WorldWideScience

Sample records for gene cluster members

  1. Myf-6, a new member of the human gene family of myogenic determination factors: evidence for a gene cluster on chromosome 12.

    OpenAIRE

    Braun, T; Bober, E; Winter, B; Rosenthal, N; Arnold, H H

    1990-01-01

    The Myf-6 gene, a novel member of the human gene family of muscle determination factors has been detected by its highly conserved sequence coding for a putative helix-loop-helix domain. This sequence motif is a common feature of all Myf factors and other regulatory proteins. The new Myf gene is located on human chromosome 12, approximately 6.5 Kb upstream of the Myf-5 locus in a closely linked cluster of myogenic determination genes. Myf-6 cDNAs were isolated from human and mouse skeletal mus...

  2. Cloning and characterization of the biosynthetic gene cluster of 16-membered macrolide antibiotic FD-891: involvement of a dual functional cytochrome P450 monooxygenase catalyzing epoxidation and hydroxylation.

    Science.gov (United States)

    Kudo, Fumitaka; Motegi, Atsushi; Mizoue, Kazutoshi; Eguchi, Tadashi

    2010-07-26

    FD-891 is a 16-membered cytotoxic antibiotic macrolide that is especially active against human leukemia such as HL-60 and Jurkat cells. We identified the FD-891 biosynthetic (gfs) gene cluster from the producer Streptomyces graminofaciens A-8890 by using typical modular type I polyketide synthase (PKS) genes as probes. The gfs gene cluster contained five typical modular type I PKS genes (gfsA, B, C, D, and E), a cytochrome P450 gene (gfsF), a methyltransferase gene (gfsG), and a regulator gene (gfsR). The gene organization of PKSs agreed well with the basic polyketide skeleton of FD-891 including the oxidation states and alpha-alkyl substituent determined by the substrate specificities of the acyltransferase (AT) domains. To clarify the involvement of the gfs genes in the FD-891 biosynthesis, the P450 gfsF gene was inactivated; this resulted in the loss of FD-891 production. Instead, the gfsF gene-disrupted mutant accumulated a novel FD-891 analogue 25-O-methyl-FD-892, which lacked the epoxide and the hydroxyl group of FD-891. Furthermore, the recombinant GfsF enzyme coexpressed with putidaredoxin and putidaredoxin reductase converted 25-O-methyl-FD-892 into FD-891. In the course of the GfsF reaction, 10-deoxy-FD-891 was isolated as an enzymatic reaction intermediate, which was also converted into FD-891 by GfsF. Therefore, it was clearly found that the cytochrome P450 GfsF catalyzes epoxidation and hydroxylation in a stepwise manner in the FD-891 biosynthesis. These results clearly confirmed that the identified gfs genes are responsible for the biosynthesis of FD-891 in S. graminofaciens.

  3. C4BPAL1, a member of the human regulator of complement activation (RCA) gene cluster that resulted from the duplication of the gene coding for the [alpha]-chain of C4b-binding protein

    Energy Technology Data Exchange (ETDEWEB)

    Sanchez-Corral, P.; Pardo-Manuel de Villena, F.; Rey-Campos, J.; Rodriguez de Cordoba, S. (Unidad de Immunologia, Madrid (Spain))

    1993-07-01

    The regulator of complement activation (RCA) gene cluster evolved by multiple gene duplications to produce a family of genes coding for proteins that collectively control the activation of the complement system. The authors report here the characterization of C4BPAL1, a member of the human RCA gene cluster that arose from the duplication of the C4BPA gene after the separation of rodent and primate lineages. C4BPAL1 maps 20 kb downstream of the C4BPA gene and is in the same 5[prime] to 3[prime] orientation found for all RCA genes characterized thus far. It includes nine exon-like regions homologous to exons 2-8, 11, and 12 of the C4BPA gene. Analysis of the C4BPAL1 sequence suggests that it is currently a pseudogene in humans. However, comparisons between C4BPAL1 and the human and murine C4BPA genes show sequence conservation, which strongly suggests that, for a long period of time, C4BPAL1 has been a functional gene coding for a protein with structural requirements similar to those of the [alpha]-chain of C4b-binding protein. 50 refs., 5 figs., 1 tab.

  4. Gene cluster statistics with gene families.

    Science.gov (United States)

    Raghupathy, Narayanan; Durand, Dannie

    2009-05-01

    Identifying genomic regions that descended from a common ancestor is important for understanding the function and evolution of genomes. In distantly related genomes, clusters of homologous gene pairs are evidence of candidate homologous regions. Demonstrating the statistical significance of such "gene clusters" is an essential component of comparative genomic analyses. However, currently there are no practical statistical tests for gene clusters that model the influence of the number of homologs in each gene family on cluster significance. In this work, we demonstrate empirically that failure to incorporate gene family size in gene cluster statistics results in overestimation of significance, leading to incorrect conclusions. We further present novel analytical methods for estimating gene cluster significance that take gene family size into account. Our methods do not require complete genome data and are suitable for testing individual clusters found in local regions, such as contigs in an unfinished assembly. We consider pairs of regions drawn from the same genome (paralogous clusters), as well as regions drawn from two different genomes (orthologous clusters). Determining cluster significance under general models of gene family size is computationally intractable. By assuming that all gene families are of equal size, we obtain analytical expressions that allow fast approximation of cluster probabilities. We evaluate the accuracy of this approximation by comparing the resulting gene cluster probabilities with cluster probabilities obtained by simulating a realistic, power-law distributed model of gene family size, with parameters inferred from genomic data. Surprisingly, despite the simplicity of the underlying assumption, our method accurately approximates the true cluster probabilities. It slightly overestimates these probabilities, yielding a conservative test. We present additional simulation results indicating the best choice of parameter values for data

  5. Support Policies in Clusters: Prioritization of Support Needs by Cluster Members According to Cluster Life Cycle

    Directory of Open Access Journals (Sweden)

    Gulcin Salıngan

    2012-07-01

    Full Text Available Economic development has always been a moving target. Both the national and local governments have been facing the challenge of implementing the effective and efficient economic policy and program in order to best utilize their limited resources. One of the recent approaches in this area is called cluster-based economic analysis and strategy development. This study reviews key literature and some of the cluster based economic policies adopted by different governments. Based on this review, it proposes “the cluster life cycle” as a determining factor to identify the support requirements of clusters. A survey, designed based on literature review of International Cluster support programs, was conducted with 30 participants from 3 clusters with different maturity stage. This paper discusses the results of this study conducted among the cluster members in Eskişehir- Bilecik-Kütahya Region in Turkey on the requirement of the support to foster the development of related clusters.

  6. FunGeneClusterS

    DEFF Research Database (Denmark)

    Vesth, Tammi Camilla; Brandl, Julian; Andersen, Mikael Rørdam

    2016-01-01

    and industrial biotechnology applications. We have previously published a method for accurate prediction of clusters from genome and transcriptome data, which could also suggest cross-chemistry, however, this method was limited both in the number of parameters which could be adjusted as well as in user......Secondary metabolites of fungi are receiving an increasing amount of interest due to their prolific bioactivities and the fact that fungal biosynthesis of secondary metabolites often occurs from co-regulated and co-located gene clusters. This makes the gene clusters attractive for synthetic biology...

  7. Confirming the least massive members of the Pleiades star cluster

    Science.gov (United States)

    Zapatero Osorio, M. R.; Béjar, V. J. S.; Lodieu, N.; Manjavacas, E.

    2018-03-01

    We present optical photometry (i and Z band) and low-resolution spectroscopy (640-1015 nm) of very faint candidate members (J = 20.2-21.2 mag) of the Pleiades star cluster (120 Myr). The main goal is to address their cluster membership via photometric, astrometric, and spectroscopic studies, and to determine the properties of the least massive population of the cluster through the comparison of the data with younger and older spectral counterparts and state-of-the art model atmospheres. We confirm three bona fide Pleiades members that have extremely red optical and infrared colours, effective temperatures of ≈1150 and ≈1350 K, and masses in the interval 11-20 MJup, and one additional likely member that shares the same motion as the cluster but does not appear to be as red as the other members with similar brightness. This latter object requires further near-infrared spectroscopy to fully address its membership in the Pleiades. The optical spectra of two bona fide members were classified as L6-L7 and show features of K I, a tentative detection of Cs I, hydrides, and water vapour with an intensity similar to high-gravity dwarfs of related classification despite their young age. The properties of the Pleiades L6-L7 members clearly indicate that very red colours of L dwarfs are not a direct evidence of ages younger than ≈100 Myr. We also report on the determination of the bolometric corrections for the coolest Pleiades members. These data can be used to interpret the observations of the atmospheres of exoplanets orbiting stars.

  8. VizieR Online Data Catalog: NGC 6802 dwarf cluster members and non-members (Tang+, 2017)

    Science.gov (United States)

    Tang, B.; Geisler, D.; Friel, E.; Villanova, S.; Smiljanic, R.; Casey, A. R.; Randich, S.; Magrini, L.; San, Roman I.; Munoz, C.; Cohen, R. E.; Mauro, F.; Bragaglia, A.; Donati, P.; Tautvaisiene, G.; Drazdauskas, A.; Zenoviene, R.; Snaith, O.; Sousa, S.; Adibekyan, V.; Costado, M. T.; Blanco-Cuaresma, S.; Jimenez-Esteban, F.; Carraro, G.; Zwitter, T.; Francois, P.; Jofre, P.; Sordo, R.; Gilmore, G.; Flaccomio, E.; Koposov, S.; Korn, A. J.; Lanzafame, A. C.; Pancino, E.; Bayo, A.; Damiani, F.; Franciosini, E.; Hourihane, A.; Lardo, C.; Lewis, J.; Monaco, L.; Morbidelli, L.; Prisinzano, L.; Sacco, G.; Worley, C. C.; Zaggia, S.

    2016-11-01

    The dwarf stars in NGC 6802 observed by GIRAFFE spectrograph are separated into four tables: 1. cluster members in the lower main sequence; 2. cluster members in the upper main sequence; 3. non-member dwarfs in the lower main sequence; 4. non-member dwarfs in the upper main sequence. The star coordinates, V band magnitude, V-I color, and radial velocity are given. (4 data files).

  9. Organization of an echinoderm Hox gene cluster

    OpenAIRE

    Martinez, Pedro; Rast, Jonathan P.; Arenas-Mena, César; Davidson, Eric H.

    1999-01-01

    The Strongylocentrotus purpuratus genome contains a single ten-gene Hox complex >0.5 megabase in length. This complex was isolated on overlapping bacterial artificial chromosome and P1 artificial chromosome genomic recombinants by using probes for individual genes and by genomic walking. Echinoderm Hox genes of Paralog Groups (PG) 1 and 2 are reported. The cluster includes genes representing all paralog groups of vertebrate Hox clusters, except that there is a sing...

  10. Mathematical model for research and analyze relations and functions between enterprises, members of cluster

    Science.gov (United States)

    Angelov, Kiril; Kaynakchieva, Vesela

    2017-12-01

    The aim of the current study is to research and analyze Mathematical model for research and analyze of relations and functions between enterprises, members of cluster, and its approbation in given cluster. Subject of the study are theoretical mechanisms for the definition of mathematical models for research and analyze of relations and functions between enterprises, members of cluster. Object of the study are production enterprises, members of cluster. Results of this study show that described theoretical mathematical model is applicable for research and analyze of functions and relations between enterprises, members of cluster from different industrial sectors. This circumstance creates alternatives for election of cluster, where is experimented this model for interaction improvement between enterprises, members of cluster.

  11. Fuzzy clustering analysis of osteosarcoma related genes.

    Science.gov (United States)

    Chen, Kai; Wu, Dajiang; Bai, Yushu; Zhu, Xiaodong; Chen, Ziqiang; Wang, Chuanfeng; Zhao, Yingchuan; Li, Ming

    2014-07-01

    Osteosarcoma is the most common malignant bone-tumor with a peak manifestation during the second and third decade of life. In order to explore the influence of genetic factors on the mechanism of osteosarcoma by analyzing the inter relationship between osteosarcoma and its related genes, and then provide potential genetic references for the prevention, diagnosis and treatment of osteosarcoma, we collected osteosarcoma related gene sequences in Genebank of National Center for Biotechnology Information (NCBI) and local alignment analysis for a pair of sequences was carried out to identify the measurement association among related sequences. Then fuzzy clustering method was used for clustering analysis so as to contact the unknown genes through the consistent osteosarcoma related genes in one class. From the result of fuzzy clustering analysis, we could classify the osteosarcoma related genes into two groups and deduced that the genes clustered into one group had similar function. Based on this knowledge, we found more genes related to the pathogenesis of osteosarcoma and these genes could exert similar function as Runx2, a risk factor confirmed in osteosarcoma, this study may help better understand the genetic mechanism and provide new molecular markers and therapies for osteosarcoma.

  12. Pichia stipitis genomics, transcriptomics, and gene clusters

    Science.gov (United States)

    Thomas W. Jeffries; Jennifer R. Headman Van Vleet

    2009-01-01

    Genome sequencing and subsequent global gene expression studies have advanced our understanding of the lignocellulose-fermenting yeast Pichia stipitis. These studies have provided an insight into its central carbon metabolism, and analysis of its genome has revealed numerous functional gene clusters and tandem repeats. Specialized physiological traits are often the...

  13. A tripartite clustering analysis on microRNA, gene and disease model.

    Science.gov (United States)

    Shen, Chengcheng; Liu, Ying

    2012-02-01

    Alteration of gene expression in response to regulatory molecules or mutations could lead to different diseases. MicroRNAs (miRNAs) have been discovered to be involved in regulation of gene expression and a wide variety of diseases. In a tripartite biological network of human miRNAs, their predicted target genes and the diseases caused by altered expressions of these genes, valuable knowledge about the pathogenicity of miRNAs, involved genes and related disease classes can be revealed by co-clustering miRNAs, target genes and diseases simultaneously. Tripartite co-clustering can lead to more informative results than traditional co-clustering with only two kinds of members and pass the hidden relational information along the relation chain by considering multi-type members. Here we report a spectral co-clustering algorithm for k-partite graph to find clusters with heterogeneous members. We use the method to explore the potential relationships among miRNAs, genes and diseases. The clusters obtained from the algorithm have significantly higher density than randomly selected clusters, which means members in the same cluster are more likely to have common connections. Results also show that miRNAs in the same family based on the hairpin sequences tend to belong to the same cluster. We also validate the clustering results by checking the correlation of enriched gene functions and disease classes in the same cluster. Finally, widely studied miR-17-92 and its paralogs are analyzed as a case study to reveal that genes and diseases co-clustered with the miRNAs are in accordance with current research findings.

  14. Functional Analysis of the Fusarielin Biosynthetic Gene Cluster

    Directory of Open Access Journals (Sweden)

    Aida Droce

    2016-12-01

    Full Text Available Fusarielins are polyketides with a decalin core produced by various species of Aspergillus and Fusarium. Although the responsible gene cluster has been identified, the biosynthetic pathway remains to be elucidated. In the present study, members of the gene cluster were deleted individually in a Fusarium graminearum strain overexpressing the local transcription factor. The results suggest that a trans-acting enoyl reductase (FSL5 assists the polyketide synthase FSL1 in biosynthesis of a polyketide product, which is released by hydrolysis by a trans-acting thioesterase (FSL2. Deletion of the epimerase (FSL3 resulted in accumulation of an unstable compound, which could be the released product. A novel compound, named prefusarielin, accumulated in the deletion mutant of the cytochrome P450 monooxygenase FSL4. Unlike the known fusarielins from Fusarium, this compound does not contain oxygenized decalin rings, suggesting that FSL4 is responsible for the oxygenation.

  15. Semi-supervised consensus clustering for gene expression data analysis

    OpenAIRE

    Wang, Yunli; Pan, Youlian

    2014-01-01

    Background Simple clustering methods such as hierarchical clustering and k-means are widely used for gene expression data analysis; but they are unable to deal with noise and high dimensionality associated with the microarray gene expression data. Consensus clustering appears to improve the robustness and quality of clustering results. Incorporating prior knowledge in clustering process (semi-supervised clustering) has been shown to improve the consistency between the data partitioning and do...

  16. Member Company Evaluation of the Effectiveness of Food Valley as a Cluster Organization

    NARCIS (Netherlands)

    Fortuin, F.T.J.M.; Omta, S.W.F.

    2012-01-01

    This paper aims to analyze the effectiveness of the different cluster organization functions (services, activities and information sources) of Food Valley Organization in the Dutch agifood innovation system, as evaluated by its member companies. It is concluded that, in accordance with cluster

  17. Gene ordering in partitive clustering using microarray expressions

    Indian Academy of Sciences (India)

    PRAKASH KUMAR

    fibroblast data and showed that our approach improves the result quality of partitive clustering solution, by identifying subclusters within big clusters, grouping functionally correlated genes within clusters, minimization of summation of gene expression distances, and the maximization of biological gene ordering using MIPS ...

  18. A survey for low-mass stellar and substellar members of the Hyades open cluster

    Science.gov (United States)

    Melnikov, Stanislav; Eislöffel, Jochen

    2018-03-01

    Context. Unlike young open clusters (with ages degrees on the sky, extended surveys are required to improve the statistics of the VLM/BD objects in the cluster. Aim. We search for new VLM stars and BD candidates in the Hyades cluster to improve the present-day cluster mass function down to substellar masses. Methods: An imaging survey of the Hyades with a completeness limit of 21.m5 in the R band and 20.m5 in the I band was carried out with the 2k × 2k CCD Schmidt camera at the 2 m Alfred Jensch Telescope in Tautenburg. We performed a photometric selection of the cluster member candidates by combining results of our survey with 2MASS JHKs photometry Results: We present a photometric and proper motion survey covering 23.4 deg2 in the Hyades cluster core region. Using optical/IR colour-magnitude diagrams, we identify 66 photometric cluster member candidates in the magnitude range 14.m7 < I < 20.m5. The proper motion measurements are based on several all-sky surveys with an epoch difference of 60-70 yr for the bright objects. The proper motions allowed us to discriminate the cluster members from field objects and resulted in 14 proper motion members of the Hyades. We rediscover Hy 6 as a proper motion member and classify it as a substellar object candidate (BD) based on the comparison of the observed colour-magnitude diagram with theoretical model isochrones. Conclusions: With our results, the mass function of the Hyades continues to be shallow below 0.15 M⊙ indicating that the Hyades have probably lost their lowest mass members by means of dynamical evolution. We conclude that the Hyades core represents the "VLM/BD desert" and that most of the substeller objects may have already left the volume of the cluster.

  19. Global Analysis of miRNA Gene Clusters and Gene Families Reveals Dynamic and Coordinated Expression

    Directory of Open Access Journals (Sweden)

    Li Guo

    2014-01-01

    Full Text Available To further understand the potential expression relationships of miRNAs in miRNA gene clusters and gene families, a global analysis was performed in 4 paired tumor (breast cancer and adjacent normal tissue samples using deep sequencing datasets. The compositions of miRNA gene clusters and families are not random, and clustered and homologous miRNAs may have close relationships with overlapped miRNA species. Members in the miRNA group always had various expression levels, and even some showed larger expression divergence. Despite the dynamic expression as well as individual difference, these miRNAs always indicated consistent or similar deregulation patterns. The consistent deregulation expression may contribute to dynamic and coordinated interaction between different miRNAs in regulatory network. Further, we found that those clustered or homologous miRNAs that were also identified as sense and antisense miRNAs showed larger expression divergence. miRNA gene clusters and families indicated important biological roles, and the specific distribution and expression further enrich and ensure the flexible and robust regulatory network.

  20. Arrangement of the Clostridium baratii F7 toxin gene cluster with identification of a σ factor that recognizes the botulinum toxin gene cluster promoters.

    Science.gov (United States)

    Dover, Nir; Barash, Jason R; Burke, Julianne N; Hill, Karen K; Detter, John C; Arnon, Stephen S

    2014-01-01

    Botulinum neurotoxin (BoNT) is the most poisonous substances known and its eight toxin types (A to H) are distinguished by the inability of polyclonal antibodies that neutralize one toxin type to neutralize any of the other seven toxin types. Infant botulism, an intestinal toxemia orphan disease, is the most common form of human botulism in the United States. It results from swallowed spores of Clostridium botulinum (or rarely, neurotoxigenic Clostridium butyricum or Clostridium baratii) that germinate and temporarily colonize the lumen of the large intestine, where, as vegetative cells, they produce botulinum toxin. Botulinum neurotoxin is encoded by the bont gene that is part of a toxin gene cluster that includes several accessory genes. We sequenced for the first time the complete botulinum neurotoxin gene cluster of nonproteolytic C. baratii type F7. Like the type E and the nonproteolytic type F6 botulinum toxin gene clusters, the C. baratii type F7 had an orfX toxin gene cluster that lacked the regulatory botR gene which is found in proteolytic C. botulinum strains and codes for an alternative σ factor. In the absence of botR, we identified a putative alternative regulatory gene located upstream of the C. baratii type F7 toxin gene cluster. This putative regulatory gene codes for a predicted σ factor that contains DNA-binding-domain homologues to the DNA-binding domains both of BotR and of other members of the TcdR-related group 5 of the σ70 family that are involved in the regulation of toxin gene expression in clostridia. We showed that this TcdR-related protein in association with RNA polymerase core enzyme specifically binds to the C. baratii type F7 botulinum toxin gene cluster promoters. This TcdR-related protein may therefore be involved in regulating the expression of the genes of the botulinum toxin gene cluster in neurotoxigenic C. baratii.

  1. Characterization of two CYP77 gene family members related to ...

    Indian Academy of Sciences (India)

    [Yue Y., Peng H., Sun J., Yang Z., Yang H., Liu G. and Hu H. 2016 Characterization of two CYP77 gene family members related to development of ornamental ... of the largest families in plants, participate in numerous reac- tions of the plant ... real-time quantitative polymerase chain reaction (RT-qPCR), indicating that they ...

  2. Microarray gene cluster identification and annotation through cluster ensemble and EM-based informative textual summarization.

    Science.gov (United States)

    Hu, Xiaohua; Park, E K; Zhang, Xiaodan

    2009-09-01

    Generating high-quality gene clusters and identifying the underlying biological mechanism of the gene clusters are the important goals of clustering gene expression analysis. To get high-quality cluster results, most of the current approaches rely on choosing the best cluster algorithm, in which the design biases and assumptions meet the underlying distribution of the dataset. There are two issues for this approach: 1) usually, the underlying data distribution of the gene expression datasets is unknown and 2) there are so many clustering algorithms available and it is very challenging to choose the proper one. To provide a textual summary of the gene clusters, the most explored approach is the extractive approach that essentially builds upon techniques borrowed from the information retrieval, in which the objective is to provide terms to be used for query expansion, and not to act as a stand-alone summary for the entire document sets. Another drawback is that the clustering quality and cluster interpretation are treated as two isolated research problems and are studied separately. In this paper, we design and develop a unified system Gene Expression Miner to address these challenging issues in a principled and general manner by integrating cluster ensemble, text clustering, and multidocument summarization and provide an environment for comprehensive gene expression data analysis. We present a novel cluster ensemble approach to generate high-quality gene cluster. In our text summarization module, given a gene cluster, our expectation-maximization based algorithm can automatically identify subtopics and extract most probable terms for each topic. Then, the extracted top k topical terms from each subtopic are combined to form the biological explanation of each gene cluster. Experimental results demonstrate that our system can obtain high-quality clusters and provide informative key terms for the gene clusters.

  3. Some statistical properties of gene expression clustering for array data

    DEFF Research Database (Denmark)

    Abreu, G C G; Pinheiro, A; Drummond, R D

    2010-01-01

    DNA arrays have been a rich source of data for the study of genomic expression of a wide variety of biological systems. Gene clustering is one of the paradigms quite used to assess the significance of a gene (or group of genes). However, most of the gene clustering techniques are applied to cDNA...

  4. Indole-Diterpene Gene Cluster from Aspergillus flavus

    Science.gov (United States)

    Zhang, Shuguang; Monahan, Brendon J.; Tkacz, Jan S.; Scott, Barry

    2004-01-01

    Aflatrem is a potent tremorgenic mycotoxin produced by the soil fungus Aspergillus flavus and is a member of a large structurally diverse group of secondary metabolites known as indole-diterpenes. By using degenerate primers for conserved domains of fungal geranylgeranyl diphosphate synthases, we cloned two genes, atmG and ggsA (an apparent pseudogene), from A. flavus. Adjacent to atmG are two other genes, atmC and atmM. These three genes have 64 to 70% amino acid sequence similarity and conserved synteny with a cluster of orthologous genes, paxG, paxC, and paxM, from Penicillium paxilli which are required for indole-diterpene biosynthesis. atmG, atmC, and atmM are coordinately expressed, with transcript levels dramatically increasing at the onset of aflatrem biosynthesis. A genomic copy of atmM can complement a paxM deletion mutant of P. paxilli, demonstrating that atmM is a functional homolog of paxM. Thus, atmG, atmC, and atmM are necessary, but not sufficient, for aflatrem biosynthesis by A. flavus. This provides the first genetic evidence for the biosynthetic pathway of aflatrem in A. flavus. PMID:15528556

  5. The GALAH survey: chemical tagging of star clusters and new members in the Pleiades

    Science.gov (United States)

    Kos, Janez; Bland-Hawthorn, Joss; Freeman, Ken; Buder, Sven; Traven, Gregor; De Silva, Gayandhi M.; Sharma, Sanjib; Asplund, Martin; Duong, Ly; Lin, Jane; Lind, Karin; Martell, Sarah; Simpson, Jeffrey D.; Stello, Dennis; Zucker, Daniel B.; Zwitter, Tomaž; Anguiano, Borja; Da Costa, Gary; D'Orazi, Valentina; Horner, Jonathan; Kafle, Prajwal R.; Lewis, Geraint; Munari, Ulisse; Nataf, David M.; Ness, Melissa; Reid, Warren; Schlesinger, Katie; Ting, Yuan-Sen; Wyse, Rosemary

    2018-02-01

    The technique of chemical tagging uses the elemental abundances of stellar atmospheres to 'reconstruct' chemically homogeneous star clusters that have long since dispersed. The GALAH spectroscopic survey - which aims to observe one million stars using the Anglo-Australian Telescope - allows us to measure up to 30 elements or dimensions in the stellar chemical abundance space, many of which are not independent. How to find clustering reliably in a noisy high-dimensional space is a difficult problem that remains largely unsolved. Here, we explore t-distributed stochastic neighbour embedding (t-SNE) - which identifies an optimal mapping of a high-dimensional space into fewer dimensions - whilst conserving the original clustering information. Typically, the projection is made to a 2D space to aid recognition of clusters by eye. We show that this method is a reliable tool for chemical tagging because it can: (i) resolve clustering in chemical space alone, (ii) recover known open and globular clusters with high efficiency and low contamination, and (iii) relate field stars to known clusters. t-SNE also provides a useful visualization of a high-dimensional space. We demonstrate the method on a data set of 13 abundances measured in the spectra of 187 000 stars by the GALAH survey. We recover seven of the nine observed clusters (six globular and three open clusters) in chemical space with minimal contamination from field stars and low numbers of outliers. With chemical tagging, we also identify two Pleiades supercluster members (which we confirm kinematically), one as far as 6° - one tidal radius away from the cluster centre.

  6. Gene ordering in partitive clustering using microarray expressions

    Indian Academy of Sciences (India)

    PRAKASH KUMAR

    the new hybrid approach, finds comparable or sometimes superior biological gene order in less computation time than those obtained by optimal leaf ordering in hierarchical clustering solution. Ray S S, Bandyopadhyay S and Pal S K 2007 Gene ordering in partitive clustering using microarray expressions; J. Biosci.

  7. Unique nucleotide polymorphism of ankyrin gene cluster in ...

    Indian Academy of Sciences (India)

    The ankyrin (ANK) gene cluster is a part of a multigene family encoding ANK transmembrane proteins in Arabidopsis thaliana, and plays an important role in protein–protein interactions and in signal pathways. In contrast to other regions of a genome, the ANK gene cluster exhibits an extremely high level of DNA ...

  8. A deep staring campaign in the σ Orionis cluster. Variability in substellar members

    Science.gov (United States)

    Elliott, P.; Scholz, A.; Jayawardhana, R.; Eislöffel, J.; Hébrard, E. M.

    2017-12-01

    Context. The young star cluster near σ Orionis is one of the primary environments to study the properties of young brown dwarfs down to masses comparable to those of giant planets. Aims: Deep optical imaging is used to study time-domain properties of young brown dwarfs over typical rotational timescales and to search for new substellar and planetary-mass cluster members. Methods: We used the Visible Multi Object Spectrograph (VIMOS) at the Very Large Telescope (VLT) to monitor a 24'× 16' field in the I-band. We stared at the same area over a total integration time of 21 h, spanning three observing nights. Using the individual images from this run we investigated the photometric time series of nine substellar cluster members with masses from 10 to 60 MJup. The deep stacked image shows cluster members down to ≈5 MJup. We searched for new planetary-mass objects by combining our deep I-band photometry with public J-band magnitudes and by examining the nearby environment of known very low mass members for possible companions. Results: We find two brown dwarfs, with significantly variable, aperiodic light curves, both with masses around 50 MJup, one of which was previously unknown to be variable. The physical mechanism responsible for the observed variability is likely to be different for the two objects. The variability of the first object, a single-lined spectroscopic binary, is most likely linked to its accretion disc; the second may be caused by variable extinction by large grains. We find five new candidate members from the colour-magnitude diagram and three from a search for companions within 2000 au. We rule all eight sources out as potential members based on non-stellar shape and/or infrared colours. The I-band photometry is made available as a public dataset. Conclusions: We present two variable brown dwarfs. One is consistent with ongoing accretion, the other exhibits apparent transient variability without the presence of an accretion disc. Our analysis

  9. A Single Gene Cluster for Chalcomycins and Aldgamycins: Genetic Basis for Bifurcation of Their Biosynthesis.

    Science.gov (United States)

    Tang, Xiao-Long; Dai, Ping; Gao, Hao; Wang, Chuan-Xi; Chen, Guo-Dong; Hong, Kui; Hu, Dan; Yao, Xin-Sheng

    2016-07-01

    Aldgamycins are 16-membered macrolide antibiotics with a rare branched-chain sugar d-aldgarose or decarboxylated d-aldgarose at C-5. In our efforts to clone the gene cluster for aldgamycins from a marine-derived Streptomyces sp. HK-2006-1 capable of producing both aldgamycins and chalcomycins, we found that both are biosynthesized from a single gene cluster. Whole-genome sequencing combined with gene disruption established the entire gene cluster of aldgamycins: nine new genes are incorporated with the previously identified chalcomycin gene cluster. Functional analysis of these genes revealed that almDI/almDII, (encoding α/β subunits of pyruvate dehydrogenase) triggers the biosynthesis of aldgamycins, whereas almCI (encoding an oxidoreductase) initiates chalcomycins biosynthesis. This is the first report that aldgamycins and chalcomycins are derived from a single gene cluster and of the genetic basis for bifurcation in their biosynthesis. © 2016 WILEY-VCH Verlag GmbH & Co. KGaA, Weinheim.

  10. Distinct functions of Egr gene family members in cognitive processes

    Directory of Open Access Journals (Sweden)

    Roseline Poirier

    2008-07-01

    Full Text Available The different gene members of the Egr family of transcriptional regulators have often been considered to have related functions in brain, based on their co-expression in many cell-types and structures, the relatively high homology of the translated proteins and their ability to bind to the same consensus DNA binding sequence. Recent research, however, suggest this might not be the case. In this review, we focus on the current understanding of the functional roles of the different Egr family members in learning and memory. We briefly outline evidence from mutant mice that Egr1 is required specifically for the consolidation of long-term memory, while Egr3 is primarily essential for short-term memory. We also review our own recent findings from newly generated forebrain-specific conditional Egr2 mutant mice, which revealed that Egr2, as opposed to Egr1 and Egr3, is dispensable for several forms of learning and memory and on the contrary can act as an inhibitory constraint for certain cognitive functions. The studies reviewed here highlight the fact that Egr family members may have different, and in certain circumstances antagonistic functions in the adult brain.

  11. Bioinformatics Prediction of Polyketide Synthase Gene Clusters from Mycosphaerella fijiensis.

    Directory of Open Access Journals (Sweden)

    Roslyn D Noar

    Full Text Available Mycosphaerella fijiensis, causal agent of black Sigatoka disease of banana, is a Dothideomycete fungus closely related to fungi that produce polyketides important for plant pathogenicity. We utilized the M. fijiensis genome sequence to predict PKS genes and their gene clusters and make bioinformatics predictions about the types of compounds produced by these clusters. Eight PKS gene clusters were identified in the M. fijiensis genome, placing M. fijiensis into the 23rd percentile for the number of PKS genes compared to other Dothideomycetes. Analysis of the PKS domains identified three of the PKS enzymes as non-reducing and two as highly reducing. Gene clusters contained types of genes frequently found in PKS clusters including genes encoding transporters, oxidoreductases, methyltransferases, and non-ribosomal peptide synthases. Phylogenetic analysis identified a putative PKS cluster encoding melanin biosynthesis. None of the other clusters were closely aligned with genes encoding known polyketides, however three of the PKS genes fell into clades with clusters encoding alternapyrone, fumonisin, and solanapyrone produced by Alternaria and Fusarium species. A search for homologs among available genomic sequences from 103 Dothideomycetes identified close homologs (>80% similarity for six of the PKS sequences. One of the PKS sequences was not similar (< 60% similarity to sequences in any of the 103 genomes, suggesting that it encodes a unique compound. Comparison of the M. fijiensis PKS sequences with those of two other banana pathogens, M. musicola and M. eumusae, showed that these two species have close homologs to five of the M. fijiensis PKS sequences, but three others were not found in either species. RT-PCR and RNA-Seq analysis showed that the melanin PKS cluster was down-regulated in infected banana as compared to growth in culture. Three other clusters, however were strongly upregulated during disease development in banana, suggesting that

  12. Bioinformatics Prediction of Polyketide Synthase Gene Clusters from Mycosphaerella fijiensis.

    Science.gov (United States)

    Noar, Roslyn D; Daub, Margaret E

    2016-01-01

    Mycosphaerella fijiensis, causal agent of black Sigatoka disease of banana, is a Dothideomycete fungus closely related to fungi that produce polyketides important for plant pathogenicity. We utilized the M. fijiensis genome sequence to predict PKS genes and their gene clusters and make bioinformatics predictions about the types of compounds produced by these clusters. Eight PKS gene clusters were identified in the M. fijiensis genome, placing M. fijiensis into the 23rd percentile for the number of PKS genes compared to other Dothideomycetes. Analysis of the PKS domains identified three of the PKS enzymes as non-reducing and two as highly reducing. Gene clusters contained types of genes frequently found in PKS clusters including genes encoding transporters, oxidoreductases, methyltransferases, and non-ribosomal peptide synthases. Phylogenetic analysis identified a putative PKS cluster encoding melanin biosynthesis. None of the other clusters were closely aligned with genes encoding known polyketides, however three of the PKS genes fell into clades with clusters encoding alternapyrone, fumonisin, and solanapyrone produced by Alternaria and Fusarium species. A search for homologs among available genomic sequences from 103 Dothideomycetes identified close homologs (>80% similarity) for six of the PKS sequences. One of the PKS sequences was not similar (< 60% similarity) to sequences in any of the 103 genomes, suggesting that it encodes a unique compound. Comparison of the M. fijiensis PKS sequences with those of two other banana pathogens, M. musicola and M. eumusae, showed that these two species have close homologs to five of the M. fijiensis PKS sequences, but three others were not found in either species. RT-PCR and RNA-Seq analysis showed that the melanin PKS cluster was down-regulated in infected banana as compared to growth in culture. Three other clusters, however were strongly upregulated during disease development in banana, suggesting that they may encode

  13. Nearest Neighbor Networks: clustering expression data based on gene neighborhoods

    Directory of Open Access Journals (Sweden)

    Olszewski Kellen L

    2007-07-01

    Full Text Available Abstract Background The availability of microarrays measuring thousands of genes simultaneously across hundreds of biological conditions represents an opportunity to understand both individual biological pathways and the integrated workings of the cell. However, translating this amount of data into biological insight remains a daunting task. An important initial step in the analysis of microarray data is clustering of genes with similar behavior. A number of classical techniques are commonly used to perform this task, particularly hierarchical and K-means clustering, and many novel approaches have been suggested recently. While these approaches are useful, they are not without drawbacks; these methods can find clusters in purely random data, and even clusters enriched for biological functions can be skewed towards a small number of processes (e.g. ribosomes. Results We developed Nearest Neighbor Networks (NNN, a graph-based algorithm to generate clusters of genes with similar expression profiles. This method produces clusters based on overlapping cliques within an interaction network generated from mutual nearest neighborhoods. This focus on nearest neighbors rather than on absolute distance measures allows us to capture clusters with high connectivity even when they are spatially separated, and requiring mutual nearest neighbors allows genes with no sufficiently similar partners to remain unclustered. We compared the clusters generated by NNN with those generated by eight other clustering methods. NNN was particularly successful at generating functionally coherent clusters with high precision, and these clusters generally represented a much broader selection of biological processes than those recovered by other methods. Conclusion The Nearest Neighbor Networks algorithm is a valuable clustering method that effectively groups genes that are likely to be functionally related. It is particularly attractive due to its simplicity, its success in the

  14. Differential Retention of Gene Functions in a Secondary Metabolite Cluster.

    Science.gov (United States)

    Reynolds, Hannah T; Slot, Jason C; Divon, Hege H; Lysøe, Erik; Proctor, Robert H; Brown, Daren W

    2017-08-01

    In fungi, distribution of secondary metabolite (SM) gene clusters is often associated with host- or environment-specific benefits provided by SMs. In the plant pathogen Alternaria brassicicola (Dothideomycetes), the DEP cluster confers an ability to synthesize the SM depudecin, a histone deacetylase inhibitor that contributes weakly to virulence. The DEP cluster includes genes encoding enzymes, a transporter, and a transcription regulator. We investigated the distribution and evolution of the DEP cluster in 585 fungal genomes and found a wide but sporadic distribution among Dothideomycetes, Sordariomycetes, and Eurotiomycetes. We confirmed DEP gene expression and depudecin production in one fungus, Fusarium langsethiae. Phylogenetic analyses suggested 6-10 horizontal gene transfers (HGTs) of the cluster, including a transfer that led to the presence of closely related cluster homologs in Alternaria and Fusarium. The analyses also indicated that HGTs were frequently followed by loss/pseudogenization of one or more DEP genes. Independent cluster inactivation was inferred in at least four fungal classes. Analyses of transitions among functional, pseudogenized, and absent states of DEP genes among Fusarium species suggest enzyme-encoding genes are lost at higher rates than the transporter (DEP3) and regulatory (DEP6) genes. The phenotype of an experimentally-induced DEP3 mutant of Fusarium did not support the hypothesis that selective retention of DEP3 and DEP6 protects fungi from exogenous depudecin. Together, the results suggest that HGT and gene loss have contributed significantly to DEP cluster distribution, and that some DEP genes provide a greater fitness benefit possibly due to a differential tendency to form network connections. Published by Oxford University Press on behalf of the Society for Molecular Biology and Evolution 2017. This work is written by US Government employees and is in the public domain in the US.

  15. A remarkably stable TipE gene cluster: evolution of insect Para sodium channel auxiliary subunits

    Directory of Open Access Journals (Sweden)

    Li Jia

    2011-11-01

    Full Text Available Abstract Background First identified in fruit flies with temperature-sensitive paralysis phenotypes, the Drosophila melanogaster TipE locus encodes four voltage-gated sodium (NaV channel auxiliary subunits. This cluster of TipE-like genes on chromosome 3L, and a fifth family member on chromosome 3R, are important for the optional expression and functionality of the Para NaV channel but appear quite distinct from auxiliary subunits in vertebrates. Here, we exploited available arthropod genomic resources to trace the origin of TipE-like genes by mapping their evolutionary histories and examining their genomic architectures. Results We identified a remarkably conserved synteny block of TipE-like orthologues with well-maintained local gene arrangements from 21 insect species. Homologues in the water flea, Daphnia pulex, suggest an ancestral pancrustacean repertoire of four TipE-like genes; a subsequent gene duplication may have generated functional redundancy allowing gene losses in the silk moth and mosquitoes. Intronic nesting of the insect TipE gene cluster probably occurred following the divergence from crustaceans, but in the flour beetle and silk moth genomes the clusters apparently escaped from nesting. Across Pancrustacea, TipE gene family members have experienced intronic nesting, escape from nesting, retrotransposition, translocation, and gene loss events while generally maintaining their local gene neighbourhoods. D. melanogaster TipE-like genes exhibit coordinated spatial and temporal regulation of expression distinct from their host gene but well-correlated with their regulatory target, the Para NaV channel, suggesting that functional constraints may preserve the TipE gene cluster. We identified homology between TipE-like NaV channel regulators and vertebrate Slo-beta auxiliary subunits of big-conductance calcium-activated potassium (BKCa channels, which suggests that ion channel regulatory partners have evolved distinct lineage

  16. The genome of tolypocladium inflatum: evolution, organization, and expression of the cyclosporin biosynthetic gene cluster.

    Directory of Open Access Journals (Sweden)

    Kathryn E Bushley

    2013-06-01

    Full Text Available The ascomycete fungus Tolypocladium inflatum, a pathogen of beetle larvae, is best known as the producer of the immunosuppressant drug cyclosporin. The draft genome of T. inflatum strain NRRL 8044 (ATCC 34921, the isolate from which cyclosporin was first isolated, is presented along with comparative analyses of the biosynthesis of cyclosporin and other secondary metabolites in T. inflatum and related taxa. Phylogenomic analyses reveal previously undetected and complex patterns of homology between the nonribosomal peptide synthetase (NRPS that encodes for cyclosporin synthetase (simA and those of other secondary metabolites with activities against insects (e.g., beauvericin, destruxins, etc., and demonstrate the roles of module duplication and gene fusion in diversification of NRPSs. The secondary metabolite gene cluster responsible for cyclosporin biosynthesis is described. In addition to genes necessary for cyclosporin biosynthesis, it harbors a gene for a cyclophilin, which is a member of a family of immunophilins known to bind cyclosporin. Comparative analyses support a lineage specific origin of the cyclosporin gene cluster rather than horizontal gene transfer from bacteria or other fungi. RNA-Seq transcriptome analyses in a cyclosporin-inducing medium delineate the boundaries of the cyclosporin cluster and reveal high levels of expression of the gene cluster cyclophilin. In medium containing insect hemolymph, weaker but significant upregulation of several genes within the cyclosporin cluster, including the highly expressed cyclophilin gene, was observed. T. inflatum also represents the first reference draft genome of Ophiocordycipitaceae, a third family of insect pathogenic fungi within the fungal order Hypocreales, and supports parallel and qualitatively distinct radiations of insect pathogens. The T. inflatum genome provides additional insight into the evolution and biosynthesis of cyclosporin and lays a foundation for further

  17. Clustering Algorithms: Their Application to Gene Expression Data.

    Science.gov (United States)

    Oyelade, Jelili; Isewon, Itunuoluwa; Oladipupo, Funke; Aromolaran, Olufemi; Uwoghiren, Efosa; Ameh, Faridah; Achas, Moses; Adebiyi, Ezekiel

    2016-01-01

    Gene expression data hide vital information required to understand the biological process that takes place in a particular organism in relation to its environment. Deciphering the hidden patterns in gene expression data proffers a prodigious preference to strengthen the understanding of functional genomics. The complexity of biological networks and the volume of genes present increase the challenges of comprehending and interpretation of the resulting mass of data, which consists of millions of measurements; these data also inhibit vagueness, imprecision, and noise. Therefore, the use of clustering techniques is a first step toward addressing these challenges, which is essential in the data mining process to reveal natural structures and identify interesting patterns in the underlying data. The clustering of gene expression data has been proven to be useful in making known the natural structure inherent in gene expression data, understanding gene functions, cellular processes, and subtypes of cells, mining useful information from noisy data, and understanding gene regulation. The other benefit of clustering gene expression data is the identification of homology, which is very important in vaccine design. This review examines the various clustering algorithms applicable to the gene expression data in order to discover and provide useful knowledge of the appropriate clustering technique that will guarantee stability and high degree of accuracy in its analysis procedure.

  18. Gene cluster encoding cholate catabolism in Rhodococcus spp.

    Science.gov (United States)

    Mohn, William W; Wilbrink, Maarten H; Casabon, Israël; Stewart, Gordon R; Liu, Jie; van der Geize, Robert; Eltis, Lindsay D

    2012-12-01

    Bile acids are highly abundant steroids with important functions in vertebrate digestion. Their catabolism by bacteria is an important component of the carbon cycle, contributes to gut ecology, and has potential commercial applications. We found that Rhodococcus jostii RHA1 grows well on cholate, as well as on its conjugates, taurocholate and glycocholate. The transcriptome of RHA1 growing on cholate revealed 39 genes upregulated on cholate, occurring in a single gene cluster. Reverse transcriptase quantitative PCR confirmed that selected genes in the cluster were upregulated 10-fold on cholate versus on cholesterol. One of these genes, kshA3, encoding a putative 3-ketosteroid-9α-hydroxylase, was deleted and found essential for growth on cholate. Two coenzyme A (CoA) synthetases encoded in the cluster, CasG and CasI, were heterologously expressed. CasG was shown to transform cholate to cholyl-CoA, thus initiating side chain degradation. CasI was shown to form CoA derivatives of steroids with isopropanoyl side chains, likely occurring as degradation intermediates. Orthologous gene clusters were identified in all available Rhodococcus genomes, as well as that of Thermomonospora curvata. Moreover, Rhodococcus equi 103S, Rhodococcus ruber Chol-4 and Rhodococcus erythropolis SQ1 each grew on cholate. In contrast, several mycolic acid bacteria lacking the gene cluster were unable to grow on cholate. Our results demonstrate that the above-mentioned gene cluster encodes cholate catabolism and is distinct from a more widely occurring gene cluster encoding cholesterol catabolism.

  19. Application of Gene Shaving and Mixture Models to Cluster Microarray Gene Expression Data

    Directory of Open Access Journals (Sweden)

    S. Wen

    2007-01-01

    Full Text Available Researchers are frequently faced with the analysis of microarray data of a relatively large number of genes using a small number of tissue samples. We examine the application of two statistical methods for clustering such microarray expression data: EMMIX-GENE and GeneClust. EMMIX-GENE is a mixture-model based clustering approach, designed primarily to cluster tissue samples on the basis of the genes. GeneClust is an implementation of the gene shaving methodology, motivated by research to identify distinct sets of genes for which variation in expression could be related to a biological property of the tissue samples. We illustrate the use of these two methods in the analysis of Affymetrix oligonucleotide arrays of well-known data sets from colon tissue samples with and without tumors, and of tumor tissue samples from patients with leukemia. Although the two approaches have been developed from different perspectives, the results demonstrate a clear correspondence between gene clusters produced by GeneClust and EMMIX-GENE for the colon tissue data. It is demonstrated, for the case of ribosomal proteins and smooth muscle genes in the colon data set, that both methods can classify genes into co-regulated families. It is further demonstrated that tissue types (tumor and normal can be separated on the basis of subtle distributed patterns of genes. Application to the leukemia tissue data produces a division of tissues corresponding closely to the external classification, acute myeloid leukemia (AML and acute lymphoblastic leukaemia (ALL, for both methods. In addition, we also identify genes specifi c for the subgroup of ALL-T cell samples. Overall, we find that the gene shaving method produces gene clusters at great speed; allows variable cluster sizes and can incorporate partial or full supervision; and finds clusters of genes in which the gene expression varies greatly over the tissue samples while maintaining a high level of coherence between the

  20. The effectiveness of Cluster Organization Functions from a Member Company Perspective: The Case of Food Valley Organization

    NARCIS (Netherlands)

    Omta, S.W.F.; Fortuin, F.T.J.M.

    2011-01-01

    This paper aims to analyze the effectiveness of the different cluster organization functions (services, activities and information sources) of Food Valley Organization in the Dutch agifood innovation system, as evaluated by its member companies. It is concluded that, in accordance with cluster

  1. Characterization of the largest effector gene cluster of Ustilago maydis.

    Directory of Open Access Journals (Sweden)

    Thomas Brefort

    2014-07-01

    Full Text Available In the genome of the biotrophic plant pathogen Ustilago maydis, many of the genes coding for secreted protein effectors modulating virulence are arranged in gene clusters. The vast majority of these genes encode novel proteins whose expression is coupled to plant colonization. The largest of these gene clusters, cluster 19A, encodes 24 secreted effectors. Deletion of the entire cluster results in severe attenuation of virulence. Here we present the functional analysis of this genomic region. We show that a 19A deletion mutant behaves like an endophyte, i.e. is still able to colonize plants and complete the infection cycle. However, tumors, the most conspicuous symptoms of maize smut disease, are only rarely formed and fungal biomass in infected tissue is significantly reduced. The generation and analysis of strains carrying sub-deletions identified several genes significantly contributing to tumor formation after seedling infection. Another of the effectors could be linked specifically to anthocyanin induction in the infected tissue. As the individual contributions of these genes to tumor formation were small, we studied the response of maize plants to the whole cluster mutant as well as to several individual mutants by array analysis. This revealed distinct plant responses, demonstrating that the respective effectors have discrete plant targets. We propose that the analysis of plant responses to effector mutant strains that lack a strong virulence phenotype may be a general way to visualize differences in effector function.

  2. Characterisation of the paralytic shellfish toxin biosynthesis gene clusters in Anabaena circinalis AWQC131C and Aphanizomenon sp. NH-5

    Directory of Open Access Journals (Sweden)

    Neilan Brett A

    2009-03-01

    Full Text Available Abstract Background Saxitoxin and its analogues collectively known as the paralytic shellfish toxins (PSTs are neurotoxic alkaloids and are the cause of the syndrome named paralytic shellfish poisoning. PSTs are produced by a unique biosynthetic pathway, which involves reactions that are rare in microbial metabolic pathways. Nevertheless, distantly related organisms such as dinoflagellates and cyanobacteria appear to produce these toxins using the same pathway. Hypothesised explanations for such an unusual phylogenetic distribution of this shared uncommon metabolic pathway, include a polyphyletic origin, an involvement of symbiotic bacteria, and horizontal gene transfer. Results We describe the identification, annotation and bioinformatic characterisation of the putative paralytic shellfish toxin biosynthesis clusters in an Australian isolate of Anabaena circinalis and an American isolate of Aphanizomenon sp., both members of the Nostocales. These putative PST gene clusters span approximately 28 kb and contain genes coding for the biosynthesis and export of the toxin. A putative insertion/excision site in the Australian Anabaena circinalis AWQC131C was identified, and the organization and evolution of the gene clusters are discussed. A biosynthetic pathway leading to the formation of saxitoxin and its analogues in these organisms is proposed. Conclusion The PST biosynthesis gene cluster presents a mosaic structure, whereby genes have apparently transposed in segments of varying size, resulting in different gene arrangements in all three sxt clusters sequenced so far. The gene cluster organizational structure and sequence similarity seems to reflect the phylogeny of the producer organisms, indicating that the gene clusters have an ancient origin, or that their lateral transfer was also an ancient event. The knowledge we gain from the characterisation of the PST biosynthesis gene clusters, including the identity and sequence of the genes involved

  3. Characterisation of the paralytic shellfish toxin biosynthesis gene clusters in Anabaena circinalis AWQC131C and Aphanizomenon sp. NH-5.

    Science.gov (United States)

    Mihali, Troco K; Kellmann, Ralf; Neilan, Brett A

    2009-03-30

    Saxitoxin and its analogues collectively known as the paralytic shellfish toxins (PSTs) are neurotoxic alkaloids and are the cause of the syndrome named paralytic shellfish poisoning. PSTs are produced by a unique biosynthetic pathway, which involves reactions that are rare in microbial metabolic pathways. Nevertheless, distantly related organisms such as dinoflagellates and cyanobacteria appear to produce these toxins using the same pathway. Hypothesised explanations for such an unusual phylogenetic distribution of this shared uncommon metabolic pathway, include a polyphyletic origin, an involvement of symbiotic bacteria, and horizontal gene transfer. We describe the identification, annotation and bioinformatic characterisation of the putative paralytic shellfish toxin biosynthesis clusters in an Australian isolate of Anabaena circinalis and an American isolate of Aphanizomenon sp., both members of the Nostocales. These putative PST gene clusters span approximately 28 kb and contain genes coding for the biosynthesis and export of the toxin. A putative insertion/excision site in the Australian Anabaena circinalis AWQC131C was identified, and the organization and evolution of the gene clusters are discussed. A biosynthetic pathway leading to the formation of saxitoxin and its analogues in these organisms is proposed. The PST biosynthesis gene cluster presents a mosaic structure, whereby genes have apparently transposed in segments of varying size, resulting in different gene arrangements in all three sxt clusters sequenced so far. The gene cluster organizational structure and sequence similarity seems to reflect the phylogeny of the producer organisms, indicating that the gene clusters have an ancient origin, or that their lateral transfer was also an ancient event. The knowledge we gain from the characterisation of the PST biosynthesis gene clusters, including the identity and sequence of the genes involved in the biosynthesis, may also afford the identification of

  4. Identification of nitrogen-fixing genes and gene clusters from metagenomic library of acid mine drainage.

    Directory of Open Access Journals (Sweden)

    Zhimin Dai

    Full Text Available Biological nitrogen fixation is an essential function of acid mine drainage (AMD microbial communities. However, most acidophiles in AMD environments are uncultured microorganisms and little is known about the diversity of nitrogen-fixing genes and structure of nif gene cluster in AMD microbial communities. In this study, we used metagenomic sequencing to isolate nif genes in the AMD microbial community from Dexing Copper Mine, China. Meanwhile, a metagenome microarray containing 7,776 large-insertion fosmids was constructed to screen novel nif gene clusters. Metagenomic analyses revealed that 742 sequences were identified as nif genes including structural subunit genes nifH, nifD, nifK and various additional genes. The AMD community is massively dominated by the genus Acidithiobacillus. However, the phylogenetic diversity of nitrogen-fixing microorganisms is much higher than previously thought in the AMD community. Furthermore, a 32.5-kb genomic sequence harboring nif, fix and associated genes was screened by metagenome microarray. Comparative genome analysis indicated that most nif genes in this cluster are most similar to those of Herbaspirillum seropedicae, but the organization of the nif gene cluster had significant differences from H. seropedicae. Sequence analysis and reverse transcription PCR also suggested that distinct transcription units of nif genes exist in this gene cluster. nifQ gene falls into the same transcription unit with fixABCX genes, which have not been reported in other diazotrophs before. All of these results indicated that more novel diazotrophs survive in the AMD community.

  5. An organisational perspective on the cluster paradox : exploring how members of a cluster manage the tension between continuity and renewal

    OpenAIRE

    Pinkse, Jonatan; Vernay, Anne-Lorene; D'Ippolito, Beatrice

    2018-01-01

    Clusters face what has been referred to as a 'cluster paradox'; a situation in which a collective identity breeds cohesion and efficiency in inter-organisational collaboration, yet it hinders the variety needed to adapt to disruptive change and prevent lock-in situations. Accordingly, a recurring theme in the literature on cluster evolution and cluster life-cycles is the need for constant renewal to allow clusters to adapt to a changing environment. However, how individual firms enact a proce...

  6. Calcitonin gene-related peptide antagonism and cluster headache

    DEFF Research Database (Denmark)

    Ashina, Håkan; Newman, Lawrence; Ashina, Sait

    2017-01-01

    Calcitonin gene-related peptide (CGRP) is a key signaling molecule involved in migraine pathophysiology. Efficacy of CGRP monoclonal antibodies and antagonists in migraine treatment has fueled an increasing interest in the prospect of treating cluster headache (CH) with CGRP antagonism. The exact...... role of CGRP and its mechanism of action in CH have not been fully clarified. A search for original studies and randomized controlled trials (RCTs) published in English was performed in PubMed and in ClinicalTrials.gov . The search term used was "cluster headache and calcitonin gene related peptide......" and "primary headaches and calcitonin gene related peptide." Reference lists of identified articles were also searched for additional relevant papers. Human experimental studies have reported elevated plasma CGRP levels during both spontaneous and glyceryl trinitrate-induced cluster attacks. CGRP may play...

  7. The Human Paraoxonase Gene Cluster As a Target in the Treatment of Atherosclerosis

    Science.gov (United States)

    She, Zhi-Gang; Chen, Hou-Zao; Yan, Yunfei; Li, Hongliang

    2012-01-01

    Abstract The paraoxonase (PON) gene cluster contains three adjacent gene members, PON1, PON2, and PON3. Originating from the same fungus lactonase precursor, all of the three PON genes share high sequence identity and a similar β propeller protein structure. PON1 and PON3 are primarily expressed in the liver and secreted into the serum upon expression, whereas PON2 is ubiquitously expressed and remains inside the cell. Each PON member has high catalytic activity toward corresponding artificial organophosphate, and all exhibit activities to lactones. Therefore, all three members of the family are regarded as lactonases. Under physiological conditions, they act to degrade metabolites of polyunsaturated fatty acids and homocysteine (Hcy) thiolactone, among other compounds. By detoxifying both oxidized low-density lipoprotein and Hcy thiolactone, PONs protect against atherosclerosis and coronary artery diseases, as has been illustrated by many types of in vitro and in vivo experimental evidence. Clinical observations focusing on gene polymorphisms also indicate that PON1, PON2, and PON3 are protective against coronary artery disease. Many other conditions, such as diabetes, metabolic syndrome, and aging, have been shown to relate to PONs. The abundance and/or activity of PONs can be regulated by lipoproteins and their metabolites, biological macromolecules, pharmacological treatments, dietary factors, and lifestyle. In conclusion, both previous results and ongoing studies provide evidence, making the PON cluster a prospective target for the treatment of atherosclerosis. Antioxid. Redox Signal. 16, 597–632. PMID:21867409

  8. IGSA: Individual Gene Sets Analysis, including Enrichment and Clustering.

    Science.gov (United States)

    Wu, Lingxiang; Chen, Xiujie; Zhang, Denan; Zhang, Wubing; Liu, Lei; Ma, Hongzhe; Yang, Jingbo; Xie, Hongbo; Liu, Bo; Jin, Qing

    2016-01-01

    Analysis of gene sets has been widely applied in various high-throughput biological studies. One weakness in the traditional methods is that they neglect the heterogeneity of genes expressions in samples which may lead to the omission of some specific and important gene sets. It is also difficult for them to reflect the severities of disease and provide expression profiles of gene sets for individuals. We developed an application software called IGSA that leverages a powerful analytical capacity in gene sets enrichment and samples clustering. IGSA calculates gene sets expression scores for each sample and takes an accumulating clustering strategy to let the samples gather into the set according to the progress of disease from mild to severe. We focus on gastric, pancreatic and ovarian cancer data sets for the performance of IGSA. We also compared the results of IGSA in KEGG pathways enrichment with David, GSEA, SPIA, ssGSEA and analyzed the results of IGSA clustering and different similarity measurement methods. Notably, IGSA is proved to be more sensitive and specific in finding significant pathways, and can indicate related changes in pathways with the severity of disease. In addition, IGSA provides with significant gene sets profile for each sample.

  9. Mining Association Rules among Gene Functions in Clusters of Similar Gene Expression Maps.

    Science.gov (United States)

    An, Li; Obradovic, Zoran; Smith, Desmond; Bodenreider, Olivier; Megalooikonomou, Vasileios

    2009-11-01

    Association rules mining methods have been recently applied to gene expression data analysis to reveal relationships between genes and different conditions and features. However, not much effort has focused on detecting the relation between gene expression maps and related gene functions. Here we describe such an approach to mine association rules among gene functions in clusters of similar gene expression maps on mouse brain. The experimental results show that the detected association rules make sense biologically. By inspecting the obtained clusters and the genes having the gene functions of frequent itemsets, interesting clues were discovered that provide valuable insight to biological scientists. Moreover, discovered association rules can be potentially used to predict gene functions based on similarity of gene expression maps.

  10. Gene clustering by latent semantic indexing of MEDLINE abstracts.

    Science.gov (United States)

    Homayouni, Ramin; Heinrich, Kevin; Wei, Lai; Berry, Michael W

    2005-01-01

    A major challenge in the interpretation of high-throughput genomic data is understanding the functional associations between genes. Previously, several approaches have been described to extract gene relationships from various biological databases using term-matching methods. However, more flexible automated methods are needed to identify functional relationships (both explicit and implicit) between genes from the biomedical literature. In this study, we explored the utility of Latent Semantic Indexing (LSI), a vector space model for information retrieval, to automatically identify conceptual gene relationships from titles and abstracts in MEDLINE citations. We found that LSI identified gene-to-gene and keyword-to-gene relationships with high average precision. In addition, LSI identified implicit gene relationships based on word usage patterns in the gene abstract documents. Finally, we demonstrate here that pairwise distances derived from the vector angles of gene abstract documents can be effectively used to functionally group genes by hierarchical clustering. Our results provide proof-of-principle that LSI is a robust automated method to elucidate both known (explicit) and unknown (implicit) gene relationships from the biomedical literature. These features make LSI particularly useful for the analysis of novel associations discovered in genomic experiments. The 50-gene document collection used in this study can be interactively queried at http://shad.cs.utk.edu/sgo/sgo.html.

  11. The Paraoxonase Gene Cluster Protects Against Abdominal Aortic Aneurysm Formation.

    Science.gov (United States)

    Yan, Yun-Fei; Pei, Jian-Fei; Zhang, Yang; Zhang, Ran; Wang, Fang; Gao, Peng; Zhang, Zhu-Qin; Wang, Ting-Ting; She, Zhi-Gang; Chen, Hou-Zao; Liu, De-Pei

    2017-02-01

    Abdominal aortic aneurysm (AAA) is a life-threatening vascular pathology, the pathogenesis of which is closely related to oxidative stress. However, an effective pharmaceutical treatment is lacking because the exact cause of AAA remains unknown. Here, we aimed at delineating the role of the paraoxonases (PONs) gene cluster (PC), which prevents atherosclerosis through the detoxification of oxidized substrates, in AAA formation. PC transgenic (Tg) mice were crossed to an Apoe -/- background, and an angiotensin II-induced AAA mouse model was used to analyze the effect of the PC on AAA formation. Four weeks after angiotensin II infusion, PC-Tg Apoe -/- mice had a lower AAA incidence, smaller maximal abdominal aortic external diameter, and less medial elastin degradation than Apoe -/- mice. Importantly, PC-Tg Apoe -/- mice exhibited lower aortic reactive oxidative species production and oxidative stress than did the Apoe -/- control mice. As a consequence, the PC transgene alleviated angiotensin II-induced arterial inflammation and suppressed arterial extracellular matrix degradation. Specifically, on angiotensin II stimulation, PC-Tg vascular smooth muscle cells exhibited lower levels of reactive oxidative species production and a decrease in the activities and expression levels of matrix metalloproteinase-2 and matrix metalloproteinase-9. Moreover, PC-Tg serum also enhanced vascular smooth muscle cell oxidative stress resistance and further decreased the expression levels of matrix metalloproteinase-2 and matrix metalloproteinase-9, indicating that circulatory and vascular smooth muscle cell PC members suppress oxidative stress in a synergistic manner. Our findings reveal, for the first time, a protective role of the PC in AAA formation and suggest PONs as promising targets for AAA prevention. © 2016 American Heart Association, Inc.

  12. The gentle art of gene arrangement: the meaning of gene clusters

    Science.gov (United States)

    Trowsdale, John

    2002-01-01

    Genome sequence comparisons reveal that some sets of genes are in similar linkage groups in different organisms while other sets are dispersed. Are some linkage groups maintained by chance, or is there an advantage to such an arrangement? Some insights may come from large clusters of genes, such as the major histocompatibility complex which includes many genes involved in immune defense. PMID:11897017

  13. The gentle art of gene arrangement: the meaning of gene clusters

    OpenAIRE

    Trowsdale, John

    2002-01-01

    Genome sequence comparisons reveal that some sets of genes are in similar linkage groups in different organisms while other sets are dispersed. Are some linkage groups maintained by chance, or is there an advantage to such an arrangement? Some insights may come from large clusters of genes, such as the major histocompatibility complex which includes many genes involved in immune defense.

  14. Cloning and Heterologous Expression of the Grecocycline Biosynthetic Gene Cluster.

    Directory of Open Access Journals (Sweden)

    Oksana Bilyk

    Full Text Available Transformation-associated recombination (TAR in yeast is a rapid and inexpensive method for cloning and assembly of large DNA fragments, which relies on natural homologous recombination. Two vectors, based on p15a and F-factor replicons that can be maintained in yeast, E. coli and streptomycetes have been constructed. These vectors have been successfully employed for assembly of the grecocycline biosynthetic gene cluster from Streptomyces sp. Acta 1362. Fragments of the cluster were obtained by PCR and transformed together with the "capture" vector into the yeast cells, yielding a construct carrying the entire gene cluster. The obtained construct was heterologously expressed in S. albus J1074, yielding several grecocycline congeners. Grecocyclines have unique structural moieties such as a dissacharide side chain, an additional amino sugar at the C-5 position and a thiol group. Enzymes from this pathway may be used for the derivatization of known active angucyclines in order to improve their desired biological properties.

  15. Accurate prediction of secondary metabolite gene clusters in filamentous fungi

    DEFF Research Database (Denmark)

    Andersen, Mikael Rørdam; Nielsen, Jakob Blæsbjerg; Klitgaard, Andreas

    2013-01-01

    supporting enzymes for key synthases one cluster at a time. In this study, we design and apply a DNA expression array for Aspergillus nidulans in combination with legacy data to form a comprehensive gene expression compendium. We apply a guilt-by-association-based analysis to predict the extent...

  16. The fimbrial gene cluster of Haemophilus influenzae type b

    NARCIS (Netherlands)

    van Ham, S. M.; van Alphen, L.; Mooi, F. R.; van Putten, J. P.

    1994-01-01

    Haemophilus influenzae infections are preceded by airway colonization, a process facilitated by fimbriae. Here, we identified the complete fimbrial gene cluster of H. influenzae type b. HifA forms the major subunit. HifB, a periplasmic chaperone, and HifC, an outer membrane usher, are typical

  17. The fimbria gene cluster of nonencapsulated Haemophilus influenzae

    NARCIS (Netherlands)

    Geluk, F.; Eijk, P. P.; van Ham, S. M.; Jansen, H. M.; van Alphen, L.

    1998-01-01

    The occurrence of fimbria gene clusters in nonencapsulated Haemophilus influenzae strains from chronic bronchitis patients (n = 58), patients with acute otitis media (n = 13), and healthy carriers (n = 12) was determined by DNA hybridization and PCR, based on sequences of fimbriate H. influenzae

  18. The ergot alkaloid gene cluster: Functional analyses and evolutionary aspects

    Czech Academy of Sciences Publication Activity Database

    Lorenz, N.; Haarmann, T.; Pažoutová, Sylvie; Jung, M.; Tudzynski, P.

    2009-01-01

    Roč. 70, 15-16 (2009), s. 1822-1832 ISSN 0031-9422 Institutional research plan: CEZ:AV0Z50200510 Keywords : Claviceps purpurea * Ergot fungus * Ergot alkaloid gene cluster Subject RIV: EE - Microbiology, Virology Impact factor: 3.104, year: 2009

  19. Unique nucleotide polymorphism of ankyrin gene cluster in ...

    Indian Academy of Sciences (India)

    Genomics 19, 478–493. Krumlauf R. 1992 Evolution of the vertebrate Hox homeobox genes. BioEssays 14, 245–252. Kuittinen H. and Aguadé M. 2000 Nucleotide variation at the. CHALCONE ISOMERASE locus in Arabidopsis thaliana. Ge- netics 155, 863–872. Lercher M. J., Urrutia A. O. and Hurst L. D. 2002 Clustering of.

  20. Gene ordering in partitive clustering using microarray expressions

    Indian Academy of Sciences (India)

    2007-06-28

    Jun 28, 2007 ... Two existing algorithms for optimally ordering cities in travelling salesman problem (TSP), namely, FRAG_GALK and Concorde, are hybridized individually with self organizing MAP to show the importance of gene ordering in partitive clustering framework. We validated our hybrid approach using yeast and ...

  1. Intact cluster and chordate-like expression of ParaHox genes in a sea star.

    Science.gov (United States)

    Annunziata, Rossella; Martinez, Pedro; Arnone, Maria Ina

    2013-06-27

    The ParaHox genes are thought to be major players in patterning the gut of several bilaterian taxa. Though this is a fundamental role that these transcription factors play, their activities are not limited to the endoderm and extend to both ectodermal and mesodermal tissues. Three genes compose the ParaHox group: Gsx, Xlox and Cdx. In some taxa (mostly chordates but to some degree also in protostomes) the three genes are arranged into a genomic cluster, in a similar fashion to what has been shown for the better-known Hox genes. Sea urchins possess the full complement of ParaHox genes but they are all dispersed throughout the genome, an arrangement that, perhaps, represented the primitive condition for all echinoderms. In order to understand the evolutionary history of this group of genes we cloned and characterized all ParaHox genes, studied their expression patterns and identified their genomic loci in a member of an earlier branching group of echinoderms, the asteroid Patiria miniata. We identified the three ParaHox orthologs in the genome of P. miniata. While one of them, PmGsx is provided as maternal message, with no zygotic activation afterwards, the other two, PmLox and PmCdx are expressed during embryogenesis, within restricted domains of both endoderm and ectoderm. Screening of a Patiria bacterial artificial chromosome (BAC) library led to the identification of a clone containing the three genes. The transcriptional directions of PmGsx and PmLox are opposed to that of the PmCdx gene within the cluster. The identification of P. miniata ParaHox genes has revealed the fact that these genes are clustered in the genome, in contrast to what has been reported for echinoids. Since the presence of an intact cluster, or at least a partial cluster, has been reported in chordates and polychaetes respectively, it becomes clear that within echinoderms, sea urchins have modified the original bilaterian arrangement. Moreover, the sea star ParaHox domains of expression show

  2. The Fusarium graminearum genome reveals more secondary metabolite gene clusters and hints of horizontal gene transfer.

    Directory of Open Access Journals (Sweden)

    Christian M K Sieber

    Full Text Available Fungal secondary metabolite biosynthesis genes are of major interest due to the pharmacological properties of their products (like mycotoxins and antibiotics. The genome of the plant pathogenic fungus Fusarium graminearum codes for a large number of candidate enzymes involved in secondary metabolite biosynthesis. However, the chemical nature of most enzymatic products of proteins encoded by putative secondary metabolism biosynthetic genes is largely unknown. Based on our analysis we present 67 gene clusters with significant enrichment of predicted secondary metabolism related enzymatic functions. 20 gene clusters with unknown metabolites exhibit strong gene expression correlation in planta and presumably play a role in virulence. Furthermore, the identification of conserved and over-represented putative transcription factor binding sites serves as additional evidence for cluster co-regulation. Orthologous cluster search provided insight into the evolution of secondary metabolism clusters. Some clusters are characteristic for the Fusarium phylum while others show evidence of horizontal gene transfer as orthologs can be found in representatives of the Botrytis or Cochliobolus lineage. The presented candidate clusters provide valuable targets for experimental examination.

  3. PEACE: Parallel Environment for Assembly and Clustering of Gene Expression.

    Science.gov (United States)

    Rao, D M; Moler, J C; Ozden, M; Zhang, Y; Liang, C; Karro, J E

    2010-07-01

    We present PEACE, a stand-alone tool for high-throughput ab initio clustering of transcript fragment sequences produced by Next Generation or Sanger Sequencing technologies. It is freely available from www.peace-tools.org. Installed and managed through a downloadable user-friendly graphical user interface (GUI), PEACE can process large data sets of transcript fragments of length 50 bases or greater, grouping the fragments by gene associations with a sensitivity comparable to leading clustering tools. Once clustered, the user can employ the GUI's analysis functions, facilitating the easy collection of statistics and allowing them to single out specific clusters for more comprehensive study or assembly. Using a novel minimum spanning tree-based clustering method, PEACE is the equal of leading tools in the literature, with an interface making it accessible to any user. It produces results of quality virtually identical to those of the WCD tool when applied to Sanger sequences, significantly improved results over WCD and TGICL when applied to the products of Next Generation Sequencing Technology and significantly improved results over Cap3 in both cases. In short, PEACE provides an intuitive GUI and a feature-rich, parallel clustering engine that proves to be a valuable addition to the leading cDNA clustering tools.

  4. Comparative genomics of natural killer cell receptor gene clusters.

    Directory of Open Access Journals (Sweden)

    James Kelley

    2005-08-01

    Full Text Available Many receptors on natural killer (NK cells recognize major histocompatibility complex class I molecules in order to monitor unhealthy tissues, such as cells infected with viruses, and some tumors. Genes encoding families of NK receptors and related sequences are organized into two main clusters in humans: the natural killer complex on Chromosome 12p13.1, which encodes C-type lectin molecules, and the leukocyte receptor complex on Chromosome 19q13.4, which encodes immunoglobulin superfamily molecules. The composition of these gene clusters differs markedly between closely related species, providing evidence for rapid, lineage-specific expansions or contractions of sets of loci. The choice of NK receptor genes is polarized in the two species most studied, mouse and human. In mouse, the C-type lectin-related Ly49 gene family predominates. Conversely, the single Ly49 sequence is a pseudogene in humans, and the immunoglobulin superfamily KIR gene family is extensive. These different gene sets encode proteins that are comparable in function and genetic diversity, even though they have undergone species-specific expansions. Understanding the biological significance of this curious situation may be aided by studying which NK receptor genes are used in other vertebrates, especially in relation to species-specific differences in genes for major histocompatibility complex class I molecules.

  5. Evolution and differential expression of a vertebrate vitellogenin gene cluster

    Directory of Open Access Journals (Sweden)

    Kongshaug Heidi

    2009-01-01

    Full Text Available Abstract Background The multiplicity or loss of the vitellogenin (vtg gene family in vertebrates has been argued to have broad implications for the mode of reproduction (placental or non-placental, cleavage pattern (meroblastic or holoblastic and character of the egg (pelagic or benthic. Earlier proposals for the existence of three forms of vertebrate vtgs present conflicting models for their origin and subsequent duplication. Results By integrating phylogenetics of novel vtg transcripts from old and modern teleosts with syntenic analyses of all available genomic variants of non-metatherian vertebrates we identify the gene orthologies between the Sarcopterygii (tetrapod branch and Actinopterygii (fish branch. We argue that the vertebrate vtg gene cluster originated in proto-chromosome m, but that vtg genes have subsequently duplicated and rearranged following whole genome duplications. Sequencing of a novel fourth vtg transcript in labrid species, and the presence of duplicated paralogs in certain model organisms supports the notion that lineage-specific gene duplications frequently occur in teleosts. The data show that the vtg gene cluster is more conserved between acanthomorph teleosts and tetrapods, than in ostariophysan teleosts such as the zebrafish. The differential expression of the labrid vtg genes are further consistent with the notion that neofunctionalized Aa-type vtgs are important determinants of the pelagic or benthic character of the eggs in acanthomorph teleosts. Conclusion The vertebrate vtg gene cluster existed prior to the separation of Sarcopterygii from Actinopterygii >450 million years ago, a period associated with the second round of whole genome duplication. The presence of higher copy numbers in a more highly expressed subcluster is particularly prevalent in teleosts. The differential expression and latent neofunctionalization of vtg genes in acanthomorph teleosts is an adaptive feature associated with oocyte hydration

  6. Clustered Xenopus keratin genes: A genomic, transcriptomic, and proteomic analysis.

    Science.gov (United States)

    Suzuki, Ken-Ichi T; Suzuki, Miyuki; Shigeta, Mitsuki; Fortriede, Joshua D; Takahashi, Shuji; Mawaribuchi, Shuuji; Yamamoto, Takashi; Taira, Masanori; Fukui, Akimasa

    2017-06-15

    Keratin genes belong to the intermediate filament superfamily and their expression is altered following morphological and physiological changes in vertebrate epithelial cells. Keratin genes are divided into two groups, type I and II, and are clustered on vertebrate genomes, including those of Xenopus species. Various keratin genes have been identified and characterized by their unique expression patterns throughout ontogeny in Xenopus laevis; however, compilation of previously reported and newly identified keratin genes in two Xenopus species is required for our further understanding of keratin gene evolution, not only in amphibians but also in all terrestrial vertebrates. In this study, 120 putative type I and II keratin genes in total were identified based on the genome data from two Xenopus species. We revealed that most of these genes are highly clustered on two homeologous chromosomes, XLA9_10 and XLA2 in X. laevis, and XTR10 and XTR2 in X. tropicalis, which are orthologous to those of human, showing conserved synteny among tetrapods. RNA-Seq data from various embryonic stages and adult tissues highlighted the unique expression profiles of orthologous and homeologous keratin genes in developmental stage- and tissue-specific manners. Moreover, we identified dozens of epidermal keratin proteins from the whole embryo, larval skin, tail, and adult skin using shotgun proteomics. In light of our results, we discuss the radiation, diversification, and unique expression of the clustered keratin genes, which are closely related to epidermal development and terrestrial adaptation during amphibian evolution, including Xenopus speciation. Copyright © 2016 Elsevier Inc. All rights reserved.

  7. Evolutionary conservation of regulatory elements in vertebrate HOX gene clusters

    Energy Technology Data Exchange (ETDEWEB)

    Santini, Simona; Boore, Jeffrey L.; Meyer, Axel

    2003-12-31

    Due to their high degree of conservation, comparisons of DNA sequences among evolutionarily distantly-related genomes permit to identify functional regions in noncoding DNA. Hox genes are optimal candidate sequences for comparative genome analyses, because they are extremely conserved in vertebrates and occur in clusters. We aligned (Pipmaker) the nucleotide sequences of HoxA clusters of tilapia, pufferfish, striped bass, zebrafish, horn shark, human and mouse (over 500 million years of evolutionary distance). We identified several highly conserved intergenic sequences, likely to be important in gene regulation. Only a few of these putative regulatory elements have been previously described as being involved in the regulation of Hox genes, while several others are new elements that might have regulatory functions. The majority of these newly identified putative regulatory elements contain short fragments that are almost completely conserved and are identical to known binding sites for regulatory proteins (Transfac). The conserved intergenic regions located between the most rostrally expressed genes in the developing embryo are longer and better retained through evolution. We document that presumed regulatory sequences are retained differentially in either A or A clusters resulting from a genome duplication in the fish lineage. This observation supports both the hypothesis that the conserved elements are involved in gene regulation and the Duplication-Deletion-Complementation model.

  8. Incorporating gene ontology into fuzzy relational clustering of microarray gene expression data.

    Science.gov (United States)

    Paul, Animesh Kumar; Shill, Pintu Chandra

    2018-01-01

    The product of gene expression works together in the cell for each living organism in order to achieve different biological processes. Many proteins are involved in different roles depending on the environment of the organism for the functioning of the cell. In this paper, we propose gene ontology (GO) annotations based semi-supervised clustering algorithm called GO fuzzy relational clustering (GO-FRC) where one gene is allowed to be assigned to multiple clusters which are the most biologically relevant behavior of genes. In the clustering process, GO-FRC utilizes useful biological knowledge which is available in the form of a gene ontology, as a prior knowledge along with the gene expression data. The prior knowledge helps to improve the coherence of the groups concerning the knowledge field. The proposed GO-FRC has been tested on the two yeast (Saccharomyces cerevisiae) expression profiles datasets (Eisen and Dream5 yeast datasets) and compared with other state-of-the-art clustering algorithms. Experimental results imply that GO-FRC is able to produce more biologically relevant clusters with the use of the small amount of GO annotations. Copyright © 2017 Elsevier B.V. All rights reserved.

  9. Acquisition and evolution of plant pathogenesis-associated gene clusters and candidate determinants of tissue-specificity in xanthomonas.

    Directory of Open Access Journals (Sweden)

    Hong Lu

    Full Text Available Xanthomonas is a large genus of plant-associated and plant-pathogenic bacteria. Collectively, members cause diseases on over 392 plant species. Individually, they exhibit marked host- and tissue-specificity. The determinants of this specificity are unknown.To assess potential contributions to host- and tissue-specificity, pathogenesis-associated gene clusters were compared across genomes of eight Xanthomonas strains representing vascular or non-vascular pathogens of rice, brassicas, pepper and tomato, and citrus. The gum cluster for extracellular polysaccharide is conserved except for gumN and sequences downstream. The xcs and xps clusters for type II secretion are conserved, except in the rice pathogens, in which xcs is missing. In the otherwise conserved hrp cluster, sequences flanking the core genes for type III secretion vary with respect to insertion sequence element and putative effector gene content. Variation at the rpf (regulation of pathogenicity factors cluster is more pronounced, though genes with established functional relevance are conserved. A cluster for synthesis of lipopolysaccharide varies highly, suggesting multiple horizontal gene transfers and reassortments, but this variation does not correlate with host- or tissue-specificity. Phylogenetic trees based on amino acid alignments of gum, xps, xcs, hrp, and rpf cluster products generally reflect strain phylogeny. However, amino acid residues at four positions correlate with tissue specificity, revealing hpaA and xpsD as candidate determinants. Examination of genome sequences of xanthomonads Xylella fastidiosa and Stenotrophomonas maltophilia revealed that the hrp, gum, and xcs clusters are recent acquisitions in the Xanthomonas lineage.Our results provide insight into the ancestral Xanthomonas genome and indicate that differentiation with respect to host- and tissue-specificity involved not major modifications or wholesale exchange of clusters, but subtle changes in a small

  10. PRECISION PHOTOMETRIC MONITORING OF VERY LOW MASS σ ORIONIS CLUSTER MEMBERS: VARIABILITY AND ROTATION AT A FEW Myr

    International Nuclear Information System (INIS)

    Cody, Ann Marie; Hillenbrand, Lynne A.

    2010-01-01

    We present high-precision photometry on 107 variable low-mass stars and brown dwarfs in the ∼3 Myr σ Orionis open cluster. We have carried out I-band photometric monitoring within two fields, encompassing 153 confirmed or candidate members of the low-mass cluster population, from 0.02 to 0.5 M sun . We are sensitive to brightness changes on timescales from 10 minutes to two weeks with amplitudes as low as 0.004 mag, and find variability on these timescales in nearly 70% of cluster members. We identify both periodic and aperiodic modes of variability, as well as semi-periodic rapid fading events that are not accounted for by the standard explanations of rotational modulation of surface features or accretion. We have incorporated both optical and infrared color data to uncover trends in variability with mass and circumstellar disks. While the data confirm that the lowest-mass objects (M sun ) rotate more rapidly than the 0.2-0.5 M sun members, they do not support a direct connection between rotation rate and the presence of a disk. Finally, we speculate on the origin of irregular variability in cluster members with no evidence for disks or accretion.

  11. Identification of Nocobactin NA Biosynthetic Gene Clusters in Nocardia farcinica▿ §

    OpenAIRE

    Hoshino, Yasutaka; Chiba, Kazuhiro; Ishino, Keiko; Fukai, Toshio; Igarashi, Yasuhiro; Yazawa, Katsukiyo; Mikami, Yuzuru; Ishikawa, Jun

    2010-01-01

    We identified the biosynthetic gene clusters of the siderophore nocobactin NA. The nbt clusters, which were discovered as genes highly homologous to the mycobactin biosynthesis genes by the genomic sequencing of Nocardia farcinica IFM 10152, consist of 10 genes separately located at two genomic regions. The gene organization of the nbt clusters and the predicted functions of the nbt genes, particularly the cyclization and epimerization domains, were in good agreement with the chemical structu...

  12. Gene duplication, modularity and adaptation in the evolution of the aflatoxin gene cluster

    Directory of Open Access Journals (Sweden)

    Jakobek Judy L

    2007-07-01

    Full Text Available Abstract Background The biosynthesis of aflatoxin (AF involves over 20 enzymatic reactions in a complex polyketide pathway that converts acetate and malonate to the intermediates sterigmatocystin (ST and O-methylsterigmatocystin (OMST, the respective penultimate and ultimate precursors of AF. Although these precursors are chemically and structurally very similar, their accumulation differs at the species level for Aspergilli. Notable examples are A. nidulans that synthesizes only ST, A. flavus that makes predominantly AF, and A. parasiticus that generally produces either AF or OMST. Whether these differences are important in the evolutionary/ecological processes of species adaptation and diversification is unknown. Equally unknown are the specific genomic mechanisms responsible for ordering and clustering of genes in the AF pathway of Aspergillus. Results To elucidate the mechanisms that have driven formation of these clusters, we performed systematic searches of aflatoxin cluster homologs across five Aspergillus genomes. We found a high level of gene duplication and identified seven modules consisting of highly correlated gene pairs (aflA/aflB, aflR/aflS, aflX/aflY, aflF/aflE, aflT/aflQ, aflC/aflW, and aflG/aflL. With the exception of A. nomius, contrasts of mean Ka/Ks values across all cluster genes showed significant differences in selective pressure between section Flavi and non-section Flavi species. A. nomius mean Ka/Ks values were more similar to partial clusters in A. fumigatus and A. terreus. Overall, mean Ka/Ks values were significantly higher for section Flavi than for non-section Flavi species. Conclusion Our results implicate several genomic mechanisms in the evolution of ST, OMST and AF cluster genes. Gene modules may arise from duplications of a single gene, whereby the function of the pre-duplication gene is retained in the copy (aflF/aflE or the copies may partition the ancestral function (aflA/aflB. In some gene modules, the

  13. Clustering gene expression regulators: new approach to disease subtyping.

    Directory of Open Access Journals (Sweden)

    Mikhail Pyatnitskiy

    Full Text Available One of the main challenges in modern medicine is to stratify different patient groups in terms of underlying disease molecular mechanisms as to develop more personalized approach to therapy. Here we propose novel method for disease subtyping based on analysis of activated expression regulators on a sample-by-sample basis. Our approach relies on Sub-Network Enrichment Analysis algorithm (SNEA which identifies gene subnetworks with significant concordant changes in expression between two conditions. Subnetwork consists of central regulator and downstream genes connected by relations extracted from global literature-extracted regulation database. Regulators found in each patient separately are clustered together and assigned activity scores which are used for final patients grouping. We show that our approach performs well compared to other related methods and at the same time provides researchers with complementary level of understanding of pathway-level biology behind a disease by identification of significant expression regulators. We have observed the reasonable grouping of neuromuscular disorders (triggered by structural damage vs triggered by unknown mechanisms, that was not revealed using standard expression profile clustering. For another experiment we were able to suggest the clusters of regulators, responsible for colorectal carcinoma vs adenoma discrimination and identify frequently genetically changed regulators that could be of specific importance for the individual characteristics of cancer development. Proposed approach can be regarded as biologically meaningful feature selection, reducing tens of thousands of genes down to dozens of clusters of regulators. Obtained clusters of regulators make possible to generate valuable biological hypotheses about molecular mechanisms related to a clinical outcome for individual patient.

  14. CHEMICAL COMPOSITION OF INTERMEDIATE-MASS STAR MEMBERS OF THE M6 (NGC 6405) OPEN CLUSTER

    Energy Technology Data Exchange (ETDEWEB)

    Kılıçoğlu, T.; Albayrak, B. [Ankara University, Faculty of Science, Department of Astronomy and Space Sciences, 06100, Tandoğan, Ankara (Turkey); Monier, R. [LESIA, UMR 8109, Observatoire de Paris Meudon, Place J. Janssen, Meudon (France); Richer, J. [Département de physique, Université de Montréal, 2900, Boulevard Edouard-Montpetit, Montréal QC, H3C 3J7 (Canada); Fossati, L., E-mail: tkilicoglu@ankara.edu.tr, E-mail: balbayrak@ankara.edu.tr, E-mail: Richard.Monier@obspm.fr, E-mail: Jacques.Richer@umontreal.ca, E-mail: lfossati@astro.uni-bonn.de [Argelander-Institut für Astronomie der Universität Bonn, Auf dem Hügel 71, D-53121, Bonn (Germany)

    2016-03-15

    We present here the first abundance analysis of 44 late B-, A-, and F-type members of the young open cluster M6 (NGC 6405, age about 75 Myr). Low- and medium-resolution spectra, covering the 4500–5840 Å wavelength range, were obtained using the FLAMES/GIRAFFE spectrograph attached to the ESO Very Large Telescopes. We determined the atmospheric parameters using calibrations of the Geneva photometry and by adjusting the H{sub β} profiles to synthetic ones. The abundances of up to 20 chemical elements, from helium to mercury, were derived for 19 late B, 16 A, and 9 F stars by iteratively adjusting synthetic spectra to the observations. We also derived a mean cluster metallicity of [Fe/H] = 0.07 ± 0.03 dex from the iron abundances of the F-type stars. We find that for most chemical elements, the normal late B- and A-type stars exhibit larger star-to-star abundance variations than the F-type stars probably because of the faster rotation of the B and A stars. The abundances of C, O, Mg, Si, and Sc appear to be anticorrelated with that of Fe, while the opposite holds for the abundances of Ca, Ti, Cr, Mn, Ni, Y, and Ba as expected if radiative diffusion is efficient in the envelopes of these stars. In the course of this analysis, we discovered five new peculiar stars: one mild Am, one Am, and one Fm star (HD 318091, CD-32 13109, GSC 07380-01211, CP1), one HgMn star (HD 318126, CP3), and one He-weak P-rich (HD 318101, CP4) star. We also discovered a new spectroscopic binary, most likely a SB2. We performed a detailed modeling of HD 318101, the new He-weak P-rich CP star, using the Montréal stellar evolution code XEVOL which self-consistently treats all particle transport processes. Although the overall abundance pattern of this star is properly reproduced, we find that detailed abundances (in particular the high P excess) resisted modeling attempts even when a range of turbulence profiles and mass-loss rates were considered. Solutions are proposed which are

  15. Chemical Composition of Intermediate-mass Star Members of the M6 (NGC 6405) Open Cluster

    Science.gov (United States)

    Kılıçoğlu, T.; Monier, R.; Richer, J.; Fossati, L.; Albayrak, B.

    2016-03-01

    We present here the first abundance analysis of 44 late B-, A-, and F-type members of the young open cluster M6 (NGC 6405, age about 75 Myr). Low- and medium-resolution spectra, covering the 4500-5840 Å wavelength range, were obtained using the FLAMES/GIRAFFE spectrograph attached to the ESO Very Large Telescopes. We determined the atmospheric parameters using calibrations of the Geneva photometry and by adjusting the Hβ profiles to synthetic ones. The abundances of up to 20 chemical elements, from helium to mercury, were derived for 19 late B, 16 A, and 9 F stars by iteratively adjusting synthetic spectra to the observations. We also derived a mean cluster metallicity of [Fe/H] = 0.07 ± 0.03 dex from the iron abundances of the F-type stars. We find that for most chemical elements, the normal late B- and A-type stars exhibit larger star-to-star abundance variations than the F-type stars probably because of the faster rotation of the B and A stars. The abundances of C, O, Mg, Si, and Sc appear to be anticorrelated with that of Fe, while the opposite holds for the abundances of Ca, Ti, Cr, Mn, Ni, Y, and Ba as expected if radiative diffusion is efficient in the envelopes of these stars. In the course of this analysis, we discovered five new peculiar stars: one mild Am, one Am, and one Fm star (HD 318091, CD-32 13109, GSC 07380-01211, CP1), one HgMn star (HD 318126, CP3), and one He-weak P-rich (HD 318101, CP4) star. We also discovered a new spectroscopic binary, most likely a SB2. We performed a detailed modeling of HD 318101, the new He-weak P-rich CP star, using the Montréal stellar evolution code XEVOL which self-consistently treats all particle transport processes. Although the overall abundance pattern of this star is properly reproduced, we find that detailed abundances (in particular the high P excess) resisted modeling attempts even when a range of turbulence profiles and mass-loss rates were considered. Solutions are proposed which are still under

  16. Evaluation of clustering algorithms for gene expression data using gene ontology annotations.

    Science.gov (United States)

    Ma, Ning; Zhang, Zheng-Guo

    2012-09-01

    Clustering is a useful exploratory technique for interpreting gene expression data to reveal groups of genes sharing common functional attributes. Biologists frequently face the problem of choosing an appropriate algorithm. We aimed to provide a standalone, easily accessible and biologically oriented criterion for expression data clustering evaluation. An external criterion utilizing annotation based similarities between genes is proposed in this work. Gene ontology information is employed as the annotation source. Comparisons among six widely used clustering algorithms over various types of gene expression data sets were carried out based on the criterion proposed. The rank of these algorithms given by the criterion coincides with our common knowledge. Single-linkage has significantly poorer performance, even worse than the random algorithm. Ward's method archives the best performance in most cases. The criterion proposed has a strong ability to distinguish among different clustering algorithms with different distance measurements. It is also demonstrated that analyzing main contributors of the criterion may offer some guidelines in finding local compact clusters. As an addition, we suggest using Ward's algorithm for gene expression data analysis.

  17. A conserved cluster of three PRD-class homeobox genes (homeobrain, rx and orthopedia in the Cnidaria and Protostomia

    Directory of Open Access Journals (Sweden)

    Mazza Maureen E

    2010-07-01

    temporal expression. Conclusion We report the first evidence for a PRD-class homeobox cluster that appears to have been conserved since the time of the cnidarian-bilaterian ancestor, and possibly even earlier, given the presence of a partial cluster in the placozoan Trichoplax. Very similar clusters comprising these three genes exist in Nematostella and diverse protostomes. Interestingly, in chordates, one member of the ancestral cluster (homeobrain has apparently been lost, and there is no linkage between rx and orthopedia in any of the vertebrates. In Nematostella, the spatial expression of these three genes along the body column is not colinear with their physical order in the cluster but the temporal expression is, therefore, using the terminology that has been applied to the Hox cluster genes, the HRO cluster would appear to exhibit temporal but not spatial colinearity. It remains to be seen whether the mechanisms responsible for the evolutionary conservation of the HRO cluster are the same mechanisms responsible for cohesion of the Hox cluster and other ANTP-class homeobox clusters that have been widely conserved throughout animal evolution.

  18. Identification and Characterization of Mycemycin Biosynthetic Gene Clusters in Streptomyces olivaceus FXJ8.012 and Streptomyces sp. FXJ1.235

    Directory of Open Access Journals (Sweden)

    Fangying Song

    2018-03-01

    Full Text Available Mycemycins A–E are new members of the dibenzoxazepinone (DBP family, derived from the gntR gene-disrupted deep sea strain Streptomyces olivaceus FXJ8.012Δ1741 and the soil strain Streptomyces sp. FXJ1.235. In this paper, we report the identification of the gene clusters and pathways’ inference for mycemycin biosynthesis in the two strains. Bioinformatics analyses of the genome sequences of S. olivaceus FXJ8.012Δ1741 and S. sp. FXJ1.235 predicted two divergent mycemycin gene clusters, mym and mye, respectively. Heterologous expression of the key enzyme genes of mym and genetic manipulation of mye as well as a feeding study in S. sp. FXJ1.235 confirmed the gene clusters and led to the proposed biosynthetic pathways for mycemycins. To the best of our knowledge, this is the first report on DBP biosynthetic gene clusters and pathways.

  19. Genetic clusters and sex-biased gene flow in a unicolonial Formica ant

    Directory of Open Access Journals (Sweden)

    Chapuisat Michel

    2009-03-01

    Full Text Available Abstract Background Animal societies are diverse, ranging from small family-based groups to extraordinarily large social networks in which many unrelated individuals interact. At the extreme of this continuum, some ant species form unicolonial populations in which workers and queens can move among multiple interconnected nests without eliciting aggression. Although unicoloniality has been mostly studied in invasive ants, it also occurs in some native non-invasive species. Unicoloniality is commonly associated with very high queen number, which may result in levels of relatedness among nestmates being so low as to raise the question of the maintenance of altruism by kin selection in such systems. However, the actual relatedness among cooperating individuals critically depends on effective dispersal and the ensuing pattern of genetic structuring. In order to better understand the evolution of unicoloniality in native non-invasive ants, we investigated the fine-scale population genetic structure and gene flow in three unicolonial populations of the wood ant F. paralugubris. Results The analysis of geo-referenced microsatellite genotypes and mitochondrial haplotypes revealed the presence of cryptic clusters of genetically-differentiated nests in the three populations of F. paralugubris. Because of this spatial genetic heterogeneity, members of the same clusters were moderately but significantly related. The comparison of nuclear (microsatellite and mitochondrial differentiation indicated that effective gene flow was male-biased in all populations. Conclusion The three unicolonial populations exhibited male-biased and mostly local gene flow. The high number of queens per nest, exchanges among neighbouring nests and restricted long-distance gene flow resulted in large clusters of genetically similar nests. The positive relatedness among clustermates suggests that kin selection may still contribute to the maintenance of altruism in unicolonial

  20. Evidence for an ergot alkaloid gene cluster in Claviceps purpurea.

    Science.gov (United States)

    Tudzynski, P; Hölter, K; Correia, T; Arntz, C; Grammel, N; Keller, U

    1999-02-01

    A gene (cpd1) coding for the dimethylallyltryptophan synthase (DMATS) that catalyzes the first specific step in the biosynthesis of ergot alkaloids, was cloned from a strain of Claviceps purpurea that produces alkaloids in axenic culture. The derived gene product (CPD1) shows only 70% similarity to the corresponding gene previously isolated from Claviceps strain ATCC 26245, which is likely to be an isolate of C. fusiformis. Therefore, the related cpd1 most probably represents the first C. purpurea gene coding for an enzymatic step of the alkaloid biosynthetic pathway to be cloned. Analysis of the 3'-flanking region of cpd1 revealed a second, closely linked ergot alkaloid biosynthetic gene named cpps1, which codes for a 356-kDa polypeptide showing significant similarity to fungal modular peptide synthetases. The protein contains three amino acid-activating modules, and in the second module a sequence is found which matches that of an internal peptide (17 amino acids in length) obtained from a tryptic digest of lysergyl peptide synthetase 1 (LPS1) of C. purpurea, thus confirming that cpps1 encodes LPS1. LPS1 activates the three amino acids of the peptide portion of ergot peptide alkaloids during D-lysergyl peptide assembly. Chromosome walking revealed the presence of additional genes upstream of cpd1 which are probably also involved in ergot alkaloid biosynthesis: cpox1 probably codes for an FAD-dependent oxidoreductase (which could represent the chanoclavine cyclase), and a second putative oxidoreductase gene, cpox2, is closely linked to it in inverse orientation. RT-PCR experiments confirm that all four genes are expressed under conditions of peptide alkaloid biosynthesis. These results strongly suggest that at least some genes of ergot alkaloid biosynthesis in C. purpurea are clustered, opening the way for a detailed molecular genetic analysis of the pathway.

  1. Translating biosynthetic gene clusters into fungal armor and weaponry.

    Science.gov (United States)

    Keller, Nancy P

    2015-09-01

    Filamentous fungi are renowned for the production of a diverse array of secondary metabolites (SMs) where the genetic material required for synthesis of a SM is typically arrayed in a biosynthetic gene cluster (BGC). These natural products are valued for their bioactive properties stemming from their functions in fungal biology, key among those protection from abiotic and biotic stress and establishment of a secure niche. The producing fungus must not only avoid self-harm from endogenous SMs but also deliver specific SMs at the right time to the right tissue requiring biochemical aid. This review highlights functions of BGCs beyond the enzymatic assembly of SMs, considering the timing and location of SM production and other proteins in the clusters that control SM activity. Specifically, self-protection is provided by both BGC-encoded mechanisms and non-BGC subcellular containment of toxic SM precursors; delivery and timing is orchestrated through cellular trafficking patterns and stress- and developmental-responsive transcriptional programs.

  2. Elemental abundance analysis of the early-type members of the open cluster M6: Preliminary results

    Science.gov (United States)

    Kılıçoǧlu, T.; Monier, R.; Fossati, L.

    2014-11-01

    Differences in chemical composition among main sequence stars within a given cluster are probably due to differences in their masses and other effects such as radiative diffusion, magnetic field, rotation, mixing mechanisms, mass loss, accretion and multiplicity. The early type main-sequence members of open clusters of different ages support studies of the competition between radiative diffusion and mixing mechanisms. We have analysed low- and high-resolution spectra covering the spectral range λ 4500-5840 Å of late B-, A- and F-type members of the open cluster M6 (age ˜100 Myr). The spectra were obtained with the FLAMES/GIRAFFE spectrograph mounted at UT2, the 8-m VLT telescope. The effective temperatures, surface gravities and microturbulent velocities of the stars were derived from both photometric and spectral methods. We have also performed a chemical abundance analysis using synthetic spectra. Abundances were determined for the elements C, O, Mg, Si, Ca, Sc, Ti, Cr, Mn, Fe, Ni, Y and Ba. The star-to-star variations in element abundances among the members of M6 are discussed.

  3. Age and helium content of the open cluster NGC 6791 from multiple eclipsing binary members. II

    DEFF Research Database (Denmark)

    Brogaard, K.; VandenBerg, D. A.; Bruntt, H.

    2012-01-01

    Models of stellar structure and evolution can be constrained by measuring accurate parameters of detached eclipsing binaries in open clusters. Multiple binary stars provide the means to determine helium abundances in these old stellar systems, and in turn, to improve estimates of their age....... In the first paper of this series, we demonstrated how measurements of multiple eclipsing binaries in the old open cluster NGC6791 sets tighter constraints on the properties of stellar models than has previously been possible, thereby potentially improving both the accuracy and precision of the cluster age....... Here we add additional constraints and perform an extensive model comparison to determine the best estimates of the cluster age and helium content, employing as many observational constraints as possible. We improve our photometry and correct empirically for differential reddening effects. We...

  4. VizieR Online Data Catalog: Gaia DR1 open cluster members (Gaia Collaboration+, 2017)

    Science.gov (United States)

    Gaia Collaboration; van Leeuwen F.; Vallenari, A.; Jordi, C.; Lindegren, L.; Bastian, U.; Prusti, T.; de Bruijne, J. H. J.; Brown, A. G. A.; Babusiaux, C.; Bailer-Jones, C. A. L.; Biermann, M.; Evans, D. W.; Eyer, L.; Jansen, F.; Klioner, S. A.; Lammers, U.; Luri, X.; Mignard, F.; Panem, C.; Pourbaix, D.; Randich, S.; Sartoretti, P.; Siddiqui, H. I.; Soubiran, C.; Valette, V.; Walton, N. A.; Aerts, C.; Arenou, F.; Cropper, M.; Drimmel, R.; Hog, E.; Katz, D.; Lattanzi, M. G.; O'Mullane, W.; Grebel, E. K.; Holland, A. D.; Huc, C.; Passot, X.; Perryman, M.; Bramante, L.; Cacciari, C.; Castaneda, J.; Chaoul, L.; Cheek, N.; de Angeli, F.; Fabricius, C.; Guerra, R.; Hernandez, J.; Jean-Antoine-Piccolo, A.; Masana, E.; Messineo, R.; Mowlavi, N.; Nienartowicz, K.; Ordonez-Blanco, D.; Panuzzo, P.; Portell, J.; Richards, P. J.; Riello, M.; Seabroke, G. M.; Tanga, P.; Thevenin, F.; Torra, J.; Els, S. G.; Gracia-Abril, G.; Comoretto, G.; Garcia-Reinaldos, M.; Lock, T.; Mercier, E.; Altmann, M.; Andrae, R.; Astraatmadja, T. L.; Bellas-Velidis, I.; Benson, K.; Berthier, J.; Blomme, R.; Busso, G.; Carry, B.; Cellino, A.; Clementini, G.; Cowell, S.; Creevey, O.; Cuypers, J.; Davidson, M.; De Ridder, J.; de Torres, A.; Delchambre, L.; Dell'Oro, A.; Ducourant, C.; Fremat, Y.; Garcia-Torres, M.; Gosset, E.; Halbwachs, J.-L.; Hambly, N. C.; Harrison, D. L.; Hauser, M.; Hestroffer, D.; Hodgkin, S. T.; Huckle, H. E.; Hutton, A.; Jasniewicz, G.; Jordan, S.; Kontizas, M.; Korn, A. J.; Lanzafame, A. C.; Manteiga, M.; Moitinho, A.; Muinonen, K.; Osinde, J.; Pancino, E.; Pauwels, T.; Petit, J.-M.; Recio-Blanco, A.; Robin, A. C.; Sarro, L. M.; Siopis, C.; Smith, M.; Smith, K. W.; Sozzetti, A.; Thuillot, W.; van Reeven, W.; Viala, Y.; Abbas, U.; Abreu Aramburu, A.; Accart, S.; Aguado, J. J.; Allan, P. M.; Allasia, W.; Altavilla, G.; Alvarez, M. A.; Alves, J.; Anderson, R. I.; Andrei, A. H.; Anglada Varela, E.; Antiche, E.; Antoja, T.; Anton, S.; Arcay, B.; Bach, N.; Baker, S. G.; Balaguer-Nunez, L.; Barache, C.; Barata, C.; Barbier, A.; Barblan, F.; Barrado, Y. Navascues D.; Barros, M.; Barstow, M. A.; Becciani, U.; Bellazzini, M.; Bello Garcia, A.; Belokuro, V. V.; Ben Djoya, P.; Berihuete, A.; Bianchi, L.; Bienayme, O.; Billebaud, F.; Blagorodnova, N.; Blanco-Cuaresma, S.; Boch, T.; Bombrun, A.; Borrachero, R.; Bouquillon, S.; Bourda, G.; Bouy, H.; Bragaglia, A.; Breddels, M. A.; Brouillet, N.; Bruesemeister, T.; Bucciarelli, B.; Burgess, P.; Burgon, R.; Burlacu, A.; Busonero, D.; Buzzi, R.; Caffau, E.; Cambras, J.; Campbell, H.; Cancelliere, R.; Cantat-Gaudin, T.; Carlucci, T.; Carrasco, J. M.; Castellani, M.; Charlot, P.; Charnas, J.; Chiavassa, A.; Clotet, M.; Cocozza, G.; Collins, R. S.; Costigan, G.; Crifo, F.; Cross, N. J. G.; Crosta, M.; Crowley, C.; Dafonte, C.; Damerdji, Y.; Dapergolas, A.; David, P.; David, M.; De Cat, P.; de Felice, F.; de Laverny, P.; de Luise, F.; de March, R.; de Martino, D.; de Souza, R.; Debosscher, J.; Del Pozo, E.; Delbo, M.; Delgado, A.; Delgado, H. E.; Di Matteo, P.; Diakite, S.; Distefano, E.; Dolding, C.; Dos Anjos, S.; Drazinos, P.; Duran, J.; Dzigan, Y.; Edvardsson, B.; Enke, H.; Evans, N. W.; Eynard Bontemps, G.; Fabre, C.; Fabrizio, M.; Faigler, S.; Falcao, A. J.; Farras Casas, M.; Federici, L.; Fedorets, G.; Fernandez-Hernandez, J.; Fernique, P.; Fienga, A.; Figueras, F.; Filippi, F.; Findeisen, K.; Fonti, A.; Fouesneau, M.; Fraile, E.; Fraser, M.; Fuchs, J.; Gai, M.; Galleti, S.; Galluccio, L.; Garabato, D.; Garcia-Sedano, F.; Garofalo, A.; Garralda, N.; Gavras, P.; Gerssen, J.; Geyer, R.; Gilmore, G.; Girona, S.; Giuffrida, G.; Gomes, M.; Gonzalez-Marcos, A.; Gonzalez-Nunez, J.; Gonzalez-Vidal, J. J.; Granvik, M.; Guerrier, A.; Guillout, P.; Guiraud, J.; Gurpide, A.; Gutierrez-Sanchez, R.; Guy, L. P.; Haigron, R.; Hatzidimitriou, D.; Haywood, M.; Heiter, U.; Helmi, A.; Hobbs, D.; Hofmann, W.; Holl, B.; Holland, G.; Hunt, J. A. S.; Hypki, A.; Icardi, V.; Irwin, M.; Jevardat de Fombelle, G.; Jofre, P.; Jonker, P. G.; Jorissen, A.; Julbe, F.; Karampelas, A.; Kochoska, A.; Kohley, R.; Kolenberg, K.; Kontizas, E.; Koposov, S. E.; Kordopatis, G.; Koubsky, P.; Krone-Martins, A.; Kudryashova, M.; Kull, I.; Bachchan, R. K.; Lacoste-Seris, F.; Lanza, A. F.; Lavigne, J.-B.; Le Poncin-Lafitte, C.; Lebreton, Y.; Lebzelter, T.; Leccia, S.; Lecler, C. N.; Lecoeur-Taibi, I.; Lemaitre, V.; Lenhardt, H.; Leroux, F.; Liao, S.; Licata, E.; Lindstrom, H. E. P.; Lister, T. A.; Livanou, E.; Lobel, A.; Loeffler, W.; Lopez, M.; Lorenz, D.; MacDonald, I.; Magalhaes Fernandes, T.; Managau, S.; Mann, R. G.; Mantelet, G.; Marchal, O.; Marchant, J. M.; Marconi, M.; Marinoni, S.; Marrese, P. M.; Marschalko, G.; Marshall, D. J.; Martin-Fleitas, J. M.; Martino, M.; Mary, N.; Matijevic, G.; Mazeh, T.; McMillan, P. J.; Messina, S.; Michalik, D.; Millar, N. R.; Miranda, B. M. H.; Molina, D.; Molinaro, R.; Molinaro, M.; Molnar, L.; Moniez, M.; Montegriffo, P.; Mor, R.; Mora, A.; Morbidelli, R.; Morel, T.; Morgenthaler, S.; Morris, D.; Mulone, A. F.; Muraveva, T.; Musella, I.; Narbonne, J.; Nelemans, G.; Nicastro, L.; Noval, L.; Ordenovic, C.; Ordieres-Mere, J.; Osborne, P.; Pagani, C.; Pagano, I.; Pailler, F.; Palacin, H.; Palaversa, L.; Parsons, P.; Pecoraro, M.; Pedrosa, R.; Pentikaeinen, H.; Pichon, B.; Piersimoni, A. M.; Pineau, F.-X.; Plachy, E.; Plum, G.; Poujoulet, E.; Prsa, A.; Pulone, L.; Ragaini, S.; Rago, S.; Rambaux, N.; Ramos-Lerate, M.; Ranalli, P.; Rauw, G.; Read, A.; Regibo, S.; Reyle, C.; Ribeiro, R. A.; Rimoldini, L.; Ripepi, V.; Riva, A.; Rixon, G.; Roelens, M.; Romero-Gomez, M.; Rowell, N.; Royer, F.; Ruiz-Dern, L.; Sadowski, G.; Sagrista Selles, T.; Sahlmann, J.; Salgado, J.; Salguero, E.; Sarasso, M.; Savietto, H.; Schultheis, M.; Sciacca, E.; Segol, M.; Segovia, J. C.; Segransan, D.; Shih, I.-C.; Smareglia, R.; Smart, R. L.; Solano, E.; Solitro, F.; Sordo, R.; Soria Nieto, S.; Souchay, J.; Spagna, A.; Spoto, F.; Stampa, U.; Steele, I. A.; Steidelmueller, H.; Stephenson, C. A.; Stoev, H.; Suess, F. F.; Sueveges, M.; Surdej, J.; Szabados, L.; Szegedi-Elek, E.; Tapiador, D.; Taris, F.; Tauran, G.; Taylor, M. B.; Teixeira, R.; Terrett, D.; Tingley, B.; Trager, S. C.; Turon, C.; Ulla, A.; Utrilla, E.; Valentini, G.; van Elteren, A.; van Hemelryck, E.; Vanleeuwen, M.; Varadi, M.; Vecchiato, A.; Veljanoski, J.; Via, T.; Vicente, D.; Vogt, S.; Voss, H.; Votruba, V.; Voutsinas, S.; Walmsley, G.; Weiler, M.; Weingrill, K.; Wevers, T.; Wyrzykowski, L.; Yoldas, A.; Zerjal, M.; Zucker, S.; Zurbach, C.; Zwitter, T.; Alecu, A.; Allen, M.; Allende Prieto, C.; Amorim, A.; Anglada-Escude, G.; Arsenijevic, V.; Azaz, S.; Balm, P.; Beck, M.; Bernstein, H.-H.; Bigot, L.; Bijaoui, A.; Blasco, C.; Bonfigli, M.; Bono, G.; Boudreault, S.; Bressan, A.; Brown, S.; Brunet, P.-M.; Bunclark, P.; Buonanno, R.; Butkevich, A. G.; Carret, C.; Carrion, C.; Chemin, L.; Chereau, F.; Corcione, L.; Darmigny, E.; de Boer, K. S.; de Teodoro, P.; de Zeeuw, P. T.; Delle Luche, C.; Domingues, C. D.; Dubath, P.; Fodor, F.; Frezouls, B.; Fries, A.; Fustes, D.; Fyfe, D.; Gallardo, E.; Gallegos, J.; Gardiol, D.; Gebran, M.; Gomboc, A.; Gomez, A.; Grux, E.; Gueguen, A.; Heyrovsky, A.; Hoar, J.; Iannicola, G.; Isasi Parache, Y.; Janotto, A.-M.; Joliet, E.; Jonckheere, A.; Keil, R.; Kim, D.-W.; Klagyivik, P.; Klar, J.; Knude, J.; Kochukhov, O.; Kolka, I.; Kos, J.; Kutka, A.; Lainey, V.; Lebouquin, D.; Liu, C.; Loreggia, D.; Makarov, V. V.; Marseille, M. G.; Martayan, C.; Martinez-Rubi, O.; Massart, B.; Meynadier, F.; Mignot, S.; Munari, U.; Nguyen, A.-T.; Nordlander, T.; O'Flaherty, K. S.; Ocvirk, P.; Olias Sanz, A.; Ortiz, P.; Osorio, J.; Oszkiewicz, D.; Ouzounis, A.; Palmer, M.; Park, P.; Pasquato, E.; Peltzer, C.; Peralta, J.; Peturaud, F.; Pieniluoma, T.; Pigozzi, E.; Poels, J.; Prat, G.; Prod'homme, T.; Raison, F.; Rebordao, J. M.; Risquez, D.; Rocca-Volmerange, B.; Rosen, S.; Ruiz-Fuertes, M. I.; Russo, F.; Sembay, S.; Serraller Vizcaino, I.; Short, A.; Siebert, A.; Silva, H.; Sinachopoulos, D.; Slezak, E.; Soffel, M.; Sosnowska, D.; Straizys, V.; Ter Linden, M.; Terrell, D.; Theil, S.; Tiede, C.; Troisi, L.; Tsalmantza, P.; Tur, D.; Vaccari, M.; Vachier, F.; Valles, P.; van Hamme, W.; Veltz, L.; Virtanen, J.; Wallut, J.-M.; Wichmann, R.; Wilkinson, M. I.; Ziaeepour, H.; Zschocke, S.

    2017-04-01

    We have determined and examined the astrometric data for 19 open clusters, ranging from the Hyades at just under 47pc to NGC 2422 at nearly 440pc. The clusters are : the Hyades, Coma Berenices, the Pleiades, Praesepe, alpha Per, IC 2391, IC 2602, Blanco 1, NGC 2451, NGC 6475, NGC 7092, NGC 2516, NGC 2232, IC 4665, NGC 6633, Collinder 140, NGC 2422, NGC 3532 and NGC 2547. (2 data files).

  5. Ancient Expansion of the Hox Cluster in Lepidoptera Generated Four Homeobox Genes Implicated in Extra-Embryonic Tissue Formation

    Science.gov (United States)

    Taylor, William R.; Gibbs, Melanie; Breuker, Casper J.; Holland, Peter W. H.

    2014-01-01

    Gene duplications within the conserved Hox cluster are rare in animal evolution, but in Lepidoptera an array of divergent Hox-related genes (Shx genes) has been reported between pb and zen. Here, we use genome sequencing of five lepidopteran species (Polygonia c-album, Pararge aegeria, Callimorpha dominula, Cameraria ohridella, Hepialus sylvina) plus a caddisfly outgroup (Glyphotaelius pellucidus) to trace the evolution of the lepidopteran Shx genes. We demonstrate that Shx genes originated by tandem duplication of zen early in the evolution of large clade Ditrysia; Shx are not found in a caddisfly and a member of the basally diverging Hepialidae (swift moths). Four distinct Shx genes were generated early in ditrysian evolution, and were stably retained in all descendent Lepidoptera except the silkmoth which has additional duplications. Despite extensive sequence divergence, molecular modelling indicates that all four Shx genes have the potential to encode stable homeodomains. The four Shx genes have distinct spatiotemporal expression patterns in early development of the Speckled Wood butterfly (Pararge aegeria), with ShxC demarcating the future sites of extraembryonic tissue formation via strikingly localised maternal RNA in the oocyte. All four genes are also expressed in presumptive serosal cells, prior to the onset of zen expression. Lepidopteran Shx genes represent an unusual example of Hox cluster expansion and integration of novel genes into ancient developmental regulatory networks. PMID:25340822

  6. Gene prioritization and clustering by multi-view text mining.

    Science.gov (United States)

    Yu, Shi; Tranchevent, Leon-Charles; De Moor, Bart; Moreau, Yves

    2010-01-14

    Text mining has become a useful tool for biologists trying to understand the genetics of diseases. In particular, it can help identify the most interesting candidate genes for a disease for further experimental analysis. Many text mining approaches have been introduced, but the effect of disease-gene identification varies in different text mining models. Thus, the idea of incorporating more text mining models may be beneficial to obtain more refined and accurate knowledge. However, how to effectively combine these models still remains a challenging question in machine learning. In particular, it is a non-trivial issue to guarantee that the integrated model performs better than the best individual model. We present a multi-view approach to retrieve biomedical knowledge using different controlled vocabularies. These controlled vocabularies are selected on the basis of nine well-known bio-ontologies and are applied to index the vast amounts of gene-based free-text information available in the MEDLINE repository. The text mining result specified by a vocabulary is considered as a view and the obtained multiple views are integrated by multi-source learning algorithms. We investigate the effect of integration in two fundamental computational disease gene identification tasks: gene prioritization and gene clustering. The performance of the proposed approach is systematically evaluated and compared on real benchmark data sets. In both tasks, the multi-view approach demonstrates significantly better performance than other comparing methods. In practical research, the relevance of specific vocabulary pertaining to the task is usually unknown. In such case, multi-view text mining is a superior and promising strategy for text-based disease gene identification.

  7. Coordinated evolution of co-expressed gene clusters in the Drosophila transcriptome

    Directory of Open Access Journals (Sweden)

    Jones Corbin D

    2008-01-01

    Full Text Available Abstract Background Co-expression of genes that physically cluster together is a common characteristic of eukaryotic transcriptomes. This organization of transcriptomes suggests that coordinated evolution of gene expression for clustered genes may also be common. Clusters where expression evolution of each gene is not independent of their neighbors are important units for understanding transcriptome evolution. Results We used a common microarray platform to measure gene expression in seven closely related species in the Drosophila melanogaster subgroup, accounting for confounding effects of sequence divergence. To summarize the correlation structure among genes in a chromosomal region, we analyzed the fraction of variation along the first principal component of the correlation matrix. We analyzed the correlation for blocks of consecutive genes to assess patterns of correlation that may be manifest at different scales of coordinated expression. We find that expression of physically clustered genes does evolve in a coordinated manner in many locations throughout the genome. Our analysis shows that relatively few of these clusters are near heterochromatin regions and that these clusters tend to be over-dispersed relative to the rest of the genome. This suggests that these clusters are not the byproduct of local gene clustering. We also analyzed the pattern of co-expression among neighboring genes within a single Drosophila species: D. simulans. For the co-expression clusters identified within this species, we find an under-representation of genes displaying a signature of recurrent adaptive amino acid evolution consistent with previous findings. However, clusters displaying co-evolution of expression among species are enriched for adaptively evolving genes. This finding points to a tie between adaptive sequence evolution and evolution of the transcriptome. Conclusion Our results demonstrate that co-evolution of expression in gene clusters is

  8. Gravitation field algorithm and its application in gene cluster

    Directory of Open Access Journals (Sweden)

    Zheng Ming

    2010-09-01

    Full Text Available Abstract Background Searching optima is one of the most challenging tasks in clustering genes from available experimental data or given functions. SA, GA, PSO and other similar efficient global optimization methods are used by biotechnologists. All these algorithms are based on the imitation of natural phenomena. Results This paper proposes a novel searching optimization algorithm called Gravitation Field Algorithm (GFA which is derived from the famous astronomy theory Solar Nebular Disk Model (SNDM of planetary formation. GFA simulates the Gravitation field and outperforms GA and SA in some multimodal functions optimization problem. And GFA also can be used in the forms of unimodal functions. GFA clusters the dataset well from the Gene Expression Omnibus. Conclusions The mathematical proof demonstrates that GFA could be convergent in the global optimum by probability 1 in three conditions for one independent variable mass functions. In addition to these results, the fundamental optimization concept in this paper is used to analyze how SA and GA affect the global search and the inherent defects in SA and GA. Some results and source code (in Matlab are publicly available at http://ccst.jlu.edu.cn/CSBG/GFA.

  9. Adaptive evolution of the FADS gene cluster within Africa.

    Directory of Open Access Journals (Sweden)

    Rasika A Mathias

    Full Text Available Long chain polyunsaturated fatty acids (LC-PUFAs are essential for brain structure, development, and function, and adequate dietary quantities of LC-PUFAs are thought to have been necessary for both brain expansion and the increase in brain complexity observed during modern human evolution. Previous studies conducted in largely European populations suggest that humans have limited capacity to synthesize brain LC-PUFAs such as docosahexaenoic acid (DHA from plant-based medium chain (MC PUFAs due to limited desaturase activity. Population-based differences in LC-PUFA levels and their product-to-substrate ratios can, in part, be explained by polymorphisms in the fatty acid desaturase (FADS gene cluster, which have been associated with increased conversion of MC-PUFAs to LC-PUFAs. Here, we show evidence that these high efficiency converter alleles in the FADS gene cluster were likely driven to near fixation in African populations by positive selection ∼85 kya. We hypothesize that selection at FADS variants, which increase LC-PUFA synthesis from plant-based MC-PUFAs, played an important role in allowing African populations obligatorily tethered to marine sources for LC-PUFAs in isolated geographic regions, to rapidly expand throughout the African continent 60-80 kya.

  10. The medaka novel immune-type receptor (NITR gene clusters reveal an extraordinary degree of divergence in variable domains

    Directory of Open Access Journals (Sweden)

    Litman Gary W

    2008-06-01

    Full Text Available Abstract Background Novel immune-type receptor (NITR genes are members of diversified multigene families that are found in bony fish and encode type I transmembrane proteins containing one or two extracellular immunoglobulin (Ig domains. The majority of NITRs can be classified as inhibitory receptors that possess cytoplasmic immunoreceptor tyrosine-based inhibition motifs (ITIMs. A much smaller number of NITRs can be classified as activating receptors by the lack of cytoplasmic ITIMs and presence of a positively charged residue within their transmembrane domain, which permits partnering with an activating adaptor protein. Results Forty-four NITR genes in medaka (Oryzias latipes are located in three gene clusters on chromosomes 10, 18 and 21 and can be organized into 24 families including inhibitory and activating forms. The particularly large dataset acquired in medaka makes direct comparison possible to another complete dataset acquired in zebrafish in which NITRs are localized in two clusters on different chromosomes. The two largest medaka NITR gene clusters share conserved synteny with the two zebrafish NITR gene clusters. Shared synteny between NITRs and CD8A/CD8B is limited but consistent with a potential common ancestry. Conclusion Comprehensive phylogenetic analyses between the complete datasets of NITRs from medaka and zebrafish indicate multiple species-specific expansions of different families of NITRs. The patterns of sequence variation among gene family members are consistent with recent birth-and-death events. Similar effects have been observed with mammalian immunoglobulin (Ig, T cell antigen receptor (TCR and killer cell immunoglobulin-like receptor (KIR genes. NITRs likely diverged along an independent pathway from that of the somatically rearranging antigen binding receptors but have undergone parallel evolution of V family diversity.

  11. Time-series clustering of gene expression in irradiated and bystander fibroblasts: an application of FBPA clustering

    Directory of Open Access Journals (Sweden)

    Markatou Marianthi

    2011-01-01

    Full Text Available Abstract Background The radiation bystander effect is an important component of the overall biological response of tissues and organisms to ionizing radiation, but the signaling mechanisms between irradiated and non-irradiated bystander cells are not fully understood. In this study, we measured a time-series of gene expression after α-particle irradiation and applied the Feature Based Partitioning around medoids Algorithm (FBPA, a new clustering method suitable for sparse time series, to identify signaling modules that act in concert in the response to direct irradiation and bystander signaling. We compared our results with those of an alternate clustering method, Short Time series Expression Miner (STEM. Results While computational evaluations of both clustering results were similar, FBPA provided more biological insight. After irradiation, gene clusters were enriched for signal transduction, cell cycle/cell death and inflammation/immunity processes; but only FBPA separated clusters by function. In bystanders, gene clusters were enriched for cell communication/motility, signal transduction and inflammation processes; but biological functions did not separate as clearly with either clustering method as they did in irradiated samples. Network analysis confirmed p53 and NF-κB transcription factor-regulated gene clusters in irradiated and bystander cells and suggested novel regulators, such as KDM5B/JARID1B (lysine (K-specific demethylase 5B and HDACs (histone deacetylases, which could epigenetically coordinate gene expression after irradiation. Conclusions In this study, we have shown that a new time series clustering method, FBPA, can provide new leads to the mechanisms regulating the dynamic cellular response to radiation. The findings implicate epigenetic control of gene expression in addition to transcription factor networks.

  12. Lampreys have a single gene cluster for the fast skeletal myosin heavy chain gene family.

    Directory of Open Access Journals (Sweden)

    Daisuke Ikeda

    Full Text Available Muscle tissues contain the most classic sarcomeric myosin, called myosin II, which consists of 2 heavy chains (MYHs and 4 light chains. In the case of humans (tetrapod, a total of 6 fast skeletal-type MYH genes (MYHs are clustered on a single chromosome. In contrast, torafugu (teleost contains at least 13 fast skeletal MYHs, which are distributed in 5 genomic regions; the MYHs are clustered in 3 of these regions. In the present study, the evolutionary relationship among fast skeletal MYHs is elucidated by comparing the MYHs of teleosts and tetrapods with those of cyclostome lampreys, one of two groups of extant jawless vertebrates (agnathans. We found that lampreys contain at least 3 fast skeletal MYHs, which are clustered in a head-to-tail manner in a single genomic region. Although there was apparent synteny in the corresponding MYH cluster regions between lampreys and tetrapods, phylogenetic analysis indicated that lamprey and tetrapod MYHs have independently duplicated and diversified. Subsequent transgenic approaches showed that the 5'-flanking sequences of Japanese lamprey fast skeletal MYHs function as a regulatory sequence to drive specific reporter gene expression in the fast skeletal muscle of zebrafish embryos. Although zebrafish MYH promoters showed apparent activity to direct reporter gene expression in myogenic cells derived from mice, promoters from Japanese lamprey MYHs had no activity. These results suggest that the muscle-specific regulatory mechanisms are partially conserved between teleosts and tetrapods but not between cyclostomes and tetrapods, despite the conserved synteny.

  13. Gene expression profiling in cluster headache: a pilot microarray study.

    Science.gov (United States)

    Sjöstrand, Christina; Duvefelt, Kristina; Steinberg, Anna; Remahl, Ingela Nilsson; Waldenlind, Elisabet; Hillert, Jan

    2006-01-01

    Cluster headache (CH) is a primary neurovascular headache disorder characterized by attacks of excruciating pain accompanied by ipsilateral autonomic symptoms. CH pathophysiology is presumed to involve an activation of hypothalamic and trigeminovascular systems, but inflammation and immunological mechanisms have also been hypothesized to be of importance. To identify differentially expressed genes during different clinical phases of CH, assuming that changes of pathophysiological importance would also be seen in peripheral venous blood. Blood samples were drawn at 3 consecutive occasions from 3 episodic CH patients: during attacks, between attacks and in remission, and at 1 occasion from 3 matched controls. Global gene expression was analyzed with microarray tehnology using the Affymetrix Human Genome U133 2.0 Plus GeneChip Set, covering more than 54,000 gene transcripts, corresponding to almost 22,000 genes. Quantitative RT-PCR on S100P gene expression was analyzed in 6 patients and 14 controls. Overall, quite small differences were seen intraindividually and large differences interindividually. However, pairwise comparisons of signal values showed upregulation of several S100 calcium binding proteins; S100A8 (calgranulin A), S100A12 (calgranulin C), and S100P during active phase of the disease compared to remission. Also, annexin A3 (calcium-binding) and ICAM3 showed upregulation. BIRC1 (neuronal apoptosis inhibitory protein), CREB5, HLA-DQA1, and HLA-DQB1 were upregulated in patients compared to controls. The upregulation of S100P during attack versus remission was confirmed by quantitative RT-PCR analysis. The S100A8 and S100A12 proteins are considered markers of non-infectious inflammatory disease, while the function of S100P is still largely unknown. Furthermore, upregulation of HLA-DQ genes in CH patients may also indicate an inflammatory response. Upregulation of these pro-inflammatory genes during the active phase of CH has not formerly been reported. Data

  14. The ergot alkaloid gene cluster in Claviceps purpurea: extension of the cluster sequence and intra species evolution.

    Science.gov (United States)

    Haarmann, Thomas; Machado, Caroline; Lübbe, Yvonne; Correia, Telmo; Schardl, Christopher L; Panaccione, Daniel G; Tudzynski, Paul

    2005-06-01

    The genomic region of Claviceps purpurea strain P1 containing the ergot alkaloid gene cluster [Tudzynski, P., Hölter, K., Correia, T., Arntz, C., Grammel, N., Keller, U., 1999. Evidence for an ergot alkaloid gene cluster in Claviceps purpurea. Mol. Gen. Genet. 261, 133-141] was explored by chromosome walking, and additional genes probably involved in the ergot alkaloid biosynthesis have been identified. The putative cluster sequence (extending over 68.5kb) contains 4 different nonribosomal peptide synthetase (NRPS) genes and several putative oxidases. Northern analysis showed that most of the genes were co-regulated (repressed by high phosphate), and identified probable flanking genes by lack of co-regulation. Comparison of the cluster sequences of strain P1, an ergotamine producer, with that of strain ECC93, an ergocristine producer, showed high conservation of most of the cluster genes, but significant variation in the NRPS modules, strongly suggesting that evolution of these chemical races of C. purpurea is determined by evolution of NRPS module specificity.

  15. Genomic sequence and organization of two members of a human lectin gene family

    International Nuclear Information System (INIS)

    Gitt, M.A.; Barondes, S.H.

    1991-01-01

    The authors have isolated and sequenced the genomic DNA encoding a human dimeric soluble lactose-binding lectin. The gene has four exons, and its upstream region contains sequences that suggest control by glucocorticoids, heat (environmental) shock, metals, and other factors. They have also isolated and sequenced three exons of the gene encoding another human putative lectin, the existence of which was first indicated by isolation of its cDNA. Comparisons suggest a general pattern of genomic organization of members of this lectin gene family

  16. In situ gene expression in mixed-culture biofilms: Evidence of metabolic interactions between community members

    DEFF Research Database (Denmark)

    Møller, Søren; Sternberg, Claus; Andersen, Jens Bo

    1998-01-01

    Microbial communities growing in laboratory-based pow chambers were investigated in order to study compartmentalization of specific gene expression. Among the community members studied, the focus,vas in particular on Pseudomonas putida and a strain of an Acinetobacter sp., and the genes studied...... in both community and pure-culture biofilms, while the Pm promoter was induced in the mixed community but not in a pure-culture biofilm. By sequentially adding community members, induction of Pm was shown to be a consequence of direct metabolic interactions between an Acinetobacter species and P. putida...... are involved in the biodegradation of toluene and related aromatic compounds. The upper-pathway promoter (Pu) and the meta-pathway promoter (Pm) from the TOL plasmid were fused independently to the gene coding for the green fluorescent protein (GFP), and expression from these promoters was studied in P. putida...

  17. Variations in CCL3L gene cluster sequence and non-specific gene copy numbers

    Directory of Open Access Journals (Sweden)

    Edberg Jeffrey C

    2010-03-01

    Full Text Available Abstract Background Copy number variations (CNVs of the gene CC chemokine ligand 3-like1 (CCL3L1 have been implicated in HIV-1 susceptibility, but the association has been inconsistent. CCL3L1 shares homology with a cluster of genes localized to chromosome 17q12, namely CCL3, CCL3L2, and, CCL3L3. These genes are involved in host defense and inflammatory processes. Several CNV assays have been developed for the CCL3L1 gene. Findings Through pairwise and multiple alignments of these genes, we have shown that the homology between these genes ranges from 50% to 99% in complete gene sequences and from 70-100% in the exonic regions, with CCL3L1 and CCL3L3 being identical. By use of MEGA 4 and BioEdit, we aligned sense primers, anti-sense primers, and probes used in several previously described assays against pre-multiple alignments of all four chemokine genes. Each set of probes and primers aligned and matched with overlapping sequences in at least two of the four genes, indicating that previously utilized RT-PCR based CNV assays are not specific for only CCL3L1. The four available assays measured median copies of 2 and 3-4 in European and African American, respectively. The concordance between the assays ranged from 0.44-0.83 suggesting individual discordant calls and inconsistencies with the assays from the expected gene coverage from the known sequence. Conclusions This indicates that some of the inconsistencies in the association studies could be due to assays that provide heterogenous results. Sequence information to determine CNV of the three genes separately would allow to test whether their association with the pathogenesis of a human disease or phenotype is affected by an individual gene or by a combination of these genes.

  18. Evolution of the C-Type Lectin-Like Receptor Genes of the DECTIN-1 Cluster in the NK Gene Complex

    Directory of Open Access Journals (Sweden)

    Susanne Sattler

    2012-01-01

    Full Text Available Pattern recognition receptors are crucial in initiating and shaping innate and adaptive immune responses and often belong to families of structurally and evolutionarily related proteins. The human C-type lectin-like receptors encoded in the DECTIN-1 cluster within the NK gene complex contain prominent receptors with pattern recognition function, such as DECTIN-1 and LOX-1. All members of this cluster share significant homology and are considered to have arisen from subsequent gene duplications. Recent developments in sequencing and the availability of comprehensive sequence data comprising many species showed that the receptors of the DECTIN-1 cluster are not only homologous to each other but also highly conserved between species. Even in Caenorhabditis elegans, genes displaying homology to the mammalian C-type lectin-like receptors have been detected. In this paper, we conduct a comprehensive phylogenetic survey and give an up-to-date overview of the currently available data on the evolutionary emergence of the DECTIN-1 cluster genes.

  19. High presence/absence gene variability in defense-related gene clusters of Cucumis melo.

    Science.gov (United States)

    González, Víctor M; Aventín, Núria; Centeno, Emilio; Puigdomènech, Pere

    2013-11-12

    Changes in the copy number of DNA sequences are one of the main mechanisms generating genome variability in eukaryotes. These changes are often related to phenotypic effects such as genetic disorders or novel pathogen resistance. The increasing availability of genome sequences through the application of next-generation massive sequencing technologies has allowed the study of genomic polymorphisms at both the interspecific and intraspecific levels, thus helping to understand how species adapt to changing environments through genome variability. Data on gene presence/absence variation (PAV) in melon was obtained by resequencing a cultivated accession and an old-relative melon variety, and using previously obtained resequencing data from three other melon cultivars, among them DHL92, on which the current draft melon genome sequence is based. A total of 1,697 PAV events were detected, involving 4.4% of the predicted melon gene complement. In all, an average 1.5% of genes were absent from each analyzed cultivar as compared to the DHL92 reference genome. The most populated functional category among the 304 PAV genes of known function was that of stress response proteins (30% of all classified PAVs). Our results suggest that genes from multi-copy families are five times more likely to be affected by PAV than singleton genes. Also, the chance of genes present in the genome in tandem arrays being affected by PAV is double that of isolated genes, with PAV genes tending to be in longer clusters. The highest concentration of PAV events detected in the melon genome was found in a 1.1 Mb region of linkage group V, which also shows the highest density of melon stress-response genes. In particular, this region contains the longest continuous gene-containing PAV sequence so far identified in melon. The first genome-wide report of PAV variation among several melon cultivars is presented here. Multi-copy and clustered genes, especially those with putative stress-response functions

  20. Challenges in microarray class discovery: a comprehensive examination of normalization, gene selection and clustering

    Science.gov (United States)

    2010-01-01

    Background Cluster analysis, and in particular hierarchical clustering, is widely used to extract information from gene expression data. The aim is to discover new classes, or sub-classes, of either individuals or genes. Performing a cluster analysis commonly involve decisions on how to; handle missing values, standardize the data and select genes. In addition, pre-processing, involving various types of filtration and normalization procedures, can have an effect on the ability to discover biologically relevant classes. Here we consider cluster analysis in a broad sense and perform a comprehensive evaluation that covers several aspects of cluster analyses, including normalization. Result We evaluated 2780 cluster analysis methods on seven publicly available 2-channel microarray data sets with common reference designs. Each cluster analysis method differed in data normalization (5 normalizations were considered), missing value imputation (2), standardization of data (2), gene selection (19) or clustering method (11). The cluster analyses are evaluated using known classes, such as cancer types, and the adjusted Rand index. The performances of the different analyses vary between the data sets and it is difficult to give general recommendations. However, normalization, gene selection and clustering method are all variables that have a significant impact on the performance. In particular, gene selection is important and it is generally necessary to include a relatively large number of genes in order to get good performance. Selecting genes with high standard deviation or using principal component analysis are shown to be the preferred gene selection methods. Hierarchical clustering using Ward's method, k-means clustering and Mclust are the clustering methods considered in this paper that achieves the highest adjusted Rand. Normalization can have a significant positive impact on the ability to cluster individuals, and there are indications that background correction is

  1. Combining Pareto-optimal clusters using supervised learning for identifying co-expressed genes.

    Science.gov (United States)

    Maulik, Ujjwal; Mukhopadhyay, Anirban; Bandyopadhyay, Sanghamitra

    2009-01-20

    The landscape of biological and biomedical research is being changed rapidly with the invention of microarrays which enables simultaneous view on the transcription levels of a huge number of genes across different experimental conditions or time points. Using microarray data sets, clustering algorithms have been actively utilized in order to identify groups of co-expressed genes. This article poses the problem of fuzzy clustering in microarray data as a multiobjective optimization problem which simultaneously optimizes two internal fuzzy cluster validity indices to yield a set of Pareto-optimal clustering solutions. Each of these clustering solutions possesses some amount of information regarding the clustering structure of the input data. Motivated by this fact, a novel fuzzy majority voting approach is proposed to combine the clustering information from all the solutions in the resultant Pareto-optimal set. This approach first identifies the genes which are assigned to some particular cluster with high membership degree by most of the Pareto-optimal solutions. Using this set of genes as the training set, the remaining genes are classified by a supervised learning algorithm. In this work, we have used a Support Vector Machine (SVM) classifier for this purpose. The performance of the proposed clustering technique has been demonstrated on five publicly available benchmark microarray data sets, viz., Yeast Sporulation, Yeast Cell Cycle, Arabidopsis Thaliana, Human Fibroblasts Serum and Rat Central Nervous System. Comparative studies of the use of different SVM kernels and several widely used microarray clustering techniques are reported. Moreover, statistical significance tests have been carried out to establish the statistical superiority of the proposed clustering approach. Finally, biological significance tests have been carried out using a web based gene annotation tool to show that the proposed method is able to produce biologically relevant clusters of co

  2. Combining Pareto-optimal clusters using supervised learning for identifying co-expressed genes

    Directory of Open Access Journals (Sweden)

    Bandyopadhyay Sanghamitra

    2009-01-01

    Full Text Available Abstract Background The landscape of biological and biomedical research is being changed rapidly with the invention of microarrays which enables simultaneous view on the transcription levels of a huge number of genes across different experimental conditions or time points. Using microarray data sets, clustering algorithms have been actively utilized in order to identify groups of co-expressed genes. This article poses the problem of fuzzy clustering in microarray data as a multiobjective optimization problem which simultaneously optimizes two internal fuzzy cluster validity indices to yield a set of Pareto-optimal clustering solutions. Each of these clustering solutions possesses some amount of information regarding the clustering structure of the input data. Motivated by this fact, a novel fuzzy majority voting approach is proposed to combine the clustering information from all the solutions in the resultant Pareto-optimal set. This approach first identifies the genes which are assigned to some particular cluster with high membership degree by most of the Pareto-optimal solutions. Using this set of genes as the training set, the remaining genes are classified by a supervised learning algorithm. In this work, we have used a Support Vector Machine (SVM classifier for this purpose. Results The performance of the proposed clustering technique has been demonstrated on five publicly available benchmark microarray data sets, viz., Yeast Sporulation, Yeast Cell Cycle, Arabidopsis Thaliana, Human Fibroblasts Serum and Rat Central Nervous System. Comparative studies of the use of different SVM kernels and several widely used microarray clustering techniques are reported. Moreover, statistical significance tests have been carried out to establish the statistical superiority of the proposed clustering approach. Finally, biological significance tests have been carried out using a web based gene annotation tool to show that the proposed method is able to

  3. Challenges in microarray class discovery: a comprehensive examination of normalization, gene selection and clustering

    Directory of Open Access Journals (Sweden)

    Landfors Mattias

    2010-10-01

    Full Text Available Abstract Background Cluster analysis, and in particular hierarchical clustering, is widely used to extract information from gene expression data. The aim is to discover new classes, or sub-classes, of either individuals or genes. Performing a cluster analysis commonly involve decisions on how to; handle missing values, standardize the data and select genes. In addition, pre-processing, involving various types of filtration and normalization procedures, can have an effect on the ability to discover biologically relevant classes. Here we consider cluster analysis in a broad sense and perform a comprehensive evaluation that covers several aspects of cluster analyses, including normalization. Result We evaluated 2780 cluster analysis methods on seven publicly available 2-channel microarray data sets with common reference designs. Each cluster analysis method differed in data normalization (5 normalizations were considered, missing value imputation (2, standardization of data (2, gene selection (19 or clustering method (11. The cluster analyses are evaluated using known classes, such as cancer types, and the adjusted Rand index. The performances of the different analyses vary between the data sets and it is difficult to give general recommendations. However, normalization, gene selection and clustering method are all variables that have a significant impact on the performance. In particular, gene selection is important and it is generally necessary to include a relatively large number of genes in order to get good performance. Selecting genes with high standard deviation or using principal component analysis are shown to be the preferred gene selection methods. Hierarchical clustering using Ward's method, k-means clustering and Mclust are the clustering methods considered in this paper that achieves the highest adjusted Rand. Normalization can have a significant positive impact on the ability to cluster individuals, and there are indications that

  4. Structure and expression of B-myc, a new member of the myc gene family.

    OpenAIRE

    Ingvarsson, S; Asker, C; Axelson, H; Klein, G; Sümegi, J

    1988-01-01

    The myc family of genes contains five functional members. We describe the cloning of a new member of the myc family from rat genomic and cDNA libraries, designated B-myc. A fragment of cloned B-myc was used to map the corresponding rat locus by Southern blotting of DNA prepared from rat X mouse somatic cell hybrids. B-myc mapped to rat chromosome 3. We have previously mapped the c-myc to rat chromosome 7 (J. Sümegi, J. Spira, H. Bazin, J. Szpirer, G. Levan, and G. Klein, Nature [London] 306:4...

  5. Physical and genetic map of the major nif gene cluster from Azotobacter vinelandii.

    OpenAIRE

    Jacobson, M R; Brigle, K E; Bennett, L T; Setterquist, R A; Wilson, M S; Cash, V L; Beynon, J; Newton, W E; Dean, D R

    1989-01-01

    Determination of a 28,793-base-pair DNA sequence of a region from the Azotobacter vinelandii genome that includes and flanks the nitrogenase structural gene region was completed. This information was used to revise the previously proposed organization of the major nif cluster. The major nif cluster from A. vinelandii encodes 15 nif-specific genes whose products bear significant structural identity to the corresponding nif-specific gene products from Klebsiella pneumoniae. These genes include ...

  6. Recursive Cluster Elimination (RCE for classification and feature selection from gene expression data

    Directory of Open Access Journals (Sweden)

    Showe Louise C

    2007-05-01

    Full Text Available Abstract Background Classification studies using gene expression datasets are usually based on small numbers of samples and tens of thousands of genes. The selection of those genes that are important for distinguishing the different sample classes being compared, poses a challenging problem in high dimensional data analysis. We describe a new procedure for selecting significant genes as recursive cluster elimination (RCE rather than recursive feature elimination (RFE. We have tested this algorithm on six datasets and compared its performance with that of two related classification procedures with RFE. Results We have developed a novel method for selecting significant genes in comparative gene expression studies. This method, which we refer to as SVM-RCE, combines K-means, a clustering method, to identify correlated gene clusters, and Support Vector Machines (SVMs, a supervised machine learning classification method, to identify and score (rank those gene clusters for the purpose of classification. K-means is used initially to group genes into clusters. Recursive cluster elimination (RCE is then applied to iteratively remove those clusters of genes that contribute the least to the classification performance. SVM-RCE identifies the clusters of correlated genes that are most significantly differentially expressed between the sample classes. Utilization of gene clusters, rather than individual genes, enhances the supervised classification accuracy of the same data as compared to the accuracy when either SVM or Penalized Discriminant Analysis (PDA with recursive feature elimination (SVM-RFE and PDA-RFE are used to remove genes based on their individual discriminant weights. Conclusion SVM-RCE provides improved classification accuracy with complex microarray data sets when it is compared to the classification accuracy of the same datasets using either SVM-RFE or PDA-RFE. SVM-RCE identifies clusters of correlated genes that when considered together

  7. The complete coenzyme B12 biosynthesis gene cluster of Lactobacillus reuteri CRL1098

    NARCIS (Netherlands)

    Santos, F.; Vera, J.L.; van der Heijden, R.; Valdez, G.; de Vos, W.M.; Sesma, F.; Hugenholtz, J.

    2008-01-01

    The coenzyme B12 production pathway in Lactobacillus reuteri has been deduced using a combination of genetic, biochemical and bioinformatics approaches. The coenzyme B12 gene cluster of Lb. reuteri CRL1098 has the unique feature of clustering together the cbi, cob and hem genes. It consists of 29

  8. The complete coenzyme B12 biosynthesis gene cluster of Lactobacillus reuteri CRL 1098

    NARCIS (Netherlands)

    Santos, dos F.; Vera, J.L.; Heijden, van der R.; Valdez, G.F.; Vos, de W.M.; Sesma, F.; Hugenholtz, J.

    2008-01-01

    The coenzyme B12 production pathway in Lactobacillus reuteri has been deduced using a combination of genetic, biochemical and bioinformatics approaches. The coenzyme B12 gene cluster of Lb. reuteri CRL1098 has the unique feature of clustering together the cbi, cob and hem genes. It consists of 29

  9. Rough-fuzzy clustering for grouping functionally similar genes from microarray data.

    Science.gov (United States)

    Maji, Pradipta; Paul, Sushmita

    2013-01-01

    Gene expression data clustering is one of the important tasks of functional genomics as it provides a powerful tool for studying functional relationships of genes in a biological process. Identifying coexpressed groups of genes represents the basic challenge in gene clustering problem. In this regard, a gene clustering algorithm, termed as robust rough-fuzzy c-means, is proposed judiciously integrating the merits of rough sets and fuzzy sets. While the concept of lower and upper approximations of rough sets deals with uncertainty, vagueness, and incompleteness in cluster definition, the integration of probabilistic and possibilistic memberships of fuzzy sets enables efficient handling of overlapping partitions in noisy environment. The concept of possibilistic lower bound and probabilistic boundary of a cluster, introduced in robust rough-fuzzy c-means, enables efficient selection of gene clusters. An efficient method is proposed to select initial prototypes of different gene clusters, which enables the proposed c-means algorithm to converge to an optimum or near optimum solutions and helps to discover coexpressed gene clusters. The effectiveness of the algorithm, along with a comparison with other algorithms, is demonstrated both qualitatively and quantitatively on 14 yeast microarray data sets.

  10. Dominant control region of the human β- like globin gene cluster

    NARCIS (Netherlands)

    Blom van Assendelft, Margaretha van

    1989-01-01

    The structure and regulation of the human β -like globin gene cluster has been studied extensively. Genetic disorders connected with this gene cluster are responsible for human diseases associated with high levels of morbidity and mortality, such as β-thalassaemia and sickle cell anaemia. The work

  11. A phylogenomic gene cluster resource: The phylogeneticallyinferred groups (PhlGs) database

    Energy Technology Data Exchange (ETDEWEB)

    Dehal, Paramvir S.; Boore, Jeffrey L.

    2005-08-25

    We present here the PhIGs database, a phylogenomic resource for sequenced genomes. Although many methods exist for clustering gene families, very few attempt to create truly orthologous clusters sharing descent from a single ancestral gene across a range of evolutionary depths. Although these non-phylogenetic gene family clusters have been used broadly for gene annotation, errors are known to be introduced by the artifactual association of slowly evolving paralogs and lack of annotation for those more rapidly evolving. A full phylogenetic framework is necessary for accurate inference of function and for many studies that address pattern and mechanism of the evolution of the genome. The automated generation of evolutionary gene clusters, creation of gene trees, determination of orthology and paralogy relationships, and the correlation of this information with gene annotations, expression information, and genomic context is an important resource to the scientific community.

  12. Gene expression of NOX family members and their clinical significance in hepatocellular carcinoma.

    Science.gov (United States)

    Eun, Hyuk Soo; Cho, Sang Yeon; Joo, Jong Seok; Kang, Sun Hyung; Moon, Hee Seok; Lee, Eaum Seok; Kim, Seok Hyun; Lee, Byung Seok

    2017-09-11

    Nicotinamide adenine dinucleotide phosphate (NADPH) oxidase complex-derived reactive oxygen species (ROS) promote chronic liver inflammation and remodeling that can drive hepatocellular carcinoma development. The role of NOX expression in hepatocellular carcinoma (HCC) has been partially investigated; however, the clinical relevance of collective or individual NOX family member expression for HCC survival remains unclear. Here, we obtained NOX mRNA expression data for 377 HCC samples and 21 normal liver controls from the TCGA data portal and performed Kaplan-Meier survival, gene ontology functional enrichment, and gene set enrichment analyses. Although most NOX genes exhibited little change, some were significantly induced in HCC compared to that in normal controls. In addition, HCC survival analyses indicated better overall survival in patients with high NOX4 and DUOX1 expression, whereas patients with high NOX1/2/5 expression showed poor prognoses. Gene-neighbour and gene set enrichment analyses revealed that NOX1/2/5 were strongly correlated with genes associated with cancer cell survival and metastasis, whereas increased NOX4 and DUOX1 expression was associated with genes that inhibit tumour progression. On the basis of these data, NOX family gene expression analysis could be a predictor of survival and identify putative therapeutic targets in HCC.

  13. Organization and differential regulation of a cluster of lignin peroxidase genes of Phanerochaete chrysosporium

    Science.gov (United States)

    Philip. Stewart; Daniel. Cullen

    1999-06-01

    The lignin peroxidases of Phanerochaete chrysosporium are encoded by a minimum of 10 closely related genes. Physical and genetic mapping of a cluster of eight lip genes revealed six genes occurring in pairs and transcriptionally convergent, suggesting that portions of the lip family arose by gene duplication events. The completed sequence of 1ipG and lipJ, together...

  14. An effective fuzzy kernel clustering analysis approach for gene expression data.

    Science.gov (United States)

    Sun, Lin; Xu, Jiucheng; Yin, Jiaojiao

    2015-01-01

    Fuzzy clustering is an important tool for analyzing microarray data. A major problem in applying fuzzy clustering method to microarray gene expression data is the choice of parameters with cluster number and centers. This paper proposes a new approach to fuzzy kernel clustering analysis (FKCA) that identifies desired cluster number and obtains more steady results for gene expression data. First of all, to optimize characteristic differences and estimate optimal cluster number, Gaussian kernel function is introduced to improve spectrum analysis method (SAM). By combining subtractive clustering with max-min distance mean, maximum distance method (MDM) is proposed to determine cluster centers. Then, the corresponding steps of improved SAM (ISAM) and MDM are given respectively, whose superiority and stability are illustrated through performing experimental comparisons on gene expression data. Finally, by introducing ISAM and MDM into FKCA, an effective improved FKCA algorithm is proposed. Experimental results from public gene expression data and UCI database show that the proposed algorithms are feasible for cluster analysis, and the clustering accuracy is higher than the other related clustering algorithms.

  15. A robust approach based on Weibull distribution for clustering gene expression data

    Directory of Open Access Journals (Sweden)

    Gong Binsheng

    2011-05-01

    Full Text Available Abstract Background Clustering is a widely used technique for analysis of gene expression data. Most clustering methods group genes based on the distances, while few methods group genes according to the similarities of the distributions of the gene expression levels. Furthermore, as the biological annotation resources accumulated, an increasing number of genes have been annotated into functional categories. As a result, evaluating the performance of clustering methods in terms of the functional consistency of the resulting clusters is of great interest. Results In this paper, we proposed the WDCM (Weibull Distribution-based Clustering Method, a robust approach for clustering gene expression data, in which the gene expressions of individual genes are considered as the random variables following unique Weibull distributions. Our WDCM is based on the concept that the genes with similar expression profiles have similar distribution parameters, and thus the genes are clustered via the Weibull distribution parameters. We used the WDCM to cluster three cancer gene expression data sets from the lung cancer, B-cell follicular lymphoma and bladder carcinoma and obtained well-clustered results. We compared the performance of WDCM with k-means and Self Organizing Map (SOM using functional annotation information given by the Gene Ontology (GO. The results showed that the functional annotation ratios of WDCM are higher than those of the other methods. We also utilized the external measure Adjusted Rand Index to validate the performance of the WDCM. The comparative results demonstrate that the WDCM provides the better clustering performance compared to k-means and SOM algorithms. The merit of the proposed WDCM is that it can be applied to cluster incomplete gene expression data without imputing the missing values. Moreover, the robustness of WDCM is also evaluated on the incomplete data sets. Conclusions The results demonstrate that our WDCM produces clusters

  16. CAR gene cluster and transcript levels of carotenogenic genes in Rhodotorula mucilaginosa.

    Science.gov (United States)

    Landolfo, Sara; Ianiri, Giuseppe; Camiolo, Salvatore; Porceddu, Andrea; Mulas, Giuliana; Chessa, Rossella; Zara, Giacomo; Mannazzu, Ilaria

    2018-01-01

    A molecular approach was applied to the study of the carotenoid biosynthetic pathway of Rhodotorula mucilaginosa. At first, functional annotation of the genome of R. mucilaginosa C2.5t1 was carried out and gene ontology categories were assigned to 4033 predicted proteins. Then, a set of genes involved in different steps of carotenogenesis was identified and those coding for phytoene desaturase, phytoene synthase/lycopene cyclase and carotenoid dioxygenase (CAR genes) proved to be clustered within a region of ~10 kb. Quantitative PCR of the genes involved in carotenoid biosynthesis showed that genes coding for 3-hydroxy-3-methylglutharyl-CoA reductase and mevalonate kinase are induced during exponential phase while no clear trend of induction was observed for phytoene synthase/lycopene cyclase and phytoene dehydrogenase encoding genes. Thus, in R. mucilaginosa the induction of genes involved in the early steps of carotenoid biosynthesis is transient and accompanies the onset of carotenoid production, while that of CAR genes does not correlate with the amount of carotenoids produced. The transcript levels of genes coding for carotenoid dioxygenase, superoxide dismutase and catalase A increased during the accumulation of carotenoids, thus suggesting the activation of a mechanism aimed at the protection of cell structures from oxidative stress during carotenoid biosynthesis. The data presented herein, besides being suitable for the elucidation of the mechanisms that underlie carotenoid biosynthesis, will contribute to boosting the biotechnological potential of this yeast by improving the outcome of further research efforts aimed at also exploring other features of interest.

  17. Effects of gene disruptions in the nisin gene cluster of Lactococcus lactis on nisin production and producer immunity

    NARCIS (Netherlands)

    Ra, Runar; Beerthuyzen, Marke M.; Vos, Willem M. de; Saris, Per E.J.; Kuipers, Oscar P.

    1999-01-01

    The lantibiotic nisin is produced by several strains of Lactococcus lactis subsp. lactis. The chromosomally located gene cluster nisABTCIPRKFEG is required for biosynthesis, development of immunity, and regulation of gene expression. In-frame deletions in the nisB and nisT genes, and disruption of

  18. Clustering of Drosophila melanogaster immune genes in interplay with recombination rate.

    Directory of Open Access Journals (Sweden)

    K Mathias Wegner

    Full Text Available BACKGROUND: Gene order in eukaryotic chromosomes is not random and has been linked to coordination of gene expression, chromatin structure and also recombination rate. The evolution of recombination rate is especially relevant for genes involved in immunity because host-parasite co-evolution could select for increased recombination rate (Red Queen hypothesis. To identify patterns left by the intimate interaction between hosts and parasites, I analysed the genomic parameters of the immune genes from 24 gene families/groups of Drosophila melanogaster. PRINCIPAL FINDINGS: Immune genes that directly interact with the pathogen (i.e. recognition and effector genes clustered in regions of higher recombination rates. Out of these, clustered effector genes were transcribed fastest indicating that transcriptional control might be one major cause for cluster formation. The relative position of clusters to each other, on the other hand, cannot be explained by transcriptional control per se. Drosophila immune genes that show epistatic interactions can be found at an average distance of 15.44+/-2.98 cM, which is considerably closer than genes that do not interact (30.64+/-1.95 cM. CONCLUSIONS: Epistatically interacting genes rarely belong to the same cluster, which supports recent models of optimal recombination rates between interacting genes in antagonistic host-parasite co-evolution. These patterns suggest that formation of local clusters might be a result of transcriptional control, but that in the condensed genome of D. melanogaster relative position of these clusters may be a result of selection for optimal rather than maximal recombination rates between these clusters.

  19. DNACLUST: accurate and efficient clustering of phylogenetic marker genes

    Directory of Open Access Journals (Sweden)

    Liu Bo

    2011-06-01

    Full Text Available Abstract Background Clustering is a fundamental operation in the analysis of biological sequence data. New DNA sequencing technologies have dramatically increased the rate at which we can generate data, resulting in datasets that cannot be efficiently analyzed by traditional clustering methods. This is particularly true in the context of taxonomic profiling of microbial communities through direct sequencing of phylogenetic markers (e.g. 16S rRNA - the domain that motivated the work described in this paper. Many analysis approaches rely on an initial clustering step aimed at identifying sequences that belong to the same operational taxonomic unit (OTU. When defining OTUs (which have no universally accepted definition, scientists must balance a trade-off between computational efficiency and biological accuracy, as accurately estimating an environment's phylogenetic composition requires computationally-intensive analyses. We propose that efficient and mathematically well defined clustering methods can benefit existing taxonomic profiling approaches in two ways: (i the resulting clusters can be substituted for OTUs in certain applications; and (ii the clustering effectively reduces the size of the data-sets that need to be analyzed by complex phylogenetic pipelines (e.g., only one sequence per cluster needs to be provided to downstream analyses. Results To address the challenges outlined above, we developed DNACLUST, a fast clustering tool specifically designed for clustering highly-similar DNA sequences. Given a set of sequences and a sequence similarity threshold, DNACLUST creates clusters whose radius is guaranteed not to exceed the specified threshold. Underlying DNACLUST is a greedy clustering strategy that owes its performance to novel sequence alignment and k-mer based filtering algorithms. DNACLUST can also produce multiple sequence alignments for every cluster, allowing users to manually inspect clustering results, and enabling more

  20. An Effective Tri-Clustering Algorithm Combining Expression Data with Gene Regulation Information

    Directory of Open Access Journals (Sweden)

    Ao Li

    2009-04-01

    Full Text Available Motivation: Bi-clustering algorithms aim to identify sets of genes sharing similar expression patterns across a subset of conditions. However direct interpretation or prediction of gene regulatory mechanisms may be difficult as only gene expression data is used. Information about gene regulators may also be available, most commonly about which transcription factors may bind to the promoter region and thus control the expression level of a gene. Thus a method to integrate gene expression and gene regulation information is desirable for clustering and analyzing. Methods: By incorporating gene regulatory information with gene expression data, we define regulated expression values (REV as indicators of how a gene is regulated by a specific factor. Existing bi-clustering methods are extended to a three dimensional data space by developing a heuristic TRI-Clustering algorithm. An additional approach named Automatic Boundary Searching algorithm (ABS is introduced to automatically determine the boundary threshold. Results: Results based on incorporating ChIP-chip data representing transcription factor-gene interactions show that the algorithms are efficient and robust for detecting tri-clusters. Detailed analysis of the tri-cluster extracted from yeast sporulation REV data shows genes in this cluster exhibited significant differences during the middle and late stages. The implicated regulatory network was then reconstructed for further study of defined regulatory mechanisms. Topological and statistical analysis of this network demonstrated evidence of significant changes of TF activities during the different stages of yeast sporulation, and suggests this approach might be a general way to study regulatory networks undergoing transformations.

  1. Lichen Biosynthetic Gene Clusters Part II: Homology Mapping Suggests a Functional Diversity.

    Science.gov (United States)

    Bertrand, Robert L; Abdel-Hameed, Mona; Sorensen, John L

    2018-02-27

    Lichens are renowned for their diverse natural products though little is known of the genetic programming dictating lichen natural product biosynthesis. We sequenced the genome of Cladonia uncialis and profiled its secondary metabolite biosynthetic gene clusters. Through a homology searching approach, we can now propose specific functions for gene products as well as the biosynthetic pathways that are encoded in several of these gene clusters. This analysis revealed that the lichen genome encodes the required enzymes for patulin and betaenones A-C biosynthesis, fungal toxins not known to be produced by lichens. Within several gene clusters, some (but not all) genes are genetically similar to genes devoted to secondary metabolite biosynthesis in Fungi. These lichen clusters also contain accessory tailoring genes without such genetic similarity, suggesting that the encoded tailoring enzymes perform distinct chemical transformations. We hypothesize that C. uncialis gene clusters have evolved by shuffling components of ancestral fungal clusters to create new series of chemical steps, leading to the production of hitherto undiscovered derivatives of fungal secondary metabolites.

  2. Identification and expression pattern of a new carotenoid cleavage dioxygenase gene member from Bixa orellana

    Science.gov (United States)

    Rodríguez-Ávila, N. L.; Narváez-Zapata, J. A.; Ramírez-Benítez, J. E.; Aguilar-Espinosa, M. L.; Rivera-Madrid, R.

    2011-01-01

    Carotenoid cleavage dioxygenases (CCDs) are a class of enzymes involved in the biosynthesis of a broad diversity of secondary metabolites known as apocarotenoids. In plants, CCDs are part of a genetic family with members which cleave specific double bonds of carotenoid molecules. CCDs are involved in the production of diverse and important metabolites such as vitamin A and abscisic acid (ABA). Bixa orellana L. is the main source of the natural pigment annatto or bixin, an apocarotenoid accumulated in large quantities in its seeds. Bixin biosynthesis has been studied and the involvement of a CCD has been confirmed in vitro. However, the CCD genes involved in the biosynthesis of the wide variety of apocarotenoids found in this plant have not been well documented. In this study, a new CCD1 gene member (BoCCD1) was identified and its expression was charaterized in different plant tissues of B. orellana plantlets and adult plants. The BoCCD1 sequence showed high homology with plant CCD1s involved mainly in the cleavage of carotenoids in several sites to generate multiple apocarotenoid products. Here, the expression profiles of the BoCCD1 gene were analysed and discussed in relation to total carotenoids and other important apocarotenoids such as bixin. PMID:21813796

  3. Leveraging long sequencing reads to investigate R-gene clustering and variation in sugar beet

    Science.gov (United States)

    Host-pathogen interactions are of prime importance to modern agriculture. Plants utilize various types of resistance genes to mitigate pathogen damage. Identification of the specific gene responsible for a specific resistance can be difficult due to duplication and clustering within R-gene families....

  4. Horizontal transfer of a nitrate assimilation gene cluster and ecological transitions in fungi: a phylogenetic study.

    Directory of Open Access Journals (Sweden)

    Jason C Slot

    Full Text Available High affinity nitrate assimilation genes in fungi occur in a cluster (fHANT-AC that can be coordinately regulated. The clustered genes include nrt2, which codes for a high affinity nitrate transporter; euknr, which codes for nitrate reductase; and NAD(PH-nir, which codes for nitrite reductase. Homologs of genes in the fHANT-AC occur in other eukaryotes and prokaryotes, but they have only been found clustered in the oomycete Phytophthora (heterokonts. We performed independent and concatenated phylogenetic analyses of homologs of all three genes in the fHANT-AC. Phylogenetic analyses limited to fungal sequences suggest that the fHANT-AC has been transferred horizontally from a basidiomycete (mushrooms and smuts to an ancestor of the ascomycetous mold Trichoderma reesei. Phylogenetic analyses of sequences from diverse eukaryotes and eubacteria, and cluster structure, are consistent with a hypothesis that the fHANT-AC was assembled in a lineage leading to the oomycetes and was subsequently transferred to the Dikarya (Ascomycota+Basidiomycota, which is a derived fungal clade that includes the vast majority of terrestrial fungi. We propose that the acquisition of high affinity nitrate assimilation contributed to the success of Dikarya on land by allowing exploitation of nitrate in aerobic soils, and the subsequent transfer of a complete assimilation cluster improved the fitness of T. reesei in a new niche. Horizontal transmission of this cluster of functionally integrated genes supports the "selfish operon" hypothesis for maintenance of gene clusters.

  5. High or low correlation between co-occuring gene clusters and 16S rRNA gene phylogeny.

    Science.gov (United States)

    Rudi, Knut; Sekelja, Monika

    2013-02-01

    Ribosomal RNA (rRNA) genes are universal for all living organisms. Yet, the correspondence between genome composition and rRNA phylogeny remains poorly known. The aim of this study was to use the information from genome sequence databases to address the correlation between rRNA gene phylogeny and total gene composition in bacteria. This was done by analysing 327 genomes with TIGRFAM functional gene annotations. Our approach consisted of two steps. First, we searched for discriminatory clusters of co-occurring genes. Using a multivariate statistical approach, we identified 11 such clusters which contain genes that were co-occurring only in a subset of genomes and contributed to explain the gene content differences between genome subsets. Second, we mapped the discovered clusters to 16S rRNA-based phylogeny and calculated the correlation between co-occuring genes and phylogeny. Six of the 11 clusters exhibited significant correlation with 16S rRNA gene phylogeny. The most distinct phylogenetic finding was a high correlation between iron-sulfur oxidoreductases in combination with carbon nitrogen ligases and Chlorobium. The other correlations identified covered relatively large phylogroups: Actinobacteria were positively associated with kinases, while Gammaproteobacteria were positively associated with methylases and acyltransferases. The suggested functional differences between higher phylogroups, however, need experimental verification. © 2012 Federation of European Microbiological Societies. Published by Blackwell Publishing Ltd. All rights reserved.

  6. Activation and clustering of a Plasmodium falciparum var gene are affected by subtelomeric sequences.

    Science.gov (United States)

    Duffy, Michael F; Tang, Jingyi; Sumardy, Fransisca; Nguyen, Hanh H T; Selvarajah, Shamista A; Josling, Gabrielle A; Day, Karen P; Petter, Michaela; Brown, Graham V

    2017-01-01

    The Plasmodium falciparum var multigene family encodes the cytoadhesive, variant antigen PfEMP1. P. falciparum antigenic variation and cytoadhesion specificity are controlled by epigenetic switching between the single, or few, simultaneously expressed var genes. Most var genes are maintained in perinuclear clusters of heterochromatic telomeres. The active var gene(s) occupy a single, perinuclear var expression site. It is unresolved whether the var expression site forms in situ at a telomeric cluster or whether it is an extant compartment to which single chromosomes travel, thus controlling var switching. Here we show that transcription of a var gene did not require decreased colocalisation with clusters of telomeres, supporting var expression site formation in situ. However following recombination within adjacent subtelomeric sequences, the same var gene was persistently activated and did colocalise less with telomeric clusters. Thus, participation in stable, heterochromatic, telomere clusters and var switching are independent but are both affected by subtelomeric sequences. The var expression site colocalised with the euchromatic mark H3K27ac to a greater extent than it did with heterochromatic H3K9me3. H3K27ac was enriched within the active var gene promoter even when the var gene was transiently repressed in mature parasites and thus H3K27ac may contribute to var gene epigenetic memory. © 2016 Federation of European Biochemical Societies.

  7. Evolution of a pentameral body plan was not linked to translocation of anterior Hox genes: the echinoderm HOX cluster revisited.

    Science.gov (United States)

    Byrne, Maria; Martinez, Pedro; Morris, Valerie

    2016-01-01

    Echinodermata is a large phylum of marine invertebrates characterized by an adult, pentameral body plan. This morphology is clearly derived as all members of Deuterostomia (the superphylum to which they belong) have a bilateral body plan. The origin of the pentameral plan has been the subject of intense debate. It is clear that the ancestor of Echinodermata had a bilateral plan but how this ancestor transformed its body "architecture" in such a drastic manner is not clear. Data from the fossil record and ontogeny are sparse and, so far, not very informative. The sequencing of the sea urchin genome a decade ago opened the possibility that the pentameral body plan was a consequence of a broken Hox cluster and a series of papers dwelt on the putative relationship between Hox gene arrangements in the chromosomes and the origin of pentamery. This relationship, sound as it was, is challenged by the revelation that the sea star HOX cluster is, in fact, intact, thus falsifying the hypothesis of a direct relationship between HOX cluster arrangement and the origin of the pentameral body plan. Here, we explore the relationship between Hox gene arrangements and echinoderm body "architecture," the expression of Hox genes in development and alternative scenarios for the origin of pentamery, with putative roles for signaling centers in generating multiple axes. © 2016 Wiley Periodicals, Inc.

  8. AntiSMASH 4.0 - improvements in chemistry prediction and gene cluster boundary identification

    NARCIS (Netherlands)

    Blin, Kai; Wolf, Thomas; Chevrette, Marc G.; Lu, Xiaowen; Schwalen, Christopher J.; Kautsar, Satria A.; Suarez Duran, Hernando G.; Los Santos, De Emmanuel L.C.; Kim, Hyun Uk; Nave, Mariana; Dickschat, Jeroen S.; Mitchell, Douglas A.; Shelest, Ekaterina; Breitling, Rainer; Takano, Eriko; Lee, Sang Yup; Weber, Tilmann; Medema, Marnix H.

    2017-01-01

    Many antibiotics, chemotherapeutics, crop protection agents and food preservatives originate from molecules produced by bacteria, fungi or plants. In recent years, genome mining methodologies have been widely adopted to identify and characterize the biosynthetic gene clusters encoding the production

  9. CASSIS and SMIPS: promoter-based prediction of secondary metabolite gene clusters in eukaryotic genomes.

    Science.gov (United States)

    Wolf, Thomas; Shelest, Vladimir; Nath, Neetika; Shelest, Ekaterina

    2016-04-15

    Secondary metabolites (SM) are structurally diverse natural products of high pharmaceutical importance. Genes involved in their biosynthesis are often organized in clusters, i.e., are co-localized and co-expressed. In silico cluster prediction in eukaryotic genomes remains problematic mainly due to the high variability of the clusters' content and lack of other distinguishing sequence features. We present Cluster Assignment by Islands of Sites (CASSIS), a method for SM cluster prediction in eukaryotic genomes, and Secondary Metabolites by InterProScan (SMIPS), a tool for genome-wide detection of SM key enzymes ('anchor' genes): polyketide synthases, non-ribosomal peptide synthetases and dimethylallyl tryptophan synthases. Unlike other tools based on protein similarity, CASSIS exploits the idea of co-regulation of the cluster genes, which assumes the existence of common regulatory patterns in the cluster promoters. The method searches for 'islands' of enriched cluster-specific motifs in the vicinity of anchor genes. It was validated in a series of cross-validation experiments and showed high sensitivity and specificity. CASSIS and SMIPS are freely available at https://sbi.hki-jena.de/cassis thomas.wolf@leibniz-hki.de or ekaterina.shelest@leibniz-hki.de Supplementary data are available at Bioinformatics online. © The Author 2015. Published by Oxford University Press.

  10. Characterization of the fumonisin B2 biosynthetic gene cluster in Aspergillus niger and A. awamori.

    Science.gov (United States)

    Aspergillus niger and A. awamori strains isolated from grapes cultivated in Mediterranean basin were examined for fumonisin B2 (FB2) production and presence/absence of sequences within the fumonisin biosynthetic gene (fum) cluster. Presence of 13 regions in the fum cluster was evaluated by PCR assay...

  11. Molecular population genetics of the β-esterase gene cluster of ...

    Indian Academy of Sciences (India)

    However there are some 'footprints' of directional and balancing selection shaping specific distribution of nucleotide polymorphism within the cluster. Intergenic epistatic selection between Est-6 and Est-6 may play an important role in the evolution of the -esterase gene cluster preserving the putative pseudogene from ...

  12. Intranuclear and higher-order chromatin organization of the major histone gene cluster in breast cancer.

    Science.gov (United States)

    Fritz, Andrew J; Ghule, Prachi N; Boyd, Joseph R; Tye, Coralee E; Page, Natalie A; Hong, Deli; Shirley, David J; Weinheimer, Adam S; Barutcu, Ahmet R; Gerrard, Diana L; Frietze, Seth; van Wijnen, Andre J; Zaidi, Sayyed K; Imbalzano, Anthony N; Lian, Jane B; Stein, Janet L; Stein, Gary S

    2018-02-01

    Alterations in nuclear morphology are common in cancer progression. However, the degree to which gross morphological abnormalities translate into compromised higher-order chromatin organization is poorly understood. To explore the functional links between gene expression and chromatin structure in breast cancer, we performed RNA-seq gene expression analysis on the basal breast cancer progression model based on human MCF10A cells. Positional gene enrichment identified the major histone gene cluster at chromosome 6p22 as one of the most significantly upregulated (and not amplified) clusters of genes from the normal-like MCF10A to premalignant MCF10AT1 and metastatic MCF10CA1a cells. This cluster is subdivided into three sub-clusters of histone genes that are organized into hierarchical topologically associating domains (TADs). Interestingly, the sub-clusters of histone genes are located at TAD boundaries and interact more frequently with each other than the regions in-between them, suggesting that the histone sub-clusters form an active chromatin hub. The anchor sites of loops within this hub are occupied by CTCF, a known chromatin organizer. These histone genes are transcribed and processed at a specific sub-nuclear microenvironment termed the major histone locus body (HLB). While the overall chromatin structure of the major HLB is maintained across breast cancer progression, we detected alterations in its structure that may relate to gene expression. Importantly, breast tumor specimens also exhibit a coordinate pattern of upregulation across the major histone gene cluster. Our results provide a novel insight into the connection between the higher-order chromatin organization of the major HLB and its regulation during breast cancer progression. © 2017 Wiley Periodicals, Inc.

  13. Clusters of Antibiotic Resistance Genes Enriched Together Stay Together in Swine Agriculture.

    Science.gov (United States)

    Johnson, Timothy A; Stedtfeld, Robert D; Wang, Qiong; Cole, James R; Hashsham, Syed A; Looft, Torey; Zhu, Yong-Guan; Tiedje, James M

    2016-04-12

    Antibiotic resistance is a worldwide health risk, but the influence of animal agriculture on the genetic context and enrichment of individual antibiotic resistance alleles remains unclear. Using quantitative PCR followed by amplicon sequencing, we quantified and sequenced 44 genes related to antibiotic resistance, mobile genetic elements, and bacterial phylogeny in microbiomes from U.S. laboratory swine and from swine farms from three Chinese regions. We identified highly abundant resistance clusters: groups of resistance and mobile genetic element alleles that cooccur. For example, the abundance of genes conferring resistance to six classes of antibiotics together with class 1 integrase and the abundance of IS6100-type transposons in three Chinese regions are directly correlated. These resistance cluster genes likely colocalize in microbial genomes in the farms. Resistance cluster alleles were dramatically enriched (up to 1 to 10% as abundant as 16S rRNA) and indicate that multidrug-resistant bacteria are likely the norm rather than an exception in these communities. This enrichment largely occurred independently of phylogenetic composition; thus, resistance clusters are likely present in many bacterial taxa. Furthermore, resistance clusters contain resistance genes that confer resistance to antibiotics independently of their particular use on the farms. Selection for these clusters is likely due to the use of only a subset of the broad range of chemicals to which the clusters confer resistance. The scale of animal agriculture and its wastes, the enrichment and horizontal gene transfer potential of the clusters, and the vicinity of large human populations suggest that managing this resistance reservoir is important for minimizing human risk. Agricultural antibiotic use results in clusters of cooccurring resistance genes that together confer resistance to multiple antibiotics. The use of a single antibiotic could select for an entire suite of resistance genes if

  14. Integrating Data Clustering and Visualization for the Analysis of 3D Gene Expression Data

    Energy Technology Data Exchange (ETDEWEB)

    Data Analysis and Visualization (IDAV) and the Department of Computer Science, University of California, Davis, One Shields Avenue, Davis CA 95616, USA,; nternational Research Training Group ``Visualization of Large and Unstructured Data Sets,' ' University of Kaiserslautern, Germany; Computational Research Division, Lawrence Berkeley National Laboratory, One Cyclotron Road, Berkeley, CA 94720, USA; Genomics Division, Lawrence Berkeley National Laboratory, One Cyclotron Road, Berkeley CA 94720, USA; Life Sciences Division, Lawrence Berkeley National Laboratory, One Cyclotron Road, Berkeley CA 94720, USA,; Computer Science Division,University of California, Berkeley, CA, USA,; Computer Science Department, University of California, Irvine, CA, USA,; All authors are with the Berkeley Drosophila Transcription Network Project, Lawrence Berkeley National Laboratory,; Rubel, Oliver; Weber, Gunther H.; Huang, Min-Yu; Bethel, E. Wes; Biggin, Mark D.; Fowlkes, Charless C.; Hendriks, Cris L. Luengo; Keranen, Soile V. E.; Eisen, Michael B.; Knowles, David W.; Malik, Jitendra; Hagen, Hans; Hamann, Bernd

    2008-05-12

    The recent development of methods for extracting precise measurements of spatial gene expression patterns from three-dimensional (3D) image data opens the way for new analyses of the complex gene regulatory networks controlling animal development. We present an integrated visualization and analysis framework that supports user-guided data clustering to aid exploration of these new complex datasets. The interplay of data visualization and clustering-based data classification leads to improved visualization and enables a more detailed analysis than previously possible. We discuss (i) integration of data clustering and visualization into one framework; (ii) application of data clustering to 3D gene expression data; (iii) evaluation of the number of clusters k in the context of 3D gene expression clustering; and (iv) improvement of overall analysis quality via dedicated post-processing of clustering results based on visualization. We discuss the use of this framework to objectively define spatial pattern boundaries and temporal profiles of genes and to analyze how mRNA patterns are controlled by their regulatory transcription factors.

  15. A phylogenomic gene cluster resource: the Phylogenetically Inferred Groups (PhIGs database

    Directory of Open Access Journals (Sweden)

    Boore Jeffrey L

    2006-04-01

    Full Text Available Abstract Background We present here the PhIGs database, a phylogenomic resource for sequenced genomes. Although many methods exist for clustering gene families, very few attempt to create truly orthologous clusters sharing descent from a single ancestral gene across a range of evolutionary depths. Although these non-phylogenetic gene family clusters have been used broadly for gene annotation, errors are known to be introduced by the artifactual association of slowly evolving paralogs and lack of annotation for those more rapidly evolving. A full phylogenetic framework is necessary for accurate inference of function and for many studies that address pattern and mechanism of the evolution of the genome. The automated generation of evolutionary gene clusters, creation of gene trees, determination of orthology and paralogy relationships, and the correlation of this information with gene annotations, expression information, and genomic context is an important resource to the scientific community. Discussion The PhIGs database currently contains 23 completely sequenced genomes of fungi and metazoans, containing 409,653 genes that have been grouped into 42,645 gene clusters. Each gene cluster is built such that the gene sequence distances are consistent with the known organismal relationships and in so doing, maximizing the likelihood for the clusters to represent truly orthologous genes. The PhIGs website contains tools that allow the study of genes within their phylogenetic framework through keyword searches on annotations, such as GO and InterPro assignments, and sequence similarity searches by BLAST and HMM. In addition to displaying the evolutionary relationships of the genes in each cluster, the website also allows users to view the relative physical positions of homologous genes in specified sets of genomes. Summary Accurate analyses of genes and genomes can only be done within their full phylogenetic context. The PhIGs database and

  16. Unusual Gene Order and Organization of the Sea Urchin HoxCluster

    Energy Technology Data Exchange (ETDEWEB)

    Richardson, Paul M.; Lucas, Susan; Cameron, R. Andrew; Rowen,Lee; Nesbitt, Ryan; Bloom, Scott; Rast, Jonathan P.; Berney, Kevin; Arenas-Mena, Cesar; Martinez, Pedro; Davidson, Eric H.; Peterson, KevinJ.; Hood, Leroy

    2005-05-10

    The highly consistent gene order and axial colinear expression patterns found in vertebrate hox gene clusters are less well conserved across the rest of bilaterians. We report the first deuterostome instance of an intact hox cluster with a unique gene order where the paralog groups are not expressed in a sequential manner. The finished sequence from BAC clones from the genome of the sea urchin, Strongylocentrotus purpuratus, reveals a gene order wherein the anterior genes (Hox1, Hox2 and Hox3) lie nearest the posterior genes in the cluster such that the most 3' gene is Hox5. (The gene order is : 5'-Hox1,2, 3, 11/13c, 11/13b, '11/13a, 9/10, 8, 7, 6, 5 - 3)'. The finished sequence result is corroborated by restriction mapping evidence and BAC-end scaffold analyses. Comparisons with a putative ancestral deuterostome Hox gene cluster suggest that the rearrangements leading to the sea urchin gene order were many and complex.

  17. Methods for evaluating clustering algorithms for gene expression data using a reference set of functional classes

    Directory of Open Access Journals (Sweden)

    Datta Somnath

    2006-08-01

    Full Text Available Abstract Background A cluster analysis is the most commonly performed procedure (often regarded as a first step on a set of gene expression profiles. In most cases, a post hoc analysis is done to see if the genes in the same clusters can be functionally correlated. While past successes of such analyses have often been reported in a number of microarray studies (most of which used the standard hierarchical clustering, UPGMA, with one minus the Pearson's correlation coefficient as a measure of dissimilarity, often times such groupings could be misleading. More importantly, a systematic evaluation of the entire set of clusters produced by such unsupervised procedures is necessary since they also contain genes that are seemingly unrelated or may have more than one common function. Here we quantify the performance of a given unsupervised clustering algorithm applied to a given microarray study in terms of its ability to produce biologically meaningful clusters using a reference set of functional classes. Such a reference set may come from prior biological knowledge specific to a microarray study or may be formed using the growing databases of gene ontologies (GO for the annotated genes of the relevant species. Results In this paper, we introduce two performance measures for evaluating the results of a clustering algorithm in its ability to produce biologically meaningful clusters. The first measure is a biological homogeneity index (BHI. As the name suggests, it is a measure of how biologically homogeneous the clusters are. This can be used to quantify the performance of a given clustering algorithm such as UPGMA in grouping genes for a particular data set and also for comparing the performance of a number of competing clustering algorithms applied to the same data set. The second performance measure is called a biological stability index (BSI. For a given clustering algorithm and an expression data set, it measures the consistency of the clustering

  18. A Link-Based Cluster Ensemble Approach For Improved Gene Expression Data Analysis

    Directory of Open Access Journals (Sweden)

    P.Balaji

    2015-01-01

    Full Text Available Abstract It is difficult from possibilities to select a most suitable effective way of clustering algorithm and its dataset for a defined set of gene expression data because we have a huge number of ways and huge number of gene expressions. At present many researchers are preferring to use hierarchical clustering in different forms this is no more totally optimal. Cluster ensemble research can solve this type of problem by automatically merging multiple data partitions from a wide range of different clusterings of any dimensions to improve both the quality and robustness of the clustering result. But we have many existing ensemble approaches using an association matrix to condense sample-cluster and co-occurrence statistics and relations within the ensemble are encapsulated only at raw level while the existing among clusters are totally discriminated. Finding these missing associations can greatly expand the capability of those ensemble methodologies for microarray data clustering. We propose general K-means cluster ensemble approach for the clustering of general categorical data into required number of partitions.

  19. Salicin regulates the expression of functional 'youth gene clusters' to reflect a more youthful gene expression profile.

    Science.gov (United States)

    Gopaul, R; Knaggs, H E; Lephart, J

    2011-10-01

    There are a variety of biological mechanisms that contribute to specific characteristics of ageing skin; for example, the loss of skin structure proteins, increased susceptibility to UV-induced pigmentation and/or loss of hydration. Each of these biological processes is influenced by specific groups of genes. In this research, we have identified groups of genes associated with specific clinical signs of skin ageing and refer to these as functional 'youth gene clusters'. In this study, quantitative real-time polymerase chain reaction (qPCR) was used to investigate the effects of topical application of salicin in regulating the expression of functional 'youth gene clusters' to reflect a more youthful skin profile and reduce the appearance of attributes associated with skin ageing. Results showed that salicin significantly influences the gene expression profiles of treated human equivalent full-thickness skin, by regulating the expression of genes associated with various biological processes involving skin structure, skin hydration, pigmentation and cellular differentiation. Based on the findings from this experiment, salicin was identified as a key ingredient that may regulate functional 'youth gene clusters' to reflect a more youthful gene expression profile by increasing the expression of genes responsible for youthful skin and decreasing the expression of genes responsible for the appearance of aged skin. © 2011 The Authors. ICS © 2011 Society of Cosmetic Scientists and the Société Française de Cosmétologie.

  20. Sequencing rare marine actinomycete genomes reveals high density of unique natural product biosynthetic gene clusters

    Science.gov (United States)

    Schorn, Michelle A.; Alanjary, Mohammad M.; Aguinaldo, Kristen; Korobeynikov, Anton; Podell, Sheila; Patin, Nastassia; Lincecum, Tommie; Jensen, Paul R.; Ziemert, Nadine

    2016-01-01

    Traditional natural product discovery methods have nearly exhausted the accessible diversity of microbial chemicals, making new sources and techniques paramount in the search for new molecules. Marine actinomycete bacteria have recently come into the spotlight as fruitful producers of structurally diverse secondary metabolites, and remain relatively untapped. In this study, we sequenced 21 marine-derived actinomycete strains, rarely studied for their secondary metabolite potential and under-represented in current genomic databases. We found that genome size and phylogeny were good predictors of biosynthetic gene cluster diversity, with larger genomes rivalling the well-known marine producers in the Streptomyces and Salinispora genera. Genomes in the Micrococcineae suborder, however, had consistently the lowest number of biosynthetic gene clusters. By networking individual gene clusters into gene cluster families, we were able to computationally estimate the degree of novelty each genus contributed to the current sequence databases. Based on the similarity measures between all actinobacteria in the Joint Genome Institute's Atlas of Biosynthetic gene Clusters database, rare marine genera show a high degree of novelty and diversity, with Corynebacterium, Gordonia, Nocardiopsis, Saccharomonospora and Pseudonocardia genera representing the highest gene cluster diversity. This research validates that rare marine actinomycetes are important candidates for exploration, as they are relatively unstudied, and their relatives are historically rich in secondary metabolites. PMID:27902408

  1. Yeast homologous recombination-based promoter engineering for the activation of silent natural product biosynthetic gene clusters.

    Science.gov (United States)

    Montiel, Daniel; Kang, Hahk-Soo; Chang, Fang-Yuan; Charlop-Powers, Zachary; Brady, Sean F

    2015-07-21

    Large-scale sequencing of prokaryotic (meta)genomic DNA suggests that most bacterial natural product gene clusters are not expressed under common laboratory culture conditions. Silent gene clusters represent a promising resource for natural product discovery and the development of a new generation of therapeutics. Unfortunately, the characterization of molecules encoded by these clusters is hampered owing to our inability to express these gene clusters in the laboratory. To address this bottleneck, we have developed a promoter-engineering platform to transcriptionally activate silent gene clusters in a model heterologous host. Our approach uses yeast homologous recombination, an auxotrophy complementation-based yeast selection system and sequence orthogonal promoter cassettes to exchange all native promoters in silent gene clusters with constitutively active promoters. As part of this platform, we constructed and validated a set of bidirectional promoter cassettes consisting of orthogonal promoter sequences, Streptomyces ribosome binding sites, and yeast selectable marker genes. Using these tools we demonstrate the ability to simultaneously insert multiple promoter cassettes into a gene cluster, thereby expediting the reengineering process. We apply this method to model active and silent gene clusters (rebeccamycin and tetarimycin) and to the silent, cryptic pseudogene-containing, environmental DNA-derived Lzr gene cluster. Complete promoter refactoring and targeted gene exchange in this "dead" cluster led to the discovery of potent indolotryptoline antiproliferative agents, lazarimides A and B. This potentially scalable and cost-effective promoter reengineering platform should streamline the discovery of natural products from silent natural product biosynthetic gene clusters.

  2. Comparative phylogenomic analyses of teleost fish Hox gene clusters: lessons from the cichlid fish Astatotilapia burtoni

    Directory of Open Access Journals (Sweden)

    Kuehl Jennifer V

    2007-09-01

    Full Text Available Abstract Background Teleost fish have seven paralogous clusters of Hox genes stemming from two complete genome duplications early in vertebrate evolution, and an additional genome duplication during the evolution of ray-finned fish, followed by the secondary loss of one cluster. Gene duplications on the one hand, and the evolution of regulatory sequences on the other, are thought to be among the most important mechanisms for the evolution of new gene functions. Cichlid fish, the largest family of vertebrates with about 2500 species, are famous examples of speciation and morphological diversity. Since this diversity could be based on regulatory changes, we chose to study the coding as well as putative regulatory regions of their Hox clusters within a comparative genomic framework. Results We sequenced and characterized all seven Hox clusters of Astatotilapia burtoni, a haplochromine cichlid fish. Comparative analyses with data from other teleost fish such as zebrafish, two species of pufferfish, stickleback and medaka were performed. We traced losses of genes and microRNAs of Hox clusters, the medaka lineage seems to have lost more microRNAs than the other fish lineages. We found that each teleost genome studied so far has a unique set of Hox genes. The hoxb7a gene was lost independently several times during teleost evolution, the most recent event being within the radiation of East African cichlid fish. The conserved non-coding sequences (CNS encompass a surprisingly large part of the clusters, especially in the HoxAa, HoxCa, and HoxDa clusters. Across all clusters, we observe a trend towards an increased content of CNS towards the anterior end. Conclusion The gene content of Hox clusters in teleost fishes is more variable than expected, with each species studied so far having a different set. Although the highest loss rate of Hox genes occurred immediately after whole genome duplications, our analyses showed that gene loss continued and is

  3. Method of Selection of Bacteria Antibiotic Resistance Genes Based on Clustering of Similar Nucleotide Sequences.

    Science.gov (United States)

    Balashov, I S; Naumov, V A; Borovikov, P I; Gordeev, A B; Dubodelov, D V; Lyubasovskaya, L A; Rodchenko, Yu V; Bystritskii, A A; Aleksandrova, N V; Trofimov, D Yu; Priputnevich, T V

    2017-10-01

    A new method for selection of bacterium antibiotic resistance genes is proposed and tested for solving the problems related to selection of primers for PCR assay. The method implies clustering of similar nucleotide sequences and selection of group primers for all genes of each cluster. Clustering of resistance genes for six groups of antibiotics (aminoglycosides, β-lactams, fluoroquinolones, glycopeptides, macrolides and lincosamides, and fusidic acid) was performed. The method was tested for 81 strains of bacteria of different genera isolated from patients (K. pneumoniae, Staphylococcus spp., S. agalactiae, E. faecalis, E. coli, and G. vaginalis). The results obtained by us are comparable to those in the selection of individual genes; this allows reducing the number of primers necessary for maximum coverage of the known antibiotic resistance genes during PCR analysis.

  4. Clustering based gene expression feature selection method: A computational approach to enrich the classifier efficiency of differentially expressed genes

    KAUST Repository

    Abusamra, Heba

    2016-07-20

    The native nature of high dimension low sample size of gene expression data make the classification task more challenging. Therefore, feature (gene) selection become an apparent need. Selecting a meaningful and relevant genes for classifier not only decrease the computational time and cost, but also improve the classification performance. Among different approaches of feature selection methods, however most of them suffer from several problems such as lack of robustness, validation issues etc. Here, we present a new feature selection technique that takes advantage of clustering both samples and genes. Materials and methods We used leukemia gene expression dataset [1]. The effectiveness of the selected features were evaluated by four different classification methods; support vector machines, k-nearest neighbor, random forest, and linear discriminate analysis. The method evaluate the importance and relevance of each gene cluster by summing the expression level for each gene belongs to this cluster. The gene cluster consider important, if it satisfies conditions depend on thresholds and percentage otherwise eliminated. Results Initial analysis identified 7120 differentially expressed genes of leukemia (Fig. 15a), after applying our feature selection methodology we end up with specific 1117 genes discriminating two classes of leukemia (Fig. 15b). Further applying the same method with more stringent higher positive and lower negative threshold condition, number reduced to 58 genes have be tested to evaluate the effectiveness of the method (Fig. 15c). The results of the four classification methods are summarized in Table 11. Conclusions The feature selection method gave good results with minimum classification error. Our heat-map result shows distinct pattern of refines genes discriminating between two classes of leukemia.

  5. The Local Maximum Clustering Method and Its Application in Microarray Gene Expression Data Analysis

    Directory of Open Access Journals (Sweden)

    Chen Yidong

    2004-01-01

    Full Text Available An unsupervised data clustering method, called the local maximum clustering (LMC method, is proposed for identifying clusters in experiment data sets based on research interest. A magnitude property is defined according to research purposes, and data sets are clustered around each local maximum of the magnitude property. By properly defining a magnitude property, this method can overcome many difficulties in microarray data clustering such as reduced projection in similarities, noises, and arbitrary gene distribution. To critically evaluate the performance of this clustering method in comparison with other methods, we designed three model data sets with known cluster distributions and applied the LMC method as well as the hierarchic clustering method, the -mean clustering method, and the self-organized map method to these model data sets. The results show that the LMC method produces the most accurate clustering results. As an example of application, we applied the method to cluster the leukemia samples reported in the microarray study of Golub et al. (1999.

  6. An original SERPINA3 gene cluster: Elucidation of genomic organization and gene expression in the Bos taurus 21q24 region

    Directory of Open Access Journals (Sweden)

    Ouali Ahmed

    2008-04-01

    Full Text Available Abstract Background The superfamily of serine proteinase inhibitors (serpins is involved in numerous fundamental biological processes as inflammation, blood coagulation and apoptosis. Our interest is focused on the SERPINA3 sub-family. The major human plasma protease inhibitor, α1-antichymotrypsin, encoded by the SERPINA3 gene, is homologous to genes organized in clusters in several mammalian species. However, although there is a similar genic organization with a high degree of sequence conservation, the reactive-centre-loop domains, which are responsible for the protease specificity, show significant divergences. Results We provide additional information by analyzing the situation of SERPINA3 in the bovine genome. A cluster of eight genes and one pseudogene sharing a high degree of identity and the same structural organization was characterized. Bovine SERPINA3 genes were localized by radiation hybrid mapping on 21q24 and only spanned over 235 Kilobases. For all these genes, we propose a new nomenclature from SERPINA3-1 to SERPINA3-8. They share approximately 70% of identity with the human SERPINA3 homologue. In the cluster, we described an original sub-group of six members with an unexpected high degree of conservation for the reactive-centre-loop domain, suggesting a similar peptidase inhibitory pattern. Preliminary expression analyses of these bovSERPINA3s showed different tissue-specific patterns and diverse states of glycosylation and phosphorylation. Finally, in the context of phylogenetic analyses, we improved our knowledge on mammalian SERPINAs evolution. Conclusion Our experimental results update data of the bovine genome sequencing, substantially increase the bovSERPINA3 sub-family and enrich the phylogenetic tree of serpins. We provide new opportunities for future investigations to approach the biological functions of this unusual subset of serine proteinase inhibitors.

  7. Structural and functional characterization of three polyketide synthase gene clusters in Bacillus amyloliquefaciens FZB 42.

    Science.gov (United States)

    Chen, Xiao-Hua; Vater, Joachim; Piel, Jörn; Franke, Peter; Scholz, Romy; Schneider, Kathrin; Koumoutsi, Alexandra; Hitzeroth, Gabriele; Grammel, Nicolas; Strittmatter, Axel W; Gottschalk, Gerhard; Süssmuth, Roderich D; Borriss, Rainer

    2006-06-01

    Although bacterial polyketides are of considerable biomedical interest, the molecular biology of polyketide biosynthesis in Bacillus spp., one of the richest bacterial sources of bioactive natural products, remains largely unexplored. Here we assign for the first time complete polyketide synthase (PKS) gene clusters to Bacillus antibiotics. Three giant modular PKS systems of the trans-acyltransferase type were identified in Bacillus amyloliquefaciens FZB 42. One of them, pks1, is an ortholog of the pksX operon with a previously unknown function in the sequenced model strain Bacillus subtilis 168, while the pks2 and pks3 clusters are novel gene clusters. Cassette mutagenesis combined with advanced mass spectrometric techniques such as matrix-assisted laser desorption ionization-time of flight mass spectrometry and liquid chromatography-electrospray ionization mass spectrometry revealed that the pks1 (bae) and pks3 (dif) gene clusters encode the biosynthesis of the polyene antibiotics bacillaene and difficidin or oxydifficidin, respectively. In addition, B. subtilis OKB105 (pheA sfp(0)), a transformant of the B. subtilis 168 derivative JH642, was shown to produce bacillaene, demonstrating that the pksX gene cluster directs the synthesis of that polyketide. The GenBank accession numbers for gene clusters pks1(bae), pks2, and pks3(dif) are AJ 634060.2, AJ 6340601.2, and AJ 6340602.2, respectively.

  8. Shared gene structures and clusters of mutually exclusive spliced exons within the metazoan muscle myosin heavy chain genes.

    Directory of Open Access Journals (Sweden)

    Martin Kollmar

    Full Text Available Multicellular animals possess two to three different types of muscle tissues. Striated muscles have considerable ultrastructural similarity and contain a core set of proteins including the muscle myosin heavy chain (Mhc protein. The ATPase activity of this myosin motor protein largely dictates muscle performance at the molecular level. Two different solutions to adjusting myosin properties to different muscle subtypes have been identified so far: Vertebrates and nematodes contain many independent differentially expressed Mhc genes while arthropods have single Mhc genes with clusters of mutually exclusive spliced exons (MXEs. The availability of hundreds of metazoan genomes now allowed us to study whether the ancient bilateria already contained MXEs, how MXE complexity subsequently evolved, and whether additional scenarios to control contractile properties in different muscles could be proposed, By reconstructing the Mhc genes from 116 metazoans we showed that all intron positions within the motor domain coding regions are conserved in all bilateria analysed. The last common ancestor of the bilateria already contained a cluster of MXEs coding for part of the loop-2 actin-binding sequence. Subsequently the protostomes and later the arthropods gained many further clusters while MXEs got completely lost independently in several branches (vertebrates and nematodes and species (for example the annelid Helobdella robusta and the salmon louse Lepeophtheirus salmonis. Several bilateria have been found to encode multiple Mhc genes that might all or in part contain clusters of MXEs. Notable examples are a cluster of six tandemly arrayed Mhc genes, of which two contain MXEs, in the owl limpet Lottia gigantea and four Mhc genes with three encoding MXEs in the predatory mite Metaseiulus occidentalis. Our analysis showed that similar solutions to provide different myosin isoforms (multiple genes or clusters of MXEs or both have independently been developed

  9. Prediction of operon-like gene clusters in the Arabidopsis thaliana genome based on co-expression analysis of neighboring genes.

    Science.gov (United States)

    Wada, Masayoshi; Takahashi, Hiroki; Altaf-Ul-Amin, Md; Nakamura, Kensuke; Hirai, Masami Y; Ohta, Daisaku; Kanaya, Shigehiko

    2012-07-15

    Operon-like arrangements of genes occur in eukaryotes ranging from yeasts and filamentous fungi to nematodes, plants, and mammals. In plants, several examples of operon-like gene clusters involved in metabolic pathways have recently been characterized, e.g. the cyclic hydroxamic acid pathways in maize, the avenacin biosynthesis gene clusters in oat, the thalianol pathway in Arabidopsis thaliana, and the diterpenoid momilactone cluster in rice. Such operon-like gene clusters are defined by their co-regulation or neighboring positions within immediate vicinity of chromosomal regions. A comprehensive analysis of the expression of neighboring genes therefore accounts a crucial step to reveal the complete set of operon-like gene clusters within a genome. Genome-wide prediction of operon-like gene clusters should contribute to functional annotation efforts and provide novel insight into evolutionary aspects acquiring certain biological functions as well. We predicted co-expressed gene clusters by comparing the Pearson correlation coefficient of neighboring genes and randomly selected gene pairs, based on a statistical method that takes false discovery rate (FDR) into consideration for 1469 microarray gene expression datasets of A. thaliana. We estimated that A. thaliana contains 100 operon-like gene clusters in total. We predicted 34 statistically significant gene clusters consisting of 3 to 22 genes each, based on a stringent FDR threshold of 0.1. Functional relationships among genes in individual clusters were estimated by sequence similarity and functional annotation of genes. Duplicated gene pairs (determined based on BLAST with a cutoff of EOperon-like clusters tend to include genes encoding bio-machinery associated with ribosomes, the ubiquitin/proteasome system, secondary metabolic pathways, lipid and fatty-acid metabolism, and the lipid transfer system. Copyright © 2012 Elsevier B.V. All rights reserved.

  10. Mycobiota and identification of aflatoxin gene cluster in marketed spices in West Africa

    DEFF Research Database (Denmark)

    Gnonlonfin, G. J. B.; Adjovi, Y. C.; Tokpo, A. F.

    2013-01-01

    of Aspergillus were dominant on all marketed dried and milled spices irrespective of country. Gene characterization and amplification analysis showed that most of the Aspergillus flavus isolates possess the cluster genes for aflatoxin production. Aflatoxin B1 assessment by Thin Layer Chromatography showed...

  11. iBBiG: iterative binary bi-clustering of gene sets.

    Science.gov (United States)

    Gusenleitner, Daniel; Howe, Eleanor A; Bentink, Stefan; Quackenbush, John; Culhane, Aedín C

    2012-10-01

    Meta-analysis of genomics data seeks to identify genes associated with a biological phenotype across multiple datasets; however, merging data from different platforms by their features (genes) is challenging. Meta-analysis using functionally or biologically characterized gene sets simplifies data integration is biologically intuitive and is seen as having great potential, but is an emerging field with few established statistical methods. We transform gene expression profiles into binary gene set profiles by discretizing results of gene set enrichment analyses and apply a new iterative bi-clustering algorithm (iBBiG) to identify groups of gene sets that are coordinately associated with groups of phenotypes across multiple studies. iBBiG is optimized for meta-analysis of large numbers of diverse genomics data that may have unmatched samples. It does not require prior knowledge of the number or size of clusters. When applied to simulated data, it outperforms commonly used clustering methods, discovers overlapping clusters of diverse sizes and is robust in the presence of noise. We apply it to meta-analysis of breast cancer studies, where iBBiG extracted novel gene set-phenotype association that predicted tumor metastases within tumor subtypes. Implemented in the Bioconductor package iBBiG CONTACT: aedin@jimmy.harvard.edu.

  12. Nonlinear dimensionality reduction of gene expression data for visualization and clustering analysis of cancer tissue samples.

    Science.gov (United States)

    Shi, Jinlong; Luo, Zhigang

    2010-08-01

    Gene expression data are the representation of nonlinear interactions among genes and environmental factors. Computing analysis of these data is expected to gain knowledge of gene functions and disease mechanisms. Clustering is a classical exploratory technique of discovering similar expression patterns and function modules. However, gene expression data are usually of high dimensions and relatively small samples, which results in the main difficulty for the application of clustering algorithms. Principal component analysis (PCA) is usually used to reduce the data dimensions for further clustering analysis. While PCA estimates the similarity between expression profiles based on the Euclidean distance, which cannot reveal the nonlinear connections between genes. This paper uses nonlinear dimensionality reduction (NDR) as a preprocessing strategy for feature selection and visualization, and then applies clustering algorithms to the reduced feature spaces. In order to estimate the effectiveness of NDR for capturing biologically relevant structures, the comparative analysis between NDR and PCA is exploited to five real cancer expression datasets. Results show that NDR can perform better than PCA in visualization and clustering analysis of complex gene expression data. Copyright 2010 Elsevier Ltd. All rights reserved.

  13. Control of immune ligands by members of a cytomegalovirus gene expansion suppresses natural killer cell activation.

    Science.gov (United States)

    Fielding, Ceri A; Weekes, Michael P; Nobre, Luis V; Ruckova, Eva; Wilkie, Gavin S; Paulo, Joao A; Chang, Chiwen; Suárez, Nicolás M; Davies, James A; Antrobus, Robin; Stanton, Richard J; Aicheler, Rebecca J; Nichols, Hester; Vojtesek, Borek; Trowsdale, John; Davison, Andrew J; Gygi, Steven P; Tomasec, Peter; Lehner, Paul J; Wilkinson, Gavin W G

    2017-02-10

    The human cytomegalovirus (HCMV) US12 family consists of ten sequentially arranged genes (US12-21) with poorly characterized function. We now identify novel natural killer (NK) cell evasion functions for four members: US12, US14, US18 and US20. Using a systematic multiplexed proteomics approach to quantify ~1300 cell surface and ~7200 whole cell proteins, we demonstrate that the US12 family selectively targets plasma membrane proteins and plays key roles in regulating NK ligands, adhesion molecules and cytokine receptors. US18 and US20 work in concert to suppress cell surface expression of the critical NKp30 ligand B7-H6 thus inhibiting NK cell activation. The US12 family is therefore identified as a major new hub of immune regulation.

  14. Combining affinity propagation clustering and mutual information network to investigate key genes in fibroid.

    Science.gov (United States)

    Chen, Qian-Song; Wang, Dan; Liu, Bao-Lian; Gao, Shu-Feng; Gao, Dan-Li; Li, Gui-Rong

    2017-07-01

    The aim of the present study was to investigate key genes in fibroids based on the multiple affinity propogation-Krzanowski and Lai (mAP-KL) method, which included the maxT multiple hypothesis, Krzanowski and Lai (KL) cluster quality index, affinity propagation (AP) clustering algorithm and mutual information network (MIN) constructed by the context likelihood of relatedness (CLR) algorithm. In order to achieve this goal, mAP-KL was initially implemented to investigate exemplars in fibroid, and the maxT function was employed to rank the genes of training and test sets, and the top 200 genes were obtained for further study. In addition, the KL cluster index was applied to determine the quantity of clusters and the AP clustering algorithm was conducted to identify the clusters and their exemplars. Subsequently, the support vector machine (SVM) model was selected to evaluate the classification performance of mAP-KL. Finally, topological properties (degree, closeness, betweenness and transitivity) of exemplars in MIN constructed according to the CLR algorithm were assessed to investigate key genes in fibroid. The SVM model validated that the classification between normal controls and fibroid patients by mAP-KL had a good performance. A total of 9 clusters and exemplars were identified based on mAP-KL, which were comprised of CALCOCO2 , COL4A2 , COPS8 , SNCG , PA2G4 , C17orf70 , MARK3 , BTNL3 and TBC1D13 . By accessing the topological analysis for exemplars in MIN, SNCG and COL4A2 were identified as the two most significant genes of four types of methods, and they were denoted as key genes in the progress of fibroid. In conclusion, two key genes ( SNCG and COL4A2 ) and 9 exemplars were successfully investigated, and these may be potential biomarkers for the detection and treatment of fibroid.

  15. Structural variation of the ribosomal gene cluster within the class Insecta

    Energy Technology Data Exchange (ETDEWEB)

    Mukha, D.V.; Sidorenko, A.P.; Lazebnaya, I.V. [Vavilov Institute of General Genetics, Moscow (Russian Federation)] [and others

    1995-09-01

    General estimation of ribosomal DNA variation within the class Insecta is presented. It is shown that, using blot-hybridization, one can detect differences in the structure of the ribosomal gene cluster not only between genera within an order, but also between species within a genera, including sibling species. Structure of the ribosomal gene cluster of the Coccinellidae family (ladybirds) is analyzed. It is shown that cloned highly conservative regions of ribosomal DNA of Tetrahymena pyriformis can be used as probes for analyzing ribosomal genes in insects. 24 refs., 4 figs.

  16. Identification and manipulation of the pleuromutilin gene cluster from Clitopilus passeckerianus for increased rapid antibiotic production

    Science.gov (United States)

    Bailey, Andy M.; Alberti, Fabrizio; Kilaru, Sreedhar; Collins, Catherine M.; de Mattos-Shipley, Kate; Hartley, Amanda J.; Hayes, Patrick; Griffin, Alison; Lazarus, Colin M.; Cox, Russell J.; Willis, Christine L.; O'Dwyer, Karen; Spence, David W.; Foster, Gary D.

    2016-05-01

    Semi-synthetic derivatives of the tricyclic diterpene antibiotic pleuromutilin from the basidiomycete Clitopilus passeckerianus are important in combatting bacterial infections in human and veterinary medicine. These compounds belong to the only new class of antibiotics for human applications, with novel mode of action and lack of cross-resistance, representing a class with great potential. Basidiomycete fungi, being dikaryotic, are not generally amenable to strain improvement. We report identification of the seven-gene pleuromutilin gene cluster and verify that using various targeted approaches aimed at increasing antibiotic production in C. passeckerianus, no improvement in yield was achieved. The seven-gene pleuromutilin cluster was reconstructed within Aspergillus oryzae giving production of pleuromutilin in an ascomycete, with a significant increase (2106%) in production. This is the first gene cluster from a basidiomycete to be successfully expressed in an ascomycete, and paves the way for the exploitation of a metabolically rich but traditionally overlooked group of fungi.

  17. An Approach to Cluster EU Member States into Groups According to Pathways of Salmonella in the Farm‐to‐Consumption Chain for Pork Products

    DEFF Research Database (Denmark)

    Vigre, Håkan; Coutinho Calado Domingues, Ana Rita; Pedersen, Ulrik Bo

    2016-01-01

    The aim of the project as the cluster analysis was to in part to develop a generic structured quantitative microbiological risk assessment (QMRA) model of human salmonellosis due to pork consumption in EU member states (MSs), and the objective of the cluster analysis was to group the EU MSs...... according to the relative contribution of different pathways of Salmonella in the farm‐to‐consumption chain of pork products. In the development of the model, by selecting a case study MS from each cluster the model was developed to represent different aspects of pig production, pork production...... production and consumption in each country. Based on MS‐specific information about distribution of (i) small and large farms, (ii) small and large slaughterhouses, (iii) amount of pork meat consumed, and (iv) amount of sausages consumed we used nonhierarchical and hierarchical cluster analysis to group...

  18. Fine Mapping of Two Wheat Powdery Mildew Resistance Genes Located at the Pm1 Cluster

    Directory of Open Access Journals (Sweden)

    Junchao Liang

    2016-07-01

    Full Text Available Powdery mildew caused by (DC. f. sp. ( is a globally devastating foliar disease of wheat ( L.. More than a dozen genes against this disease, identified from wheat germplasms of different ploidy levels, have been mapped to the region surrounding the locus on the long arm of chromosome 7A, which forms a resistance (-gene cluster. and from einkorn wheat ( L. were two of the genes belonging to this cluster. This study was initiated to fine map these two genes toward map-based cloning. Comparative genomics study showed that macrocolinearity exists between L. chromosome 1 (Bd1 and the – region, which allowed us to develop markers based on the wheat sequences orthologous to genes contained in the Bd1 region. With these and other newly developed and published markers, high-resolution maps were constructed for both and using large F populations. Moreover, a physical map of was constructed through chromosome walking with bacterial artificial chromosome (BAC clones and comparative mapping. Eventually, and were restricted to a 0.12- and 0.86-cM interval, respectively. Based on the closely linked common markers, , , and (another powdery mildew resistance gene in the cluster were not allelic to one another. Severe recombination suppression and disruption of synteny were noted in the region encompassing . These results provided useful information for map-based cloning of the genes in the cluster and interpretation of their evolution.

  19. Discovery of Unusual Biaryl Polyketides by Activation of a Silent Streptomyces venezuelae Biosynthetic Gene Cluster.

    Science.gov (United States)

    Thanapipatsiri, Anyarat; Gomez-Escribano, Juan Pablo; Song, Lijiang; Bibb, Maureen J; Al-Bassam, Mahmoud; Chandra, Govind; Thamchaipenet, Arinthip; Challis, Gregory L; Bibb, Mervyn J

    2016-11-17

    Comparative transcriptional profiling of a ΔbldM mutant of Streptomyces venezuelae with its unmodified progenitor revealed that the expression of a cryptic biosynthetic gene cluster containing both type I and type III polyketide synthase genes is activated in the mutant. The 29.5 kb gene cluster, which was predicted to encode an unusual biaryl metabolite, which we named venemycin, and potentially halogenated derivatives, contains 16 genes including one-vemR-that encodes a transcriptional activator of the large ATP-binding LuxR-like (LAL) family. Constitutive expression of vemR in the ΔbldM mutant led to the production of sufficient venemycin for structural characterisation, confirming its unusual biaryl structure. Co-expression of the venemycin biosynthetic gene cluster and vemR in the heterologous host Streptomyces coelicolor also resulted in venemycin production. Although the gene cluster encodes two halogenases and a flavin reductase, constitutive expression of all three genes led to the accumulation only of a monohalogenated venemycin derivative, both in the native producer and the heterologous host. A competition experiment in which equimolar quantities of sodium chloride and sodium bromide were fed to the venemycin-producing strains resulted in the preferential incorporation of bromine, thus suggesting that bromide is the preferred substrate for one or both halogenases. © 2016 The Authors. Published by Wiley-VCH Verlag GmbH & Co. KGaA.

  20. A CLUSTERING OF DJA STOCKS - THE APPLICATION IN FINANCE OF A METHOD FIRST USED IN GENE TRAJECTORY STUDY

    Directory of Open Access Journals (Sweden)

    Silaghi Gheorghe Cosmin

    2009-05-01

    Full Text Available Previously we employed the Gene Trajectory Clustering methodology to search for different associations of the stocks composing the DJA index, with the aim of finding different, logic clusters, supported by economic reasons, preferably different than the

  1. Evolutionary study of vertebrate and invertebrate members of the dystrophin and utrophin gene family

    Energy Technology Data Exchange (ETDEWEB)

    Roberts, R.G.; Nicholson, L.; Bobrow, M. [Paediatric Research Unit, London (United Kingdom)] [and others

    1994-09-01

    Vertebrates express two members of the dystrophin gene family. The prototype, dystrophin, is expressed in muscle and neural tissue, and is defective in the human disorders Duchenne and Becker muscular dystrophy (DMD, BMD). The dystrophin homologue utrophin is more generally expressed but has not yet been associated with a genetic disorder. The function of neither protein is clear. A comparison of human utrophin with the known dystrophins (human, mouse, chicken, Torpedo) suggests that dystrophin and utrophin diverged before the vertebrate radiation. We have used reverse-transcript PCR (RT-PCR) directed by degenerate primers to characterize dystrophin and utrophin transcripts from a range of vertebrate and invertebrate animals. Our results suggest that the duplication leading to distinct dystrophin and utrophin genes occurred close to the point of divergence of urochordates from the cephalochordate-vertebrate lineage. This divergence may have occurred to fulfill a novel role which arose at this point, or may reflect a need for separate regulation of the neuromuscular and other functions of the ancient dystrophin. Our data include sequences of the first non-human utrophins to be characterized, and show these to be substantially more divergent than their cognate dystrophins. In addition, our results provide a large body of information regarding the tolerance of amino acid positions in the cysteine-rich and C-terminal domains to substitution. This will aid the interpretations of DMD and BMD missense mutations in these regions.

  2. MS/MS networking guided analysis of molecule and gene cluster families

    Science.gov (United States)

    Nguyen, Don Duy; Wu, Cheng-Hsuan; Moree, Wilna J.; Lamsa, Anne; Medema, Marnix H.; Zhao, Xiling; Gavilan, Ronnie G.; Aparicio, Marystella; Atencio, Librada; Jackson, Chanaye; Ballesteros, Javier; Sanchez, Joel; Watrous, Jeramie D.; Phelan, Vanessa V.; van de Wiel, Corine; Kersten, Roland D.; Mehnaz, Samina; De Mot, René; Shank, Elizabeth A.; Charusanti, Pep; Nagarajan, Harish; Duggan, Brendan M.; Moore, Bradley S.; Bandeira, Nuno; Palsson, Bernhard Ø.; Pogliano, Kit; Gutiérrez, Marcelino; Dorrestein, Pieter C.

    2013-01-01

    The ability to correlate the production of specialized metabolites to the genetic capacity of the organism that produces such molecules has become an invaluable tool in aiding the discovery of biotechnologically applicable molecules. Here, we accomplish this task by matching molecular families with gene cluster families, making these correlations to 60 microbes at one time instead of connecting one molecule to one organism at a time, such as how it is traditionally done. We can correlate these families through the use of nanospray desorption electrospray ionization MS/MS, an ambient pressure MS technique, in conjunction with MS/MS networking and peptidogenomics. We matched the molecular families of peptide natural products produced by 42 bacilli and 18 pseudomonads through the generation of amino acid sequence tags from MS/MS data of specific clusters found in the MS/MS network. These sequence tags were then linked to biosynthetic gene clusters in publicly accessible genomes, providing us with the ability to link particular molecules with the genes that produced them. As an example of its use, this approach was applied to two unsequenced Pseudoalteromonas species, leading to the discovery of the gene cluster for a molecular family, the bromoalterochromides, in the previously sequenced strain P. piscicida JCM 20779T. The approach itself is not limited to 60 related strains, because spectral networking can be readily adopted to look at molecular family–gene cluster families of hundreds or more diverse organisms in one single MS/MS network. PMID:23798442

  3. A genomics based discovery of secondary metabolite biosynthetic gene clusters in Aspergillus ustus.

    Directory of Open Access Journals (Sweden)

    Borui Pi

    Full Text Available Secondary metabolites (SMs produced by Aspergillus have been extensively studied for their crucial roles in human health, medicine and industrial production. However, the resulting information is almost exclusively derived from a few model organisms, including A. nidulans and A. fumigatus, but little is known about rare pathogens. In this study, we performed a genomics based discovery of SM biosynthetic gene clusters in Aspergillus ustus, a rare human pathogen. A total of 52 gene clusters were identified in the draft genome of A. ustus 3.3904, such as the sterigmatocystin biosynthesis pathway that was commonly found in Aspergillus species. In addition, several SM biosynthetic gene clusters were firstly identified in Aspergillus that were possibly acquired by horizontal gene transfer, including the vrt cluster that is responsible for viridicatumtoxin production. Comparative genomics revealed that A. ustus shared the largest number of SM biosynthetic gene clusters with A. nidulans, but much fewer with other Aspergilli like A. niger and A. oryzae. These findings would help to understand the diversity and evolution of SM biosynthesis pathways in genus Aspergillus, and we hope they will also promote the development of fungal identification methodology in clinic.

  4. Evolution of the bovine TLR gene family and member associations with Mycobacterium avium subspecies paratuberculosis infection.

    Directory of Open Access Journals (Sweden)

    Colleen A Fisher

    Full Text Available Members of the Toll-like receptor (TLR gene family occupy key roles in the mammalian innate immune system by functioning as sentries for the detection of invading pathogens, thereafter provoking host innate immune responses. We utilized a custom next-generation sequencing approach and allele-specific genotyping assays to detect and validate 280 biallelic variants across all 10 bovine TLR genes, including 71 nonsynonymous single nucleotide polymorphisms (SNPs and one putative nonsense SNP. Bayesian haplotype reconstructions and median joining networks revealed haplotype sharing between Bos taurus taurus and Bos taurus indicus breeds at every locus, and specialized beef and dairy breeds could not be differentiated despite an average polymorphism density of 1 marker/158 bp. Collectively, 160 tagSNPs and two tag insertion-deletion mutations (indels were sufficient to predict 100% of the variation at 280 variable sites for both Bos subspecies and their hybrids, whereas 118 tagSNPs and 1 tagIndel predictively captured 100% of the variation at 235 variable sites for B. t. taurus. Polyphen and SIFT analyses of amino acid (AA replacements encoded by bovine TLR SNPs indicated that up to 32% of the AA substitutions were expected to impact protein function. Classical and newly developed tests of diversity provide strong support for balancing selection operating on TLR3 and TLR8, and purifying selection acting on TLR10. An investigation of the persistence and continuity of linkage disequilibrium (r2≥0.50 between adjacent variable sites also supported the presence of selection acting on TLR3 and TLR8. A case-control study employing validated variants from bovine TLR genes recognizing bacterial ligands revealed six SNPs potentially eliciting small effects on susceptibility to Mycobacterium avium spp paratuberculosis infection in dairy cattle. The results of this study will broadly impact domestic cattle research by providing the necessary foundation to

  5. A Metabolic Gene Cluster in the Wheat W1 and the Barley Cer-cqu Loci Determines β-Diketone Biosynthesis and Glaucousness.

    Science.gov (United States)

    Hen-Avivi, Shelly; Savin, Orna; Racovita, Radu C; Lee, Wing-Sham; Adamski, Nikolai M; Malitsky, Sergey; Almekias-Siegl, Efrat; Levy, Matan; Vautrin, Sonia; Bergès, Hélène; Friedlander, Gilgi; Kartvelishvily, Elena; Ben-Zvi, Gil; Alkan, Noam; Uauy, Cristobal; Kanyuka, Kostya; Jetter, Reinhard; Distelfeld, Assaf; Aharoni, Asaph

    2016-06-01

    The glaucous appearance of wheat (Triticum aestivum) and barley (Hordeum vulgare) plants, that is the light bluish-gray look of flag leaf, stem, and spike surfaces, results from deposition of cuticular β-diketone wax on their surfaces; this phenotype is associated with high yield, especially under drought conditions. Despite extensive genetic and biochemical characterization, the molecular genetic basis underlying the biosynthesis of β-diketones remains unclear. Here, we discovered that the wheat W1 locus contains a metabolic gene cluster mediating β-diketone biosynthesis. The cluster comprises genes encoding proteins of several families including type-III polyketide synthases, hydrolases, and cytochrome P450s related to known fatty acid hydroxylases. The cluster region was identified in both genetic and physical maps of glaucous and glossy tetraploid wheat, demonstrating entirely different haplotypes in these accessions. Complementary evidence obtained through gene silencing in planta and heterologous expression in bacteria supports a model for a β-diketone biosynthesis pathway involving members of these three protein families. Mutations in homologous genes were identified in the barley eceriferum mutants defective in β-diketone biosynthesis, demonstrating a gene cluster also in the β-diketone biosynthesis Cer-cqu locus in barley. Hence, our findings open new opportunities to breed major cereal crops for surface features that impact yield and stress response. © 2016 American Society of Plant Biologists. All rights reserved.

  6. Regulation of Three Nitrogenase Gene Clusters in the Cyanobacterium Anabaena variabilis ATCC 29413

    Directory of Open Access Journals (Sweden)

    Teresa Thiel

    2014-12-01

    Full Text Available The filamentous cyanobacterium Anabaena variabilis ATCC 29413 fixes nitrogen under aerobic conditions in specialized cells called heterocysts that form in response to an environmental deficiency in combined nitrogen. Nitrogen fixation is mediated by the enzyme nitrogenase, which is very sensitive to oxygen. Heterocysts are microxic cells that allow nitrogenase to function in a filament comprised primarily of vegetative cells that produce oxygen by photosynthesis. A. variabilis is unique among well-characterized cyanobacteria in that it has three nitrogenase gene clusters that encode different nitrogenases, which function under different environmental conditions. The nif1 genes encode a Mo-nitrogenase that functions only in heterocysts, even in filaments grown anaerobically. The nif2 genes encode a different Mo-nitrogenase that functions in vegetative cells, but only in filaments grown under anoxic conditions. An alternative V-nitrogenase is encoded by vnf genes that are expressed only in heterocysts in an environment that is deficient in Mo. Thus, these three nitrogenases are expressed differentially in response to environmental conditions. The entire nif1 gene cluster, comprising at least 15 genes, is primarily under the control of the promoter for the first gene, nifB1. Transcriptional control of many of the downstream nif1 genes occurs by a combination of weak promoters within the coding regions of some downstream genes and by RNA processing, which is associated with increased transcript stability. The vnf genes show a similar pattern of transcriptional and post-transcriptional control of expression suggesting that the complex pattern of regulation of the nif1 cluster is conserved in other cyanobacterial nitrogenase gene clusters.

  7. Identification of biosynthetic gene clusters from metagenomic libraries using PPTase complementation in a Streptomyces host.

    Science.gov (United States)

    Bitok, J Kipchirchir; Lemetre, Christophe; Ternei, Melinda A; Brady, Sean F

    2017-09-01

    The majority of environmental bacteria are not readily cultured in the lab, leaving the natural products they make inaccessible using culture-dependent discovery methods. Cloning and heterologous expression of DNA extracted from environmental samples (environmental DNA, eDNA) provides a means of circumventing this discovery bottleneck. To facilitate the identification of clones containing biosynthetic gene clusters, we developed a model heterologous expression reporter strain Streptomyces albus::bpsA ΔPPTase. This strain carries a 4΄-phosphopantetheinyl transferase (PPTase)-dependent blue pigment synthase A gene, bpsA, in a PPTase deletion background. eDNA clones that express a functional PPTase restore production of the blue pigment, indigoidine. As PPTase genes often occur in biosynthetic gene clusters (BGCs), indigoidine production can be used to identify eDNA clones containing BGCs. We screened a soil eDNA library hosted in S. albus::bpsA ΔPPTase and identified clones containing non-ribosomal peptide synthetase (NRPS), polyketide synthase (PKS) and mixed NRPS/PKS biosynthetic gene clusters. One NRPS gene cluster was shown to confer the production of myxochelin A to S. albus::bpsA ΔPPTase. © FEMS 2017. All rights reserved. For permissions, please e-mail: journals.permissions@oup.com.

  8. Methods for simultaneously identifying coherent local clusters with smooth global patterns in gene expression profiles

    Directory of Open Access Journals (Sweden)

    Lee Yun-Shien

    2008-03-01

    Full Text Available Abstract Background The hierarchical clustering tree (HCT with a dendrogram 1 and the singular value decomposition (SVD with a dimension-reduced representative map 2 are popular methods for two-way sorting the gene-by-array matrix map employed in gene expression profiling. While HCT dendrograms tend to optimize local coherent clustering patterns, SVD leading eigenvectors usually identify better global grouping and transitional structures. Results This study proposes a flipping mechanism for a conventional agglomerative HCT using a rank-two ellipse (R2E, an improved SVD algorithm for sorting purpose seriation by Chen 3 as an external reference. While HCTs always produce permutations with good local behaviour, the rank-two ellipse seriation gives the best global grouping patterns and smooth transitional trends. The resulting algorithm automatically integrates the desirable properties of each method so that users have access to a clustering and visualization environment for gene expression profiles that preserves coherent local clusters and identifies global grouping trends. Conclusion We demonstrate, through four examples, that the proposed method not only possesses better numerical and statistical properties, it also provides more meaningful biomedical insights than other sorting algorithms. We suggest that sorted proximity matrices for genes and arrays, in addition to the gene-by-array expression matrix, can greatly aid in the search for comprehensive understanding of gene expression structures. Software for the proposed methods can be obtained at http://gap.stat.sinica.edu.tw/Software/GAP.

  9. Sequencing and transcriptional analysis of the biosynthesis gene cluster of abscisic acid-producing Botrytis cinerea.

    Science.gov (United States)

    Gong, Tao; Shu, Dan; Yang, Jie; Ding, Zhong-Tao; Tan, Hong

    2014-09-29

    Botrytis cinerea is a model species with great importance as a pathogen of plants and has become used for biotechnological production of ABA. The ABA cluster of B. cinerea is composed of an open reading frame without significant similarities (bcaba3), followed by the genes (bcaba1 and bcaba2) encoding P450 monooxygenases and a gene probably coding for a short-chain dehydrogenase/reductase (bcaba4). In B. cinerea ATCC58025, targeted inactivation of the genes in the cluster suggested at least three genes responsible for the hydroxylation at carbon atom C-1' and C-4' or oxidation at C-4' of ABA. Our group has identified an ABA-overproducing strain, B. cinerea TB-3-H8. To differentiate TB-3-H8 from other B. cinerea strains with the functional ABA cluster, the DNA sequence of the 12.11-kb region containing the cluster of B. cinerea TB-3-H8 was determined. Full-length cDNAs were also isolated for bcaba1, bcaba2, bcaba3 and bcaba4 from B. cinerea TB-3-H8. Sequence comparison of the four genes and their flanking regions respectively derived from B. cinerea TB-3-H8, B05.10 and T4 revealed that major variations were located in intergenic sequences. In B. cinerea TB-3-H8, the expression profiles of the four function genes under ABA high-yield conditions were also analyzed by real-time PCR.

  10. Hessian regularization based non-negative matrix factorization for gene expression data clustering.

    Science.gov (United States)

    Liu, Xiao; Shi, Jun; Wang, Congzhi

    2015-01-01

    Since a key step in the analysis of gene expression data is to detect groups of genes that have similar expression patterns, clustering technique is then commonly used to analyze gene expression data. Data representation plays an important role in clustering analysis. The non-negative matrix factorization (NMF) is a widely used data representation method with great success in machine learning. Although the traditional manifold regularization method, Laplacian regularization (LR), can improve the performance of NMF, LR still suffers from the problem of its weak extrapolating power. Hessian regularization (HR) is a newly developed manifold regularization method, whose natural properties make it more extrapolating, especially for small sample data. In this work, we propose the HR-based NMF (HR-NMF) algorithm, and then apply it to represent gene expression data for further clustering task. The clustering experiments are conducted on five commonly used gene datasets, and the results indicate that the proposed HR-NMF outperforms LR-based NMM and original NMF, which suggests the potential application of HR-NMF for gene expression data.

  11. Members of the barley NAC transcription factor gene family show differential co-regulation with senescence-associated genes during senescence of flag leaves

    DEFF Research Database (Denmark)

    Christiansen, Michael W; Gregersen, Per L.

    2014-01-01

    activation of degradation processes in senescing barley leaf tissues. This picture was confirmed in a detailed quantitative reverse transcription–PCR (qRT–PCR) experiment, which also showed distinct gene expression patterns for different members of the NAC gene family, suggesting a group of ~15 out of the 47...... in the NAC transcription factor family during senescence of barley flag leaves was studied. Several members of the NAC transcription factor gene family were up-regulated during senescence in a microarray experiment, together with a large range of senescence-associated genes, reflecting the coordinated...... studied NAC genes to be important for signalling processes and for the execution of degradation processes during leaf senescence in barley. Seven models for DNA-binding motifs for NAC transcription factors were designed based on published motifs, and available promoter sequences of barley genes were...

  12. plantiSMASH: automated identification, annotation and expression analysis of plant biosynthetic gene clusters

    DEFF Research Database (Denmark)

    Kautsar, Satria A.; Suarez Duran, Hernando G.; Blin, Kai

    2017-01-01

    in specific genomic loci: biosynthetic gene clusters (BGCs). Here, we introduce plantiSMASH, a versatile online analysis platform that automates the identification of candidate plant BGCs. Moreover, it allows integration of transcriptomic data to prioritize candidate BGCs based on the coexpression patterns......Plant specialized metabolites are chemically highly diverse, play key roles in host-microbe interactions, have important nutritional value in crops and are frequently applied as medicines. It has recently become clear that plant biosynthetic pathway-encoding genes are sometimes densely clustered...... of predicted biosynthetic enzyme-coding genes, and facilitates comparative genomic analysis to study the evolutionary conservation of each cluster. Applied on 48 high-quality plant genomes, plantiSMASH identifies a rich diversity of candidate plant BGCs. These results will guide further experimental...

  13. The CHRNA5-A3-B4 Gene Cluster and Smoking: From Discovery to Therapeutics.

    Science.gov (United States)

    Lassi, Glenda; Taylor, Amy E; Timpson, Nicholas J; Kenny, Paul J; Mather, Robert J; Eisen, Tim; Munafò, Marcus R

    2016-12-01

    Genome-wide association studies (GWASs) have identified associations between the CHRNA5-CHRNA3-CHRNB4 gene cluster and smoking heaviness and nicotine dependence. Studies in rodents have described the anatomical localisation and function of the nicotinic acetylcholine receptors (nAChRs) formed by the subunits encoded by this gene cluster. Further investigations that complemented these studies highlighted the variability of individuals' smoking behaviours and their ability to adjust nicotine intake. GWASs of smoking-related health outcomes have also identified this signal in the CHRNA5-CHRNA3-CHRNB4 gene cluster. This insight underpins approaches to strengthen causal inference in observational data. Combining genetic and mechanistic studies of nicotine dependence and smoking heaviness may reveal novel targets for medication development. Validated targets can inform genetic therapeutic interventions for smoking cessation and tobacco-related diseases. Copyright © 2016 The Authors. Published by Elsevier Ltd.. All rights reserved.

  14. Clustering Time-Series Gene Expression Data Using Smoothing Spline Derivatives

    Directory of Open Access Journals (Sweden)

    Martin PGP

    2007-01-01

    Full Text Available Microarray data acquired during time-course experiments allow the temporal variations in gene expression to be monitored. An original postprandial fasting experiment was conducted in the mouse and the expression of 200 genes was monitored with a dedicated macroarray at 11 time points between 0 and 72 hours of fasting. The aim of this study was to provide a relevant clustering of gene expression temporal profiles. This was achieved by focusing on the shapes of the curves rather than on the absolute level of expression. Actually, we combined spline smoothing and first derivative computation with hierarchical and partitioning clustering. A heuristic approach was proposed to tune the spline smoothing parameter using both statistical and biological considerations. Clusters are illustrated a posteriori through principal component analysis and heatmap visualization. Most results were found to be in agreement with the literature on the effects of fasting on the mouse liver and provide promising directions for future biological investigations.

  15. Clustering Time-Series Gene Expression Data Using Smoothing Spline Derivatives

    Directory of Open Access Journals (Sweden)

    S. Déjean

    2007-06-01

    Full Text Available Microarray data acquired during time-course experiments allow the temporal variations in gene expression to be monitored. An original postprandial fasting experiment was conducted in the mouse and the expression of 200 genes was monitored with a dedicated macroarray at 11 time points between 0 and 72 hours of fasting. The aim of this study was to provide a relevant clustering of gene expression temporal profiles. This was achieved by focusing on the shapes of the curves rather than on the absolute level of expression. Actually, we combined spline smoothing and first derivative computation with hierarchical and partitioning clustering. A heuristic approach was proposed to tune the spline smoothing parameter using both statistical and biological considerations. Clusters are illustrated a posteriori through principal component analysis and heatmap visualization. Most results were found to be in agreement with the literature on the effects of fasting on the mouse liver and provide promising directions for future biological investigations.

  16. Form gene clustering method about pan-ethnic-group products based on emotional semantic

    Science.gov (United States)

    Chen, Dengkai; Ding, Jingjing; Gao, Minzhuo; Ma, Danping; Liu, Donghui

    2016-09-01

    The use of pan-ethnic-group products form knowledge primarily depends on a designer's subjective experience without user participation. The majority of studies primarily focus on the detection of the perceptual demands of consumers from the target product category. A pan-ethnic-group products form gene clustering method based on emotional semantic is constructed. Consumers' perceptual images of the pan-ethnic-group products are obtained by means of product form gene extraction and coding and computer aided product form clustering technology. A case of form gene clustering about the typical pan-ethnic-group products is investigated which indicates that the method is feasible. This paper opens up a new direction for the future development of product form design which improves the agility of product design process in the era of Industry 4.0.

  17. Evaluation of gene-expression clustering via mutual information distance measure

    Directory of Open Access Journals (Sweden)

    Maimon Oded

    2007-03-01

    Full Text Available Abstract Background The definition of a distance measure plays a key role in the evaluation of different clustering solutions of gene expression profiles. In this empirical study we compare different clustering solutions when using the Mutual Information (MI measure versus the use of the well known Euclidean distance and Pearson correlation coefficient. Results Relying on several public gene expression datasets, we evaluate the homogeneity and separation scores of different clustering solutions. It was found that the use of the MI measure yields a more significant differentiation among erroneous clustering solutions. The proposed measure was also used to analyze the performance of several known clustering algorithms. A comparative study of these algorithms reveals that their "best solutions" are ranked almost oppositely when using different distance measures, despite the found correspondence between these measures when analysing the averaged scores of groups of solutions. Conclusion In view of the results, further attention should be paid to the selection of a proper distance measure for analyzing the clustering of gene expression data.

  18. Cloning of the staurosporine biosynthetic gene cluster from Streptomyces sp. TP-A0274 and its heterologous expression in Streptomyces lividans.

    Science.gov (United States)

    Onaka, Hiroyasu; Taniguchi, Shin-ichi; Igarashi, Yasuhiro; Furumai, Tamotsu

    2002-12-01

    Staurosporine is a representative member of indolocarbazole antibiotics. The entire staurosporine biosynthetic and regulatory gene cluster spanning 20-kb was cloned from Streptomyces sp. TP-A0274 and sequenced. The gene cluster consists of 14 ORFs and the amino acid sequence homology search revealed that it contains three genes, staO, staD, and staP, coding for the enzymes involved in the indolocarbazole aglycone biosynthesis, two genes, staG and staN, for the bond formation between the aglycone and deoxysugar, eight genes, staA, staB, staE, staJ, staI, staK, staMA, and staMB, for the deoxysugar biosynthesis and one gene, staR is a transcriptional regulator. Heterologous gene expression of a 38-kb fragment containing a complete set of the biosynthetic genes for staurosporine cloned into pTOYAMAcos confirmed its role in staurosporine biosynthesis. Moreover, the distribution of the gene for chromopyrrolic acid synthase, the key enzyme for the biosynthesis of indolocarbazole aglycone, in actinomycetes was investigated, and rebD homologs were shown to exist only in the strains producing indolocarbazole antibiotics.

  19. Gene microarray data analysis using parallel point-symmetry-based clustering.

    Science.gov (United States)

    Sarkar, Anasua; Maulik, Ujjwal

    2015-01-01

    Identification of co-expressed genes is the central goal in microarray gene expression analysis. Point-symmetry-based clustering is an important unsupervised learning technique for recognising symmetrical convex- or non-convex-shaped clusters. To enable fast clustering of large microarray data, we propose a distributed time-efficient scalable approach for point-symmetry-based K-Means algorithm. A natural basis for analysing gene expression data using symmetry-based algorithm is to group together genes with similar symmetrical expression patterns. This new parallel implementation also satisfies linear speedup in timing without sacrificing the quality of clustering solution on large microarray data sets. The parallel point-symmetry-based K-Means algorithm is compared with another new parallel symmetry-based K-Means and existing parallel K-Means over eight artificial and benchmark microarray data sets, to demonstrate its superiority, in both timing and validity. The statistical analysis is also performed to establish the significance of this message-passing-interface based point-symmetry K-Means implementation. We also analysed the biological relevance of clustering solutions.

  20. Two transitional type Ia supernovae located in the Fornax cluster member NGC 1404: SN 2007on and SN 2011iv

    Science.gov (United States)

    Gall, C.; Stritzinger, M. D.; Ashall, C.; Baron, E.; Burns, C. R.; Hoeflich, P.; Hsiao, E. Y.; Mazzali, P. A.; Phillips, M. M.; Filippenko, A. V.; Anderson, J. P.; Benetti, S.; Brown, P. J.; Campillay, A.; Challis, P.; Contreras, C.; Elias de la Rosa, N.; Folatelli, G.; Foley, R. J.; Fraser, M.; Holmbo, S.; Marion, G. H.; Morrell, N.; Pan, Y.-C.; Pignata, G.; Suntzeff, N. B.; Taddia, F.; Robledo, S. Torres; Valenti, S.

    2018-03-01

    We present an analysis of ultraviolet (UV) to near-infrared observations of the fast-declining Type Ia supernovae (SNe Ia) 2007on and 2011iv, hosted by the Fornax cluster member NGC 1404. The B-band light curves of SN 2007on and SN 2011iv are characterised by Δm15 (B) decline-rate values of 1.96 mag and 1.77 mag, respectively. Although they have similar decline rates, their peak B- and H-band magnitudes differ by 0.60 mag and 0.35 mag, respectively. After correcting for the luminosity vs. decline rate and the luminosity vs. colour relations, the peak B-band and H-band light curves provide distances that differ by 14% and 9%, respectively. These findings serve as a cautionary tale for the use of transitional SNe Ia located in early-type hosts in the quest to measure cosmological parameters. Interestingly, even though SN 2011iv is brighter and bluer at early times, by three weeks past maximum and extending over several months, its B - V colour is 0.12 mag redder than that of SN 2007on. To reconcile this unusual behaviour, we turn to guidance from a suite of spherical one-dimensional Chandrasekhar-mass delayed-detonation explosion models. In this context, 56Ni production depends on both the so-called transition density and the central density of the progenitor white dwarf. To first order, the transition density drives the luminosity-width relation, while the central density is an important second-order parameter. Within this context, the differences in the B - V colour evolution along the Lira regime suggest that the progenitor of SN 2011iv had a higher central density than SN 2007on. The photometry tables are only available at the CDS via anonymous ftp to http://cdsarc.u-strasbg.fr (http://130.79.128.5) or via http://cdsarc.u-strasbg.fr/viz-bin/qcat?J/A+A/611/A58

  1. The rhizoferrin biosynthetic gene in the fungal pathogen Rhizopus delemar is a novel member of the NIS gene family.

    Science.gov (United States)

    Carroll, Cassandra S; Grieve, Clark L; Murugathasan, Indu; Bennet, Andrew J; Czekster, Clarissa M; Liu, Huanting; Naismith, James; Moore, Margo M

    2017-08-01

    Iron is essential for growth and in low iron environments such as serum many bacteria and fungi secrete ferric iron-chelating molecules called siderophores. All fungi produce hydroxamate siderophores with the exception of Mucorales fungi, which secrete rhizoferrin, a polycarboxylate siderophore. Here we investigated the biosynthesis of rhizoferrin by the opportunistic human pathogen, Rhizopus delemar. We searched the genome of R. delemar 99-880 for a homologue of the bacterial NRPS-independent siderophore (NIS) protein, SfnaD, that is involved in biosynthesis of staphyloferrin A in Staphylococcus aureus. A protein was identified in R. delemar with 22% identity and 37% similarity with SfnaD, containing an N-terminal IucA/IucC family domain, and a C-terminal conserved ferric iron reductase FhuF-like transporter domain. Expression of the putative fungal rhizoferrin synthetase (rfs) gene was repressed by iron. The rfs gene was cloned and expressed in E.coli and siderophore biosynthesis from citrate and diaminobutane was confirmed using high resolution LC-MS. Substrate specificity was investigated showing that Rfs produced AMP when oxaloacetic acid, tricarballylic acid, ornithine, hydroxylamine, diaminopentane and diaminopropane were employed as substrates. Based on the production of AMP and the presence of a mono-substituted rhizoferrin, we suggest that Rfs is a member of the superfamily of adenylating enzymes. We used site-directed mutagenesis to mutate selected conserved residues predicted to be in the Rfs active site. These studies revealed that H484 is essential for Rfs activity and L544 may play a role in amine recognition by the enzyme. This study on Rfs is the first characterization of a fungal NIS enzyme. Future work will determine if rhizoferrin biosynthesis is required for virulence in Mucorales fungi. Copyright © 2017 Elsevier Ltd. All rights reserved.

  2. CLASH-VLT: DISSECTING THE FRONTIER FIELDS GALAXY CLUSTER MACS J0416.1-2403 WITH ∼800 SPECTRA OF MEMBER GALAXIES

    Energy Technology Data Exchange (ETDEWEB)

    Balestra, I.; Sartoris, B.; Girardi, M.; Nonino, M.; Biviano, A. [INAF—Osservatorio Astronomico di Trieste, via G. B. Tiepolo 11, I-34131, Trieste (Italy); Mercurio, A. [INAF—Osservatorio Astronomico di Capodimonte, Via Moiariello 16, I-80131 Napoli (Italy); Grillo, C. [Dark Cosmology Centre, Niels Bohr Institute, University of Copenhagen, Juliane Maries Vej 30, DK-2100 Copenhagen (Denmark); Rosati, P. [Dipartimento di Fisica e Scienze della Terra, Università di Ferrara, Via Saragat 1, I-44122, Ferrara (Italy); Ettori, S.; Vanzella, E. [INAF—Osservatorio Astronomico di Bologna, Via Ranzani 1, I-40127 Bologna (Italy); Forman, W.; Jones, C.; Ogrean, G. A.; Weeren, R. J. van [Harvard-Smithsonian Center for Astrophysics, 60 Garden Street, Cambridge, MA 02138 (United States); Koekemoer, A. [Space Telescope Science Institute, 3700 San Martin Drive, Baltimore, MD 21208 (United States); Medezinski, E.; Zitrin, A. [Cahill Center for Astronomy and Astrophysics, California Institute of Technology, MS 249-17, Pasadena, CA 91125 (United States); Merten, J. [Department of Physics, University of Oxford, Keble Road, Oxford OX1 3RH (United Kingdom); Tozzi, P. [INAF—Osservatorio Astrofisico di Arcetri, Largo E. Fermi 5, I-50125, Firenze (Italy); Umetsu, K., E-mail: balestra@oats.inaf.it [Institute of Astronomy and Astrophysics, Academia Sinica, P.O. Box 23-141, Taipei 10617, Taiwan (China); and others

    2016-06-01

    We present VIMOS-Very Large Telescope (VLT) spectroscopy of the Frontier Fields cluster MACS J0416.1-2403 ( z  = 0.397). Taken as part of the CLASH-VLT survey, the large spectroscopic campaign provided more than 4000 reliable redshifts over ∼600 arcmin{sup 2}, including ∼800 cluster member galaxies. The unprecedented sample of cluster members at this redshift allows us to perform a highly detailed dynamical and structural analysis of the cluster out to ∼2.2 r {sub 200} (∼4 Mpc). Our analysis of substructures reveals a complex system composed of a main massive cluster ( M {sub 200} ∼ 0.9 × 10{sup 15} M {sub ⊙} and σ{sub V,r200} ∼ 1000 km s{sup −1}) presenting two major features: (i) a bimodal velocity distribution, showing two central peaks separated by Δ V {sub rf} ∼ 1100 km s{sup −1} with comparable galaxy content and velocity dispersion, and (ii) a projected elongation of the main substructures along the NE–SW direction, with a prominent sub-clump ∼600 kpc SW of the center and an isolated BCG approximately halfway between the center and the SW clump. We also detect a low-mass structure at z  ∼ 0.390, ∼10′ south of the cluster center, projected at ∼3 Mpc, with a relative line-of-sight velocity of Δ V{sub rf} ∼ −1700 km s{sup −1}. The cluster mass profile that we obtain through our dynamical analysis deviates significantly from the “universal” NFW, being best fit by a Softened Isothermal Sphere model instead. The mass profile measured from the galaxy dynamics is found to be in relatively good agreement with those obtained from strong and weak lensing, as well as with that from the X-rays, despite the clearly unrelaxed nature of the cluster. Our results reveal an overall complex dynamical state of this massive cluster and support the hypothesis that the two main subclusters are being observed in a pre-collisional phase, in agreement with recent findings from radio and deep X-ray data. In this article, we

  3. Cloning of the biosynthetic gene cluster for naphthoxanthene antibiotic FD-594 from Streptomyces sp. TA-0256.

    Science.gov (United States)

    Kudo, Fumitaka; Yonezawa, Takanori; Komatsubara, Akiko; Mizoue, Kazutoshi; Eguchi, Tadashi

    2011-01-01

    FD-594 is an unique pyrano[4',3':6,7]naphtho[1,2-b]xanthene polyketide with a trisaccharide of 2,6-dideoxysugars. In this study, we cloned the FD-594 biosynthetic gene cluster from the producer strain Streptomyces sp. TA-0256 to investigate its biosynthesis. The identified pnx gene cluster was 38143 bp, consisting of 40 open reading frames, including a minimal PKS gene, TDP-olivose biosynthetic genes, two glycosyltransferase genes, two methyltransferase genes and many oxygenase/reductase genes. Most of these enzymes coded in the pnx cluster were reasonably assigned to a plausible biosynthetic pathway for FD-594, in which an unique ring opening process via Baeyer-Villiger-type oxidation catalyzed by a putative flavin adenine dinucleotide (FAD)-dependent monooxygenase, is speculated to lead to the unique xanthene structure. To clarify the involvement of pnx genes in the FD-594 biosynthesis, a glycosyltransferase, PnxGT2, and a methyltransferase, PnxMT2, were characterized enzymatically with the recombinant proteins expressed in Escherichia coli. As a result, PnxGT2 catalyzed the triple olivose transfers to the FD-594 aglycon with TDP-olivose as the glycosyl donor to afford triolivoside. Surprisingly, in the PnxGT2 enzymatic reaction, tetraolivoside and pentaolivoside were significantly detected along with the expected triolivoside. To our knowledge, PnxGT2 is the first contiguous oligosaccharide-forming glycosyltransferase in secondary metabolism. Furthermore, addition of PnxMT2 and S-adenosyl-L-methionine into the PnxGT2 reaction mixture afforded natural FD-594 to confirm that the PnxGT2 reaction product was the expected regiospecifically glycosylated compound. Consequently, the identified pnx gene cluster appears to be involved in FD-594 biosynthesis.

  4. Rearrangements of the beta-globin gene cluster in apparently typical betaS haplotypes.

    Science.gov (United States)

    Zago, M A; Silva, W A; Gualandro, S; Yokomizu, I K; Araujo, A G; Tavela, M H; Gerard, N; Krishnamoorthy, R; Elion, J

    2001-02-01

    The majority of the chromosomes with the betaS gene have one of the five common haplotypes, designated as Benin, Bantu, Senegal, Cameroon, and Arab-Indian haplotypes. However, 5-10% of the chromosomes have less common haplotypes, usually referred to as atypical haplotypes. We have demonstrated that most atypical haplotypes are generated by recombinations. The present study was carried out in order to explore whether recombination also occurs in chromosomes with the common (or typical) haplotypes. We screened the HS-2 region of the beta-globin gene locus control region (LCR) in 244 sickle cell patients who had typical restriction fragment length polymorphism (RFLP)-defined haplotypes of the betaS-gene cluster. For 14 cases in which the expected and the observed LCR repeat-sequence sizes were discrepant, the analysis was extended to other unexplored polymorphic markers of the bS-globin gene cluster, i.e.: pre-Ggamma framework, pre-Ggamma 6-bp deletion, HS-2 LCR (AT)xR(AT)y and pre-beta(AT)xTy repeats, and the intragenic beta-globin gene framework. In all 14 cases (15 chromosomes) in which the LCR repeat-sequence sizes were discrepant, a recombination involving a typical 3' segment of the betaS globin gene cluster was demonstrated. In most of the cases, the recombination site was located between the beta-globin gene and the betaLCR. Nine cases involving recombination were detected among 156 Brazilian HbS homozygotes and five among 88 African patients homozygotes for the Benin haplotype. INTERPRETATION AND CONCLUSIONS. Thus, 3.1% of apparently typical haplotypes linked to the sickle cell gene involve recombinations similar to those that generate the atypical haplotypes, a finding that reinforces the picture of the beta-globin gene cluster as highly dynamic.

  5. A genome-wide analysis of nonribosomal peptide synthetase gene clusters and their peptides in a Planktothrix rubescens strain

    Directory of Open Access Journals (Sweden)

    Nederbragt Alexander J

    2009-08-01

    Full Text Available Abstract Background Cyanobacteria often produce several different oligopeptides, with unknown biological functions, by nonribosomal peptide synthetases (NRPS. Although some cyanobacterial NRPS gene cluster types are well described, the entire NRPS genomic content within a single cyanobacterial strain has never been investigated. Here we have combined a genome-wide analysis using massive parallel pyrosequencing ("454" and mass spectrometry screening of oligopeptides produced in the strain Planktothrix rubescens NIVA CYA 98 in order to identify all putative gene clusters for oligopeptides. Results Thirteen types of oligopeptides were uncovered by mass spectrometry (MS analyses. Microcystin, cyanopeptolin and aeruginosin synthetases, highly similar to already characterized NRPS, were present in the genome. Two novel NRPS gene clusters were associated with production of anabaenopeptins and microginins, respectively. Sequence-depth of the genome and real-time PCR data revealed three copies of the microginin gene cluster. Since NRPS gene cluster candidates for microviridin and oscillatorin synthesis could not be found, putative (gene encoded precursor peptide sequences to microviridin and oscillatorin were found in the genes mdnA and oscA, respectively. The genes flanking the microviridin and oscillatorin precursor genes encode putative modifying enzymes of the precursor oligopeptides. We therefore propose ribosomal pathways involving modifications and cyclisation for microviridin and oscillatorin. The microviridin, anabaenopeptin and cyanopeptolin gene clusters are situated in close proximity to each other, constituting an oligopeptide island. Conclusion Altogether seven nonribosomal peptide synthetase (NRPS gene clusters and two gene clusters putatively encoding ribosomal oligopeptide biosynthetic pathways were revealed. Our results demonstrate that whole genome shotgun sequencing combined with MS-directed determination of oligopeptides successfully

  6. A phase synchronization clustering algorithm for identifying interesting groups of genes from cell cycle expression data

    Directory of Open Access Journals (Sweden)

    Tcha Hong

    2008-01-01

    Full Text Available Abstract Background The previous studies of genome-wide expression patterns show that a certain percentage of genes are cell cycle regulated. The expression data has been analyzed in a number of different ways to identify cell cycle dependent genes. In this study, we pose the hypothesis that cell cycle dependent genes are considered as oscillating systems with a rhythm, i.e. systems producing response signals with period and frequency. Therefore, we are motivated to apply the theory of multivariate phase synchronization for clustering cell cycle specific genome-wide expression data. Results We propose the strategy to find groups of genes according to the specific biological process by analyzing cell cycle specific gene expression data. To evaluate the propose method, we use the modified Kuramoto model, which is a phase governing equation that provides the long-term dynamics of globally coupled oscillators. With this equation, we simulate two groups of expression signals, and the simulated signals from each group shares their own common rhythm. Then, the simulated expression data are mixed with randomly generated expression data to be used as input data set to the algorithm. Using these simulated expression data, it is shown that the algorithm is able to identify expression signals that are involved in the same oscillating process. We also evaluate the method with yeast cell cycle expression data. It is shown that the output clusters by the proposed algorithm include genes, which are closely associated with each other by sharing significant Gene Ontology terms of biological process and/or having relatively many known biological interactions. Therefore, the evaluation analysis indicates that the method is able to identify expression signals according to the specific biological process. Our evaluation analysis also indicates that some portion of output by the proposed algorithm is not obtainable by the traditional clustering algorithm with

  7. Identification of a gene cluster associated with triclosan catabolism.

    Science.gov (United States)

    Kagle, Jeanne M; Paxson, Clayton; Johnstone, Precious; Hay, Anthony G

    2015-06-01

    Aerobic degradation of bis-aryl ethers like the antimicrobial triclosan typically proceeds through oxygenase-dependent catabolic pathways. Although several studies have reported on bacteria capable of degrading triclosan aerobically, there are no reports describing the genes responsible for this process. In this study, a gene encoding the large subunit of a putative triclosan oxygenase, designated tcsA was identified in a triclosan-degrading fosmid clone from a DNA library of Sphingomonas sp. RD1. Consistent with tcsA's similarity to two-part dioxygenases, a putative FMN-dependent ferredoxin reductase, designated tcsB was found immediately downstream of tcsA. Both tcsAB were found in the midst of a putative chlorocatechol degradation operon. We show that RD1 produces hydroxytriclosan and chlorocatechols during triclosan degradation and that tcsA is induced by triclosan. This is the first study to report on the genetics of triclosan degradation.

  8. Identification of certain cancer-mediating genes using Gaussian fuzzy cluster validity index.

    Science.gov (United States)

    Ghosh, Anupam; De, Rajat K

    2015-10-01

    In this article, we have used an index, called Gaussian fuzzy index (GFI), recently developed by the authors, based on the notion of fuzzy set theory, for validating the clusters obtained by a clustering algorithm applied on cancer gene expression data. GFI is then used for the identification of genes that have altered quite significantly from normal state to carcinogenic state with respect to their mRNA expression patterns. The effectiveness of the methodology has been demonstrated on three gene expression cancer datasets dealing with human lung, colon and leukemia. The performance of GFI is compared with 19 exiting cluster validity indices. The results are appropriately validated biologically and statistically. In this context, we have used biochemical pathways, p-value statistics of GO attributes, t-test and zscore for the validation of the results. It has been reported that GFI is capable of identifying high-quality enriched clusters of genes, and thereby is able to select more cancer-mediating genes.

  9. Mapping of the {alpha}{sub 4} subunit gene (GABRA4) to human chromosome 4 defines an {alpha}{sub 2}-{alpha}{sub 4}-{beta}{sub 1}-{gamma}{sub 1} gene cluster: Further evidence that modern GABA{sub a} receptor gene clusters are derived from an ancestral cluster

    Energy Technology Data Exchange (ETDEWEB)

    McLean, P.J.; Farb, D.H.; Russek, S.J. [Boston Univ. School of Medicine, MA (United States)] [and others

    1995-04-10

    We demonstrated previously that an {alpha}{sub 1}-{beta}{sub 2}-{gamma}{sub 2} gene cluster of the {gamma}-aminobutyric acid (GABA{sub A}) receptor is located on human chromosome 5q34-q35 and that an ancestral {alpha}-{beta}-{gamma} gene cluster probably spawned clusters on chromosomes 4, 5, and 15. Here, we report that the {alpha}{sub 4} gene (GABRA4) maps to human chromosome 4p14-q12, defining a cluster comprising the {alpha}{sub 2}, {alpha}{sub 4}, {beta}{sub 1}, and {gamma}{sub 1} genes. The existence of an {alpha}{sub 2}-{alpha}{sub 4}-{beta}{sub 1}-{gamma}{sub 2} cluster on chromosome 4 and an {alpha}{sub 1}-{alpha}{sub 6}-{beta}{sub 2}-{gamma}{sub 2} cluster on chromosome 5 provides further evidence that the number of ancestral GABA{sub A} receptor subunit genes has been expanded by duplication within an ancestral gene cluster. Moreover, if duplication of the {alpha} gene occurred before duplication of the ancestral gene cluster, then a heretofore undiscovered subtype of a subunit should be located on human chromosome 15q11-q13 within an {alpha}{sub 5}-{alpha}{sub x}-{beta}{sub 3}-{gamma}{sub 3} gene cluster at the locus for Angelman and Prader-Willi syndromes. 34 refs., 6 figs., 1 tab.

  10. clusters

    Indian Academy of Sciences (India)

    2017-09-27

    Sep 27, 2017 ... while CuCoNO, Co3NO, Cu3CoNO, Cu2Co3NO, Cu3Co3NO and Cu6CoNO clusters display stronger chemical stability. Magnetic and electronic properties are also discussed. The magnetic moment is affected by charge transfer and the spd hybridization. Keywords. CumConNO (m + n = 2–7) clusters; ...

  11. Teaching Gene Technology in an Outreach Lab: Students' Assigned Cognitive Load Clusters and the Clusters' Relationships to Learner Characteristics, Laboratory Variables, and Cognitive Achievement

    Science.gov (United States)

    Scharfenberg, Franz-Josef; Bogner, Franz X.

    2013-01-01

    This study classified students into different cognitive load (CL) groups by means of cluster analysis based on their experienced CL in a gene technology outreach lab which has instructionally been designed with regard to CL theory. The relationships of the identified student CL clusters to learner characteristics, laboratory variables, and…

  12. Unique actinomycetes from marine caves and coral reef sediments provide novel PKS and NRPS biosynthetic gene clusters.

    Science.gov (United States)

    Hodges, Tyler W; Slattery, Marc; Olson, Julie B

    2012-06-01

    In the ever-expanding search for novel bioactive molecules and enzymes, marine actinomycetes have proven to be a productive source. While open reef sediment and sponge-associated actinomycetes have been extensively examined, their marine cave counterparts remain unevaluated. Anchialine cave systems in the Bahamas offered an ideal setting to evaluate the occurrence and variation within sediment-associated actinomycete communities. While in close geographical proximity to open reef environments, these systems provide a specialized environmental niche devoid of light and direct exposure to nutrient input. In the present study, selective isolation techniques and molecular methods were used to test the hypothesis that variable distribution of actinomycetes and secondary metabolite gene clusters occur between open reef and marine cave systems. The results indicated that differences exist within the culturable sediment-associated actinomycete communities between marine caves and open reef systems, with members of the genus Streptomyces dominating cultures from open reef sediments and a more diverse suite of actinomycetes isolated from marine cave sediment samples. Within the cave isolates, members of the proposed genus Solwaraspora were the most represented. Based on PKS- and NRPS-gene-targeted PCR amplification and sequencing, geographic variation in the occurrence of these biosynthetic pathways was also observed. These findings indicate that marine cave systems are a lucrative source in the search for novel secondary metabolite producers with biotechnological applications and that environmental and geographic factors likely affect the occurrence of these biosynthetic pathways.

  13. Identification of the Viridicatumtoxin and Griseofulvin Gene Clusters from Penicillium aethiopicum

    Science.gov (United States)

    Chooi, Yit-Heng; Cacho, Ralph; Tang, Yi

    2010-01-01

    SUMMARY Penicillium aethiopicum produces two structurally interesting and biologically active polyketides: the tetracycline-like viridicatumtoxin 1 and the classic antifungal agent griseofulvin 2. Here, we report the concurrent discovery of the two corresponding biosynthetic gene clusters (vrt and gsf) by 454 shotgun sequencing. Gene deletions confirmed two nonreducing PKSs (NRPKS), vrtA and gsfA, are required for the biosynthesis of 1 and 2, respectively. Both PKSs share similar domain architectures and lack a C-terminal thioesterase domain. We identified gsfI as the chlorinase involved in the biosynthesis of 2, as deletion of gsfI resulted in the accumulation of decholorogriseofulvin 3. Comparative analysis with the P. chrysogenum genome revealed that both clusters are embedded within conserved syntenic regions of P. aethiopicum chromosomes. Discovery of the vrt and gsf clusters provided the basis for genetic and biochemical studies of the pathways. PMID:20534346

  14. Evolutionary history of the phl gene cluster in the plant-associated bacterium Pseudomonas fluorescens

    NARCIS (Netherlands)

    Moynihan, J.A.; Morrissey, J.P.; Coppoolse, E.; Stiekema, W.J.; O'Gara, F.; Boyd, E.F.

    2009-01-01

    Pseudomonas fluorescens is of agricultural and economic importance as a biological control agent largely because of its plant-association and production of secondary metabolites, in particular 2, 4-diacetylphloroglucinol (2, 4-DAPG). This polyketide, which is encoded by the eight gene phl cluster,

  15. Insights into secondary metabolism from a global analysis of prokaryotic biosynthetic gene clusters

    NARCIS (Netherlands)

    Cimermancic, P.; Medema, Marnix; Claesen, J.; Kurika, K.; Wieland Brown, L.C.; Mavrommatis, K.; Pati, A.; Godfrey, P.A.; Koehrsen, M.; Clardy, J.; Birren, B. W.; Takano, Eriko; Sali, A.; Linington, R.G.; Fischbach, M.A.

    2014-01-01

    Although biosynthetic gene clusters (BGCs) have been discovered for hundreds of bacterial metabolites, our knowledge of their diversity remains limited. Here, we used a novel algorithm to systematically identify BGCs in the extensive extant microbial sequencing data. Network analysis of the

  16. Molecular population genetics of the β-esterase gene cluster of ...

    Indian Academy of Sciences (India)

    We suggest that the demographic history (bottleneck and admixture of genetically differentiated populations) is the major factor shaping the pattern of nucleotide polymorphism in the -esterase gene cluster. However there are some 'footprints' of directional and balancing selection shaping specific distribution of nucleotide ...

  17. Molecular population genetics of the β-esterase gene cluster of ...

    Indian Academy of Sciences (India)

    Unknown

    neutrality with recombination are significant for the β−esterase gene cluster in the non-African samples but not signi- ficant in the African one. We suggest ...... I. Viability studies. Genetics 102,. 467–483. Selva E. M., New L., Crouse G. F. and Lahue R. S. 1995 Mis- match correction acts as a barrier to homologous recombina-.

  18. GenClust: A genetic algorithm for clustering gene expression data

    Directory of Open Access Journals (Sweden)

    Raimondi Alessandra

    2005-12-01

    Full Text Available Abstract Background Clustering is a key step in the analysis of gene expression data, and in fact, many classical clustering algorithms are used, or more innovative ones have been designed and validated for the task. Despite the widespread use of artificial intelligence techniques in bioinformatics and, more generally, data analysis, there are very few clustering algorithms based on the genetic paradigm, yet that paradigm has great potential in finding good heuristic solutions to a difficult optimization problem such as clustering. Results GenClust is a new genetic algorithm for clustering gene expression data. It has two key features: (a a novel coding of the search space that is simple, compact and easy to update; (b it can be used naturally in conjunction with data driven internal validation methods. We have experimented with the FOM methodology, specifically conceived for validating clusters of gene expression data. The validity of GenClust has been assessed experimentally on real data sets, both with the use of validation measures and in comparison with other algorithms, i.e., Average Link, Cast, Click and K-means. Conclusion Experiments show that none of the algorithms we have used is markedly superior to the others across data sets and validation measures; i.e., in many cases the observed differences between the worst and best performing algorithm may be statistically insignificant and they could be considered equivalent. However, there are cases in which an algorithm may be better than others and therefore worthwhile. In particular, experiments for GenClust show that, although simple in its data representation, it converges very rapidly to a local optimum and that its ability to identify meaningful clusters is comparable, and sometimes superior, to that of more sophisticated algorithms. In addition, it is well suited for use in conjunction with data driven internal validation measures and, in particular, the FOM methodology.

  19. Genomic and expression analysis of the vanG-like gene cluster of Clostridium difficile.

    Science.gov (United States)

    Peltier, Johann; Courtin, Pascal; El Meouche, Imane; Catel-Ferreira, Manuella; Chapot-Chartier, Marie-Pierre; Lemée, Ludovic; Pons, Jean-Louis

    2013-07-01

    Primary antibiotic treatment of Clostridium difficile intestinal diseases requires metronidazole or vancomycin therapy. A cluster of genes homologous to enterococcal glycopeptides resistance vanG genes was found in the genome of C. difficile 630, although this strain remains sensitive to vancomycin. This vanG-like gene cluster was found to consist of five ORFs: the regulatory region consisting of vanR and vanS and the effector region consisting of vanG, vanXY and vanT. We found that 57 out of 83 C. difficile strains, representative of the main lineages of the species, harbour this vanG-like cluster. The cluster is expressed as an operon and, when present, is found at the same genomic location in all strains. The vanG, vanXY and vanT homologues in C. difficile 630 are co-transcribed and expressed to a low level throughout the growth phases in the absence of vancomycin. Conversely, the expression of these genes is strongly induced in the presence of subinhibitory concentrations of vancomycin, indicating that the vanG-like operon is functional at the transcriptional level in C. difficile. Hydrophilic interaction liquid chromatography (HILIC-HPLC) and MS analysis of cytoplasmic peptidoglycan precursors of C. difficile 630 grown without vancomycin revealed the exclusive presence of a UDP-MurNAc-pentapeptide with an alanine at the C terminus. UDP-MurNAc-pentapeptide [d-Ala] was also the only peptidoglycan precursor detected in C. difficile grown in the presence of vancomycin, corroborating the lack of vancomycin resistance. Peptidoglycan structures of a vanG-like mutant strain and of a strain lacking the vanG-like cluster did not differ from the C. difficile 630 strain, indicating that the vanG-like cluster also has no impact on cell-wall composition.

  20. Identification, characterization and metagenome analysis of oocyte-specific genes organized in clusters in the mouse genome

    Directory of Open Access Journals (Sweden)

    Vaiman Daniel

    2005-05-01

    Full Text Available Abstract Background Genes specifically expressed in the oocyte play key roles in oogenesis, ovarian folliculogenesis, fertilization and/or early embryonic development. In an attempt to identify novel oocyte-specific genes in the mouse, we have used an in silico subtraction methodology, and we have focused our attention on genes that are organized in genomic clusters. Results In the present work, five clusters have been studied: a cluster of thirteen genes characterized by an F-box domain localized on chromosome 9, a cluster of six genes related to T-cell leukaemia/lymphoma protein 1 (Tcl1 on chromosome 12, a cluster composed of a SPErm-associated glutamate (E-Rich (Speer protein expressed in the oocyte in the vicinity of four unknown genes specifically expressed in the testis on chromosome 14, a cluster composed of the oocyte secreted protein-1 (Oosp-1 gene and two Oosp-related genes on chromosome 19, all three being characterized by a partial N-terminal zona pellucida-like domain, and another small cluster of two genes on chromosome 19 as well, composed of a TWIK-Related spinal cord K+ channel encoding-gene, and an unknown gene predicted in silico to be testis-specific. The specificity of expression was confirmed by RT-PCR and in situ hybridization for eight and five of them, respectively. Finally, we showed by comparing all of the isolated and clustered oocyte-specific genes identified so far in the mouse genome, that the oocyte-specific clusters are significantly closer to telomeres than isolated oocyte-specific genes are. Conclusion We have studied five clusters of genes specifically expressed in female, some of them being also expressed in male germ-cells. Moreover, contrarily to non-clustered oocyte-specific genes, those that are organized in clusters tend to map near chromosome ends, suggesting that this specific near-telomere position of oocyte-clusters in rodents could constitute an evolutionary advantage. Understanding the biological

  1. Multiplexed CRISPR/Cas9- and TAR-Mediated Promoter Engineering of Natural Product Biosynthetic Gene Clusters in Yeast.

    Science.gov (United States)

    Kang, Hahk-Soo; Charlop-Powers, Zachary; Brady, Sean F

    2016-09-16

    The use of DNA sequencing to guide the discovery of natural products has emerged as a new paradigm for revealing chemistries encoded in bacterial genomes. A major obstacle to implementing this approach to natural product discovery is the transcriptional silence of biosynthetic gene clusters under laboratory growth conditions. Here we describe an improved yeast-based promoter engineering platform (mCRISTAR) that combines CRISPR/Cas9 and TAR to enable single-marker multiplexed promoter engineering of large gene clusters. mCRISTAR highlights the first application of the CRISPR/Cas9 system to multiplexed promoter engineering of natural product biosynthetic gene clusters. In this method, CRISPR/Cas9 is used to induce DNA double-strand breaks in promoter regions of biosynthetic gene clusters, and the resulting operon fragments are reassembled by TAR using synthetic gene-cluster-specific promoter cassettes. mCRISTAR uses a CRISPR array to simplify the construction of a CRISPR plasmid for multiplex CRISPR and a single auxotrophic selection to improve the inefficiency of using a CRISPR array for multiplex gene cluster refactoring. mCRISTAR is a simple and generic method for multiplexed replacement of promoters in biosynthetic gene clusters that will facilitate the discovery of natural products from the rapidly growing collection of gene clusters found in microbial genome and metagenome sequencing projects.

  2. Clustering of two genes putatively involved in cyanate detoxification evolved recently and independently in multiple fungal lineages

    Science.gov (United States)

    Fungi that have the enzymes cyanase and carbonic anhydrase show a limited capacity to detoxify cyanate, a fungicide employed by both plants and humans. Here, we describe a novel two-gene cluster that comprises duplicated cyanase and carbonic anhydrase copies, which we name the CCA gene cluster, trac...

  3. Gene clustering analysis in human osteoporosis disease and modifications of the jawbone.

    Science.gov (United States)

    Toti, Paolo; Sbordone, Carolina; Martuscelli, Ranieri; Califano, Luigi; Ramaglia, Luca; Sbordone, Ludovico

    2013-08-01

    An analysis of the genes involved in both osteoporosis and modifications of the jawbone, through text mining, using a web search tool, of information regarding gene/protein interaction. The final set of genes involved in the present phenomenon was obtained by expansion-filtering loop. Using a web-available software (STRING), interactions among all genes were searched for, and a clustering procedure was performed in which only high-confidence predicted associations were considered. Two hundred forty-two genes potentially involved in osteoporosis and in modifications of the jawbone were recorded. Seven "leader genes" were identified (CTNNB1, IL1B, IL6, JUN, RUNX2, SPP1, TGFB1), while another 10 genes formed the cluster B group (BMP2, BMP7, COL1A1, ICAM1, IGF1, IL10, MMP9, NFKB1, TNFSF11, VEGFA). Ninety-eight genes had no interactions, and were defined as "orphan genes". The expansion of knowledge regarding the molecular basis causing osteoporotic traits has been brought about with the help of a de novo identification, based on the data mining of genes involved in osteoporosis and in modification of the jawbone. A comparison of the present data, in which no role was verified for 98 genes that had been previously supposed to have a role, with that of the literature, in which another 81 genes, as obtained from GWAS reviews and meta-analyses, appeared to be strongly associated with osteoporosis, probably attests to a lack of information on osteoporotic disease. Copyright © 2013 Elsevier Ltd. All rights reserved.

  4. Genome-wide identification, subcellular localization and gene expression analysis of the members of CESA gene family in common tobacco (Nicotiana tabacum L.).

    Science.gov (United States)

    Xu, Zong-Chang; Kong, Yingzhen

    2017-06-20

    Cellulose-synthase proteins (CESAs) are membrane localized proteins and they form protein complexes to produce cellulose in the plasma membrane. CESA proteins play very important roles in cell wall construction during plant growth and development. In this study, a total of 21 NtCESA gene sequences were identified by using PF03552 conserved protein sequence and 10 AtCESA protein sequences of Arabidopsis thaliana to blast against the common tobacco (Nicotiana tabacum L.) genome database with TBLASTN protocol. We analyzed the physical and chemical properties of protein sequences based on some software or on-line analysis tools. The results showed that there were no significant variances in terms of the physical and chemical properties of the 21 NtCESA proteins. First, phylogenetic tree analysis showed that 21 NtCESA genes and 10 AtCESA genes were clustered into five groups, and the gene structures were similar among the genes that are clustered into the same group. Second, in all of the 21 NtCESA proteins the conserved zinc finger domain was identified in the N-terminus, transmembrane domains were identified in the C-terminus and the DDD-QXXRW conserved domains were also identified. Third, gene expression analysis results indicated that most NtCESA genes were expressed in roots and leaves of seedling or mature tissues of tobacco, seeds and callus tissues. The genes that clustered into the same group share similar expression patterns. Importantly, NtCESA proteins that are involved in secondary cell wall cellulose synthesis have two extra transmembrane domains compared with that involved in primary cell wall cellulose biosynthesis. In addition, subcellular localization results showed that NtCESA9 and NtCESA14 were two plasma membrane anchored proteins. This study will lay a foundation for further functional characterization of these NtCESA genes.

  5. Theobroma cacao L. pathogenesis-related gene tandem array members show diverse expression dynamics in response to pathogen colonization.

    Science.gov (United States)

    Fister, Andrew S; Mejia, Luis C; Zhang, Yufan; Herre, Edward Allen; Maximova, Siela N; Guiltinan, Mark J

    2016-05-17

    The pathogenesis-related (PR) group of proteins are operationally defined as polypeptides that increase in concentration in plant tissues upon contact with a pathogen. To date, 17 classes of highly divergent proteins have been described that act through multiple mechanisms of pathogen resistance. Characterizing these families in cacao, an economically important tree crop, and comparing the families to those in other species, is an important step in understanding cacao's immune response. Using publically available resources, all members of the 17 recognized pathogenesis-related gene families in the genome of Theobroma cacao were identified and annotated resulting in a set of ~350 members in both published cacao genomes. Approximately 50 % of these genes are organized in tandem arrays scattered throughout the genome. This feature was observed in five additional plant taxa (three dicots and two monocots), suggesting that tandem duplication has played an important role in the evolution of the PR genes in higher plants. Expression profiling captured the dynamics and complexity of PR genes expression at basal levels and after induction by two cacao pathogens (the oomycete, Phytophthora palmivora, and the fungus, Colletotrichum theobromicola), identifying specific genes within families that are more responsive to pathogen challenge. Subsequent qRT-PCR validated the induction of several PR-1, PR-3, PR-4, and PR-10 family members, with greater than 1000 fold induction detected for specific genes. We describe candidate genes that are likely to be involved in cacao's defense against Phytophthora and Colletotrichum infection and could be potentially useful for marker-assisted selection for breeding of disease resistant cacao varieties. The data presented here, along with existing cacao-omics resources, will enable targeted functional genetic screening of defense genes likely to play critical functions in cacao's defense against its pathogens.

  6. Transcriptional regulation of gene expression clusters in motor neurons following spinal cord injury

    DEFF Research Database (Denmark)

    Ryge, J.; Winther, Ole; Wienecke, J.

    2010-01-01

    Background: Spinal cord injury leads to neurological dysfunctions affecting the motor, sensory as well as the autonomic systems. Increased excitability of motor neurons has been implicated in injury-induced spasticity, where the reappearance of self-sustained plateau potentials in the absence...... of modulatory inputs from the brain correlates with the development of spasticity. Results: Here we examine the dynamic transcriptional response of motor neurons to spinal cord injury as it evolves over time to unravel common gene expression patterns and their underlying regulatory mechanisms. For this we use...... a rat-tail-model with complete spinal cord transection causing injury-induced spasticity, where gene expression profiles are obtained from labeled motor neurons extracted with laser microdissection 0, 2, 7, 21 and 60 days post injury. Consensus clustering identifies 12 gene clusters with distinct time...

  7. cluster

    Indian Academy of Sciences (India)

    has been investigated electrochemically in positive and negative microenvironments, both in solution and in film. Charge nature around the active centre ... in plants, bacteria and also in mammals. This cluster is also an important constituent of a ..... selection of non-cysteine amino acid in the active centre of Rieske proteins.

  8. A scan statistic to extract causal gene clusters from case-control genome-wide rare CNV data

    Directory of Open Access Journals (Sweden)

    Scherer Stephen W

    2011-05-01

    Full Text Available Abstract Background Several statistical tests have been developed for analyzing genome-wide association data by incorporating gene pathway information in terms of gene sets. Using these methods, hundreds of gene sets are typically tested, and the tested gene sets often overlap. This overlapping greatly increases the probability of generating false positives, and the results obtained are difficult to interpret, particularly when many gene sets show statistical significance. Results We propose a flexible statistical framework to circumvent these problems. Inspired by spatial scan statistics for detecting clustering of disease occurrence in the field of epidemiology, we developed a scan statistic to extract disease-associated gene clusters from a whole gene pathway. Extracting one or a few significant gene clusters from a global pathway limits the overall false positive probability, which results in increased statistical power, and facilitates the interpretation of test results. In the present study, we applied our method to genome-wide association data for rare copy-number variations, which have been strongly implicated in common diseases. Application of our method to a simulated dataset demonstrated the high accuracy of this method in detecting disease-associated gene clusters in a whole gene pathway. Conclusions The scan statistic approach proposed here shows a high level of accuracy in detecting gene clusters in a whole gene pathway. This study has provided a sound statistical framework for analyzing genome-wide rare CNV data by incorporating topological information on the gene pathway.

  9. Two Gene Clusters Coordinate Galactose and Lactose Metabolism in Streptococcus gordonii

    Science.gov (United States)

    Zeng, Lin; Martino, Nicole C.

    2012-01-01

    Streptococcus gordonii is an early colonizer of the human oral cavity and an abundant constituent of oral biofilms. Two tandemly arranged gene clusters, designated lac and gal, were identified in the S. gordonii DL1 genome, which encode genes of the tagatose pathway (lacABCD) and sugar phosphotransferase system (PTS) enzyme II permeases. Genes encoding a predicted phospho-β-galactosidase (LacG), a DeoR family transcriptional regulator (LacR), and a transcriptional antiterminator (LacT) were also present in the clusters. Growth and PTS assays supported that the permease designated EIILac transports lactose and galactose, whereas EIIGal transports galactose. The expression of the gene for EIIGal was markedly upregulated in cells growing on galactose. Using promoter-cat fusions, a role for LacR in the regulation of the expressions of both gene clusters was demonstrated, and the gal cluster was also shown to be sensitive to repression by CcpA. The deletion of lacT caused an inability to grow on lactose, apparently because of its role in the regulation of the expression of the genes for EIILac, but had little effect on galactose utilization. S. gordonii maintained a selective advantage over Streptococcus mutans in a mixed-species competition assay, associated with its possession of a high-affinity galactose PTS, although S. mutans could persist better at low pHs. Collectively, these results support the concept that the galactose and lactose systems of S. gordonii are subject to complex regulation and that a high-affinity galactose PTS may be advantageous when S. gordonii is competing against the caries pathogen S. mutans in oral biofilms. PMID:22660715

  10. DMRT gene cluster analysis in the platypus: new insights into genomic organization and regulatory regions.

    Science.gov (United States)

    El-Mogharbel, Nisrine; Wakefield, Matthew; Deakin, Janine E; Tsend-Ayush, Enkhjargal; Grützner, Frank; Alsop, Amber; Ezaz, Tariq; Marshall Graves, Jennifer A

    2007-01-01

    We isolated and characterized a cluster of platypus DMRT genes and compared their arrangement, location, and sequence across vertebrates. The DMRT gene cluster on human 9p24.3 harbors, in order, DMRT1, DMRT3, and DMRT2, which share a DM domain. DMRT1 is highly conserved and involved in sexual development in vertebrates, and deletions in this region cause sex reversal in humans. Sequence comparisons of DMRT genes between species have been valuable in identifying exons, control regions, and conserved nongenic regions (CNGs). The addition of platypus sequences is expected to be particularly valuable, since monotremes fill a gap in the vertebrate genome coverage. We therefore isolated and fully sequenced platypus BAC clones containing DMRT3 and DMRT2 as well as DMRT1 and then generated multispecies alignments and ran prediction programs followed by experimental verification to annotate this gene cluster. We found that the three genes have 58-66% identity to their human orthologues, lie in the same order as in other vertebrates, and colocate on 1 of the 10 platypus sex chromosomes, X5. We also predict that optimal annotation of the newly sequenced platypus genome will be challenging. The analysis of platypus sequence revealed differences in structure and sequence of the DMRT gene cluster. Multispecies comparison was particularly effective for detecting CNGs, revealing several novel potential regulatory regions within DMRT3 and DMRT2 as well as DMRT1. RT-PCR indicated that platypus DMRT1 and DMRT3 are expressed specifically in the adult testis (and not ovary), but DMRT2 has a wider expression profile, as it does for other mammals. The platypus DMRT1 expression pattern, and its location on an X chromosome, suggests an involvement in monotreme sexual development.

  11. A novel method incorporating gene ontology information for unsupervised clustering and feature selection.

    Directory of Open Access Journals (Sweden)

    Shireesh Srivastava

    Full Text Available Among the primary goals of microarray analysis is the identification of genes that could distinguish between different phenotypes (feature selection. Previous studies indicate that incorporating prior information of the genes' function could help identify physiologically relevant features. However, current methods that incorporate prior functional information do not provide a relative estimate of the effect of different genes on the biological processes of interest.Here, we present a method that integrates gene ontology (GO information and expression data using Bayesian regression mixture models to perform unsupervised clustering of the samples and identify physiologically relevant discriminating features. As a model application, the method was applied to identify the genes that play a role in the cytotoxic responses of human hepatoblastoma cell line (HepG2 to saturated fatty acid (SFA and tumor necrosis factor (TNF-alpha, as compared to the non-toxic response to the unsaturated FFAs (UFA and TNF-alpha. Incorporation of prior knowledge led to a better discrimination of the toxic phenotypes from the others. The model identified roles of lysosomal ATPases and adenylate cyclase (AC9 in the toxicity of palmitate. To validate the role of AC in palmitate-treated cells, we measured the intracellular levels of cyclic AMP (cAMP. The cAMP levels were found to be significantly reduced by palmitate treatment and not by the other FFAs, in accordance with the model selection of AC9.A framework is presented that incorporates prior ontology information, which helped to (a perform unsupervised clustering of the phenotypes, and (b identify the genes relevant to each cluster of phenotypes. We demonstrate the proposed framework by applying it to identify physiologically-relevant feature genes that conferred differential toxicity to saturated vs. unsaturated FFAs. The framework can be applied to other problems to efficiently integrate ontology information and

  12. An enhanced deterministic K-Means clustering algorithm for cancer subtype prediction from gene expression data.

    Science.gov (United States)

    Nidheesh, N; Abdul Nazeer, K A; Ameer, P M

    2017-12-01

    Clustering algorithms with steps involving randomness usually give different results on different executions for the same dataset. This non-deterministic nature of algorithms such as the K-Means clustering algorithm limits their applicability in areas such as cancer subtype prediction using gene expression data. It is hard to sensibly compare the results of such algorithms with those of other algorithms. The non-deterministic nature of K-Means is due to its random selection of data points as initial centroids. We propose an improved, density based version of K-Means, which involves a novel and systematic method for selecting initial centroids. The key idea of the algorithm is to select data points which belong to dense regions and which are adequately separated in feature space as the initial centroids. We compared the proposed algorithm to a set of eleven widely used single clustering algorithms and a prominent ensemble clustering algorithm which is being used for cancer data classification, based on the performances on a set of datasets comprising ten cancer gene expression datasets. The proposed algorithm has shown better overall performance than the others. There is a pressing need in the Biomedical domain for simple, easy-to-use and more accurate Machine Learning tools for cancer subtype prediction. The proposed algorithm is simple, easy-to-use and gives stable results. Moreover, it provides comparatively better predictions of cancer subtypes from gene expression data. Copyright © 2017 Elsevier Ltd. All rights reserved.

  13. Co-ordinate regulation of cytokinin gene family members during flag leaf and reproductive development in wheat.

    Science.gov (United States)

    Song, Jiancheng; Jiang, Lijun; Jameson, Paula Elizabeth

    2012-06-06

    As the global population continues to expand, increasing yield in bread wheat is of critical importance as 20% of the world's food supply is sourced from this cereal. Several recent studies of the molecular basis of grain yield indicate that the cytokinins are a key factor in determining grain yield. In this study, cytokinin gene family members in bread wheat were isolated from four multigene families which regulate cytokinin synthesis and metabolism, the isopentenyl transferases (IPT), cytokinin oxidases (CKX), zeatin O-glucosyltransferases (ZOG), and β-glucosidases (GLU). As bread wheat is hexaploid, each gene family is also likely to be represented on the A, B and D genomes. By using a novel strategy of qRT-PCR with locus-specific primers shared among the three homoeologues of each family member, detailed expression profiles are provided of family members of these multigene families expressed during leaf, spike and seed development. The expression patterns of individual members of the IPT, CKX, ZOG, and GLU multigene families in wheat are shown to be tissue- and developmentally-specific. For instance, TaIPT2 and TaCKX1 were the most highly expressed family members during early seed development, with relative expression levels of up to 90- and 900-fold higher, respectively, than those in the lowest expressed samples. The expression of two cis-ZOG genes was sharply increased in older leaves, while an extremely high mRNA level of TaGLU1-1 was detected in young leaves. Key genes with tissue- and developmentally-specific expression have been identified which would be prime targets for genetic manipulation towards yield improvement in bread wheat breeding programmes, utilising TILLING and MAS strategies.

  14. Genetic variations and haplotype diversity of the UGT1 gene cluster in the Chinese population.

    Directory of Open Access Journals (Sweden)

    Jing Yang

    Full Text Available Vertebrates require tremendous molecular diversity to defend against numerous small hydrophobic chemicals. UDP-glucuronosyltransferases (UGTs are a large family of detoxification enzymes that glucuronidate xenobiotics and endobiotics, facilitating their excretion from the body. The UGT1 gene cluster contains a tandem array of variable first exons, each preceded by a specific promoter, and a common set of downstream constant exons, similar to the genomic organization of the protocadherin (Pcdh, immunoglobulin, and T-cell receptor gene clusters. To assist pharmacogenomics studies in Chinese, we sequenced nine first exons, promoter and intronic regions, and five common exons of the UGT1 gene cluster in a population sample of 253 unrelated Chinese individuals. We identified 101 polymorphisms and found 15 novel SNPs. We then computed allele frequencies for each polymorphism and reconstructed their linkage disequilibrium (LD map. The UGT1 cluster can be divided into five linkage blocks: Block 9 (UGT1A9, Block 9/7/6 (UGT1A9, UGT1A7, and UGT1A6, Block 5 (UGT1A5, Block 4/3 (UGT1A4 and UGT1A3, and Block 3' UTR. Furthermore, we inferred haplotypes and selected their tagSNPs. Finally, comparing our data with those of three other populations of the HapMap project revealed ethnic specificity of the UGT1 genetic diversity in Chinese. These findings have important implications for future molecular genetic studies of the UGT1 gene cluster as well as for personalized medical therapies in Chinese.

  15. Sequencing and transcriptional analysis of the Streptococcus thermophilus histamine biosynthesis gene cluster: factors that affect differential hdcA expression

    DEFF Research Database (Denmark)

    Calles-Enríquez, Marina; Hjort, Benjamin Benn; Andersen, Pia Skov

    2010-01-01

    to produce histamine. The hdc clusters of S. thermophilus CHCC1524 and CHCC6483 were sequenced, and the factors that affect histamine biosynthesis and histidine-decarboxylating gene (hdcA) expression were studied. The hdc cluster began with the hdcA gene, was followed by a transporter (hdcP), and ended...... with the hdcB gene, which is of unknown function. The three genes were orientated in the same direction. The genetic organization of the hdc cluster showed a unique organization among the lactic acid bacterial group and resembled those of Staphylococcus and Clostridium species, thus indicating possible...... acquisition through a horizontal transfer mechanism. Transcriptional analysis of the hdc cluster revealed the existence of a polycistronic mRNA covering the three genes. The histidine-decarboxylating gene (hdcA) of S. thermophilus demonstrated maximum expression during the stationary growth phase, with high...

  16. MeSH key terms for validation and annotation of gene expression clusters

    Energy Technology Data Exchange (ETDEWEB)

    Rechtsteiner, A. (Andreas); Rocha, L. M. (Luis Mateus)

    2004-01-01

    Integration of different sources of information is a great challenge for the analysis of gene expression data, and for the field of Functional Genomics in general. As the availability of numerical data from high-throughput methods increases, so does the need for technologies that assist in the validation and evaluation of the biological significance of results extracted from these data. In mRNA assaying with microarrays, for example, numerical analysis often attempts to identify clusters of co-expressed genes. The important task to find the biological significance of the results and validate them has so far mostly fallen to the biological expert who had to perform this task manually. One of the most promising avenues to develop automated and integrative technology for such tasks lies in the application of modern Information Retrieval (IR) and Knowledge Management (KM) algorithms to databases with biomedical publications and data. Examples of databases available for the field are bibliographic databases c ntaining scientific publications (e.g. MEDLINE/PUBMED), databases containing sequence data (e.g. GenBank) and databases of semantic annotations (e.g. the Gene Ontology Consortium and Medical Subject Headings (MeSH)). We present here an approach that uses the MeSH terms and their concept hierarchies to validate and obtain functional information for gene expression clusters. The controlled and hierarchical MeSH vocabulary is used by the National Library of Medicine (NLM) to index all the articles cited in MEDLINE. Such indexing with a controlled vocabulary eliminates some of the ambiguity due to polysemy (terms that have multiple meanings) and synonymy (multiple terms have similar meaning) that would be encountered if terms would be extracted directly from the articles due to differing article contexts or author preferences and background. Further, the hierarchical organization of the MeSH terms can illustrate the conceptuallfunctional relationships of genes

  17. The detection and phylogenetic analysis of the alkane 1-monooxygenase gene of members of the genus Rhodococcus.

    Science.gov (United States)

    Táncsics, András; Benedek, Tibor; Szoboszlay, Sándor; Veres, Péter G; Farkas, Milán; Máthé, István; Márialigeti, Károly; Kukolya, József; Lányi, Szabolcs; Kriszt, Balázs

    2015-02-01

    Naturally occurring and anthropogenic petroleum hydrocarbons are potential carbon sources for many bacteria. The AlkB-related alkane hydroxylases, which are integral membrane non-heme iron enzymes, play a key role in the microbial degradation of many of these hydrocarbons. Several members of the genus Rhodococcus are well-known alkane degraders and are known to harbor multiple alkB genes encoding for different alkane 1-monooxygenases. In the present study, 48 Rhodococcus strains, representing 35 species of the genus, were investigated to find out whether there was a dominant type of alkB gene widespread among species of the genus that could be used as a phylogenetic marker. Phylogenetic analysis of rhodococcal alkB gene sequences indicated that a certain type of alkB gene was present in almost every member of the genus Rhodococcus. These alkB genes were common in a unique nucleotide sequence stretch absent from other types of rhodococcal alkB genes that encoded a conserved amino acid motif: WLG(I/V/L)D(G/D)GL. The sequence identity of the targeted alkB gene in Rhodococcus ranged from 78.5 to 99.2% and showed higher nucleotide sequence variation at the inter-species level compared to the 16S rRNA gene (93.9-99.8%). The results indicated that the alkB gene type investigated might be applicable for: (i) differentiating closely related Rhodococcus species, (ii) properly assigning environmental isolates to existing Rhodococcus species, and finally (iii) assessing whether a new Rhodococcus isolate represents a novel species of the genus. Copyright © 2014 Elsevier GmbH. All rights reserved.

  18. Rearranged Biosynthetic Gene Cluster and Synthesis of Hassallidin E in Planktothrix serta PCC 8927.

    Science.gov (United States)

    Pancrace, Claire; Jokela, Jouni; Sassoon, Nathalie; Ganneau, Christelle; Desnos-Ollivier, Marie; Wahlsten, Matti; Humisto, Anu; Calteau, Alexandra; Bay, Sylvie; Fewer, David P; Sivonen, Kaarina; Gugger, Muriel

    2017-07-21

    Cyanobacteria produce a wide range of natural products with antifungal bioactivity. The cyclic glycosylated lipopeptides of the hassallidin family have potent antifungal activity and display a great degree of chemical diversity. Here, we report the discovery of a hassallidin biosynthetic gene cluster from the filamentous cyanobacterium Planktothrix serta PCC 8927. The hassallidin gene cluster showed heavy rearrangement and marks of genomic plasticity. Nucleotide bias, differences in GC content, and phylogenetic incongruence suggested the acquisition of the hassallidin biosynthetic gene cluster in Planktothrix serta PCC 8927 by horizontal gene transfer. Chemical analyses by liquid chromatography and mass spectrometry demonstrated that this strain produced hassallidin E, a new glycosylated hassallidin variant. Hassallidin E was the only structural variant produced by Planktothrix serta PCC 8927 in all tested conditions. Further evaluated on human pathogenic fungi, hassallidin E showed an antifungal bioactivity. Hassallidin production levels correlated with nitrogen availability, in the only nitrogen-fixing Planktothrix described so far. Our results provide insights into the distribution and chemical diversity of cyanobacterial antifungal compounds as well as raise questions on their ecological relevance.

  19. Gene expression analyses in individual grape (Vitis vinifera L.) berries during ripening initiation reveal that pigmentation intensity is a valid indicator of developmental staging within the cluster.

    Science.gov (United States)

    Lund, Steven T; Peng, Fred Y; Nayar, Tarun; Reid, Karen E; Schlosser, James

    2008-10-01

    Asynchronous ripening of individual grape berries within clusters can lead to inconsistent organoleptic characteristics for wine making. Ripening initiation in grape berries is non-climacteric and not well understood at the molecular level. Evidence is lacking for a single master switch controlling this process, such as the established role for ethylene in climacteric fruit ripening. We used Affymetrix microarray analyses of 32 individual Vitis vinifera cv. Cabernet Sauvignon berries sampled from two clusters at 50% ripening initiation. By delineating four developmental stages of ripening initiation, we demonstrate that pigmentation is a statistically significant indicator of transcriptional state during ripening initiation. We report on clustered gene expression patterns which were mined for genes annotated with signal transduction functions in order to advance regulatory network modeling of ripening initiation in grape berries. Abscisic acid has previously been demonstrated to be an important signaling component regulating ripening initiation in grapevine. We demonstrate via real-time RT-PCR analyses that up-regulation of a 9-cis-epoxycarotenoid gene family member, VvNCED2, in grape seed and pericarp and a putative ortholog to a reported abscisic acid receptor, VvGCR2, are correlated with ripening initiation. Our results suggest a role for these genes in abscisic acid signaling during ripening initiation.

  20. Genomic organization, tissue distribution and functional characterization of the rat Pate gene cluster.

    Directory of Open Access Journals (Sweden)

    Angireddy Rajesh

    Full Text Available The cysteine rich prostate and testis expressed (Pate proteins identified till date are thought to resemble the three fingered protein/urokinase-type plasminogen activator receptor proteins. In this study, for the first time, we report the identification, cloning and characterization of rat Pate gene cluster and also determine the expression pattern. The rat Pate genes are clustered on chromosome 8 and their predicted proteins retained the ten cysteine signature characteristic to TFP/Ly-6 protein family. PATE and PATE-F three dimensional protein structure was found to be similar to that of the toxin bucandin. Though Pate gene expression is thought to be prostate and testis specific, we observed that rat Pate genes are also expressed in seminal vesicle and epididymis and in tissues beyond the male reproductive tract. In the developing rats (20-60 day old, expression of Pate genes seem to be androgen dependent in the epididymis and testis. In the adult rat, androgen ablation resulted in down regulation of the majority of Pate genes in the epididymides. PATE and PATE-F proteins were found to be expressed abundantly in the male reproductive tract of rats and on the sperm. Recombinant PATE protein exhibited potent antibacterial activity, whereas PATE-F did not exhibit any antibacterial activity. Pate expression was induced in the epididymides when challenged with LPS. Based on our results, we conclude that rat PATE proteins may contribute to the reproductive and defense functions.

  1. Conserved gene clusters in bacterial genomes provide further support for the primacy of RNA

    Science.gov (United States)

    Siefert, J. L.; Martin, K. A.; Abdi, F.; Widger, W. R.; Fox, G. E.

    1997-01-01

    Five complete bacterial genome sequences have been released to the scientific community. These include four (eu)Bacteria, Haemophilus influenzae, Mycoplasma genitalium, M. pneumoniae, and Synechocystis PCC 6803, as well as one Archaeon, Methanococcus jannaschii. Features of organization shared by these genomes are likely to have arisen very early in the history of the bacteria and thus can be expected to provide further insight into the nature of early ancestors. Results of a genome comparison of these five organisms confirm earlier observations that gene order is remarkably unpreserved. There are, nevertheless, at least 16 clusters of two or more genes whose order remains the same among the four (eu)Bacteria and these are presumed to reflect conserved elements of coordinated gene expression that require gene proximity. Eight of these gene orders are essentially conserved in the Archaea as well. Many of these clusters are known to be regulated by RNA-level mechanisms in Escherichia coli, which supports the earlier suggestion that this type of regulation of gene expression may have arisen very early. We conclude that although the last common ancestor may have had a DNA genome, it likely was preceded by progenotes with an RNA genome.

  2. QServer: a biclustering server for prediction and assessment of co-expressed gene clusters.

    Directory of Open Access Journals (Sweden)

    Fengfeng Zhou

    Full Text Available BACKGROUND: Biclustering is a powerful technique for identification of co-expressed gene groups under any (unspecified substantial subset of given experimental conditions, which can be used for elucidation of transcriptionally co-regulated genes. RESULTS: We have previously developed a biclustering algorithm, QUBIC, which can solve more general biclustering problems than previous biclustering algorithms. To fully utilize the analysis power the algorithm provides, we have developed a web server, QServer, for prediction, computational validation and analyses of co-expressed gene clusters. Specifically, the QServer has the following capabilities in addition to biclustering by QUBIC: (i prediction and assessment of conserved cis regulatory motifs in promoter sequences of the predicted co-expressed genes; (ii functional enrichment analyses of the predicted co-expressed gene clusters using Gene Ontology (GO terms, and (iii visualization capabilities in support of interactive biclustering analyses. QServer supports the biclustering and functional analysis for a wide range of organisms, including human, mouse, Arabidopsis, bacteria and archaea, whose underlying genome database will be continuously updated. CONCLUSION: We believe that QServer provides an easy-to-use and highly effective platform useful for hypothesis formulation and testing related to transcription co-regulation.

  3. QServer: a biclustering server for prediction and assessment of co-expressed gene clusters.

    Science.gov (United States)

    Zhou, Fengfeng; Ma, Qin; Li, Guojun; Xu, Ying

    2012-01-01

    Biclustering is a powerful technique for identification of co-expressed gene groups under any (unspecified) substantial subset of given experimental conditions, which can be used for elucidation of transcriptionally co-regulated genes. We have previously developed a biclustering algorithm, QUBIC, which can solve more general biclustering problems than previous biclustering algorithms. To fully utilize the analysis power the algorithm provides, we have developed a web server, QServer, for prediction, computational validation and analyses of co-expressed gene clusters. Specifically, the QServer has the following capabilities in addition to biclustering by QUBIC: (i) prediction and assessment of conserved cis regulatory motifs in promoter sequences of the predicted co-expressed genes; (ii) functional enrichment analyses of the predicted co-expressed gene clusters using Gene Ontology (GO) terms, and (iii) visualization capabilities in support of interactive biclustering analyses. QServer supports the biclustering and functional analysis for a wide range of organisms, including human, mouse, Arabidopsis, bacteria and archaea, whose underlying genome database will be continuously updated. We believe that QServer provides an easy-to-use and highly effective platform useful for hypothesis formulation and testing related to transcription co-regulation.

  4. Spatial expression of Hox cluster genes in the ontogeny of a sea urchin

    Science.gov (United States)

    Arenas-Mena, C.; Cameron, A. R.; Davidson, E. H.

    2000-01-01

    The Hox cluster of the sea urchin Strongylocentrous purpuratus contains ten genes in a 500 kb span of the genome. Only two of these genes are expressed during embryogenesis, while all of eight genes tested are expressed during development of the adult body plan in the larval stage. We report the spatial expression during larval development of the five 'posterior' genes of the cluster: SpHox7, SpHox8, SpHox9/10, SpHox11/13a and SpHox11/13b. The five genes exhibit a dynamic, largely mesodermal program of expression. Only SpHox7 displays extensive expression within the pentameral rudiment itself. A spatially sequential and colinear arrangement of expression domains is found in the somatocoels, the paired posterior mesodermal structures that will become the adult perivisceral coeloms. No such sequential expression pattern is observed in endodermal, epidermal or neural tissues of either the larva or the presumptive juvenile sea urchin. The spatial expression patterns of the Hox genes illuminate the evolutionary process by which the pentameral echinoderm body plan emerged from a bilateral ancestor.

  5. Scavenger receptor class A member 5 (SCARA5) and suprabasin (SBSN) are hub genes of coexpression network modules associated with peripheral vein graft patency.

    Science.gov (United States)

    Kenagy, Richard D; Civelek, Mete; Kikuchi, Shinsuke; Chen, Lihua; Grieff, Anthony; Sobel, Michael; Lusis, Aldons J; Clowes, Alexander W

    2016-07-01

    Approximately 30% of autogenous vein grafts develop luminal narrowing and fail because of intimal hyperplasia or negative remodeling. We previously found that vein graft cells from patients who later develop stenosis proliferate more in vitro in response to growth factors than cells from patients who maintain patent grafts. To discover novel determinants of vein graft outcome, we have analyzed gene expression profiles of these cells using a systems biology approach to cluster the genes into modules by their coexpression patterns and to correlate the results with growth data from our prior study and with new studies of migration and matrix remodeling. RNA from 4-hour serum- or platelet-derived growth factor (PDGF)-BB-stimulated human saphenous vein cells obtained from the outer vein wall (20 cell lines) was used for microarray analysis of gene expression, followed by weighted gene coexpression network analysis. Cell migration in microchemotaxis chambers in response to PDGF-BB and cell-mediated collagen gel contraction in response to serum were also determined. Gene function was determined using short-interfering RNA to inhibit gene expression before subjecting cells to growth or collagen gel contraction assays. These cells were derived from samples of the vein grafts obtained at surgery, and the long-term fate of these bypass grafts was known. Neither migration nor cell-mediated collagen gel contraction showed a correlation with graft outcome. Although 1188 and 1340 genes were differentially expressed in response to treatment with serum and PDGF, respectively, no single gene was differentially expressed in cells isolated from patients whose grafts stenosed compared with those that remained patent. Network analysis revealed four unique groups of genes, which we term modules, associated with PDGF responses, and 20 unique modules associated with serum responses. The "yellow" and "skyblue" modules, from PDGF and serum analyses, respectively, correlated with later graft

  6. Genome Sequences of Mycobacteriophages AlanGrant, Baee, Corofin, OrangeOswald, and Vincenzo, New Members of Cluster B

    Science.gov (United States)

    Carbonara, Maria E.; Cioffi, Hanna M.; Cruz, Tyler; Dang, Brian Q.; Doyle, Alexander N.; Fan, Olivia H.; Gallagher, Molly; Gentile, Gabrielle M.; German, Brian A.; Farrell, Margaret E.; Gerwig, Madeline; Hunter, Kelsey L.; Lefever, Virginia E.; Marfisi, Nicole A.; McDonnell, Jill E.; Monga, Jappmann K.; Quiroz, Kevin G.; Pong, Alexandra C.; Rimple, Patrick A.; Situ, Michelle; Sohnen, Peri C.; Stockinger, Annmarie N.; Thompson, Paige K.; Torchio, Nicole M.; Toner, Chelsea L.; Ulbrich, Megan C.; Vohra, Neelam I.; Zakir, Aala; Adkins, Nancy L.; Brown, Bryony R.; Churilla, Bryce M.; Kramer, Zachary J.; Lapin, Jonathan S.; Montgomery, Matthew T.; Prout, Ashley K.; Grubb, Sarah R.; Warner, Marcie H.; Bowman, Charles A.; Russell, Daniel A.; Hatfull, Graham F.

    2015-01-01

    AlanGrant, Baee, Corofin, OrangeOswald, and Vincenzo are newly isolated phages of Mycobacterium smegmatis mc2155 discovered in Pittsburgh, Pennsylvania, USA. All five phages share nucleotide similarity with cluster B mycobacteriophages but span considerable diversity with Corofin and OrangeOswald in subcluster B3, AlanGrant and Vincenzo in subcluster B4, and Baee in subcluster B5. PMID:26089409

  7. Multi-class clustering of cancer subtypes through SVM based ensemble of pareto-optimal solutions for gene marker identification.

    Science.gov (United States)

    Mukhopadhyay, Anirban; Bandyopadhyay, Sanghamitra; Maulik, Ujjwal

    2010-11-12

    With the advancement of microarray technology, it is now possible to study the expression profiles of thousands of genes across different experimental conditions or tissue samples simultaneously. Microarray cancer datasets, organized as samples versus genes fashion, are being used for classification of tissue samples into benign and malignant or their subtypes. They are also useful for identifying potential gene markers for each cancer subtype, which helps in successful diagnosis of particular cancer types. In this article, we have presented an unsupervised cancer classification technique based on multiobjective genetic clustering of the tissue samples. In this regard, a real-coded encoding of the cluster centers is used and cluster compactness and separation are simultaneously optimized. The resultant set of near-Pareto-optimal solutions contains a number of non-dominated solutions. A novel approach to combine the clustering information possessed by the non-dominated solutions through Support Vector Machine (SVM) classifier has been proposed. Final clustering is obtained by consensus among the clusterings yielded by different kernel functions. The performance of the proposed multiobjective clustering method has been compared with that of several other microarray clustering algorithms for three publicly available benchmark cancer datasets. Moreover, statistical significance tests have been conducted to establish the statistical superiority of the proposed clustering method. Furthermore, relevant gene markers have been identified using the clustering result produced by the proposed clustering method and demonstrated visually. Biological relationships among the gene markers are also studied based on gene ontology. The results obtained are found to be promising and can possibly have important impact in the area of unsupervised cancer classification as well as gene marker identification for multiple cancer subtypes.

  8. The Identification of Intrinsic Chloramphenicol and Tetracycline Resistance Genes in Members of the Bacillus cereus Group (sensu lato)

    Science.gov (United States)

    Glenwright, Helen; Pohl, Susanne; Navarro, Ferran; Miro, Elisenda; Jiménez, Guillermo; Blanch, Anicet R.; Harwood, Colin R.

    2017-01-01

    Bacillus toyonensis strain BCT-7112T (NCIMB 14858T) has been widely used as an additive in animal nutrition for more than 30 years without reports of adverse toxigenic effects. However, this strain is resistant to chloramphenicol and tetracycline and it is generally considered inadvisable to introduce into the food chain resistance determinants capable of being transferred to other bacterial strains, thereby adding to the pool of such determinants in the gastro-enteric systems of livestock species. We therefore characterized the resistance phenotypes of this strain and its close relatives to determine whether they were of recent origin, and therefore likely to be transmissible. To this end we identified the genes responsible for chloramphenicol (catQ) and tetracycline (tetM) resistance and confirmed the presence of homologs in other members of the B. toyonensis taxonomic unit. Unexpectedly, closely related strains encoding these genes did not exhibit chloramphenicol and tetracycline resistance phenotypes. To understand the differences in the behaviors, we cloned and expressed the genes, together with their upstream regulatory regions, into Bacillus subtilis. The data showed that the genes encoded functional proteins, but were expressed inefficiently from their native promoters. B. toyonensis is a taxonomic unit member of the Bacillus cereus group (sensu lato). We therefore extended the analysis to determine the extent to which homologous chloramphenicol and tetracycline resistance genes were present in other species within this group. This analysis revealed that homologous genes were present in nearly all representative species within the B. cereus group (sensu lato). The absence of known transposition elements and the observations that they are found at the same genomic locations, indicates that these chloramphenicol and tetracycline resistance genes are of ancient origin and intrinsic to this taxonomic group, rather than recent acquisitions. In this context we

  9. Natural product proteomining, a quantitative proteomics platform, allows rapid discovery of biosynthetic gene clusters for different classes of natural products.

    Science.gov (United States)

    Gubbens, Jacob; Zhu, Hua; Girard, Geneviève; Song, Lijiang; Florea, Bogdan I; Aston, Philip; Ichinose, Koji; Filippov, Dmitri V; Choi, Young H; Overkleeft, Herman S; Challis, Gregory L; van Wezel, Gilles P

    2014-06-19

    Information on gene clusters for natural product biosynthesis is accumulating rapidly because of the current boom of available genome sequencing data. However, linking a natural product to a specific gene cluster remains challenging. Here, we present a widely applicable strategy for the identification of gene clusters for specific natural products, which we name natural product proteomining. The method is based on using fluctuating growth conditions that ensure differential biosynthesis of the bioactivity of interest. Subsequent combination of metabolomics and quantitative proteomics establishes correlations between abundance of natural products and concomitant changes in the protein pool, which allows identification of the relevant biosynthetic gene cluster. We used this approach to elucidate gene clusters for different natural products in Bacillus and Streptomyces, including a novel juglomycin-type antibiotic. Natural product proteomining does not require prior knowledge of the gene cluster or secondary metabolite and therefore represents a general strategy for identification of all types of gene clusters. Copyright © 2014 Elsevier Ltd. All rights reserved.

  10. Lichen Biosynthetic Gene Clusters. Part I. Genome Sequencing Reveals a Rich Biosynthetic Potential.

    Science.gov (United States)

    Bertrand, Robert L; Abdel-Hameed, Mona; Sorensen, John L

    2018-02-27

    Lichens are symbionts of fungi and algae that produce diverse secondary metabolites with useful properties. Little is known of lichen natural product biosynthesis because of the challenges of working with lichenizing fungi. We describe the first attempt to comprehensively profile the genetic secondary metabolome of a lichenizing fungus. An Illumina platform combined with the Antibiotics and Secondary Metabolites Analysis Shell (FungiSMASH, version 4.0) was used to sequence and annotate assembled contigs of the fungal partner of Cladonia uncialis. Up to 48 putative gene clusters are described comprising type I and type III polyketide synthases (PKS), nonribosomal peptide synthetases (NRPS), hybrid PKS-NRPS, and terpene synthases. The number of gene clusters revealed by this work dwarfs the number of known secondary metabolites from C. uncialis, suggesting that lichenizing fungi have an unexplored biosynthetic potential.

  11. Hierarchical clustering of breast cancer methylomes revealed differentially methylated and expressed breast cancer genes.

    Directory of Open Access Journals (Sweden)

    I-Hsuan Lin

    Full Text Available Oncogenic transformation of normal cells often involves epigenetic alterations, including histone modification and DNA methylation. We conducted whole-genome bisulfite sequencing to determine the DNA methylomes of normal breast, fibroadenoma, invasive ductal carcinomas and MCF7. The emergence, disappearance, expansion and contraction of kilobase-sized hypomethylated regions (HMRs and the hypomethylation of the megabase-sized partially methylated domains (PMDs are the major forms of methylation changes observed in breast tumor samples. Hierarchical clustering of HMR revealed tumor-specific hypermethylated clusters and differential methylated enhancers specific to normal or breast cancer cell lines. Joint analysis of gene expression and DNA methylation data of normal breast and breast cancer cells identified differentially methylated and expressed genes associated with breast and/or ovarian cancers in cancer-specific HMR clusters. Furthermore, aberrant patterns of X-chromosome inactivation (XCI was found in breast cancer cell lines as well as breast tumor samples in the TCGA BRCA (breast invasive carcinoma dataset. They were characterized with differentially hypermethylated XIST promoter, reduced expression of XIST, and over-expression of hypomethylated X-linked genes. High expressions of these genes were significantly associated with lower survival rates in breast cancer patients. Comprehensive analysis of the normal and breast tumor methylomes suggests selective targeting of DNA methylation changes during breast cancer progression. The weak causal relationship between DNA methylation and gene expression observed in this study is evident of more complex role of DNA methylation in the regulation of gene expression in human epigenetics that deserves further investigation.

  12. Molecular analysis of an inactive aflatoxin biosynthesis gene cluster in Aspergillus oryzae RIB strains.

    Science.gov (United States)

    Tominaga, Mihoko; Lee, Yun-Hae; Hayashi, Risa; Suzuki, Yuji; Yamada, Osamu; Sakamoto, Kazutoshi; Gotoh, Kuniyasu; Akita, Osamu

    2006-01-01

    To help assess the potential for aflatoxin production by Aspergillus oryzae, the structure of an aflatoxin biosynthesis gene homolog cluster in A. oryzae RIB 40 was analyzed. Although most genes in the corresponding cluster exhibited from 97 to 99% similarity to those of Aspergillus flavus, three genes shared 93% similarity or less. A 257-bp deletion in the aflT region, a frameshift mutation in norA, and a base pair substitution in verA were found in A. oryzae RIB 40. In the aflR promoter, two substitutions were found in one of the three putative AreA binding sites and in the FacB binding site. PCR primers were designed to amplify homologs of aflT, nor-1, aflR, norA, avnA, verB, and vbs and were used to detect these genes in 210 A. oryzae strains. Based on the PCR results, the A. oryzae RIB strains were classified into three groups, although most of them fell into two of the groups. Group 1, in which amplification of all seven genes was confirmed, contained 122 RIB strains (58.1% of examined strains), including RIB 40. Seventy-seven strains (36.7%) belonged to group 2, characterized by having only vbs, verB, and avnA in half of the cluster. Although slight expression of aflR was detected by reverse transcription-PCR in some group 1 strains, including RIB 40, other genes (avnA, vbs, verB, and omtA) related to aflatoxin production were not detected. aflR was not detected in group 2 strains by Southern analysis.

  13. Gene cluster analysis for the biosynthesis of elgicins, novel lantibiotics produced by paenibacillus elgii B69

    Directory of Open Access Journals (Sweden)

    Teng Yi

    2012-03-01

    Full Text Available Abstract Background The recent increase in bacterial resistance to antibiotics has promoted the exploration of novel antibacterial materials. As a result, many researchers are undertaking work to identify new lantibiotics because of their potent antimicrobial activities. The objective of this study was to provide details of a lantibiotic-like gene cluster in Paenibacillus elgii B69 and to produce the antibacterial substances coded by this gene cluster based on culture screening. Results Analysis of the P. elgii B69 genome sequence revealed the presence of a lantibiotic-like gene cluster composed of five open reading frames (elgT1, elgC, elgT2, elgB, and elgA. Screening of culture extracts for active substances possessing the predicted properties of the encoded product led to the isolation of four novel peptides (elgicins AI, AII, B, and C with a broad inhibitory spectrum. The molecular weights of these peptides were 4536, 4593, 4706, and 4820 Da, respectively. The N-terminal sequence of elgicin B was Leu-Gly-Asp-Tyr, which corresponded to the partial sequence of the peptide ElgA encoded by elgA. Edman degradation suggested that the product elgicin B is derived from ElgA. By correlating the results of electrospray ionization-mass spectrometry analyses of elgicins AI, AII, and C, these peptides are deduced to have originated from the same precursor, ElgA. Conclusions A novel lantibiotic-like gene cluster was shown to be present in P. elgii B69. Four new lantibiotics with a broad inhibitory spectrum were isolated, and these appear to be promising antibacterial agents.

  14. Gene Clusters for Insecticidal Loline Alkaloids in the Grass-Endophytic Fungus Neotyphodium uncinatum

    OpenAIRE

    Spiering, Martin J.; Moon, Christina D.; Wilkinson, Heather H.; Schardl, Christopher L.

    2005-01-01

    Loline alkaloids are produced by mutualistic fungi symbiotic with grasses, and they protect the host plants from insects. Here we identify in the fungal symbiont, Neotyphodium uncinatum, two homologous gene clusters (LOL-1 and LOL-2) associated with loline-alkaloid production. Nine genes were identified in a 25-kb region of LOL-1 and designated (in order) lolF-1, lolC-1, lolD-1, lolO-1, lolA-1, lolU-1, lolP-1, lolT-1, and lolE-1. LOL-2 contained the homologs lolC-2 through lolE-2 in the same ...

  15. Comparison of Expression of Secondary Metabolite Biosynthesis Cluster Genes in Aspergillus flavus, A. parasiticus, and A. oryzae

    Directory of Open Access Journals (Sweden)

    Kenneth C. Ehrlich

    2014-06-01

    Full Text Available Fifty six secondary metabolite biosynthesis gene clusters are predicted to be in the Aspergillus flavus genome. In spite of this, the biosyntheses of only seven metabolites, including the aflatoxins, kojic acid, cyclopiazonic acid and aflatrem, have been assigned to a particular gene cluster. We used RNA-seq to compare expression of secondary metabolite genes in gene clusters for the closely related fungi A. parasiticus, A. oryzae, and A. flavus S and L sclerotial morphotypes. The data help to refine the identification of probable functional gene clusters within these species. Our results suggest that A. flavus, a prevalent contaminant of maize, cottonseed, peanuts and tree nuts, is capable of producing metabolites which, besides aflatoxin, could be an underappreciated contributor to its toxicity.

  16. Comparison of Expression of Secondary Metabolite Biosynthesis Cluster Genes in Aspergillus flavus, A. parasiticus, and A. oryzae

    Science.gov (United States)

    Ehrlich, Kenneth C.; Mack, Brian M.

    2014-01-01

    Fifty six secondary metabolite biosynthesis gene clusters are predicted to be in the Aspergillus flavus genome. In spite of this, the biosyntheses of only seven metabolites, including the aflatoxins, kojic acid, cyclopiazonic acid and aflatrem, have been assigned to a particular gene cluster. We used RNA-seq to compare expression of secondary metabolite genes in gene clusters for the closely related fungi A. parasiticus, A. oryzae, and A. flavus S and L sclerotial morphotypes. The data help to refine the identification of probable functional gene clusters within these species. Our results suggest that A. flavus, a prevalent contaminant of maize, cottonseed, peanuts and tree nuts, is capable of producing metabolites which, besides aflatoxin, could be an underappreciated contributor to its toxicity. PMID:24960201

  17. Characterization of the biosynthetic gene cluster for cryptic phthoxazolin A in Streptomyces avermitilis.

    Directory of Open Access Journals (Sweden)

    Dian Anggraini Suroto

    Full Text Available Phthoxazolin A, an oxazole-containing polyketide, has a broad spectrum of anti-oomycete activity and herbicidal activity. We recently identified phthoxazolin A as a cryptic metabolite of Streptomyces avermitilis that produces the important anthelmintic agent avermectin. Even though genome data of S. avermitilis is publicly available, no plausible biosynthetic gene cluster for phthoxazolin A is apparent in the sequence data. Here, we identified and characterized the phthoxazolin A (ptx biosynthetic gene cluster through genome sequencing, comparative genomic analysis, and gene disruption. Sequence analysis uncovered that the putative ptx biosynthetic genes are laid on an extra genomic region that is not found in the public database, and 8 open reading frames in the extra genomic region could be assigned roles in the biosynthesis of the oxazole ring, triene polyketide and carbamoyl moieties. Disruption of the ptxA gene encoding a discrete acyltransferase resulted in a complete loss of phthoxazolin A production, confirming that the trans-AT type I PKS system is responsible for the phthoxazolin A biosynthesis. Based on the predicted functional domains in the ptx assembly line, we propose the biosynthetic pathway of phthoxazolin A.

  18. Clustering gene expression time series data using an infinite Gaussian process mixture model.

    Science.gov (United States)

    McDowell, Ian C; Manandhar, Dinesh; Vockley, Christopher M; Schmid, Amy K; Reddy, Timothy E; Engelhardt, Barbara E

    2018-01-01

    Transcriptome-wide time series expression profiling is used to characterize the cellular response to environmental perturbations. The first step to analyzing transcriptional response data is often to cluster genes with similar responses. Here, we present a nonparametric model-based method, Dirichlet process Gaussian process mixture model (DPGP), which jointly models data clusters with a Dirichlet process and temporal dependencies with Gaussian processes. We demonstrate the accuracy of DPGP in comparison to state-of-the-art approaches using hundreds of simulated data sets. To further test our method, we apply DPGP to published microarray data from a microbial model organism exposed to stress and to novel RNA-seq data from a human cell line exposed to the glucocorticoid dexamethasone. We validate our clusters by examining local transcription factor binding and histone modifications. Our results demonstrate that jointly modeling cluster number and temporal dependencies can reveal shared regulatory mechanisms. DPGP software is freely available online at https://github.com/PrincetonUniversity/DP_GP_cluster.

  19. Clustering gene expression time series data using an infinite Gaussian process mixture model.

    Directory of Open Access Journals (Sweden)

    Ian C McDowell

    2018-01-01

    Full Text Available Transcriptome-wide time series expression profiling is used to characterize the cellular response to environmental perturbations. The first step to analyzing transcriptional response data is often to cluster genes with similar responses. Here, we present a nonparametric model-based method, Dirichlet process Gaussian process mixture model (DPGP, which jointly models data clusters with a Dirichlet process and temporal dependencies with Gaussian processes. We demonstrate the accuracy of DPGP in comparison to state-of-the-art approaches using hundreds of simulated data sets. To further test our method, we apply DPGP to published microarray data from a microbial model organism exposed to stress and to novel RNA-seq data from a human cell line exposed to the glucocorticoid dexamethasone. We validate our clusters by examining local transcription factor binding and histone modifications. Our results demonstrate that jointly modeling cluster number and temporal dependencies can reveal shared regulatory mechanisms. DPGP software is freely available online at https://github.com/PrincetonUniversity/DP_GP_cluster.

  20. Burkholderia thailandensis harbors two identical rhl gene clusters responsible for the biosynthesis of rhamnolipids

    Directory of Open Access Journals (Sweden)

    Woods Donald E

    2009-12-01

    Full Text Available Abstract Background Rhamnolipids are surface active molecules composed of rhamnose and β-hydroxydecanoic acid. These biosurfactants are produced mainly by Pseudomonas aeruginosa and have been thoroughly investigated since their early discovery. Recently, they have attracted renewed attention because of their involvement in various multicellular behaviors. Despite this high interest, only very few studies have focused on the production of rhamnolipids by Burkholderia species. Results Orthologs of rhlA, rhlB and rhlC, which are responsible for the biosynthesis of rhamnolipids in P. aeruginosa, have been found in the non-infectious Burkholderia thailandensis, as well as in the genetically similar important pathogen B. pseudomallei. In contrast to P. aeruginosa, both Burkholderia species contain these three genes necessary for rhamnolipid production within a single gene cluster. Furthermore, two identical, paralogous copies of this gene cluster are found on the second chromosome of these bacteria. Both Burkholderia spp. produce rhamnolipids containing 3-hydroxy fatty acid moieties with longer side chains than those described for P. aeruginosa. Additionally, the rhamnolipids produced by B. thailandensis contain a much larger proportion of dirhamnolipids versus monorhamnolipids when compared to P. aeruginosa. The rhamnolipids produced by B. thailandensis reduce the surface tension of water to 42 mN/m while displaying a critical micelle concentration value of 225 mg/L. Separate mutations in both rhlA alleles, which are responsible for the synthesis of the rhamnolipid precursor 3-(3-hydroxyalkanoyloxyalkanoic acid, prove that both copies of the rhl gene cluster are functional, but one contributes more to the total production than the other. Finally, a double ΔrhlA mutant that is completely devoid of rhamnolipid production is incapable of swarming motility, showing that both gene clusters contribute to this phenotype. Conclusions Collectively, these

  1. CLASH: Extending galaxy strong lensing to small physical scales with distant sources highly magnified by galaxy cluster members

    International Nuclear Information System (INIS)

    Grillo, C.; Christensen, L.; Gobat, R.; Presotto, V.; Balestra, I.; Nonino, M.; Biviano, A.; Mercurio, A.; Rosati, P.; Vanzella, E.; Graves, G.; Lemze, D.; Ford, H.; Bartelmann, M.; Benitez, N.; Bouwens, R.; Bradley, L.; Coe, D.; Broadhurst, T.; Donahue, M.

    2014-01-01

    We present a complex strong lensing system in which a double source is imaged five times by two early-type galaxies. We take advantage in this target of the extraordinary multi-band photometric data set obtained as part of the Cluster Lensing And Supernova survey with Hubble (CLASH) program, complemented by the spectroscopic measurements of the VLT/VIMOS and FORS2 follow-up campaign. We use a photometric redshift value of 3.7 for the source and confirm spectroscopically the membership of the two lenses to the galaxy cluster MACS J1206.2–0847 at redshift 0.44. We exploit the excellent angular resolution of the HST/ACS images to model the two lenses in terms of singular isothermal sphere profiles and derive robust effective velocity dispersion values of 97 ± 3 and 240 ± 6 km s –1 . Interestingly, the total mass distribution of the cluster is also well characterized by using only the local information contained in this lensing system, which is located at a projected distance of more than 300 kpc from the cluster luminosity center. According to our best-fitting lensing and composite stellar population models, the source is magnified by a total factor of 50 and has a luminous mass of approximately (1.0 ± 0.5) × 10 9 M ☉ (assuming a Salpeter stellar initial mass function). By combining the total and luminous mass estimates of the two lenses, we measure luminous over total mass fractions projected within the effective radii of 0.51 ± 0.21 and 0.80 ± 0.32. Remarkably, with these lenses we can extend the analysis of the mass properties of lens early-type galaxies by factors that are approximately two and three times smaller than previously done with regard to, respectively, velocity dispersion and luminous mass. The comparison of the total and luminous quantities of our lenses with those of astrophysical objects with different physical scales, like massive early-type galaxies and dwarf spheroidals, reveals the potential of studies of this kind for improving our

  2. Loss of Major DNase I Hypersensitive Sites in Duplicatedglobin Gene Cluster Incompletely Silences HBB Gene Expression

    Czech Academy of Sciences Publication Activity Database

    Reading, N. S.; Shooter, C.; Song, J.; Miller, R.; Agarwal, A.; Láníková, Lucie; Clark, B.; Thein, S.L.; Divoký, V.; Prchal, J.T.

    2016-01-01

    Roč. 37, č. 11 (2016), s. 1153-1156 ISSN 1059-7794 R&D Projects: GA MŠk(CZ) LH15223 Institutional support: RVO:68378050 Keywords : globin gene s * regulation * sickle cell disease * HBB duplication Subject RIV: EB - Gene tics ; Molecular Biology Impact factor: 4.601, year: 2016

  3. Functional dissection of HOXD cluster genes in regulation of neuroblastoma cell proliferation and differentiation.

    Directory of Open Access Journals (Sweden)

    Yunhong Zha

    Full Text Available Retinoic acid (RA can induce growth arrest and neuronal differentiation of neuroblastoma cells and has been used in clinic for treatment of neuroblastoma. It has been reported that RA induces the expression of several HOXD genes in human neuroblastoma cell lines, but their roles in RA action are largely unknown. The HOXD cluster contains nine genes (HOXD1, HOXD3, HOXD4, and HOXD8-13 that are positioned sequentially from 3' to 5', with HOXD1 at the 3' end and HOXD13 the 5' end. Here we show that all HOXD genes are induced by RA in the human neuroblastoma BE(2-C cells, with the genes located at the 3' end being activated generally earlier than those positioned more 5' within the cluster. Individual induction of HOXD8, HOXD9, HOXD10 or HOXD12 is sufficient to induce both growth arrest and neuronal differentiation, which is associated with downregulation of cell cycle-promoting genes and upregulation of neuronal differentiation genes. However, induction of other HOXD genes either has no effect (HOXD1 or has partial effects (HOXD3, HOXD4, HOXD11 and HOXD13 on BE(2-C cell proliferation or differentiation. We further show that knockdown of HOXD8 expression, but not that of HOXD9 expression, significantly inhibits the differentiation-inducing activity of RA. HOXD8 directly activates the transcription of HOXC9, a key effector of RA action in neuroblastoma cells. These findings highlight the distinct functions of HOXD genes in RA induction of neuroblastoma cell differentiation.

  4. Clusters of orthologous genes for 41 archaeal genomes and implications for evolutionary genomics of archaea

    Directory of Open Access Journals (Sweden)

    Wolf Yuri I

    2007-11-01

    Full Text Available Abstract Background An evolutionary classification of genes from sequenced genomes that distinguishes between orthologs and paralogs is indispensable for genome annotation and evolutionary reconstruction. Shortly after multiple genome sequences of bacteria, archaea, and unicellular eukaryotes became available, an attempt on such a classification was implemented in Clusters of Orthologous Groups of proteins (COGs. Rapid accumulation of genome sequences creates opportunities for refining COGs but also represents a challenge because of error amplification. One of the practical strategies involves construction of refined COGs for phylogenetically compact subsets of genomes. Results New Archaeal Clusters of Orthologous Genes (arCOGs were constructed for 41 archaeal genomes (13 Crenarchaeota, 27 Euryarchaeota and one Nanoarchaeon using an improved procedure that employs a similarity tree between smaller, group-specific clusters, semi-automatically partitions orthology domains in multidomain proteins, and uses profile searches for identification of remote orthologs. The annotation of arCOGs is a consensus between three assignments based on the COGs, the CDD database, and the annotations of homologs in the NR database. The 7538 arCOGs, on average, cover ~88% of the genes in a genome compared to a ~76% coverage in COGs. The finer granularity of ortholog identification in the arCOGs is apparent from the fact that 4538 arCOGs correspond to 2362 COGs; ~40% of the arCOGs are new. The archaeal gene core (protein-coding genes found in all 41 genome consists of 166 arCOGs. The arCOGs were used to reconstruct gene loss and gene gain events during archaeal evolution and gene sets of ancestral forms. The Last Archaeal Common Ancestor (LACA is conservatively estimated to possess 996 genes compared to 1245 and 1335 genes for the last common ancestors of Crenarchaeota and Euryarchaeota, respectively. It is inferred that LACA was a chemoautotrophic hyperthermophile

  5. Using SNP genetic markers to elucidate the linkage of the Co-34/Phg-3 anthracnose and angular leaf spot resistance gene cluster with the Ur-14 resistance gene

    Science.gov (United States)

    The Ouro Negro common bean cultivar contains the Co-34/Phg-3 gene cluster that confers resistance to the anthracnose (ANT) and angular leaf spot (ALS) pathogens. These genes are tightly linked on chromosome 4. Ouro Negro also has the Ur-14 rust resistance gene, reportedly in the vicinity of Co- 34; ...

  6. Differential expression of TIR-like genes embedded in the M1-1 gene cluster in nematode-resistant and -susceptible tomato roots

    NARCIS (Netherlands)

    Seifi Abdolabad, A.R.; Visser, R.G.F.; Bai, Y.

    2011-01-01

    Transport inhibitor 1 (TIR1) is an auxin receptor that plays a pivotal role in auxin signaling. It has been reported that TIR-like genes are present in a gene cluster carrying the Mi-1 gene which confers resistance to nematodes, aphids and whiteflies. Since auxin is involved in the pathogenicity of

  7. AtSIG6 and other members of the sigma gene family jointly but differentially determine plastid target gene expression in Arabidopsis thaliana

    Directory of Open Access Journals (Sweden)

    Sylvia eBock

    2014-11-01

    Full Text Available Plants contain a nuclear gene family for plastid sigma factors, i.e. proteins that associate with the bacterial-type organellar RNA polymerase and confer the ability for correct promoter binding and transcription initiation. Questions that are still unresolved relate to the division of labour among members of the sigma family, both in terms of their range of target genes and their temporal and spatial activity during development. Clues to the in vivo role of individual sigma genes have mainly come from studies of sigma knockout lines. Despite its obvious strengths, however, this strategy does not necessarily trace-down causal relationships between mutant phenotype and a single sigma gene, if other family members act in a redundant and/or compensatory manner. We made efforts to reduce the complexity by genetic crosses of Arabidopsis single mutants (with focus on a chlorophyll-deficient sig6 line to generate double knockout lines. The latter typically had a similar visible phenotype as the parental lines, but tended to be more strongly affected in the transcript patterns of both plastid and sigma genes. Because triple mutants were lethal under our growth conditions, we exploited a strategy of transformation of single and double mutants with RNAi constructs that contained sequences from the unconserved sigma region (UCR. These RNAi/knockout lines phenotypically resembled their parental lines, but were even more strongly affected in their plastid transcript patterns. Expression patterns of sigma genes revealed both similarities and differences compared to the parental lines, with transcripts at reduced or unchanged amounts and others that were found to be present in higher (perhaps compensatory amounts. Together, our results reveal considerable flexibility of gene activity at the levels of both sigma and plastid gene expression. A (still viable basal state seems to be reached, if 2 - 3 of the 6 Arabidopsis sigma genes are functionally compromised.

  8. Defense mechanisms against herbivory in Picea: sequence evolution and expression regulation of gene family members in the phenylpropanoid pathway

    Directory of Open Access Journals (Sweden)

    Porth Ilga

    2011-12-01

    Full Text Available Abstract Background In trees, a substantial amount of carbon is directed towards production of phenolics for development and defense. This metabolic pathway is also a major factor in resistance to insect pathogens in spruce. In such gene families, environmental stimuli may have an important effect on the evolutionary fate of duplicated genes, and different expression patterns may indicate functional diversification. Results Gene families in spruce (Picea have expanded to superfamilies, including O-methyltransferases, cytochrome-P450, and dirigents/classIII-peroxidases. Neo-functionalization of superfamily members from different clades is reflected in expression diversification. Genetical genomics can provide new insights into the genetic basis and evolution of insect resistance in plants. Adopting this approach, we merged genotype data (252 SNPs in a segregating pedigree, gene expression levels (for 428 phenylpropanoid-related genes and measures of susceptibility to Pissodes stobi, using a partial-diallel crossing-design with white spruce (Picea glauca. Thirty-eight expressed phenylpropanoid-related genes co-segregated with weevil susceptibility, indicating either causative or reactive effects of these genes to weevil resistance. We identified eight regulatory genomic regions with extensive overlap of quantitative trait loci from susceptibility and growth phenotypes (pQTLs and expression QTL (eQTL hotspots. In particular, SNPs within two different CCoAOMT loci regulate phenotypic variation from a common set of 24 genes and three resistance traits. Conclusions Pest resistance was associated with individual candidate genes as well as with trans-regulatory hotspots along the spruce genome. Our results showed that specific genes within the phenylpropanoid pathway have been duplicated and diversified in the conifer in a process fundamentally different from short-lived angiosperm species. These findings add to the information about the role of the

  9. Identification of suitable genes contributes to lung adenocarcinoma clustering by multiple meta-analysis methods.

    Science.gov (United States)

    Yang, Ze-Hui; Zheng, Rui; Gao, Yuan; Zhang, Qiang

    2016-09-01

    With the widespread application of high-throughput technology, numerous meta-analysis methods have been proposed for differential expression profiling across multiple studies. We identified the suitable differentially expressed (DE) genes that contributed to lung adenocarcinoma (ADC) clustering based on seven popular multiple meta-analysis methods. Seven microarray expression profiles of ADC and normal controls were extracted from the ArrayExpress database. The Bioconductor was used to perform the data preliminary preprocessing. Then, DE genes across multiple studies were identified. Hierarchical clustering was applied to compare the classification performance for microarray data samples. The classification efficiency was compared based on accuracy, sensitivity and specificity. Across seven datasets, 573 ADC cases and 222 normal controls were collected. After filtering out unexpressed and noninformative genes, 3688 genes were remained for further analysis. The classification efficiency analysis showed that DE genes identified by sum of ranks method separated ADC from normal controls with the best accuracy, sensitivity and specificity of 0.953, 0.969 and 0.932, respectively. The gene set with the highest classification accuracy mainly participated in the regulation of response to external stimulus (P = 7.97E-04), cyclic nucleotide-mediated signaling (P = 0.01), regulation of cell morphogenesis (P = 0.01) and regulation of cell proliferation (P = 0.01). Evaluation of DE genes identified by different meta-analysis methods in classification efficiency provided a new perspective to the choice of the suitable method in a given application. Varying meta-analysis methods always present varying abilities, so synthetic consideration should be taken when providing meta-analysis methods for particular research. © 2015 John Wiley & Sons Ltd.

  10. Functional genomics and expression analysis of the Corynebacterium glutamicum fpr2-cysIXHDNYZ gene cluster involved in assimilatory sulphate reduction

    Directory of Open Access Journals (Sweden)

    Albersmeier Andreas

    2005-09-01

    the extended cluster are conserved in 11 of 17 completely sequenced members of the Actinomycetales. Conclusion The set of C. glutamicum genes involved in assimilatory sulphate reduction was identified and four novel genes involved in this pathway were found. The high degree of conservation of this cluster among the Actinomycetales supports the hypothesis that a different metabolic pathway for the reduction of inorganic sulphur compounds than that known from the well-studied model organisms E. coli and B. subtilis is used by members of this order, providing the basis for further biochemical studies.

  11. Identification and functional analysis of gene cluster involvement in biosynthesis of the cyclic lipopeptide antibiotic pelgipeptin produced by Paenibacillus elgii

    Directory of Open Access Journals (Sweden)

    Qian Chao-Dong

    2012-09-01

    Full Text Available Abstract Background Pelgipeptin, a potent antibacterial and antifungal agent, is a non-ribosomally synthesised lipopeptide antibiotic. This compound consists of a β-hydroxy fatty acid and nine amino acids. To date, there is no information about its biosynthetic pathway. Results A potential pelgipeptin synthetase gene cluster (plp was identified from Paenibacillus elgii B69 through genome analysis. The gene cluster spans 40.8 kb with eight open reading frames. Among the genes in this cluster, three large genes, plpD, plpE, and plpF, were shown to encode non-ribosomal peptide synthetases (NRPSs, with one, seven, and one module(s, respectively. Bioinformatic analysis of the substrate specificity of all nine adenylation domains indicated that the sequence of the NRPS modules is well collinear with the order of amino acids in pelgipeptin. Additional biochemical analysis of four recombinant adenylation domains (PlpD A1, PlpE A1, PlpE A3, and PlpF A1 provided further evidence that the plp gene cluster involved in pelgipeptin biosynthesis. Conclusions In this study, a gene cluster (plp responsible for the biosynthesis of pelgipeptin was identified from the genome sequence of Paenibacillus elgii B69. The identification of the plp gene cluster provides an opportunity to develop novel lipopeptide antibiotics by genetic engineering.

  12. Localization and physical mapping of a plasmid-borne 23-kb nif gene cluster from Enterobacter agglomerans showing homology to the entire nif gene cluster of Klebsiella pneumoniae M5a1.

    Science.gov (United States)

    Singh, M; Kreutzer, R; Acker, G; Klingmüller, W

    1988-01-01

    A physical and genetical map of the plasmid pEA3 indigenous to Enterobacter agglomerans is presented. pEA3 is a 111-kb large plasmid containing a 23-kb large cluster of nif genes which shows extensive homology (Southern hybridization and heteroduplex analysis) to the entire nif gene cluster of Klebsiella pneumoniae (Kp) M5a1. All the nif genes on pEA3 are organized in the same manner as in K. pneumoniae, except nifJ, which is located on the left end of pEA3 nif gene cluster (near nifQB). A BamHI restriction map of pEA3 and a detailed restriction map of the 23-kb nif region on pEA3 is also presented. The nif genes of pEA3 showed a low level of acetylene reduction in Escherichia coli, demonstrating that these genes are functional and contain the whole genetic information required to fix nitrogen. The origin of vegetative replication (OriV) of pEA3 was localized about 5.5 kb from the right end of the nif gene cluster. In addition to pEA3, large plasmids from four other strains of E. agglomerans showed homology to all the Kp nif genes tested, indicating that in diazotrophic strains of E. agglomerans nif genes are usually located on plasmids. In contrast, in most of the free-living, nitrogen-fixing bacteria the nif genes are on chromosome.

  13. The ZNF75 zinc finger gene subfamily: Isolation and mapping of the four members in humans and great apes

    Energy Technology Data Exchange (ETDEWEB)

    Villa, A.; Strina, D.; Frattini, A. [Consiglio Nazionale delle Ricerche, Milan (Italy)] [and others

    1996-07-15

    We have previously reported the characterization of the human ZNF75 gene located on Xq26, which has only limited homology (less than 65%) to other ZF genes in the databases. Here, we describe three human zinc finger genes with 86 to 95% homology to ZNF75 at the nucleotide level, which represent all the members of the human ZNF75 subfamily. One of these, ZNF75B, is a pseudogene mapped to chromosome 12q13. The other two, ZNF75A and ZNF75C, maintain on ORF in the sequenced region, and at least the latter is expressed in the U937 cell line. They were mapped to chromosomes 16 and 11, respectively. All these genes are conserved in chimpanzees, gorillas, and orangutans. The ZNF75B homologue is a pseudogene in all three great apes, and in chimpanzee it is located on chromosome 10 (phylogenetic XII), at p13 (corresponding to the human 12q13). The chimpanzee homologue of ZNF75 is also located on the Xq26 chromosome, in the same region, as detected by in situ hybridization. As expected, nucleotide changes were clearly more abundant between human and organutan than between human and chimpanzee or gorilla homologues. Members of the same class were more similar to each other than to the other homologues within the same species. This suggests that the duplication and/or retrotranscription events occurred in a common ancestor long before great ape speciation. This, together with the existance of at least two genes in cows and horses, suggests a relatively high conservation of this gene family. 20 refs., 5 figs., 1 tab.

  14. Genetic variation in a member of the laminin gene family affects variation in body composition in Drosophila and humans

    Directory of Open Access Journals (Sweden)

    Hunter Gary R

    2008-08-01

    Full Text Available Abstract Background The objective of the present study was to map candidate loci influencing naturally occurring variation in triacylglycerol (TAG storage using quantitative complementation procedures in Drosophila melanogaster. Based on our results from Drosophila, we performed a human population-based association study to investigate the effect of natural variation in LAMA5 gene on body composition in humans. Results We identified four candidate genes that contributed to differences in TAG storage between two strains of D. melanogaster, including Laminin A (LanA, which is a member of the α subfamily of laminin chains. We confirmed the effects of this gene using a viable LanA mutant and showed that female flies homozygous for the mutation had significantly lower TAG storage, body weight, and total protein content than control flies. Drosophila LanA is closely related to human LAMA5 gene, which maps to the well-replicated obesity-linkage region on chromosome 20q13.2-q13.3. We tested for association between three common single nucleotide polymorphisms (SNPs in the human LAMA5 gene and variation in body composition and lipid profile traits in a cohort of unrelated women of European American (EA and African American (AA descent. In both ethnic groups, we found that SNP rs659822 was associated with weight (EA: P = 0.008; AA: P = 0.05 and lean mass (EA: P= 0.003; AA: P = 0.03. We also found this SNP to be associated with height (P = 0.01, total fat mass (P = 0.01, and HDL-cholesterol (P = 0.003 but only in EA women. Finally, significant associations of SNP rs944895 with serum TAG levels (P = 0.02 and HDL-cholesterol (P = 0.03 were observed in AA women. Conclusion Our results suggest an evolutionarily conserved role of a member of the laminin gene family in contributing to variation in weight and body composition.

  15. Gene clusters for insecticidal loline alkaloids in the grass-endophytic fungus Neotyphodium uncinatum.

    Science.gov (United States)

    Spiering, Martin J; Moon, Christina D; Wilkinson, Heather H; Schardl, Christopher L

    2005-03-01

    Loline alkaloids are produced by mutualistic fungi symbiotic with grasses, and they protect the host plants from insects. Here we identify in the fungal symbiont, Neotyphodium uncinatum, two homologous gene clusters (LOL-1 and LOL-2) associated with loline-alkaloid production. Nine genes were identified in a 25-kb region of LOL-1 and designated (in order) lolF-1, lolC-1, lolD-1, lolO-1, lolA-1, lolU-1, lolP-1, lolT-1, and lolE-1. LOL-2 contained the homologs lolC-2 through lolE-2 in the same order and orientation. Also identified was lolF-2, but its possible linkage with either cluster was undetermined. Most lol genes were regulated in N. uncinatum and N. coenophialum, and all were expressed concomitantly with loline-alkaloid biosynthesis. A lolC-2 RNA-interference (RNAi) construct was introduced into N. uncinatum, and in two independent transformants, RNAi significantly decreased lolC expression (P lol-gene products indicate that the pathway has evolved from various different primary and secondary biosynthesis pathways.

  16. Human paraoxonase gene cluster overexpression alleviates angiotensin II-induced cardiac hypertrophy in mice.

    Science.gov (United States)

    Pei, Jian-Fei; Yan, Yun-Fei; Tang, Xiaoqiang; Zhang, Yang; Cui, Shen-Shen; Zhang, Zhu-Qin; Chen, Hou-Zao; Liu, De-Pei

    2016-11-01

    Cardiac hypertrophy is the strongest predictor of the development of heart failure, and anti-hypertrophic treatment holds the key to improving the clinical syndrome and increasing the survival rates for heart failure. The paraoxonase (PON) gene cluster (PC) protects against atherosclerosis and coronary artery diseases. However, the role of PC in the heart is largely unknown. To evaluate the roles of PC in cardiac hypertrophy, transgenic mice carrying the intact human PON1, PON2, and PON3 genes and their flanking sequences were studied. We demonstrated that the PC transgene (PC-Tg) protected mice from cardiac hypertrophy induced by Ang II; these mice had reduced heart weight/body weight ratios, decreased left ventricular wall thicknesses and increased fractional shortening compared with wild-type (WT) control. The same protective tendency was also observed with an Apoe -/- background. Mechanically, PC-Tg normalized the disequilibrium of matrix metalloproteinases (MMPs)/tissue inhibitors of MMPs (TIMPs) in hypertrophic hearts, which might contribute to the protective role of PC-Tg in cardiac fibrosis and, thus, protect against cardiac remodeling. Taken together, our results identify a novel anti-hypertrophic role for the PON gene cluster, suggesting a possible strategy for the treatment of cardiac hypertrophy through elevating the levels of the PON gene family.

  17. Updated clusters of orthologous genes for Archaea: a complex ancestor of the Archaea and the byways of horizontal gene transfer

    Directory of Open Access Journals (Sweden)

    Wolf Yuri I

    2012-12-01

    Full Text Available Abstract Background Collections of Clusters of Orthologous Genes (COGs provide indispensable tools for comparative genomic analysis, evolutionary reconstruction and functional annotation of new genomes. Initially, COGs were made for all complete genomes of cellular life forms that were available at the time. However, with the accumulation of thousands of complete genomes, construction of a comprehensive COG set has become extremely computationally demanding and prone to error propagation, necessitating the switch to taxon-specific COG collections. Previously, we reported the collection of COGs for 41 genomes of Archaea (arCOGs. Here we present a major update of the arCOGs and describe evolutionary reconstructions to reveal general trends in the evolution of Archaea. Results The updated version of the arCOG database incorporates 91% of the pangenome of 120 archaea (251,032 protein-coding genes altogether into 10,335 arCOGs. Using this new set of arCOGs, we performed maximum likelihood reconstruction of the genome content of archaeal ancestral forms and gene gain and loss events in archaeal evolution. This reconstruction shows that the last Common Ancestor of the extant Archaea was an organism of greater complexity than most of the extant archaea, probably with over 2,500 protein-coding genes. The subsequent evolution of almost all archaeal lineages was apparently dominated by gene loss resulting in genome streamlining. Overall, in the evolution of Archaea as well as a representative set of bacteria that was similarly analyzed for comparison, gene losses are estimated to outnumber gene gains at least 4 to 1. Analysis of specific patterns of gene gain in Archaea shows that, although some groups, in particular Halobacteria, acquire substantially more genes than others, on the whole, gene exchange between major groups of Archaea appears to be largely random, with no major ‘highways’ of horizontal gene transfer. Conclusions The updated collection

  18. Functional characterization of members of the microRNA-17-92 cluster in the vascular system

    OpenAIRE

    Döbele, Carmen

    2011-01-01

    Almost two decades ago, microRNAs were discovered as novel posttranscriptional regulators of gene expression. Since then, research efforts have uncovered their involvement in the control of various cellular processes including migration, proliferation and cell survival. Even more complex events, such as the formation of new blood vessels or organ development, have been shown to be tightly regulated and orchestrated by microRNAs. Due to their crucial regulatory role in tissue homeostasis in ve...

  19. ATNT: an enhanced system for expression of polycistronic secondary metabolite gene clusters in Aspergillus niger.

    Science.gov (United States)

    Geib, Elena; Brock, Matthias

    2017-01-01

    Fungi are treasure chests for yet unexplored natural products. However, exploitation of their real potential remains difficult as a significant proportion of biosynthetic gene clusters appears silent under standard laboratory conditions. Therefore, elucidation of novel products requires gene activation or heterologous expression. For heterologous gene expression, we previously developed an expression platform in Aspergillus niger that is based on the transcriptional regulator TerR and its target promoter P terA . In this study, we extended this system by regulating expression of terR  by the doxycycline inducible Tet-on system. Reporter genes cloned under the control of the target promoter P terA remained silent in the absence of doxycycline, but were strongly expressed when doxycycline was added. Reporter quantification revealed that the coupled system results in about five times higher expression rates compared to gene expression under direct control of the Tet-on system. As production of secondary metabolites generally requires the expression of several biosynthetic genes, the suitability of the self-cleaving viral peptide sequence P2A was tested in this optimised expression system. P2A allowed polycistronic expression of genes required for Asp-melanin formation in combination with the gene coding for the red fluorescent protein tdTomato. Gene expression and Asp-melanin formation was prevented in the absence of doxycycline and strongly induced by addition of doxycycline. Fluorescence studies confirmed the correct subcellular localisation of the respective enzymes. This tightly regulated but strongly inducible expression system enables high level production of secondary metabolites most likely even those with toxic potential. Furthermore, this system is compatible with polycistronic gene expression and, thus, suitable for the discovery of novel natural products.

  20. Heterologous Reconstitution of the Intact Geodin Gene Cluster in Aspergillus nidulans through a Simple and Versatile PCR Based Approach

    DEFF Research Database (Denmark)

    Nielsen, Morten Thrane; Nielsen, Jakob Blæsbjerg; Anyaogu, Dianna Chinyere

    2013-01-01

    was transferred in a two step procedure to an expression platform in A. nidulans. The individual cluster fragments were generated by PCR and assembled via efficient USER fusion prior to ransformation and integration via re-iterative gene targeting. A total of 13 open reading frames contained in 25 kb of DNA were...... of solid methodology for genetic manipulation of most species severely hampers pathway haracterization. Here we present a simple PCR based approach for heterologous reconstitution of intact gene clusters. Specifically, the putative gene cluster responsible for geodin production from Aspergillus terreus...... successfully transferred between the two species enabling geodin synthesis in A. nidulans. Subsequently, functions of three genes in the cluster were validated by genetic and chemical analyses. Specifically, ATEG_08451 (gedC) encodes a polyketide synthase, ATEG_08453 (gedR) encodes a transcription factor...

  1. Homoeologous Recombination of the V1r1-V1r2 Gene Cluster of Pheromone Receptors in an Allotetraploid Lineage of Teleosts

    Directory of Open Access Journals (Sweden)

    Lei Zhong

    2017-11-01

    Full Text Available In contrast to other olfactory receptor families that exhibit frequent lineage-specific expansions, the vomeronasal type 1 receptor (V1R family exhibits a canonical six-member repertoire in teleosts. V1r1 and V1r2 are present in no more than one copy in all examined teleosts, including salmons, which are ancient polyploids, implying strict evolutionary constraints. However, recent polyploids have not been examined. Here, we identified a young allotetraploid lineage of weatherfishes and investigated their V1r1-V1r2 cluster. We found a novel pattern that the parental V1r1-V1r2 clusters had recombined in the tetraploid genome and that the recombinant was nearly fixed in the tetraploid population. Subsequent analyses suggested strong selective pressure, for both a new combination of paralogs and homogeneity among gene duplicates, acting on the V1r1-V1r2 pair.

  2. Meta-analysis of cell- specific transcriptomic data using fuzzy c-means clustering discovers versatile viral responsive genes.

    Science.gov (United States)

    Khan, Atif; Katanic, Dejan; Thakar, Juilee

    2017-06-06

    Despite advances in the gene-set enrichment analysis methods; inadequate definitions of gene-sets cause a major limitation in the discovery of novel biological processes from the transcriptomic datasets. Typically, gene-sets are obtained from publicly available pathway databases, which contain generalized definitions frequently derived by manual curation. Recently unsupervised clustering algorithms have been proposed to identify gene-sets from transcriptomics datasets deposited in public domain. These data-driven definitions of the gene-sets can be context-specific revealing novel biological mechanisms. However, the previously proposed algorithms for identification of data-driven gene-sets are based on hard clustering which do not allow overlap across clusters, a characteristic that is predominantly observed across biological pathways. We developed a pipeline using fuzzy-C-means (FCM) soft clustering approach to identify gene-sets which recapitulates topological characteristics of biological pathways. Specifically, we apply our pipeline to derive gene-sets from transcriptomic data measuring response of monocyte derived dendritic cells and A549 epithelial cells to influenza infections. Our approach apply Ward's method for the selection of initial conditions, optimize parameters of FCM algorithm for human cell-specific transcriptomic data and identify robust gene-sets along with versatile viral responsive genes. We validate our gene-sets and demonstrate that by identifying genes associated with multiple gene-sets, FCM clustering algorithm significantly improves interpretation of transcriptomic data facilitating investigation of novel biological processes by leveraging on transcriptomic data available in the public domain. We develop an interactive 'Fuzzy Inference of Gene-sets (FIGS)' package (GitHub: https://github.com/Thakar-Lab/FIGS ) to facilitate use of of pipeline. Future extension of FIGS across different immune cell-types will improve mechanistic

  3. Comparison of loline alkaloid gene clusters across fungal endophytes: predicting the co-regulatory sequence motifs and the evolutionary history.

    Science.gov (United States)

    Kutil, Brandi L; Greenwald, Charles; Liu, Gang; Spiering, Martin J; Schardl, Christopher L; Wilkinson, Heather H

    2007-10-01

    LOL, a fungal secondary metabolite gene cluster found in Epichloë and Neotyphodium species, is responsible for production of insecticidal loline alkaloids. To analyze the genetic architecture and to predict the evolutionary history of LOL, we compared five clusters from four fungal species (single clusters from Epichloë festucae, Neotyphodium sp. PauTG-1, Neotyphodium coenophialum, and two clusters we previously characterized in Neotyphodium uncinatum). Using PhyloCon to compare putative lol gene promoter regions, we have identified four motifs conserved across the lol genes in all five clusters. Each motif has significant similarity to known fungal transcription factor binding sites in the TRANSFAC database. Conservation of these motifs is further support for the hypothesis that the lol genes are co-regulated. Interestingly, the history of asexual Neotyphodium spp. includes multiple interspecific hybridization events. Comparing clusters from three Neotyphodium species and E. festucae allowed us to determine which Epichloë ancestors are the most likely contributors of LOL in these asexual species. For example, while no present day Epichloë typhina isolates are known to produce lolines, our data support the hypothesis that the E. typhina ancestor(s) of three asexual endophyte species contained a LOL gene cluster. Thus, these data support a model of evolution in which the polymorphism in loline alkaloid production phenotypes among endophyte species is likely due to the loss of the trait over time.

  4. Noise Resistant Generalized Parametric Validity Index of Clustering for Gene Expression Data.

    Science.gov (United States)

    Fa, Rui; Nandi, Asoke K

    2014-01-01

    Validity indices have been investigated for decades. However, since there is no study of noise-resistance performance of these indices in the literature, there is no guideline for determining the best clustering in noisy data sets, especially microarray data sets. In this paper, we propose a generalized parametric validity (GPV) index which employs two tunable parameters α and β to control the proportions of objects being considered to calculate the dissimilarities. The greatest advantage of the proposed GPV index is its noise-resistance ability, which results from the flexibility of tuning the parameters. Several rules are set to guide the selection of parameter values. To illustrate the noise-resistance performance of the proposed index, we evaluate the GPV index for assessing five clustering algorithms in two gene expression data simulation models with different noise levels and compare the ability of determining the number of clusters with eight existing indices. We also test the GPV in three groups of real gene expression data sets. The experimental results suggest that the proposed GPV index has superior noise-resistance ability and provides fairly accurate judgements.

  5. Characterization of the biosynthetic gene cluster of rebeccamycin from Lechevalieria aerocolonigenes ATCC 39243.

    Science.gov (United States)

    Onaka, Hiroyasu; Taniguchi, Shin-ichi; Igarashi, Yasuhiro; Furumai, Tamotsu

    2003-01-01

    The biosynthetic gene cluster for rebeccamycin, an indolocarbazole antibiotic, from Lechevalieria aerocolonigenes ATCC 39243 has 11 ORFs. To clarify their functions, mutants with rebG, rebD, rebC, rebP, rebM, rebR, rebH, rebT, or orfD2 disrupted were constructed, and the gene products were examined. rebP disruptants produced 11,11'-dichlorochromopyrrolic acid, found to be a biosynthetic intermediate by a bioconversion experiment. Other genes encoded N-glycosyltransferase (rebG), monooxygenase (rebC), methyltransferase (rebM), a transcriptional activator (rebR), and halogenase (rebH). rebT disruptants produced rebeccamycin as much as the wild strain, so rebT was probably not involved in rebeccamycin production. Biosynthetic genes of staurosporine, an another indolocarbazole antibiotic, were cloned from Streptomyces sp. TP-A0274. staO, staD, and staP were similar to rebO, rebD, and rebP, respectively, all of which are responsible for indolocarbazole biosynthesis, But a rebC homolog, encoding a putative enzyme oxidizing the C-7 site of pyrrole rings, was not found in the staurosporine biosynthetic gene cluster. These results suggest that indolocarbazole is constructed by oxidative decarboxylation of chromopyrrolic acid (11,11'-dichlorochromopyrrolic acid in rebeccamycin) generated from two molecules of tryptophan by coupling and that the oxidation state at the C-7 position depends on the additional enzyme(s) encoded by the biosynthetic genes.

  6. Phylogenetic analysis of a gene cluster encoding an additional, rhizobial-like type III secretion system that is narrowly distributed among Pseudomonas syringae strains

    Science.gov (United States)

    2012-01-01

    Background The central role of Type III secretion systems (T3SS) in bacteria-plant interactions is well established, yet unexpected findings are being uncovered through bacterial genome sequencing. Some Pseudomonas syringae strains possess an uncharacterized cluster of genes encoding putative components of a second T3SS (T3SS-2) in addition to the well characterized Hrc1 T3SS which is associated with disease lesions in host plants and with the triggering of hypersensitive response in non-host plants. The aim of this study is to perform an in silico analysis of T3SS-2, and to compare it with other known T3SSs. Results Based on phylogenetic analysis and gene organization comparisons, the T3SS-2 cluster of the P. syringae pv. phaseolicola strain is grouped with a second T3SS found in the pNGR234b plasmid of Rhizobium sp. These additional T3SS gene clusters define a subgroup within the Rhizobium T3SS family. Although, T3SS-2 is not distributed as widely as the Hrc1 T3SS in P. syringae strains, it was found to be constitutively expressed in P. syringae pv phaseolicola through RT-PCR experiments. Conclusions The relatedness of the P. syringae T3SS-2 to a second T3SS from the pNGR234b plasmid of Rhizobium sp., member of subgroup II of the rhizobial T3SS family, indicates common ancestry and/or possible horizontal transfer events between these species. Functional analysis and genome sequencing of more rhizobia and P. syringae pathovars may shed light into why these bacteria maintain a second T3SS gene cluster in their genome. PMID:22937899

  7. Pathogen corruption and site-directed recombination at a plant disease resistance gene cluster.

    Science.gov (United States)

    Nagy, Ervin D; Bennetzen, Jeffrey L

    2008-12-01

    The Pc locus of sorghum (Sorghum bicolor) determines dominant sensitivity to a host-selective toxin produced by the fungal pathogen Periconia circinata. The Pc region was cloned by a map-based approach and found to contain three tandemly repeated genes with the structures of nucleotide binding site-leucine-rich repeat (NBS-LRR) disease resistance genes. Thirteen independent Pc-to-pc mutations were analyzed, and each was found to remove all or part of the central gene of the threesome. Hence, this central gene is Pc. Most Pc-to-pc mutations were associated with unequal recombination. Eight recombination events were localized to different sites in a 560-bp region within the approximately 3.7-kb NBS-LRR genes. Because any unequal recombination located within the flanking NBS-LRR genes would have removed Pc, the clustering of cross-over events within a 560-bp segment indicates that a site-directed recombination process exists that specifically targets unequal events to generate LRR diversity in NBS-LRR loci.

  8. PRS1 is a key member of the gene family encoding phosphoribosylpyrophosphate synthetase in Saccharomyces cerevisiae

    DEFF Research Database (Denmark)

    Carter, Andrew T.; Beiche, Flora; Hove-Jensen, Bjarne

    1997-01-01

    In Saccharomyces cerevisiae the metabolite phosphoribosyl-pyrophosphate (PRPP) is required for purine, pyrimidine, tryptophan and histidine biosynthesis. Enzymes that can synthesize PRPP can be encoded by at least four genes. We have studied 5-phospho-ribosyl-1(α)-pyrophosphate synthetases (PRS......) genetically and biochemically. Each of the four genes, all of which are transcribed, has been disrupted in haploid yeast strains of each mating type and although all disruptants are able to grow on complete medium, differences in growth rate and enzyme activity suggest that disruption of PRS1 or PRS3 has...

  9. Isoeugenol monooxygenase and its putative regulatory gene are located in the eugenol metabolic gene cluster in Pseudomonas nitroreducens Jin1.

    Science.gov (United States)

    Ryu, Ji-Young; Seo, Jiyoung; Unno, Tatsuya; Ahn, Joong-Hoon; Yan, Tao; Sadowsky, Michael J; Hur, Hor-Gil

    2010-03-01

    The plant-derived phenylpropanoids eugenol and isoeugenol have been proposed as useful precursors for the production of natural vanillin. Genes involved in the metabolism of eugenol and isoeugenol were clustered in region of about a 30 kb of Pseudomonas nitroreducens Jin1. Two of the 23 ORFs in this region, ORFs 26 (iemR) and 27 (iem), were predicted to be involved in the conversion of isoeugenol to vanillin. The deduced amino acid sequence of isoeugenol monooxygenase (Iem) of strain Jin1 had 81.4% identity to isoeugenol monooxygenase from Pseudomonas putida IE27, which also transforms isoeugenol to vanillin. Iem was expressed in E. coli BL21(DE3) and was found to lead to isoeugenol to vanillin transformation. Deletion and cloning analyses indicated that the gene iemR, located upstream of iem, is required for expression of iem in the presence of isoeugenol, suggesting it to be the iem regulatory gene. Reverse transcription, real-time PCR analyses indicated that the genes involved in the metabolism of eugenol and isoeugenol were differently induced by isoeugenol, eugenol, and vanillin.

  10. Functional classification of genes using semantic distance and fuzzy clustering approach: evaluation with reference sets and overlap analysis.

    Science.gov (United States)

    Devignes, Marie-Dominique; Benabderrahmane, Sidahmed; Smaïl-Tabbone, Malika; Napoli, Amedeo; Poch, Olivier

    2012-01-01

    Functional classification aims at grouping genes according to their molecular function or the biological process they participate in. Evaluating the validity of such unsupervised gene classification remains a challenge given the variety of distance measures and classification algorithms that can be used. We evaluate here functional classification of genes with the help of reference sets: KEGG (Kyoto Encyclopaedia of Genes and Genomes) pathways and Pfam clans. These sets represent ground truth for any distance based on GO (Gene Ontology) biological process and molecular function annotations respectively. Overlaps between clusters and reference sets are estimated by the F-score method. We test our previously described IntelliGO semantic distance with hierarchical and fuzzy C-means clustering and we compare results with the state-of-the-art DAVID (Database for Annotation Visualisation and Integrated Discovery) functional classification method. Finally, study of best matching clusters to reference sets leads us to propose a set-difference method for discovering missing information.

  11. The human serum paraoxonase/arylesterase gene (PON1) is one member of a multigene family

    Energy Technology Data Exchange (ETDEWEB)

    Primo-Parmo, S.L.; Sorenson, R.C.; La Du, B.N. [Univ. of Michigan Medical School, Ann Arbor, MI (United States); Sorenson, R.C. [Univ. of Michigan, Ann Arbor, MI (United States)

    1996-05-01

    A physiological role for paraoxonase (PON1) is still uncertain, but it catalyzes the hydrolysis of toxic organophosphates. Evidence that the human genome contains two PON1-like genes, designated PON2 and PON3, is presented here. Human PON1 and PON2 each have nine exons, and the exon/intron junctions occur at equivalent positions. PON1 and PON2 genes are both on chromosome 7 in human and on chromosome 6 in the mouse. Turkey and chicken, like most birds, lack paraoxonase activity and are very susceptible to organophosphates. However, they have a PON-like gene with {approximately}70% identity with human PON1, PON2, and PON3. Another unexpected finding is that the deduced amino acid sequences of PON2 in human, mouse, dog, turkey, and chicken and of human PON3 are all missing the amino acid residue 105, which is lysine in human PON1. The expanded number of PON genes will have important implications for future experiments designed to discover the individual functions, catalytic properties, and physiological role of the paraoxonases. 35 refs., 4 figs., 3 tabs.

  12. Determination of taste receptor type 1 member 1 (TAS1R1) gene ...

    African Journals Online (AJOL)

    In this article, the objective was to investigate variations in goat TAS1R1 gene and their associations with growth traits in 317 goats by PCR-SSCP and DNA sequencing methods. The results showed two novel single nucleotide polymorphisms (SNPs): HM449123:g. [T3974C, C4037T]. In detail, two different alleles, A and B, ...

  13. Determination of taste receptor type 1 member 1 (TAS1R1) gene ...

    African Journals Online (AJOL)

    Jane

    2011-10-10

    Oct 10, 2011 ... Most of the previous studies laid particular emphasis on the taste sensitivity to umami substances or the relation of umami taste ... AB, and BB with the frequency distribution of allele B from 0.59 to 0.67 in analyzed populations. The ... locus within TAS1R1 gene possessed moderate genetic diversity in goat.

  14. Identifying driving gene clusters in complex diseases through critical transition theory

    Science.gov (United States)

    Wolanyk, Nathaniel; Wang, Xujing; Hessner, Martin; Gao, Shouguo; Chen, Ye; Jia, Shuang

    A novel approach of looking at the human body using critical transition theory has yielded positive results: clusters of genes that act in tandem to drive complex disease progression. This cluster of genes can be thought of as the first part of a large genetic force that pushes the body from a curable, but sick, point to an incurable diseased point through a catastrophic bifurcation. The data analyzed is time course microarray blood assay data of 7 high risk individuals for Type 1 Diabetes who progressed into a clinical onset, with an additional larger study requested to be presented at the conference. The normalized data is 25,000 genes strong, which were narrowed down based on statistical metrics, and finally a machine learning algorithm using critical transition metrics found the driving network. This approach was created to be repeatable across multiple complex diseases with only progression time course data needed so that it would be applicable to identifying when an individual is at risk of developing a complex disease. Thusly, preventative measures can be enacted, and in the longer term, offers a possible solution to prevent all Type 1 Diabetes.

  15. Characterization of the aldo-keto reductase 1C gene cluster on pig chromosome 10: possible associations with reproductive traits

    Directory of Open Access Journals (Sweden)

    Nonneman Dan J

    2006-09-01

    Full Text Available Abstract Background The rate of pubertal development and weaning to estrus interval are correlated and affect reproductive efficiency of swine. Quantitative trait loci (QTL for age of puberty, nipple number and ovulation rate have been identified in Meishan crosses on pig chromosome 10q (SSC10 near the telomere, which is homologous to human chromosome 10p15 and contains an aldo-keto reductase (AKR gene cluster with at least six family members. AKRs are tissue-specific hydroxysteroid dehydrogenases that interconvert weak steroid hormones to their more potent counterparts and regulate processes involved in development, homeostasis and reproduction. Because of their location in the swine genome and their implication in reproductive physiology, this gene cluster was characterized and evaluated for effects on reproductive traits in swine. Results Screening the porcine CHORI-242 BAC library with a full-length AKR1C4 cDNA identified 7 positive clones and sample sequencing of 5 BAC clones revealed 5 distinct AKR1C genes (AKR1CL2 and AKR1C1 through 4, which mapped to 126–128 cM on SSC10. Using the IMpRH7000rad and IMNpRH212000rad radiation hybrid panels, these 5 genes mapped between microsatellite markers SWR67 and SW2067. Comparison of sequence data with the porcine BAC fingerprint map show that the cluster of genes resides in a 300 kb region. Twelve SNPs were genotyped in gilts observed for age at first estrus and ovulation rate from the F8 and F10 generations of one-quarter Meishan descendants of the USMARC resource population. Age at puberty, nipple number and ovulation rate data were analyzed for association with genotypes by MTDFREML using an animal model. One SNP, a phenylalanine to isoleucine substitution in AKR1C2, was associated with age of puberty (p = 0.07 and possibly ovulation rate (p = 0.102. Two SNP in AKR1C4 were significantly associated with nipple number (p ≤ 0.03 and another possibly associated with age at puberty (p = 0

  16. The Serratia gene cluster encoding biosynthesis of the red antibiotic, prodigiosin, shows species- and strain-dependent genome context variation

    DEFF Research Database (Denmark)

    Harris, Abigail K P; Williamson, Neil R; Slater, Holly

    2004-01-01

    The prodigiosin biosynthesis gene cluster (pig cluster) from two strains of Serratia (S. marcescens ATCC 274 and Serratia sp. ATCC 39006) has been cloned, sequenced and expressed in heterologous hosts. Sequence analysis of the respective pig clusters revealed 14 ORFs in S. marcescens ATCC 274...... from Str. coelicolor A3(2) revealed some important differences. A modified scheme for the biosynthesis of prodigiosin, based on the pathway recently suggested for the synthesis of undecylprodigiosin, is proposed. The distribution of the pig cluster within several Serratia sp. isolates is demonstrated...

  17. Generating in vivo cloning vectors for parallel cloning of large gene clusters by homologous recombination.

    Directory of Open Access Journals (Sweden)

    Jeongmin Lee

    Full Text Available A robust method for the in vivo cloning of large gene clusters was developed based on homologous recombination (HR, requiring only the transformation of PCR products into Escherichia coli cells harboring a receiver plasmid. Positive clones were selected by an acquired antibiotic resistance, which was activated by the recruitment of a short ribosome-binding site plus start codon sequence from the PCR products to the upstream position of a silent antibiotic resistance gene in receiver plasmids. This selection was highly stringent and thus the cloning efficiency of the GFPuv gene (size: 0.7 kb was comparable to that of the conventional restriction-ligation method, reaching up to 4.3 × 10(4 positive clones per μg of DNA. When we attempted parallel cloning of GFPuv fusion genes (size: 2.0 kb and carotenoid biosynthesis pathway clusters (sizes: 4 kb, 6 kb, and 10 kb, the cloning efficiency was similarly high regardless of the DNA size, demonstrating that this would be useful for the cloning of large DNA sequences carrying multiple open reading frames. However, restriction analyses of the obtained plasmids showed that the selected cells may contain significant amounts of receiver plasmids without the inserts. To minimize the amount of empty plasmid in the positive selections, the sacB gene encoding a levansucrase was introduced as a counter selection marker in receiver plasmid as it converts sucrose to a toxic levan in the E. coli cells. Consequently, this method yielded completely homogeneous plasmids containing the inserts via the direct transformation of PCR products into E. coli cells.

  18. antiSMASH 3.0-a comprehensive resource for the genome mining of biosynthetic gene clusters.

    Science.gov (United States)

    Weber, Tilmann; Blin, Kai; Duddela, Srikanth; Krug, Daniel; Kim, Hyun Uk; Bruccoleri, Robert; Lee, Sang Yup; Fischbach, Michael A; Müller, Rolf; Wohlleben, Wolfgang; Breitling, Rainer; Takano, Eriko; Medema, Marnix H

    2015-07-01

    Microbial secondary metabolism constitutes a rich source of antibiotics, chemotherapeutics, insecticides and other high-value chemicals. Genome mining of gene clusters that encode the biosynthetic pathways for these metabolites has become a key methodology for novel compound discovery. In 2011, we introduced antiSMASH, a web server and stand-alone tool for the automatic genomic identification and analysis of biosynthetic gene clusters, available at http://antismash.secondarymetabolites.org. Here, we present version 3.0 of antiSMASH, which has undergone major improvements. A full integration of the recently published ClusterFinder algorithm now allows using this probabilistic algorithm to detect putative gene clusters of unknown types. Also, a new dereplication variant of the ClusterBlast module now identifies similarities of identified clusters to any of 1172 clusters with known end products. At the enzyme level, active sites of key biosynthetic enzymes are now pinpointed through a curated pattern-matching procedure and Enzyme Commission numbers are assigned to functionally classify all enzyme-coding genes. Additionally, chemical structure prediction has been improved by incorporating polyketide reduction states. Finally, in order for users to be able to organize and analyze multiple antiSMASH outputs in a private setting, a new XML output module allows offline editing of antiSMASH annotations within the Geneious software. © The Author(s) 2015. Published by Oxford University Press on behalf of Nucleic Acids Research.

  19. A genome-wide association study on androstenone levels in pigs reveals a cluster of candidate genes on chromosome 6

    Directory of Open Access Journals (Sweden)

    Groenen Martien AM

    2010-05-01

    Full Text Available Abstract Background In many countries, male piglets are castrated shortly after birth because a proportion of un-castrated male pigs produce meat with an unpleasant flavour and odour. Main compounds of boar taint are androstenone and skatole. The aim of this high-density genome-wide association study was to identify single nucleotide polymorphisms (SNPs associated with androstenone levels in a commercial sire line of pigs. The identification of major genetic effects causing boar taint would accelerate the reduction of boar taint through breeding to finally eliminate the need for castration. Results The Illumina Porcine 60K+SNP Beadchip was genotyped on 987 pigs divergent for androstenone concentration from a commercial Duroc-based sire line. The association analysis with 47,897 SNPs revealed that androstenone levels in fat tissue were significantly affected by 37 SNPs on pig chromosomes SSC1 and SSC6. Among them, the 5 most significant SNPs explained together 13.7% of the genetic variance in androstenone. On SSC6, a larger region of 10 Mb was shown to be associated with androstenone covering several candidate genes potentially involved in the synthesis and metabolism of androgens. Besides known candidate genes, such as cytochrome P450 A19 (CYP2A19, sulfotransferases SULT2A1, and SULT2B1, also new members of the cytochrome P450 CYP2 gene subfamilies and of the hydroxysteroid-dehydrogenases (HSD17B14 were found. In addition, the gene encoding the ß-chain of the luteinizing hormone (LHB which induces steroid synthesis in the Leydig cells of the testis at onset of puberty maps to this area on SSC6. Interestingly, the gene encoding the α-chain of LH is also located in one of the highly significant areas on SSC1. Conclusions This study reveals several areas of the genome at high resolution responsible for variation of androstenone levels in intact boars. Major genetic factors on SSC1 and SSC6 showing moderate to large effects on androstenone

  20. New tools for reconstruction and heterologous expression of natural product biosynthetic gene clusters.

    Science.gov (United States)

    Luo, Yunzi; Enghiad, Behnam; Zhao, Huimin

    2016-02-01

    Natural product scaffolds remain a major source and inspiration for human therapeutics. However, generation of a natural product in the post-genomic era often requires reconstruction of the corresponding biosynthetic gene cluster in a heterologous host. In the burgeoning fields of synthetic biology and metabolic engineering, a significant amount of efforts has been devoted to develop DNA assembly techniques with higher efficiency, fidelity, and modularity, and heterologous expression systems with higher productivity and yield. Here we describe recent advances in DNA assembly and host engineering and highlight their applications in natural product discovery and engineering.

  1. Genome-wide identification of physically clustered genes suggests chromatin-level co-regulation in male reproductive development in Arabidopsis thaliana.

    Science.gov (United States)

    Reimegård, Johan; Kundu, Snehangshu; Pendle, Ali; Irish, Vivian F; Shaw, Peter; Nakayama, Naomi; Sundström, Jens F; Emanuelsson, Olof

    2017-04-07

    Co-expression of physically linked genes occurs surprisingly frequently in eukaryotes. Such chromosomal clustering may confer a selective advantage as it enables coordinated gene regulation at the chromatin level. We studied the chromosomal organization of genes involved in male reproductive development in Arabidopsis thaliana. We developed an in-silico tool to identify physical clusters of co-regulated genes from gene expression data. We identified 17 clusters (96 genes) involved in stamen development and acting downstream of the transcriptional activator MS1 (MALE STERILITY 1), which contains a PHD domain associated with chromatin re-organization. The clusters exhibited little gene homology or promoter element similarity, and largely overlapped with reported repressive histone marks. Experiments on a subset of the clusters suggested a link between expression activation and chromatin conformation: qRT-PCR and mRNA in situ hybridization showed that the clustered genes were up-regulated within 48 h after MS1 induction; out of 14 chromatin-remodeling mutants studied, expression of clustered genes was consistently down-regulated only in hta9/hta11, previously associated with metabolic cluster activation; DNA fluorescence in situ hybridization confirmed that transcriptional activation of the clustered genes was correlated with open chromatin conformation. Stamen development thus appears to involve transcriptional activation of physically clustered genes through chromatin de-condensation. © The Author(s) 2017. Published by Oxford University Press on behalf of Nucleic Acids Research.

  2. Apolipoprotein A-V interaction with members of the low density lipoprotein receptor gene family

    DEFF Research Database (Denmark)

    Nilsson, Stefan K; Lookene, Aivar; Beckstead, Jennifer A

    2007-01-01

    Apolipoprotein A-V is a potent modulator of plasma triacylglycerol levels. To investigate the molecular basis for this phenomenon we explored the ability of apolipoprotein A-V, in most experiments complexed to disks of dimyristoylphosphatidylcholine, to interact with two members of the low density...... lipoprotein receptor family, the low density lipoprotein receptor-related protein and the mosaic type-1 receptor, SorLA. Experiments using surface plasmon resonance showed specific binding of both free and lipid-bound apolipoprotein A-V to both receptors. The binding was calcium dependent and was inhibited......, apolipoprotein A-V (Arg210Glu/Lys211Gln), showed decreased binding to heparin and decreased ability to bind the low density lipoprotein receptor-related protein. Association of apolipoprotein A-V with the low density lipoprotein receptor-related protein or SorLA resulted in enhanced binding of human chylomicrons...

  3. Genetic interrelations in the actinomycin biosynthetic gene clusters of Streptomyces antibioticus IMRU 3720 and Streptomyces chrysomallus ATCC11523, producers of actinomycin X and actinomycin C

    Science.gov (United States)

    Crnovčić, Ivana; Rückert, Christian; Semsary, Siamak; Lang, Manuel; Kalinowski, Jörn; Keller, Ullrich

    2017-01-01

    Sequencing the actinomycin (acm) biosynthetic gene cluster of Streptomyces antibioticus IMRU 3720, which produces actinomycin X (Acm X), revealed 20 genes organized into a highly similar framework as in the bi-armed acm C biosynthetic gene cluster of Streptomyces chrysomallus but without an attached additional extra arm of orthologues as in the latter. Curiously, the extra arm of the S. chrysomallus gene cluster turned out to perfectly match the single arm of the S. antibioticus gene cluster in the same order of orthologues including the the presence of two pseudogenes, scacmM and scacmN, encoding a cytochrome P450 and its ferredoxin, respectively. Orthologues of the latter genes were both missing in the principal arm of the S. chrysomallus acm C gene cluster. All orthologues of the extra arm showed a G +C-contents different from that of their counterparts in the principal arm. Moreover, the similarities of translation products from the extra arm were all higher to the corresponding translation products of orthologue genes from the S. antibioticus acm X gene cluster than to those encoded by the principal arm of their own gene cluster. This suggests that the duplicated structure of the S. chrysomallus acm C biosynthetic gene cluster evolved from previous fusion between two one-armed acm gene clusters each from a different genetic background. However, while scacmM and scacmN in the extra arm of the S. chrysomallus acm C gene cluster are mutated and therefore are non-functional, their orthologues saacmM and saacmN in the S. antibioticus acm C gene cluster show no defects seemingly encoding active enzymes with functions specific for Acm X biosynthesis. Both acm biosynthetic gene clusters lack a kynurenine-3-monooxygenase gene necessary for biosynthesis of 3-hydroxy-4-methylanthranilic acid, the building block of the Acm chromophore, which suggests participation of a genome-encoded relevant monooxygenase during Acm biosynthesis in both S. chrysomallus and S

  4. The entire β-globin gene cluster is deleted in a form of τδβ-thalassemia.

    NARCIS (Netherlands)

    E.R. Fearon; H.H.Jr. Kazazian; P.G. Waber (Pamela); J.I. Lee (Joseph); S.E. Antonarakis; S.H. Orkin (Stuart); E.F. Vanin; P.S. Henthorn; F.G. Grosveld (Frank); A.F. Scott; G.R. Buchanan

    1983-01-01

    textabstractWe have used restriction endonuclease mapping to study a deletion involving the beta-globin gene cluster in a Mexican-American family with gamma delta beta-thalassemia. Analysis of DNA polymorphisms demonstrated deletion of the beta-globin gene from the affected chromosome. Using a DNA

  5. Ladder-like amplification of the type I interferon gene cluster in the human osteosarcoma cell line MG63

    Czech Academy of Sciences Publication Activity Database

    Marella, N.V.; Zeitz, M.J.; Malyavantham, K.S.; Pliss, A.; Matsui, S.; Goetze, S.; Bode, J.; Raška, Ivan; Berezney, R.

    2008-01-01

    Roč. 16, č. 8 (2008), s. 1177-1192 ISSN 0967-3849 Grant - others:GA MŠk(CZ) LC535 Program:LC Institutional research plan: CEZ:AV0Z50110509 Keywords : array comparative genomic hybridization * human osteosarcoma cells * interferon gene cluster Subject RIV: EB - Gene tics ; Molecular Biology Impact factor: 3.405, year: 2008

  6. Dispersed Benzoxazinone Gene Cluster: Molecular Characterization and Chromosomal Localization of Glucosyltransferase and Glucosidase Genes in Wheat and Rye1[W

    Science.gov (United States)

    Sue, Masayuki; Nakamura, Chihiro; Nomura, Taiji

    2011-01-01

    Benzoxazinones (Bxs) are major defensive secondary metabolites in wheat (Triticum aestivum), rye (Secale cereale), and maize (Zea mays). Here, we identified full sets of homeologous and paralogous genes encoding Bx glucosyltransferase (GT) and Bx-glucoside glucosidase (Glu) in hexaploid wheat (2n = 6x = 42; AABBDD). Four GT loci (TaGTa–TaGTd) were mapped on chromosomes 7A, 7B (two loci), and 7D, whereas four glu1 loci (Taglu1a–Taglu1d) were on chromosomes 2A, 2B (two loci), and 2D. Transcript levels differed greatly among the four loci; B-genome loci of both TaGT and Taglu1 genes were preferentially transcribed. Catalytic properties of the enzyme encoded by each homeolog/paralog also differed despite high levels of identity among amino acid sequences. The predominant contribution of the B genome to GT and Glu reactions was revealed, as observed previously for the five Bx biosynthetic genes, TaBx1 to TaBx5, which are separately located on homeologous groups 4 and 5 chromosomes. In rye, where the ScBx1 to ScBx5 genes are dispersed to chromosomes 7R and 5R, ScGT and Scglu were located separately on chromosomes 4R and 2R, respectively. The dispersal of Bx-pathway loci to four distinct chromosomes in hexaploid wheat and rye suggests that the clustering of Bx-pathway genes, as found in maize, is not essential for coordinated transcription. On the other hand, barley (Hordeum vulgare) was found to lack the orthologous GT and glu loci like the Bx1 to Bx5 loci despite its close phylogenetic relationship with wheat and rye. These results contribute to our understanding of the evolutionary processes that the Bx-pathway loci have undergone in grasses. PMID:21875895

  7. Dispersed benzoxazinone gene cluster: molecular characterization and chromosomal localization of glucosyltransferase and glucosidase genes in wheat and rye.

    Science.gov (United States)

    Sue, Masayuki; Nakamura, Chihiro; Nomura, Taiji

    2011-11-01

    Benzoxazinones (Bxs) are major defensive secondary metabolites in wheat (Triticum aestivum), rye (Secale cereale), and maize (Zea mays). Here, we identified full sets of homeologous and paralogous genes encoding Bx glucosyltransferase (GT) and Bx-glucoside glucosidase (Glu) in hexaploid wheat (2n = 6x = 42; AABBDD). Four GT loci (TaGTa-TaGTd) were mapped on chromosomes 7A, 7B (two loci), and 7D, whereas four glu1 loci (Taglu1a-Taglu1d) were on chromosomes 2A, 2B (two loci), and 2D. Transcript levels differed greatly among the four loci; B-genome loci of both TaGT and Taglu1 genes were preferentially transcribed. Catalytic properties of the enzyme encoded by each homeolog/paralog also differed despite high levels of identity among amino acid sequences. The predominant contribution of the B genome to GT and Glu reactions was revealed, as observed previously for the five Bx biosynthetic genes, TaBx1 to TaBx5, which are separately located on homeologous groups 4 and 5 chromosomes. In rye, where the ScBx1 to ScBx5 genes are dispersed to chromosomes 7R and 5R, ScGT and Scglu were located separately on chromosomes 4R and 2R, respectively. The dispersal of Bx-pathway loci to four distinct chromosomes in hexaploid wheat and rye suggests that the clustering of Bx-pathway genes, as found in maize, is not essential for coordinated transcription. On the other hand, barley (Hordeum vulgare) was found to lack the orthologous GT and glu loci like the Bx1 to Bx5 loci despite its close phylogenetic relationship with wheat and rye. These results contribute to our understanding of the evolutionary processes that the Bx-pathway loci have undergone in grasses.

  8. Linking secondary metabolites to gene clusters through genome sequencing of six diverse Aspergillus species

    DEFF Research Database (Denmark)

    Kjærbølling, Inge; Vesth, Tammi C.; Frisvad, Jens C.

    2018-01-01

    to determine phylogeny and genetic diversity, showing that each presented genome contains 15–27% genes not found in other sequenced Aspergilli. In particular, A. novofumigatus was compared with the pathogenic species A. fumigatus. This suggests that A. novofumigatus can produce most of the same allergens......, virulence, and pathogenicity factors as A. fumigatus, suggesting that A. novofumigatus could be as pathogenic as A. fumigatus. Furthermore, SMs were linked to gene clusters based on biological and chemical knowledge and analysis, genome sequences, and predictive algorithms. We thus identify putative SM....... campestris, A. novofumigatus, A. ochraceoroseus, and A. steynii) have been whole-genome PacBio sequenced to provide genetic references in three Aspergillus sections. A. taichungensis and A. candidus also were sequenced for SM elucidation. Thirteen Aspergillus genomes were analyzed with comparative genomics...

  9. Characteristic beta-globin gene cluster haplotypes of Evenkis and Oroqens in north China.

    Science.gov (United States)

    Shimizu, Koji; Marubayashi, Azusa; Tokimasa, Kozue; Harihara, Shinji; Omoto, Keiichi; Imanishi, Tadashi; Hao, Luping; Jin, Feng

    2004-10-01

    Haplotype frequencies of the beta-globin gene cluster were estimated for 114 Evenkis and 81 Oroqens from northeast China, and their characteristics were compared with those in Japanese, Koreans, and three Colombian Amerindian groups of South America (Wayuu, Kamsa, and Inga tribes). A major 5' subhaplotype (5' to the delta-globin gene) was + - - - - in Evenkis, whereas + - - - -, - + + - +, and - + - + + were the major subhaplotypes in Oroqens. One possible candidate for an ancestral 5' subhaplotype, - - - - -, was found in one Evenki (0.5%) and three Oroqen chromosomes (2.0%). They were observed as heterozygous forms for + ---- and -----. Major haplotypes were +-----+, + -----+-, and + - - - - + + in Evenkis, whereas they were +-----+,-++-+-+, +----+-, and -+-++-+ in Oroqens. The lowest Nei's genetic distance values of Evenkis or Oroqens based on the 5' subhaplotype frequency distributions were observed in relation to the Wayuu or Koreans, respectively, but those of Evenkis and Oroqens based on the haplotype frequency distributions were found in relation to Koreans.

  10. Heterologous reconstitution of the intact geodin gene cluster in Aspergillus nidulans through a simple and versatile PCR based approach.

    Directory of Open Access Journals (Sweden)

    Morten Thrane Nielsen

    Full Text Available Fungal natural products are a rich resource for bioactive molecules. To fully exploit this potential it is necessary to link genes to metabolites. Genetic information for numerous putative biosynthetic pathways has become available in recent years through genome sequencing. However, the lack of solid methodology for genetic manipulation of most species severely hampers pathway characterization. Here we present a simple PCR based approach for heterologous reconstitution of intact gene clusters. Specifically, the putative gene cluster responsible for geodin production from Aspergillus terreus was transferred in a two step procedure to an expression platform in A. nidulans. The individual cluster fragments were generated by PCR and assembled via efficient USER fusion prior to transformation and integration via re-iterative gene targeting. A total of 13 open reading frames contained in 25 kb of DNA were successfully transferred between the two species enabling geodin synthesis in A. nidulans. Subsequently, functions of three genes in the cluster were validated by genetic and chemical analyses. Specifically, ATEG_08451 (gedC encodes a polyketide synthase, ATEG_08453 (gedR encodes a transcription factor responsible for activation of the geodin gene cluster and ATEG_08460 (gedL encodes a halogenase that catalyzes conversion of sulochrin to dihydrogeodin. We expect that our approach for transferring intact biosynthetic pathways to a fungus with a well developed genetic toolbox will be instrumental in characterizing the many exciting pathways for secondary metabolite production that are currently being uncovered by the fungal genome sequencing projects.

  11. Expression-based clustering of CAZyme-encoding genes of Aspergillus niger.

    Science.gov (United States)

    Gruben, Birgit S; Mäkelä, Miia R; Kowalczyk, Joanna E; Zhou, Miaomiao; Benoit-Gelber, Isabelle; De Vries, Ronald P

    2017-11-23

    The Aspergillus niger genome contains a large repertoire of genes encoding carbohydrate active enzymes (CAZymes) that are targeted to plant polysaccharide degradation enabling A. niger to grow on a wide range of plant biomass substrates. Which genes need to be activated in certain environmental conditions depends on the composition of the available substrate. Previous studies have demonstrated the involvement of a number of transcriptional regulators in plant biomass degradation and have identified sets of target genes for each regulator. In this study, a broad transcriptional analysis was performed of the A. niger genes encoding (putative) plant polysaccharide degrading enzymes. Microarray data focusing on the initial response of A. niger to the presence of plant biomass related carbon sources were analyzed of a wild-type strain N402 that was grown on a large range of carbon sources and of the regulatory mutant strains ΔxlnR, ΔaraR, ΔamyR, ΔrhaR and ΔgalX that were grown on their specific inducing compounds. The cluster analysis of the expression data revealed several groups of co-regulated genes, which goes beyond the traditionally described co-regulated gene sets. Additional putative target genes of the selected regulators were identified, based on their expression profile. Notably, in several cases the expression profile puts questions on the function assignment of uncharacterized genes that was based on homology searches, highlighting the need for more extensive biochemical studies into the substrate specificity of enzymes encoded by these non-characterized genes. The data also revealed sets of genes that were upregulated in the regulatory mutants, suggesting interaction between the regulatory systems and a therefore even more complex overall regulatory network than has been reported so far. Expression profiling on a large number of substrates provides better insight in the complex regulatory systems that drive the conversion of plant biomass by fungi. In

  12. Inactivation of the indole-diterpene biosynthetic gene cluster of Claviceps paspali by Agrobacterium-mediated gene replacement.

    Science.gov (United States)

    Kozák, László; Szilágyi, Zoltán; Vágó, Barbara; Kakuk, Annamária; Tóth, László; Molnár, István; Pócsi, István

    2018-04-01

    The hypocrealean fungus Claviceps paspali is a parasite of wild grasses. This fungus is widely utilized in the pharmaceutical industry for the manufacture of ergot alkaloids, but also produces tremorgenic and neurotoxic indole-diterpene (IDT) secondary metabolites such as paspalitrems A and B. IDTs cause significant losses in agriculture and represent health hazards that threaten food security. Conversely, IDTs may also be utilized as lead compounds for pharmaceutical drug discovery. Current protoplast-mediated transformation protocols of C. paspali are inadequate as they suffer from inefficiencies in protoplast regeneration, a low frequency of DNA integration, and a low mitotic stability of the nascent transformants. We adapted and optimized Agrobacterium tumefaciens-mediated transformation (ATMT) for C. paspali and validated this method with the straightforward creation of a mutant strain of this fungus featuring a targeted replacement of key genes in the putative IDT biosynthetic gene cluster. Complete abrogation of IDT production in isolates of the mutant strain proved the predicted involvement of the target genes in the biosynthesis of IDTs. The mutant isolates continued to produce ergot alkaloids undisturbed, indicating that equivalent mutants generated in industrial ergot producers may have a better safety profile as they are devoid of IDT-type mycotoxins. Meanwhile, ATMT optimized for Claviceps spp. may open the door for the facile genetic engineering of these industrially and ecologically important organisms.

  13. The diversity and expansion of the trans-sialidase gene family is a common feature in Trypanosoma cruzi clade members.

    Science.gov (United States)

    Chiurillo, Miguel Angel; Cortez, Danielle R; Lima, Fábio M; Cortez, Caroline; Ramírez, José Luis; Martins, Andre G; Serrano, Myrna G; Teixeira, Marta M G; da Silveira, José Franco

    2016-01-01

    Trans-sialidase (TS) is a polymorphic protein superfamily described in members of the protozoan genus Trypanosoma. Of the eight TS groups recently described, TS group I proteins (some of which have catalytic activity) are present in the distantly related Trypanosoma brucei and Trypanosoma cruzi phylogenetic clades, whereas other TS groups have only been described in some species belonging to the T. cruzi clade. In the present study we analyzed the repertoire, distribution and phylogenetic relationships of TS genes among species of the T. cruzi clade based on sequence similarity, multiple sequence alignment and tree-reconstruction approaches using TS sequences obtained with the aid of PCR-based strategies or retrieved from genome databases. We included the following representative isolates of the T. cruzi clade from South America: T. cruzi, T. cruzi Tcbat, Trypanosoma cruzi marinkellei, Trypanosoma dionisii, Trypanosoma rangeli and Trypanosoma conorhini. The cloned sequences encoded conserved TS protein motifs Asp-box and VTVxNVxLYNR but lacked the FRIP motif (conserved in TS group I). The T. conorhini sequences were the most divergent. The hybridization patterns of TS probes with chromosomal bands confirmed the abundance of these sequences in species in the T. cruzi clade. Divergence and relationship analysis placed most of the TS sequences in the groups defined in T. cruzi. Further examination of members of TS group II, which includes T. cruzi surface glycoproteins implicated in host cell attachment and invasion, showed that sequences of T. cruzi Tcbat grouped with those of T. cruzi genotype TcI. Our analysis indicates that different members of the T. cruzi clade, with different vertebrate hosts, vectors and pathogenicity, share the extensive expansion and sequence diversification of the TS gene family. Altogether, our results are congruent with the evolutionary history of the T. cruzi clade and represent a contribution to the understanding of the molecular

  14. Hessian regularization based symmetric nonnegative matrix factorization for clustering gene expression and microbiome data.

    Science.gov (United States)

    Ma, Yuanyuan; Hu, Xiaohua; He, Tingting; Jiang, Xingpeng

    2016-12-01

    Nonnegative matrix factorization (NMF) has received considerable attention due to its interpretation of observed samples as combinations of different components, and has been successfully used as a clustering method. As an extension of NMF, Symmetric NMF (SNMF) inherits the advantages of NMF. Unlike NMF, however, SNMF takes a nonnegative similarity matrix as an input, and two lower rank nonnegative matrices (H, H T ) are computed as an output to approximate the original similarity matrix. Laplacian regularization has improved the clustering performance of NMF and SNMF. However, Laplacian regularization (LR), as a classic manifold regularization method, suffers some problems because of its weak extrapolating ability. In this paper, we propose a novel variant of SNMF, called Hessian regularization based symmetric nonnegative matrix factorization (HSNMF), for this purpose. In contrast to Laplacian regularization, Hessian regularization fits the data perfectly and extrapolates nicely to unseen data. We conduct extensive experiments on several datasets including text data, gene expression data and HMP (Human Microbiome Project) data. The results show that the proposed method outperforms other methods, which suggests the potential application of HSNMF in biological data clustering. Copyright © 2016. Published by Elsevier Inc.

  15. Identification, characterization, and expression of three new members of the Borrelia burgdorferi Mlp (2.9) lipoprotein gene family.

    Science.gov (United States)

    Yang, X; Popova, T G; Hagman, K E; Wikel, S K; Schoeler, G B; Caimano, M J; Radolf, J D; Norgard, M V

    1999-11-01

    We previously reported on the existence of a family of lipoprotein genes, designated 2.9 lipoprotein genes, encoded in at least seven versions on the circular (supercoiled) cp32 and cp18 plasmids of Borrelia burgdorferi 297. A distinguishing feature of the 2.9 lipoproteins were highly similar signal sequences but variable mature polypeptides that segregated into two antigenic classes. Further screenings of B. burgdorferi 297 genomic libraries led to the identification of three additional 2.9 lipoprotein genes, renamed herein mlp, for multicopy lipoprotein genes. Computer analyses and immunoblotting revealed that Mlp-9 segregated with the antigenic class I lipoproteins, whereas Mlp-8 and Mlp-10 were members of class II. Northern blotting showed that all three of the mlp genes were expressed when B. burgdorferi was cultivated in vitro at 34 degrees C, although mlp-9 and mlp-10 transcripts were expressed at very low levels. Additional combined immunoblotting and comparative reverse transcription-PCR analyses performed on borreliae cultivated in vitro at 23, 34, or 37 degrees C indicated that although Mlp-8 was substantially more abundant than Mlp-9 or Mlp-10, all three of the mlp genes were upregulated during B. burgdorferi replication at 37 degrees C. Expression of the same three lipoproteins was further enhanced upon growth of the spirochetes within dialysis membrane chambers (DMCs) implanted intraperitoneally in rats (i.e., spirochetes in a mammalian host-adapted state), suggesting that temperature alone did not account for maximal upregulation of the mlp genes. That certain mlp genes are likely expressed during the growth of B. burgdorferi in mammalian tissues was supported by findings of antibodies against all three Mlp lipoproteins in mice after challenge with Ixodes scapularis nymphs harboring B. burgdorferi 297. The combined data suggest that as opposed to being differentially expressed in any reciprocal fashion (e.g., OspA/OspC), at least three mlp genes are

  16. Human major histocompatibility complex contains a minimum of 19 genes between the complement cluster and HLA-B

    International Nuclear Information System (INIS)

    Spies, T.; Bresnahan, M.; Strominger, J.L.

    1989-01-01

    A 600-kilobase (kb) DNA segment from the human major histocompatibility complex (MHC) class III region was isolated by extension of a previous 435-kb chromosome walk. The contiguous series of cloned overlapping cosmids contains the entire 555-kb interval between C2 in the complement gene cluster and HLA-B. This region is known to encode the tumor necrosis factors (TNFs) α and β, B144, and the major heat shock protein HSP70. Moreover, a cluster of genes, BAT1-BAT5 (HLA-B-associated transcripts) have been localized in the vicinity of the genes for TNFα and TNFβ. An additional four genes were identified by isolation of corresponding cDNA clones with cosmid DNA probes. These genes for BAT6-BAT9 were mapped near the gene for C2 within a 120-kb region that includes a HSP70 gene pair. These results, together with complementary data from a similar recent study, indicated the presence of a minimum of 19 genes within the C2-HLA-B interval of the MHC class III region. Although the functional properties of most of these genes are yet unknown, they may be involved in some aspects of immunity. This idea is supported by the genetic mapping of the hematopoietic histocompatibility locus-1 (Hh-1) in recombinant mice between TNFα and H-2S, which is homologous to the complement gene cluster in humans

  17. Microbial communication leading to the activation of silent fungal secondary metabolite gene clusters

    Directory of Open Access Journals (Sweden)

    Tina eNetzker

    2015-04-01

    Full Text Available Microorganisms form diverse multispecies communities in various ecosystems. The high abundance of fungal and bacterial species in these consortia results in specific communication between the microorganisms. A key role in this communication is played by secondary metabolites (SMs, which are also called natural products. Recently, it was shown that interspecies ‘talk’ between microorganisms represents a physiological trigger to activate silent gene clusters leading to the formation of novel SMs by the involved species. This review focuses on mixed microbial cultivation, mainly between bacteria and fungi, with a special emphasis on the induced formation of fungal SMs in co-cultures. In addition, the role of chromatin remodeling in the induction is examined, and methodical perspectives for the analysis of natural products are presented. As an example for an intermicrobial interaction elucidated at the molecular level, we discuss the specific interaction between the filamentous fungi Aspergillus nidulans and Aspergillus fumigatus with the soil bacterium Streptomyces rapamycinicus, which provides an excellent model system to enlighten molecular concepts behind regulatory mechanisms and will pave the way to a novel avenue of drug discovery through targeted activation of silent SM gene clusters through co-cultivations of microorganisms.

  18. Directed natural product biosynthesis gene cluster capture and expression in the model bacterium Bacillus subtilis

    Science.gov (United States)

    Li, Yongxin; Li, Zhongrui; Yamanaka, Kazuya; Xu, Ying; Zhang, Weipeng; Vlamakis, Hera; Kolter, Roberto; Moore, Bradley S.; Qian, Pei-Yuan

    2015-03-01

    Bacilli are ubiquitous low G+C environmental Gram-positive bacteria that produce a wide assortment of specialized small molecules. Although their natural product biosynthetic potential is high, robust molecular tools to support the heterologous expression of large biosynthetic gene clusters in Bacillus hosts are rare. Herein we adapt transformation-associated recombination (TAR) in yeast to design a single genomic capture and expression vector for antibiotic production in Bacillus subtilis. After validating this direct cloning ``plug-and-play'' approach with surfactin, we genetically interrogated amicoumacin biosynthetic gene cluster from the marine isolate Bacillus subtilis 1779. Its heterologous expression allowed us to explore an unusual maturation process involving the N-acyl-asparagine pro-drug intermediates preamicoumacins, which are hydrolyzed by the asparagine-specific peptidase into the active component amicoumacin A. This work represents the first direct cloning based heterologous expression of natural products in the model organism B. subtilis and paves the way to the development of future genome mining efforts in this genus.

  19. Heterologous expression of the Halothiobacillus neapolitanus carboxysomal gene cluster in Corynebacterium glutamicum.

    Science.gov (United States)

    Baumgart, Meike; Huber, Isabel; Abdollahzadeh, Iman; Gensch, Thomas; Frunzke, Julia

    2017-09-20

    Compartmentalization represents a ubiquitous principle used by living organisms to optimize metabolic flux and to avoid detrimental interactions within the cytoplasm. Proteinaceous bacterial microcompartments (BMCs) have therefore created strong interest for the encapsulation of heterologous pathways in microbial model organisms. However, attempts were so far mostly restricted to Escherichia coli. Here, we introduced the carboxysomal gene cluster of Halothiobacillus neapolitanus into the biotechnological platform species Corynebacterium gluta-micum. Transmission electron microscopy, fluorescence microscopy and single molecule localization microscopy suggested the formation of BMC-like structures in cells expressing the complete carboxysome operon or only the shell proteins. Purified carboxysomes consisted of the expected protein components as verified by mass spectrometry. Enzymatic assays revealed the functional production of RuBisCO in C. glutamicum both in the presence and absence of carboxysomal shell proteins. Furthermore, we could show that eYFP is targeted to the carboxysomes by fusion to the large RuBisCO subunit. Overall, this study represents the first transfer of an α-carboxysomal gene cluster into a Gram-positive model species supporting the modularity and orthogonality of these microcompartments, but also identified important challenges which need to be addressed on the way towards biotechnological application. Copyright © 2017 Elsevier B.V. All rights reserved.

  20. Directed natural product biosynthesis gene cluster capture and expression in the model bacterium Bacillus subtilis

    KAUST Repository

    Li, Yongxin

    2015-03-24

    Bacilli are ubiquitous low G+C environmental Gram-positive bacteria that produce a wide assortment of specialized small molecules. Although their natural product biosynthetic potential is high, robust molecular tools to support the heterologous expression of large biosynthetic gene clusters in Bacillus hosts are rare. Herein we adapt transformation-associated recombination (TAR) in yeast to design a single genomic capture and expression vector for antibiotic production in Bacillus subtilis. After validating this direct cloning plug-and-playa approach with surfactin, we genetically interrogated amicoumacin biosynthetic gene cluster from the marine isolate Bacillus subtilis 1779. Its heterologous expression allowed us to explore an unusual maturation process involving the N-acyl-asparagine pro-drug intermediates preamicoumacins, which are hydrolyzed by the asparagine-specific peptidase into the active component amicoumacin A. This work represents the first direct cloning based heterologous expression of natural products in the model organism B. subtilis and paves the way to the development of future genome mining efforts in this genus.

  1. Microbial communication leading to the activation of silent fungal secondary metabolite gene clusters.

    Science.gov (United States)

    Netzker, Tina; Fischer, Juliane; Weber, Jakob; Mattern, Derek J; König, Claudia C; Valiante, Vito; Schroeckh, Volker; Brakhage, Axel A

    2015-01-01

    Microorganisms form diverse multispecies communities in various ecosystems. The high abundance of fungal and bacterial species in these consortia results in specific communication between the microorganisms. A key role in this communication is played by secondary metabolites (SMs), which are also called natural products. Recently, it was shown that interspecies "talk" between microorganisms represents a physiological trigger to activate silent gene clusters leading to the formation of novel SMs by the involved species. This review focuses on mixed microbial cultivation, mainly between bacteria and fungi, with a special emphasis on the induced formation of fungal SMs in co-cultures. In addition, the role of chromatin remodeling in the induction is examined, and methodical perspectives for the analysis of natural products are presented. As an example for an intermicrobial interaction elucidated at the molecular level, we discuss the specific interaction between the filamentous fungi Aspergillus nidulans and Aspergillus fumigatus with the soil bacterium Streptomyces rapamycinicus, which provides an excellent model system to enlighten molecular concepts behind regulatory mechanisms and will pave the way to a novel avenue of drug discovery through targeted activation of silent SM gene clusters through co-cultivations of microorganisms.

  2. Global analysis of biosynthetic gene clusters reveals vast potential of secondary metabolite production in Penicillium species.

    Science.gov (United States)

    Nielsen, Jens Christian; Grijseels, Sietske; Prigent, Sylvain; Ji, Boyang; Dainat, Jacques; Nielsen, Kristian Fog; Frisvad, Jens Christian; Workman, Mhairi; Nielsen, Jens

    2017-04-03

    Filamentous fungi produce a wide range of bioactive compounds with important pharmaceutical applications, such as antibiotic penicillins and cholesterol-lowering statins. However, less attention has been paid to fungal secondary metabolites compared to those from bacteria. In this study, we sequenced the genomes of 9 Penicillium species and, together with 15 published genomes, we investigated the secondary metabolism of Penicillium and identified an immense, unexploited potential for producing secondary metabolites by this genus. A total of 1,317 putative biosynthetic gene clusters (BGCs) were identified, and polyketide synthase and non-ribosomal peptide synthetase based BGCs were grouped into gene cluster families and mapped to known pathways. The grouping of BGCs allowed us to study the evolutionary trajectory of pathways based on 6-methylsalicylic acid (6-MSA) synthases. Finally, we cross-referenced the predicted pathways with published data on the production of secondary metabolites and experimentally validated the production of antibiotic yanuthones in Penicillia and identified a previously undescribed compound from the yanuthone pathway. This study is the first genus-wide analysis of the genomic diversity of Penicillia and highlights the potential of these species as a source of new antibiotics and other pharmaceuticals.

  3. [Variety of thrombotic thrombocytopenic purpura clinical course in Polish family members with ADAMTS 13 gene mutation].

    Science.gov (United States)

    Hyla-Klekot, Lidia; Kucharska, Grazyna; Słonka, Karina

    2013-03-01

    The congenital form of thrombotic thrombocytopenic purpura (Upshaw-Schulman syndrom) is a result of genetically conditioned dysfunction of protease ADAMTS 13 enzyme which is responsible for von Wiellebrand factor multimer disintegration. The disease is inherited autosomally and recessively. The decrease of ADAMTS 13 activity results in intravascular clotting process activation with rapid lowering of platelet count, haemolytic anaemia, and occurence of schistocytes. Clinically, the disease is characterized by a range of symptoms such as severe jaundice in neonatal period, embolicthrombotic incidents of nervous system and progressive dysfunction of kidneys and other organs. Delaying diagnosis and hence administering of freshly frozen plasma leads to death. Molecular diagnosis allows for identification of genetical profile of the patient, and showing lowered enzyme activity is a basis for regular prophylactic plasma administration which is the protease donor. In our study we present members of a Polish family identified with ADAMTS 13 mutation. 52 old male with heterozygotic mutation of exon 29 (4143_4144insA) and in exon 19 (c2281G>A; Gly761Ser), experienced a few episodes of ischaemic stroke with ongoing neurological deficiency and developed chronic kidney disease. His 16-year old daughter with double homozygotic mutation in exon 29 (4143_4144insA) after severe episode of TTP at the age of 4 has been receiving plasma every 2 weeks for 12 years, which prevented her from other disorders. Target treatment introduced to clinical practice by means of ADAMTS 13 obtained by genetic recombination technology raises hopes.

  4. Nonblack patients with sickle cell disease have African. beta. sup s gene cluster haplotypes

    Energy Technology Data Exchange (ETDEWEB)

    Rogers, Z.R.; Powars, D.R.; Williams, W.D. (Univ. of Southern California School of Medicine, Los Angeles (USA)); Kinney, T.R. (Duke Univ., Durham, NC (USA)); Schroeder, W.A. (California Institute of Technology, Pasadena (USA))

    1989-05-26

    Of 18 nonblack patients with sickle cell disease, 14 had sickle cell anemia, 2 had hemoglobin SC disease, and 2 had hemoglobin S-{beta}{sup o}-thalassemia. The {beta}{sup s} gene cluster haplotypes that were determined in 7 patients were of African origin and were identified as Central African Republic, Central African Republic minor II, Benin, and Senegal. The haplotype Central African Republic minor II was present on the {beta}{sup o}-thalassemia chromosome in 2 patients. None of 10 patients whose {alpha}-gene status was determined had {alpha}-thalassemia-2. These data strongly support the concept that the {beta}{sup s} gene on chromosome 11 of these individuals is of African origin and that the {alpha}-gene locus on chromosome 16 is of white or native American origin. The clinical severity of the disease in these nonblack patients is appropriate to their haplotype without {alpha}-thalassemia-2 and is comparable with that of black patients. All persons with congenital hemolytic anemia should be examined for the presence of sickle cell disease regardless of physical appearance or ethnic background.

  5. Fungal metabolic gene clusters – caravans traveling across genomes and environments

    Directory of Open Access Journals (Sweden)

    Jennifer Hughes Wisecaver

    2015-03-01

    Full Text Available Metabolic gene clusters (MGCs, physically co-localized genes participating in the same metabolic pathway, are signature features of fungal genomes. MGCs are most often observed in specialized metabolism, having evolved in individual fungal lineages in response to specific ecological needs, such as the utilization of uncommon nutrients (e.g., galactose and allantoin or the production of secondary metabolic antimicrobial compounds and virulence factors (e.g., aflatoxin and melanin. A flurry of recent studies has shown that several MGCs, whose functions are often associated with fungal virulence as well as with the evolutionary arms race between fungi and their competitors, have experienced horizontal gene transfer (HGT. In this minireview, after briefly introducing HGT as a source of gene innovation, we examine the evidence for HGT’s involvement on the evolution of MGCs and, more generally of fungal metabolism, enumerate the molecular mechanisms that mediate such transfers and the ecological circumstances that favor them, as well as discuss the types of evidence required for inferring the presence of HGT in MGCs. The currently available examples indicate that transfers of entire MGCs have taken place between closely related fungal species as well as distant ones and that they sometimes involve large chromosomal segments. These results suggest that the HGT-mediated acquisition of novel metabolism is an ongoing and successful ecological strategy for many fungal species.

  6. Isolation of Resistance Gene Candidates (RGCs) and characterization of an RGC cluster in cassava.

    Science.gov (United States)

    López, C E; Zuluaga, A P; Cooke, R; Delseny, M; Tohme, J; Verdier, V

    2003-08-01

    Plant disease resistance genes (R genes) show significant similarity amongst themselves in terms of both their DNA sequences and structural motifs present in their protein products. Oligonucleotide primers designed from NBS (Nucleotide Binding Site) domains encoded by several R-genes have been used to amplify NBS sequences from the genomic DNA of various plant species, which have been called Resistance Gene Analogues (RGAs) or Resistance Gene Candidates (RGCs). Using specific primers from the NBS and TIR (Toll/Interleukin-1 Receptor) regions, we identified twelve classes of RGCs in cassava (Manihot esculenta Crantz). Two classes were obtained from the PCR-amplification of the TIR domain. The other 10 classes correspond to the NBS sequences and were grouped into two subfamilies. Classes RCa1 to RCa5 are part of the first subfamily and were linked to a TIR domain in the N terminus. Classes RCa6 to RCa10 corresponded to non-TIR NBS-LRR encoding sequences. BAC library screening with the 12 RGC classes as probes allowed the identification of 42 BAC clones that were assembled into 10 contigs and 19 singletons. Members of the two TIR and non-TIR NBS-LRR subfamilies occurred together within individual BAC clones. The BAC screening and Southern hybridization analyses showed that all RGCs were single copy sequences except RCa6 that represented a large and diverse gene family. One BAC contained five NBS sequences and sequence analysis allowed the identification of two complete RGCs encoding two highly similar proteins. This BAC was located on linkage group J with three other RGC-containing BACs. At least one of these genes, RGC2, is expressed constitutively in cassava tissues.

  7. Plasmid Complement of Lactococcus lactis NCDO712 Reveals a Novel Pilus Gene Cluster.

    Science.gov (United States)

    Tarazanova, Mariya; Beerthuyzen, Marke; Siezen, Roland; Fernandez-Gutierrez, Marcela M; de Jong, Anne; van der Meulen, Sjoerd; Kok, Jan; Bachmann, Herwig

    2016-01-01

    Lactococcus lactis MG1363 is an important gram-positive model organism. It is a plasmid-free and phage-cured derivative of strain NCDO712. Plasmid-cured strains facilitate studies on molecular biological aspects, but many properties which make L. lactis an important organism in the dairy industry are plasmid encoded. We sequenced the total DNA of strain NCDO712 and, contrary to earlier reports, revealed that the strain carries 6 rather than 5 plasmids. A new 50-kb plasmid, designated pNZ712, encodes functional nisin immunity (nisCIP) and copper resistance (lcoRSABC). The copper resistance could be used as a marker for the conjugation of pNZ712 to L. lactis MG1614. A genome comparison with the plasmid cured daughter strain MG1363 showed that the number of single nucleotide polymorphisms that accumulated in the laboratory since the strains diverted more than 30 years ago is limited to 11 of which only 5 lead to amino acid changes. The 16-kb plasmid pSH74 was found to contain a novel 8-kb pilus gene cluster spaCB-spaA-srtC1-srtC2, which is predicted to encode a pilin tip protein SpaC, a pilus basal subunit SpaB, and a pilus backbone protein SpaA. The sortases SrtC1/SrtC2 are most likely involved in pilus polymerization while the chromosomally encoded SrtA could act to anchor the pilus to peptidoglycan in the cell wall. Overexpression of the pilus gene cluster from a multi-copy plasmid in L. lactis MG1363 resulted in cell chaining, aggregation, rapid sedimentation and increased conjugation efficiency of the cells. Electron microscopy showed that the over-expression of the pilus gene cluster leads to appendices on the cell surfaces. A deletion of the gene encoding the putative basal protein spaB, by truncating spaCB, led to more pilus-like structures on the cell surface, but cell aggregation and cell chaining were no longer observed. This is consistent with the prediction that spaB is involved in the anchoring of the pili to the cell.

  8. The biosynthetic gene cluster for the cyanogenic glucoside dhurrin in Sorghum bicolor contains its co-expressed vacuolar MATE transporter

    DEFF Research Database (Denmark)

    Darbani Shirvanehdeh, Behrooz; Motawie, Mohammed Saddik; Olsen, Carl Erik

    2016-01-01

    for the cyanogenic glucoside dhurrin in Sorghum bicolor additionally contains a gene, SbMATE2, encoding a transporter of the multidrug and toxic compound extrusion (MATE) family, which is co-expressed with the biosynthetic genes. The predicted localisation of SbMATE2 to the vacuolar membrane was demonstrated......-glucoside or the glucosinolate indol-3-yl-methyl glucosinolate. The genomic co-localisation of a transporter gene with the biosynthetic genes producing the transported compound is discussed in relation to the role self-toxicity of chemical defence compounds may play in the formation of gene clusters....

  9. Conservation of a vitellogenin gene cluster in oviparous vertebrates and identification of its traces in the platypus genome.

    Science.gov (United States)

    Babin, Patrick J

    2008-04-30

    Vitellogenin (Vtg) derivatives are the main egg-yolk proteins in most oviparous animal species, and are, therefore, key players in reproduction and embryo development. Conserved synteny and phylogeny were used to identify a Vtg gene cluster (VGC) that had been evolutionarily conserved in most oviparous vertebrates, encompassing the three linked Vtgs on chicken (Gallus gallus) chromosome 8. Tandem arranged homologs to chicken VtgII and VtgIII were retrieved in similar locations in Xenopus (Xenopus tropicalis) and homologous transcribed inverted genes were found in medaka (Oryzias latipes), stickleback (Gasterosteus aculeatus), pufferfish (Takifugu rubripes), and Tetrahodon (Tetraodon nigroviridis), while zebrafish (Danio rerio) Vtg3 may represent a residual trace of VGC in this genome. Vtgs were not conserved in the paralogous chromosomal segment attributed to a whole-genome duplication event in the ancestor of teleosts, while tandem duplicated forms have survived the recent African clawed frog (Xenopus laevis) tetraploidization. Orthologs to chicken VtgI were found in similar locations in teleost fish, as well as in the platypus (Ornithorhynchus anatinus). Additional Vtg fragments found suggested that VGC had been conserved in this egg-laying mammal. A low ratio of nonsynonymous-to-synonymous substitution values and the paucity of pseudogene features suggest functional platypus Vtg products. Genomic identification of Vtgs, Apob, and Mtp in this genome, together with maximum likelihood and Bayesian inference phylogenetic analyses, support the existence of these three large lipid transfer protein superfamily members at the base of the mammalian lineage. In conclusion, the establishment of a VGC in the vertebrate lineage predates the divergence of ray-finned fish and tetrapods and the shift in reproductive and developmental strategy observed between prototherians and therians may be associated with its loss, as shown by its absence from the genomic resources currently

  10. Genomic characterization of a new endophytic Streptomyces kebangsaanensis identifies biosynthetic pathway gene clusters for novel phenazine antibiotic production

    Directory of Open Access Journals (Sweden)

    Juwairiah Remali

    2017-11-01

    Full Text Available Background Streptomyces are well known for their capability to produce many bioactive secondary metabolites with medical and industrial importance. Here we report a novel bioactive phenazine compound, 6-((2-hydroxy-4-methoxyphenoxy carbonyl phenazine-1-carboxylic acid (HCPCA extracted from Streptomyces kebangsaanensis, an endophyte isolated from the ethnomedicinal Portulaca oleracea. Methods The HCPCA chemical structure was determined using nuclear magnetic resonance spectroscopy. We conducted whole genome sequencing for the identification of the gene cluster(s believed to be responsible for phenazine biosynthesis in order to map its corresponding pathway, in addition to bioinformatics analysis to assess the potential of S. kebangsaanensis in producing other useful secondary metabolites. Results The S. kebangsaanensis genome comprises an 8,328,719 bp linear chromosome with high GC content (71.35% consisting of 12 rRNA operons, 81 tRNA, and 7,558 protein coding genes. We identified 24 gene clusters involved in polyketide, nonribosomal peptide, terpene, bacteriocin, and siderophore biosynthesis, as well as a gene cluster predicted to be responsible for phenazine biosynthesis. Discussion The HCPCA phenazine structure was hypothesized to derive from the combination of two biosynthetic pathways, phenazine-1,6-dicarboxylic acid and 4-methoxybenzene-1,2-diol, originated from the shikimic acid pathway. The identification of a biosynthesis pathway gene cluster for phenazine antibiotics might facilitate future genetic engineering design of new synthetic phenazine antibiotics. Additionally, these findings confirm the potential of S. kebangsaanensis for producing various antibiotics and secondary metabolites.

  11. Degradation of Benzene by Pseudomonas veronii 1YdBTEX2 and 1YB2 Is Catalyzed by Enzymes Encoded in Distinct Catabolism Gene Clusters.

    Science.gov (United States)

    de Lima-Morales, Daiana; Chaves-Moreno, Diego; Wos-Oxley, Melissa L; Jáuregui, Ruy; Vilchez-Vargas, Ramiro; Pieper, Dietmar H

    2016-01-01

    Pseudomonas veronii 1YdBTEX2, a benzene and toluene degrader, and Pseudomonas veronii 1YB2, a benzene degrader, have previously been shown to be key players in a benzene-contaminated site. These strains harbor unique catabolic pathways for the degradation of benzene comprising a gene cluster encoding an isopropylbenzene dioxygenase where genes encoding downstream enzymes were interrupted by stop codons. Extradiol dioxygenases were recruited from gene clusters comprising genes encoding a 2-hydroxymuconic semialdehyde dehydrogenase necessary for benzene degradation but typically absent from isopropylbenzene dioxygenase-encoding gene clusters. The benzene dihydrodiol dehydrogenase-encoding gene was not clustered with any other aromatic degradation genes, and the encoded protein was only distantly related to dehydrogenases of aromatic degradation pathways. The involvement of the different gene clusters in the degradation pathways was suggested by real-time quantitative reverse transcription PCR. Copyright © 2015, American Society for Microbiology. All Rights Reserved.

  12. Identification of the Regulator Gene Responsible for the Acetone-Responsive Expression of the Binuclear Iron Monooxygenase Gene Cluster in Mycobacteria ▿

    Science.gov (United States)

    Furuya, Toshiki; Hirose, Satomi; Semba, Hisashi; Kino, Kuniki

    2011-01-01

    The mimABCD gene cluster encodes the binuclear iron monooxygenase that oxidizes propane and phenol in Mycobacterium smegmatis strain MC2 155 and Mycobacterium goodii strain 12523. Interestingly, expression of the mimABCD gene cluster is induced by acetone. In this study, we investigated the regulator gene responsible for this acetone-responsive expression. In the genome sequence of M. smegmatis strain MC2 155, the mimABCD gene cluster is preceded by a gene designated mimR, which is divergently transcribed. Sequence analysis revealed that MimR exhibits amino acid similarity with the NtrC family of transcriptional activators, including AcxR and AcoR, which are involved in acetone and acetoin metabolism, respectively. Unexpectedly, many homologs of the mimR gene were also found in the sequenced genomes of actinomycetes. A plasmid carrying a transcriptional fusion of the intergenic region between the mimR and mimA genes with a promoterless green fluorescent protein (GFP) gene was constructed and introduced into M. smegmatis strain MC2 155. Using a GFP reporter system, we confirmed by deletion and complementation analyses that the mimR gene product is the positive regulator of the mimABCD gene cluster expression that is responsive to acetone. M. goodii strain 12523 also utilized the same regulatory system as M. smegmatis strain MC2 155. Although transcriptional activators of the NtrC family generally control transcription using the σ54 factor, a gene encoding the σ54 factor was absent from the genome sequence of M. smegmatis strain MC2 155. These results suggest the presence of a novel regulatory system in actinomycetes, including mycobacteria. PMID:21856847

  13. Members of a large retroposon family are determinants of post-transcriptional gene expression in Leishmania.

    Directory of Open Access Journals (Sweden)

    Frédéric Bringaud

    2007-09-01

    Full Text Available Trypanosomatids are unicellular protists that include the human pathogens Leishmania spp. (leishmaniasis, Trypanosoma brucei (sleeping sickness, and Trypanosoma cruzi (Chagas disease. Analysis of their recently completed genomes confirmed the presence of non-long-terminal repeat retrotransposons, also called retroposons. Using the 79-bp signature sequence common to all trypanosomatid retroposons as bait, we identified in the Leishmania major genome two new large families of small elements--LmSIDER1 (785 copies and LmSIDER2 (1,073 copies--that fulfill all the characteristics of extinct trypanosomatid retroposons. LmSIDERs are approximately 70 times more abundant in L. major compared to T. brucei and are found almost exclusively within the 3'-untranslated regions (3'UTRs of L. major mRNAs. We provide experimental evidence that LmSIDER2 act as mRNA instability elements and that LmSIDER2-containing mRNAs are generally expressed at lower levels compared to the non-LmSIDER2 mRNAs. The considerable expansion of LmSIDERs within 3'UTRs in an organism lacking transcriptional control and their role in regulating mRNA stability indicate that Leishmania have probably recycled these short retroposons to globally modulate the expression of a number of genes. To our knowledge, this is the first example in eukaryotes of the domestication and expansion of a family of mobile elements that have evolved to fulfill a critical cellular function.

  14. A Gene Expressed during Sexual and Asexual Sporulation in Phytophthora infestans is a Member of the Puf Family of Translational Regulators

    DEFF Research Database (Denmark)

    Cvitanich, Cristina; Judelson, Howard S.

    2003-01-01

    A gene from Phytophthora infestans that was previously identified as being induced during the development of sexual spores was also found to be active during asexual sporulation. The gene, M90, was expressed as a 3.1-kb primary transcript containing two introns and was predicted to encode a member...

  15. Regulatory role of tetR gene in a novel gene cluster of Acidovorax avenae subsp. avenae RS-1 under oxidative stress

    Directory of Open Access Journals (Sweden)

    He eLiu

    2014-10-01

    Full Text Available Acidovorax avenae subsp. avenae is the causal agent of bacterial brown stripe disease in rice. In this study, we characterized a novel horizontal transfer of a gene cluster, including tetR, on the chromosome of A. avenae subsp. avenae RS-1 by genome-wide analysis. TetR acted as a repressor in this gene cluster and the oxidative stress resistance was enhanced in tetR-deletion mutant strain. Electrophoretic mobility shift assay (EMSA demonstrated that TetR regulator bound directly to the promoter of this gene cluster. Consistently, the results of quantitative real-time PCR also showed alterations in expression of associated genes. Moreover, the proteins affected by TetR under oxidative stress were revealed by comparing proteomic profiles of wild-type and mutant strains via 1D SDS-PAGE and LC-MS/MS analyses. Taken together, our results demonstrated that tetR gene in this novel gene cluster contributed to cell survival under oxidative stress, and TetR protein played an important regulatory role in growth kinetics, biofilm-forming capability, SOD and catalase activity, and oxide detoxicating ability.

  16. Regulatory role of tetR gene in a novel gene cluster of Acidovorax avenae subsp. avenae RS-1 under oxidative stress.

    Science.gov (United States)

    Liu, He; Yang, Chun-Lan; Ge, Meng-Yu; Ibrahim, Muhammad; Li, Bin; Zhao, Wen-Jun; Chen, Gong-You; Zhu, Bo; Xie, Guan-Lin

    2014-01-01

    Acidovorax avenae subsp. avenae is the causal agent of bacterial brown stripe disease in rice. In this study, we characterized a novel horizontal transfer of a gene cluster, including tetR, on the chromosome of A. avenae subsp. avenae RS-1 by genome-wide analysis. TetR acted as a repressor in this gene cluster and the oxidative stress resistance was enhanced in tetR-deletion mutant strain. Electrophoretic mobility shift assay demonstrated that TetR regulator bound directly to the promoter of this gene cluster. Consistently, the results of quantitative real-time PCR also showed alterations in expression of associated genes. Moreover, the proteins affected by TetR under oxidative stress were revealed by comparing proteomic profiles of wild-type and mutant strains via 1D SDS-PAGE and LC-MS/MS analyses. Taken together, our results demonstrated that tetR gene in this novel gene cluster contributed to cell survival under oxidative stress, and TetR protein played an important regulatory role in growth kinetics, biofilm-forming capability, superoxide dismutase and catalase activity, and oxide detoxicating ability.

  17. A gene cluster for the biosynthesis of moenomycin family antibiotics in the genome of teicoplanin producer Actinoplanes teichomyceticus.

    Science.gov (United States)

    Horbal, Liliya; Ostash, Bohdan; Luzhetskyy, Andriy; Walker, Suzanne; Kalinowski, Jorn; Fedorenko, Victor

    2016-09-01

    Moenomycins are phosphoglycolipid antibiotics notable for their extreme potency, unique mode of action, and proven record of use in animal nutrition without selection for resistant microflora. There is a keen interest in manipulation of structures of moenomycins in order to better understand their structure-activity relationships and to generate improved analogs. Only two almost identical moenomycin biosynthetic gene clusters are known, limiting our knowledge of the evolution of moenomycin pathways and our ability to genetically diversify them. Here, we report a novel gene cluster (tchm) that directs production of the phosphoglycolipid teichomycin in Actinoplanes teichomyceticus. Its overall genetic architecture is significantly different from that of the moenomycin biosynthesis (moe) gene clusters of Streptomyces ghanaensis and Streptomyces clavuligerus, featuring multiple gene rearrangements and two novel structural genes. Involvement of the tchm cluster in teichomycin biosynthesis was confirmed via heterologous co-expression of amidotransferase tchmH5 and moe genes. Our work sets the background for further engineering of moenomycins and for deeper inquiries into the evolution of this fascinating biosynthetic pathway.

  18. Evolution of C2H2-zinc finger genes and subfamilies in mammals: Species-specific duplication and loss of clusters, genes and effector domains

    Directory of Open Access Journals (Sweden)

    Aubry Muriel

    2008-06-01

    Full Text Available Abstract Background C2H2 zinc finger genes (C2H2-ZNF constitute the largest class of transcription factors in humans and one of the largest gene families in mammals. Often arranged in clusters in the genome, these genes are thought to have undergone a massive expansion in vertebrates, primarily by tandem duplication. However, this view is based on limited datasets restricted to a single chromosome or a specific subset of genes belonging to the large KRAB domain-containing C2H2-ZNF subfamily. Results Here, we present the first comprehensive study of the evolution of the C2H2-ZNF family in mammals. We assembled the complete repertoire of human C2H2-ZNF genes (718 in total, about 70% of which are organized into 81 clusters across all chromosomes. Based on an analysis of their N-terminal effector domains, we identified two new C2H2-ZNF subfamilies encoding genes with a SET or a HOMEO domain. We searched for the syntenic counterparts of the human clusters in other mammals for which complete gene data are available: chimpanzee, mouse, rat and dog. Cross-species comparisons show a large variation in the numbers of C2H2-ZNF genes within homologous mammalian clusters, suggesting differential patterns of evolution. Phylogenetic analysis of selected clusters reveals that the disparity in C2H2-ZNF gene repertoires across mammals not only originates from differential gene duplication but also from gene loss. Further, we discovered variations among orthologs in the number of zinc finger motifs and association of the effector domains, the latter often undergoing sequence degeneration. Combined with phylogenetic studies, physical maps and an analysis of the exon-intron organization of genes from the SCAN and KRAB domains-containing subfamilies, this result suggests that the SCAN subfamily emerged first, followed by the SCAN-KRAB and finally by the KRAB subfamily. Conclusion Our results are in agreement with the "birth and death hypothesis" for the evolution of

  19. Vibrio salilacus sp. nov., a new member of the Anguillarum clade with six alleles of the 16S rRNA gene from a saline lake.

    Science.gov (United States)

    Zhong, Zhi-Ping; Liu, Ying; Liu, Hong-Can; Wang, Fang; Zhou, Yu-Guang; Liu, Zhi-Pei

    2015-08-01

    A Gram-stain-negative, catalase- and oxidase-positive, facultatively aerobic bacterium, strain DSG-S6T, was isolated from Dasugan Lake (salinity 3.1%, w/w), China. Its taxonomic position was determined by using a polyphasic approach. Cells of strain DSG-S6T were non-spore-forming, slightly bent rods, and motile by means of a single polar flagellum. Growth occurred in the presence of 0-7.0% (w/v) NaCl (optimum, 2.0%), at 4-35 °C (optimum, 30 °C) and at pH 6.0-10.5 (optimum, pH 8.0-8.5). C16 : 0, C18 : 1ω7c and C16 : 1ω7c and/or C16 : 1ω6c were the major fatty acids. Six alleles of the 16S rRNA gene sharing 98.9-99.9  % similarity were detected in strain DSG-S6T, which showed highest 16S rRNA gene sequence similarity to Vibrio aestuarianus ATCC 35048T (97.7 %), then to Vibrio pacinii LMG 19999T (97.6%) and Vibrio metschnikovii CIP 69.14T (96.8%). Multilocus sequence analysis of four housekeeping genes and 16S rRNA genes clearly clustered it as a member of the Anguillarum clade. Mean DNA-DNA relatedness between strain DSG-S6T and V. aestuarianus NBRC 15629T, V. pacinii CGMCC 1.12557T and V. metschnikovii JCM 21189T was 20.6 ± 2.3, 38.1 ± 3.5 and 24.2 ± 2.8%, respectively. The DNA G+C content was 46.8 mol% (Tm). Based on the data, it is concluded that strain DSG-S6T represents a novel species of the genus Vibrio, for which the name Vibrio salilacus sp. nov. is proposed. The type strain is DSG-S6T ( = CGMCC 1.12427T = JCM 19265T).

  20. MicroRNAs located in the Hox gene clusters are implicated in huntington's disease pathogenesis.

    Directory of Open Access Journals (Sweden)

    Andrew G Hoss

    2014-02-01

    Full Text Available Transcriptional dysregulation has long been recognized as central to the pathogenesis of Huntington's disease (HD. MicroRNAs (miRNAs represent a major system of post-transcriptional regulation, by either preventing translational initiation or by targeting transcripts for storage or for degradation. Using next-generation miRNA sequencing in prefrontal cortex (Brodmann Area 9 of twelve HD and nine controls, we identified five miRNAs (miR-10b-5p, miR-196a-5p, miR-196b-5p, miR-615-3p and miR-1247-5p up-regulated in HD at genome-wide significance (FDR q-value<0.05. Three of these, miR-196a-5p, miR-196b-5p and miR-615-3p, were expressed at near zero levels in control brains. Expression was verified for all five miRNAs using reverse transcription quantitative PCR and all but miR-1247-5p were replicated in an independent sample (8HD/8C. Ectopic miR-10b-5p expression in PC12 HTT-Q73 cells increased survival by MTT assay and cell viability staining suggesting increased expression may be a protective response. All of the miRNAs but miR-1247-5p are located in intergenic regions of Hox clusters. Total mRNA sequencing in the same samples identified fifteen of 55 genes within the Hox cluster gene regions as differentially expressed in HD, and the Hox genes immediately adjacent to the four Hox cluster miRNAs as up-regulated. Pathway analysis of mRNA targets of these miRNAs implicated functions for neuronal differentiation, neurite outgrowth, cell death and survival. In regression models among the HD brains, huntingtin CAG repeat size, onset age and age at death were independently found to be inversely related to miR-10b-5p levels. CAG repeat size and onset age were independently inversely related to miR-196a-5p, onset age was inversely related to miR-196b-5p and age at death was inversely related to miR-615-3p expression. These results suggest these Hox-related miRNAs may be involved in neuroprotective response in HD. Recently, miRNAs have shown promise as

  1. Unsupervised clustering of gene expression data points at hypoxia as possible trigger for metabolic syndrome

    Directory of Open Access Journals (Sweden)

    York David

    2006-12-01

    Full Text Available Abstract Background Classification of large volumes of data produced in a microarray experiment allows for the extraction of important clues as to the nature of a disease. Results Using multi-dimensional unsupervised FOREL (FORmal ELement algorithm we have re-analyzed three public datasets of skeletal muscle gene expression in connection with insulin resistance and type 2 diabetes (DM2. Our analysis revealed the major line of variation between expression profiles of normal, insulin resistant, and diabetic skeletal muscle. A cluster of most "metabolically sound" samples occupied one end of this line. The distance along this line coincided with the classic markers of diabetes risk, namely obesity and insulin resistance, but did not follow the accepted clinical diagnosis of DM2 as defined by the presence or absence of hyperglycemia. Genes implicated in this expression pattern are those controlling skeletal muscle fiber type and glycolytic metabolism. Additionally myoglobin and hemoglobin were upregulated and ribosomal genes deregulated in insulin resistant patients. Conclusion Our findings are concordant with the changes seen in skeletal muscle with altitude hypoxia. This suggests that hypoxia and shift to glycolytic metabolism may also drive insulin resistance.

  2. Sexuality Generates Diversity in the Aflatoxin Gene Cluster: Evidence on a Global Scale

    Science.gov (United States)

    Moore, Geromy G.; Elliott, Jacalyn L.; Singh, Rakhi; Horn, Bruce W.; Dorner, Joe W.; Stone, Eric A.; Chulze, Sofia N.; Barros, German G.; Naik, Manjunath K.; Wright, Graeme C.; Hell, Kerstin; Carbone, Ignazio

    2013-01-01

    Aflatoxins are produced by Aspergillus flavus and A. parasiticus in oil-rich seed and grain crops and are a serious problem in agriculture, with aflatoxin B1 being the most carcinogenic natural compound known. Sexual reproduction in these species occurs between individuals belonging to different vegetative compatibility groups (VCGs). We examined natural genetic variation in 758 isolates of A. flavus, A. parasiticus and A. minisclerotigenes sampled from single peanut fields in the United States (Georgia), Africa (Benin), Argentina (Córdoba), Australia (Queensland) and India (Karnataka). Analysis of DNA sequence variation across multiple intergenic regions in the aflatoxin gene clusters of A. flavus, A. parasiticus and A. minisclerotigenes revealed significant linkage disequilibrium (LD) organized into distinct blocks that are conserved across different localities, suggesting that genetic recombination is nonrandom and a global occurrence. To assess the contributions of asexual and sexual reproduction to fixation and maintenance of toxin chemotype diversity in populations from each locality/species, we tested the null hypothesis of an equal number of MAT1-1 and MAT1-2 mating-type individuals, which is indicative of a sexually recombining population. All samples were clone-corrected using multi-locus sequence typing which associates closely with VCG. For both A. flavus and A. parasiticus, when the proportions of MAT1-1 and MAT1-2 were significantly different, there was more extensive LD in the aflatoxin cluster and populations were fixed for specific toxin chemotype classes, either the non-aflatoxigenic class in A. flavus or the B1-dominant and G1-dominant classes in A. parasiticus. A mating type ratio close to 1∶1 in A. flavus, A. parasiticus and A. minisclerotigenes was associated with higher recombination rates in the aflatoxin cluster and less pronounced chemotype differences in populations. This work shows that the reproductive nature of the population (more

  3. Sexuality generates diversity in the aflatoxin gene cluster: evidence on a global scale.

    Directory of Open Access Journals (Sweden)

    Geromy G Moore

    Full Text Available Aflatoxins are produced by Aspergillus flavus and A. parasiticus in oil-rich seed and grain crops and are a serious problem in agriculture, with aflatoxin B₁ being the most carcinogenic natural compound known. Sexual reproduction in these species occurs between individuals belonging to different vegetative compatibility groups (VCGs. We examined natural genetic variation in 758 isolates of A. flavus, A. parasiticus and A. minisclerotigenes sampled from single peanut fields in the United States (Georgia, Africa (Benin, Argentina (Córdoba, Australia (Queensland and India (Karnataka. Analysis of DNA sequence variation across multiple intergenic regions in the aflatoxin gene clusters of A. flavus, A. parasiticus and A. minisclerotigenes revealed significant linkage disequilibrium (LD organized into distinct blocks that are conserved across different localities, suggesting that genetic recombination is nonrandom and a global occurrence. To assess the contributions of asexual and sexual reproduction to fixation and maintenance of toxin chemotype diversity in populations from each locality/species, we tested the null hypothesis of an equal number of MAT1-1 and MAT1-2 mating-type individuals, which is indicative of a sexually recombining population. All samples were clone-corrected using multi-locus sequence typing which associates closely with VCG. For both A. flavus and A. parasiticus, when the proportions of MAT1-1 and MAT1-2 were significantly different, there was more extensive LD in the aflatoxin cluster and populations were fixed for specific toxin chemotype classes, either the non-aflatoxigenic class in A. flavus or the B₁-dominant and G₁-dominant classes in A. parasiticus. A mating type ratio close to 1∶1 in A. flavus, A. parasiticus and A. minisclerotigenes was associated with higher recombination rates in the aflatoxin cluster and less pronounced chemotype differences in populations. This work shows that the reproductive nature of

  4. Regulatory role of tetR gene in a novel gene cluster of Acidovorax avenae subsp. avenae RS-1 under oxidative stress

    OpenAIRE

    Liu, He; Yang, Chun-Lan; Ge, Meng-Yu; Ibrahim, Muhammad; Li, Bin; Zhao, Wen-Jun; Chen, Gong-You; Zhu, Bo; Xie, Guan-Lin

    2014-01-01

    Acidovorax avenae subsp. avenae is the causal agent of bacterial brown stripe disease in rice. In this study, we characterized a novel horizontal transfer of a gene cluster, including tetR, on the chromosome of A. avenae subsp. avenae RS-1 by genome-wide analysis. TetR acted as a repressor in this gene cluster and the oxidative stress resistance was enhanced in tetR-deletion mutant strain. Electrophoretic mobility shift assay demonstrated that TetR regulator bound directly to the promoter of ...

  5. antiSMASH 3.0—a comprehensive resource for the genome mining of biosynthetic gene clusters

    DEFF Research Database (Denmark)

    Weber, Tilmann; Blin, Kai; Duddela, Srikanth

    2015-01-01

    Microbial secondary metabolism constitutes a rich source of antibiotics, chemotherapeutics, insecticides and other high-value chemicals. Genome mining of gene clusters that encode the biosynthetic pathways for these metabolites has become a key methodology for novel compound discovery. In 2011, we...... introduced antiSMASH, a web server and stand-alone tool for the automatic genomic identification and analysis of biosynthetic gene clusters, available at http://antismash.secondarymetabolites.org. Here, we present version 3.0 of antiSMASH, which has undergone major improvements. A full integration...

  6. Global analysis of biosynthetic gene clusters reveals vast potential of secondary metabolite production in Penicillium species

    DEFF Research Database (Denmark)

    Nielsen, Jens Christian; Grijseels, Sietske; Prigent, Sylvain

    2017-01-01

    -referenced the predicted pathways with published data on the production of secondary metabolites and experimentally validated the production of antibiotic yanuthones in Penicillia and identified a previously undescribed compound from the yanuthone pathway. This study is the first genus-wide analysis of the genomic......Filamentous fungi produce a wide range of bioactive compounds with important pharmaceutical applications, such as antibiotic penicillins and cholesterol-lowering statins. However, less attention has been paid to fungal secondary metabolites compared to those from bacteria. In this study, we...... sequenced the genomes of 9 Penicillium species and, together with 15 published genomes, we investigated the secondary metabolism of Penicillium and identified an immense, unexploited potential for producing secondary metabolites by this genus. A total of 1,317 putative biosynthetic gene clusters (BGCs) were...

  7. antiSMASH 4.0-improvements in chemistry prediction and gene cluster boundary identification

    DEFF Research Database (Denmark)

    Blin, Kai; Wolf, Thomas; Chevrette, Marc G.

    2017-01-01

    architectures. Additionally, several usability features have been updated and improved. Together, these improvements make antiSMASH up-to-date with the latest developments in natural product research and will further facilitate computational genome mining for the discovery of novel bioactive molecules.......Many antibiotics, chemotherapeutics, crop protection agents and food preservatives originate from molecules produced by bacteria, fungi or plants. In recent years, genome mining methodologies have been widely adopted to identify and characterize the biosynthetic gene clusters encoding...... the production of such compounds. Since 2011, the 'antibiotics and secondary metabolite analysis shell-antiSMASH' has assisted researchers in efficiently performing this, both as a web server and a standalone tool. Here, we present the thoroughly updated antiSMASH version 4, which adds several novel features...

  8. Biosynthesis of antinutritional alkaloids in solanaceous crops is mediated by clustered genes.

    Science.gov (United States)

    Itkin, M; Heinig, U; Tzfadia, O; Bhide, A J; Shinde, B; Cardenas, P D; Bocobza, S E; Unger, T; Malitsky, S; Finkers, R; Tikunov, Y; Bovy, A; Chikate, Y; Singh, P; Rogachev, I; Beekwilder, J; Giri, A P; Aharoni, A

    2013-07-12

    Steroidal glycoalkaloids (SGAs) such as α-solanine found in solanaceous food plants--as, for example, potato--are antinutritional factors for humans. Comparative coexpression analysis between tomato and potato coupled with chemical profiling revealed an array of 10 genes that partake in SGA biosynthesis. We discovered that six of them exist as a cluster on chromosome 7, whereas an additional two are adjacent in a duplicated genomic region on chromosome 12. Following systematic functional analysis, we suggest a revised SGA biosynthetic pathway starting from cholesterol up to the tetrasaccharide moiety linked to the tomato SGA aglycone. Silencing GLYCOALKALOID METABOLISM 4 prevented accumulation of SGAs in potato tubers and tomato fruit. This may provide a means for removal of unsafe, antinutritional substances present in these widely used food crops.

  9. A systematic computational analysis of biosynthetic gene cluster evolution: lessons for engineering biosynthesis.

    Directory of Open Access Journals (Sweden)

    Marnix H Medema

    2014-12-01

    Full Text Available Bacterial secondary metabolites are widely used as antibiotics, anticancer drugs, insecticides and food additives. Attempts to engineer their biosynthetic gene clusters (BGCs to produce unnatural metabolites with improved properties are often frustrated by the unpredictability and complexity of the enzymes that synthesize these molecules, suggesting that genetic changes within BGCs are limited by specific constraints. Here, by performing a systematic computational analysis of BGC evolution, we derive evidence for three findings that shed light on the ways in which, despite these constraints, nature successfully invents new molecules: 1 BGCs for complex molecules often evolve through the successive merger of smaller sub-clusters, which function as independent evolutionary entities. 2 An important subset of polyketide synthases and nonribosomal peptide synthetases evolve by concerted evolution, which generates sets of sequence-homogenized domains that may hold promise for engineering efforts since they exhibit a high degree of functional interoperability, 3 Individual BGC families evolve in distinct ways, suggesting that design strategies should take into account family-specific functional constraints. These findings suggest novel strategies for using synthetic biology to rationally engineer biosynthetic pathways.

  10. Cancer Transcriptome Dataset Analysis: Comparing Methods of Pathway and Gene Regulatory Network-Based Cluster Identification.

    Science.gov (United States)

    Nam, Seungyoon

    2017-04-01

    Cancer transcriptome analysis is one of the leading areas of Big Data science, biomarker, and pharmaceutical discovery, not to forget personalized medicine. Yet, cancer transcriptomics and postgenomic medicine require innovation in bioinformatics as well as comparison of the performance of available algorithms. In this data analytics context, the value of network generation and algorithms has been widely underscored for addressing the salient questions in cancer pathogenesis. Analysis of cancer trancriptome often results in complicated networks where identification of network modularity remains critical, for example, in delineating the "druggable" molecular targets. Network clustering is useful, but depends on the network topology in and of itself. Notably, the performance of different network-generating tools for network cluster (NC) identification has been little investigated to date. Hence, using gastric cancer (GC) transcriptomic datasets, we compared two algorithms for generating pathway versus gene regulatory network-based NCs, showing that the pathway-based approach better agrees with a reference set of cancer-functional contexts. Finally, by applying pathway-based NC identification to GC transcriptome datasets, we describe cancer NCs that associate with candidate therapeutic targets and biomarkers in GC. These observations collectively inform future research on cancer transcriptomics, drug discovery, and rational development of new analysis tools for optimal harnessing of omics data.

  11. Clustered Mutation Signatures Reveal that Error-Prone DNA Repair Targets Mutations to Active Genes.

    Science.gov (United States)

    Supek, Fran; Lehner, Ben

    2017-07-27

    Many processes can cause the same nucleotide change in a genome, making the identification of the mechanisms causing mutations a difficult challenge. Here, we show that clustered mutations provide a more precise fingerprint of mutagenic processes. Of nine clustered mutation signatures identified from >1,000 tumor genomes, three relate to variable APOBEC activity and three are associated with tobacco smoking. An additional signature matches the spectrum of translesion DNA polymerase eta (POLH). In lymphoid cells, these mutations target promoters, consistent with AID-initiated somatic hypermutation. In solid tumors, however, they are associated with UV exposure and alcohol consumption and target the H3K36me3 chromatin of active genes in a mismatch repair (MMR)-dependent manner. These regions normally have a low mutation rate because error-free MMR also targets H3K36me3 chromatin. Carcinogens and error-prone repair therefore redistribute mutations to the more important regions of the genome, contributing a substantial mutation load in many tumors, including driver mutations. Copyright © 2017 Elsevier Inc. All rights reserved.

  12. Biosynthetic gene clusters for relevant secondary metabolites produced by Penicillium roqueforti in blue cheeses.

    Science.gov (United States)

    García-Estrada, Carlos; Martín, Juan-Francisco

    2016-10-01

    Ripening of blue-veined cheeses, such as the French Bleu and Roquefort, the Italian Gorgonzola, the English Stilton, the Danish Danablu or the Spanish Cabrales, Picón Bejes-Tresviso, and Valdeón, requires the growth and enzymatic activity of the mold Penicillium roqueforti, which is responsible for the characteristic texture, blue-green spots, and aroma of these types of cheeses. This filamentous fungus is able to synthesize different secondary metabolites, including andrastins, mycophenolic acid, and several mycotoxins, such as roquefortines C and D, PR-toxin and eremofortins, isofumigaclavines A and B, and festuclavine. This review provides a detailed description of the main secondary metabolites produced by P. roqueforti in blue cheese, giving a special emphasis to roquefortine, PR-toxin and mycophenolic acid, and their biosynthetic gene clusters and pathways. The knowledge of these clusters and secondary metabolism pathways, together with the ability of P. roqueforti to produce beneficial secondary metabolites, is of interest for commercial purposes.

  13. Insights into variability of actinorhodopsin genes of the LG1 cluster in two different freshwater habitats.

    Directory of Open Access Journals (Sweden)

    Jitka Jezberová

    Full Text Available Actinorhodopsins (ActRs are recently discovered proteorhodopsins present in Actinobacteria, enabling them to adapt to a wider spectrum of environmental conditions. Frequently, a large fraction of freshwater bacterioplankton belongs to the acI lineage of Actinobacteria and codes the LG1 type of ActRs. In this paper we studied the genotype variability of the LG1 ActRs. We have constructed two clone libraries originating from two environmentally different habitats located in Central Europe; the large alkaline lake Mondsee (Austria and the small humic reservoir Jiřická (the Czech Republic. The 75 yielded clones were phylogenetically analyzed together with all ActR sequences currently available in public databases. Altogether 156 sequences were analyzed and 13 clusters of ActRs were distinguished. Newly obtained clones are distributed over all three LG1 subgroups--LG1-A, B and C. Eighty percent of the sequences belonged to the acI lineage (LG1-A ActR gene bearers further divided into LG1-A1 and LG1-A2 subgroups. Interestingly, the two habitats markedly differed in genotype composition with no identical sequence found in both samples of clones. Moreover, Jiřická reservoir contained three so far not reported clusters, one of them LG1-C related, presenting thus completely new, so far undescribed, genotypes of Actinobacteria in freshwaters.

  14. Ancestral Variations of the PCDHG Gene Cluster Predispose to Dyslexia in a Multiplex Family

    Directory of Open Access Journals (Sweden)

    Teesta Naskar

    2018-02-01

    Full Text Available Dyslexia is a heritable neurodevelopmental disorder characterized by difficulties in reading and writing. In this study, we describe the identification of a set of 17 polymorphisms located across 1.9 Mb region on chromosome 5q31.3, encompassing genes of the PCDHG cluster, TAF7, PCDH1 and ARHGAP26, dominantly inherited with dyslexia in a multi-incident family. Strikingly, the non-risk form of seven variations of the PCDHG cluster, are preponderant in the human lineage, while risk alleles are ancestral and conserved across Neanderthals to non-human primates. Four of these seven ancestral variations (c.460A > C [p.Ile154Leu], c.541G > A [p.Ala181Thr], c.2036G > C [p.Arg679Pro] and c.2059A > G [p.Lys687Glu] result in amino acid alterations. p.Ile154Leu and p.Ala181Thr are present at EC2: EC3 interacting interface of γA3-PCDH and γA4-PCDH respectively might affect trans-homophilic interaction and hence neuronal connectivity. p.Arg679Pro and p.Lys687Glu are present within the linker region connecting trans-membrane to extracellular domain. Sequence analysis indicated the importance of p.Ile154, p.Arg679 and p.Lys687 in maintaining class specificity. Thus the observed association of PCDHG genes encoding neural adhesion proteins reinforces the hypothesis of aberrant neuronal connectivity in the pathophysiology of dyslexia. Additionally, the striking conservation of the identified variants indicates a role of PCDHG in the evolution of highly specialized cognitive skills critical to reading.

  15. The Sound of Silence: Activating Silent Biosynthetic Gene Clusters in Marine Microorganisms

    Directory of Open Access Journals (Sweden)

    F. Jerry Reen

    2015-07-01

    Full Text Available Unlocking the rich harvest of marine microbial ecosystems has the potential to both safeguard the existence of our species for the future, while also presenting significant lifestyle benefits for commercial gain. However, while significant advances have been made in the field of marine biodiscovery, leading to the introduction of new classes of therapeutics for clinical medicine, cosmetics and industrial products, much of what this natural ecosystem has to offer is locked in, and essentially hidden from our screening methods. Releasing this silent potential represents a significant technological challenge, the key to which is a comprehensive understanding of what controls these systems. Heterologous expression systems have been successful in awakening a number of these cryptic marine biosynthetic gene clusters (BGCs. However, this approach is limited by the typically large size of the encoding sequences. More recently, focus has shifted to the regulatory proteins associated with each BGC, many of which are signal responsive raising the possibility of exogenous activation. Abundant among these are the LysR-type family of transcriptional regulators, which are known to control production of microbial aromatic systems. Although the environmental signals that activate these regulatory systems remain unknown, it offers the exciting possibility of evoking mimic molecules and synthetic expression systems to drive production of potentially novel natural products in microorganisms. Success in this field has the potential to provide a quantum leap forward in medical and industrial bio-product development. To achieve these new endpoints, it is clear that the integrated efforts of bioinformaticians and natural product chemists will be required as we strive to uncover new and potentially unique structures from silent or cryptic marine gene clusters.

  16. Ancestral Variations of the PCDHG Gene Cluster Predispose to Dyslexia in a Multiplex Family.

    Science.gov (United States)

    Naskar, Teesta; Faruq, Mohammed; Banerjee, Priyajit; Khan, Massarat; Midha, Rashi; Kumari, Renu; Devasenapathy, Subhashree; Prajapati, Bharat; Sengupta, Sanghamitra; Jain, Deepti; Mukerji, Mitali; Singh, Nandini Chatterjee; Sinha, Subrata

    2018-02-01

    Dyslexia is a heritable neurodevelopmental disorder characterized by difficulties in reading and writing. In this study, we describe the identification of a set of 17 polymorphisms located across 1.9Mb region on chromosome 5q31.3, encompassing genes of the PCDHG cluster, TAF7, PCDH1 and ARHGAP26, dominantly inherited with dyslexia in a multi-incident family. Strikingly, the non-risk form of seven variations of the PCDHG cluster, are preponderant in the human lineage, while risk alleles are ancestral and conserved across Neanderthals to non-human primates. Four of these seven ancestral variations (c.460A>C [p.Ile154Leu], c.541G>A [p.Ala181Thr], c.2036G>C [p.Arg679Pro] and c.2059A>G [p.Lys687Glu]) result in amino acid alterations. p.Ile154Leu and p.Ala181Thr are present at EC2: EC3 interacting interface of γA3-PCDH and γA4-PCDH respectively might affect trans-homophilic interaction and hence neuronal connectivity. p.Arg679Pro and p.Lys687Glu are present within the linker region connecting trans-membrane to extracellular domain. Sequence analysis indicated the importance of p.Ile154, p.Arg679 and p.Lys687 in maintaining class specificity. Thus the observed association of PCDHG genes encoding neural adhesion proteins reinforces the hypothesis of aberrant neuronal connectivity in the pathophysiology of dyslexia. Additionally, the striking conservation of the identified variants indicates a role of PCDHG in the evolution of highly specialized cognitive skills critical to reading. Copyright © 2018 The Authors. Published by Elsevier B.V. All rights reserved.

  17. Analysis of healthy cohorts for single nucleotide polymorphisms in C1q gene cluster

    Directory of Open Access Journals (Sweden)

    MARIA A. RADANOVA

    2015-12-01

    Full Text Available C1q is the first component of the classical pathway of complement activation. The coding region for C1q is localized on chromosome 1p34.1–36.3. Mutations or single nucleotide polymorphisms (SNPs in C1q gene cluster can cause developing of Systemic lupus erythematosus (SLE because of C1q deficiency or other unknown reason. We selected five SNPs located in 7.121 kbp region on chromosome 1, which were previously associated with SLE and/or low C1q level, but not causing C1q deficiency and analyzed them in terms of allele frequencies and genotype distribution in comparison with Hispanic, Asian, African and other Caucasian cohorts. These SNPs were: rs587585, rs292001, rs172378, rs294179 and rs631090. One hundred eighty five healthy Bulgarian volunteers were genotyped for the selected five C1q SNPs by quantative real-time PCR methods. International HapMap Project has been used for information about genotype distribution and allele frequencies of the five SNPs in, Hispanics, Asians, Africans and others Caucasian cohorts. Bulgarian healthy volunteers and another pooled Caucasian cohort had similar frequencies of genotypes and alleles of rs587585, rs292001, rs294179 and rs631090 SNPs. Nevertheless, genotype AA of rs172378 was significantly overrepresented in Bulgarians when compared to other healthy Caucasians from USA and UK (60% vs 31%. Genotype distribution of rs172378 in Bulgarians was similar to Greek-Cyriot Caucasians. For all Caucasians the major allele of rs172378 was A. This is the first study analyzing the allele frequencies and genotype distribution of C1q gene cluster SNPs in Bulgarian healthy population.

  18. An additional k-means clustering step improves the biological features of WGCNA gene co-expression networks.

    Science.gov (United States)

    Botía, Juan A; Vandrovcova, Jana; Forabosco, Paola; Guelfi, Sebastian; D'Sa, Karishma; Hardy, John; Lewis, Cathryn M; Ryten, Mina; Weale, Michael E

    2017-04-12

    Weighted Gene Co-expression Network Analysis (WGCNA) is a widely used R software package for the generation of gene co-expression networks (GCN). WGCNA generates both a GCN and a derived partitioning of clusters of genes (modules). We propose k-means clustering as an additional processing step to conventional WGCNA, which we have implemented in the R package km2gcn (k-means to gene co-expression network, https://github.com/juanbot/km2gcn ). We assessed our method on networks created from UKBEC data (10 different human brain tissues), on networks created from GTEx data (42 human tissues, including 13 brain tissues), and on simulated networks derived from GTEx data. We observed substantially improved module properties, including: (1) few or zero misplaced genes; (2) increased counts of replicable clusters in alternate tissues (x3.1 on average); (3) improved enrichment of Gene Ontology terms (seen in 48/52 GCNs) (4) improved cell type enrichment signals (seen in 21/23 brain GCNs); and (5) more accurate partitions in simulated data according to a range of similarity indices. The results obtained from our investigations indicate that our k-means method, applied as an adjunct to standard WGCNA, results in better network partitions. These improved partitions enable more fruitful downstream analyses, as gene modules are more biologically meaningful.

  19. Genomic organization and mapping of the mouse P26s4 ATPase gene: A member of the remarkably conserved AAA gene family

    Energy Technology Data Exchange (ETDEWEB)

    Hoyle, J.; Fisher, E.M.C. [Imperial College, London (United Kingdom)

    1996-01-01

    The eukaryotic genome contains a large family of ATPases in which each member has at least one highly conserved domain of approximately 200 amino acids with an ATP binding motif (the {open_quotes}AAA{close_quotes} domain). AAA ATPases play diverse roles in the cell and are of considerable interest to researchers investigating a number of different phenomena, including control of the cell cycle. We have characterized the mouse P26s4 AAA ATPase gene that encodes a subunit of the 26S protease, a multimeric complex that is responsible for the ubiquitin- and ATP-dependent degradation of specific proteins. The normal functioning of eukaryotic cells depends on this pathway to remove regulatory proteins such as cyclins or signal transduction molecules from the intracellular environment, with the appropriate timing to allow normal cell division and development. We have isolated mouse P26s4 cDNAs and mapped the P26s4 gene to chromosome 12. We have analyzed the intron-exon structure of the P26s4 genomic locus and have determined that the gene contains at least 10 introns, the first of which separates the start methionine from the rest of the coding sequence. 18 refs., 2 figs.

  20. Identification of a Gene Cluster Enabling Lactobacillus casei BL23 To Utilize myo-Inositol▿ †

    Science.gov (United States)

    Yebra, María Jesús; Zúñiga, Manuel; Beaufils, Sophie; Pérez-Martínez, Gaspar; Deutscher, Josef; Monedero, Vicente

    2007-01-01

    Genome analysis of Lactobacillus casei BL23 revealed that, compared to L. casei ATCC 334, it carries a 12.8-kb DNA insertion containing genes involved in the catabolism of the cyclic polyol myo-inositol (MI). Indeed, L. casei ATCC 334 does not ferment MI, whereas strain BL23 is able to utilize this carbon source. The inserted DNA consists of an iolR gene encoding a DeoR family transcriptional repressor and a divergently transcribed iolTABCDG1G2EJK operon, encoding a complete MI catabolic pathway, in which the iolK gene probably codes for a malonate semialdehyde decarboxylase. The presence of iolK suggests that L. casei has two alternative pathways for the metabolism of malonic semialdehyde: (i) the classical MI catabolic pathway in which IolA (malonate semialdehyde dehydrogenase) catalyzes the formation of acetyl-coenzyme A from malonic semialdehyde and (ii) the conversion of malonic semialdehyde to acetaldehyde catalyzed by the product of iolK. The function of the iol genes was verified by the disruption of iolA, iolT, and iolD, which provided MI-negative strains. By contrast, the disruption of iolK resulted in a strain with no obvious defect in MI utilization. Transcriptional analyses conducted with different mutant strains showed that the iolTABCDG1G2EJK cluster is regulated by substrate-specific induction mediated by the inactivation of the transcriptional repressor IolR and by carbon catabolite repression mediated by the catabolite control protein A (CcpA). This is the first example of an operon for MI utilization in lactic acid bacteria and illustrates the versatility of carbohydrate utilization in L. casei BL23. PMID:17449687

  1. Genetic interrelations in the actinomycin biosynthetic gene clusters of Streptomyces antibioticus IMRU 3720 and Streptomyces chrysomallus ATCC11523, producers of actinomycin X and actinomycin C

    Directory of Open Access Journals (Sweden)

    Crnovčić I

    2017-04-01

    Full Text Available Ivana Crnovčić,1 Christian Rückert,2 Siamak Semsary,1 Manuel Lang,1 Jörn Kalinowski,2 Ullrich Keller1 1Institut für Chemie, Technische Universität Berlin, Berlin-Charlottenburg, 2Technology Platform Genomics, Center for Biotechnology, Bielefeld University, Bielefeld, Germany Abstract: Sequencing the actinomycin (acm biosynthetic gene cluster of Streptomyces antibioticus IMRU 3720, which produces actinomycin X (Acm X, revealed 20 genes organized into a highly similar framework as in the bi-armed acm C biosynthetic gene cluster of Streptomyces chrysomallus but without an attached additional extra arm of orthologues as in the latter. Curiously, the extra arm of the S. chrysomallus gene cluster turned out to perfectly match the single arm of the S. antibioticus gene cluster in the same order of orthologues including the the presence of two pseudogenes, scacmM and scacmN, encoding a cytochrome P450 and its ferredoxin, respectively. Orthologues of the latter genes were both missing in the principal arm of the S. chrysomallus acm C gene cluster. All orthologues of the extra arm showed a G +C-contents different from that of their counterparts in the principal arm. Moreover, the similarities of translation products from the extra arm were all higher to the corresponding translation products of orthologue genes from the S. antibioticus acm X gene cluster than to those encoded by the principal arm of their own gene cluster. This suggests that the duplicated structure of the S. chrysomallus acm C biosynthetic gene cluster evolved from previous fusion between two one-armed acm gene clusters each from a different genetic background. However, while scacmM and scacmN in the extra arm of the S. chrysomallus acm C gene cluster are mutated and therefore are non-functional, their orthologues saacmM and saacmN in the S. antibioticus acm C gene cluster show no defects seemingly encoding active enzymes with functions specific for Acm X biosynthesis. Both acm

  2. Automatic extraction of gene ontology annotation and its correlation with clusters in protein networks

    Directory of Open Access Journals (Sweden)

    Mazo Ilya

    2007-07-01

    Full Text Available Abstract Background Uncovering cellular roles of a protein is a task of tremendous importance and complexity that requires dedicated experimental work as well as often sophisticated data mining and processing tools. Protein functions, often referred to as its annotations, are believed to manifest themselves through topology of the networks of inter-proteins interactions. In particular, there is a growing body of evidence that proteins performing the same function are more likely to interact with each other than with proteins with other functions. However, since functional annotation and protein network topology are often studied separately, the direct relationship between them has not been comprehensively demonstrated. In addition to having the general biological significance, such demonstration would further validate the data extraction and processing methods used to compose protein annotation and protein-protein interactions datasets. Results We developed a method for automatic extraction of protein functional annotation from scientific text based on the Natural Language Processing (NLP technology. For the protein annotation extracted from the entire PubMed, we evaluated the precision and recall rates, and compared the performance of the automatic extraction technology to that of manual curation used in public Gene Ontology (GO annotation. In the second part of our presentation, we reported a large-scale investigation into the correspondence between communities in the literature-based protein networks and GO annotation groups of functionally related proteins. We found a comprehensive two-way match: proteins within biological annotation groups form significantly denser linked network clusters than expected by chance and, conversely, densely linked network communities exhibit a pronounced non-random overlap with GO groups. We also expanded the publicly available GO biological process annotation using the relations extracted by our NLP technology

  3. Protein-protein association and cellular localization of four essential gene products encoded by tellurite resistance-conferring cluster "ter" from pathogenic Escherichia coli.

    Science.gov (United States)

    Valkovicova, Lenka; Vavrova, Silvia Minarikova; Mravec, Jozef; Grones, Jozef; Turna, Jan

    2013-12-01

    Gene cluster "ter" conferring high tellurite resistance has been identified in various pathogenic bacteria including Escherichia coli O157:H7. However, the precise mechanism as well as the molecular function of the respective gene products is unclear. Here we describe protein-protein association and localization analyses of four essential Ter proteins encoded by minimal resistance-conferring fragment (terBCDE) by means of recombinant expression. By using a two-plasmid complementation system we show that the overproduced single Ter proteins are not able to mediate tellurite resistance, but all Ter members play an irreplaceable role within the cluster. We identified several types of homotypic and heterotypic protein-protein associations among the Ter proteins by in vitro and in vivo pull-down assays and determined their cellular localization by cytosol/membrane fractionation. Our results strongly suggest that Ter proteins function involves their mutual association, which probably happens at the interface of the inner plasma membrane and the cytosol.

  4. Application of bi-clustering of gene expression data and gene set enrichment analysis methods to identify potentially disease causing nanomaterials

    Directory of Open Access Journals (Sweden)

    Andrew Williams

    2017-12-01

    Full Text Available This article contains data related to the research article ‘Application of bi-clustering of gene expression data and gene set enrichment analysis methods to identify potentially disease causing nanomaterials’ (Williams and Halappanavar, 2015 [1]. The presence of diverse types of nanomaterials (NMs in commerce has grown significantly in the past decade and as a result, human exposure to these materials in the environment is inevitable. The traditional toxicity testing approaches that are reliant on animals are both time- and cost- intensive; employing which, it is not possible to complete the challenging task of safety assessment of NMs currently on the market in a timely manner. Thus, there is an urgent need for comprehensive understanding of the biological behavior of NMs, and efficient toxicity screening tools that will enable the development of predictive toxicology paradigms suited to rapidly assessing the human health impacts of exposure to NMs. In an effort to predict the long term health impacts of acute exposure to NMs, in Williams and Halappanavar (2015 [1], we applied bi-clustering and gene set enrichment analysis methods to derive essential features of altered lung transcriptome following exposure to NMs that are associated with lung-specific diseases. Several datasets from public microarray repositories describing pulmonary diseases in mouse models following exposure to a variety of substances were examined and functionally related bi-clusters showing similar gene expression profiles were identified. The identified bi-clusters were then used to conduct a gene set enrichment analysis on lung gene expression profiles derived from mice exposed to nano-titanium dioxide, carbon black or carbon nanotubes (nano-TiO2, CB and CNTs to determine the disease significance of these data-driven gene sets. The results of the analysis correctly identified all NMs to be inflammogenic, and only CB and CNTs as potentially fibrogenic. Here, we

  5. Characterization and detection of a widely distributed gene cluster that predicts anaerobic choline utilization by human gut bacteria.

    Science.gov (United States)

    Martínez-del Campo, Ana; Bodea, Smaranda; Hamer, Hilary A; Marks, Jonathan A; Haiser, Henry J; Turnbaugh, Peter J; Balskus, Emily P

    2015-04-14

    Elucidation of the molecular mechanisms underlying the human gut microbiota's effects on health and disease has been complicated by difficulties in linking metabolic functions associated with the gut community as a whole to individual microorganisms and activities. Anaerobic microbial choline metabolism, a disease-associated metabolic pathway, exemplifies this challenge, as the specific human gut microorganisms responsible for this transformation have not yet been clearly identified. In this study, we established the link between a bacterial gene cluster, the choline utilization (cut) cluster, and anaerobic choline metabolism in human gut isolates by combining transcriptional, biochemical, bioinformatic, and cultivation-based approaches. Quantitative reverse transcription-PCR analysis and in vitro biochemical characterization of two cut gene products linked the entire cluster to growth on choline and supported a model for this pathway. Analyses of sequenced bacterial genomes revealed that the cut cluster is present in many human gut bacteria, is predictive of choline utilization in sequenced isolates, and is widely but discontinuously distributed across multiple bacterial phyla. Given that bacterial phylogeny is a poor marker for choline utilization, we were prompted to develop a degenerate PCR-based method for detecting the key functional gene choline TMA-lyase (cutC) in genomic and metagenomic DNA. Using this tool, we found that new choline-metabolizing gut isolates universally possessed cutC. We also demonstrated that this gene is widespread in stool metagenomic data sets. Overall, this work represents a crucial step toward understanding anaerobic choline metabolism in the human gut microbiota and underscores the importance of examining this microbial community from a function-oriented perspective. Anaerobic choline utilization is a bacterial metabolic activity that occurs in the human gut and is linked to multiple diseases. While bacterial genes responsible for

  6. Proteomic identification of putative microRNA394 target genes in Arabidopsis thaliana identifies major latex protein family members critical for normal development

    DEFF Research Database (Denmark)

    Litholdo, Celso G; Parker, Benjamin; Eamens, Andrew L

    2016-01-01

    , the identification of proteins targeted by LCR F-box itself has proven problematic. Here, a proteomic analysis of shoot apices from plants with altered LCR levels identified a member of the Latex Protein (MLP) family gene as a potential LCR F-box target. Bioinformatic and molecular analyses also suggested that other...... MLP family members are likely to be targets for this post-translational regulation. Direct interaction between LCR F-Box and MLP423 was validated. Additional MLP members had reduction in protein accumulation, in varying degrees, mediated by LCR F-Box. Transgenic Arabidopsis lines, in which MLP28...... Taken together, the results demonstrate that MLPs are driven to degradation by LCR, and indicate that MLP gene family is target of miR394-LCR regulatory node, representing potential targets for directly post-translational regulation mediated by LCR F-Box. In addition, MLP28 family member is associated...

  7. Functional Operons in Secondary Metabolic Gene Clusters in Glarea lozoyensis (Fungi, Ascomycota, Leotiomycetes).

    Science.gov (United States)

    Yue, Qun; Chen, Li; Li, Yan; Bills, Gerald F; Zhang, Xinyu; Xiang, Meichun; Li, Shaojie; Che, Yongsheng; Wang, Chengshu; Niu, Xuemei; An, Zhiqiang; Liu, Xingzhong

    2015-06-16

    Operons are multigene transcriptional units which occur mostly in prokaryotes but rarely in eukaryotes. Protein-coding operons have not been reported in the Fungi even though they represent a very diverse kingdom of organisms. Here, we report a functional operon involved in the secondary metabolism of the fungus Glarea lozoyensis belonging to Leotiomycetes (Ascomycota). Two contiguous genes, glpks3 and glnrps7, encoding polyketide synthase and nonribosomal peptide synthetase, respectively, are cotranscribed into one dicistronic mRNA under the control of the same promoter, and the mRNA is then translated into two individual proteins, GLPKS3 and GLNRPS7. Heterologous expression in Aspergillus nidulans shows that the GLPKS3-GLNRPS7 enzyme complex catalyzes the biosynthesis of a novel pyrrolidinedione-containing compound, xenolozoyenone (compound 1), which indicates the operon is functional. Although it is structurally similar to prokaryotic operons, the glpks3-glnrps7 operon locus has a monophylogenic origin from fungi rather than having been horizontally transferred from prokaryotes. Moreover, two additional operons, glpks28-glnrps8 and glpks29-glnrps9, were verified at the transcriptional level in the same fungus. This is the first report of protein-coding operons in a member of the Fungi. Operons are multigene transcriptional units which occur mostly in prokaryotes but rarely in eukaryotes. Three operon-like gene structures for secondary metabolism that were discovered in the filamentous fungus Glarea lozoyensis are the first examples of protein-coding operons identified in a member of the Fungi. Among them, the glpks3-glnrps7 operon is responsible for the biosynthesis of xenolozoyenone, which is a novel tetramic acid-containing compound. Although structurally similar to prokaryotic operons, the glpks3-glnrps7 operon locus did not result from horizontal gene transfer from prokaryotes. In addition, operonlike structures have been predicted in silico to be common in

  8. Characterization of a multicopper oxidase gene cluster in Phanerochaete chrysosporium and evidence of altered splicing of the mco transcripts

    Science.gov (United States)

    Luis F. Larrondo; Bernardo Gonzalez; Dan Cullen; Rafael Vicuna

    2004-01-01

    A cluster of multicopper oxidase genes (mco1, mco2, mco3, mco4) from the lignin-degrading basidiomycete Phanerochaete chrysosporium is described. The four genes share the same transcriptional orientation within a 25 kb region. mco1, mco2 and mco3 are tightly grouped, with intergenic regions of 2.3 and 0.8 kb, respectively, whereas mco4 is located 11 kb upstream of mco1...

  9. Genotyping of Campylobacter jejuni strains from Danish broiler chickens by restriction fragment length polymorphism of the LPS gene cluster

    DEFF Research Database (Denmark)

    Knudsen, K.N.; Bang, Dang Duong; Nielsen, E.M.

    2005-01-01

    Aims: To apply and evaluate LG (LPS genes) genotyping, which is a genotyping method based on a cluster of genes involved in the synthesis of surface lipopolysaccharides (LPS) in Campylobacter species, for typing of Campylobacter jejuni isolates obtained from Danish broiler chickens. Furthermore...... and LG genotyping was low when applied to poultry isolates. This is in contrast to previous studies on isolates of human origin that reported a high correlation between results obtained by the two typing methods....

  10. Glutamic acid promotes monacolin K production and monacolin K biosynthetic gene cluster expression in Monascus.

    Science.gov (United States)

    Zhang, Chan; Liang, Jian; Yang, Le; Chai, Shiyuan; Zhang, Chenxi; Sun, Baoguo; Wang, Chengtao

    2017-12-01

    This study investigated the effects of glutamic acid on production of monacolin K and expression of the monacolin K biosynthetic gene cluster. When Monascus M1 was grown in glutamic medium instead of in the original medium, monacolin K production increased from 48.4 to 215.4 mg l -1 , monacolin K production increased by 3.5 times. Glutamic acid enhanced monacolin K production by upregulating the expression of mokB-mokI; on day 8, the expression level of mokA tended to decrease by Reverse Transcription-polymerase Chain Reaction. Our findings demonstrated that mokA was not a key gene responsible for the quantity of monacolin K production in the presence of glutamic acid. Observation of Monascus mycelium morphology using Scanning Electron Microscope showed glutamic acid significantly increased the content of Monascus mycelium, altered the permeability of Monascus mycelium, enhanced secretion of monacolin K from the cell, and reduced the monacolin K content in Monascus mycelium, thereby enhancing monacolin K production.

  11. Identification of a conserved cluster of skin-specific genes encoding secreted proteins.

    Science.gov (United States)

    Moffatt, Pierre; Salois, Patrick; St-Amant, Natalie; Gaumond, Marie-Hélène; Lanctôt, Christian

    2004-06-09

    Terminal differentiation of keratinocytes results in the formation of a cornified layer composed of cross-linked intracellular and extracellular material. Using a signal trap expression screening strategy, we have identified four cDNAs encoding secreted proteins potentially involved in this process. One of the cDNAs is identical to the short isoform of suprabasin, a recently described epidermis-specific protein, which is shown here to contain a functional secretory signal. The second cDNA, sk89, encodes a protein of 493 amino acids, rich in glycine and serine residues. The third cDNA encodes a C-terminal fragment of SK89 (amino acids 410-493). It comprises exons 13 to 18 of the sk89 locus but transcription starts at an isoform-specific exon encoding a distinct secretory signal. The fourth cDNA encodes keratinocyte differentiation-associated protein (KDAP), a precursor protein of 102 amino acids. Subcellular localization by immunofluorescence and detection of the tagged proteins by Western blotting confirmed that the four proteins are secreted. Northern analysis and in situ hybridization revealed that expression of the corresponding genes was restricted to the suprabasal keratinocytes of the epidermis. These genes encoding epidermis-specific secreted products are found in a conserved cluster on human chromosome 19q13.12 and on mouse chromosome 7A3.

  12. Identification of a gene cluster for biosynthesis of mannosylerythritol lipids in the basidiomycetous fungus Ustilago maydis.

    Science.gov (United States)

    Hewald, Sandra; Linne, Uwe; Scherer, Mario; Marahiel, Mohamed A; Kämper, Jörg; Bölker, Michael

    2006-08-01

    Many microorganisms produce surface-active substances that enhance the availability of water-insoluble substrates. Although many of these biosurfactants have interesting potential applications, very little is known about their biosynthesis. The basidiomycetous fungus Ustilago maydis secretes large amounts of mannosylerythritol lipids (MELs) under conditions of nitrogen starvation. We recently described a putative glycosyltransferase, Emt1, which is essential for MEL biosynthesis and whose expression is strongly induced by nitrogen limitation. We used DNA microarray analysis to identify additional genes involved in MEL biosynthesis. Here we show that emt1 is part of a gene cluster which comprises five open reading frames. Three of the newly identified proteins, Mac1, Mac2, and Mat1, contain short sequence motifs characteristic for acyl- and acetyltransferases. Mutational analysis revealed that Mac1 and Mac2 are essential for MEL production, which suggests that they are involved in the acylation of mannosylerythritol. Deletion of mat1 resulted in the secretion of completely deacetylated MELs, as determined by mass spectrometry. We overexpressed Mat1 in Escherichia coli and demonstrated that this enzyme acts as an acetyl coenzyme A-dependent acetyltransferase. Remarkably, Mat1 displays relaxed regioselectivity and is able to acetylate mannosylerythritol at both the C-4 and C-6 hydroxyl groups. Based on these results, we propose a biosynthesis pathway for the generation of mannosylerythritol lipids in U. maydis.

  13. Genetic analysis of capsular polysaccharide synthesis gene clusters in 79 capsular types of Klebsiella spp

    Science.gov (United States)

    Pan, Yi-Jiun; Lin, Tzu-Lung; Chen, Chun-Tang; Chen, Yi-Yin; Hsieh, Pei-Fang; Hsu, Chun-Ru; Wu, Meng-Chuan; Wang, Jin-Town

    2015-01-01

    A total of 79 capsular types have been reported in Klebsiella spp., whereas capsular polysaccharide synthesis (cps) regions were available in only 22 types. Due to the limitations of serotyping, complete repertoire of cps will be helpful for capsular genotyping. We therefore resolved the rest 57 cps and conducted comparative analysis. Clustering results of 1,515 predicted proteins from cps loci categorized proteins which share similarity into homology groups (HGs) revealing that 77 Wzy polymerases were classified into 56 HGs, which indicate the high specificity of wzy between different types. Accordingly, wzy-based capsular genotyping could differentiate capsule types except for those lacking wzy (K29 and K50), those sharing identical wzy (K22 vs. K37); and should be carefully applied in those exhibited high similarity (K12 vs. K41, K2 vs. K13, K74 vs. K80, K79 vs. KN1 and K30 vs. K69). Comparison of CPS structures in several capsular types that shared similarity in their gene contents implies possible functions of glycosyltransferases. Therefore, our results provide complete set of cps in various types of Klebsiella spp., which enable the understandings of relationship between genes and CPS structures and are useful for identification of documented or new capsular types. PMID:26493302

  14. pMH2, a small plasmid bearing the nif gene cluster of Enterobacter agglomerans 333 as an excisable cassette.

    Science.gov (United States)

    Stumpf, F; Halda, L; Klingmüller, W

    1993-10-01

    A small plasmid containing the entire nif gene cluster of Enterobacter agglomerans 333 as an excisable cassette has been constructed, using pACYC177 as a vector. Two cosmid clones taken from a gene library of E. agglomerans plasmid pEA3 were used as a source of nif genes. A SmaI fragment of peaMS2-2, containing the H,D,K,Y,E,N,X,U,S,V,W,Z,M,L,A and B genes and an ApaI fragment of peaMS2-16 containing nif A,B,Q,F and J were selected to construct pMH2. The resulting plasmid of 33 kb carries the complete nif gene cluster as a nif cassette on a single XbaI fragment. The nif construct pMH2 in Escherichia coli strains has significant nitrogenase activity compared to wild-type E. agglomerans 333. The nif gene cluster construct was found to be very stable.

  15. GenCLiP: a software program for clustering gene lists by literature profiling and constructing gene co-occurrence networks related to custom keywords

    Directory of Open Access Journals (Sweden)

    Zhou Yi-Bo

    2008-07-01

    Full Text Available Abstract Background Biomedical researchers often want to explore pathogenesis and pathways regulated by abnormally expressed genes, such as those identified by microarray analyses. Literature mining is an important way to assist in this task. Many literature mining tools are now available. However, few of them allows the user to make manual adjustments to zero in on what he/she wants to know in particular. Results We present our software program, GenCLiP (Gene Cluster with Literature Profiles, which is based on the methods presented by Chaussabel and Sher (Genome Biol 2002, 3(10:RESEARCH0055 that search gene lists to identify functional clusters of genes based on up-to-date literature profiling. Four features were added to this previously described method: the ability to 1 manually curate keywords extracted from the literature, 2 search genes and gene co-occurrence networks related to custom keywords, 3 compare analyzed gene results with negative and positive controls generated by GenCLiP, and 4 calculate probabilities that the resulting genes and gene networks are randomly related. In this paper, we show with a set of differentially expressed genes between keloids and normal control, how implementation of functions in GenCLiP successfully identified keywords related to the pathogenesis of keloids and unknown gene pathways involved in the pathogenesis of keloids. Conclusion With regard to the identification of disease-susceptibility genes, GenCLiP allows one to quickly acquire a primary pathogenesis profile and identify pathways involving abnormally expressed genes not previously associated with the disease.

  16. Assessment of clusters of transcription factor binding sites in relationship to human promoter, CpG islands and gene expression

    Directory of Open Access Journals (Sweden)

    Sakaki Yoshiyuki

    2004-02-01

    Full Text Available Abstract Background Gene expression is regulated mainly by transcription factors (TFs that interact with regulatory cis-elements on DNA sequences. To identify functional regulatory elements, computer searching can predict TF binding sites (TFBS using position weight matrices (PWMs that represent positional base frequencies of collected experimentally determined TFBS. A disadvantage of this approach is the large output of results for genomic DNA. One strategy to identify genuine TFBS is to utilize local concentrations of predicted TFBS. It is unclear whether there is a general tendency for TFBS to cluster at promoter regions, although this is the case for certain TFBS. Also unclear is the identification of TFs that have TFBS concentrated in promoters and to what level this occurs. This study hopes to answer some of these questions. Results We developed the cluster score measure to evaluate the correlation between predicted TFBS clusters and promoter sequences for each PWM. Non-promoter sequences were used as a control. Using the cluster score, we identified a PWM group called PWM-PCP, in which TFBS clusters positively correlate with promoters, and another PWM group called PWM-NCP, in which TFBS clusters negatively correlate with promoters. The PWM-PCP group comprises 47% of the 199 vertebrate PWMs, while the PWM-NCP group occupied 11 percent. After reducing the effect of CpG islands (CGI against the clusters using partial correlation coefficients among three properties (promoter, CGI and predicted TFBS cluster, we identified two PWM groups including those strongly correlated with CGI and those not correlated with CGI. Conclusion Not all PWMs predict TFBS correlated with human promoter sequences. Two main PWM groups were identified: (1 those that show TFBS clustered in promoters associated with CGI, and (2 those that show TFBS clustered in promoters independent of CGI. Assessment of PWM matches will allow more positive interpretation of TFBS in

  17. vanI: a novel d-Ala-d-Lac vancomycin resistance gene cluster found in Desulfitobacterium hafniense

    NARCIS (Netherlands)

    Kruse, T.; Levisson, M.; Vos, de W.M.; Smidt, H.

    2014-01-01

    The glycopeptide vancomycin was until recently considered a drug of last resort against Gram-positive bacteria. Increasing numbers of bacteria, however, are found to carry genes that confer resistance to this antibiotic. So far, 10 different vancomycin resistance clusters have been described. A

  18. Evolution and genetic population structure of prickly lettuce (Lactuca serriola) and its RGC2 resistance gene cluster

    NARCIS (Netherlands)

    Kuang, H.; Eck, van H.J.; Sicard, D.; Michelmore, R.; Nevo, E.

    2008-01-01

    Genetic structure and diversity of natural populations of prickly lettuce (Lactuca serriola) were studied using AFLP markers and then compared with the diversity of the RGC2 disease resistance gene cluster. Screening of 696 accessions from 41 populations using 319 AFLP markers showed that eastern

  19. Histone and Ribosomal RNA Repetitive Gene Clusters of the Boll Weevil are Linked in a Tandem Array

    Science.gov (United States)

    Histones are the major protein component of chromatin structure. The histone family is made up of a quintet of proteins, four core histones (H2A, H2B, H3 & H4) and the linker histones (H1). Spacers are found between the coding regions. Among insects this quintet of genes is usually clustered and ...

  20. Genetic diversity of K-antigen gene clusters of Escherichia coli and their molecular typing using a suspension array.

    Science.gov (United States)

    Yang, Shuang; Xi, Daoyi; Jing, Fuyi; Kong, Deju; Wu, Junli; Feng, Lu; Cao, Boyang; Wang, Lei

    2018-04-01

    Capsular polysaccharides (CPSs), or K-antigens, are the major surface antigens of Escherichia coli. More than 80 serologically unique K-antigens are classified into 4 groups (Groups 1-4) of capsules. Groups 1 and 4 contain the Wzy-dependent polymerization pathway and the gene clusters are in the order galF to gnd; Groups 2 and 3 contain the ABC-transporter-dependent pathway and the gene clusters consist of 3 regions, regions 1, 2 and 3. Little is known about the variations among the gene clusters. In this study, 9 serotypes of K-antigen gene clusters (K2ab, K11, K20, K24, K38, K84, K92, K96, and K102) were sequenced and correlated with their CPS chemical structures. On the basis of sequence data, a K-antigen-specific suspension array that detects 10 distinct CPSs, including the above 9 CPSs plus K30, was developed. This is the first report to catalog the genetic features of E. coli K-antigen variations and to develop a suspension array for their molecular typing. The method has a number of advantages over traditional bacteriophage and serum agglutination methods and lays the foundation for straightforward identification and detection of additional K-antigens in the future.

  1. Organization of nif gene cluster in Frankia sp. EuIK1 strain, a symbiont of Elaeagnus umbellata.

    Science.gov (United States)

    Oh, Chang Jae; Kim, Ho Bang; Kim, Jitae; Kim, Won Jin; Lee, Hyoungseok; An, Chung Sun

    2012-01-01

    The nucleotide sequence of a 20.5-kb genomic region harboring nif genes was determined and analyzed. The fragment was obtained from Frankia sp. EuIK1 strain, an indigenous symbiont of Elaeagnus umbellata. A total of 20 ORFs including 12 nif genes were identified and subjected to comparative analysis with the genome sequences of 3 Frankia strains representing diverse host plant specificities. The nucleotide and deduced amino acid sequences showed highest levels of identity with orthologous genes from an Elaeagnus-infecting strain. The gene organization patterns around the nif gene clusters were well conserved among all 4 Frankia strains. However, characteristic features appeared in the location of the nifV gene for each Frankia strain, depending on the type of host plant. Sequence analysis was performed to determine the transcription units and suggested that there could be an independent operon starting from the nifW gene in the EuIK strain. Considering the organization patterns and their total extensions on the genome, we propose that the nif gene clusters remained stable despite genetic variations occurring in the Frankia genomes.

  2. Evolution of Chromosomal Clostridium botulinum Type E Neurotoxin Gene Clusters: Evidence Provided by Their Rare Plasmid-Borne Counterparts.

    Science.gov (United States)

    Carter, Andrew T; Austin, John W; Weedmark, Kelly A; Peck, Michael W

    2016-03-02

    Analysis of more than 150 Clostridium botulinum Group II type E genomes identified a small fraction (6%) where neurotoxin-encoding genes were located on plasmids. Seven closely related (134-144 kb) neurotoxigenic plasmids of subtypes E1, E3, and E10 were characterized; all carried genes associated with plasmid mobility via conjugation. Each plasmid contained the same 24-kb neurotoxin cluster cassette (six neurotoxin cluster and six flanking genes) that had split a helicase gene, rather than the more common chromosomal rarA. The neurotoxin cluster cassettes had evolved as separate genetic units which had either exited their chromosomal rarA locus in a series of parallel events, inserting into the plasmid-borne helicase gene, or vice versa. A single intact version of the helicase gene was discovered on a nonneurotoxigenic form of this plasmid. The observed low frequency for the plasmid location may reflect one or more of the following: 1) Less efficient recombination mechanism for the helicase gene target, 2) lack of suitable target plasmids, and 3) loss of neurotoxigenic plasmids. Type E1 and E10 plasmids possessed a Clustered Regularly Interspaced Short Palindromic Repeats locus with spacers that recognized C. botulinum Group II plasmids, but not C. botulinum Group I plasmids, demonstrating their long-term separation. Clostridium botulinum Group II type E strains also carry nonneurotoxigenic plasmids closely related to C. botulinum Group II types B and F plasmids. Here, the absence of neurotoxin cassettes may be because recombination requires both a specific mechanism and specific target sequence, which are rarely found together. © The Author 2016. Published by Oxford University Press on behalf of the Society for Molecular Biology and Evolution.

  3. The short mRNA isoform of the immunoglobulin superfamily, member 1 gene encodes an intracellular glycoprotein.

    Directory of Open Access Journals (Sweden)

    Ying Wang

    Full Text Available Mutations in the immunoglobulin superfamily, member 1 gene (IGSF1/Igsf1 cause an X-linked form of central hypothyroidism. The canonical form of IGSF1 is a transmembrane glycoprotein with 12 immunoglobulin (Ig loops. The protein is co-translationally cleaved into two sub-domains. The carboxyl-terminal domain (CTD, which contains the last 7 Ig loops, is trafficked to the plasma membrane. Most pathogenic mutations in IGSF1 map to the portion of the gene encoding the CTD. IGSF1/Igsf1 encodes a variety of transcripts. A little studied, but abundant splice variant encodes a truncated form of the protein, predicted to contain the first 2 Ig loops of the full-length IGSF1. The protein (hereafter referred to as IGSF1 isoform 2 or IGSF1-2 is likely retained in most individuals with IGSF1 mutations. Here, we characterized basic biochemical properties of the protein as a foray into understanding its potential function. IGSF1-2, like the IGSF1-CTD, is a glycoprotein. In both mouse and rat, the protein is N-glycosylated at a single asparagine residue in the first Ig loop. Contrary to earlier predictions, neither the murine nor rat IGSF1-2 is secreted from heterologous or homologous cells. In addition, neither protein associates with the plasma membrane. Rather, IGSF1-2 appears to be retained in the endoplasmic reticulum. Whether the protein plays intracellular functions or is trafficked through the secretory pathway under certain physiologic or pathophysiologic conditions has yet to be determined.

  4. Insights into the evolutionary origins of clostridial neurotoxins from analysis of the Clostridium botulinum strain A neurotoxin gene cluster.

    Science.gov (United States)

    Doxey, Andrew C; Lynch, Michael D J; Müller, Kirsten M; Meiering, Elizabeth M; McConkey, Brendan J

    2008-11-14

    Clostridial neurotoxins (CNTs) are the most deadly toxins known and causal agents of botulism and tetanus neuroparalytic diseases. Despite considerable progress in understanding CNT structure and function, the evolutionary origins of CNTs remain a mystery as they are unique to Clostridium and possess a sequence and structural architecture distinct from other protein families. Uncovering the origins of CNTs would be a significant contribution to our understanding of how pathogens evolve and generate novel toxin families. The C. botulinum strain A genome was examined for potential homologues of CNTs. A key link was identified between the neurotoxin and the flagellin gene (CBO0798) located immediately upstream of the BoNT/A neurotoxin gene cluster. This flagellin sequence displayed the strongest sequence similarity to the neurotoxin and NTNH homologue out of all proteins encoded within C. botulinum strain A. The CBO0798 gene contains a unique hypervariable region, which in closely related flagellins encodes a collagenase-like domain. Remarkably, these collagenase-containing flagellins were found to possess the characteristic HEXXH zinc-protease motif responsible for the neurotoxin's endopeptidase activity. Additional links to collagenase-related sequences and functions were detected by further analysis of CNTs and surrounding genes, including sequence similarities to collagen-adhesion domains and collagenases. Furthermore, the neurotoxin's HCRn domain was found to exhibit both structural and sequence similarity to eukaryotic collagen jelly-roll domains. Multiple lines of evidence suggest that the neurotoxin and adjacent genes evolved from an ancestral collagenase-like gene cluster, linking CNTs to another major family of clostridial proteolytic toxins. Duplication, reshuffling and assembly of neighboring genes within the BoNT/A neurotoxin gene cluster may have lead to the neurotoxin's unique architecture. This work provides new insights into the evolution of C

  5. A Functional Bikaverin Biosynthesis Gene Cluster in Rare Strains of Botrytis cinerea Is Positively Controlled by VELVET

    Science.gov (United States)

    Schumacher, Julia; Gautier, Angélique; Morgant, Guillaume; Studt, Lena; Ducrot, Paul-Henri; Le Pêcheur, Pascal; Azeddine, Saad; Fillinger, Sabine; Leroux, Pierre; Tudzynski, Bettina; Viaud, Muriel

    2013-01-01

    The gene cluster responsible for the biosynthesis of the red polyketidic pigment bikaverin has only been characterized in Fusarium ssp. so far. Recently, a highly homologous but incomplete and nonfunctional bikaverin cluster has been found in the genome of the unrelated phytopathogenic fungus Botrytis cinerea. In this study, we provided evidence that rare B. cinerea strains such as 1750 have a complete and functional cluster comprising the six genes orthologous to Fusarium fujikuroi ffbik1-ffbik6 and do produce bikaverin. Phylogenetic analysis confirmed that the whole cluster was acquired from Fusarium through a horizontal gene transfer (HGT). In the bikaverin-nonproducing strain B05.10, the genes encoding bikaverin biosynthesis enzymes are nonfunctional due to deleterious mutations (bcbik2-3) or missing (bcbik1) but interestingly, the genes encoding the regulatory proteins BcBIK4 and BcBIK5 do not harbor deleterious mutations which suggests that they may still be functional. Heterologous complementation of the F. fujikuroi Δffbik4 mutant confirmed that bcbik4 of strain B05.10 is indeed fully functional. Deletion of bcvel1 in the pink strain 1750 resulted in loss of bikaverin and overproduction of melanin indicating that the VELVET protein BcVEL1 regulates the biosynthesis of the two pigments in an opposite manner. Although strain 1750 itself expresses a truncated BcVEL1 protein (100 instead of 575 aa) that is nonfunctional with regard to sclerotia formation, virulence and oxalic acid formation, it is sufficient to regulate pigment biosynthesis (bikaverin and melanin) and fenhexamid HydR2 type of resistance. Finally, a genetic cross between strain 1750 and a bikaverin-nonproducing strain sensitive to fenhexamid revealed that the functional bikaverin cluster is genetically linked to the HydR2 locus. PMID:23308280

  6. VHDL, a larval storage protein from the corn earworm, Helicoverpa zea, is a member of the vitellogenin gene family.

    Science.gov (United States)

    Sum, Herbert; Haunerland, Norbert H

    2007-10-01

    The hemolymph of last instar larvae of the corn earworm, Helicoverpa zea contains a blue very high-density lipoprotein (VHDL) that is selectively taken up into fat body prior to pupation. Its amino-terminal sequence was determined by Edman degradation, and used to design a degenerate primer for PCR amplification. With 5' and 3' RACE techniques, the entire cDNA coding for VHDL was amplified and sequenced. Conceptual translation reveals a 173 kDa protein that contains a 15 amino acid signal sequence immediately before the experimentally determined N-terminus of the mature protein. The protein contains a typical lipoprotein N-terminal domain, and shows high sequence similarity to vitellogenins from Lepidoptera and other insect species. VHDL mRNA was not detectable in adult H. zea, and antibodies raised against VHDL did not react with adult hemolymph or yolk proteins. Therefore VHDL, although a member of the vitellogenin gene family, seems to be distinct from the vitellogenin expressed in adult females.

  7. Presence of CTX gene cluster in environmental non-O1/O139 Vibrio cholerae and its potential clinical significance

    Directory of Open Access Journals (Sweden)

    B Bakhshi

    2012-01-01

    Full Text Available Purpose: The aim of this study was to understand the epidemiological linkage of clinical and environmental isolates of Vibrio cholerae and to determine their genotypes and virulence genes content. Materials and Methods: A total of 60 V. cholerae strains obtained from clinical specimens (n = 40 and surface waters (n = 20 were subjected to genotyping using PFGE and determination of their virulence-associated gene clusters. Result: PCR analysis showed the presence of chromosomally located hly and RTX genetic elements in 100% and 90% of the environmental isolates, respectively. The phage-mediated genetic elements such as CTX, TLC and VPI were detected in 5% of the environmental isolates suggesting that the environmental isolates cannot acquire certain mobile gene clusters. A total of 4 and 18 pulsotypes were obtained among the clinical and environmental V. cholerae isolates, respectively. Non-pathogenic environmentally isolated V. cholerae constituted a distinct cluster with one single non-O1, non-O139 strain (EP6 carrying the virulence genes similar to the epidemic strains. This may suggest the possible potential of conversion of non-pathogenic to a pathogenic environmental strain. Conclusions: The emergence of a single environmental isolate in our study containing the pathogenicity genes amongst the diverse non-pathogenic environmental isolates needs to be further studied in the context of V. cholerae pathogenicity sero-coversion.

  8. De novo deletion of HOXB gene cluster in a patient with failure to thrive, developmental delay, gastroesophageal reflux and bronchiectasis.

    Science.gov (United States)

    Pajusalu, Sander; Reimand, Tiia; Uibo, Oivi; Vasar, Maire; Talvik, Inga; Zilina, Olga; Tammur, Pille; Õunap, Katrin

    2015-01-01

    We report a female patient with a complex phenotype consisting of failure to thrive, developmental delay, congenital bronchiectasis, gastroesophageal reflux and bilateral inguinal hernias. Chromosomal microarray analysis revealed a 230 kilobase deletion in chromosomal region 17q21.32 (arr[hg19] 17q21.32(46 550 362-46 784 039)×1) encompassing only 9 genes - HOXB1 to HOXB9. The deletion was not found in her mother or father. This is the first report of a patient with a HOXB gene cluster deletion involving only HOXB1 to HOXB9 genes. By comparing our case to previously reported five patients with larger chromosomal aberrations involving the HOXB gene cluster, we can suppose that HOXB gene cluster deletions are responsible for growth retardation, developmental delay, and specific facial dysmorphic features. Also, we suppose that bilateral inguinal hernias, tracheo-esophageal abnormalities, and lung malformations represent features with incomplete penetrance. Interestingly, previously published knock-out mice with targeted heterozygous deletion comparable to our patient did not show phenotypic alterations. Copyright © 2015 Elsevier Masson SAS. All rights reserved.

  9. Molecular evolution of the nif gene cluster carrying nifI1 and nifI2 genes in the Gram-positive phototrophic bacterium Heliobacterium chlorum.

    Science.gov (United States)

    Enkh-Amgalan, Jigjiddorj; Kawasaki, Hiroko; Seki, Tatsuji

    2006-01-01

    A major nif cluster was detected in the strictly anaerobic, Gram-positive phototrophic bacterium Heliobacterium chlorum. The cluster consisted of 11 genes arranged within a 10 kb region in the order nifI1, nifI2, nifH, nifD, nifK, nifE, nifN, nifX, fdx, nifB and nifV. The phylogenetic position of Hbt. chlorum was the same in the NifH, NifD, NifK, NifE and NifN trees; Hbt. chlorum formed a cluster with Desulfitobacterium hafniense, the closest neighbour of heliobacteria based on the 16S rRNA phylogeny, and two species of the genus Geobacter belonging to the Deltaproteobacteria. Two nifI genes, known to occur in the nif clusters of methanogenic archaea between nifH and nifD, were found upstream of the nifH gene of Hbt. chlorum. The organization of the nif operon and the phylogeny of individual and concatenated gene products showed that the Hbt. chlorum nif operon carrying nifI genes upstream of the nifH gene was an intermediate between the nif operon with nifI downstream of nifH (group II and III of the nitrogenase classification) and the nif operon lacking nifI (group I). Thus, the phylogenetic position of Hbt. chlorum nitrogenase may reflect an evolutionary stage of a divergence of the two nitrogenase groups, with group I consisting of the aerobic diazotrophs and group II consisting of strictly anaerobic prokaryotes.

  10. Relational analysis of CpG islands methylation and gene expression in human lymphomas using possibilistic C-means clustering and modified cluster fuzzy density.

    Science.gov (United States)

    Sjahputera, Ozy; Keller, James M; Davis, J Wade; Taylor, Kristen H; Rahmatpanah, Farahnaz; Shi, Huidong; Anderson, Derek T; Blisard, Samuel N; Luke, Robert H; Popescu, Mihail; Arthur, Gerald C; Caldwell, Charles W

    2007-01-01

    Heterogeneous genetic and epigenetic alterations are commonly found in human non-Hodgkin's lymphomas (NHL). One such epigenetic alteration is aberrant methylation of gene promoter-related CpG islands, where hypermethylation frequently results in transcriptional inactivation of target genes, while a decrease or loss of promoter methylation (hypomethylation) is frequently associated with transcriptional activation. Discovering genes with these relationships in NHL or other types of cancers could lead to a better understanding of the pathobiology of these diseases. The simultaneous analysis of promoter methylation using Differential Methylation Hybridization (DMH) and its associated gene expression using Expressed CpG Island Sequence Tag (ECIST) microarrays generates a large volume of methylation-expression relational data. To analyze this data, we propose a set of algorithms based on fuzzy sets theory, in particular Possibilistic c-Means (PCM) and cluster fuzzy density. For each gene, these algorithms calculate measures of confidence of various methylation-expression relationships in each NHL subclass. Thus, these tools can be used as a means of high volume data exploration to better guide biological confirmation using independent molecular biology methods.

  11. Identification and characterization of a biosynthetic gene cluster for tryptophan dimers in deep sea-derived Streptomyces sp. SCSIO 03032.

    Science.gov (United States)

    Ma, Liang; Zhang, Wenjun; Zhu, Yiguang; Zhang, Guangtao; Zhang, Haibo; Zhang, Qingbo; Zhang, Liping; Yuan, Chengshan; Zhang, Changsheng

    2017-08-01

    Tryptophan dimers (TDs) are an important class of natural products with diverse bioactivities and share conserved biosynthetic pathways. We report the identification of a partial gene cluster (spm) responsible for the biosynthesis of a class of unusual TDs with non-planar skeletons including spiroindimicins (SPMs), indimicins (IDMs), and lynamicins (LNMs) from the deep-sea derived Streptomyces sp. SCSIO 03032. Bioinformatics analysis, targeted gene disruptions, and heterologous expression studies confirmed the involvement of the spm gene cluster in the biosynthesis of SPM/IDM/LNMs, and revealed the indispensable roles for the halogenase/reductase pair SpmHF, the amino acid oxidase SpmO, and the chromopyrrolic acid (CPA) synthase SpmD, as well as the positive regulator SpmR and the putative transporter SpmA. However, the spm gene cluster was unable to confer a heterologous host the ability to produce SPM/IDM/LNMs. In addition, the P450 enzyme SpmP and the monooxygenase SpmX2 were found to be non-relevant to the biosynthesis of SPM/IDM/LNMs. Sequence alignment and structure modeling suggested the lack of key conserved amino acid residues in the substrate-binding pocket of SpmP. Furthermore, feeding experiments in the non-producing ΔspmO mutant revealed several biosynthetic precursors en route to SPMs, indicating that key enzymes responsible for the biosynthesis of SPMs should be encoded by genes outside of the identified spm gene cluster. Finally, the biosynthetic pathways of SPM/IDM/LNMs are proposed to lay a basis for further insights into their intriguing biosynthetic machinery.

  12. Genes encoding Cher-TPR fusion proteins are predominantly found in gene clusters encoding chemosensory pathways with alternative cellular functions.

    Directory of Open Access Journals (Sweden)

    Francisco Muñoz-Martínez

    Full Text Available Chemosensory pathways correspond to major signal transduction mechanisms and can be classified into the functional families flagellum-mediated taxis, type four pili-mediated taxis or pathways with alternative cellular functions (ACF. CheR methyltransferases are core enzymes in all of these families. CheR proteins fused to tetratricopeptide repeat (TPR domains have been reported and we present an analysis of this uncharacterized family. We show that CheR-TPRs are widely distributed in GRAM-negative but almost absent from GRAM-positive bacteria. Most strains contain a single CheR-TPR and its abundance does not correlate with the number of chemoreceptors. The TPR domain fused to CheR is comparatively short and frequently composed of 2 repeats. The majority of CheR-TPR genes were found in gene clusters that harbor multidomain response regulators in which the REC domain is fused to different output domains like HK, GGDEF, EAL, HPT, AAA, PAS, GAF, additional REC, HTH, phosphatase or combinations thereof. The response regulator architectures coincide with those reported for the ACF family of pathways. Since the presence of multidomain response regulators is a distinctive feature of this pathway family, we conclude that CheR-TPR proteins form part of ACF type pathways. The diversity of response regulator output domains suggests that the ACF pathways form a superfamily which regroups many different regulatory mechanisms, in which all CheR-TPR proteins appear to participate. In the second part we characterize WspC of Pseudomonas putida, a representative example of CheR-TPR. The affinities of WspC-Pp for S-adenosylmethionine and S-adenosylhomocysteine were comparable to those of prototypal CheR, indicating that WspC-Pp activity is in analogy to prototypal CheRs controlled by product feed-back inhibition. The removal of the TPR domain did not impact significantly on the binding constants and consequently not on the product feed-back inhibition. WspC-Pp was

  13. Genes encoding Cher-TPR fusion proteins are predominantly found in gene clusters encoding chemosensory pathways with alternative cellular functions.

    Science.gov (United States)

    Muñoz-Martínez, Francisco; García-Fontana, Cristina; Rico-Jiménez, Miriam; Alfonso, Carlos; Krell, Tino

    2012-01-01

    Chemosensory pathways correspond to major signal transduction mechanisms and can be classified into the functional families flagellum-mediated taxis, type four pili-mediated taxis or pathways with alternative cellular functions (ACF). CheR methyltransferases are core enzymes in all of these families. CheR proteins fused to tetratricopeptide repeat (TPR) domains have been reported and we present an analysis of this uncharacterized family. We show that CheR-TPRs are widely distributed in GRAM-negative but almost absent from GRAM-positive bacteria. Most strains contain a single CheR-TPR and its abundance does not correlate with the number of chemoreceptors. The TPR domain fused to CheR is comparatively short and frequently composed of 2 repeats. The majority of CheR-TPR genes were found in gene clusters that harbor multidomain response regulators in which the REC domain is fused to different output domains like HK, GGDEF, EAL, HPT, AAA, PAS, GAF, additional REC, HTH, phosphatase or combinations thereof. The response regulator architectures coincide with those reported for the ACF family of pathways. Since the presence of multidomain response regulators is a distinctive feature of this pathway family, we conclude that CheR-TPR proteins form part of ACF type pathways. The diversity of response regulator output domains suggests that the ACF pathways form a superfamily which regroups many different regulatory mechanisms, in which all CheR-TPR proteins appear to participate. In the second part we characterize WspC of Pseudomonas putida, a representative example of CheR-TPR. The affinities of WspC-Pp for S-adenosylmethionine and S-adenosylhomocysteine were comparable to those of prototypal CheR, indicating that WspC-Pp activity is in analogy to prototypal CheRs controlled by product feed-back inhibition. The removal of the TPR domain did not impact significantly on the binding constants and consequently not on the product feed-back inhibition. WspC-Pp was found to be

  14. Sequencing and Transcriptional Analysis of the Biosynthesis Gene Cluster of Putrescine-Producing Lactococcus lactis ▿ †

    Science.gov (United States)

    Ladero, Victor; Rattray, Fergal P.; Mayo, Baltasar; Martín, María Cruz; Fernández, María; Alvarez, Miguel A.

    2011-01-01

    Lactococcus lactis is a prokaryotic microorganism with great importance as a culture starter and has become the model species among the lactic acid bacteria. The long and safe history of use of L. lactis in dairy fermentations has resulted in the classification of this species as GRAS (General Regarded As Safe) or QPS (Qualified Presumption of Safety). However, our group has identified several strains of L. lactis subsp. lactis and L. lactis subsp. cremoris that are able to produce putrescine from agmatine via the agmatine deiminase (AGDI) pathway. Putrescine is a biogenic amine that confers undesirable flavor characteristics and may even have toxic effects. The AGDI cluster of L. lactis is composed of a putative regulatory gene, aguR, followed by the genes (aguB, aguD, aguA, and aguC) encoding the catabolic enzymes. These genes are transcribed as an operon that is induced in the presence of agmatine. In some strains, an insertion (IS) element interrupts the transcription of the cluster, which results in a non-putrescine-producing phenotype. Based on this knowledge, a PCR-based test was developed in order to differentiate nonproducing L. lactis strains from those with a functional AGDI cluster. The analysis of the AGDI cluster and their flanking regions revealed that the capacity to produce putrescine via the AGDI pathway could be a specific characteristic that was lost during the adaptation to the milk environment by a process of reductive genome evolution. PMID:21803900

  15. Genome-wide analysis and evolutionary study of sucrose non-fermenting 1-related protein kinase 2 (SnRK2) gene family members in Arabidopsis and Oryza.

    Science.gov (United States)

    Saha, Jayita; Chatterjee, Chitrita; Sengupta, Atreyee; Gupta, Kamala; Gupta, Bhaskar

    2014-04-01

    The over-expression of plant specific SnRK2 gene family members by hyperosmotic stress and some by abscisic acid is well established. In this report, we have analyzed the evolution of SnRK2 gene family in different plant lineages including green algae, moss, lycophyte, dicot and monocot. Our results provide some evidences to indicate that the natural selection pressure had considerable influence on cis-regulatory promoter region and coding region of SnRK2 members in Arabidopsis and Oryza independently through time. Observed degree of sequence/motif conservation amongst SnRK2 homolog in all the analyzed plant lineages strongly supported their inclusion as members of this family. The chromosomal distributions of duplicated SnRK2 members have also been analyzed in Arabidopsis and Oryza. Massively Parallel Signature Sequencing (MPSS) database derived expression data and the presence of abiotic stress related promoter elements within the 1 kb upstream promoter region of these SnRK2 family members further strengthen the observations of previous workers. Additionally, the phylogenetic relationships of SnRK2 have been studied in all plant lineages along with their respective exon-intron structural patterns. Our results indicate that the ancestral SnRK2 gene of land plants gradually evolved by duplication and diversification and modified itself through exon-intron loss events to survive under environmental stress conditions. Copyright © 2013 Elsevier Ltd. All rights reserved.

  16. Endophytic actinobacteria: Diversity, secondary metabolism and mechanisms to unsilence biosynthetic gene clusters.

    Science.gov (United States)

    Dinesh, Raghavan; Srinivasan, Veeraraghavan; T E, Sheeja; Anandaraj, Muthuswamy; Srambikkal, Hamza

    2017-09-01

    Endophytic actinobacteria, which reside in the inner tissues of host plants, are gaining serious attention due to their capacity to produce a plethora of secondary metabolites (e.g. antibiotics) possessing a wide variety of biological activity with diverse functions. This review encompasses the recent reports on endophytic actinobacterial species diversity, in planta habitats and mechanisms underlying their mode of entry into plants. Besides, their metabolic potential, novel bioactive compounds they produce and mechanisms to unravel their hidden metabolic repertoire by activation of cryptic or silent biosynthetic gene clusters (BGCs) for eliciting novel secondary metabolite production are discussed. The study also reviews the classical conservative techniques (chemical/biological/physical elicitation, co-culturing) as well as modern microbiology tools (e.g. next generation sequencing) that are being gainfully employed to uncover the vast hidden scaffolds for novel secondary metabolites produced by these endophytes, which would subsequently herald a revolution in drug engineering. The potential role of these endophytes in the agro-environment as promising biological candidates for inhibition of phytopathogens and the way forward to thoroughly exploit this unique microbial community by inducing expression of cryptic BGCs for encoding unseen products with novel therapeutic properties are also discussed.

  17. The N‐acetylglucosamine catabolic gene cluster in Trichoderma reesei is controlled by the Ndt80‐like transcription factor RON1

    Science.gov (United States)

    Kappel, Lisa; Gaderer, Romana; Flipphi, Michel

    2015-01-01

    Summary Chitin is an important structural constituent of fungal cell walls composed of N‐acetylglucosamine (GlcNAc) monosaccharides, but catabolism of GlcNAc has not been studied in filamentous fungi so far. In the yeast C andida albicans, the genes encoding the three enzymes responsible for stepwise conversion of GlcNAc to fructose‐6‐phosphate are clustered. In this work, we analysed GlcNAc catabolism in ascomycete filamentous fungi and found that the respective genes are also clustered in these fungi. In contrast to C . albicans, the cluster often contains a gene for an Ndt80‐like transcription factor, which we named RON1 (regulator of N‐acetylglucosamine catabolism 1). Further, a gene for a glycoside hydrolase 3 protein related to bacterial N‐acetylglucosaminidases can be found in the GlcNAc gene cluster in filamentous fungi. Functional analysis in T richoderma reesei showed that the transcription factor RON1 is a key activator of the GlcNAc gene cluster and essential for GlcNAc catabolism. Furthermore, we present an evolutionary analysis of Ndt80‐like proteins in Ascomycota. All GlcNAc cluster genes, as well as the GlcNAc transporter gene ngt1, and an additional transcriptional regulator gene, csp2, encoding the homolog of N eurospora crassa  CSP2/GRHL, were functionally characterised by gene expression analysis and phenotypic characterisation of knockout strains in T . reesei. PMID:26481444

  18. Molecular Typing and Virulence Gene Profiles of Enterotoxin Gene Cluster (egc)-Positive Staphylococcus aureus Isolates Obtained from Various Food and Clinical Specimens.

    Science.gov (United States)

    Song, Minghui; Shi, Chunlei; Xu, Xuebing; Shi, Xianming

    2016-11-01

    The enterotoxin gene cluster (egc) has been proposed to contribute to the Staphylococcus aureus colonization, which highlights the need to evaluate genetic diversity and virulence gene profiles of the egc-positive population. Here, a total of 43 egc-positive isolates (16.2%) were identified from 266 S. aureus isolates that were obtained from various food and clinical specimens in Shanghai. Seven different egc profiles were found based on the polymerase chain reaction (PCR) result for egc genes. Then, these 43 egc-positive isolates were further typed by multilocus sequence typing, pulsed-field gel electrophoresis (PFGE), multiple-locus variable-number tandem-repeat analysis (MLVA), and accessory gene regulatory (agr) typing. It showed that the 43 egc-positive isolates displayed 17 sequence types, 28 PFGE patterns, 29 MLVA types, and 4 agr types, respectively. Among them, the dominant clonal lineage was CC5-agr II (48.84%). Thirty toxin and 20 adhesion-associated genes were detected by PCR in egc-positive isolates. Notably, invasive toxin genes showed a high prevalence, such as 76.7% for Panton-Valentine leukocidin encoding genes, 27.9% for sec, and 23.3% for tsst-1. Most of the examined adhesion-associated genes were found to be conserved (76.7-100%), whereas the fnbB gene was only found in 8 (18.6%) isolates. In addition, 33 toxin gene profiles and 13 adhesion gene profiles were identified, respectively. Our results imply that isolates belonging to the same clonal lineage harbored similar adhesion gene profiles but diverse toxin gene profiles. Overall, the high prevalence of invasive virulence genes increases the potential risk of egc-positive isolates in S. aureus infection.

  19. Identification of the chelocardin biosynthetic gene cluster from Amycolatopsis sulphurea: a platform for producing novel tetracycline antibiotics.

    Science.gov (United States)

    Lukežič, Tadeja; Lešnik, Urška; Podgoršek, Ajda; Horvat, Jaka; Polak, Tomaž; Šala, Martin; Jenko, Branko; Raspor, Peter; Herron, Paul R; Hunter, Iain S; Petković, Hrvoje

    2013-12-01

    Tetracyclines (TCs) are medically important antibiotics from the polyketide family of natural products. Chelocardin (CHD), produced by Amycolatopsis sulphurea, is a broad-spectrum tetracyclic antibiotic with potent bacteriolytic activity against a number of Gram-positive and Gram-negative multi-resistant pathogens. CHD has an unknown mode of action that is different from TCs. It has some structural features that define it as 'atypical' and, notably, is active against tetracycline-resistant pathogens. Identification and characterization of the chelocardin biosynthetic gene cluster from A. sulphurea revealed 18 putative open reading frames including a type II polyketide synthase. Compared to typical TCs, the chd cluster contains a number of features that relate to its classification as 'atypical': an additional gene for a putative two-component cyclase/aromatase that may be responsible for the different aromatization pattern, a gene for a putative aminotransferase for C-4 with the opposite stereochemistry to TCs and a gene for a putative C-9 methylase that is a unique feature of this biosynthetic cluster within the TCs. Collectively, these enzymes deliver a molecule with different aromatization of ring C that results in an unusual planar structure of the TC backbone. This is a likely contributor to its different mode of action. In addition CHD biosynthesis is primed with acetate, unlike the TCs, which are primed with malonamate, and offers a biosynthetic engineering platform that represents a unique opportunity for efficient generation of novel tetracyclic backbones using combinatorial biosynthesis.

  20. Manual annotation and analysis of the defensin gene cluster in the C57BL/6J mouse reference genome

    Directory of Open Access Journals (Sweden)

    Dougan Gordon

    2009-12-01

    Full Text Available Abstract Background Host defense peptides are a critical component of the innate immune system. Human alpha- and beta-defensin genes are subject to copy number variation (CNV and historically the organization of mouse alpha-defensin genes has been poorly defined. Here we present the first full manual genomic annotation of the mouse defensin region on Chromosome 8 of the reference strain C57BL/6J, and the analysis of the orthologous regions of the human and rat genomes. Problems were identified with the reference assemblies of all three genomes. Defensins have been studied for over two decades and their naming has become a critical issue due to incorrect identification of defensin genes derived from different mouse strains and the duplicated nature of this region. Results The defensin gene cluster region on mouse Chromosome 8 A2 contains 98 gene loci: 53 are likely active defensin genes and 22 defensin pseudogenes. Several TATA box motifs were found for human and mouse defensin genes that likely impact gene expression. Three novel defensin genes belonging to the Cryptdin Related Sequences (CRS family were identified. All additional mouse defensin loci on Chromosomes 1, 2 and 14 were annotated and unusual splice variants identified. Comparison of the mouse alpha-defensins in the three main mouse reference gene sets Ensembl, Mouse Genome Informatics (MGI, and NCBI RefSeq reveals significant inconsistencies in annotation and nomenclature. We are collaborating with the Mouse Genome Nomenclature Committee (MGNC to establish a standardized naming scheme for alpha-defensins. Conclusions Prior to this analysis, there was no reliable reference gene set available for the mouse strain C57BL/6J defensin genes, demonstrating that manual intervention is still critical for the annotation of complex gene families and heavily duplicated regions. Accurate gene annotation is facilitated by the annotation of pseudogenes and regulatory elements. Manually curated gene

  1. Defining reference sequences for Nocardia species by similarity and clustering analyses of 16S rRNA gene sequence data.

    Directory of Open Access Journals (Sweden)

    Manal Helal

    Full Text Available BACKGROUND: The intra- and inter-species genetic diversity of bacteria and the absence of 'reference', or the most representative, sequences of individual species present a significant challenge for sequence-based identification. The aims of this study were to determine the utility, and compare the performance of several clustering and classification algorithms to identify the species of 364 sequences of 16S rRNA gene with a defined species in GenBank, and 110 sequences of 16S rRNA gene with no defined species, all within the genus Nocardia. METHODS: A total of 364 16S rRNA gene sequences of Nocardia species were studied. In addition, 110 16S rRNA gene sequences assigned only to the Nocardia genus level at the time of submission to GenBank were used for machine learning classification experiments. Different clustering algorithms were compared with a novel algorithm or the linear mapping (LM of the distance matrix. Principal Components Analysis was used for the dimensionality reduction and visualization. RESULTS: The LM algorithm achieved the highest performance and classified the set of 364 16S rRNA sequences into 80 clusters, the majority of which (83.52% corresponded with the original species. The most representative 16S rRNA sequences for individual Nocardia species have been identified as 'centroids' in respective clusters from which the distances to all other sequences were minimized; 110 16S rRNA gene sequences with identifications recorded only at the genus level were classified using machine learning methods. Simple kNN machine learning demonstrated the highest performance and classified Nocardia species sequences with an accuracy of 92.7% and a mean frequency of 0.578. CONCLUSION: The identification of centroids of 16S rRNA gene sequence clusters using novel distance matrix clustering enables the identification of the most representative sequences for each individual species of Nocardia and allows the quantitation of inter- and intra

  2. Clustered organization, polycistronic transcription, and evolution of modification-guide snoRNA genes in Euglena gracilis.

    Science.gov (United States)

    Moore, Ashley N; Russell, Anthony G

    2012-01-01

    Previous studies have shown that the eukaryotic microbe Euglena gracilis contains an unusually large assortment of small nucleolar RNAs (snoRNAs) and ribosomal RNA (rRNA) modification sites. However, little is known about the evolutionary mechanisms contributing to this situation. In this study, we have examined the organization and evolution of snoRNA genes in Euglena with the additional objective of determining how these properties relate to the rRNA modification pattern in this protist. We have identified and extensively characterized a clustered pattern of genes encoding previously biochemically isolated snoRNA sequences in E. gracilis. We show that polycistronic transcription is a prevalent snoRNA gene expression strategy in this organism. Further, we have identified 121 new snoRNA coding regions through sequence analysis of these clusters. We have identified an E. gracilis U14 snoRNA homolog clustered with modification-guide snoRNA genes. The U14 snoRNAs in other eukaryotic organisms examined to date typically contain both a modification and a processing domain. E. gracilis U14 lacks the modification domain but retains the processing domain. Our analysis of U14 structure and evolution in Euglena and other eukaryotes allows us to propose a model for its evolution and suggest its processing role may be its more important function, explaining its conservation in many eukaryotes. The preponderance of apparent small and larger-scale duplication events in the genomic regions we have characterized in Euglena provides a mechanism for the generation of the unusually diverse collection and abundance of snoRNAs and modified rRNA sites. Our findings provide the framework for more extensive whole genome analysis to elucidate whether these snoRNA gene clusters are spread across multiple chromosomes and/or form dense "arrays" at a limited number of chromosomal loci.

  3. The DUB/USP17 deubiquitinating enzymes: A gene family within a tandemly repeated sequence, is also embedded within the copy number variable Beta-defensin cluster

    Directory of Open Access Journals (Sweden)

    Scott Christopher J

    2010-04-01

    Full Text Available Abstract Background The DUB/USP17 subfamily of deubiquitinating enzymes were originally identified as immediate early genes induced in response to cytokine stimulation in mice (DUB-1, DUB-1A, DUB-2, DUB-2A. Subsequently we have identified a number of human family members and shown that one of these (DUB-3 is also cytokine inducible. We originally showed that constitutive expression of DUB-3 can block cell proliferation and more recently we have demonstrated that this is due to its regulation of the ubiquitination and activity of the 'CAAX' box protease RCE1. Results Here we demonstrate that the human DUB/USP17 family members are found on both chromosome 4p16.1, within a block of tandem repeats, and on chromosome 8p23.1, embedded within the copy number variable beta-defensin cluster. In addition, we show that the multiple genes observed in humans and other distantly related mammals have arisen due to the independent expansion of an ancestral sequence within each species. However, it is also apparent when sequences from humans and the more closely related chimpanzee are compared, that duplication events have taken place prior to these species separating. Conclusions The observation that the DUB/USP17 genes, which can influence cell growth and survival, have evolved from an unstable ancestral sequence which has undergone multiple and varied duplications in the species examined marks this as a unique family. In addition, their presence within the beta-defensin repeat raises the question whether they may contribute to the influence of this repeat on immune related conditions.

  4. A pseudogene cluster in the leader region of the Euglena chloroplast 16S-23S rRNA genes.

    Science.gov (United States)

    Miyata, T; Kikuno, R; Ohshima, Y

    1982-01-01

    The nucleotide sequence of a region (leader region) preceding the 5'-end of 16S-23S rRNA gene region of Euglena gracilis chloroplast DNA was compared with the homologous sequences that code for the 16S-23S rRNA operons of Euglena and E. coli. The leader region shows close homology in sequence to the 16S-23S rRNA gene region of Euglena (Orozco et al. (1980) J. Biol.Chem. 255, 10997-11003) as well as to the rrnD operon of E. coli, suggesting that it was derived from the 16S-23S rRNA gene region by gene duplication. It was shown that the leader region had accumulated nucleotide substitutions at an extremely rapid rate in its entirety, similar to the rate of tRNAIle pseudogene identified in the leader region. In addition, the leader region shows an unique base content which is quite distinct from those of 16S-23S rRNA gene regions of Euglena and E. coli, but again is similar to that of the tRNAIle pseudogene. The above two results strongly suggest that the leader region contains a pseudogene cluster which was derived from a gene cluster coding for the functional 16S-23S rRNA operon possibly by imperfect duplication during evolution of Euglena chloroplast DNA. PMID:7041094

  5. Phylogeography of var gene repertoires reveals fine-scale geospatial clustering of Plasmodium falciparum populations in a highly endemic area.

    Science.gov (United States)

    Tessema, Sofonias K; Monk, Stephanie L; Schultz, Mark B; Tavul, Livingstone; Reeder, John C; Siba, Peter M; Mueller, Ivo; Barry, Alyssa E

    2015-01-01

    Plasmodium falciparum malaria is a major global health problem that is being targeted for progressive elimination. Knowledge of local disease transmission patterns in endemic countries is critical to these elimination efforts. To investigate fine-scale patterns of malaria transmission, we have compared repertoires of rapidly evolving var genes in a highly endemic area. A total of 3680 high-quality DBLα-sequences were obtained from 68 P. falciparum isolates from ten villages spread over two distinct catchment areas on the north coast of Papua New Guinea (PNG). Modelling of the extent of var gene diversity in the two parasite populations predicts more than twice as many var gene alleles circulating within each catchment (Mugil = 906; Wosera = 1094) than previously recognized in PNG (Amele = 369). In addition, there were limited levels of var gene sharing between populations, consistent with local parasite population structure. Phylogeographic analyses demonstrate that while neutrally evolving microsatellite markers identified population structure only at the catchment level, var gene repertoires reveal further fine-scale geospatial clustering of parasite isolates. The clustering of parasite isolates by village in Mugil, but not in Wosera was consistent with the physical and cultural isolation of the human populations in the two catchments. The study highlights the microheterogeneity of P. falciparum transmission in highly endemic areas and demonstrates the potential of var genes as markers of local patterns of parasite population structure. © 2014 John Wiley & Sons Ltd.

  6. Molecular identification and characterization of clustered regularly interspaced short palindromic repeat (CRISPR) gene cluster in Taylorella equigenitalis.

    Science.gov (United States)

    Hara, Yasushi; Hayashi, Kyohei; Nakajima, Takuya; Kagawa, Shizuko; Tazumi, Akihiro; Moore, John E; Matsuda, Motoo

    2013-09-01

    Clustered regularly interspaced short palindromic repeats (CRISPRs), of approximately 10,000 base pairs (bp) in length, were shown to occur in the Japanese Taylorella equigenitalis strain, EQ59. The locus was composed of the putative CRISPRs-associated with 5 (cas5), RAMP csd1, csd2, recB, cas1, a leader region, 13 CRISPR consensus sequence repeats (each 32 bp; 5'-TCAGCCACGTTCGCGTGGCTGTGTGTTTAAAG-3'). These were in turn separated by 12 non repetitive unique spacer regions of similar length. In addition, a leader region, a transposase/IS protein, a leader region, and cas3 were also seen. All seven putative open reading frames carry their ribosome binding sites. Promoter consensus sequences at the -35 and -10 regions and putative intrinsic ρ-independent transcription terminator regions also occurred. A possible long overlap of 170 bp in length occurred between the recB and cas1 loci. Positive reverse transcription PCR signals of cas5, RAMP csd1, csd2-recB/cas1, and cas3 were generated. A putative secondary structure of the CRISPR consensus repeats was constructed. Following this, CRISPR results of the T. equigenitalis EQ59 isolate were subsequently compared with those from the Taylorella asinigenitalis MCE3 isolate.

  7. Overexpression of Hoxc13 in differentiating keratinocytes results in downregulation of a novel hair keratin gene cluster and alopecia.

    Science.gov (United States)

    Tkatchenko, A V; Visconti, R P; Shang, L; Papenbrock, T; Pruett, N D; Ito, T; Ogawa, M; Awgulewitsch, A

    2001-05-01

    Studying the roles of Hox genes in normal and pathological development of skin and hair requires identification of downstream target genes in genetically defined animal models. We show that transgenic mice overexpressing Hoxc13 in differentiating keratinocytes of hair follicles develop alopecia, accompanied by a progressive pathological skin condition that resembles ichthyosis. Large-scale analysis of differential gene expression in postnatal skin of these mice identified 16 previously unknown and 13 known genes as presumptive Hoxc13 targets. The majority of these targets are downregulated and belong to a subgroup of genes that encode hair-specific keratin-associated proteins (KAPs). Genomic mapping using a mouse hamster radiation hybrid panel showed these genes to reside in a novel KAP gene cluster on mouse chromosome 16 in a region of conserved linkage with human chromosome 21q22.11. Furthermore, data obtained by Hoxc13/lacZ reporter gene analysis in mice that overexpress Hoxc13 suggest negative autoregulatory feedback control of Hoxc13 expression levels, thus providing an entry point for elucidating currently unknown mechanisms that are required for regulating quantitative levels of Hox gene expression. Combined, these results provide a framework for understanding molecular mechanisms of Hoxc13 function in hair growth and development.

  8. Prevalence of the lmo0036-0043 gene cluster encoding arginine deiminase and agmatine deiminase systems in Listeria monocytogenes.

    Science.gov (United States)

    Chen, Jianshun; Chen, Fan; Cheng, Changyong; Fang, Weihuan

    2013-04-01

    Arginine deiminase and agmatine deiminase systems are involved in acid tolerance, and their encoding genes form the cluster lmo0036-0043 in Listeria monocytogenes. While lmo0042 and lmo0043 were conserved in all L. monocytogenes strains, the lmo0036-0041 region of this cluster was identified in all lineages I and II, and the majority of lineage IV (83.3%) strains, but absent in all lineage III and a small fraction of lineage IV (16.7%) strains, suggesting that the presence of the complete lmo0036-0043 cluster is dependent on lineages. lmo0036-0043-complete and -deficient lineage IV strains exhibit specific ascB-dapE profiles, which might represent two subpopulations with distinct genetic characteristics.

  9. Mapping in an apple (Malus x domestica) F1 segregating population based on physical clustering of differentially expressed genes.

    Science.gov (United States)

    Jensen, Philip J; Fazio, Gennaro; Altman, Naomi; Praul, Craig; McNellis, Timothy W

    2014-04-04

    Apple tree breeding is slow and difficult due to long generation times, self-incompatibility, and complex genetics. The identification of molecular markers linked to traits of interest is a way to expedite the breeding process. In the present study, we aimed to identify genes whose steady-state transcript abundance was associated with inheritance of specific traits segregating in an apple (Malus × domestica) rootstock F1 breeding population, including resistance to powdery mildew (Podosphaera leucotricha) disease and woolly apple aphid (Eriosoma lanigerum). Transcription profiling was performed for 48 individual F1 apple trees from a cross of two highly heterozygous parents, using RNA isolated from healthy, actively-growing shoot tips and a custom apple DNA oligonucleotide microarray representing 26,000 unique transcripts. Genome-wide expression profiles were not clear indicators of powdery mildew or woolly apple aphid resistance phenotype. However, standard differential gene expression analysis between phenotypic groups of trees revealed relatively small sets of genes with trait-associated expression levels. For example, thirty genes were identified that were differentially expressed between trees resistant and susceptible to powdery mildew. Interestingly, the genes encoding twenty-four of these transcripts were physically clustered on chromosome 12. Similarly, seven genes were identified that were differentially expressed between trees resistant and susceptible to woolly apple aphid, and the genes encoding five of these transcripts were also clustered, this time on chromosome 17. In each case, the gene clusters were in the vicinity of previously identified major quantitative trait loci for the corresponding trait. Similar results were obtained for a series of molecular traits. Several of the differentially expressed genes were used to develop DNA polymorphism markers linked to powdery mildew disease and woolly apple aphid resistance. Gene expression profiling

  10. In silico analysis highlights the frequency and diversity of type 1 lantibiotic gene clusters in genome sequenced bacteria

    LENUS (Irish Health Repository)

    Marsh, Alan J

    2010-11-30

    Abstract Background Lantibiotics are lanthionine-containing, post-translationally modified antimicrobial peptides. These peptides have significant, but largely untapped, potential as preservatives and chemotherapeutic agents. Type 1 lantibiotics are those in which lanthionine residues are introduced into the structural peptide (LanA) through the activity of separate lanthionine dehydratase (LanB) and lanthionine synthetase (LanC) enzymes. Here we take advantage of the conserved nature of LanC enzymes to devise an in silico approach to identify potential lantibiotic-encoding gene clusters in genome sequenced bacteria. Results In total 49 novel type 1 lantibiotic clusters were identified which unexpectedly were associated with species, genera and even phyla of bacteria which have not previously been associated with lantibiotic production. Conclusions Multiple type 1 lantibiotic gene clusters were identified at a frequency that suggests that these antimicrobials are much more widespread than previously thought. These clusters represent a rich repository which can yield a large number of valuable novel antimicrobials and biosynthetic enzymes.

  11. Identification and activation of novel biosynthetic gene clusters by genome mining in the kirromycin producer Streptomyces collinus Tü 365

    DEFF Research Database (Denmark)

    Iftime, Dumitrita; Kulik, Andreas; Härtner, Thomas

    2016-01-01

    Streptomycetes are prolific sources of novel biologically active secondary metabolites with pharmaceutical potential. S. collinus Tü 365 is a Streptomyces strain, isolated 1972 from Kouroussa (Guinea). It is best known as producer of the antibiotic kirromycin, an inhibitor of the protein biosynth...... of a lanthipeptide, a carotenoid, five terpenoid compounds, an ectoine, a siderophore and a spore pigment-associated gene cluster to their respective biosynthesis products....

  12. Variants in linkage disequilibrium with the late cornified envelope gene cluster deletion are associated with susceptibility to psoriatic arthritis.

    LENUS (Irish Health Repository)

    Bowes, John

    2010-12-01

    A common deletion mapping to the psoriasis susceptibility locus 4 on chromosome 1q21, encompassing two genes of the late cornified envelope (LCE) gene cluster, has been associated with an increased risk of psoriasis vulgaris (PsV). One previous report found no association of the deletion with psoriatic arthritis (PsA), suggesting it may be a specific risk factor for PsV. Given the genetic overlap between PsA and PsV, a study was undertaken to investigate whether single nucleotide polymorphisms (SNPs) mapping to this locus are risk factors for PsA in a UK and Irish population.

  13. Beta-globin gene cluster haplotypes in the Mapuche Indians of Argentina

    Directory of Open Access Journals (Sweden)

    Letícia Kaufman

    1998-12-01

    Full Text Available Haplotypes derived from five polymorphic restriction sites in the beta-globin gene cluster were investigated in 86 chromosomes from the Argentinian Mapuche. These results were integrated with those previously obtained for ten Brazilian Indian tribes. Eight haplotypes were identified, the most frequent being 2 (57% and 6 (27%. The presence of haplotype 3 in 2% of the Mapuche chromosomes is probably an evidence of admixture with individuals of African ancestry. Due to the high number of haplotypes observed, heterozygosity as measured by the Gini-Simpson index was higher in the Mapuche than in Brazilian Indians. The haplotypic distribution in the Mapuche was also significantly different from those of all Brazilian tribes investigated. This heterogeneity could be at least partially explained by admixture with non-Indian populations.Haplótipos derivados de cinco sítios de restrição polimórficos presentes no agrupamento da globina beta foram investigados em 86 cromossomos da população mapuche da Argentina. Esses resultados foram analisados em conjunto com os previamente obtidos para dez tribos indígenas brasileiras. Oito haplótipos foram identificados, dos quais os mais freqüentes foram o 2 (57% e o 6 (27%. A presença do haplótipo 3 em 2% dos cromossomos dos Mapuches é uma evidência de mistura com indivíduos de ancestralidade africana. Devido ao alto número de haplótipos, a heterozigosidade medida pelo índice Gini-Simpson é mais alta nos Mapuches do que nos índios brasileiros. A distribuição haplotípica nos Mapuches é também significativamente diferente da observada nas tribos brasileiras. Essa heterogeneidade poderia ser parcialmente explicada pela mistura com populações não-indígenas.

  14. Total alpha-globin gene cluster deletion has high frequency in Filipinos

    Energy Technology Data Exchange (ETDEWEB)

    Hunt, J.A.; Haruyama, A.Z.; Chu, B.M. [Kapiolani Medical Center, Honolulu, HI (United States)] [and others

    1994-09-01

    Most {alpha}-thalassemias [Thal] are due to large deletions. In Southeast Asians, the (--{sup SEA}) double {alpha}-globin gene deletion is common, 3 (--{sup Tot}) total {alpha}-globin cluster deletions are known: Filipino (--{sup Fil}), Thai (--{sup Thai}), and Chinese (--{sup Chin}). In a Hawaii Thal project, provisional diagnosis of {alpha}-Thal-1 heterozygotes was based on microcytosis, normal isoelectric focusing, and no iron deficiency. One in 10 unselected Filipinos was an {alpha}-Thal-1 heterozygote, 2/3 of these had a (--{sup Tot}) deletion: a {var_sigma}-cDNA probe consistently showed fainter intensity of the constant 5.5 kb {var_sigma}{sub 2} BamHI band, with no heterzygosity for {var_sigma}-globin region polymorphisms; {alpha}-cDNA or {var_sigma}-cDNA probes showed no BamHI or BglII bands diagnostic of the (--{sup SEA}) deletion; bands for the (-{alpha}) {alpha}-Thal-2 single {alpha}-globin deletions were only seen in Hb H cases. A reliable monoclonal anti-{var_sigma}-peptide antibody test for the (--{sup SEA}) deletion was always negative in (--{sup Tot}) samples. Southern digests with the Lo probe, a gift from D. Higgs of Oxford Univ., confirmed that 49 of 50 (--{sup Tot}) chromosomes in Filipinos were (--{sup Fil}). Of 20 {alpha}-Thal-1 hydrops born to Filipinos, 11 were (--{sup Fil}/--{sup SEA}) compound heterozygotes; 9 were (--{sup SEA}/--{sup SEA}) homozygotes, but none was a (--{sup Fil}/--{sup Fil}).

  15. Cracking the regulatory code of biosynthetic gene clusters as a strategy for natural product discovery.

    Science.gov (United States)

    Rigali, Sébastien; Anderssen, Sinaeda; Naômé, Aymeric; van Wezel, Gilles P

    2018-01-05

    The World Health Organization (WHO) describes antibiotic resistance as "one of the biggest threats to global health, food security, and development today", as the number of multi- and pan-resistant bacteria is rising dangerously. Acquired resistance phenomena also impair antifungals, antivirals, anti-cancer drug therapy, while herbicide resistance in weeds threatens the crop industry. On the positive side, it is likely that the chemical space of natural products goes far beyond what has currently been discovered. This idea is fueled by genome sequencing of microorganisms which unveiled numerous so-called cryptic biosynthetic gene clusters (BGCs), many of which are transcriptionally silent under laboratory culture conditions, and by the fact that most bacteria cannot yet be cultivated in the laboratory. However, brute force antibiotic discovery does not yield the same results as it did in the past, and researchers have had to develop creative strategies in order to unravel the hidden potential of microorganisms such as Streptomyces and other antibiotic-producing microorganisms. Identifying the cis elements and their corresponding transcription factors(s) involved in the control of BGCs through bioinformatic approaches is a promising strategy. Theoretically, we are a few 'clicks' away from unveiling the culturing conditions or genetic changes needed to activate the production of cryptic metabolites or increase the production yield of known compounds to make them economically viable. In this opinion article, we describe and illustrate the idea beyond 'cracking' the regulatory code for natural product discovery, by presenting a series of proofs of concept, and discuss what still should be achieved to increase the rate of success of this strategy. Copyright © 2018 Elsevier Inc. All rights reserved.

  16. Serum paraoxonase activity is associated with variants in the PON gene cluster and risk of Alzheimer disease.

    Science.gov (United States)

    Erlich, Porat M; Lunetta, Kathryn L; Cupples, L Adrienne; Abraham, Carmela R; Green, Robert C; Baldwin, Clinton T; Farrer, Lindsay A

    2012-05-01

    Previous studies have shown association of single nucleotide polymorphisms (SNPs) in 3 contiguous genes (PON1, PON2, and PON3) encoding paraoxonase with risk of Alzheimer disease (AD). We evaluated the association of serum paraoxonase activity measured by phenyl acetate (PA) and thiobutyl butyrolactone (TBBL) with risk of AD and with 26 SNPs spanning the PON gene cluster in 266 AD cases and 306 sibling controls from the MIRAGE study. The odds of AD (adjusted for age, gender, and ethnicity) increased 20% for each standard deviation decrease in PA or TBBL activity. There were association signals with activity in all 3 genes. Haplotypes including SNPs spanning the PON genes were generally more significant than haplotypes comprising SNPs from 1 gene. Significant interactions were observed between SNP pairs located across the PON cluster with either serum activity measure as the outcome, and between several PON SNPs and PA activity with AD status as the outcome. Our results suggest that low serum paraoxonase activity is a risk factor for AD. Furthermore, multiple variants in PON influence serum paraoxonase activity and their effects may be synergistic. Copyright © 2012 Elsevier Inc. All rights reserved.

  17. The nitrate-reduction gene cluster components exert lineage-dependent contributions to optimization of Sinorhizobium symbiosis with soybeans.

    Science.gov (United States)

    Liu, Li Xue; Li, Qin Qin; Zhang, Yun Zeng; Hu, Yue; Jiao, Jian; Guo, Hui Juan; Zhang, Xing Xing; Zhang, Biliang; Chen, Wen Xin; Tian, Chang Fu

    2017-12-01

    Receiving nodulation and nitrogen fixation genes does not guarantee rhizobia an effective symbiosis with legumes. Here, variations in gene content were determined for three Sinorhizobium species showing contrasting symbiotic efficiency on soybeans. A nitrate-reduction gene cluster absent in S. sojae was found to be essential for symbiotic adaptations of S. fredii and S. sp. III. In S. fredii, the deletion mutation of the nap (nitrate reductase), instead of nir (nitrite reductase) and nor (nitric oxide reductase), led to defects in nitrogen-fixation (Fix - ). By contrast, none of these core nitrate-reduction genes were required for the symbiosis of S. sp. III. However, within the same gene cluster, the deletion of hemN1 (encoding oxygen-independent coproporphyrinogen III oxidase) in both S. fredii and S. sp. III led to the formation of nitrogen-fixing (Fix + ) but ineffective (Eff - ) nodules. These Fix + /Eff - nodules were characterized by significantly lower enzyme activity of glutamine synthetase indicating rhizobial modulation of nitrogen-assimilation by plants. A distant homologue of HemN1 from S. sojae can complement this defect in S. fredii and S. sp. III, but exhibited a more pleotropic role in symbiosis establishment. These findings highlighted the lineage-dependent optimization of symbiotic functions in different rhizobial species associated with the same host. © 2017 Society for Applied Microbiology and John Wiley & Sons Ltd.

  18. Structure of the neutral capsular polysaccharide of Acinetobacter baumannii NIPH146 that carries the KL37 capsule gene cluster.

    Science.gov (United States)

    Arbatsky, Nikolay P; Shneider, Mikhail M; Kenyon, Johanna J; Shashkov, Alexander S; Popova, Anastasiya V; Miroshnikov, Konstantin A; Volozhantsev, Nikolay V; Knirel, Yuriy A

    2015-09-02

    Capsular polysaccharide (CPS) was isolated from Acinetobacter baumannii NIPH146, and the following structure of branched pentasaccharide repeating unit was established by sugar analyses along with 1D and 2D NMR spectroscopy: In comparison to most other known capsular polysaccharides of A. baumannii, the CPS studied is neutral and lacks any specific monosaccharide component. The synthesis, assembly and export of this structure could be attributed to genes in a novel capsule biosynthesis gene cluster, designated KL37, which was found in the NIPH146 genome. The CPS of A. baumannii NIPH146 shares the α-d-Galp-(1→6)-β-d-Glcp-(1→3)-d-GalpNAc-(1→ trisaccharide fragment with the CPS units of several A. baumannii strains, including ATCC 17978 and LUH 5537 that carry the KL3 and KL22 gene clusters, respectively. KL37 contains two genes for glycosyltransferases that are related to two glycosyltransferase genes present in both KL3 and KL22, and the encoded proteins could be tentatively assigned to linkages between sugars in the CPS repeat. Copyright © 2015 Elsevier Ltd. All rights reserved.

  19. Transcriptional analysis of the jamaicamide gene cluster from the marine cyanobacterium Lyngbya majuscula and identification of possible regulatory proteins

    Directory of Open Access Journals (Sweden)

    Dorrestein Pieter C

    2009-12-01

    Full Text Available Abstract Background The marine cyanobacterium Lyngbya majuscula is a prolific producer of bioactive secondary metabolites. Although biosynthetic gene clusters encoding several of these compounds have been identified, little is known about how these clusters of genes are transcribed or regulated, and techniques targeting genetic manipulation in Lyngbya strains have not yet been developed. We conducted transcriptional analyses of the jamaicamide gene cluster from a Jamaican strain of Lyngbya majuscula, and isolated proteins that could be involved in jamaicamide regulation. Results An unusually long untranslated leader region of approximately 840 bp is located between the jamaicamide transcription start site (TSS and gene cluster start codon. All of the intergenic regions between the pathway ORFs were transcribed into RNA in RT-PCR experiments; however, a promoter prediction program indicated the possible presence of promoters in multiple intergenic regions. Because the functionality of these promoters could not be verified in vivo, we used a reporter gene assay in E. coli to show that several of these intergenic regions, as well as the primary promoter preceding the TSS, are capable of driving β-galactosidase production. A protein pulldown assay was also used to isolate proteins that may regulate the jamaicamide pathway. Pulldown experiments using the intergenic region upstream of jamA as a DNA probe isolated two proteins that were identified by LC-MS/MS. By BLAST analysis, one of these had close sequence identity to a regulatory protein in another cyanobacterial species. Protein comparisons suggest a possible correlation between secondary metabolism regulation and light dependent complementary chromatic adaptation. Electromobility shift assays were used to evaluate binding of the recombinant proteins to the jamaicamide promoter region. Conclusion Insights into natural product regulation in cyanobacteria are of significant value to drug discovery

  20. The effect of alcohol on the differential expression of cluster of differentiation 14 gene, associated pathways, and genetic network.

    Science.gov (United States)

    Zhou, Diana X; Zhao, Yinghong; Baker, Jessica A; Gu, Qingqing; Hamre, Kristin M; Yue, Junming; Jones, Byron C; Cook, Melloni N; Lu, Lu

    2017-01-01

    Alcohol consumption affects human health in part by compromising the immune system. In this study, we examined the expression of the Cd14 (cluster of differentiation 14) gene, which is involved in the immune system through a proinflammatory cascade. Expression was evaluated in BXD mice treated with saline or acute 1.8 g/kg i.p. ethanol (12.5% v/v). Hippocampal gene expression data were generated to examine differential expression and to perform systems genetics analyses. The Cd14 gene expression showed significant changes among the BXD strains after ethanol treatment, and eQTL mapping revealed that Cd14 is a cis-regulated gene. We also identified eighteen ethanol-related phenotypes correlated with Cd14 expression related to either ethanol responses or ethanol consumption. Pathway analysis was performed to identify possible biological pathways involved in the response to ethanol and Cd14. We also constructed a genetic network for Cd14 using the top 20 correlated genes and present several genes possibly involved in Cd14 and ethanol responses based on differential gene expression. In conclusion, we found Cd14, along with several other genes and pathways, to be involved in ethanol responses in the hippocampus, such as increased susceptibility to lipopolysaccharides and neuroinflammation.

  1. The effect of alcohol on the differential expression of cluster of differentiation 14 gene, associated pathways, and genetic network.

    Directory of Open Access Journals (Sweden)

    Diana X Zhou

    Full Text Available Alcohol consumption affects human health in part by compromising the immune system. In this study, we examined the expression of the Cd14 (cluster of differentiation 14 gene, which is involved in the immune system through a proinflammatory cascade. Expression was evaluated in BXD mice treated with saline or acute 1.8 g/kg i.p. ethanol (12.5% v/v. Hippocampal gene expression data were generated to examine differential expression and to perform systems genetics analyses. The Cd14 gene expression showed significant changes among the BXD strains after ethanol treatment, and eQTL mapping revealed that Cd14 is a cis-regulated gene. We also identified eighteen ethanol-related phenotypes correlated with Cd14 expression related to either ethanol responses or ethanol consumption. Pathway analysis was performed to identify possible biological pathways involved in the response to ethanol and Cd14. We also constructed a genetic network for Cd14 using the top 20 correlated genes and present several genes possibly involved in Cd14 and ethanol responses based on differential gene expression. In conclusion, we found Cd14, along with several other genes and pathways, to be involved in ethanol responses in the hippocampus, such as increased susceptibility to lipopolysaccharides and neuroinflammation.

  2. Cloning, reassembling and integration of the entire nikkomycin biosynthetic gene cluster into Streptomyces ansochromogenes lead to an improved nikkomycin production

    Directory of Open Access Journals (Sweden)

    Yang Haihua

    2010-01-01

    Full Text Available Abstract Background Nikkomycins are a group of peptidyl nucleoside antibiotics produced by Streptomyces ansochromogenes. They are competitive inhibitors of chitin synthase and show potent fungicidal, insecticidal, and acaricidal activities. Nikkomycin X and Z are the main components produced by S. ansochromogenes. Generation of a high-producing strain is crucial to scale up nikkomycins production for further clinical trials. Results To increase the yields of nikkomycins, an additional copy of nikkomycin biosynthetic gene cluster (35 kb was introduced into nikkomycin producing strain, S. ansochromogenes 7100. The gene cluster was first reassembled into an integrative plasmid by Red/ET technology combining with classic cloning methods and then the resulting plasmid(pNIKwas introduced into S. ansochromogenes by conjugal transfer. Introduction of pNIK led to enhanced production of nikkomycins (880 mg L-1, 4 -fold nikkomycin X and 210 mg L-1, 1.8-fold nikkomycin Z in the resulting exconjugants comparing with the parent strain (220 mg L-1 nikkomycin X and 120 mg L-1 nikkomycin Z. The exconjugants are genetically stable in the absence of antibiotic resistance selection pressure. Conclusion A high nikkomycins producing strain (1100 mg L-1 nikkomycins was obtained by introduction of an extra nikkomycin biosynthetic gene cluster into the genome of S. ansochromogenes. The strategies presented here could be applicable to other bacteria to improve the yields of secondary metabolites.

  3. Non-host Plant Resistance against Phytophthora capsici Is Mediated in Part by Members of the I2 R Gene Family in Nicotiana spp.

    Science.gov (United States)

    Vega-Arreguín, Julio C; Shimada-Beltrán, Harumi; Sevillano-Serrano, Jacobo; Moffett, Peter

    2017-01-01

    The identification of host genes associated with resistance to Phytophthora capsici is crucial to developing strategies of control against this oomycete pathogen. Since there are few sources of resistance to P. capsici in crop plants, non-host plants represent a promising source of resistance genes as well as excellent models to study P. capsici - plant interactions. We have previously shown that non-host resistance to P. capsici in Nicotiana spp. is mediated by the recognition of a specific P. capsici effector protein, PcAvr3a1 in a manner that suggests the involvement of a cognate disease resistance (R) genes. Here, we have used virus-induced gene silencing (VIGS) and transgenic tobacco plants expressing dsRNA in Nicotiana spp. to identify candidate R genes that mediate non-host resistance to P. capsici . Silencing of members of the I2 multigene family in the partially resistant plant N. edwardsonii and in the resistant N. tabacum resulted in compromised resistance to P. capsici . VIGS of two other components required for R gene-mediated resistance, EDS1 and SGT1 , also enhanced susceptibility to P. capsici in N. edwardsonii , as well as in the susceptible plants N. benthamiana and N. clevelandii . The silencing of I2 family members in N. tabacum also compromised the recognition of PcAvr3a1. These results indicate that in this case, non-host resistance is mediated by the same components normally associated with race-specific resistance.

  4. Analysis of odorant-binding protein gene family members in the polyembryonic wasp, Copidosoma floridanum: evidence for caste bias and host interaction.

    Science.gov (United States)

    Donnell, David M

    2014-01-01

    The polyembryonic wasp Copidosoma floridanum produces two larval castes, soldiers and reproductives, during development within its caterpillar host. Primary structures were determined for 6 odorant-binding protein (OBP) gene family members in Copidosoma and then analyzed alongside two formerly sequenced OBP genes from this wasp. The genes were examined for caste-bias in expression patterns using reverse transcription-polymerase chain reaction (RT-PCR) and in situ expression studies. Six of the 8 genes show a clear bias in gene expression towards one or the other larval caste. Of the 3 distinct in situ probe hybridization patterns observed in this study, none lie in tissues with clear chemosensory functions. Two of the patterns suggest the majority of the Copidosoma OBP gene family members discovered thus far come into contact with host hemolymph. Most of these OBPs are expressed exclusively in the serosal membrane encompassing each of the reproductive larvae. The absence of expression in the membrane surrounding soldier larvae strongly suggests these OBPs are performing caste-specific functions in the host. Copyright © 2013 Elsevier Ltd. All rights reserved.

  5. ZmCPK1, a calcium-independent kinase member of the Zea mays CDPK gene family, functions as a negative regulator in cold stress signalling.

    Science.gov (United States)

    Weckwerth, Philipp; Ehlert, Britta; Romeis, Tina

    2015-03-01

    Calcium-dependent protein kinases (CDPKs) have been shown to play important roles in plant environmental stress signal transduction. We report on the identification of ZmCPK1 as a member of the maize (Zea mays) CDPK gene family involved in the regulation of the maize cold stress response. Based upon in silico analysis of the Z. mays cv. B73 genome, we identified that the maize CDPK gene family consists of 39 members. Two CDPK members were selected whose gene expression was either increased (Zmcpk1) or decreased (Zmcpk25) in response to cold exposure. Biochemical analysis demonstrated that ZmCPK1 displays calcium-independent protein kinase activity. The C-terminal calcium-binding domain of ZmCPK1 was sufficient to mediate calcium independency of a previously calcium-dependent enzyme in chimeric ZmCPK25-CPK1 proteins. Furthermore, co-transfection of maize mesophyll protoplasts with active full-length ZmCPK1 suppressed the expression of a cold-induced marker gene, Zmerf3 (ZmCOI6.21). In accordance, heterologous overexpression of ZmCPK1 in Arabidopsis thaliana yielded plants with altered acclimation-induced frost tolerance. Our results identify ZmCPK1 as a negative regulator of cold stress signalling in maize. © 2014 John Wiley & Sons Ltd.

  6. Systematic analysis of sequences and expression patterns of drought-responsive members of the HD-Zip gene family in maize.

    Directory of Open Access Journals (Sweden)

    Yang Zhao

    Full Text Available BACKGROUND: Members of the homeodomain-leucine zipper (HD-Zip gene family encode transcription factors that are unique to plants and have diverse functions in plant growth and development such as various stress responses, organ formation and vascular development. Although systematic characterization of this family has been carried out in Arabidopsis and rice, little is known about HD-Zip genes in maize (Zea mays L.. METHODS AND FINDINGS: In this study, we described the identification and structural characterization of HD-Zip genes in the maize genome. A complete set of 55 HD-Zip genes (Zmhdz1-55 were identified in the maize genome using Blast search tools and categorized into four classes (HD-Zip I-IV based on phylogeny. Chromosomal location of these genes revealed that they are distributed unevenly across all 10 chromosomes. Segmental duplication contributed largely to the expansion of the maize HD-ZIP gene family, while tandem duplication was only responsible for the amplification of the HD-Zip II genes. Furthermore, most of the maize HD-Zip I genes were found to contain an overabundance of stress-related cis-elements in their promoter sequences. The expression levels of the 17 HD-Zip I genes under drought stress were also investigated by quantitative real-time PCR (qRT-PCR. All of the 17 maize HD-ZIP I genes were found to be regulated by drought stress, and the duplicated genes within a sister pair exhibited the similar expression patterns, suggesting their conserved functions during the process of evolution. CONCLUSIONS: Our results reveal a comprehensive overview of the maize HD-Zip gene family and provide the first step towards the selection of Zmhdz genes for cloning and functional research to uncover their roles in maize growth and development.

  7. [Sequence analysis of 16S rDNA and pmoCAB gene cluster of trichloroethylene-degrading methanotroph].

    Science.gov (United States)

    Zhang, Yunru; Chen, Huaqing; Gao, Yanhui; Xing, Zhilin; Zhao, Tiantao

    2014-12-01

    Methanotrophs could degrade methane and various chlorinated hydrocarbons. The analysis on methane monooxygenase gene cluster sequence would help to understand its catalytic mechanism and enhance the application in pollutants biodegradation. The methanotrophs was enriched and isolated with methane as the sole carbon source in the nitrate mineral salt medium. Then, five chlorinated hydrocarbons were selected as cometabolic substrates to study the biodegradation. The phylogenetic tree of 16S rDNA using MEGE5.05 software was constructed to identify the methanotroph strain. The pmoCAB gene cluster encoding particulate methane monooxygenase (pMMO) was amplified by semi-nested PCR in segments. ExPASy was performed to analyze theoretical molecular weight of the three pMMO subunits. As a result, a strain of methanotroph was isolated. The phylogenetic analysis indicated that the strain belongs to a species of Methylocystis, and it was named as Methylocystis sp. JTC3. The degradation rate of trichloroethylene (TCE) reached 93.79% when its initial concentration was 15.64 μmol/L after 5 days. We obtained the pmoCAB gene cluster of 3 227 bp including pmoC gene of 771 bp, pmoA gene of 759 bp, pmoB gene of 1 260 bp and two noncoding sequences in the middle by semi-nested PCR, T-A cloning and sequencing. The theoretical molecular weight of their corresponding gamma, beta and alpha subunit were 29.1 kDa, 28.6 kDa and 45.6 kDa respectively analyzed using ExPASy tool. The pmoCAB gene cluster of JTC3 was highly identical with that of Methylocystis sp. strain M analyzed by Blast, and pmoA sequences is more conservative than pmoC and pmoB. Finally, Methylocystis sp. JTC3 could degrade TCE efficiently. And the detailed analysis of pmoCAB from Methylocystis sp. JTC3 laid a solid foundation to further study its active sites features and its selectivity to chlorinated hydrocarbon.

  8. Genome-Wide Analysis of Secondary Metabolite Gene Clusters in Ophiostoma ulmi and Ophiostoma novo-ulmi Reveals a Fujikurin-Like Gene Cluster with a Putative Role in Infection

    Directory of Open Access Journals (Sweden)

    Nicolau Sbaraini

    2017-06-01

    Full Text Available The emergence of new microbial pathogens can result in destructive outbreaks, since their hosts have limited resistance and pathogens may be excessively aggressive. Described as the major ecological incident of the twentieth century, Dutch elm disease, caused by ascomycete fungi from the Ophiostoma genus, has caused a significant decline in elm tree populations (Ulmus sp. in North America and Europe. Genome sequencing of the two main causative agents of Dutch elm disease (Ophiostoma ulmi and Ophiostoma novo-ulmi, along with closely related species with different lifestyles, allows for unique comparisons to be made to identify how pathogens and virulence determinants have emerged. Among several established virulence determinants, secondary metabolites (SMs have been suggested to play significant roles during phytopathogen infection. Interestingly, the secondary metabolism of Dutch elm pathogens remains almost unexplored, and little is known about how SM biosynthetic genes are organized in these species. To better understand the metabolic potential of O. ulmi and O. novo-ulmi, we performed a deep survey and description of SM biosynthetic gene clusters (BGCs in these species and assessed their conservation among eight species from the Ophiostomataceae family. Among 19 identified BGCs, a fujikurin-like gene cluster (OpPKS8 was unique to Dutch elm pathogens. Phylogenetic analysis revealed that orthologs for this gene cluster are widespread among phytopathogens and plant-associated fungi, suggesting that OpPKS8 may have been horizontally acquired by the Ophiostoma genus. Moreover, the detailed identification of several BGCs paves the way for future in-depth research and supports the potential impact of secondary metabolism on Ophiostoma genus’ lifestyle.

  9. Taxonomic revision of the Malagasy members of the Nesomyrmex angulatus species group using the automated morphological species delineation protocol NC-PART-clustering

    Directory of Open Access Journals (Sweden)

    Sándor Csősz

    2016-03-01

    Full Text Available Background. Applying quantitative morphological approaches in systematics research is a promising way to discover cryptic biological diversity. Information obtained through twenty-first century science poses new challenges to taxonomy by offering the possibility of increased objectivity in independent and automated hypothesis formation. In recent years a number of promising new algorithmic approaches have been developed to recognize morphological diversity among insects based on multivariate morphometric analyses. These algorithms objectively delimit components in the data by automatically assigning objects into clusters. Method. In this paper, hypotheses on the diversity of the Malagasy Nesomyrmex angulatus group are formulated via a highly automated protocol involving a fusion of two algorithms, (1 Nest Centroid clustering (NC clustering and (2 Partitioning Algorithm based on Recursive Thresholding (PART. Both algorithms assign samples into clusters, making the class assignment results of different algorithms readily inferable. The results were tested by confirmatory cross-validated Linear Discriminant Analysis (LOOCV-LDA. Results. Here we reveal the diversity of a unique and largely unexplored fragment of the Malagasy ant fauna using NC-PART-clustering on continuous morphological data, an approach that brings increased objectivity to taxonomy. We describe eight morphologically distinct species, including seven new species: Nesomyrmex angulatus (Mayr, 1862, N. bidentatus sp. n., N. clypeatus sp. n., N. devius sp. n., N. exiguus sp. n., N. fragilis sp. n., N. gracilis sp. n., and N. hirtellus sp. n.. An identification key for their worker castes using morphometric data is provided. Conclusions. Combining the dimensionality reduction feature of NC clustering with the assignment of samples into clusters by PART advances the automatization of morphometry-based alpha taxonomy.

  10. Identification and functional clustering of genes regulating muscle protein degradation from amongst the known C. elegans muscle mutants.

    Directory of Open Access Journals (Sweden)

    Freya Shephard

    Full Text Available Loss of muscle mass via protein degradation is an important clinical problem but we know little of how muscle protein degradation is regulated genetically. To gain insight our labs developed C. elegans into a model for understanding the regulation of muscle protein degradation. Past studies uncovered novel functional roles for genes affecting muscle and/or involved in signalling in other cells or tissues. Here we examine most of the genes previously identified as the sites of mutations affecting muscle for novel roles in regulating degradation. We evaluate genomic (RNAi knockdown approaches and combine them with our established genetic (mutant and pharmacologic (drugs approaches to examine these 159 genes. We find that RNAi usually recapitulates both organismal and sub-cellular mutant phenotypes but RNAi, unlike mutants, can frequently be used acutely to study gene function solely in differentiated muscle. In the majority of cases where RNAi does not produce organismal level phenotypes, sub-cellular defects can be detected; disrupted proteostasis is most commonly observed. We identify 48 genes in which mutation or RNAi knockdown causes excessive protein degradation; myofibrillar and/or mitochondrial morphologies are also disrupted in 19 of these 48 cases. These 48 genes appear to act via at least three sub-networks to control bulk degradation of protein in muscle cytosol. Attachment to the extracellular matrix regulates degradation via unidentified proteases and affects myofibrillar and mitochondrial morphology. Growth factor imbalance and calcium overload promote lysosome based degradation whereas calcium deficit promotes proteasome based degradation, in both cases myofibrillar and mitochondrial morphologies are largely unaffected. Our results provide a framework for effectively using RNAi to identify and functionally cluster novel regulators of degradation. This clustering allows prioritization of candidate genes/pathways for future

  11. Genome-wide identification, phylogenetic analysis, expression profiling, and protein–protein interaction properties of TOPLESS gene family members in tomato

    Science.gov (United States)

    Hao, Yanwei; Wang, Xinyu; van der Rest, Benoit; Zouine, Mohamed

    2014-01-01

    Members of the TOPLESS gene family emerged recently as key players in gene repression in several mechanisms, especially in auxin perception. The TOPLESS genes constitute, in ‘higher-plant’ genomes, a small multigenic family comprising four to 11 members. In this study, this family was investigated in tomato, a model plant for Solanaceae species and fleshy fruits. Six open reading frames predicted to encode topless-like proteins (SlTPLs) containing the canonical domains (LisH, CTLH, and two WD40 repeats) were identified in the tomato genome. Nuclear localization was confirmed for all members of the SlTPL family with the exception SlTPL6, which localized at the cytoplasm and was excluded from the nucleus. SlTPL genes displayed distinctive expression patterns in different tomato organs, with SlTPL1 showing the highest levels of transcript accumulation in all tissues tested except in ripening fruit where SlTPL3 and SlTPL4 were the most prominently expressed. To gain insight into the specificity of the different TOPLESS paralogues, a protein–protein interaction map between TOPLESS and auxin/indole-3-acetic acid (Aux/IAA) proteins was built using a yeast two-hybrid approach. The PPI map enabled the distinction of two patterns: TOPLESS isoforms interacting with the majority of Aux/IAA, and isoforms with limited capacity for interaction with these protein partners. Interestingly, evolutionary analyses of the TOPLESS gene family revealed that the highly expressed isoforms (SlTPL1, SlTPL3, and SlTPL4) corresponded to the three TPL-related genes undergoing the strongest purifying selection, while the selection was much weaker for SlTPL6, which was expressed at a low level and encoded a protein lacking the capacity to interact with Aux/IAAs. PMID:24399174

  12. Organization and subcloning of the dacA-rodA-pbpA cluster of cell shape genes in Escherichia coli.

    Science.gov (United States)

    Stoker, N G; Broome-Smith, J K; Edelman, A; Spratt, B G

    1983-01-01

    The transducing bacteriophage lambda pBS10 carries a small cluster of Escherichia coli penicillin-binding protein/cell shape genes, including pbpA, rodA, and dacA. Deletion mapping and subcloning showed that these genes, and the gene for a cytoplasmic membrane protein of molecular weight 54,000, are located within a 5.6-kilobase region and are probably contiguous. The dacA gene, which codes for penicillin-binding protein 5, was cloned on a 1.5-kilobase fragment into a low-copy-number plasmid vector, but insertion into high-copy-number plasmids produced deleterious effects on bacterial growth, and the plasmids could not be stably maintained. The direction of transcription of dacA was determined. The rodA gene was cloned on a 1.6-kilobase fragment into both low- and high-copy-number plasmids, and the identification of its gene product is described in the accompanying paper (Stoker et al., J. Bacteriol. 155:854-859). The pbpA gene, which codes for penicillin-binding protein 2, was cloned on a 3.7-kilobase fragment in low-copy-number plasmids, but insertion of the fragment into high-copy-number plasmids resulted in deleterious effects on bacterial growth, and the plasmids could not be stably maintained. Images PMID:6348028

  13. Rapid transcriptional plasticity of duplicated gene clusters enables a clonally reproducing aphid to colonise diverse plant species.

    Science.gov (United States)

    Mathers, Thomas C; Chen, Yazhou; Kaithakottil, Gemy; Legeai, Fabrice; Mugford, Sam T; Baa-Puyoulet, Patrice; Bretaudeau, Anthony; Clavijo, Bernardo; Colella, Stefano; Collin, Olivier; Dalmay, Tamas; Derrien, Thomas; Feng, Honglin; Gabaldón, Toni; Jordan, Anna; Julca, Irene; Kettles, Graeme J; Kowitwanich, Krissana; Lavenier, Dominique; Lenzi, Paolo; Lopez-Gomollon, Sara; Loska, Damian; Mapleson, Daniel; Maumus, Florian; Moxon, Simon; Price, Daniel R G; Sugio, Akiko; van Munster, Manuella; Uzest, Marilyne; Waite, Darren; Jander, Georg; Tagu, Denis; Wilson, Alex C C; van Oosterhout, Cock; Swarbreck, David; Hogenhout, Saskia A

    2017-02-13

    The prevailing paradigm of host-parasite evolution is that arms races lead to increasing specialisation via genetic adaptation. Insect herbivores are no exception and the majority have evolved to colonise a small number of closely related host species. Remarkably, the green peach aphid, Myzus persicae, colonises plant species across 40 families and single M. persicae clonal lineages can colonise distantly related plants. This remarkable ability makes M. persicae a highly destructive pest of many important crop species. To investigate the exceptional phenotypic plasticity of M. persicae, we sequenced the M. persicae genome and assessed how one clonal lineage responds to host plant species of different families. We show that genetically identical individuals are able to colonise distantly related host species through the differential regulation of genes belonging to aphid-expanded gene families. Multigene clusters collectively upregulate in single aphids within two days upon host switch. Furthermore, we demonstrate the functional significance of this rapid transcriptional change using RNA interference (RNAi)-mediated knock-down of genes belonging to the cathepsin B gene family. Knock-down of cathepsin B genes reduced aphid fitness, but only on the host that induced upregulation of these genes. Previous research has focused on the role of genetic adaptation of parasites to their hosts. Here we show that the generalist aphid pest M. persicae is able to colonise diverse host plant species in the absence of genetic specialisation. This is achieved through rapid transcriptional plasticity of genes that have duplicated during aphid evolution.

  14. Draft genome sequence of Streptomyces coelicoflavus ZG0656 reveals the putative biosynthetic gene cluster of acarviostatin family α-amylase inhibitors.

    Science.gov (United States)

    Guo, X; Geng, P; Bai, F; Bai, G; Sun, T; Li, X; Shi, L; Zhong, Q

    2012-08-01

    The aims of this study are to obtain the draft genome sequence of Streptomyces coelicoflavus ZG0656, which produces novel acarviostatin family α-amylase inhibitors, and then to reveal the putative acarviostatin-related gene cluster and the biosynthetic pathway. The draft genome sequence of S. coelicoflavus ZG0656 was generated using a shotgun approach employing a combination of 454 and Solexa sequencing technologies. Genome analysis revealed a putative gene cluster for acarviostatin biosynthesis, termed sct-cluster. The cluster contains 13 acarviostatin synthetic genes, six transporter genes, four starch degrading or transglycosylation enzyme genes and two regulator genes. On the basis of bioinformatic analysis, we proposed a putative biosynthetic pathway of acarviostatins. The intracellular steps produce a structural core, acarviostatin I00-7-P, and the extracellular assemblies lead to diverse acarviostatin end products. The draft genome sequence of S. coelicoflavus ZG0656 revealed the putative biosynthetic gene cluster of acarviostatins and a putative pathway of acarviostatin production. To our knowledge, S. coelicoflavus ZG0656 is the first strain in this species for which a genome sequence has been reported. The analysis of sct-cluster provided important insights into the biosynthesis of acarviostatins. This work will be a platform for producing novel variants and yield improvement. © 2012 The Authors. Letters in Applied Microbiology © 2012 The Society for Applied Microbiology.

  15. Two gene clusters co-ordinate for a functional N-acetylglucosamine catabolic pathway in Vibrio cholerae.

    Science.gov (United States)

    Ghosh, Swagata; Rao, K Hanumantha; Sengupta, Manjistha; Bhattacharya, Sujit K; Datta, Asis

    2011-06-01

    Pathogenic microorganisms like Vibrio cholerae are capable of adapting to diverse living conditions, especially when they transit from their environmental reservoirs to human host. V. cholerae attaches to N-acetylglucosamine (GlcNAc) residues in glycoproteins and lipids present in the intestinal epithelium and chitinous surface of zoo-phytoplanktons in the aquatic environment for its survival and colonization. GlcNAc utilization thus appears to be important for the pathogen to reach sufficient titres in the intestine for producing clinical symptoms of cholera. We report here the involvement of a second cluster of genes working in combination with the classical genes of GlcNAc catabolism, suggesting the occurrence of a novel variant of the process of biochemical conversion of GlcNAc to Fructose-6-phosphate as has been described in other organisms. Colonization was severely attenuated in mutants that were incapable of utilizing GlcNAc. It was also shown that N-acetylglucosamine specific repressor (NagC) performs a dual role - while the classical GlcNAc catabolic genes are under its negative control, the genes belonging to the second cluster are positively regulated by it. Further application of tandem affinity purification to NagC revealed its interaction with a novel partner. Our results provide a genetic program that probably enables V. cholerae to successfully utilize amino - sugars and also highlights a new mode of transcriptional regulation, not described in this organism. © 2011 Blackwell Publishing Ltd.

  16. Functional genomic analysis supports conservation of function among cellulose synthase-like a gene family members and suggests diverse roles of mannans in plants

    DEFF Research Database (Denmark)

    Liepman, Aaron H; Nairn, C Joseph; Willats, William G T

    2007-01-01

    in insect cells, and each CslA protein catalyzed mannan and glucomannan synthase reactions in vitro. Microarray mining and quantitative real-time reverse transcription-polymerase chain reaction analysis demonstrated that transcripts of Arabidopsis and loblolly pine (Pinus taeda) CslA genes display tissue...... they are prevalent at cell junctions and in buds. Taken together, these results demonstrate that members of the CslA gene family from diverse plant species encode glucomannan synthases and support the hypothesis that mannans function in metabolic networks devoted to other cellular processes in addition to cell wall...

  17. Rice Snl6, a cinnamoyl-CoA reductase-like gene family member, is required for NH1-mediated immunity to Xanthomonas oryzae pv. oryzae.

    Directory of Open Access Journals (Sweden)

    Rebecca S Bart

    2010-09-01

    Full Text Available Rice NH1 (NPR1 homolog 1 is a key mediator of innate immunity. In both plants and animals, the innate immune response is often accompanied by rapid cell death at the site of pathogen infection. Over-expression of NH1 in rice results in resistance to the bacterial pathogen, Xanthomonas oryzae pv. oryzae (Xoo, constitutive expression of defense related genes and enhanced benzothiadiazole (BTH- mediated cell death. Here we describe a forward genetic screen that identified a suppressor of NH1-mediated lesion formation and resistance, snl6. Comparative genome hybridization and fine mapping rapidly identified the genomic location of the Snl6 gene. Snl6 is a member of the cinnamoyl-CoA reductase (CCR-like gene family. We show that Snl6 is required for NH1-mediated resistance to Xoo. Further, we show that Snl6 is required for pathogenesis-related gene expression. In contrast to previously described CCR family members, disruption of Snl6 does not result in an obvious morphologic phenotype. Snl6 mutants have reduced lignin content and increased sugar extractability, an important trait for the production of cellulosic biofuels. These results suggest the existence of a conserved group of CCR-like genes involved in the defense response, and with the potential to alter lignin content without affecting development.

  18. Comparison of the gene encoding, and the predicted amino acid composition of, platelet membrane receptor subunit glycoprotein Ibα in members of the family Felidae.

    Science.gov (United States)

    Boudreaux, Mary K; Christopherson, Pete W; Blair, Cori

    2016-03-01

    There is minimal information regarding platelet receptors in the family Felidae. Comparative studies assist with identifying amino acids critical for protein structure and function. The purpose of the study was to compare the gene encoding, and the predicted amino acid composition of, platelet membrane receptor subunit GPIbα in Felidae family members. Genomic DNA samples isolated from whole blood of 13 domestic cats and 50 big cats representing 8 different species were subjected to PCR using primers designed to flank the coding region of GPIbα in overlapping fashion. PCR products were separated via electrophoresis on agarose gels, and extracted products were submitted for sequencing. DNA sequences were used to predict the length and amino acid composition of the protein. Varying protein lengths were predicted in Felidae family members which were primarily due to polymorphisms in the variable number of tandem repeats region encoding the macroglycopeptide region of GPIbα. Other areas of the gene and predicted amino acid compositions were fairly conserved when compared to human sequences and between Felidae family members. Various polymorphisms within GPIbα, including length variants encoding the macroglycopeptide region, were identified in members of the family Felidae. More studies are needed to determine if a correlation exists between various polymorphisms and predisposition for hemorrhage or thrombosis as suggested in people. © 2016 American Society for Veterinary Clinical Pathology.

  19. Ancestral and derived attributes of the dlx gene repertoire, cluster structure and expression patterns in an African cichlid fish

    Directory of Open Access Journals (Sweden)

    Renz Adina J

    2011-01-01

    Full Text Available Abstract Background Cichlid fishes have undergone rapid, expansive evolutionary radiations that are manifested in the diversification of their trophic morphologies, tooth patterning and coloration. Understanding the molecular mechanisms that underlie the cichlids' unique patterns of evolution requires a thorough examination of genes that pattern the neural crest, from which these diverse phenotypes are derived. Among those genes, the homeobox-containing Dlx gene family is of particular interest since it is involved in the patterning of the brain, jaws and teeth. Results In this study, we characterized the dlx genes of an African cichlid fish, Astatotilapia burtoni, to provide a baseline to later allow cross-species comparison within Cichlidae. We identified seven dlx paralogs (dlx1a, -2a, -4a, -3b, -4b, -5a and -6a, whose orthologies were validated with molecular phylogenetic trees. The intergenic regions of three dlx gene clusters (dlx1a-2a, dlx3b-4b, and dlx5a-6a were amplified with long PCR. Intensive cross-species comparison revealed a number of conserved non-coding elements (CNEs that are shared with other percomorph fishes. This analysis highlighted additional lineage-specific gains/losses of CNEs in different teleost fish lineages and a novel CNE that had previously not been identified. Our gene expression analyses revealed overlapping but distinct expression of dlx orthologs in the developing brain and pharyngeal arches. Notably, four of the seven A. burtoni dlx genes, dlx2a, dlx3b, dlx4a and dlx5a, were expressed in the developing pharyngeal teeth. Conclusion This comparative study of the dlx genes of A. burtoni has deepened our knowledge of the diversity of the Dlx gene family, in terms of gene repertoire, expression patterns and non-coding elements. We have identified possible cichlid lineage-specific changes, including losses of a subset of dlx expression domains in the pharyngeal teeth, which will be the targets of future functional

  20. Molecular characterization of a conserved archaeal copper resistance (cop) gene cluster and its copper-responsive regulator in Sulfolobus solfataricus P2

    NARCIS (Netherlands)

    Ettema, T.J.G.; Brinkman, A.B.; Lamers, P.P.; Kornet, N.; Vos, de W.M.; Oost, van der J.

    2006-01-01

    Using a comparative genomics approach, a copper resistance gene cluster has been identified in multiple archaeal genomes. The cop cluster is predicted to encode a metallochaperone (CopM), a P-type copper-exporting ATPase (CopA) and a novel, archaea-specific transcriptional regulator (CopT) which

  1. Haplotype diversity of VvTFL1A gene and association with cluster traits in grapevine (V. vinifera).

    Science.gov (United States)

    Fernandez, Lucie; Le Cunff, Loïc; Tello, Javier; Lacombe, Thierry; Boursiquot, Jean Michel; Fournier-Level, Alexandre; Bravo, Gema; Lalet, Sandrine; Torregrosa, Laurent; This, Patrice; Martinez-Zapater, José Miguel

    2014-08-05

    Interaction between TERMINAL FLOWER 1 (TFL1) and LEAFY (LFY) seem to determine the inflorescence architecture in Arabidopsis. In a parallel way, overexpression of VvTFL1A, a grapevine TFL1 homolog, causes delayed flowering and production of a ramose cluster in the reiterated reproductive meristem (RRM) somatic variant of cultivar Carignan. To analyze the possible contribution of this gene to cluster phenotypic variation in a diversity panel of cultivated grapevine (Vitis vinifera L. subsp. vinifera) its nucleotide diversity was characterized and association analyses among detected sequence polymorphisms and phenology and cluster traits was carried out. A total of 3.6 kb of the VvTFL1A gene, including its promoter, was sequenced in a core collection of 140 individuals designed to maximize phenotypic variation at agronomical relevant traits. Nucleotide variation for VvTFL1A within this collection was higher in the promoter and intron sequences than in the exon regions; where few polymorphisms were located in agreement with a high conservation of coding sequence. Characterization of the VvTFL1A haplotype network identified three major haplogroups, consistent with the geographic origins and the use of the cultivars that could correspond to three major ancestral alleles or evolutionary branches, based on the existence of mutations in linkage disequilibrium. Genetic association studies with cluster traits revealed the presence of major INDEL polymorphisms, explaining 16%, 13% and 25% of flowering time, cluster width and berry weight, respectively, and also structuring the three haplogroups. At least three major VvTFL1A haplogroups are present in cultivated grapevines, which are defined by the presence of three main polymorphism LD blocks and associated to characteristic phenotypic values for flowering time, cluster width and berry size. Phenotypic differences between haplogroups are consistent with differences observed between Eastern and Western grapevine cultivars and

  2. Diverse and Abundant Secondary Metabolism Biosynthetic Gene Clusters in the Genomes of Marine Sponge Derived Streptomyces spp. Isolates

    Directory of Open Access Journals (Sweden)

    Stephen A. Jackson

    2018-02-01

    Full Text Available The genus Streptomyces produces secondary metabolic compounds that are rich in biological activity. Many of these compounds are genetically encoded by large secondary metabolism biosynthetic gene clusters (smBGCs such as polyketide synthases (PKS and non-ribosomal peptide synthetases (NRPS which are modular and can be highly repetitive. Due to the repeats, these gene clusters can be difficult to resolve using short read next generation datasets and are often quite poorly predicted using standard approaches. We have sequenced the genomes of 13 Streptomyces spp. strains isolated from shallow water and deep-sea sponges that display antimicrobial activities against a number of clinically relevant bacterial and yeast species. Draft genomes have been assembled and smBGCs have been identified using the antiSMASH (antibiotics and Secondary Metabolite Analysis Shell web platform. We have compared the smBGCs amongst strains in the search for novel sequences conferring the potential to produce novel bioactive secondary metabolites. The strains in this study recruit to four distinct clades within the genus Streptomyces. The marine strains host abundant smBGCs which encode polyketides, NRPS, siderophores, bacteriocins and lantipeptides. The deep-sea strains appear to be enriched with gene clusters encoding NRPS. Marine adaptations are evident in the sponge-derived strains which are enriched for genes involved in the biosynthesis and transport of compatible solutes and for heat-shock proteins. Streptomyces spp. from marine environments are a promising source of novel bioactive secondary metabolites as the abundance and diversity of smBGCs show high degrees of novelty. Sponge derived Streptomyces spp. isolates appear to display genomic adaptations to marine living when compared to terrestrial strains.

  3. IMG-ABC: A Knowledge Base To Fuel Discovery of Biosynthetic Gene Clusters and Novel Secondary Metabolites.

    Science.gov (United States)

    Hadjithomas, Michalis; Chen, I-Min Amy; Chu, Ken; Ratner, Anna; Palaniappan, Krishna; Szeto, Ernest; Huang, Jinghua; Reddy, T B K; Cimermančič, Peter; Fischbach, Michael A; Ivanova, Natalia N; Markowitz, Victor M; Kyrpides, Nikos C; Pati, Amrita

    2015-07-14

    In the discovery of secondary metabolites, analysis of sequence data is a promising exploration path that remains largely underutilized due to the lack of computational platforms that enable such a systematic approach on a large scale. In this work, we present IMG-ABC (https://img.jgi.doe.gov/abc), an atlas of biosynthetic gene clusters within the Integrated Microbial Genomes (IMG) system, which is aimed at harnessing the power of "big" genomic data for discovering small molecules. IMG-ABC relies on IMG's comprehensive integrated structural and functional genomic data for the analysis of biosynthetic gene clusters (BCs) and associated secondary metabolites (SMs). SMs and BCs serve as the two main classes of objects in IMG-ABC, each with a rich collection of attributes. A unique feature of IMG-ABC is the incorporation of both experimentally validated and computationally predicted BCs in genomes as well as metagenomes, thus identifying BCs in uncultured populations and rare taxa. We demonstrate the strength of IMG-ABC's focused integrated analysis tools in enabling the exploration of microbial secondary metabolism on a global scale, through the discovery of phenazine-producing clusters for the first time in Alphaproteobacteria. IMG-ABC strives to fill the long-existent void of resources for computational exploration of the secondary metabolism universe; its underlying scalable framework enables traversal of uncovered phylogenetic and chemical structure space, serving as a doorway to a new era in the discovery of novel molecules. IMG-ABC is the largest publicly available database of predicted and experimental biosynthetic gene clusters and the secondary metabolites they produce. The system also includes powerful search and analysis tools that are integrated with IMG's extensive genomic/metagenomic data and analysis tool kits. As new research on biosynthetic gene clusters and secondary metabolites is published and more genomes are sequenced, IMG-ABC will continue to

  4. An epigenetic switch involving overlapping fur and DNA methylation optimizes expression of a type VI secretion gene cluster.

    Directory of Open Access Journals (Sweden)

    Yannick R Brunet

    2011-07-01

    Full Text Available Type VI secretion systems (T6SS are macromolecular machines of the cell envelope of Gram-negative bacteria responsible for bacterial killing and/or virulence towards different host cells. Here, we characterized the regulatory mechanism underlying expression of the enteroagregative Escherichia coli sci1 T6SS gene cluster. We identified Fur as the main regulator of the sci1 cluster. A detailed analysis of the promoter region showed the presence of three GATC motifs, which are target of the DNA adenine methylase Dam. Using a combination of reporter fusion, gel shift, and in vivo and in vitro Dam methylation assays, we dissected the regulatory role of Fur and Dam-dependent methylation. We showed that the sci1 gene cluster expression is under the control of an epigenetic switch depending on methylation: fur binding prevents methylation of a GATC motif, whereas methylation at this specific site decreases the affinity of Fur for its binding box. A model is proposed in which the sci1 promoter is regulated by iron availability, adenine methylation, and DNA replication.

  5. The N?acetylglucosamine catabolic gene cluster in Trichoderma reesei is controlled by the Ndt80?like transcription factor RON1

    OpenAIRE

    Kappel, Lisa; Gaderer, Romana; Flipphi, Michel; Seidl?Seiboth, Verena

    2015-01-01

    Summary Chitin is an important structural constituent of fungal cell walls composed of N ?acetylglucosamine (GlcNAc) monosaccharides, but catabolism of GlcNAc has not been studied in filamentous fungi so far. In the yeast C andida albicans, the genes encoding the three enzymes responsible for stepwise conversion of GlcNAc to fructose?6?phosphate are clustered. In this work, we analysed GlcNAc catabolism in ascomycete filamentous fungi and found that the respective genes are also clustered in ...

  6. High GC Content Cas9-Mediated Genome-Editing and Biosynthetic Gene Cluster Activation in Saccharopolyspora erythraea.

    Science.gov (United States)

    Liu, Yong; Wei, Wen-Ping; Ye, Bang-Ce

    2018-04-17

    The overexpression of bacterial secondary metabolite biosynthetic enzymes is the basis for industrial overproducing strains. Genome editing tools can be used to further improve gene expression and yield. Saccharopolyspora erythraea produces erythromycin, which has extensive clinical applications. In this study, the CRISPR-Cas9 system was used to edit genes in the S. erythraea genome. A temperature-sensitive plasmid containing the PermE promoter, to drive Cas9 expression, and the Pj23119 and PkasO promoters, to drive sgRNAs, was designed. Erythromycin esterase, encoded by S. erythraea SACE_1765, inactivates erythromycin by hydrolyzing the macrolactone ring. Sequencing and qRT-PCR confirmed that reporter genes were successfully inserted into the SACE_1765 gene. Deletion of SACE_1765 in a high-producing strain resulted in a 12.7% increase in erythromycin levels. Subsequent PermE- egfp knock-in at the SACE_0712 locus resulted in an 80.3% increase in erythromycin production compared with that of wild type. Further investigation showed that PermE promoter knock-in activated the erythromycin biosynthetic gene clusters at the SACE_0712 locus. Additionally, deletion of indA (SACE_1229) using dual sgRNA targeting without markers increased the editing efficiency to 65%. In summary, we have successfully applied Cas9-based genome editing to a bacterial strain, S. erythraea, with a high GC content. This system has potential application for both genome-editing and biosynthetic gene cluster activation in Actinobacteria.

  7. Cluster editing

    DEFF Research Database (Denmark)

    Böcker, S.; Baumbach, Jan

    2013-01-01

    . The problem has been the inspiration for numerous algorithms in bioinformatics, aiming at clustering entities such as genes, proteins, phenotypes, or patients. In this paper, we review exact and heuristic methods that have been proposed for the Cluster Editing problem, and also applications...

  8. NIFTI: An evolutionary approach for finding number of clusters in microarray data

    Directory of Open Access Journals (Sweden)

    Srinivasan Rajagopalan

    2009-01-01

    Full Text Available Abstract Background Clustering techniques are routinely used in gene expression data analysis to organize the massive data. Clustering techniques arrange a large number of genes or assays into a few clusters while maximizing the intra-cluster similarity and inter-cluster separation. While clustering of genes facilitates learning the functions of un-characterized genes using their association with known genes, clustering of assays reveals the disease stages and subtypes. Many clustering algorithms require the user to specify the number of clusters a priori. A wrong specification of number of clusters generally leads to either failure to detect novel clusters (disease subtypes or unnecessary splitting of natural clusters. Results We have developed a novel method to find the number of clusters in gene expression data. Our procedure evaluates different partitions (each with different number of clusters from the clustering algorithm and finds the partition that best describes the data. In contrast to the existing methods that evaluate the partitions independently, our procedure considers the dynamic rearrangement of cluster members when a new cluster is added. Partition quality is measured based on a new index called Net InFormation Transfer Index (NIFTI that measures the information change when an additional cluster is introduced. Information content of a partition increases when clusters do not intersect and decreases if they are not clearly separated. A partition with the highest Total Information Content (TIC is selected as the optimal one. We illustrate our method using four publicly available microarray datasets. Conclusion In all four case studies, the proposed method correctly identified the number of clusters and performs better than other well known methods. Our method also showed invariance to the clustering techniques.

  9. NJ cluster analysis of the SnRK2, PYR/PYL/RCAR, and ABF genes in Tibetan hulless barley.

    Science.gov (United States)

    Yuan, H J; Wang, Y L; Wei, Z X; Xu, Q J; Zeng, X Q; Tang, Y W; Nyima, T S

    2016-11-03

    The abscisic acid (ABA) signaling pathway is known as one of the most important signaling pathways in plants and is mediated by multiple regulators. The genes SnRK2, PYR/PYL/RCAR, and ABF are relevant to both ABA-dependent and -independent signaling pathways. To elucidate the profile of these genes from Tibetan hulless barley (Hordeum vulgare L. var. nudum Hook. f.), we collected available sequences from RNA-Seq data, together with NCBI data from five other model plant species (Arabidopsis thaliana, Brachypodium distachyon, Oryza sativa, Populus trichocarpa, and Sorghum bicolor). Gene trees of SnRK2, PYR/PYL/RCAR, and ABF were constructed using a neighbor joining (NJ) method. For all genes, we identified a dominant group in which all six species were represented. Three, four, and five groups were found in the NJ trees of SnRK2, PYR/PYL/RCAR, and ABF, respectively. For each gene, Tibetan hulless barley was divided into three groups. Our analyses indicated that Tibetan hulless barley was associated with B. distachyon. The NJ cluster analysis also suggested that Tibetan hulless barley was affiliated with S. bicolor (SnRK2), A. thaliana (PYR/PYL/RCAR), and O. sativa (ABF). These results illustrate a diverse expression of genes SnRK2, PYR/PYL/RCAR, and ABF, and suggest a relationship among the six species studied. Collectively, our characterization of the three components of the ABA signaling pathway may contribute to improve stress tolerance in Tibetan hulless barley.

  10. Functional characterization of diverse ring-hydroxylating oxygenases and induction of complex aromatic catabolic gene clusters in Sphingobium sp. PNB

    Directory of Open Access Journals (Sweden)

    Pratick Khara

    2014-01-01

    Full Text Available Sphingobium sp. PNB, like other sphingomonads, has multiple ring-hydroxylating oxygenase (RHO genes. Three different fosmid clones have been sequenced to identify the putative genes responsible for the degradation of various aromatics in this bacterial strain. Comparison of the map of the catabolic genes with that of different sphingomonads revealed a similar arrangement of gene clusters that harbors seven sets of RHO terminal components and a sole set of electron transport (ET proteins. The presence of distinctly conserved amino acid residues in ferredoxin and in silico molecular docking analyses of ferredoxin with the well characterized terminal oxygenase components indicated the structural uniqueness of the ET component in sphingomonads. The predicted substrate specificities, derived from the phylogenetic relationship of each of the RHOs, were examined based on transformation of putative substrates and their structural homologs by the recombinant strains expressing each of the oxygenases and the sole set of available ET proteins. The RHO AhdA1bA2b was functionally characterized for the first time and was found to be capable of transforming ethylbenzene, propylbenzene, cumene, p-cymene and biphenyl, in addition to a number of polycyclic aromatic hydrocarbons. Overexpression of aromatic catabolic genes in strain PNB, revealed by real-time PCR analyses, is a way forward to understand the complex regulation of degradative genes in sphingomonads.

  11. The ArcD1 and ArcD2 arginine/ornithine exchangers encoded in the arginine deiminase (ADI) pathway gene cluster of Lactococcus lactis

    NARCIS (Netherlands)

    Noens, Elke E E; Kaczmarek, Michał B; Żygo, Monika; Lolkema, Juke S

    2015-01-01

    The arginine deiminase pathway (ADI) gene cluster in Lactococcus lactis contains two copies of a gene encoding an L-arginine/L-ornithine exchanger, the arcD1 and arcD2 genes. The physiological function of ArcD1 and ArcD2 was studied by deleting the two genes. Deletion of arcD1 resulted in loss of

  12. ConGEMs: Condensed Gene Co-Expression Module Discovery Through Rule-Based Clustering and Its Application to Carcinogenesis

    Directory of Open Access Journals (Sweden)

    Saurav Mallik

    2017-12-01

    Full Text Available For transcriptomic analysis, there are numerous microarray-based genomic data, especially those generated for cancer research. The typical analysis measures the difference between a cancer sample-group and a matched control group for each transcript or gene. Association rule mining is used to discover interesting item sets through rule-based methodology. Thus, it has advantages to find causal effect relationships between the transcripts. In this work, we introduce two new rule-based similarity measures—weighted rank-based Jaccard and Cosine measures—and then propose a novel computational framework to detect condensed gene co-expression modules ( C o n G E M s through the association rule-based learning system and the weighted similarity scores. In practice, the list of evolved condensed markers that consists of both singular and complex markers in nature depends on the corresponding condensed gene sets in either antecedent or consequent of the rules of the resultant modules. In our evaluation, these markers could be supported by literature evidence, KEGG (Kyoto Encyclopedia of Genes and Genomes pathway and Gene Ontology annotations. Specifically, we preliminarily identified differentially expressed genes using an empirical Bayes test. A recently developed algorithm—RANWAR—was then utilized to determine the association rules from these genes. Based on that, we computed the integrated similarity scores of these rule-based similarity measures between each rule-pair, and the resultant scores were used for clustering to identify the co-expressed rule-modules. We applied our method to a gene expression dataset for lung squamous cell carcinoma and a genome methylation dataset for uterine cervical carcinogenesis. Our proposed module discovery method produced better results than the traditional gene-module discovery measures. In summary, our proposed rule-based method is useful for exploring biomarker modules from transcriptomic data.

  13. Identification and characterization of plant-specific NAC gene family in canola (Brassica napus L.) reveal novel members involved in cell death.

    Science.gov (United States)

    Wang, Boya; Guo, Xiaohua; Wang, Chen; Ma, Jieyu; Niu, Fangfang; Zhang, Hanfeng; Yang, Bo; Liang, Wanwan; Han, Feng; Jiang, Yuan-Qing

    2015-03-01

    NAC transcription factors are plant-specific and play important roles in plant development processes, response to biotic and abiotic cues and hormone signaling. However, to date, little is known about the NAC genes in canola (or oilseed rape, Brassica napus L.). In this study, a total of 60 NAC genes were identified from canola through a systematical analysis and mining of expressed sequence tags. Among these, the cDNA sequences of 41 NAC genes were successfully cloned. The translated protein sequences of canola NAC genes with the NAC genes from representative species were phylogenetically clustered into three major groups and multiple subgroups. The transcriptional activities of these BnaNAC proteins were assayed in yeast. In addition, by quantitative real-time RT-PCR, we further observed that some of these BnaNACs were regulated by different hormone stimuli or abiotic stresses. Interestingly, we successfully identified two novel BnaNACs, BnaNAC19 and BnaNAC82, which could elicit hypersensitive response-like cell death when expressed in Nicotiana benthamiana leaves, which was mediated by accumulation of reactive oxygen species. Overall, our work has laid a solid foundation for further characterization of this important NAC gene family in canola.

  14. Prevalence of genes encoding for members of the staphylococcal leukotoxin family among clinical isolates of Staphylococcus aureus

    NARCIS (Netherlands)

    von Eiff, Christof; Friedrich, Alexander W.; Peters, Georg; Becker, Karsten

    Well-characterized Staphylococcus aureus nasal and blood isolates (N = 429) were tested by polymerase chain reaction for the prevalence of genes that encode leukocidal toxins. The leukotoxin genes lukE+lukD were found at high prevalence, significantly more so in blood (82%) than in nasal isolates

  15. Neural networks and Fuzzy clustering methods for assessing the efficacy of microarray based intrinsic gene signatures in breast cancer classification and the character and relations of identified subtypes.

    Science.gov (United States)

    Samarasinghe, Sandhya; Chaiboonchoe, Amphun

    2015-01-01

    In the classification of breast cancer subtypes using microarray data, hierarchical clustering is commonly used. Although this form of clustering shows basic cluster patterns, more needs to be done to investigate the accuracy of clusters as well as to extract meaningful cluster characteristics and their relations to increase our confidence in their use in a clinical setting. In this study, an in-depth investigation of the efficacy of three reported gene subsets in distinguishing breast cancer subtypes was performed using four advanced computational intelligence methods-Self-Organizing Maps (SOM), Emergent Self-Organizing Maps (ESOM), Fuzzy Clustering by Local Approximation of Memberships (FLAME), and Fuzzy C-means (FCM)-each differing in the way they view data in terms of distance measures and fuzzy or crisp clustering. The gene subsets consisted of 71, 93, and 71 genes reported in the literature from three comprehensive experimental studies for distinguishing Luminal (A and B), Basal, Normal breast-like, and HER2 subtypes. Given the costly procedures involved in clinical studies, the proposed 93-gene set can be used for preliminary classification of breast cancer. Then, as a decision aid, SOM can be used to map the gene signature of a new patient to locate them with respect to all subtypes to get a comprehensive view of the classification. These can be followed by a deeper investigation in the light of the observations made in this study regarding overlapping subtypes. Results from the study could be used as the base for further refining the gene signatures from later experiments and from new experiments designed to separate overlapping clusters as well as to maximally separate all clusters.

  16. IMG-ABC: An Atlas of Biosynthetic Gene Clusters to Fuel the Discovery of Novel Secondary Metabolites

    Energy Technology Data Exchange (ETDEWEB)

    Chen, I-Min; Chu, Ken; Ratner, Anna; Palaniappan, Krishna; Huang, Jinghua; Reddy, T. B.K.; Cimermancic, Peter; Fischbach, Michael; Ivanova, Natalia; Markowitz, Victor; Kyrpides, Nikos; Pati, Amrita

    2014-10-28

    In the discovery of secondary metabolites (SMs), large-scale analysis of sequence data is a promising exploration path that remains largely underutilized due to the lack of relevant computational resources. We present IMG-ABC (https://img.jgi.doe.gov/abc/) -- An Atlas of Biosynthetic gene Clusters within the Integrated Microbial Genomes (IMG) system1. IMG-ABC is a rich repository of both validated and predicted biosynthetic clusters (BCs) in cultured isolates, single-cells and metagenomes linked with the SM chemicals they produce and enhanced with focused analysis tools within IMG. The underlying scalable framework enables traversal of phylogenetic dark matter and chemical structure space -- serving as a doorway to a new era in the discovery of novel molecules.