Grigoriev, Igor V.; Banks, Jo Ann; Nishiyama, Tomoaki; Hasebe, Mitsuyasu; Bowman, John L.; Gribskov, Michael; dePamphilis, Claude; Albert, Victor A.; Aono, Naoki; Aoyama, Tsuyoshi; Ambrose, Barbara A.; Ashton, Neil W.; Axtell, Michael J.; Barker, Elizabeth; Barker, Michael S.; Bennetzen, Jeffrey L.; Bonawitz, Nicholas D.; Chapple, Clint; Cheng, Chaoyang; Correa, Luiz Gustavo Guedes; Dacre, Michael; DeBarry, Jeremy; Dreyer, Ingo; Elias, Marek; Engstrom, Eric M.; Estelle, Mark; Feng, Liang; Finet, Cedric; Floyd, Sandra K.; Frommer, Wolf B.; Fujita, Tomomichi; Gramzow, Lydia; Gutensohn, Michael; Harholt, Jesper; Hattori, Mitsuru; Heyl, Alexander; Hirai, Tadayoshi; Hiwatashi, Yuji; Ishikawa, Masaki; Iwata, Mineko; Karol, Kenneth G.; Koehler, Barbara; Kolukisaoglu, Uener; Kubo, Minoru; Kurata, Tetsuya; Lalonde, Sylvie; Li, Kejie; Li, Ying; Litt, Amy; Lyons, Eric; Manning, Gerard; Maruyama, Takeshi; Michael, Todd P.; Mikami, Koji; Miyazaki, Saori; Morinaga, Shin-ichi; Murata, Takashi; Mueller-Roeber, Bernd; Nelson, David R.; Obara, Mari; Oguri, Yasuko; Olmstead, Richard G.; Onodera, Naoko; Petersen, Bent Larsen; Pils, Birgit; Prigge, Michael; Rensing, Stefan A.; Riano-Pachon, Diego Mauricio; Roberts, Alison W.; Sato, Yoshikatsu; Scheller, Henrik Vibe; Schulz, Burkhard; Schulz, Christian; Shakirov, Eugene V.; Shibagaki, Nakako; Shinohara, Naoki; Shippen, Dorothy E.; Sorensen, Iben; Sotooka, Ryo; Sugimoto, Nagisa; Sugita, Mamoru; Sumikawa, Naomi; Tanurdzic, Milos; Theilsen, Gunter; Ulvskov, Peter; Wakazuki, Sachiko; Weng, Jing-Ke; Willats, William W.G.T.; Wipf, Daniel; Wolf, Paul G.; Yang, Lixing; Zimmer, Andreas D.; Zhu, Qihui; Mitros, Therese; Hellsten, Uffe; Loque, Dominique; Otillar, Robert; Salamov, Asaf; Schmutz, Jeremy; Shapiro, Harris; Lindquist, Erika; Lucas, Susan; Rokhsar, Daniel
We report the genome sequence of the nonseed vascular plant, Selaginella moellendorffii, and by comparative genomics identify genes that likely played important roles in the early evolution of vascular plants and their subsequent evolution
González-Plaza, Juan J; Ortiz-Martín, Inmaculada; Muñoz-Mérida, Antonio; García-López, Carmen; Sánchez-Sevilla, José F; Luque, Francisco; Trelles, Oswaldo; Bejarano, Eduardo R; De La Rosa, Raúl; Valpuesta, Victoriano; Beuzón, Carmen R
Plant architecture is a critical trait in fruit crops that can significantly influence yield, pruning, planting density and harvesting. Little is known about how plant architecture is genetically determined in olive, were most of the existing varieties are traditional with an architecture poorly suited for modern growing and harvesting systems. In the present study, we have carried out microarray analysis of meristematic tissue to compare expression profiles of olive varieties displaying differences in architecture, as well as seedlings from their cross pooled on the basis of their sharing architecture-related phenotypes. The microarray used, previously developed by our group has already been applied to identify candidates genes involved in regulating juvenile to adult transition in the shoot apex of seedlings. Varieties with distinct architecture phenotypes and individuals from segregating progenies displaying opposite architecture features were used to link phenotype to expression. Here, we identify 2252 differentially expressed genes (DEGs) associated to differences in plant architecture. Microarray results were validated by quantitative RT-PCR carried out on genes with functional annotation likely related to plant architecture. Twelve of these genes were further analyzed in individual seedlings of the corresponding pool. We also examined Arabidopsis mutants in putative orthologs of these targeted candidate genes, finding altered architecture for most of them. This supports a functional conservation between species and potential biological relevance of the candidate genes identified. This study is the first to identify genes associated to plant architecture in olive, and the results obtained could be of great help in future programs aimed at selecting phenotypes adapted to modern cultivation practices in this species.
Full Text Available Abstract Background The elucidation of transcriptional regulation in plant genes is important area of research for plant scientists, following the mapping of various plant genomes, such as A. thaliana, O. sativa and Z. mays. A variety of bioinformatic servers or databases of plant promoters have been established, although most have been focused only on annotating transcription factor binding sites in a single gene and have neglected some important regulatory elements (tandem repeats and CpG/CpNpG islands in promoter regions. Additionally, the combinatorial interaction of transcription factors (TFs is important in regulating the gene group that is associated with the same expression pattern. Therefore, a tool for detecting the co-regulation of transcription factors in a group of gene promoters is required. Results This study develops a database-assisted system, PlantPAN (Plant Promoter Analysis Navigator, for recognizing combinatorial cis-regulatory elements with a distance constraint in sets of plant genes. The system collects the plant transcription factor binding profiles from PLACE, TRANSFAC (public release 7.0, AGRIS, and JASPER databases and allows users to input a group of gene IDs or promoter sequences, enabling the co-occurrence of combinatorial transcription factor binding sites (TFBSs within a defined distance (20 bp to 200 bp to be identified. Furthermore, the new resource enables other regulatory features in a plant promoter, such as CpG/CpNpG islands and tandem repeats, to be displayed. The regulatory elements in the conserved regions of the promoters across homologous genes are detected and presented. Conclusion In addition to providing a user-friendly input/output interface, PlantPAN has numerous advantages in the analysis of a plant promoter. Several case studies have established the effectiveness of PlantPAN. This novel analytical resource is now freely available at http://PlantPAN.mbc.nctu.edu.tw.
Egelund, Jack; Skjøt, Michael; Geshi, Naomi
Plant cell wall (CW) synthesizing enzymes can be divided into the glycan (i.e. cellulose and callose) synthases, which are multimembrane spanning proteins located at the plasma membrane, and the glycosyltransferases (GTs), which are Golgi localized single membrane spanning proteins, believed....... Although much is known with regard to composition and fine structures of the plant CW, only a handful of CW biosynthetic GT genes-all classified in the CAZy system-have been characterized. In an effort to identify CW GTs that have not yet been classified in the CAZy database, a simple bioinformatics...... approach was adopted. First, the entire Arabidopsis proteome was run through the Transmembrane Hidden Markov Model 2.0 server and proteins containing one or, more rarely, two transmembrane domains within the N-terminal 150 amino acids were collected. Second, these sequences were submitted...
Blavet, Nicolas; Blavet, Hana; Muyle, A.; Käfer, J.; Cegan, R.; Deschamps, C.; Zemp, N.; Mousset, S.; Aubourg, S.; Bergero, R.; Charlesworth, D.; Hobza, Roman; Widmer, A.; Marais, G.A.B.
Roč. 16, JUL 25 (2015), s. 546 ISSN 1471-2164 R&D Projects: GA ČR GAP501/12/2220 Institutional support: RVO:61389030 Keywords : Sex chromosomes * Sex-linked genes * Plant Subject RIV: EB - Genetics ; Molecular Biology Impact factor: 3.867, year: 2015
Ranjan, Aashish; Ichihashi, Yasunori; Farhi, Moran; Zumstein, Kristina; Townsley, Brad; David-Schwartz, Rakefet; Sinha, Neelima R
Parasitic flowering plants are one of the most destructive agricultural pests and have major impact on crop yields throughout the world. Being dependent on finding a host plant for growth, parasitic plants penetrate their host using specialized organs called haustoria. Haustoria establish vascular connections with the host, which enable the parasite to steal nutrients and water. The underlying molecular and developmental basis of parasitism by plants is largely unknown. In order to investigate the process of parasitism, RNAs from different stages (i.e. seed, seedling, vegetative strand, prehaustoria, haustoria, and flower) were used to de novo assemble and annotate the transcriptome of the obligate plant stem parasite dodder (Cuscuta pentagona). The assembled transcriptome was used to dissect transcriptional dynamics during dodder development and parasitism and identified key gene categories involved in the process of plant parasitism. Host plant infection is accompanied by increased expression of parasite genes underlying transport and transporter categories, response to stress and stimuli, as well as genes encoding enzymes involved in cell wall modifications. By contrast, expression of photosynthetic genes is decreased in the dodder infective stages compared with normal stem. In addition, genes relating to biosynthesis, transport, and response of phytohormones, such as auxin, gibberellins, and strigolactone, were differentially expressed in the dodder infective stages compared with stems and seedlings. This analysis sheds light on the transcriptional changes that accompany plant parasitism and will aid in identifying potential gene targets for use in controlling the infestation of crops by parasitic weeds. © 2014 American Society of Plant Biologists. All Rights Reserved.
Su, Junji; Li, Libei; Zhang, Chi; Wang, Caixiang; Gu, Lijiao; Wang, Hantao; Wei, Hengling; Liu, Qibao; Huang, Long; Yu, Shuxun
Thirty significant associations between 22 SNPs and five plant architecture component traits in Chinese upland cotton were identified via GWAS. Four peak SNP loci located on chromosome D03 were simultaneously associated with more plant architecture component traits. A candidate gene, Gh_D03G0922, might be responsible for plant height in upland cotton. A compact plant architecture is increasingly required for mechanized harvesting processes in China. Therefore, cotton plant architecture is an important trait, and its components, such as plant height, fruit branch length and fruit branch angle, affect the suitability of a cultivar for mechanized harvesting. To determine the genetic basis of cotton plant architecture, a genome-wide association study (GWAS) was performed using a panel composed of 355 accessions and 93,250 single nucleotide polymorphisms (SNPs) identified using the specific-locus amplified fragment sequencing method. Thirty significant associations between 22 SNPs and five plant architecture component traits were identified via GWAS. Most importantly, four peak SNP loci located on chromosome D03 were simultaneously associated with more plant architecture component traits, and these SNPs were harbored in one linkage disequilibrium block. Furthermore, 21 candidate genes for plant architecture were predicted in a 0.95-Mb region including the four peak SNPs. One of these genes (Gh_D03G0922) was near the significant SNP D03_31584163 (8.40 kb), and its Arabidopsis homologs contain MADS-box domains that might be involved in plant growth and development. qRT-PCR showed that the expression of Gh_D03G0922 was upregulated in the apical buds and young leaves of the short and compact cotton varieties, and virus-induced gene silencing (VIGS) proved that the silenced plants exhibited increased PH. These results indicate that Gh_D03G0922 is likely the candidate gene for PH in cotton. The genetic variations and candidate genes identified in this study lay a foundation
Jan E Aagaard
Full Text Available Understanding the genetic basis of reproductive isolation promises insight into speciation and the origins of biological diversity. While progress has been made in identifying genes underlying barriers to reproduction that function after fertilization (post-zygotic isolation, we know much less about earlier acting pre-zygotic barriers. Of particular interest are barriers involved in mating and fertilization that can evolve extremely rapidly under sexual selection, suggesting they may play a prominent role in the initial stages of reproductive isolation. A significant challenge to the field of speciation genetics is developing new approaches for identification of candidate genes underlying these barriers, particularly among non-traditional model systems. We employ powerful proteomic and genomic strategies to study the genetic basis of conspecific pollen precedence, an important component of pre-zygotic reproductive isolation among yellow monkeyflowers (Mimulus spp. resulting from male pollen competition. We use isotopic labeling in combination with shotgun proteomics to identify more than 2,000 male function (pollen tube proteins within maternal reproductive structures (styles of M. guttatus flowers where pollen competition occurs. We then sequence array-captured pollen tube exomes from a large outcrossing population of M. guttatus, and identify those genes with evidence of selective sweeps or balancing selection consistent with their role in pollen competition. We also test for evidence of positive selection on these genes more broadly across yellow monkeyflowers, because a signal of adaptive divergence is a common feature of genes causing reproductive isolation. Together the molecular evolution studies identify 159 pollen tube proteins that are candidate genes for conspecific pollen precedence. Our work demonstrates how powerful proteomic and genomic tools can be readily adapted to non-traditional model systems, allowing for genome-wide screens
Full Text Available Abstract Background We have used the genomic data in the Integrated Microbial Genomes system of the Department of Energy’s Joint Genome Institute to make predictions about rhizobial open reading frames that play a role in nodulation of host plants. The genomic data was screened by searching for ORFs conserved in α-proteobacterial rhizobia, but not conserved in closely-related non-nitrogen-fixing α-proteobacteria. Results Using this approach, we identified many genes known to be involved in nodulation or nitrogen fixation, as well as several new candidate genes. We knocked out selected new genes and assayed for the presence of nodulation phenotypes and/or nodule-specific expression. One of these genes, SMc00911, is strongly expressed by bacterial cells within host plant nodules, but is expressed minimally by free-living bacterial cells. A strain carrying an insertion mutation in SMc00911 is not defective in the symbiosis with host plants, but in contrast to expectations, this mutant strain is able to out-compete the S. meliloti 1021 wild type strain for nodule occupancy in co-inoculation experiments. The SMc00911 ORF is predicted to encode a “SodM-like” (superoxide dismutase-like protein containing a rhodanese sulfurtransferase domain at the N-terminus and a chromate-resistance superfamily domain at the C-terminus. Several other ORFs (SMb20360, SMc01562, SMc01266, SMc03964, and the SMc01424-22 operon identified in the screen are expressed at a moderate level by bacteria within nodules, but not by free-living bacteria. Conclusions Based on the analysis of ORFs identified in this study, we conclude that this comparative genomics approach can identify rhizobial genes involved in the nitrogen-fixing symbiosis with host plants, although none of the newly identified genes were found to be essential for this process.
Reusch Thorsten BH
Full Text Available Abstract Background Seagrasses are a polyphyletic group of monocotyledonous angiosperms that have adapted to a completely submerged lifestyle in marine waters. Here, we exploit two collections of expressed sequence tags (ESTs of two wide-spread and ecologically important seagrass species, the Mediterranean seagrass Posidonia oceanica (L. Delile and the eelgrass Zostera marina L., which have independently evolved from aquatic ancestors. This replicated, yet independent evolutionary history facilitates the identification of traits that may have evolved in parallel and are possible instrumental candidates for adaptation to a marine habitat. Results In our study, we provide the first quantitative perspective on molecular adaptations in two seagrass species. By constructing orthologous gene clusters shared between two seagrasses (Z. marina and P. oceanica and eight distantly related terrestrial angiosperm species, 51 genes could be identified with detection of positive selection along the seagrass branches of the phylogenetic tree. Characterization of these positively selected genes using KEGG pathways and the Gene Ontology uncovered that these genes are mostly involved in translation, metabolism, and photosynthesis. Conclusions These results provide first insights into which seagrass genes have diverged from their terrestrial counterparts via an initial aquatic stage characteristic of the order and to the derived fully-marine stage characteristic of seagrasses. We discuss how adaptive changes in these processes may have contributed to the evolution towards an aquatic and marine existence.
Wissler, Lothar; Codoñer, Francisco M; Gu, Jenny; Reusch, Thorsten B H; Olsen, Jeanine L; Procaccini, Gabriele; Bornberg-Bauer, Erich
Seagrasses are a polyphyletic group of monocotyledonous angiosperms that have adapted to a completely submerged lifestyle in marine waters. Here, we exploit two collections of expressed sequence tags (ESTs) of two wide-spread and ecologically important seagrass species, the Mediterranean seagrass Posidonia oceanica (L.) Delile and the eelgrass Zostera marina L., which have independently evolved from aquatic ancestors. This replicated, yet independent evolutionary history facilitates the identification of traits that may have evolved in parallel and are possible instrumental candidates for adaptation to a marine habitat. In our study, we provide the first quantitative perspective on molecular adaptations in two seagrass species. By constructing orthologous gene clusters shared between two seagrasses (Z. marina and P. oceanica) and eight distantly related terrestrial angiosperm species, 51 genes could be identified with detection of positive selection along the seagrass branches of the phylogenetic tree. Characterization of these positively selected genes using KEGG pathways and the Gene Ontology uncovered that these genes are mostly involved in translation, metabolism, and photosynthesis. These results provide first insights into which seagrass genes have diverged from their terrestrial counterparts via an initial aquatic stage characteristic of the order and to the derived fully-marine stage characteristic of seagrasses. We discuss how adaptive changes in these processes may have contributed to the evolution towards an aquatic and marine existence.
Hussey, Richard S; Huang, Guozhong; Allen, Rex
Identifying parasitism genes encoding proteins secreted from a plant-parasitic nematode's esophageal gland cells and injected through its stylet into plant tissue is the key to understanding the molecular basis of nematode parasitism of plants. Parasitism genes have been cloned by directly microaspirating the cytoplasm from the esophageal gland cells of different parasitic stages of cyst or root-knot nematodes to provide mRNA to create a gland cell-specific cDNA library by long-distance reverse-transcriptase polymerase chain reaction. cDNA clones are sequenced and deduced protein sequences with a signal peptide for secretion are identified for high-throughput in situ hybridization to confirm gland-specific expression.
Singh, Vinay Kumar; Ambwani, Sonu; Marla, Soma; Kumar, Anil
We describe the development of a user friendly tool that would assist in the retrieval of information relating to Cry genes in transgenic crops. The tool also helps in detection of transformed Cry genes from Bacillus thuringiensis present in transgenic plants by providing suitable designed primers for PCR identification of these genes. The tool designed based on relational database model enables easy retrieval of information from the database with simple user queries. The tool also enables users to access related information about Cry genes present in various databases by interacting with different sources (nucleotide sequences, protein sequence, sequence comparison tools, published literature, conserved domains, evolutionary and structural data). http://insilicogenomics.in/Cry-btIdentifier/welcome.html.
Fusarium oxysporum is the causative agent of fungal wilt disease in a variety of crops. The capacity of a fungal pathogen such as F. oxysporum f. sp. nicotianae to establish infection on its tobacco (Nicotiana tabacum) host depends in part on its capacity to evade the toxicity of tobacco defense proteins, such as osmotin. Fusarium genes that control resistance to osmotin would therefore reflect coevolutionary pressures and include genes that control mutual recognition, avoidance, and detoxification. We identified FOR (Fusarium Osmotin Resistance) genes on the basis of their ability to confer osmotin resistance to an osmotin-sensitive strain of Saccharomyces cerevisiae. FOR1 encodes a putative cell wall glycoprotein. FOR2 encodes the structural gene for glutamine:fructose-6-phosphate amidotransferase, the first and rate-limiting step in the biosynthesis of hexosamine and cell wall chitin. FOR3 encodes a homolog of SSD1, which controls cell wall composition, longevity, and virulence in S. cerevisiae. A for3 null mutation increased osmotin sensitivity of conidia and hyphae of F. oxysporum f. sp. nicotianae and also reduced cell wall β-1,3-glucan content. Together our findings show that conserved fungal genes that determine cell wall properties play a crucial role in regulating fungal susceptibility to the plant defense protein osmotin.
Lee, H.; Damsz, B.; Woloshuk, C. P.; Bressan, R. A.; Narasimhan, Meena L.
Fusarium oxysporum is the causative agent of fungal wilt disease in a variety of crops. The capacity of a fungal pathogen such as F. oxysporum f. sp. nicotianae to establish infection on its tobacco (Nicotiana tabacum) host depends in part on its capacity to evade the toxicity of tobacco defense proteins, such as osmotin. Fusarium genes that control resistance to osmotin would therefore reflect coevolutionary pressures and include genes that control mutual recognition, avoidance, and detoxification. We identified FOR (Fusarium Osmotin Resistance) genes on the basis of their ability to confer osmotin resistance to an osmotin-sensitive strain of Saccharomyces cerevisiae. FOR1 encodes a putative cell wall glycoprotein. FOR2 encodes the structural gene for glutamine:fructose-6-phosphate amidotransferase, the first and rate-limiting step in the biosynthesis of hexosamine and cell wall chitin. FOR3 encodes a homolog of SSD1, which controls cell wall composition, longevity, and virulence in S. cerevisiae. A for3 null mutation increased osmotin sensitivity of conidia and hyphae of F. oxysporum f. sp. nicotianae and also reduced cell wall β-1,3-glucan content. Together our findings show that conserved fungal genes that determine cell wall properties play a crucial role in regulating fungal susceptibility to the plant defense protein osmotin.
Full Text Available BACKGROUND: Reaumuria soongorica is an extreme xerophyte shrub widely distributed in the desert regions including sand dune, Gobi and marginal loess of central Asia which plays a crucial role to sustain and restore fragile desert ecosystems. However, due to the lacking of the genomic sequences, studies on R. soongorica had mainly limited in physiological responses to drought stress. Here, a deep transcriptomic sequencing of R. soongorica will facilitate molecular functional studies and pave the path to understand drought adaptation for a desert plant. METHODOLOGY/PRINCIPAL FINDINGS: A total of 53,193,660 clean paired-end reads was generated from the Illumina HiSeq™ 2000 platform. By assembly with Trinity, we got 173,700 contigs and 77,647 unigenes with mean length of 677 bp and N50 of 1109 bp. Over 55% (43,054 unigenes were successfully annotated based on sequence similarity against public databases as well as Rfam and Pfam database. Local BLAST and Kyoto Encyclopedia of Genes and Genomes (KEGG maps were used to further exhausting seek for candidate genes related to drought adaptation and a set of 123 putative candidate genes were identified. Moreover, all the C4 photosynthesis genes existed and were active in R. soongorica, which has been regarded as a typical C3 plant. CONCLUSION/SIGNIFICANCE: The assembled unigenes in present work provide abundant genomic information for the functional assignments in an extreme xerophyte R. soongorica, and will help us exploit the genetic basis of how desert plants adapt to drought environment in the near future.
Full Text Available P. minus is an aromatic plant, the leaf of which is widely used as a food additive and in the perfume industry. The leaf also accumulates secondary metabolites that act as active ingredients such as flavonoid. Due to limited genomic and transcriptomic data, the biosynthetic pathway of flavonoids is currently unclear. Identification of candidate genes involved in the flavonoid biosynthetic pathway will significantly contribute to understanding the biosynthesis of active compounds. We have constructed a standard cDNA library from P. minus leaves, and two normalized full-length enriched cDNA libraries were constructed from stem and root organs in order to create a gene resource for the biosynthesis of secondary metabolites, especially flavonoid biosynthesis. Thus, large‑scale sequencing of P. minus cDNA libraries identified 4196 expressed sequences tags (ESTs which were deposited in dbEST in the National Center of Biotechnology Information (NCBI. From the three constructed cDNA libraries, 11 ESTs encoding seven genes were mapped to the flavonoid biosynthetic pathway. Finally, three flavonoid biosynthetic pathway-related ESTs chalcone synthase, CHS (JG745304, flavonol synthase, FLS (JG705819 and leucoanthocyanidin dioxygenase, LDOX (JG745247 were selected for further examination by quantitative RT-PCR (qRT-PCR in different P. minus organs. Expression was detected in leaf, stem and root. Gene expression studies have been initiated in order to better understand the underlying physiological processes.
Vandelle, Elodie; Puttilli, Maria Rita; Chini, Andrea; Devescovi, Giulia; Venturi, Vittorio; Polverari, Annalisa
The life cycle of bacterial phytopathogens consists of a benign epiphytic phase, during which the bacteria grow in the soil or on the plant surface, and a virulent endophytic phase involving the penetration of host defenses and the colonization of plant tissues. Innovative strategies are urgently required to integrate copper treatments that control the epiphytic phase with complementary tools that control the virulent endophytic phase, thus reducing the quantity of chemicals applied to economically and ecologically acceptable levels. Such strategies include targeted treatments that weaken bacterial pathogens, particularly those inhibiting early infection steps rather than tackling established infections. This chapter describes a reporter gene-based chemical genomic high-throughput screen for the induction of bacterial virulence by plant molecules. Specifically, we describe a chemical genomic screening method to identify agonist and antagonist molecules for the induction of targeted bacterial virulence genes by plant extracts, focusing on the experimental controls required to avoid false positives and thus ensuring the results are reliable and reproducible.
Ranjan, Aashish; Ichihashi, Yasunori; Farhi, Moran; Zumstein, Kristina; Townsley, Brad; David-Schwartz, Rakefet; Sinha, Neelima R.
Parasitic flowering plants are one of the most destructive agricultural pests and have major impact on crop yields throughout the world. Being dependent on finding a host plant for growth, parasitic plants penetrate their host using specialized organs called haustoria. Haustoria establish vascular connections with the host, which enable the parasite to steal nutrients and water. The underlying molecular and developmental basis of parasitism by plants is largely unknown. In order to investigate the process of parasitism, RNAs from different stages (i.e. seed, seedling, vegetative strand, prehaustoria, haustoria, and flower) were used to de novo assemble and annotate the transcriptome of the obligate plant stem parasite dodder (Cuscuta pentagona). The assembled transcriptome was used to dissect transcriptional dynamics during dodder development and parasitism and identified key gene categories involved in the process of plant parasitism. Host plant infection is accompanied by increased expression of parasite genes underlying transport and transporter categories, response to stress and stimuli, as well as genes encoding enzymes involved in cell wall modifications. By contrast, expression of photosynthetic genes is decreased in the dodder infective stages compared with normal stem. In addition, genes relating to biosynthesis, transport, and response of phytohormones, such as auxin, gibberellins, and strigolactone, were differentially expressed in the dodder infective stages compared with stems and seedlings. This analysis sheds light on the transcriptional changes that accompany plant parasitism and will aid in identifying potential gene targets for use in controlling the infestation of crops by parasitic weeds. PMID:24399359
Full Text Available Bursaphelenchus mucronatus (B. mucronatus isolates that originate from different regions may vary in their virulence, but their virulence-associated genes and proteins are poorly understood. Thus, we conducted an integrated study coupling RNA-Seq and isobaric tags for relative and absolute quantitation (iTRAQ to analyse transcriptomic and proteomic data of highly and weakly virulent B. mucronatus isolates during the pathogenic processes. Approximately 40,000 annotated unigenes and 5000 proteins were gained from the isolates. When we matched all of the proteins with their detected transcripts, a low correlation coefficient of r = 0.138 was found, indicating probable post-transcriptional gene regulation involved in the pathogenic processes. A functional analysis showed that five differentially expressed proteins which were all highly expressed in the highly virulent isolate were involved in the pathogenic processes of nematodes. Peroxiredoxin, fatty acid- and retinol-binding protein, and glutathione peroxidase relate to resistance against plant defence responses, while β-1,4-endoglucanase and expansin are associated with the breakdown of plant cell walls. Thus, the pathogenesis of B. mucronatus depends on its successful survival in host plants. Our work adds to the understanding of B. mucronatus’ pathogenesis, and will aid in controlling B. mucronatus and other pinewood nematode species complexes in the future.
Swertia mussotii Franch. is an important traditional Tibetan medicinal plant with pharmacological properties useful for the treatment of various ailments, such as hepatitis. Secoiridoids, including swertiamarin, are the major bioactive compounds in S. mussotii. The development of genomic resources ...
Zubko, E.; Adams, Ch.; Macháčková, Ivana; Malbeck, Jiří; Scollan, C.; Meyer, P.
Roč. 29, č. 6 (2002), s. 797-808 ISSN 0960-7412 R&D Projects: GA ČR GA206/00/1354; GA MŠk LN00A081; GA ČR GA206/02/0967; GA ČR GA522/02/0530 Institutional research plan: CEZ:AV0Z5038910 Keywords : cytokinin * isopentenyl transferase * plant hormones Subject RIV: EF - Botanics Impact factor: 5.850, year: 2002
... News From NIH NIH Researchers Identify OCD Risk Gene Past Issues / Summer 2006 Table of Contents For ... and Alcoholism (NIAAA) have identified a previously unknown gene variant that doubles an individual's risk for obsessive- ...
The genetic modification of plants by gene technology is of immense potential benefits, but there may be possible risks. ... As a new endeavour, however, people have a mixed ... reality by gene biotechnology (Watson, 1997). Industrial ...
Kress, W. John; Wurdack, Kenneth J.; Zimmer, Elizabeth A.; Weigt, Lee A.; Janzen, Daniel H.
Methods for identifying species by using short orthologous DNA sequences, known as “DNA barcodes,” have been proposed and initiated to facilitate biodiversity studies, identify juveniles, associate sexes, and enhance forensic analyses. The cytochrome c oxidase 1 sequence, which has been found to be widely applicable in animal barcoding, is not appropriate for most species of plants because of a much slower rate of cytochrome c oxidase 1 gene evolution in higher plants than in animals. We ther...
Full Text Available Quinclorac is a highly selective auxin-type herbicide, and is widely used in the effective control of barnyard grass in paddy rice fields, improving the world’s rice yield. The herbicide mode of action of quinclorac has been proposed and hormone interactions affect quinclorac signaling. Because of widespread use, quinclorac may be transported outside rice fields with the drainage waters, leading to soil and water pollution and environmental health problems.In this study, we used 57K Affymetrix rice whole-genome array to identify quinclorac signaling response genes to study the molecular mechanisms of action and detoxification of quinclorac in rice plants. Overall, 637 probe sets were identified with differential expression levels under either 6 or 24 h of quinclorac treatment. Auxin-related genes such as GH3 and OsIAAs responded to quinclorac treatment. Gene Ontology analysis showed that genes of detoxification-related family genes were significantly enriched, including cytochrome P450, GST, UGT, and ABC and drug transporter genes. Moreover, real-time RT-PCR analysis showed that top candidate P450 families such as CYP81, CYP709C and CYP72A genes were universally induced by different herbicides. Some Arabidopsis genes for the same P450 family were up-regulated under quinclorac treatment.We conduct rice whole-genome GeneChip analysis and the first global identification of quinclorac response genes. This work may provide potential markers for detoxification of quinclorac and biomonitors of environmental chemical pollution.
Rubio, M. Belén; Quijada, Narciso M.; Pérez, Esclaudys; Domínguez, Sara; Hermosa, Rosa
Trichoderma parareesei and Trichoderma reesei (teleomorph Hypocrea jecorina) produce cellulases and xylanases of industrial interest. Here, the anamorphic strain T6 (formerly T. reesei) has been identified as T. parareesei, showing biocontrol potential against fungal and oomycete phytopathogens and enhanced hyphal growth in the presence of tomato exudates or plant cell wall polymers in in vitro assays. A Trichoderma microarray was used to examine the transcriptomic changes in T6 at 20 h of interaction with tomato plants. Out of a total 34,138 Trichoderma probe sets deposited on the microarray, 250 showed a significant change of at least 2-fold in expression in the presence of tomato plants, with most of them being downregulated. T. parareesei T6 exerted beneficial effects on tomato plants in terms of seedling lateral root development, and in adult plants it improved defense against Botrytis cinerea and growth promotion under salt stress. Time course expression patterns (0 to 6 days) observed for defense-related genes suggest that T6 was able to prime defense responses in the tomato plants against biotic and abiotic stresses. Such responses undulated, with a maximum upregulation of the jasmonic acid (JA)/ethylene (ET)-related LOX1 and EIN2 genes and the salt tolerance SOS1 gene at 24 h and that of the salicylic acid (SA)-related PR-1 gene at 48 h after T6 inoculation. Our study demonstrates that the T. parareesei T6-tomato interaction is beneficial to both partners. PMID:24413597
At the time of the dawn of agriculture, plant domestication was very slow. As agriculture progressed, however, domestication began to evolve faster and reached its highest point with the advent of plant breeders who played a very important role in solving the world food problem. One of the fastest moving strategies was a better exploitation of genetic diversity, both natural and induced. However, intensive plant breeding activity caused a heavy fall in genetic variability. Gene banks then provided a further tool for modern agriculture, specifically to preserve genetic resources and to help breeders to further domesticate important crops and to introduce and domesticate new species. (author). 3 refs
Fehrmann, Rudolf S. N.; Karjalainen, Juha M.; Krajewska, Malgorzata
Many cancer-associated somatic copy number alterations (SCNAs) are known. Currently, one of the challenges is to identify the molecular downstream effects of these variants. Although several SCNAs are known to change gene expression levels, it is not clear whether each individual SCNA affects gen...
Gan, Susheng; Guo, Yongfeng
The present invention discloses transgenic plants having an altered level of NAP protein compared to that of a non-transgenic plant, where the transgenic plants display an altered leaf senescence phenotype relative to a non-transgenic plant, as well as mutant plants comprising an inactivated NAP gene, where mutant plants display a delayed leaf senescence phenotype compared to that of a non-mutant plant. The present invention also discloses methods for delaying leaf senescence in a plant, as well as methods of making a mutant plant having a decreased level of NAP protein compared to that of a non-mutant plant, where the mutant plant displays a delayed leaf senescence phenotype relative to a non-mutant plant. Methods for causing precocious leaf senescence or promoting leaf senescence in a plant are also disclosed. Also disclosed are methods of identifying a candidate plant suitable for breeding that displays a delayed leaf senescence and/or enhanced yield phenotype.
Balestrini, Raffaella; Lanfranco, Luisa
Arbuscular mycorrhizas (AMs) are a unique example of symbiosis between two eukaryotes, soil fungi and plants. This association induces important physiological changes in each partner that lead to reciprocal benefits, mainly in nutrient supply. The symbiosis results from modifications in plant and fungal cell organization caused by specific changes in gene expression. Recently, much effort has gone into studying these gene expression patterns to identify a wider spectrum of genes involved. We aim in this review to describe AM symbiosis in terms of current knowledge on plant and fungal gene expression profiles.
Zulfiqar, Asma, E-mail: email@example.com [Department of Plant, Soil, and Insect Sciences, 270 Stockbridge Road, University of Massachusetts Amherst, MA 01003 (United States); Paulose, Bibin, E-mail: firstname.lastname@example.org [Department of Plant, Soil, and Insect Sciences, 270 Stockbridge Road, University of Massachusetts Amherst, MA 01003 (United States); Chhikara, Sudesh, E-mail: email@example.com [Department of Plant, Soil, and Insect Sciences, 270 Stockbridge Road, University of Massachusetts Amherst, MA 01003 (United States); Dhankher, Om Parkash, E-mail: firstname.lastname@example.org [Department of Plant, Soil, and Insect Sciences, 270 Stockbridge Road, University of Massachusetts Amherst, MA 01003 (United States)
Chromium pollution is a serious environmental problem with few cost-effective remediation strategies available. Crambe abyssinica (a member of Brassicaseae), a non-food, fast growing high biomass crop, is an ideal candidate for phytoremediation of heavy metals contaminated soils. The present study used a PCR-Select Suppression Subtraction Hybridization approach in C. abyssinica to isolate differentially expressed genes in response to Cr exposure. A total of 72 differentially expressed subtracted cDNAs were sequenced and found to represent 43 genes. The subtracted cDNAs suggest that Cr stress significantly affects pathways related to stress/defense, ion transporters, sulfur assimilation, cell signaling, protein degradation, photosynthesis and cell metabolism. The regulation of these genes in response to Cr exposure was further confirmed by semi-quantitative RT-PCR. Characterization of these differentially expressed genes may enable the engineering of non-food, high-biomass plants, including C. abyssinica, for phytoremediation of Cr-contaminated soils and sediments. - Highlights: > Molecular mechanism of Cr uptake and detoxification in plants is not well known. > We identified differentially regulated genes upon Cr exposure in Crambe abyssinica. > 72 Cr-induced subtracted cDNAs were sequenced and found to represent 43 genes. > Pathways linked to stress, ion transport, and sulfur assimilation were affected. > This is the first Cr transcriptome study in a crop with phytoremediation potential. - This study describes the identification and isolation of differentially expressed genes involved in chromium metabolism and detoxification in a non-food industrial oil crop Crambe abyssinica.
Bowman Rayleen V
Full Text Available Abstract Chronic obstructive pulmonary disease (COPD is a major public health problem. The aim of this study was to identify genes involved in emphysema severity in COPD patients. Gene expression profiling was performed on total RNA extracted from non-tumor lung tissue from 30 smokers with emphysema. Class comparison analysis based on gas transfer measurement was performed to identify differentially expressed genes. Genes were then selected for technical validation by quantitative reverse transcriptase-PCR (qRT-PCR if also represented on microarray platforms used in previously published emphysema studies. Genes technically validated advanced to tests of biological replication by qRT-PCR using an independent test set of 62 lung samples. Class comparison identified 98 differentially expressed genes (p p Gene expression profiling of lung from emphysema patients identified seven candidate genes associated with emphysema severity including COL6A3, SERPINF1, ZNHIT6, NEDD4, CDKN2A, NRN1 and GSTM3.
Full Text Available The Toll-interleukin-1 receptor (TIR and Nucleotide-binding site (NBS domains are two major components of the TIR-NBS-leucine-rich repeat family plant disease resistance genes. Extensive functional and evolutionary studies have been performed on these genes; however, the characterization of a small group of genes that are composed of atypical TIR and NBS domains, namely XTNX genes, is limited. The present study investigated this specific gene family by conducting genome-wide analyses of 59 green plant genomes. A total of 143 XTNX genes were identified in 51 of the 52 land plant genomes, whereas no XTNX gene was detected in any green algae genomes, which indicated that XTNX genes originated upon emergence of land plants. Phylogenetic analysis revealed that the ancestral XTNX gene underwent two rounds of ancient duplications in land plants, which resulted in the formation of clades I/II and clades IIa/IIb successively. Although clades I and IIb have evolved conservatively in angiosperms, the motif composition difference and sequence divergence at the amino acid level suggest that functional divergence may have occurred since the separation of the two clades. In contrast, several features of the clade IIa genes, including the absence in the majority of dicots, the long branches in the tree, the frequent loss of ancestral motifs, and the loss of expression in all detected tissues of Zea mays, all suggest that the genes in this lineage might have undergone pseudogenization. This study highlights that XTNX genes are a gene family originated anciently in land plants and underwent specific conservative pattern in evolution.
Zulfiqar, Asma; Paulose, Bibin; Chhikara, Sudesh; Dhankher, Om Parkash
Chromium pollution is a serious environmental problem with few cost-effective remediation strategies available. Crambe abyssinica (a member of Brassicaseae), a non-food, fast growing high biomass crop, is an ideal candidate for phytoremediation of heavy metals contaminated soils. The present study used a PCR-Select Suppression Subtraction Hybridization approach in C. abyssinica to isolate differentially expressed genes in response to Cr exposure. A total of 72 differentially expressed subtracted cDNAs were sequenced and found to represent 43 genes. The subtracted cDNAs suggest that Cr stress significantly affects pathways related to stress/defense, ion transporters, sulfur assimilation, cell signaling, protein degradation, photosynthesis and cell metabolism. The regulation of these genes in response to Cr exposure was further confirmed by semi-quantitative RT-PCR. Characterization of these differentially expressed genes may enable the engineering of non-food, high-biomass plants, including C. abyssinica, for phytoremediation of Cr-contaminated soils and sediments. - Highlights: → Molecular mechanism of Cr uptake and detoxification in plants is not well known. → We identified differentially regulated genes upon Cr exposure in Crambe abyssinica. → 72 Cr-induced subtracted cDNAs were sequenced and found to represent 43 genes. → Pathways linked to stress, ion transport, and sulfur assimilation were affected. → This is the first Cr transcriptome study in a crop with phytoremediation potential. - This study describes the identification and isolation of differentially expressed genes involved in chromium metabolism and detoxification in a non-food industrial oil crop Crambe abyssinica.
Victor M. Bii
Full Text Available Identifying novel genes that drive tumor metastasis and drug resistance has significant potential to improve patient outcomes. High-throughput sequencing approaches have identified cancer genes, but distinguishing driver genes from passengers remains challenging. Insertional mutagenesis screens using replication-incompetent retroviral vectors have emerged as a powerful tool to identify cancer genes. Unlike replicating retroviruses and transposons, replication-incompetent retroviral vectors lack additional mutagenesis events that can complicate the identification of driver mutations from passenger mutations. They can also be used for almost any human cancer due to the broad tropism of the vectors. Replication-incompetent retroviral vectors have the ability to dysregulate nearby cancer genes via several mechanisms including enhancer-mediated activation of gene promoters. The integrated provirus acts as a unique molecular tag for nearby candidate driver genes which can be rapidly identified using well established methods that utilize next generation sequencing and bioinformatics programs. Recently, retroviral vector screens have been used to efficiently identify candidate driver genes in prostate, breast, liver and pancreatic cancers. Validated driver genes can be potential therapeutic targets and biomarkers. In this review, we describe the emergence of retroviral insertional mutagenesis screens using replication-incompetent retroviral vectors as a novel tool to identify cancer driver genes in different cancer types.
Ernst, Antonia M; Rüping, Boris; Jekat, Stephan B; Nordzieke, Steffen; Reineke, Anna R; Müller, Boje; Bornberg-Bauer, Erich; Prüfer, Dirk; Noll, Gundula A
Sieve element occlusion (SEO) genes encoding forisome subunits have been identified in Medicago truncatula and other legumes. Forisomes are structural phloem proteins uniquely found in Fabaceae sieve elements. They undergo a reversible conformational change after wounding, from a condensed to a dispersed state, thereby blocking sieve tube translocation and preventing the loss of photoassimilates. Recently, we identified SEO genes in several non-Fabaceae plants (lacking forisomes) and concluded that they most probably encode conventional non-forisome P-proteins. Molecular and phylogenetic analysis of the SEO gene family has identified domains that are characteristic for SEO proteins. Here, we extended our phylogenetic analysis by including additional SEO genes from several diverse species based on recently published genomic data. Our results strengthen the original assumption that SEO genes seem to be widespread in dicotyledonous angiosperms, and further underline the divergent evolution of SEO genes within the Fabaceae.
Shu, Shengqiang; Rokhsar, Dan; Goodstein, David; Hayes, David; Mitros, Therese
Plant genomes vary in size and are highly complex with a high amount of repeats, genome duplication and tandem duplication. Gene encodes a wealth of information useful in studying organism and it is critical to have high quality and stable gene annotation. Thanks to advancement of sequencing technology, many plant species genomes have been sequenced and transcriptomes are also sequenced. To use these vastly large amounts of sequence data to make gene annotation or re-annotation in a timely fashion, an automatic pipeline is needed. JGI plant genomics gene annotation pipeline, called integrated gene call (IGC), is our effort toward this aim with aid of a RNA-seq transcriptome assembly pipeline. It utilizes several gene predictors based on homolog peptides and transcript ORFs. See Methods for detail. Here we present genome annotation of JGI flagship green plants produced by this pipeline plus Arabidopsis and rice except for chlamy which is done by a third party. The genome annotations of these species and others are used in our gene family build pipeline and accessible via JGI Phytozome portal whose URL and front page snapshot are shown below.
Ward, John M; Mäser, Pascal; Schroeder, Julian I
Distinct potassium, anion, and calcium channels in the plasma membrane and vacuolar membrane of plant cells have been identified and characterized by patch clamping. Primarily owing to advances in Arabidopsis genetics and genomics, and yeast functional complementation, many of the corresponding genes have been identified. Recent advances in our understanding of ion channel genes that mediate signal transduction and ion transport are discussed here. Some plant ion channels, for example, ALMT and SLAC anion channel subunits, are unique. The majority of plant ion channel families exhibit homology to animal genes; such families include both hyperpolarization- and depolarization-activated Shaker-type potassium channels, CLC chloride transporters/channels, cyclic nucleotide-gated channels, and ionotropic glutamate receptor homologs. These plant ion channels offer unique opportunities to analyze the structural mechanisms and functions of ion channels. Here we review gene families of selected plant ion channel classes and discuss unique structure-function aspects and their physiological roles in plant cell signaling and transport.
Dhillon, Inderjit S; Marcotte, Edward M; Roshan, Usman
Clustering genes based upon their expression patterns allows us to predict gene function. Most existing clustering algorithms cluster genes together when their expression patterns show high positive correlation. However, it has been observed that genes whose expression patterns are strongly anti-correlated can also be functionally similar. Biologically, this is not unintuitive-genes responding to the same stimuli, regardless of the nature of the response, are more likely to operate in the same pathways. We present a new diametrical clustering algorithm that explicitly identifies anti-correlated clusters of genes. Our algorithm proceeds by iteratively (i). re-partitioning the genes and (ii). computing the dominant singular vector of each gene cluster; each singular vector serving as the prototype of a 'diametric' cluster. We empirically show the effectiveness of the algorithm in identifying diametrical or anti-correlated clusters. Testing the algorithm on yeast cell cycle data, fibroblast gene expression data, and DNA microarray data from yeast mutants reveals that opposed cellular pathways can be discovered with this method. We present systems whose mRNA expression patterns, and likely their functions, oppose the yeast ribosome and proteosome, along with evidence for the inverse transcriptional regulation of a number of cellular systems.
Full Text Available With the recent advances in genomics and sequencing technologies, databases of transcriptomes representing many cellular processes have been built. Meiotic transcriptomes in plants have been studied in Arabidopsis thaliana, rice (Oryza sativa, wheat (Triticum aestivum, petunia (Petunia hybrida, sunflower (Helianthus annuus, and maize (Zea mays. Studies in all organisms, but particularly in plants, indicate that a very large number of genes are expressed during meiosis, though relatively few of them seem to be required for the completion of meiosis. In this review, we focus on gene expression at the RNA level and analyze the meiotic transcriptome datasets and explore expression patterns of known meiotic genes to elucidate how gene expression could be regulated during meiosis. We also discuss mechanisms, such as chromatin organization and non-coding RNAs, that might be involved in the regulation of meiotic transcription patterns.
Weiss, Jeffrey; Hurley, Lisa A.; Harris, Rebecca M.; Finlayson, Courtney; Tong, Minghan; Fisher, Lisa A.; Moran, Jennifer L.; Beier, David R.; Mason, Christopher; Jameson, J. Larry
Genome-wide mutagenesis was performed in mice to identify candidate genes for male infertility, for which the predominant causes remain idiopathic. Mice were mutagenized using N-ethyl-N-nitrosourea (ENU), bred, and screened for phenotypes associated with the male urogenital system. Fifteen heritable lines were isolated and chromosomal loci were assigned using low density genome-wide SNP arrays. Ten of the fifteen lines were pursued further using higher resolution SNP analysis to narrow the candidate gene regions. Exon sequencing of candidate genes identified mutations in mice with cystic kidneys (Bicc1), cryptorchidism (Rxfp2), restricted germ cell deficiency (Plk4), and severe germ cell deficiency (Prdm9). In two other lines with severe hypogonadism candidate sequencing failed to identify mutations, suggesting defects in genes with previously undocumented roles in gonadal function. These genomic intervals were sequenced in their entirety and a candidate mutation was identified in SnrpE in one of the two lines. The line harboring the SnrpE variant retains substantial spermatogenesis despite small testis size, an unusual phenotype. In addition to the reproductive defects, heritable phenotypes were observed in mice with ataxia (Myo5a), tremors (Pmp22), growth retardation (unknown gene), and hydrocephalus (unknown gene). These results demonstrate that the ENU screen is an effective tool for identifying potential causes of male infertility. PMID:22258617
Rohde, Palle Duun; Edwards, Stefan McKinnon; Sarup, Pernille Merete
Identification of genes explaining variation in quantitative traits or genetic risk factors of human diseases requires both good phenotypic- and genotypic data, but also efficient statistical methods. Genome-wide association studies may reveal association between phenotypic variation and variation...... approach grouping variants accordingly to gene position, thus lowering the number of statistical tests performed and increasing the probability of identifying genes with small to moderate effects. Using this approach we identify numerous genes associated with different types of stresses in Drosophila...... melanogaster, but also identify common genes that affects the stress traits....
Wall, P. Kerr; Leebens-Mack, Jim; Müller, Kai F.; Field, Dawn; Altman, Naomi S.; dePamphilis, Claude W.
The PlantTribes database (http://fgp.huck.psu.edu/tribe.html) is a plant gene family database based on the inferred proteomes of five sequenced plant species: Arabidopsis thaliana, Carica papaya, Medicago truncatula, Oryza sativa and Populus trichocarpa. We used the graph-based clustering algorithm MCL [Van Dongen (Technical Report INS-R0010 2000) and Enright et al. (Nucleic Acids Res. 2002; 30: 1575–1584)] to classify all of these species’ protein-coding genes into putative gene families, ca...
Gao, Fang; Li, Jingyu; Zhang, Heng; Yang, Xu; An, Tiezhu
Factor-based induced reprogramming approaches have tremendous potential for human regenerative medicine, but the efficiencies of these approaches are still low. In this study, we analyzed the global transcriptional profiles of mouse induced pluripotent stem cells (miPSCs) and mouse embryonic stem cells (mESCs) from seven different labs and present here the first successful clustering according to cell type, not by lab of origin. We identified 2131 different expression genes (DEs) as candidate pluripotency-associated genes by comparing mESCs/miPSCs with somatic cells and 720 DEs between miPSCs and mESCs. Interestingly, there was a significant overlap between the two DE sets. Therefore, we defined the overlap DEs as "consensus DEs" including 313 miPSC-specific genes expressed at a higher level in miPSCs versus mESCs and 184 mESC-specific genes in total and reasoned that these may contribute to the differences in pluripotency between mESCs and miPSCs. A classification of "consensus DEs" according to their different expression levels between somatic cells and mESCs/miPSCs shows that 86% of the miPSC-specific genes are more highly expressed in somatic cells, while 73% of mESC-specific genes are highly expressed in mESCs/miPSCs, indicating that the miPSCs have not efficiently silenced the expression pattern of the somatic cells from which they are derived and failed to completely induce the genes with high expression levels in mESCs. We further revealed a strong correlation between oocyte-enriched factors and insufficiently induced mESC-specific genes and identified 11 hub genes via network analysis. In light of these findings, we postulated that these key hub genes might not only drive somatic cell nuclear transfer (SCNT) reprogramming but also augment the efficiency and quality of miPSC reprogramming.
Panwar, Vinay; Bakkeren, Guus
Cereal rust fungi are destructive pathogens, threatening grain production worldwide. Targeted breeding for resistance utilizing host resistance genes has been effective. However, breakdown of resistance occurs frequently and continued efforts are needed to understand how these fungi overcome resistance and to expand the range of available resistance genes. Whole genome sequencing, transcriptomic and proteomic studies followed by genome-wide computational and comparative analyses have identified large repertoire of genes in rust fungi among which are candidates predicted to code for pathogenicity and virulence factors. Some of these genes represent defence triggering avirulence effectors. However, functions of most genes still needs to be assessed to understand the biology of these obligate biotrophic pathogens. Since genetic manipulations such as gene deletion and genetic transformation are not yet feasible in rust fungi, performing functional gene studies is challenging. Recently, Host-induced gene silencing (HIGS) has emerged as a useful tool to characterize gene function in rust fungi while infecting and growing in host plants. We utilized Barley stripe mosaic virus-mediated virus induced gene silencing (BSMV-VIGS) to induce HIGS of candidate rust fungal genes in the wheat host to determine their role in plant-fungal interactions. Here, we describe the methods for using BSMV-VIGS in wheat for functional genomics study in cereal rust fungi.
Li, Yuanjun; Gou, Junbo; Chen, Fangfang; Li, Changfu; Zhang, Yansheng
Xanthium strumarium L. is a traditional Chinese herb belonging to the Asteraceae family. The major bioactive components of this plant are sesquiterpene lactones, which include the xanthanolides. To date, the biogenesis of xanthanolides, especiallytheir downstream pathway, remains largely unknown. In X. strumarium, xanthanolides primarily accumulate in its glandular trichomes. To identify putative gene candidates involved in the biosynthesis of xanthanolides, three X. strumarium transcriptomes...
Yu, Hong; Hatzivassiloglou, Vasileios; Rzhetsky, Andrey; Wilbur, W John
Natural language processing (NLP) techniques are used to extract information automatically from computer-readable literature. In biology, the identification of terms corresponding to biological substances (e.g., genes and proteins) is a necessary step that precedes the application of other NLP systems that extract biological information (e.g., protein-protein interactions, gene regulation events, and biochemical pathways). We have developed GPmarkup (for "gene/protein-full name mark up"), a software system that automatically identifies gene/protein terms (i.e., symbols or full names) in MEDLINE abstracts. As a part of marking up process, we also generated automatically a knowledge source of paired gene/protein symbols and full names (e.g., LARD for lymphocyte associated receptor of death) from MEDLINE. We found that many of the pairs in our knowledge source do not appear in the current GenBank database. Therefore our methods may also be used for automatic lexicon generation. GPmarkup has 73% recall and 93% precision in identifying and marking up gene/protein terms in MEDLINE abstracts. A random sample of gene/protein symbols and full names and a sample set of marked up abstracts can be viewed at http://www.cpmc.columbia.edu/homepages/yuh9001/GPmarkup/. Contact. email@example.com. Voice: 212-939-7028; fax: 212-666-0140.
Full Text Available The huge amount of gene expression data generated by microarray and next-generation sequencing technologies present challenges to exploit their biological meanings. When searching for the coexpression genes, the data mining process is largely affected by selection of algorithms. Thus, it is highly desirable to provide multiple options of algorithms in the user-friendly analytical toolkit to explore the gene expression signatures. For this purpose, we developed GESearch, an interactive graphical user interface (GUI toolkit, which is written in MATLAB and supports a variety of gene expression data files. This analytical toolkit provides four models, including the mean, the regression, the delegate, and the ensemble models, to identify the coexpression genes, and enables the users to filter data and to select gene expression patterns by browsing the display window or by importing knowledge-based genes. Subsequently, the utility of this analytical toolkit is demonstrated by analyzing two sets of real-life microarray datasets from cell-cycle experiments. Overall, we have developed an interactive GUI toolkit that allows for choosing multiple algorithms for analyzing the gene expression signatures.
Kanth, Priyanka; Bronner, Mary P.; Boucher, Kenneth M.; Burt, Randall W.; Neklason, Deborah W.; Hagedorn, Curt H.; Delker, Don A.
Sessile serrated colon adenoma/polyps (SSA/Ps) are found during routine screening colonoscopy and may account for 20–30% of colon cancers. However, differentiating SSA/Ps from hyperplastic polyps (HP) with little risk of cancer is challenging and complementary molecular markers are needed. Additionally, the molecular mechanisms of colon cancer development from SSA/Ps are poorly understood. RNA sequencing was performed on 21 SSA/Ps, 10 HPs, 10 adenomas, 21 uninvolved colon and 20 control colon specimens. Differential expression and leave-one-out cross validation methods were used to define a unique gene signature of SSA/Ps. Our SSA/P gene signature was evaluated in colon cancer RNA-Seq data from The Cancer Genome Atlas (TCGA) to identify a subtype of colon cancers that may develop from SSA/Ps. A total of 1422 differentially expressed genes were found in SSA/Ps relative to controls. Serrated polyposis syndrome (n=12) and sporadic SSA/Ps (n=9) exhibited almost complete (96%) gene overlap. A 51-gene panel in SSA/P showed similar expression in a subset of TCGA colon cancers with high microsatellite instability (MSI-H). A smaller seven-gene panel showed high sensitivity and specificity in identifying BRAF mutant, CpG island methylator phenotype high (CIMP-H) and MLH1 silenced colon cancers. We describe a unique gene signature in SSA/Ps that identifies a subset of colon cancers likely to develop through the serrated pathway. These gene panels may be utilized for improved differentiation of SSA/Ps from HPs and provide insights into novel molecular pathways altered in colon cancer arising from the serrated pathway. PMID:27026680
Background There are currently three postulated genomic subtypes of the childhood tumour neuroblastoma (NB); Type 1, Type 2A, and Type 2B. The most aggressive forms of NB are characterized by amplification of the oncogene MYCN (MNA) and low expression of the favourable marker NTRK1. Recently, mutations or high expression of the familial predisposition gene Anaplastic Lymphoma Kinase (ALK) was associated to unfavourable biology of sporadic NB. Also, various other genes have been linked to NB pathogenesis. Results The present study explores subgroup discrimination by gene expression profiling using three published microarray studies on NB (47 samples). Four distinct clusters were identified by Principal Components Analysis (PCA) in two separate data sets, which could be verified by an unsupervised hierarchical clustering in a third independent data set (101 NB samples) using a set of 74 discriminative genes. The expression signature of six NB-associated genes ALK, BIRC5, CCND1, MYCN, NTRK1, and PHOX2B, significantly discriminated the four clusters (p INSS stage 4 and/or dead of disease, p < 0.05, Fisher's exact test). Conclusions Based on expression profiling we have identified four molecular subgroups of neuroblastoma, which can be distinguished by a 6-gene signature. The fourth subgroup has not been described elsewhere, and efforts are currently made to further investigate this group's specific characteristics. PMID:21492432
Full Text Available Abstract Background There are currently three postulated genomic subtypes of the childhood tumour neuroblastoma (NB; Type 1, Type 2A, and Type 2B. The most aggressive forms of NB are characterized by amplification of the oncogene MYCN (MNA and low expression of the favourable marker NTRK1. Recently, mutations or high expression of the familial predisposition gene Anaplastic Lymphoma Kinase (ALK was associated to unfavourable biology of sporadic NB. Also, various other genes have been linked to NB pathogenesis. Results The present study explores subgroup discrimination by gene expression profiling using three published microarray studies on NB (47 samples. Four distinct clusters were identified by Principal Components Analysis (PCA in two separate data sets, which could be verified by an unsupervised hierarchical clustering in a third independent data set (101 NB samples using a set of 74 discriminative genes. The expression signature of six NB-associated genes ALK, BIRC5, CCND1, MYCN, NTRK1, and PHOX2B, significantly discriminated the four clusters (p ALK, BIRC5, and PHOX2B, and was significantly associated with higher tumour stage, poor outcome and poor survival compared to the Type 1-corresponding favourable group (INSS stage 4 and/or dead of disease, p Conclusions Based on expression profiling we have identified four molecular subgroups of neuroblastoma, which can be distinguished by a 6-gene signature. The fourth subgroup has not been described elsewhere, and efforts are currently made to further investigate this group's specific characteristics.
Pan, Qian; Peng, Jin; Zhou, Xue; Yang, Hao; Zhang, Wei
In order to screen out important genes from large gene data of gene microarray after nerve injury, we combine gene ontology (GO) method and computer pattern recognition technology to find key genes responding to nerve injury, and then verify one of these screened-out genes. Data mining and gene ontology analysis of gene chip data GSE26350 was carried out through MATLAB software. Cd44 was selected from screened-out key gene molecular spectrum by comparing genes' different GO terms and positions on score map of principal component. Function interferences were employed to influence the normal binding of Cd44 and one of its ligands, chondroitin sulfate C (CSC), to observe neurite extension. Gene ontology analysis showed that the first genes on score map (marked by red *) mainly distributed in molecular transducer activity, receptor activity, protein binding et al molecular function GO terms. Cd44 is one of six effector protein genes, and attracted us with its function diversity. After adding different reagents into the medium to interfere the normal binding of CSC and Cd44, varying-degree remissions of CSC's inhibition on neurite extension were observed. CSC can inhibit neurite extension through binding Cd44 on the neuron membrane. This verifies that important genes in given physiological processes can be identified by gene ontology analysis of gene chip data.
Cheng, Ming; An, Shoukuan; Li, Junquan
This study aimed to identify key genes associated with acute myocardial infarction (AMI) by reanalyzing microarray data. Three gene expression profile datasets GSE66360, GSE34198, and GSE48060 were downloaded from GEO database. After data preprocessing, genes without heterogeneity across different platforms were subjected to differential expression analysis between the AMI group and the control group using metaDE package. P FI) network. Then, DEGs in each module were subjected to pathway enrichment analysis using DAVID. MiRNAs and transcription factors predicted to regulate target DEGs were identified. Quantitative real-time polymerase chain reaction (RT-PCR) was applied to verify the expression of genes. A total of 913 upregulated genes and 1060 downregulated genes were identified in the AMI group. A FI network consists of 21 modules and DEGs in 12 modules were significantly enriched in pathways. The transcription factor-miRNA-gene network contains 2 transcription factors FOXO3 and MYBL2, and 2 miRNAs hsa-miR-21-5p and hsa-miR-30c-5p. RT-PCR validations showed that expression levels of FOXO3 and MYBL2 were significantly increased in AMI, and expression levels of hsa-miR-21-5p and hsa-miR-30c-5p were obviously decreased in AMI. A total of 41 DEGs, such as SOCS3, VAPA, and COL5A2, are speculated to have roles in the pathogenesis of AMI; 2 transcription factors FOXO3 and MYBL2, and 2 miRNAs hsa-miR-21-5p and hsa-miR-30c-5p may be involved in the regulation of the expression of these DEGs.
Dec 4, 2013 ... approaches could be combined in order to identify candidate genes for the genetic control of ascorbic ..... applied to other traits under the complex control of many ... Engineering increased vitamin C levels in ... Chem. Biol. 13:532–538. Giovannucci E, Rimm EB, Liu Y, Stampfer MJ, Willett WC (2002). A.
Ma, Chunhui; Lv, Qi; Teng, Songsong; Yu, Yinxian; Niu, Kerun; Yi, Chengqin
This study aimed to identify rheumatoid arthritis (RA) related genes based on microarray data using the WGCNA (weighted gene co-expression network analysis) method. Two gene expression profile datasets GSE55235 (10 RA samples and 10 healthy controls) and GSE77298 (16 RA samples and seven healthy controls) were downloaded from Gene Expression Omnibus database. Characteristic genes were identified using metaDE package. WGCNA was used to find disease-related networks based on gene expression correlation coefficients, and module significance was defined as the average gene significance of all genes used to assess the correlation between the module and RA status. Genes in the disease-related gene co-expression network were subject to functional annotation and pathway enrichment analysis using Database for Annotation Visualization and Integrated Discovery. Characteristic genes were also mapped to the Connectivity Map to screen small molecules. A total of 599 characteristic genes were identified. For each dataset, characteristic genes in the green, red and turquoise modules were most closely associated with RA, with gene numbers of 54, 43 and 79, respectively. These genes were enriched in totally enriched in 17 Gene Ontology terms, mainly related to immune response (CD97, FYB, CXCL1, IKBKE, CCR1, etc.), inflammatory response (CD97, CXCL1, C3AR1, CCR1, LYZ, etc.) and homeostasis (C3AR1, CCR1, PLN, CCL19, PPT1, etc.). Two small-molecule drugs sanguinarine and papaverine were predicted to have a therapeutic effect against RA. Genes related to immune response, inflammatory response and homeostasis presumably have critical roles in RA pathogenesis. Sanguinarine and papaverine have a potential therapeutic effect against RA. © 2017 Asia Pacific League of Associations for Rheumatology and John Wiley & Sons Australia, Ltd.
Maternal genes present in mature oocytes play a crucial role in the early development of silkworm. Although maternal genes have been widely studied in many other species, there has been limited research in Bombyx mori. High-throughput next generation sequencing provides a practical method for gene discovery on a genome-wide level. Herein, a transcriptome study was used to identify maternal-related genes from silkworm eggs. Unfertilized eggs from five different stages of early development were used to detect the changing situation of gene expression. The expressed genes showed different patterns over time. Seventy-six maternal genes were annotated according to homology analysis with Drosophila melanogaster. More than half of the differentially expressed maternal genes fell into four expression patterns, while the expression patterns showed a downward trend over time. The functional annotation of these material genes was mainly related to transcription factor activity, growth factor activity, nucleic acid binding, RNA binding, ATP binding, and ion binding. Additionally, twenty-two gene clusters including maternal genes were identified from 18 scaffolds. Altogether, we plotted a profile for the maternal genes of Bombyx mori using a digital gene expression profiling method. This will provide the basis for maternal-specific signature research and improve the understanding of the early development of silkworm. PMID:29462160
Buitrago, Maria F.; Skidmore, Andrew K.; Groen, Thomas A.; Hecker, Christoph A.
Plant traits are used to define species, but also to evaluate the health status of forests, plantations and crops. Conventional methods of measuring plant traits (e.g. wet chemistry), although accurate, are inefficient and costly when applied over large areas or with intensive sampling. Spectroscopic methods, as used in the food industry and mineralogy, are nowadays applied to identify plant traits, however, most studies analysed visible to near infrared, while infrared spectra of longer wavelengths have been little used for identifying the spectral differences between plant species. This study measured the infrared spectra (1.4-16.0 μm) on individual, fresh leaves of 19 species (from herbaceous to woody species), as well as 14 leaf traits for each leaf. The results describe at which wavelengths in the infrared the leaves' spectra can differentiate most effectively between these plant species. A Quadratic Discrimination Analysis (QDA) shows that using five bands in the SWIR or the LWIR is enough to accurately differentiate these species (Kappa: 0.93, 0.94 respectively), while the MWIR has a lower classification accuracy (Kappa: 0.84). This study also shows that in the infrared spectra of fresh leaves, the identified species-specific features are correlated with leaf traits as well as changes in their values. Spectral features in the SWIR (1.66, 1.89 and 2.00 μm) are common to all species and match the main features of pure cellulose and lignin spectra. The depth of these features varies with changes of cellulose and leaf water content and can be used to differentiate species in this region. In the MWIR and LWIR, the absorption spectra of leaves are formed by key species-specific traits including lignin, cellulose, water, nitrogen and leaf thickness. The connection found in this study between leaf traits, features and spectral signatures are novel tools to assist when identifying plant species by spectroscopy and remote sensing.
Han, Y; Zheng, Q S; Wei, Y P; Chen, J; Liu, R; Wan, H J
In this study, we examined phytoene synthetase (PSY), the first key limiting enzyme in the synthesis of carotenoids and catalyzing the formation of geranylgeranyl pyrophosphate in terpenoid biosynthesis. We used known amino acid sequences of the PSY gene in tomato plants to conduct a genome-wide search and identify putative candidates in 34 sequenced plants. A total of 101 homologous genes were identified. Phylogenetic analysis revealed that PSY evolved independently in algae as well as monocotyledonous and dicotyledonous plants. Our results showed that the amino acid structures exhibited 5 motifs (motifs 1 to 5) in algae and those in higher plants were highly conserved. The PSY gene structures showed that the number of intron in algae varied widely, while the number of introns in higher plants was 4 to 5. Identification of PSY genes in plants and the analysis of the gene structure may provide a theoretical basis for studying evolutionary relationships in future analyses.
Lu, Xinguo; Lu, Jibo
Integrative analysis of molecular mechanics underlying cancer can distinguish interactions that cannot be revealed based on one kind of data for the appropriate diagnosis and treatment of cancer patients. Tumor samples exhibit heterogeneity in omics data, such as somatic mutations, Copy Number Variations CNVs), gene expression profiles and so on. In this paper we combined gene co-expression modules and mutation modulators separately in tumor patients to obtain the candidate driver genes for resistant and sensitive tumor from the heterogeneous data. The final list of modulators identified are well known in biological processes associated with ovarian cancer, such as CCL17, CACTIN, CCL16, CCL22, APOB, KDF1, CCL11, HNF1B, LRG1, MED1 and so on, which can help to facilitate the discovery of biomarkers, molecular diagnostics, and drug discovery.
Ouyang, Weiwei; An, Qiang; Zhao, Jinying; Qin, Huaizhen
In functional genomics studies, tests on mean heterogeneity have been widely employed to identify differentially expressed genes with distinct mean expression levels under different experimental conditions. Variance heterogeneity (aka, the difference between condition-specific variances) of gene expression levels is simply neglected or calibrated for as an impediment. The mean heterogeneity in the expression level of a gene reflects one aspect of its distribution alteration; and variance heterogeneity induced by condition change may reflect another aspect. Change in condition may alter both mean and some higher-order characteristics of the distributions of expression levels of susceptible genes. In this report, we put forth a conception of mean-variance differentially expressed (MVDE) genes, whose expression means and variances are sensitive to the change in experimental condition. We mathematically proved the null independence of existent mean heterogeneity tests and variance heterogeneity tests. Based on the independence, we proposed an integrative mean-variance test (IMVT) to combine gene-wise mean heterogeneity and variance heterogeneity induced by condition change. The IMVT outperformed its competitors under comprehensive simulations of normality and Laplace settings. For moderate samples, the IMVT well controlled type I error rates, and so did existent mean heterogeneity test (i.e., the Welch t test (WT), the moderated Welch t test (MWT)) and the procedure of separate tests on mean and variance heterogeneities (SMVT), but the likelihood ratio test (LRT) severely inflated type I error rates. In presence of variance heterogeneity, the IMVT appeared noticeably more powerful than all the valid mean heterogeneity tests. Application to the gene profiles of peripheral circulating B raised solid evidence of informative variance heterogeneity. After adjusting for background data structure, the IMVT replicated previous discoveries and identified novel experiment
Abstract Background There are currently three postulated genomic subtypes of the childhood tumour neuroblastoma (NB); Type 1, Type 2A, and Type 2B. The most aggressive forms of NB are characterized by amplification of the oncogene MYCN (MNA) and low expression of the favourable marker NTRK1. Recently, mutations or high expression of the familial predisposition gene Anaplastic Lymphoma Kinase (ALK) was associated to unfavourable biology of sporadic NB. Also, various other genes have been linked to NB pathogenesis. Results The present study explores subgroup discrimination by gene expression profiling using three published microarray studies on NB (47 samples). Four distinct clusters were identified by Principal Components Analysis (PCA) in two separate data sets, which could be verified by an unsupervised hierarchical clustering in a third independent data set (101 NB samples) using a set of 74 discriminative genes. The expression signature of six NB-associated genes ALK, BIRC5, CCND1, MYCN, NTRK1, and PHOX2B, significantly discriminated the four clusters (p < 0.05, one-way ANOVA test). PCA clusters p1, p2, and p3 were found to correspond well to the postulated subtypes 1, 2A, and 2B, respectively. Remarkably, a fourth novel cluster was detected in all three independent data sets. This cluster comprised mainly 11q-deleted MNA-negative tumours with low expression of ALK, BIRC5, and PHOX2B, and was significantly associated with higher tumour stage, poor outcome and poor survival compared to the Type 1-corresponding favourable group (INSS stage 4 and\\/or dead of disease, p < 0.05, Fisher\\'s exact test). Conclusions Based on expression profiling we have identified four molecular subgroups of neuroblastoma, which can be distinguished by a 6-gene signature. The fourth subgroup has not been described elsewhere, and efforts are currently made to further investigate this group\\'s specific characteristics.
Full Text Available The aluminium activated malate transporter (ALMT gene family is named after the first member of the family identified in wheat (Triticum aestivum L.. The product of this gene controls resistance to aluminium (Al toxicity. ALMT genes encode transmembrane proteins that function as anion channels and perform multiple functions involving the transport of organic anions (e.g., carboxylates and inorganic anions in cells. They share a PF11744 domain and are classified in the Fusaric acid resistance protein-like superfamily, CL0307. The proteins typically have five to seven transmembrane regions in the N-terminal half and a long hydrophillic C-terminal tail but predictions of secondary structure vary. Although widely spread in plants, relatively little information is available on the roles performed by other members of this family. In this review, we summarized functions of ALMT gene families, including Al resistance, stomatal function, mineral nutrition, microbe interactions, fruit acidity, light response and seed development.
Kim, Dong Sub; Kim, Jinbaek; Kim, Sang Hoon
In this project, we irradiated Arabidopsis plants with various doses of gamma-rays at the vegetative and reproductive stages to assess their radiation sensitivity. After the gene expression profiles and an analysis of the antioxidant response, we selected several Arabidopsis genes for uses of 'Radio marker genes (RMG)' and conducted over-expression and knock-down experiments to confirm the radio sensitivity. Based on these results, we applied two patents for the detection of two RMG (At3g28210 and At4g37990) and development of transgenic plants. Also, we developed a Genechip for use of high-throughput screening of Arabidopsis genes responding only to ionizing radiation and identified RMG to detect radiation leaks. Based on these results, we applied two patents associated with the use of Genechip for different types of radiation and different growth stages. Also, we conducted co-expression network study of specific expressed probes against gamma-ray stress and identified expressed patterns of duplicated genes formed by whole/500kb segmental genome duplication
Shen, K A; Meyers, B C; Islam-Faridi, M N; Chin, D B; Stelly, D M; Michelmore, R W
The recent cloning of genes for resistance against diverse pathogens from a variety of plants has revealed that many share conserved sequence motifs. This provides the possibility of isolating numerous additional resistance genes by polymerase chain reaction (PCR) with degenerate oligonucleotide primers. We amplified resistance gene candidates (RGCs) from lettuce with multiple combinations of primers with low degeneracy designed from motifs in the nucleotide binding sites (NBSs) of RPS2 of Arabidopsis thaliana and N of tobacco. Genomic DNA, cDNA, and bacterial artificial chromosome (BAC) clones were successfully used as templates. Four families of sequences were identified that had the same similarity to each other as to resistance genes from other species. The relationship of the amplified products to resistance genes was evaluated by several sequence and genetic criteria. The amplified products contained open reading frames with additional sequences characteristic of NBSs. Hybridization of RGCs to genomic DNA and to BAC clones revealed large numbers of related sequences. Genetic analysis demonstrated the existence of clustered multigene families for each of the four RGC sequences. This parallels classical genetic data on clustering of disease resistance genes. Two of the four families mapped to known clusters of resistance genes; these two families were therefore studied in greater detail. Additional evidence that these RGCs could be resistance genes was gained by the identification of leucine-rich repeat (LRR) regions in sequences adjoining the NBS similar to those in RPM1 and RPS2 of A. thaliana. Fluorescent in situ hybridization confirmed the clustered genomic distribution of these sequences. The use of PCR with degenerate oligonucleotide primers is therefore an efficient method to identify numerous RGCs in plants.
Full Text Available Gastric cancer is one of the most severe complex diseases with high morbidity and mortality in the world. The molecular mechanisms and risk factors for this disease are still not clear since the cancer heterogeneity caused by different genetic and environmental factors. With more and more expression data accumulated nowadays, we can perform integrative analysis for these data to understand the complexity of gastric cancer and to identify consensus players for the heterogeneous cancer. In the present work, we screened the published gene expression data and analyzed them with integrative tool, combined with pathway and gene ontology enrichment investigation. We identified several consensus differentially expressed genes and these genes were further confirmed with literature mining; at last, two genes, that is, immunoglobulin J chain and C-X-C motif chemokine ligand 17, were screened as novel gastric cancer associated genes. Experimental validation is proposed to further confirm this finding.
Kusunoki, Kazutaka; Nakano, Yuki; Tanaka, Keisuke; Sakata, Yoichi; Koyama, Hiroyuki; Kobayashi, Yuriko
Differences in the expression levels of aluminium (Al) tolerance genes are a known determinant of Al tolerance among plant varieties. We combined transcriptomic analysis of six Arabidopsis thaliana accessions with contrasting Al tolerance and a reverse genetic approach to identify Al-tolerance genes responsible for differences in Al tolerance between accession groups. Gene expression variation increased in the signal transduction process under Al stress and in growth-related processes in the absence of stress. Co-expression analysis and promoter single nucleotide polymorphism searching suggested that both trans-acting polymorphisms of Al signal transduction pathway and cis-acting polymorphisms in the promoter sequences caused the variations in gene expression associated with Al tolerance. Compared with the wild type, Al sensitivity increased in T-DNA knockout (KO) lines for five genes, including TARGET OF AVRB OPERATION1 (TAO1) and an unannotated gene (At5g22530). These were identified from 53 Al-inducible genes showing significantly higher expression in tolerant accessions than in sensitive accessions. These results indicate that the difference in transcriptional signalling is partly associated with the natural variation in Al tolerance in Arabidopsis. Our study also demonstrates the feasibility of comparative transcriptome analysis by using natural genetic variation for the identification of genes responsible for Al stress tolerance. © 2016 John Wiley & Sons Ltd.
Stimpson, Alexander; Pereira, Rhea; Kiss, John Z.; Correll, Melanie
Three experiments were performed on the International Space Station (ISS) in 2006 as part of the TROPI experiments. These experiments were performed to study graviTROPIsm and photoTROPIsm responses of Arabidopsis in microgravity (µg). Seedlings were grown with a variety of light and gravitational treatments for approximately five days. The frozen samples were returned to Earth during three space shuttle missions in 2007 and stored at -80° C. Due to the limited amount of plant biomass returned, new protocols were developed to minimize the amount of material needed for RNA extraction as a preparation for microarray analysis. Using these new protocols, RNA was extracted from several sets of seedlings grown in red light followed by blue light with one sample from 1.0g treatment and the other at µg. Using a 2-fold change criterion, microarray (Affymetrix, GeneChip) results showed that 613 genes were upregulated in the µg sample while 757 genes were downregulated. Upregulated genes in response to µg included transcription factors from the WRKY (15 genes), MYB (3) and ZF (8) families as well as those that are involved in auxin responses (10). Downregulated genes also included transcription factors such as MYB (5) and Zinc finger (10) but interestingly only two WRKY family genes were down-regulated during the µg treatment. Studies are underway to compare these results with other samples to identify the genes involved in the gravity and light signal transduction pathways (this project is Supported By: NASA NCC2-1200).
Full Text Available Abstract Background Orchids comprise one of the largest families of flowering plants and generate commercially important flowers. However, model plants, such as Arabidopsis thaliana do not contain all plant genes, and agronomic and horticulturally important genera and species must be individually studied. Results Several molecular biology tools were used to isolate flower-specific gene promoters from Oncidium 'Gower Ramsey' (Onc. GR. A cDNA library of reproductive tissues was used to construct a microarray in order to compare gene expression in flowers and leaves. Five genes were highly expressed in flower tissues, and the subcellular locations of the corresponding proteins were identified using lip transient transformation with fluorescent protein-fusion constructs. BAC clones of the 5 genes, together with 7 previously published flower- and reproductive growth-specific genes in Onc. GR, were identified for cloning of their promoter regions. Interestingly, 3 of the 5 novel flower-abundant genes were putative trypsin inhibitor (TI genes (OnTI1, OnTI2 and OnTI3, which were tandemly duplicated in the same BAC clone. Their promoters were identified using transient GUS reporter gene transformation and stable A. thaliana transformation analyses. Conclusions By combining cDNA microarray, BAC library, and bombardment assay techniques, we successfully identified flower-directed orchid genes and promoters.
Nakahara, Yoshiki; Sawabe, Shogo; Kainuma, Kenta; Katsuhara, Maki; Shibasaka, Mineo; Suzuki, Masanori; Yamamoto, Kosuke; Oguri, Suguru; Sakamoto, Hikaru
Salinity is a critical environmental factor that adversely affects crop productivity. Halophytes have evolved various mechanisms to adapt to saline environments. Salicornia europaea L. is one of the most salt-tolerant plant species. It does not have special salt-secreting structures like a salt gland or salt bladder, and is therefore a good model for studying the common mechanisms underlying plant salt tolerance. To identify candidate genes encoding key proteins in the mediation of salt tolerance in S. europaea, we performed a functional screen of a cDNA library in yeast. The library was screened for genes that allowed the yeast to grow in the presence of 1.3 M NaCl. We obtained three full-length S. europaea genes that confer salt tolerance. The genes are predicted to encode (1) a novel protein highly homologous to thaumatin-like proteins, (2) a novel coiled-coil protein of unknown function, and (3) a novel short peptide of 32 residues. Exogenous application of a synthetic peptide corresponding to the 32 residues improved salt tolerance of Arabidopsis. The approach described in this report provides a rapid assay system for large-scale screening of S. europaea genes involved in salt stress tolerance and supports the identification of genes responsible for such mechanisms. These genes may be useful candidates for improving crop salt tolerance by genetic transformation.
Full Text Available Salinity is a critical environmental factor that adversely affects crop productivity. Halophytes have evolved various mechanisms to adapt to saline environments. Salicornia europaea L. is one of the most salt-tolerant plant species. It does not have special salt-secreting structures like a salt gland or salt bladder, and is therefore a good model for studying the common mechanisms underlying plant salt tolerance. To identify candidate genes encoding key proteins in the mediation of salt tolerance in S. europaea, we performed a functional screen of a cDNA library in yeast. The library was screened for genes that allowed the yeast to grow in the presence of 1.3 M NaCl. We obtained three full-length S. europaea genes that confer salt tolerance. The genes are predicted to encode (1 a novel protein highly homologous to thaumatin-like proteins, (2 a novel coiled-coil protein of unknown function, and (3 a novel short peptide of 32 residues. Exogenous application of a synthetic peptide corresponding to the 32 residues improved salt tolerance of Arabidopsis. The approach described in this report provides a rapid assay system for large-scale screening of S. europaea genes involved in salt stress tolerance and supports the identification of genes responsible for such mechanisms. These genes may be useful candidates for improving crop salt tolerance by genetic transformation.
Forrest, Kerrie L; Bhave, Mrinal
The ubiquitous cell membrane proteins called aquaporins are now firmly established as channel proteins that control the specific transport of water molecules across cell membranes in all living organisms. The aquaporins are thus likely to be of fundamental significance to all facets of plant growth and development affected by plant-water relations. A majority of plant aquaporins have been found to share essential structural features with the human aquaporin and exhibit water-transporting ability in various functional assays, and some have been shown experimentally to be of critical importance to plant survival. Furthermore, substantial evidence is now available from a number of plant species that shows differential gene expression of aquaporins in response to abiotic stresses such as salinity, drought, or cold and clearly establishes the aquaporins as major players in the response of plants to conditions that affect water availability. This review summarizes the function and regulation of these genes to develop a greater understanding of the response of plants to water insufficiency, and particularly, to identify tolerant genotypes of major crop species including wheat and rice and plants that are important in agroforestry.
From the approximately 200,000 species of flowering plants known, only about 200 have been domesticated. The process has taken place in many regions over long periods. At present there is great interest in domesticating new species and developing new uses for existing ones in order to supply needed food, industrial raw materials, etc. It is proposed that major gene mutations were important in domestication; many key characters distinguishing cultivated from related wild species are controlled by one or very few major genes. The deliberate effort to domesticate new species requires at least the following: identification of needs and potential sources, establishment of suitable niches, choice of taxa to be domesticated, specification of the desired traits and key characters to be modified, as well as the potential role of induced mutations. (author). 14 refs
Full Text Available The role of the immune system in response to chemotherapeutic agents remains elusive. The interpatient variability observed in immune and chemotherapeutic cytotoxic responses is likely, at least in part, due to complex genetic differences. Through the use of a panel of genetically diverse mouse inbred strains, we developed a drug screening platform aimed at identifying genes underlying these chemotherapeutic cytotoxic effects on immune cells. Using genome-wide association studies (GWAS, we identified four genome-wide significant quantitative trait loci (QTL that contributed to the sensitivity of doxorubicin and idarubicin in immune cells. Of particular interest, a locus on chromosome 16 was significantly associated with cell viability following idarubicin administration (p = 5.01x10-8. Within this QTL lies App, which encodes amyloid beta precursor protein. Comparison of dose-response curves verified that T-cells in App knockout mice were more sensitive to idarubicin than those of C57BL/6J control mice (p < 0.05.In conclusion, the cellular screening approach coupled with GWAS led to the identification and subsequent validation of a gene involved in T-cell viability after idarubicin treatment. Previous studies have suggested a role for App in in vitro and in vivo cytotoxicity to anticancer agents; the overexpression of App enhances resistance, while the knockdown of this gene is deleterious to cell viability. Thus, further investigations should include performing mechanistic studies, validating additional genes from the GWAS, including Ppfia1 and Ppfibp1, and ultimately translating the findings to in vivo and human studies.
The unarmored dinoflagellate Karenia brevis is among the most prominent harmful, bloom-forming phytoplankton species in the Gulf of Mexico. During blooms, the polyketides PbTx-1 and PbTx-2 (brevetoxins) are produced by K. brevis. Brevetoxins negatively impact human health and the Gulf shellfish harvest. However, the genes underlying brevetoxin synthesis are currently unknown. Because the K. brevis genome is extremely large ( 1 × 1011 base pairs long), and with a high proportion of repetitive, non-coding DNA, it has not been sequenced. In fact, large, repetitive genomes are common among the dinoflagellate group. High-throughput RNA sequencing technology enabled us to assemble Karenia transcriptomes de novo and investigate potential genes in the brevetoxin pathway through comparative transcriptomics. The brevetoxin profile varies among K. brevis clonal cultures. For example, well-documented Wilson-CCFWC268 typically produces 8-10 pg PbTx per cell, whereas SP1 produces differences in gene expression. Of the 85,000 transcripts in the K. brevis transcriptome, 4,600 transcripts, including novel unannotated orthologs and putative polyketide synthases (PKSs), were only expressed by brevetoxin-producing K. brevis and K. papilionacea, not K. mikimotoi. Examination of gene expression between the typical- and low-toxin Wilson clones identified about 3,500 genes with significantly different expression levels, including 2 putative PKSs. One of the 2 PKSs was only found in the brevetoxin-producing Karenia species. These transcriptomes could not have been characterized without high-throughput RNA sequencing.
Büchel, Kerstin; McDowell, Eric; Nelson, Will; Descour, Anne; Gershenzon, Jonathan; Hilker, Monika; Soderlund, Carol; Gang, David R; Fenning, Trevor; Meiners, Torsten
Plants can defend themselves against herbivorous insects prior to the onset of larval feeding by responding to the eggs laid on their leaves. In the European field elm (Ulmus minor), egg laying by the elm leaf beetle ( Xanthogaleruca luteola) activates the emission of volatiles that attract specialised egg parasitoids, which in turn kill the eggs. Little is known about the transcriptional changes that insect eggs trigger in plants and how such indirect defense mechanisms are orchestrated in the context of other biological processes. Here we present the first large scale study of egg-induced changes in the transcriptional profile of a tree. Five cDNA libraries were generated from leaves of (i) untreated control elms, and elms treated with (ii) egg laying and feeding by elm leaf beetles, (iii) feeding, (iv) artificial transfer of egg clutches, and (v) methyl jasmonate. A total of 361,196 ESTs expressed sequence tags (ESTs) were identified which clustered into 52,823 unique transcripts (Unitrans) and were stored in a database with a public web interface. Among the analyzed Unitrans, 73% could be annotated by homology to known genes in the UniProt (Plant) database, particularly to those from Vitis, Ricinus, Populus and Arabidopsis. Comparative in silico analysis among the different treatments revealed differences in Gene Ontology term abundances. Defense- and stress-related gene transcripts were present in high abundance in leaves after herbivore egg laying, but transcripts involved in photosynthesis showed decreased abundance. Many pathogen-related genes and genes involved in phytohormone signaling were expressed, indicative of jasmonic acid biosynthesis and activation of jasmonic acid responsive genes. Cross-comparisons between different libraries based on expression profiles allowed the identification of genes with a potential relevance in egg-induced defenses, as well as other biological processes, including signal transduction, transport and primary metabolism
Full Text Available It has been shown that gene body DNA methylation is associated with gene expression. However, whether and how deviation of gene body DNA methylation between duplicate genes can influence their divergence remains largely unexplored. Here, we aim to elucidate the potential role of gene body DNA methylation in the fate of duplicate genes. We identified paralogous gene pairs from Arabidopsis and rice (Oryza sativa ssp. japonica genomes and reprocessed their single-base resolution methylome data. We show that methylation in paralogous genes nonlinearly correlates with several gene properties including exon number/gene length, expression level and mutation rate. Further, we demonstrated that divergence of methylation level and pattern in paralogs indeed positively correlate with their sequence and expression divergences. This result held even after controlling for other confounding factors known to influence the divergence of paralogs. We observed that methylation level divergence might be more relevant to the expression divergence of paralogs than methylation pattern divergence. Finally, we explored the mechanisms that might give rise to the divergence of gene body methylation in paralogs. We found that exonic methylation divergence more closely correlates with expression divergence than intronic methylation divergence. We show that genomic environments (e.g., flanked by transposable elements and repetitive sequences of paralogs generated by various duplication mechanisms are associated with the methylation divergence of paralogs. Overall, our results suggest that the changes in gene body DNA methylation could provide another avenue for duplicate genes to develop differential expression patterns and undergo different evolutionary fates in plant genomes.
We elucidate a recently emergent framework in unifying the two families of high temperature (high [Formula: see text]) superconductors, cuprates and iron-based superconductors. The unification suggests that the latter is simply the counterpart of the former to realize robust extended s-wave pairing symmetries in a square lattice. The unification identifies that the key ingredients (gene) of high [Formula: see text] superconductors is a quasi two dimensional electronic environment in which the d -orbitals of cations that participate in strong in-plane couplings to the p -orbitals of anions are isolated near Fermi energy. With this gene, the superexchange magnetic interactions mediated by anions could maximize their contributions to superconductivity. Creating the gene requires special arrangements between local electronic structures and crystal lattice structures. The speciality explains why high [Formula: see text] superconductors are so rare. An explicit prediction is made to realize high [Formula: see text] superconductivity in Co/Ni-based materials with a quasi two dimensional hexagonal lattice structure formed by trigonal bipyramidal complexes.
Zhu, Xinyu; Chen, Caoyi; Wang, Baohua
Plant Trx SET proteins are involved in H3K4 methylation and play a key role in plant floral development. Genes encoding Trx SET proteins constitute a multigene family in which the copy number varies among plant species and functional divergence appears to have occurred repeatedly. To investigate the evolutionary history of the Trx SET gene family, we made a comprehensive evolutionary analysis on this gene family from 13 major representatives of green plants. A novel clustering (here named as cpTrx clade), which included the III-1, III-2, and III-4 orthologous groups, previously resolved was identified. Our analysis showed that plant Trx proteins possessed a variety of domain organizations and gene structures among paralogs. Additional domains such as PHD, PWWP, and FYR were early integrated into primordial SET-PostSET domain organization of cpTrx clade. We suggested that the PostSET domain was lost in some members of III-4 orthologous group during the evolution of land plants. At least four classes of gene structures had been formed at the early evolutionary stage of land plants. Three intronless orphan Trx SET genes from the Physcomitrella patens (moss) were identified, and supposedly, their parental genes have been eliminated from the genome. The structural differences among evolutionary groups of plant Trx SET genes with different functions were described, contributing to the design of further experimental studies.
Full Text Available Non-target-site resistance (NTSR to herbicides is a worldwide concern for weed control. However, as the dominant NTSR mechanism in weeds, metabolic resistance is not yet well-characterized at the genetic level. For this study, we have identified a shortawn foxtail (Alopecurus aequalis Sobol. population displaying both TSR and NTSR to mesosulfuron-methyl and fenoxaprop-P-ethyl, yet the molecular basis for this NTSR remains unclear. To investigate the mechanisms of metabolic resistance, an RNA-Seq transcriptome analysis was used to find candidate genes that may confer metabolic resistance to the herbicide mesosulfuron-methyl in this plant population. The RNA-Seq libraries generated 831,846,736 clean reads. The de novo transcriptome assembly yielded 95,479 unigenes (averaging 944 bp in length that were assigned putative annotations. Among these, a total of 29,889 unigenes were assigned to 67 GO terms that contained three main categories, and 14,246 unigenes assigned to 32 predicted KEGG metabolic pathways. Global gene expression was measured using the reads generated from the untreated control (CK, water-only control (WCK, and mesosulfuron-methyl treatment (T of R and susceptible (S. Contigs that showed expression differences between mesosulfuron-methyl-treated R and S biotypes, and between mesosulfuron-methyl-treated, water-treated and untreated R plants were selected for further quantitative real-time PCR (qRT-PCR validation analyses. Seventeen contigs were consistently highly expressed in the resistant A. aequalis plants, including four cytochrome P450 monooxygenase (CytP450 genes, two glutathione S-transferase (GST genes, two glucosyltransferase (GT genes, two ATP-binding cassette (ABC transporter genes, and seven additional contigs with functional annotations related to oxidation, hydrolysis, and plant stress physiology. These 17 contigs could serve as major candidate genes for contributing to metabolic mesosulfuron-methyl resistance; hence
Cagliari, Alexandro; Turchetto-Zolet, Andreia Carina; Korbes, Ana Paula; Maraschin, Felipe Dos Santos; Margis, Rogerio; Margis-Pinheiro, Marcia
NF-Y is a conserved oligomeric transcription factor found in all eukaryotes. In plants, this regulator evolved with a broad diversification of the genes coding for its three subunits (NF-YA, NF-YB and NF-YC). The NF-YB members can be divided into Leafy Cotyledon1 (LEC1) and non-LEC1 types. Here we presented a comparative genomic study using phylogenetic analyses to validate an evolutionary model for the origin of LEC-type genes in plants and their emergence from non-LEC1-type genes. We identified LEC1-type members in all vascular plant genomes, but not in amoebozoa, algae, fungi, metazoa and non-vascular plant representatives, which present exclusively non-LEC1-type genes as constituents of their NF-YB subunits. The non-synonymous to synonymous nucleotide substitution rates (Ka/Ks) between LEC1 and non-LEC1-type genes indicate the presence of positive selection acting on LEC1-type members to the fixation of LEC1-specific amino acid residues. The phylogenetic analyses demonstrated that plant LEC1-type genes are evolutionary divergent from the non-LEC1-type genes of plants, fungi, amoebozoa, algae and animals. Our results point to a scenario in which LEC1-type genes have originated in vascular plants after gene expansion in plants. We suggest that processes of neofunctionalization and/or subfunctionalization were responsible for the emergence of a versatile role for LEC1-type genes in vascular plants, especially in seed plants. LEC1-type genes besides being phylogenetic divergent also present different expression profile when compared with non-LEC1-type genes. Altogether, our data provide new insights about the LEC1 and non-LEC1 evolutionary relationship during the vascular plant evolution. Copyright © 2014 Elsevier Inc. All rights reserved.
Richards, Thomas A; Soanes, Darren M; Foster, Peter G; Leonard, Guy; Thornton, Christopher R; Talbot, Nicholas J
Horizontal gene transfer (HGT) describes the transmission of genetic material across species boundaries and is an important evolutionary phenomenon in the ancestry of many microbes. The role of HGT in plant evolutionary history is, however, largely unexplored. Here, we compare the genomes of six plant species with those of 159 prokaryotic and eukaryotic species and identify 1689 genes that show the highest similarity to corresponding genes from fungi. We constructed a phylogeny for all 1689 genes identified and all homolog groups available from the rice (Oryza sativa) genome (3177 gene families) and used these to define 14 candidate plant-fungi HGT events. Comprehensive phylogenetic analyses of these 14 data sets, using methods that account for site rate heterogeneity, demonstrated support for nine HGT events, demonstrating an infrequent pattern of HGT between plants and fungi. Five HGTs were fungi-to-plant transfers and four were plant-to-fungi HGTs. None of the fungal-to-plant HGTs involved angiosperm recipients. These results alter the current view of organismal barriers to HGT, suggesting that phagotrophy, the consumption of a whole cell by another, is not necessarily a prerequisite for HGT between eukaryotes. Putative functional annotation of the HGT candidate genes suggests that two fungi-to-plant transfers have added phenotypes important for life in a soil environment. Our study suggests that genetic exchange between plants and fungi is exceedingly rare, particularly among the angiosperms, but has occurred during their evolutionary history and added important metabolic traits to plant lineages.
Full Text Available Abstract Background Plants can defend themselves against herbivorous insects prior to the onset of larval feeding by responding to the eggs laid on their leaves. In the European field elm (Ulmus minor, egg laying by the elm leaf beetle ( Xanthogaleruca luteola activates the emission of volatiles that attract specialised egg parasitoids, which in turn kill the eggs. Little is known about the transcriptional changes that insect eggs trigger in plants and how such indirect defense mechanisms are orchestrated in the context of other biological processes. Results Here we present the first large scale study of egg-induced changes in the transcriptional profile of a tree. Five cDNA libraries were generated from leaves of (i untreated control elms, and elms treated with (ii egg laying and feeding by elm leaf beetles, (iii feeding, (iv artificial transfer of egg clutches, and (v methyl jasmonate. A total of 361,196 ESTs expressed sequence tags (ESTs were identified which clustered into 52,823 unique transcripts (Unitrans and were stored in a database with a public web interface. Among the analyzed Unitrans, 73% could be annotated by homology to known genes in the UniProt (Plant database, particularly to those from Vitis, Ricinus, Populus and Arabidopsis. Comparative in silico analysis among the different treatments revealed differences in Gene Ontology term abundances. Defense- and stress-related gene transcripts were present in high abundance in leaves after herbivore egg laying, but transcripts involved in photosynthesis showed decreased abundance. Many pathogen-related genes and genes involved in phytohormone signaling were expressed, indicative of jasmonic acid biosynthesis and activation of jasmonic acid responsive genes. Cross-comparisons between different libraries based on expression profiles allowed the identification of genes with a potential relevance in egg-induced defenses, as well as other biological processes, including signal transduction
Lin Han; Chunwei Cao; Zhaotong Jia; Shiguo Liu; Zhen Liu; Ruosai Xin; Can Wang; Xinde Li; Wei Ren; Xuefeng Wang; Changgui Li
Chromosome 4q25 has been identified as a genomic region associated with gout. However, the associations of gout with the genes in this region have not yet been confirmed. Here, we performed two-stage analysis to determine whether variations in candidate genes in the 4q25 region are associated with gout in a male Chinese Han population. We first evaluated 96 tag single nucleotide polymorphisms (SNPs) in eight inflammatory/immune pathway- or glucose/lipid metabolism-related genes in the 4q25 re...
Bi, Dongbin; Ning, Hao; Liu, Shuai; Que, Xinxiang; Ding, Kejia
To explore molecular mechanisms of bladder cancer (BC), network strategy was used to find biomarkers for early detection and diagnosis. The differentially expressed genes (DEGs) between bladder carcinoma patients and normal subjects were screened using empirical Bayes method of the linear models for microarray data package. Co-expression networks were constructed by differentially co-expressed genes and links. Regulatory impact factors (RIF) metric was used to identify critical transcription factors (TFs). The protein-protein interaction (PPI) networks were constructed by the Search Tool for the Retrieval of Interacting Genes/Proteins (STRING) and clusters were obtained through molecular complex detection (MCODE) algorithm. Centralities analyses for complex networks were performed based on degree, stress and betweenness. Enrichment analyses were performed based on Gene Ontology (GO) and Kyoto Encyclopedia of Genes and Genomes (KEGG) databases. Co-expression networks and TFs (based on expression data of global DEGs and DEGs in different stages and grades) were identified. Hub genes of complex networks, such as UBE2C, ACTA2, FABP4, CKS2, FN1 and TOP2A, were also obtained according to analysis of degree. In gene enrichment analyses of global DEGs, cell adhesion, proteinaceous extracellular matrix and extracellular matrix structural constituent were top three GO terms. ECM-receptor interaction, focal adhesion, and cell cycle were significant pathways. Our results provide some potential underlying biomarkers of BC. However, further validation is required and deep studies are needed to elucidate the pathogenesis of BC. Copyright © 2015 Elsevier Ltd. All rights reserved.
Masle, Josette; Gilmore, Scott R; Farquhar, Graham D
Assimilation of carbon by plants incurs water costs. In the many parts of the world where water is in short supply, plant transpiration efficiency, the ratio of carbon fixation to water loss, is critical to plant survival, crop yield and vegetation dynamics. When challenged by variations in their environment, plants often seem to coordinate photosynthesis and transpiration, but significant genetic variation in transpiration efficiency has been identified both between and within species. This has allowed plant breeders to develop effective selection programmes for the improved transpiration efficiency of crops, after it was demonstrated that carbon isotopic discrimination, Delta, of plant matter was a reliable and sensitive marker negatively related to variation in transpiration efficiency. However, little is known of the genetic controls of transpiration efficiency. Here we report the isolation of a gene that regulates transpiration efficiency, ERECTA. We show that ERECTA, a putative leucine-rich repeat receptor-like kinase (LRR-RLK) known for its effects on inflorescence development, is a major contributor to a locus for Delta on Arabidopsis chromosome 2. Mechanisms include, but are not limited to, effects on stomatal density, epidermal cell expansion, mesophyll cell proliferation and cell-cell contact.
Tantong, Supaluk; Pringsulaka, Onanong; Weerawanich, Kamonwan; Meeprasert, Arthitaya; Rungrotmongkol, Thanyada; Sarnthima, Rakrudee; Roytrakul, Sittiruk; Sirikantaramas, Supaart
Defensins form an antimicrobial peptides (AMP) family, and have been widely studied in various plants because of their considerable inhibitory functions. However, their roles in rice (Oryza sativa L.) have not been characterized, even though rice is one of the most important staple crops that is susceptible to damaging infections. Additionally, a previous study identified 598 rice genes encoding cysteine-rich peptides, suggesting there are several uncharacterized AMPs in rice. We performed in silico gene expression and coexpression network analyses of all genes encoding defensin and defensin-like peptides, and determined that OsDEF7 and OsDEF8 are coexpressed with pathogen-responsive genes. Recombinant OsDEF7 and OsDEF8 could form homodimers. They inhibited the growth of the bacteria Xanthomonas oryzae pv. oryzae, X. oryzae pv. oryzicola, and Erwinia carotovora subsp. atroseptica with minimum inhibitory concentration (MIC) ranging from 0.6 to 63μg/mL. However, these OsDEFs are weakly active against the phytopathogenic fungi Helminthosporium oryzae and Fusarium oxysporum f.sp. cubense. This study describes a useful method for identifying potential plant AMPs with biological activities. Copyright © 2016 Elsevier Inc. All rights reserved.
Full Text Available Abstract Background Arsenic contamination is widespread throughout the world and this toxic metalloid is known to cause cancers of organs such as liver, kidney, skin, and lung in human. In spite of a recent surge in arsenic related studies, we are still far from a comprehensive understanding of arsenic uptake, detoxification, and sequestration in plants. Crambe abyssinica, commonly known as 'abyssinian mustard', is a non-food, high biomass oil seed crop that is naturally tolerant to heavy metals. Moreover, it accumulates significantly higher levels of arsenic as compared to other species of the Brassicaceae family. Thus, C. abyssinica has great potential to be utilized as an ideal inedible crop for phytoremediation of heavy metals and metalloids. However, the mechanism of arsenic metabolism in higher plants, including C. abyssinica, remains elusive. Results To identify the differentially expressed transcripts and the pathways involved in arsenic metabolism and detoxification, C. abyssinica plants were subjected to arsenate stress and a PCR-Select Suppression Subtraction Hybridization (SSH approach was employed. A total of 105 differentially expressed subtracted cDNAs were sequenced which were found to represent 38 genes. Those genes encode proteins functioning as antioxidants, metal transporters, reductases, enzymes involved in the protein degradation pathway, and several novel uncharacterized proteins. The transcripts corresponding to the subtracted cDNAs showed strong upregulation by arsenate stress as confirmed by the semi-quantitative RT-PCR. Conclusions Our study revealed novel insights into the plant defense mechanisms and the regulation of genes and gene networks in response to arsenate toxicity. The differential expression of transcripts encoding glutathione-S-transferases, antioxidants, sulfur metabolism, heat-shock proteins, metal transporters, and enzymes in the ubiquitination pathway of protein degradation as well as several unknown
Background Arsenic contamination is widespread throughout the world and this toxic metalloid is known to cause cancers of organs such as liver, kidney, skin, and lung in human. In spite of a recent surge in arsenic related studies, we are still far from a comprehensive understanding of arsenic uptake, detoxification, and sequestration in plants. Crambe abyssinica, commonly known as 'abyssinian mustard', is a non-food, high biomass oil seed crop that is naturally tolerant to heavy metals. Moreover, it accumulates significantly higher levels of arsenic as compared to other species of the Brassicaceae family. Thus, C. abyssinica has great potential to be utilized as an ideal inedible crop for phytoremediation of heavy metals and metalloids. However, the mechanism of arsenic metabolism in higher plants, including C. abyssinica, remains elusive. Results To identify the differentially expressed transcripts and the pathways involved in arsenic metabolism and detoxification, C. abyssinica plants were subjected to arsenate stress and a PCR-Select Suppression Subtraction Hybridization (SSH) approach was employed. A total of 105 differentially expressed subtracted cDNAs were sequenced which were found to represent 38 genes. Those genes encode proteins functioning as antioxidants, metal transporters, reductases, enzymes involved in the protein degradation pathway, and several novel uncharacterized proteins. The transcripts corresponding to the subtracted cDNAs showed strong upregulation by arsenate stress as confirmed by the semi-quantitative RT-PCR. Conclusions Our study revealed novel insights into the plant defense mechanisms and the regulation of genes and gene networks in response to arsenate toxicity. The differential expression of transcripts encoding glutathione-S-transferases, antioxidants, sulfur metabolism, heat-shock proteins, metal transporters, and enzymes in the ubiquitination pathway of protein degradation as well as several unknown novel proteins serve as
Bagger Jørgensen, Rikke; Hauser, T.P.; Mikkelsen, T.R.
The escape of engineered genes - genes inserted using recombinant DNA techniques - from cultivated plants to wild or weedy relatives has raised concern about possible risks to the environment or to health. The media have added considerably to public concern by suggesting that such gene escape...... is a new and rather unexpected phenomenon. However, transfer of engineered genes between plants is not at-all surprising, because it is mediated by exactly the same mechanisms as those responsible for transferring endogenous plant genes: it takes place by sexual crosses, with pollen as the carrier...
Full Text Available Large numbers of quantitative trait loci (QTL affecting complex diseases and other quantitative traits have been reported in humans and model animals. However, the genetic architecture of these traits remains elusive due to the difficulty in identifying causal quantitative trait genes (QTGs for common QTL with relatively small phenotypic effects. A traditional strategy based on techniques such as positional cloning does not always enable identification of a single candidate gene for a QTL of interest because it is difficult to narrow down a target genomic interval of the QTL to a very small interval harboring only one gene. A combination of gene expression analysis and statistical causal analysis can greatly reduce the number of candidate genes. This integrated approach provides causal evidence that one of the candidate genes is a putative QTG for the QTL. Using this approach, I have recently succeeded in identifying a single putative QTG for resistance to obesity in mice. Here, I outline the integration approach and discuss its usefulness using my studies as an example.
Large numbers of quantitative trait loci (QTL) affecting complex diseases and other quantitative traits have been reported in humans and model animals. However, the genetic architecture of these traits remains elusive due to the difficulty in identifying causal quantitative trait genes (QTGs) for common QTL with relatively small phenotypic effects. A traditional strategy based on techniques such as positional cloning does not always enable identification of a single candidate gene for a QTL of interest because it is difficult to narrow down a target genomic interval of the QTL to a very small interval harboring only one gene. A combination of gene expression analysis and statistical causal analysis can greatly reduce the number of candidate genes. This integrated approach provides causal evidence that one of the candidate genes is a putative QTG for the QTL. Using this approach, I have recently succeeded in identifying a single putative QTG for resistance to obesity in mice. Here, I outline the integration approach and discuss its usefulness using my studies as an example.
Full Text Available In the field, plants constantly face a plethora of abiotic and biotic stresses that can impart detrimental effects on plants. In response to multiple stresses, plants can rapidly reprogram their transcriptome through a tightly regulated and highly dynamic regulatory network where WRKY transcription factors can act as activators or repressors. WRKY transcription factors have diverse biological functions in plants, but most notably are key players in plant responses to biotic and abiotic stresses. In tomato there are 83 WRKY genes identified. Here we review recent progress on functions of these tomato WRKY genes and their homologs in other plant species, such as Arabidopsis and rice, with a special focus on their involvement in responses to abiotic and biotic stresses. In particular, we highlight WRKY genes that play a role in plant responses to a combination of abiotic and biotic stresses.
Full Text Available Abstract Background The basidiomycete fungus Microbotryum violaceum is responsible for the anther-smut disease in many plants of the Caryophyllaceae family and is a model in genetics and evolutionary biology. Infection is initiated by dikaryotic hyphae produced after the conjugation of two haploid sporidia of opposite mating type. This study describes M. violaceum ESTs corresponding to nuclear genes expressed during conjugation and early hyphal production. Results A normalized cDNA library generated 24,128 sequences, which were assembled into 7,765 unique genes; 25.2% of them displayed significant similarity to annotated proteins from other organisms, 74.3% a weak similarity to the same set of known proteins, and 0.5% were orphans. We identified putative pheromone receptors and genes that in other fungi are involved in the mating process. We also identified many sequences similar to genes known to be involved in pathogenicity in other fungi. The M. violaceum EST database, MICROBASE, is available on the Web and provides access to the sequences, assembled contigs, annotations and programs to compare similarities against MICROBASE. Conclusion This study provides a basis for cloning the mating type locus, for further investigation of pathogenicity genes in the anther smut fungi, and for comparative genomics.
Full Text Available Genome-wide dissection of the heat stress response (HSR is necessary to overcome problems in crop production caused by global warming. To identify HSR genes, we profiled gene expression in two Chinese cabbage inbred lines with different thermotolerances, Chiifu and Kenshin. Many genes exhibited >2-fold changes in expression upon exposure to 0.5- 4 h at 45°C (high temperature, HT: 5.2% (2,142 genes in Chiifu and 3.7% (1,535 genes in Kenshin. The most enriched GO (Gene Ontology items included 'response to heat', 'response to reactive oxygen species (ROS', 'response to temperature stimulus', 'response to abiotic stimulus', and 'MAPKKK cascade'. In both lines, the genes most highly induced by HT encoded small heat shock proteins (Hsps and heat shock factor (Hsf-like proteins such as HsfB2A (Bra029292, whereas high-molecular weight Hsps were constitutively expressed. Other upstream HSR components were also up-regulated: ROS-scavenging genes like glutathione peroxidase 2 (BrGPX2, Bra022853, protein kinases, and phosphatases. Among heat stress (HS marker genes in Arabidopsis, only exportin 1A (XPO1A (Bra008580, Bra006382 can be applied to B. rapa for basal thermotolerance (BT and short-term acquired thermotolerance (SAT gene. CYP707A3 (Bra025083, Bra021965, which is involved in the dehydration response in Arabidopsis, was associated with membrane leakage in both lines following HS. Although many transcription factors (TF genes, including DREB2A (Bra005852, were involved in HS tolerance in both lines, Bra024224 (MYB41 and Bra021735 (a bZIP/AIR1 [Anthocyanin-Impaired-Response-1] were specific to Kenshin. Several candidate TFs involved in thermotolerance were confirmed as HSR genes by real-time PCR, and these assignments were further supported by promoter analysis. Although some of our findings are similar to those obtained using other plant species, clear differences in Brassica rapa reveal a distinct HSR in this species. Our data could also provide a
Xu, Song-Zhi; Li, Zhen-Yu; Jin, Xiao-Hua
Invasive plants have aroused attention globally for causing ecological damage and having a negative impact on the economy and human health. However, it can be extremely challenging to rapidly and accurately identify invasive plants based on morphology because they are an assemblage of many different families and many plant materials lack sufficient diagnostic characteristics during border inspections. It is therefore urgent to evaluate candidate loci and build a reliable genetic library to prevent invasive plants from entering China. In this study, five common single markers (ITS, ITS2, matK, rbcL and trnH-psbA) were evaluated using 634 species (including 469 invasive plant species in China, 10 new records to China, 16 potentially invasive plant species around the world but not introduced into China yet and 139 plant species native to China) based on three different methods. Our results indicated that ITS2 displayed largest intra- and interspecific divergence (1.72% and 91.46%). Based on NJ tree method, ITS2, ITS, matK, rbcL and trnH-psbA provided 76.84%, 76.5%, 63.21%, 52.86% and 50.68% discrimination rates, respectively. The combination of ITS + matK performed best and provided 91.03% discriminatory power, followed by ITS2 + matK (85.78%). For identifying unknown individuals, ITS + matK had 100% correct identification rate based on our database, followed by ITS/ITS2 (both 93.33%) and ITS2 + matK (91.67%). Thus, we propose ITS/ITS2 + matK as the most suitable barcode for invasive plants in China. This study also demonstrated that DNA barcoding is an efficient tool for identifying invasive species. © 2017 John Wiley & Sons Ltd.
Li, Yuanjun; Gou, Junbo; Chen, Fangfang; Li, Changfu; Zhang, Yansheng
Xanthium strumarium L. is a traditional Chinese herb belonging to the Asteraceae family. The major bioactive components of this plant are sesquiterpene lactones (STLs), which include the xanthanolides. To date, the biogenesis of xanthanolides, especially their downstream pathway, remains largely unknown. In X. strumarium, xanthanolides primarily accumulate in its glandular trichomes. To identify putative gene candidates involved in the biosynthesis of xanthanolides, three X. strumarium transcriptomes, which were derived from the young leaves of two different cultivars and the purified glandular trichomes from one of the cultivars, were constructed in this study. In total, 157 million clean reads were generated and assembled into 91,861 unigenes, of which 59,858 unigenes were successfully annotated. All the genes coding for known enzymes in the upstream pathway to the biosynthesis of xanthanolides were present in the X. strumarium transcriptomes. From a comparative analysis of the X. strumarium transcriptomes, this study identified a number of gene candidates that are putatively involved in the downstream pathway to the synthesis of xanthanolides, such as four unigenes encoding CYP71 P450s, 50 unigenes for dehydrogenases, and 27 genes for acetyltransferases. The possible functions of these four CYP71 candidates are extensively discussed. In addition, 116 transcription factors that are highly expressed in X. strumarium glandular trichomes were also identified. Their possible regulatory roles in the biosynthesis of STLs are discussed. The global transcriptomic data for X. strumarium should provide a valuable resource for further research into the biosynthesis of xanthanolides.
Full Text Available Xanthium strumarium L. is a traditional Chinese herb belonging to the Asteraceae family. The major bioactive components of this plant are sesquiterpene lactones, which include the xanthanolides. To date, the biogenesis of xanthanolides, especiallytheir downstream pathway, remains largely unknown. In X. strumarium, xanthanolides primarily accumulate in its glandular trichomes. To identify putative gene candidates involved in the biosynthesis of xanthanolides, three X. strumarium transcriptomes, which were derived from the young leaves of two different cultivars and the purified glandular trichomes from one of the cultivars, were constructed in this study. In total, 157 million clean reads were generated and assembled into 91,861 unigenes, of which 59,858 unigenes were successfully annotated. All the genes coding for known enzymes in the upstream pathway to the biosynthesis of xanthanolides were present in the X. strumarium transcriptomes. From a comparative analysis of the X. strumarium transcriptomes, this study identified a number of gene candidates that are putatively involved in the downstream pathway to the synthesis of xanthanolides, such as four unigenes encoding CYP71 P450s, 50 unigenes for dehydrogenases, and 27 genes for acetyltransferases. The possible functions of these four CYP71 candidates are extensively discussed. In addition, 116 transcription factors that were highly expressed in X. strumarium glandular trichomes were also identified. Their possible regulatory roles in the biosynthesis of sesquiterpene lactones are discussed. The global transcriptomic data for X. strumarium should provide a valuable resource for further research into the biosynthesis of xanthanolides.
Rahme, Laurence G.; Tan, Man-Wah; Le, Long; Wong, Sandy M.; Tompkins, Ronald G.; Calderwood, Stephen B.; Ausubel, Frederick M.
We used plants as an in vivo pathogenesis model for the identification of virulence factors of the human opportunistic pathogen Pseudomonas aeruginosa. Nine of nine TnphoA mutant derivatives of P. aeruginosa strain UCBPP-PA14 that were identified in a plant leaf assay for less pathogenic mutants also exhibited significantly reduced pathogenicity in a burned mouse pathogenicity model, suggesting that P. aeruginosa utilizes common strategies to infect both hosts. Seven of these nine mutants contain TnphoA insertions in previously unknown genes. These results demonstrate that an alternative nonvertebrate host of a human bacterial pathogen can be used in an in vivo high throughput screen to identify novel bacterial virulence factors involved in mammalian pathogenesis. PMID:9371831
Full Text Available Abstract Background Differential coexpression analysis (DCEA is increasingly used for investigating the global transcriptional mechanisms underlying phenotypic changes. Current DCEA methods mostly adopt a gene connectivity-based strategy to estimate differential coexpression, which is characterized by comparing the numbers of gene neighbors in different coexpression networks. Although it simplifies the calculation, this strategy mixes up the identities of different coexpression neighbors of a gene, and fails to differentiate significant differential coexpression changes from those trivial ones. Especially, the correlation-reversal is easily missed although it probably indicates remarkable biological significance. Results We developed two link-based quantitative methods, DCp and DCe, to identify differentially coexpressed genes and gene pairs (links. Bearing the uniqueness of exploiting the quantitative coexpression change of each gene pair in the coexpression networks, both methods proved to be superior to currently popular methods in simulation studies. Re-mining of a publicly available type 2 diabetes (T2D expression dataset from the perspective of differential coexpression analysis led to additional discoveries than those from differential expression analysis. Conclusions This work pointed out the critical weakness of current popular DCEA methods, and proposed two link-based DCEA algorithms that will make contribution to the development of DCEA and help extend it to a broader spectrum.
Full Text Available Background: Platycodon grandiflorum is the only species in the genus Platycodon of the family Campanulaceae, which has been traditionally used as a medicinal plant for its lung-heat-clearing, antitussive, and expectorant properties in China, Japanese and Korean. Oleanane-type triterpenoid saponins were the main chemical components of P. grandiflorum and platycodin D was the abundant and main bioactive component, but little is known about their biosynthesis in plants. Hence, P. grandiflorum is an ideal medicinal plant for studying the biosynthesis of Oleanane-type saponins. In addition, the genomic information of this important herbal plant is unavailable.Principal Findings:A total of 58,580,566 clean reads were obtained, which were assembled into 34,053 unigenes, with an average length of 936 bp and N50 of 1,661 bp by analyzing the transcriptome data of P. grandiflorum. Among these 34,053 unigenes, 22,409 unigenes (65.80% were annotated based on the information available from public databases, including Nr, NCBI, Swiss-Prot, KOG and KEGG. Furthermore, 21 candidate cytochrome P450 genes and 17 candidate UDP-glycosyltransferase genes most likely involved in triterpenoid saponins biosynthesis pathway were discovered from the transcriptome sequencing of P. grandiflorum. In addition, 10,626 SSRs were identified based on the transcriptome data, which would provide abundant candidates of molecular markers for genetic diversity and genetic map for this medicinal plant.Conclusion:The genomic data obtained from P. grandiflorum, especially the identification of putative genes involved in triterpenoid saponins biosynthesis pathway, will facilitate our understanding of the biosynthesis of triterpenoid saponins at molecular level.
Manuella Nóbrega Dourado
Full Text Available Bacteria from the genus Methylobacterium interact symbiotically (endophytically and epiphytically with different plant species. These interactions can promote plant growth or induce systemic resistance, increasing plant fitness. The plant colonization is guided by molecular communication between bacteria-bacteria and bacteria-plants, where the bacteria recognize specific exuded compounds by other bacteria (e.g. homoserine molecules and/or by the plant roots (e.g. flavonoids, ethanol and methanol, respectively. In this context, the aim of this study was to evaluate the effect of quorum sensing molecules (N-acyl-homoserine lactones and plant exudates (including ethanol in the expression of a series of bacterial genes involved in Methylobacterium-plant interaction. The selected genes are related to bacterial metabolism (mxaF, adaptation to stressful environment (crtI, phoU and sss, to interactions with plant metabolism compounds (acdS and pathogenicity (patatin and phoU. Under in vitro conditions, our results showed the differential expression of some important genes related to metabolism, stress and pathogenesis, thereby AHL molecules up-regulate all tested genes, except phoU, while plant exudates induce only mxaF gene expression. In the presence of plant exudates there is a lower bacterial density (due the endophytic and epiphytic colonization, which produce less AHL, leading to down regulation of genes when compared to the control. Therefore, bacterial density, more than plant exudate, influences the expression of genes related to plant-bacteria interaction.
Halimaa, Pauliina; Lin, Ya-Fen; Ahonen, Viivi H; Blande, Daniel; Clemens, Stephan; Gyenesei, Attila; Häikiö, Elina; Kärenlampi, Sirpa O; Laiho, Asta; Aarts, Mark G M; Pursiheimo, Juha-Pekka; Schat, Henk; Schmidt, Holger; Tuomainen, Marjo H; Tervahauta, Arja I
Populations of Noccaea caerulescens show tremendous differences in their capacity to hyperaccumulate and hypertolerate metals. To explore the differences that could contribute to these traits, we undertook SOLiD high-throughput sequencing of the root transcriptomes of three phenotypically well-characterized N. caerulescens accessions, i.e., Ganges, La Calamine, and Monte Prinzera. Genes with possible contribution to zinc, cadmium, and nickel hyperaccumulation and hypertolerance were predicted. The most significant differences between the accessions were related to metal ion (di-, trivalent inorganic cation) transmembrane transporter activity, iron and calcium ion binding, (inorganic) anion transmembrane transporter activity, and antioxidant activity. Analysis of correlation between the expression profile of each gene and the metal-related characteristics of the accessions disclosed both previously characterized (HMA4, HMA3) and new candidate genes (e.g., for nickel IRT1, ZIP10, and PDF2.3) as possible contributors to the hyperaccumulation/tolerance phenotype. A number of unknown Noccaea-specific transcripts also showed correlation with Zn(2+), Cd(2+), or Ni(2+) hyperaccumulation/tolerance. This study shows that N. caerulescens populations have evolved great diversity in the expression of metal-related genes, facilitating adaptation to various metalliferous soils. The information will be helpful in the development of improved plants for metal phytoremediation.
Jonge, de B.
Since the advent of biotechnology, plant genetic resources have become more valuable as possible sources for new products and inventions. With knowledge about the genetic make-up and functioning of a plant, biotechnologists can identify and isolate genes with interesting traits which, after long
Huang, Ling; Shi, Xinhui; Wang, Wenjia; Ryu, Kook Hui; Schiefelbein, John
The molecular genetic program for root hair development has been studied intensively in Arabidopsis ( Arabidopsis thaliana ). To understand the extent to which this program might operate in other plants, we conducted a large-scale comparative analysis of root hair development genes from diverse vascular plants, including eudicots, monocots, and a lycophyte. Combining phylogenetics and transcriptomics, we discovered conservation of a core set of root hair genes across all vascular plants, which may derive from an ancient program for unidirectional cell growth coopted for root hair development during vascular plant evolution. Interestingly, we also discovered preferential diversification in the structure and expression of root hair development genes, relative to other root hair- and root-expressed genes, among these species. These differences enabled the definition of sets of genes and gene functions that were acquired or lost in specific lineages during vascular plant evolution. In particular, we found substantial divergence in the structure and expression of genes used for root hair patterning, suggesting that the Arabidopsis transcriptional regulatory mechanism is not shared by other species. To our knowledge, this study provides the first comprehensive view of gene expression in a single plant cell type across multiple species. © 2017 American Society of Plant Biologists. All Rights Reserved.
Buck, L.; Stein, R.; Palazzolo, M.; Anderson, D. J.; Axel, R.
Nervous systems consist of diverse populations of neurons that are anatomically and functionally distinct. The diversity of neurons and the precision with which they are interconnected suggest that specific genes or sets of genes are activated in some neurons but not expressed in others. Experimentally, this problem may be considered at two levels. First, what is the total number of genes expressed in the brain, and how are they distributed among the different populations of neurons? Second, ...
Full Text Available Abstract Background Despite extensive efforts devoted to predicting protein-coding genes in genome sequences, many bona fide genes have not been found and many existing gene models are not accurate in all sequenced eukaryote genomes. This situation is partly explained by the fact that gene prediction programs have been developed based on our incomplete understanding of gene feature information such as splicing and promoter characteristics. Additionally, full-length cDNAs of many genes and their isoforms are hard to obtain due to their low level or rare expression. In order to obtain full-length sequences of all protein-coding genes, alternative approaches are required. Results In this project, we have developed a method of reconstructing full-length cDNA sequences based on short expressed sequence tags which is called sequence tag-based amplification of cDNA ends (STACE. Expressed tags are used as anchors for retrieving full-length transcripts in two rounds of PCR amplification. We have demonstrated the application of STACE in reconstructing full-length cDNA sequences using expressed tags mined in an array of serial analysis of gene expression (SAGE of C. elegans cDNA libraries. We have successfully applied STACE to recover sequence information for 12 genes, for two of which we found isoforms. STACE was used to successfully recover full-length cDNA sequences for seven of these genes. Conclusions The STACE method can be used to effectively reconstruct full-length cDNA sequences of genes that are under-represented in cDNA sequencing projects and have been missed by existing gene prediction methods, but their existence has been suggested by short sequence tags such as SAGE tags.
Han, Lin; Cao, Chunwei; Jia, Zhaotong; Liu, Shiguo; Liu, Zhen; Xin, Ruosai; Wang, Can; Li, Xinde; Ren, Wei; Wang, Xuefeng; Li, Changgui
Chromosome 4q25 has been identified as a genomic region associated with gout. However, the associations of gout with the genes in this region have not yet been confirmed. Here, we performed two-stage analysis to determine whether variations in candidate genes in the 4q25 region are associated with gout in a male Chinese Han population. We first evaluated 96 tag single nucleotide polymorphisms (SNPs) in eight inflammatory/immune pathway- or glucose/lipid metabolism-related genes in the 4q25 region in 480 male gout patients and 480 controls. The SNP rs12504538, located in the elongation of very-long-chain-fatty-acid-like family member 6 gene (Elovl6), was found to be associated with gout susceptibility (Padjusted = 0.00595). In the second stage of analysis, we performed fine mapping analysis of 93 tag SNPs in Elovl6 and in the epidermal growth factor gene (EGF) and its flanking regions in 1017 male patients gout and 1897 healthy male controls. We observed a significant association between the T allele of EGF rs2298999 and gout (odds ratio = 0.77, 95% confidence interval = 0.67–0.88, Padjusted = 6.42 × 10−3). These results provide the first evidence for an association between the EGF rs2298999 C/T polymorphism and gout. Our findings should be validated in additional populations. PMID:27506295
Han, Lin; Cao, Chunwei; Jia, Zhaotong; Liu, Shiguo; Liu, Zhen; Xin, Ruosai; Wang, Can; Li, Xinde; Ren, Wei; Wang, Xuefeng; Li, Changgui
Chromosome 4q25 has been identified as a genomic region associated with gout. However, the associations of gout with the genes in this region have not yet been confirmed. Here, we performed two-stage analysis to determine whether variations in candidate genes in the 4q25 region are associated with gout in a male Chinese Han population. We first evaluated 96 tag single nucleotide polymorphisms (SNPs) in eight inflammatory/immune pathway- or glucose/lipid metabolism-related genes in the 4q25 region in 480 male gout patients and 480 controls. The SNP rs12504538, located in the elongation of very-long-chain-fatty-acid-like family member 6 gene (Elovl6), was found to be associated with gout susceptibility (Padjusted = 0.00595). In the second stage of analysis, we performed fine mapping analysis of 93 tag SNPs in Elovl6 and in the epidermal growth factor gene (EGF) and its flanking regions in 1017 male patients gout and 1897 healthy male controls. We observed a significant association between the T allele of EGF rs2298999 and gout (odds ratio = 0.77, 95% confidence interval = 0.67-0.88, Padjusted = 6.42 × 10(-3)). These results provide the first evidence for an association between the EGF rs2298999 C/T polymorphism and gout. Our findings should be validated in additional populations.
Finet, Cédric; Floyd, Sandra K; Conway, Stephanie J; Zhong, Bojian; Scutt, Charles P; Bowman, John L
Members of the YABBY gene family of transcription factors in angiosperms have been shown to be involved in the initiation of outgrowth of the lamina, the maintenance of polarity, and establishment of the leaf margin. Although most of the dorsal-ventral polarity genes in seed plants have homologs in non-spermatophyte lineages, the presence of YABBY genes is restricted to seed plants. To gain insight into the origin and diversification of this gene family, we reconstructed the evolutionary history of YABBY gene lineages in seed plants. Our findings suggest that either one or two YABBY genes were present in the last common ancestor of extant seed plants. We also examined the expression of YABBY genes in the gymnosperms Ephedra distachya (Gnetales), Ginkgo biloba (Ginkgoales), and Pseudotsuga menziesii (Coniferales). Our data indicate that some YABBY genes are expressed in a polar (abaxial) manner in leaves and female cones in gymnosperms. We propose that YABBY genes already acted as polarity genes in the last common ancestor of extant seed plants. © 2016 Wiley Periodicals, Inc.
Bruhn, Sören; Fang, Yu; Barrenäs, Fredrik
The identification of diagnostic markers and therapeutic candidate genes in common diseases is complicated by the involvement of thousands of genes. We hypothesized that genes co-regulated with a key gene in allergy, IL13, would form a module that could help to identify candidate genes. We identi...
Dec 4, 2013 ... importance for human health and nutrition. This species has ... function to genes, proteins and metabolites is still a daunting task. Major challenges ... relation of the expression pattern of genes with the accu- mulation pattern of ..... M, Gordon JS, Rose, JKC, Martin G, Tanksley SD, Bouzayen M,. Jahn MM ...
Pan, Yufang; Li, Qiaofeng; Wang, Zhizheng; Wang, Yang; Ma, Rui; Zhu, Lili; He, Guangcun; Chen, Rongzhi
Thermosensitive genic male sterile (TGMS) lines and photoperiod-sensitive genic male sterile (PGMS) lines have been successfully used in hybridization to improve rice yields. However, the molecular mechanisms underlying male sterility transitions in most PGMS/TGMS rice lines are unclear. In the recently developed TGMS-Co27 line, the male sterility is based on co-suppression of a UDP-glucose pyrophosphorylase gene (Ugp1), but further study is needed to fully elucidate the molecular mechanisms involved. Microarray-based transcriptome profiling of TGMS-Co27 and wild-type Hejiang 19 (H1493) plants grown at high and low temperatures revealed that 15462 probe sets representing 8303 genes were differentially expressed in the two lines, under the two conditions, or both. Environmental factors strongly affected global gene expression. Some genes important for pollen development were strongly repressed in TGMS-Co27 at high temperature. More significantly, series-cluster analysis of differentially expressed genes (DEGs) between TGMS-Co27 plants grown under the two conditions showed that low temperature induced the expression of a gene cluster. This cluster was found to be essential for sterility transition. It includes many meiosis stage-related genes that are probably important for thermosensitive male sterility in TGMS-Co27, inter alia: Arg/Ser-rich domain (RS)-containing zinc finger proteins, polypyrimidine tract-binding proteins (PTBs), DEAD/DEAH box RNA helicases, ZOS (C2H2 zinc finger proteins of Oryza sativa), at least one polyadenylate-binding protein and some other RNA recognition motif (RRM) domain-containing proteins involved in post-transcriptional processes, eukaryotic initiation factor 5B (eIF5B), ribosomal proteins (L37, L1p/L10e, L27 and L24), aminoacyl-tRNA synthetases (ARSs), eukaryotic elongation factor Tu (eEF-Tu) and a peptide chain release factor protein involved in translation. The differential expression of 12 DEGs that are important for pollen
Full Text Available Understanding complex networks that modulate development in humans is hampered by genetic and phenotypic heterogeneity within and between populations. Here we present a method that exploits natural variation in highly diverse mouse genetic reference panels in which genetic and environmental factors can be tightly controlled. The aim of our study is to test a cross-species genetic mapping strategy, which compares data of gene mapping in human patients with functional data obtained by QTL mapping in recombinant inbred mouse strains in order to prioritize human disease candidate genes.We exploit evolutionary conservation of developmental phenotypes to discover gene variants that influence brain development in humans. We studied corpus callosum volume in a recombinant inbred mouse panel (C57BL/6J×DBA/2J, BXD strains using high-field strength MRI technology. We aligned mouse mapping results for this neuro-anatomical phenotype with genetic data from patients with abnormal corpus callosum (ACC development.From the 61 syndromes which involve an ACC, 51 human candidate genes have been identified. Through interval mapping, we identified a single significant QTL on mouse chromosome 7 for corpus callosum volume with a QTL peak located between 25.5 and 26.7 Mb. Comparing the genes in this mouse QTL region with those associated with human syndromes (involving ACC and those covered by copy number variations (CNV yielded a single overlap, namely HNRPU in humans and Hnrpul1 in mice. Further analysis of corpus callosum volume in BXD strains revealed that the corpus callosum was significantly larger in BXD mice with a B genotype at the Hnrpul1 locus than in BXD mice with a D genotype at Hnrpul1 (F = 22.48, p<9.87*10(-5.This approach that exploits highly diverse mouse strains provides an efficient and effective translational bridge to study the etiology of human developmental disorders, such as autism and schizophrenia.
Zhang, Yunhua; Dai, Li; Liu, Ying; Zhang, YuHang; Wang, ShaoPeng
Fruit is essential for plant reproduction and is responsible for protection and dispersal of seeds. The development and maturation of fruit is tightly regulated by numerous genetic factors that respond to environmental and internal stimulation. In this study, we attempted to identify novel fruit-related genes in a model organism, Arabidopsis thaliana, using a computational method. Based on validated fruit-related genes, the random walk with restart (RWR) algorithm was applied on a protein-protein interaction (PPI) network using these genes as seeds. The identified genes with high probabilities were filtered by the permutation test and linkage tests. In the permutation test, the genes that were selected due to the structure of the PPI network were discarded. In the linkage tests, the importance of each candidate gene was measured from two aspects: (1) its functional associations with validated genes and (2) its similarity with validated genes on gene ontology (GO) terms and KEGG pathways. Finally, 255 inferred genes were obtained, subsequent extensive analysis of important genes revealed that they mainly contribute to ubiquitination (UBQ9, UBQ8, UBQ11, UBQ10), serine hydroxymethyl transfer (SHM7, SHM5, SHM6) or glycol-metabolism (HXKL2_ARATH, CSY5, GAPCP1), suggesting essential roles during the development and maturation of fruit in Arabidopsis thaliana.
Yousaf, Sohail; Afzal, Muhammad; Reichenauer, Thomas G.; Brady, Carrie L.; Sessitsch, Angela
The genus Enterobacter comprises a range of beneficial plant-associated bacteria showing plant growth promotion. Enterobacter ludwigii belongs to the Enterobacter cloacae complex and has been reported to include human pathogens but also plant-associated strains with plant beneficial capacities. To assess the role of Enterobacter endophytes in hydrocarbon degradation, plant colonization, abundance and expression of CYP153 genes in different plant compartments, three plant species (Italian ryegrass, birdsfoot trefoil and alfalfa) were grown in sterile soil spiked with 1% diesel and inoculated with three endophytic E. ludwigii strains. Results showed that all strains were capable of hydrocarbon degradation and efficiently colonized the rhizosphere and plant interior. Two strains, ISI10-3 and BRI10-9, showed highest degradation rates of diesel fuel up to 68% and performed best in combination with Italian ryegrass and alfalfa. All strains expressed the CYP153 gene in all plant compartments, indicating an active role in degradation of diesel in association with plants. - Highlights: → E. ludwigii strains efficiently colonized plants in a non-sterile soil environment. → E. ludwigii strains efficiently expressed alkane degradation genes in plants. → E. ludwigii efficiently degraded alkane contaminations and promoted plant growth. → E. ludwigii interacted more effectively with Italian ryegrass than with other plants. → Degradation activity varied with plant and microbial genotype as well as with time. - Enterobacter ludwigii strains belonging to the E. cloacae complex are able to efficiently degrade alkanes when associated with plants and to promote plant growth.
Yousaf, Sohail [AIT Austrian Institute of Technology GmbH, Bioresources Unit, A-2444 Seibersdorf (Austria); Afzal, Muhammad [AIT Austrian Institute of Technology GmbH, Bioresources Unit, A-2444 Seibersdorf (Austria); National Institute for Biotechnology and Genetic Engineering (NIBGE), Faisalabad (Pakistan); Reichenauer, Thomas G. [AIT Austrian Institute of Technology GmbH, Environmental Resources and Technologies Unit, A-2444 Seibersdorf (Austria); Brady, Carrie L. [Forestry and Agricultural Biotechnology Institute, Department of Microbiology and Plant Pathology, University of Pretoria, Pretoria (South Africa); Sessitsch, Angela, E-mail: firstname.lastname@example.org [AIT Austrian Institute of Technology GmbH, Bioresources Unit, A-2444 Seibersdorf (Austria)
The genus Enterobacter comprises a range of beneficial plant-associated bacteria showing plant growth promotion. Enterobacter ludwigii belongs to the Enterobacter cloacae complex and has been reported to include human pathogens but also plant-associated strains with plant beneficial capacities. To assess the role of Enterobacter endophytes in hydrocarbon degradation, plant colonization, abundance and expression of CYP153 genes in different plant compartments, three plant species (Italian ryegrass, birdsfoot trefoil and alfalfa) were grown in sterile soil spiked with 1% diesel and inoculated with three endophytic E. ludwigii strains. Results showed that all strains were capable of hydrocarbon degradation and efficiently colonized the rhizosphere and plant interior. Two strains, ISI10-3 and BRI10-9, showed highest degradation rates of diesel fuel up to 68% and performed best in combination with Italian ryegrass and alfalfa. All strains expressed the CYP153 gene in all plant compartments, indicating an active role in degradation of diesel in association with plants. - Highlights: > E. ludwigii strains efficiently colonized plants in a non-sterile soil environment. > E. ludwigii strains efficiently expressed alkane degradation genes in plants. > E. ludwigii efficiently degraded alkane contaminations and promoted plant growth. > E. ludwigii interacted more effectively with Italian ryegrass than with other plants. > Degradation activity varied with plant and microbial genotype as well as with time. - Enterobacter ludwigii strains belonging to the E. cloacae complex are able to efficiently degrade alkanes when associated with plants and to promote plant growth.
Full Text Available Abstract Background Synaptotagmin genes are found in animal genomes and are known to function in the nervous system. Genes with a similar domain architecture as well as sequence similarity to synaptotagmin C2 domains have also been found in plant genomes. The plant genes share an additional region of sequence similarity with a group of animal genes named FAM62. FAM62 genes also have a similar domain architecture. Little is known about the functions of the plant genes and animal FAM62 genes. Indeed, many members of the large and diverse Syt gene family await functional characterization. Understanding the evolutionary relationships among these genes will help to realize the full implications of functional studies and lead to improved genome annotation. Results I collected and compared plant Syt-like sequences from the primary nucleotide sequence databases at NCBI. The collection comprises six groups of plant genes conserved in embryophytes: NTMC2Type1 to NTMC2Type6. I collected and compared metazoan FAM62 sequences and identified some similar sequences from other eukaryotic lineages. I found evidence of RNA editing and alternative splicing. I compared the intron patterns of Syt genes. I also compared Rabphilin and Doc2 genes. Conclusion Genes encoding proteins with N-terminal-transmembrane-C2 domain architectures resembling synaptotagmins, are widespread in eukaryotes. A collection of these genes is presented here. The collection provides a resource for studies of intron evolution. I have classified the collection into homologous gene families according to distinctive patterns of sequence conservation and intron position. The evolutionary histories of these gene families are traceable through the appearance of family members in different eukaryotic lineages. Assuming an intron-rich eukaryotic ancestor, the conserved intron patterns distinctive of individual gene families, indicate independent origins of Syt, FAM62 and NTMC2 genes. Resemblances
Broeckling, Bettina E.; Liu, Chang-Jun; Dixon, Richard A.
The invention provides enzymes that encode O-methyltransferases (OMTs) from Medicago truncatula that allow modification to plant (iso)flavonoid biosynthetic pathways. In certain aspects of the invention, the genes encoding these enzymes are provided. The invention therefore allows the modification of plants for isoflavonoid content. Transgenic plants comprising such enzymes are also provided, as well as methods for improving disease resistance in plants. Methods for producing food and nutraceuticals, and the resulting compositions, are also provided.
Full Text Available Syringa oblata Lindl. is a woody ornamental plant with high economic value and characteristics that include early flowering, multiple flower colors, and strong fragrance. Despite a long history of cultivation, the genetics and molecular biology of S. oblata are poorly understood. Transcriptome and expression profiling data are needed to identify genes and to better understand the biological mechanisms of floral pigments and scents in this species. Nine cDNA libraries were obtained from three replicates of three developmental stages: inflorescence with enlarged flower buds not protruded, inflorescence with corolla lobes not displayed, and inflorescence with flowers fully opened and emitting strong fragrance. Using the Illumina RNA-Seq technique, 319,425,972 clean reads were obtained and were assembled into 104,691 final unigenes (average length of 853 bp, 41.75% of which were annotated in the NCBI non-redundant protein database. Among the annotated unigenes, 36,967 were assigned to gene ontology categories and 19,956 were assigned to eukaryoticorthologous groups. Using the Kyoto Encyclopedia of Genes and Genomes pathway database, 12,388 unigenes were sorted into 286 pathways. Based on these transcriptomic data, we obtained a large number of candidate genes that were differentially expressed at different flower stages and that were related to floral pigment biosynthesis and fragrance metabolism. This comprehensive transcriptomic analysis provides fundamental information on the genes and pathways involved in flower secondary metabolism and development in S. oblata, providing a useful database for further research on S. oblata and other plants of genus Syringa.
Dec 5, 2011 ... Lord et al., 1998) have shed light on the influence of leptin on both the .... A weak correlation between leptin serum levels and cow body condition ... Detection of polymorphisms in the ovine leptin (LEP) gene: .... Signals that.
Abel, Frida; Dalevi, Daniel; Nethander, Maria; Jörnsten, Rebecka; De Preter, Katleen; Vermeulen, Joëlle; Stallings, Raymond; Kogner, Per; Maris, John; Nilsson, Staffan
Abstract Background There are currently three postulated genomic subtypes of the childhood tumour neuroblastoma (NB); Type 1, Type 2A, and Type 2B. The most aggressive forms of NB are characterized by amplification of the oncogene MYCN (MNA) and low expression of the favourable marker NTRK1. Recently, mutations or high expression of the familial predisposition gene Anaplastic Lymphoma Kinase (ALK) was associated to unfavourable biology of sporadic NB. Also, various other genes have been linke...
Full Text Available Argonaute protein family is the key players in pathways of gene silencing and small regulatory RNAs in different organisms. Argonaute proteins can bind small noncoding RNAs and control protein synthesis, affect messenger RNA stability, and even participate in the production of new forms of small RNAs. The aim of this study was to characterize and perform bioinformatic analysis of Argonaute proteins in 32 plant species that their genome was sequenced. A total of 437 Argonaute genes were identified and were analyzed based on lengths, gene structure, and protein structure. Results showed that Argonaute proteins were highly conserved across plant kingdom. Phylogenic analysis divided plant Argonautes into three classes. Argonaute proteins have three conserved domains PAZ, MID and PIWI. In addition to three conserved domains namely, PAZ, MID, and PIWI, we identified few more domains in AGO of some plant species. Expression profile analysis of Argonaute proteins showed that expression of these genes varies in most of tissues, which means that these proteins are involved in regulation of most pathways of the plant system. Numbers of alternative transcripts of Argonaute genes were highly variable among the plants. A thorough analysis of large number of putative Argonaute genes revealed several interesting aspects associated with this protein and brought novel information with promising usefulness for both basic and biotechnological applications.
Zhang, Luoyan; Kong, Hongzhi; Ma, Hong; Yang, Ji
Meiosis is a specialized type of cell division necessary for sexual reproduction in eukaryotes. A better understanding of the cytological procedures of meiosis has been achieved by comprehensive cytogenetic studies in plants, while the genetic mechanisms regulating meiotic progression remain incompletely understood. The increasing accumulation of complete genome sequences and large-scale gene expression datasets has provided a powerful resource for phylogenomic inference and unsupervised identification of genes involved in plant meiosis. By integrating sequence homology and expression data, 164, 131, 124 and 162 genes potentially important for meiosis were identified in the genomes of Arabidopsis thaliana, Oryza sativa, Selaginella moellendorffii and Pogonatum aloides, respectively. The predicted genes were assigned to 45 meiotic GO terms, and their functions were related to different processes occurring during meiosis in various organisms. Most of the predicted meiotic genes underwent lineage-specific duplication events during plant evolution, with about 30% of the predicted genes retaining only a single copy in higher plant genomes. The results of this study provided clues to design experiments for better functional characterization of meiotic genes in plants, promoting the phylogenomic approach to the evolutionary dynamics of the plant meiotic machineries. Copyright © 2017 Elsevier B.V. All rights reserved.
Kourelis, Jiorgos; van der Hoorn, Renier A L
Plants have many, highly variable resistance ( R ) gene loci, which provide resistance to a variety of pathogens. The first R gene to be cloned, maize ( Zea mays ) Hm1 , was published over 25 years ago, and since then, many different R genes have been identified and isolated. The encoded proteins have provided clues to the diverse molecular mechanisms underlying immunity. Here, we present a meta-analysis of 314 cloned R genes. The majority of R genes encode cell surface or intracellular receptors, and we distinguish nine molecular mechanisms by which R proteins can elevate or trigger disease resistance: direct (1) or indirect (2) perception of pathogen-derived molecules on the cell surface by receptor-like proteins and receptor-like kinases; direct (3) or indirect (4) intracellular detection of pathogen-derived molecules by nucleotide binding, leucine-rich repeat receptors, or detection through integrated domains (5); perception of transcription activator-like effectors through activation of executor genes (6); and active (7), passive (8), or host reprogramming-mediated (9) loss of susceptibility. Although the molecular mechanisms underlying the functions of R genes are only understood for a small proportion of known R genes, a clearer understanding of mechanisms is emerging and will be crucial for rational engineering and deployment of novel R genes. © 2018 American Society of Plant Biologists. All rights reserved.
Braukmann, Thomas W A; Kuzmina, Maria L; Sills, Jesse; Zakharov, Evgeny V; Hebert, Paul D N
Their relatively slow rates of molecular evolution, as well as frequent exposure to hybridization and introgression, often make it difficult to discriminate species of vascular plants with the standard barcode markers (rbcL, matK, ITS2). Previous studies have examined these constraints in narrow geographic or taxonomic contexts, but the present investigation expands analysis to consider the performance of these gene regions in discriminating the species in local floras at sites across Canada. To test identification success, we employed a DNA barcode reference library with sequence records for 96% of the 5108 vascular plant species known from Canada, but coverage varied from 94% for rbcL to 60% for ITS2 and 39% for matK. Using plant lists from 27 national parks and one scientific reserve, we tested the efficacy of DNA barcodes in identifying the plants in simulated species assemblages from six biogeographic regions of Canada using BLAST and mothur. Mean pairwise distance (MPD) and mean nearest taxon distance (MNTD) were strong predictors of barcode performance for different plant families and genera, and both metrics supported ITS2 as possessing the highest genetic diversity. All three genes performed strongly in assigning the taxa present in local floras to the correct genus with values ranging from 91% for rbcL to 97% for ITS2 and 98% for matK. However, matK delivered the highest species discrimination (~81%) followed by ITS2 (~72%) and rbcL (~44%). Despite the low number of plant taxa in the Canadian Arctic, DNA barcodes had the least success in discriminating species from this biogeographic region with resolution ranging from 36% with rbcL to 69% with matK. Species resolution was higher in the other settings, peaking in the Woodland region at 52% for rbcL and 87% for matK. Our results indicate that DNA barcoding is very effective in identifying Canadian plants to a genus, and that it performs well in discriminating species in regions where floristic diversity is
Full Text Available With advances in next-generation sequencing(NGS technologies, a large number of multiple types of high-throughput genomics data are available. A great challenge in exploring cancer progression is to identify the driver genes from the variant genes by analyzing and integrating multi-types genomics data. Breast cancer is known as a heterogeneous disease. The identification of subtype-specific driver genes is critical to guide the diagnosis, assessment of prognosis and treatment of breast cancer. We developed an integrated frame based on gene expression profiles and copy number variation (CNV data to identify breast cancer subtype-specific driver genes. In this frame, we employed statistical machine-learning method to select gene subsets and utilized an module-network analysis method to identify potential candidate driver genes. The final subtype-specific driver genes were acquired by paired-wise comparison in subtypes. To validate specificity of the driver genes, the gene expression data of these genes were applied to classify the patient samples with 10-fold cross validation and the enrichment analysis were also conducted on the identified driver genes. The experimental results show that the proposed integrative method can identify the potential driver genes and the classifier with these genes acquired better performance than with genes identified by other methods.
Guan, Xueni; Wurtele, E.S.; Nikolau, B.J.
Six biotin-containing proteins are present in plants, representing at least four different biotin enzymes. The physiological function of these biotin enzymes is not understood. Streptavidin, a protein from Streptomyces avidinii, binds tightly and specifically to biotin causing inactivation of biotin enzymes. One approach to elucidating the physiological function of biotin enzymes in plant metabolism is to create transgenic plants expressing the streptavidin gene. A plasmid containing a fused streptavidin-beta-galactosidase gene has been expressed in E. coli. We also have constructed various fusion genes that include an altered CaMV 35S promoter, signal peptides to target the streptavidin protein to specific organelles, and the streptavidin coding gene. We are examining the expression of these genes in cells of carrot
Gutiérrez, Rodrigo A; Stokes, Trevor L; Thum, Karen; Xu, Xiaodong; Obertello, Mariana; Katari, Manpreet S; Tanurdzic, Milos; Dean, Alexis; Nero, Damion C; McClung, C Robertson; Coruzzi, Gloria M
Understanding how nutrients affect gene expression will help us to understand the mechanisms controlling plant growth and development as a function of nutrient availability. Nitrate has been shown to serve as a signal for the control of gene expression in Arabidopsis. There is also evidence, on a gene-by-gene basis, that downstream products of nitrogen (N) assimilation such as glutamate (Glu) or glutamine (Gln) might serve as signals of organic N status that in turn regulate gene expression. To identify genome-wide responses to such organic N signals, Arabidopsis seedlings were transiently treated with ammonium nitrate in the presence or absence of MSX, an inhibitor of glutamine synthetase, resulting in a block of Glu/Gln synthesis. Genes that responded to organic N were identified as those whose response to ammonium nitrate treatment was blocked in the presence of MSX. We showed that some genes previously identified to be regulated by nitrate are under the control of an organic N-metabolite. Using an integrated network model of molecular interactions, we uncovered a subnetwork regulated by organic N that included CCA1 and target genes involved in N-assimilation. We validated some of the predicted interactions and showed that regulation of the master clock control gene CCA1 by Glu or a Glu-derived metabolite in turn regulates the expression of key N-assimilatory genes. Phase response curve analysis shows that distinct N-metabolites can advance or delay the CCA1 phase. Regulation of CCA1 by organic N signals may represent a novel input mechanism for N-nutrients to affect plant circadian clock function.
Tomato (Solanum lycopersicum) is one of the most important vegetables in the world with significant importance for human health and nutrition. This species has long served as model system for plant genetics, development, physiology, pathology, and fleshy fruit ripening, resulting in the accumulation of many genetic and ...
Francisco, Marta; Joseph, Bindu; Caligagan, Hart; Li, Baohua; Corwin, Jason A; Lin, Catherine; Kerwin, Rachel E; Burow, Meike; Kliebenstein, Daniel J
A key limitation in modern biology is the ability to rapidly identify genes underlying newly identified complex phenotypes. Genome wide association studies (GWAS) have become an increasingly important approach for dissecting natural variation by associating phenotypes with genotypes at a genome wide level. Recent work is showing that the Arabidopsis thaliana defense metabolite, allyl glucosinolate (GSL), may provide direct feedback regulation, linking defense metabolism outputs to the growth, and defense responses of the plant. However, there is still a need to identify genes that underlie this process. To start developing a deeper understanding of the mechanism(s) that modulate the ability of exogenous allyl GSL to alter growth and defense, we measured changes in plant biomass and defense metabolites in a collection of natural 96 A. thaliana accessions fed with 50 μM of allyl GSL. Exogenous allyl GSL was introduced exclusively to the roots and the compound transported to the leaf leading to a wide range of heritable effects upon plant biomass and endogenous GSL accumulation. Using natural variation we conducted GWAS to identify a number of new genes which potentially control allyl responses in various plant processes. This is one of the first instances in which this approach has been successfully utilized to begin dissecting a novel phenotype to the underlying molecular/polygenic basis.
Full Text Available A key limitation in modern biology is the ability to rapidly identify genes underlying newly identified complex phenotypes. Genome wide association studies (GWAS have become an increasingly important approach for dissecting natural variation by associating phenotypes with genotypes at a genome wide level. Recent work is showing that the Arabidopsis thaliana defense metabolite, allyl glucosinolate (GSL, may provide direct feedback regulation, linking defense metabolism outputs to the growth and defense responses of the plant. However, there is still a need to identify genes that underlie this process. To start developing a deeper understanding of the mechanism(s that modulate the ability of exogenous allyl GSL to alter growth and defense, we measured changes in plant biomass and defense metabolites in a collection of natural 96 A. thaliana accessions fed with 50 µM of allyl GSL. Exogenous allyl GSL was introduced exclusively to the roots and the compound transported to the leaf leading to a wide range of heritable effects upon plant biomass and endogenous GSL accumulation. Using natural variation we conducted GWAS to identify a number of new genes which potentially control allyl responses in various plant processes. This is one of the first instances in which this approach has been successfully utilized to begin dissecting a novel phenotype to the underlying molecular/polygenic basis.
Mar 19, 2007 ... Localizing genes using linkage disequilibrium in plants: integrating lessons ... reduce that association as a function of the marker distance from the QTL. ..... the gene locus enhanced the resolution power of asso- ciation tests .... agents, such as insects, birds, water and wind, so mating is determined by a ...
Taneera, Jalal; Lang, Stefan; Sharma, Amitabh
Close to 50 genetic loci have been associated with type 2 diabetes (T2D), but they explain only 15% of the heritability. In an attempt to identify additional T2D genes, we analyzed global gene expression in human islets from 63 donors. Using 48 genes located near T2D risk variants, we identified ...
Durfee, Tim [Madison, WI; Feiler, Heidi [Albany, CA; Gruissem, Wilhelm [Forch, CH; Jenkins, Susan [Martinez, CA; Roe, Judith [Manhattan, KS; Zambryski, Patricia [Berkeley, CA
This invention provides methods and compositions for altering the growth, organization, and differentiation of plant tissues. The invention is based on the discovery that, in plants, genetically altering the levels of Retinoblastoma-related gene (RRB) activity produces dramatic effects on the growth, proliferation, organization, and differentiation of plant meristem.
Methods were developed to monitor persistence of genomic DNA in decaying plants in the field. As a model, we used recombinant neomycin phosphotransferase II (rNPT-II) marker genes present in genetically engineered plants. Polymerase chain reaction (PCR) primers were designed, com...
Tran Lan T
Full Text Available Abstract Background Plant polyphenol oxidases (PPOs are enzymes that typically use molecular oxygen to oxidize ortho-diphenols to ortho-quinones. These commonly cause browning reactions following tissue damage, and may be important in plant defense. Some PPOs function as hydroxylases or in cross-linking reactions, but in most plants their physiological roles are not known. To better understand the importance of PPOs in the plant kingdom, we surveyed PPO gene families in 25 sequenced genomes from chlorophytes, bryophytes, lycophytes, and flowering plants. The PPO genes were then analyzed in silico for gene structure, phylogenetic relationships, and targeting signals. Results Many previously uncharacterized PPO genes were uncovered. The moss, Physcomitrella patens, contained 13 PPO genes and Selaginella moellendorffii (spike moss and Glycine max (soybean each had 11 genes. Populus trichocarpa (poplar contained a highly diversified gene family with 11 PPO genes, but several flowering plants had only a single PPO gene. By contrast, no PPO-like sequences were identified in several chlorophyte (green algae genomes or Arabidopsis (A. lyrata and A. thaliana. We found that many PPOs contained one or two introns often near the 3’ terminus. Furthermore, N-terminal amino acid sequence analysis using ChloroP and TargetP 1.1 predicted that several putative PPOs are synthesized via the secretory pathway, a unique finding as most PPOs are predicted to be chloroplast proteins. Phylogenetic reconstruction of these sequences revealed that large PPO gene repertoires in some species are mostly a consequence of independent bursts of gene duplication, while the lineage leading to Arabidopsis must have lost all PPO genes. Conclusion Our survey identified PPOs in gene families of varying sizes in all land plants except in the genus Arabidopsis. While we found variation in intron numbers and positions, overall PPO gene structure is congruent with the phylogenetic
Jesús Quiroz Chávez
Full Text Available Plant molecular improvement by recombinant DNA technology represents an advantage to obtain new varieties or traits. This technique is promised for genetic improvement of crop plants. Lines with increased yield, quality, disease resistance, or tolerant to abiotic stress have been obtained, with clear advantages for producers, marketers and consumers. However, they have several limitations in its application to agriculture because of its risk and hazards. The aim of the document is to show the advantages and disadvantages of GM crop plant, to develop represent an opportunity to have new exotic traits.
Binder, A; Soyano, T; Hayashi, H
to nodule primordia formation, and the infection thread initiation in the root hairs guiding bacteria towards dividing cortical cells. This chapter focuses on the plant genes involved in the recognition of the symbiotic signal produced by rhizobia, and the downstream genes, which are part of a complex...... symbiotic signalling pathway that leads to the generation of calcium spiking in the nuclear regions and activation of transcription factors controlling symbiotic genes induction...
Mukherjee, Krishanu; Brocchieri, Luciano; B?rglin, Thomas R.
The full complement of homeobox transcription factor sequences, including genes and pseudogenes, was determined from the analysis of 10 complete genomes from flowering plants, moss, Selaginella, unicellular green algae, and red algae. Our exhaustive genome-wide searches resulted in the discovery in each class of a greater number of homeobox genes than previously reported. All homeobox genes can be unambiguously classified by sequence evolutionary analysis into 14 distinct classes also charact...
Wullschleger, Stan D; Difazio, Stephen P
Microarrays have become an important technology for the global analysis of gene expression in humans, animals, plants, and microbes. Implemented in the context of a well-designed experiment, cDNA and oligonucleotide arrays can provide highthroughput, simultaneous analysis of transcript abundance for hundreds, if not thousands, of genes. However, despite widespread acceptance, the use of microarrays as a tool to better understand processes of interest to the plant physiologist is still being explored. To help illustrate current uses of microarrays in the plant sciences, several case studies that we believe demonstrate the emerging application of gene expression arrays in plant physiology were selected from among the many posters and presentations at the 2003 Plant and Animal Genome XI Conference. Based on this survey, microarrays are being used to assess gene expression in plants exposed to the experimental manipulation of air temperature, soil water content and aluminium concentration in the root zone. Analysis often includes characterizing transcript profiles for multiple post-treatment sampling periods and categorizing genes with common patterns of response using hierarchical clustering techniques. In addition, microarrays are also providing insights into developmental changes in gene expression associated with fibre and root elongation in cotton and maize, respectively. Technical and analytical limitations of microarrays are discussed and projects attempting to advance areas of microarray design and data analysis are highlighted. Finally, although much work remains, we conclude that microarrays are a valuable tool for the plant physiologist interested in the characterization and identification of individual genes and gene families with potential application in the fields of agriculture, horticulture and forestry.
Stephen P. Difazio
Full Text Available Microarrays have become an important technology for the global analysis of gene expression in humans, animals, plants, and microbes. Implemented in the context of a well-designed experiment, cDNA and oligonucleotide arrays can provide highthroughput, simultaneous analysis of transcript abundance for hundreds, if not thousands, of genes. However, despite widespread acceptance, the use of microarrays as a tool to better understand processes of interest to the plant physiologist is still being explored. To help illustrate current uses of microarrays in the plant sciences, several case studies that we believe demonstrate the emerging application of gene expression arrays in plant physiology were selected from among the many posters and presentations at the 2003 Plant and Animal Genome XI Conference. Based on this survey, microarrays are being used to assess gene expression in plants exposed to the experimental manipulation of air temperature, soil water content and aluminium concentration in the root zone. Analysis often includes characterizing transcript profiles for multiple post-treatment sampling periods and categorizing genes with common patterns of response using hierarchical clustering techniques. In addition, microarrays are also providing insights into developmental changes in gene expression associated with fibre and root elongation in cotton and maize, respectively. Technical and analytical limitations of microarrays are discussed and projects attempting to advance areas of microarray design and data analysis are highlighted. Finally, although much work remains, we conclude that microarrays are a valuable tool for the plant physiologist interested in the characterization and identification of individual genes and gene families with potential application in the fields of agriculture, horticulture and forestry.
Gramzow, Lydia; Weilandt, Lisa; Theißen, Günter
MADS-box genes comprise a gene family coding for transcription factors. This gene family expanded greatly during land plant evolution such that the number of MADS-box genes ranges from one or two in green algae to around 100 in angiosperms. Given the crucial functions of MADS-box genes for nearly all aspects of plant development, the expansion of this gene family probably contributed to the increasing complexity of plants. However, the expansion of MADS-box genes during one important step of land plant evolution, namely the origin of seed plants, remains poorly understood due to the previous lack of whole-genome data for gymnosperms. The newly available genome sequences of Picea abies, Picea glauca and Pinus taeda were used to identify the complete set of MADS-box genes in these conifers. In addition, MADS-box genes were identified in the growing number of transcriptomes available for gymnosperms. With these datasets, phylogenies were constructed to determine the ancestral set of MADS-box genes of seed plants and to infer the ancestral functions of these genes. Type I MADS-box genes are under-represented in gymnosperms and only a minimum of two Type I MADS-box genes have been present in the most recent common ancestor (MRCA) of seed plants. In contrast, a large number of Type II MADS-box genes were found in gymnosperms. The MRCA of extant seed plants probably possessed at least 11-14 Type II MADS-box genes. In gymnosperms two duplications of Type II MADS-box genes were found, such that the MRCA of extant gymnosperms had at least 14-16 Type II MADS-box genes. The implied ancestral set of MADS-box genes for seed plants shows simplicity for Type I MADS-box genes and remarkable complexity for Type II MADS-box genes in terms of phylogeny and putative functions. The analysis of transcriptome data reveals that gymnosperm MADS-box genes are expressed in a great variety of tissues, indicating diverse roles of MADS-box genes for the development of gymnosperms. This study is
Since 1976, Industrial Assessment Centers (IACs) administered by the U.S. Department of Energy have supported small and medium-sized American manufacturers to reduce their energy use and improve their productivity and competitiveness. DOE is now offering up to 50 assessments per year at no cost to industrial or municipal water and wastewater plants.
Since 1976, Industrial Assessment Centers (IACs) administered by the U.S. Department of Energy have supported small and medium-sized American manufacturers to reduce their energy use and improve their productivity and competitiveness. DOE is now offering up to 50 assessments per year at no cost to industrial or municipal water and wastewater plants.
Since 1976, Industrial Assessment Centers (IACs) administered by the U.S. Department of Energy have supported small and medium-sized American manufacturers to reduce their energy use and improve their productivity and competitiveness. DOE is now offering up to 50 assessments per year at no cost to industrial or municipal water and wastewater plants.
Ana Lúcia Anversa Segatto
Full Text Available Abstract Developmental genes are believed to contribute to major changes during plant evolution, from infrageneric to higher levels. Due to their putative high sequence conservation, developmental genes are rarely used as molecular markers, and few studies including these sequences at low taxonomic levels exist. WUSCHEL-related homeobox genes (WOX are transcription factors exclusively present in plants and are involved in developmental processes. In this study, we characterized the infrageneric genetic variation of Petunia WOX genes. We obtained phylogenetic relationships consistent with other phylogenies based on nuclear markers, but with higher statistical support, resolution in terminals, and compatibility with flower morphological changes.
Waaijenborg, S.; Zwinderman, A.H.
ABSTRACT: BACKGROUND: We generalized penalized canonical correlation analysis for analyzing microarray gene-expression measurements for checking completeness of known metabolic pathways and identifying candidate genes for incorporation in the pathway. We used Wold's method for calculation of the
Background The glycosylation process, catalyzed by ubiquitous glycosyltransferase (GT) family enzymes, is a prevalent modification of plant secondary metabolites that regulates various functions such as hormone homeostasis, detoxification of xenobiotics and biosynthesis and storage of secondary metabolites. Flax (Linum usitatissimum L.) is a commercially grown oilseed crop, important because of its essential fatty acids and health promoting lignans. Identification and characterization of UDP glycosyltransferase (UGT) genes from flax could provide valuable basic information about this important gene family and help to explain the seed specific glycosylated metabolite accumulation and other processes in plants. Plant genome sequencing projects are useful to discover complexity within this gene family and also pave way for the development of functional genomics approaches. Results Taking advantage of the newly assembled draft genome sequence of flax, we identified 137 UDP glycosyltransferase (UGT) genes from flax using a conserved signature motif. Phylogenetic analysis of these protein sequences clustered them into 14 major groups (A-N). Expression patterns of these genes were investigated using publicly available expressed sequence tag (EST), microarray data and reverse transcription quantitative real time PCR (RT-qPCR). Seventy-three per cent of these genes (100 out of 137) showed expression evidence in 15 tissues examined and indicated varied expression profiles. The RT-qPCR results of 10 selected genes were also coherent with the digital expression analysis. Interestingly, five duplicated UGT genes were identified, which showed differential expression in various tissues. Of the seven intron loss/gain positions detected, two intron positions were conserved among most of the UGTs, although a clear relationship about the evolution of these genes could not be established. Comparison of the flax UGTs with orthologs from four other sequenced dicot genomes indicated that
Barvkar Vitthal T
Full Text Available Abstract Background The glycosylation process, catalyzed by ubiquitous glycosyltransferase (GT family enzymes, is a prevalent modification of plant secondary metabolites that regulates various functions such as hormone homeostasis, detoxification of xenobiotics and biosynthesis and storage of secondary metabolites. Flax (Linum usitatissimum L. is a commercially grown oilseed crop, important because of its essential fatty acids and health promoting lignans. Identification and characterization of UDP glycosyltransferase (UGT genes from flax could provide valuable basic information about this important gene family and help to explain the seed specific glycosylated metabolite accumulation and other processes in plants. Plant genome sequencing projects are useful to discover complexity within this gene family and also pave way for the development of functional genomics approaches. Results Taking advantage of the newly assembled draft genome sequence of flax, we identified 137 UDP glycosyltransferase (UGT genes from flax using a conserved signature motif. Phylogenetic analysis of these protein sequences clustered them into 14 major groups (A-N. Expression patterns of these genes were investigated using publicly available expressed sequence tag (EST, microarray data and reverse transcription quantitative real time PCR (RT-qPCR. Seventy-three per cent of these genes (100 out of 137 showed expression evidence in 15 tissues examined and indicated varied expression profiles. The RT-qPCR results of 10 selected genes were also coherent with the digital expression analysis. Interestingly, five duplicated UGT genes were identified, which showed differential expression in various tissues. Of the seven intron loss/gain positions detected, two intron positions were conserved among most of the UGTs, although a clear relationship about the evolution of these genes could not be established. Comparison of the flax UGTs with orthologs from four other sequenced dicot
Aaron E Walworth
Full Text Available In order to identify genetic components in flowering pathways of highbush blueberry (Vaccinium corymbosum L., a transcriptome reference composed of 254,396 transcripts and 179,853 gene contigs was developed by assembly of 72.7 million reads using Trinity. Using this transcriptome reference and a query of flowering pathway genes of herbaceous plants, we identified potential flowering pathway genes/transcripts of blueberry. Transcriptome analysis of flowering pathway genes was then conducted on leaf tissue samples of transgenic blueberry cv. Aurora ('VcFT-Aurora', which overexpresses a blueberry FLOWERING LOCUS T-like gene (VcFT. Sixty-one blueberry transcripts of 40 genes showed high similarities to 33 known flowering-related genes of herbaceous plants, of which 17 down-regulated and 16 up-regulated genes were identified in 'VcFT-Aurora'. All down-regulated genes encoded transcription factors/enzymes upstream in the signaling pathway containing VcFT. A blueberry CONSTANS-LIKE 5-like (VcCOL5 gene was down-regulated and associated with five other differentially expressed (DE genes in the photoperiod-mediated flowering pathway. Three down-regulated genes, i.e., a MADS-AFFECTING FLOWERING 2-like gene (VcMAF2, a MADS-AFFECTING FLOWERING 5-like gene (VcMAF5, and a VERNALIZATION1-like gene (VcVRN1, may function as integrators in place of FLOWERING LOCUS C (FLC in the vernalization pathway. Because no CONSTAN1-like or FLOWERING LOCUS C-like genes were found in blueberry, VcCOL5 and VcMAF2/VcMAF5 or VRN1 might be the major integrator(s in the photoperiod- and vernalization-mediated flowering pathway, respectively. The major down-stream genes of VcFT, i.e., SUPPRESSOR of Overexpression of Constans 1-like (VcSOC1, LEAFY-like (VcLFY, APETALA1-like (VcAP1, CAULIFLOWER 1-like (VcCAL1, and FRUITFULL-like (VcFUL genes were present and showed high similarity to their orthologues in herbaceous plants. Moreover, overexpression of VcFT promoted expression of all of
Walworth, Aaron E.; Chai, Benli; Song, Guo-qing
In order to identify genetic components in flowering pathways of highbush blueberry (Vaccinium corymbosum L.), a transcriptome reference composed of 254,396 transcripts and 179,853 gene contigs was developed by assembly of 72.7 million reads using Trinity. Using this transcriptome reference and a query of flowering pathway genes of herbaceous plants, we identified potential flowering pathway genes/transcripts of blueberry. Transcriptome analysis of flowering pathway genes was then conducted on leaf tissue samples of transgenic blueberry cv. Aurora (‘VcFT-Aurora’), which overexpresses a blueberry FLOWERING LOCUS T-like gene (VcFT). Sixty-one blueberry transcripts of 40 genes showed high similarities to 33 known flowering-related genes of herbaceous plants, of which 17 down-regulated and 16 up-regulated genes were identified in ‘VcFT-Aurora’. All down-regulated genes encoded transcription factors/enzymes upstream in the signaling pathway containing VcFT. A blueberry CONSTANS-LIKE 5-like (VcCOL5) gene was down-regulated and associated with five other differentially expressed (DE) genes in the photoperiod-mediated flowering pathway. Three down-regulated genes, i.e., a MADS-AFFECTING FLOWERING 2-like gene (VcMAF2), a MADS-AFFECTING FLOWERING 5-like gene (VcMAF5), and a VERNALIZATION1-like gene (VcVRN1), may function as integrators in place of FLOWERING LOCUS C (FLC) in the vernalization pathway. Because no CONSTAN1-like or FLOWERING LOCUS C-like genes were found in blueberry, VcCOL5 and VcMAF2/VcMAF5 or VRN1 might be the major integrator(s) in the photoperiod- and vernalization-mediated flowering pathway, respectively. The major down-stream genes of VcFT, i.e., SUPPRESSOR of Overexpression of Constans 1-like (VcSOC1), LEAFY-like (VcLFY), APETALA1-like (VcAP1), CAULIFLOWER 1-like (VcCAL1), and FRUITFULL-like (VcFUL) genes were present and showed high similarity to their orthologues in herbaceous plants. Moreover, overexpression of VcFT promoted expression of all
Walworth, Aaron E; Chai, Benli; Song, Guo-Qing
In order to identify genetic components in flowering pathways of highbush blueberry (Vaccinium corymbosum L.), a transcriptome reference composed of 254,396 transcripts and 179,853 gene contigs was developed by assembly of 72.7 million reads using Trinity. Using this transcriptome reference and a query of flowering pathway genes of herbaceous plants, we identified potential flowering pathway genes/transcripts of blueberry. Transcriptome analysis of flowering pathway genes was then conducted on leaf tissue samples of transgenic blueberry cv. Aurora ('VcFT-Aurora'), which overexpresses a blueberry FLOWERING LOCUS T-like gene (VcFT). Sixty-one blueberry transcripts of 40 genes showed high similarities to 33 known flowering-related genes of herbaceous plants, of which 17 down-regulated and 16 up-regulated genes were identified in 'VcFT-Aurora'. All down-regulated genes encoded transcription factors/enzymes upstream in the signaling pathway containing VcFT. A blueberry CONSTANS-LIKE 5-like (VcCOL5) gene was down-regulated and associated with five other differentially expressed (DE) genes in the photoperiod-mediated flowering pathway. Three down-regulated genes, i.e., a MADS-AFFECTING FLOWERING 2-like gene (VcMAF2), a MADS-AFFECTING FLOWERING 5-like gene (VcMAF5), and a VERNALIZATION1-like gene (VcVRN1), may function as integrators in place of FLOWERING LOCUS C (FLC) in the vernalization pathway. Because no CONSTAN1-like or FLOWERING LOCUS C-like genes were found in blueberry, VcCOL5 and VcMAF2/VcMAF5 or VRN1 might be the major integrator(s) in the photoperiod- and vernalization-mediated flowering pathway, respectively. The major down-stream genes of VcFT, i.e., SUPPRESSOR of Overexpression of Constans 1-like (VcSOC1), LEAFY-like (VcLFY), APETALA1-like (VcAP1), CAULIFLOWER 1-like (VcCAL1), and FRUITFULL-like (VcFUL) genes were present and showed high similarity to their orthologues in herbaceous plants. Moreover, overexpression of VcFT promoted expression of all of these
Asthmatic individuals have been identified as a susceptible subpopulation for air pollutants. However, asthma represents a syndrome with multiple probable etiologies, and the identification of these asthma endotypes is critical to accurately define the most susceptible subpopula...
Cava, Claudia; Bertoli, Gloria; Colaprico, Antonio; Olsen, Catharina; Bontempi, Gianluca; Castiglioni, Isabella
Modern high-throughput genomic technologies represent a comprehensive hallmark of molecular changes in pan-cancer studies. Although different cancer gene signatures have been revealed, the mechanism of tumourigenesis has yet to be completely understood. Pathways and networks are important tools to explain the role of genes in functional genomic studies. However, few methods consider the functional non-equal roles of genes in pathways and the complex gene-gene interactions in a network. We present a novel method in pan-cancer analysis that identifies de-regulated genes with a functional role by integrating pathway and network data. A pan-cancer analysis of 7158 tumour/normal samples from 16 cancer types identified 895 genes with a central role in pathways and de-regulated in cancer. Comparing our approach with 15 current tools that identify cancer driver genes, we found that 35.6% of the 895 genes identified by our method have been found as cancer driver genes with at least 2/15 tools. Finally, we applied a machine learning algorithm on 16 independent GEO cancer datasets to validate the diagnostic role of cancer driver genes for each cancer. We obtained a list of the top-ten cancer driver genes for each cancer considered in this study. Our analysis 1) confirmed that there are several known cancer driver genes in common among different types of cancer, 2) highlighted that cancer driver genes are able to regulate crucial pathways.
Meagher, Richard B [Athens, GA; Balish, Rebecca S [Oxford, OH; Tehryung, Kim [Athens, GA; McKinney, Elizabeth C [Athens, GA
Plant tissue specific gene expression by way of repressor-operator complexes, has enabled outcomes including, without limitation, male sterility and engineered plants having root-specific gene expression of relevant proteins to clean environmental pollutants from soil and water. A mercury hyperaccumulation strategy requires that mercuric ion reductase coding sequence is strongly expressed. The actin promoter vector, A2pot, engineered to contain bacterial lac operator sequences, directed strong expression in all plant vegetative organs and tissues. In contrast, the expression from the A2pot construct was restricted primarily to root tissues when a modified bacterial repressor (LacIn) was coexpressed from the light-regulated rubisco small subunit promoter in above-ground tissues. Also provided are analogous repressor operator complexes for selective expression in other plant tissues, for example, to produce male sterile plants.
Johnson, Toby; Gaunt, Tom R.; Newhouse, Stephen J.; Padmanabhan, Sandosh; Tomaszewski, Maciej; Kumari, Meena; Morris, Richard W.; Tzoulaki, Ioanna; O'Brien, Eoin T.; Poulter, Neil R.; Sever, Peter; Shields, Denis C.; Thom, Simon; Wannamethee, Sasiwarang G.; Whincup, Peter H.; Brown, Morris J.; Connell, John M.; Dobson, Richard J.; Howard, Philip J.; Mein, Charles A.; Onipinla, Abiodun; Shaw-Hawkins, Sue; Zhang, Yun; Smith, George Davey; Day, Ian N. M.; Lawlor, Debbie A.; Goodall, Alison H.; Fowkes, F. Gerald; Abecasis, Goncalo R.; Elliott, Paul; Gateva, Vesela; Braund, Peter S.; Burton, Paul R.; Nelson, Christopher P.; Tobin, Martin D.; van der Harst, Pim; Glorioso, Nicola; Neuvrith, Hani; Salvi, Erika; Staessen, Jan A.; Stucchi, Andrea; Devos, Nabila; Jeunemaitre, Xavier; Plouin, Pierre-Francois; Tichet, Jean; Juhanson, Peeter; Org, Elin; Westra, Harm-Jan; Wolfs, Marcel G. M.; Franke, Lude
Raised blood pressure (BP) is a major risk factor for cardiovascular disease. Previous studies have identified 47 distinct genetic variants robustly associated with BP, but collectively these explain only a few percent of the heritability for BP phenotypes. To find additional BP loci, we used a
Nepal, Madhav P; Andersen, Ethan J; Neupane, Surendra; Benson, Benjamin V
Disease resistance genes (R genes), as part of the plant defense system, have coevolved with corresponding pathogen molecules. The main objectives of this project were to identify non-Toll interleukin receptor, nucleotide-binding site, leucine-rich repeat (nTNL) genes and elucidate their evolutionary divergence across six plant genomes. Using reference sequences from Arabidopsis , we investigated nTNL orthologs in the genomes of common bean, Medicago , soybean, poplar, and rice. We used Hidden Markov Models for sequence identification, performed model-based phylogenetic analyses, visualized chromosomal positioning, inferred gene clustering, and assessed gene expression profiles. We analyzed 908 nTNL R genes in the genomes of the six plant species, and classified them into 12 subgroups based on the presence of coiled-coil (CC), nucleotide binding site (NBS), leucine rich repeat (LRR), resistance to Powdery mildew 8 (RPW8), and BED type zinc finger domains. Traditionally classified CC-NBS-LRR (CNL) genes were nested into four clades (CNL A-D) often with abundant, well-supported homogeneous subclades of Type-II R genes. CNL-D members were absent in rice, indicating a unique R gene retention pattern in the rice genome. Genomes from Arabidopsis , common bean, poplar and soybean had one chromosome without any CNL R genes. Medicago and Arabidopsis had the highest and lowest number of gene clusters, respectively. Gene expression analyses suggested unique patterns of expression for each of the CNL clades. Differential gene expression patterns of the nTNL genes were often found to correlate with number of introns and GC content, suggesting structural and functional divergence.
Zambon Alexander C
Full Text Available Abstract Background The completion of several genome projects showed that most genes have not yet been characterized, especially in multicellular organisms. Although most genes have unknown functions, a large collection of data is available describing their transcriptional activities under many different experimental conditions. In many cases, the coregulatation of a set of genes across a set of conditions can be used to infer roles for genes of unknown function. Results We developed a search engine, the Multiple-Species Gene Recommender (MSGR, which scans gene expression datasets from multiple organisms to identify genes that participate in a genetic pathway. The MSGR takes a query consisting of a list of genes that function together in a genetic pathway from one of six organisms: Homo sapiens, Drosophila melanogaster, Caenorhabditis elegans, Saccharomyces cerevisiae, Arabidopsis thaliana, and Helicobacter pylori. Using a probabilistic method to merge searches, the MSGR identifies genes that are significantly coregulated with the query genes in one or more of those organisms. The MSGR achieves its highest accuracy for many human pathways when searches are combined across species. We describe specific examples in which new genes were identified to be involved in a neuromuscular signaling pathway and a cell-adhesion pathway. Conclusion The search engine can scan large collections of gene expression data for new genes that are significantly coregulated with a pathway of interest. By integrating searches across organisms, the MSGR can identify pathway members whose coregulation is either ancient or newly evolved.
Allman, Elizabeth S; Degnan, James H; Rhodes, John A
Gene trees are evolutionary trees representing the ancestry of genes sampled from multiple populations. Species trees represent populations of individuals-each with many genes-splitting into new populations or species. The coalescent process, which models ancestry of gene copies within populations, is often used to model the probability distribution of gene trees given a fixed species tree. This multispecies coalescent model provides a framework for phylogeneticists to infer species trees from gene trees using maximum likelihood or Bayesian approaches. Because the coalescent models a branching process over time, all trees are typically assumed to be rooted in this setting. Often, however, gene trees inferred by traditional phylogenetic methods are unrooted. We investigate probabilities of unrooted gene trees under the multispecies coalescent model. We show that when there are four species with one gene sampled per species, the distribution of unrooted gene tree topologies identifies the unrooted species tree topology and some, but not all, information in the species tree edges (branch lengths). The location of the root on the species tree is not identifiable in this situation. However, for 5 or more species with one gene sampled per species, we show that the distribution of unrooted gene tree topologies identifies the rooted species tree topology and all its internal branch lengths. The length of any pendant branch leading to a leaf of the species tree is also identifiable for any species from which more than one gene is sampled.
Takeda, Haruna; Rust, Alistair G; Ward, Jerrold M; Yew, Christopher Chin Kuan; Jenkins, Nancy A; Copeland, Neal G
Mutations in SMAD4 predispose to the development of gastrointestinal cancer, which is the third leading cause of cancer-related deaths. To identify genes driving gastric cancer (GC) development, we performed a Sleeping Beauty (SB) transposon mutagenesis screen in the stomach of Smad4(+/-) mutant mice. This screen identified 59 candidate GC trunk drivers and a much larger number of candidate GC progression genes. Strikingly, 22 SB-identified trunk drivers are known or candidate cancer genes, whereas four SB-identified trunk drivers, including PTEN, SMAD4, RNF43, and NF1, are known human GC trunk drivers. Similar to human GC, pathway analyses identified WNT, TGF-β, and PI3K-PTEN signaling, ubiquitin-mediated proteolysis, adherens junctions, and RNA degradation in addition to genes involved in chromatin modification and organization as highly deregulated pathways in GC. Comparative oncogenomic filtering of the complete list of SB-identified genes showed that they are highly enriched for genes mutated in human GC and identified many candidate human GC genes. Finally, by comparing our complete list of SB-identified genes against the list of mutated genes identified in five large-scale human GC sequencing studies, we identified LDL receptor-related protein 1B (LRP1B) as a previously unidentified human candidate GC tumor suppressor gene. In LRP1B, 129 mutations were found in 462 human GC samples sequenced, and LRP1B is one of the top 10 most deleted genes identified in a panel of 3,312 human cancers. SB mutagenesis has, thus, helped to catalog the cooperative molecular mechanisms driving SMAD4-induced GC growth and discover genes with potential clinical importance in human GC.
Guo, Yuan; Qiu, Caisheng; Long, Songhua; Chen, Ping; Hao, Dongmei; Preisner, Marta; Wang, Hui; Wang, Yufu
To better understand the molecular mechanisms and gene expression characteristics associated with development of bast fiber cell within flax stem phloem, the gene expression profiling of flax stem peels and leaves were screened, using Illumina's Digital Gene Expression (DGE) analysis. Four DGE libraries (2 for stem peel and 2 for leaf), ranging from 6.7 to 9.2 million clean reads were obtained, which produced 7.0 million and 6.8 million mapped reads for flax stem peel and leave, respectively. By differential gene expression analysis, a total of 975 genes, of which 708 (73%) genes have protein-coding annotation, were identified as phloem enriched genes putatively involved in the processes of polysaccharide and cell wall metabolism. Differential expression genes (DEGs) was validated using quantitative RT-PCR, the expression pattern of all nine genes determined by qRT-PCR fitted in well with that obtained by sequencing analysis. Cluster and Gene Ontology (GO) analysis revealed that a large number of genes related to metabolic process, catalytic activity and binding category were expressed predominantly in the stem peels. The Kyoto Encyclopedia of Genes and Genomes (KEGG) analysis of the phloem enriched genes suggested approximately 111 biological pathways. The large number of genes and pathways produced from DGE sequencing will expand our understanding of the complex molecular and cellular events in flax bast fiber development and provide a foundation for future studies on fiber development in other bast fiber crops. Copyright © 2017 Elsevier B.V. All rights reserved.
Plants have many, highly variable resistance (R) gene loci, which provide resistance to a variety of pathogens. The first R gene to be cloned, maize (Zea mays) Hm1, was published over 25 years ago, and since then, many different R genes have been identified and isolated. The encoded proteins have provided clues to the diverse molecular mechanisms underlying immunity. Here, we present a meta-analysis of 314 cloned R genes. The majority of R genes encode cell surface or intracellular receptors, and we distinguish nine molecular mechanisms by which R proteins can elevate or trigger disease resistance: direct (1) or indirect (2) perception of pathogen-derived molecules on the cell surface by receptor-like proteins and receptor-like kinases; direct (3) or indirect (4) intracellular detection of pathogen-derived molecules by nucleotide binding, leucine-rich repeat receptors, or detection through integrated domains (5); perception of transcription activator-like effectors through activation of executor genes (6); and active (7), passive (8), or host reprogramming-mediated (9) loss of susceptibility. Although the molecular mechanisms underlying the functions of R genes are only understood for a small proportion of known R genes, a clearer understanding of mechanisms is emerging and will be crucial for rational engineering and deployment of novel R genes. PMID:29382771
Full Text Available Santalum album (sandalwood is one of the economically important plant species in the Santalaceae for its production of highly valued perfume oils. Sandalwood is also a hemiparasitic tree that obtains some of its water and simple nutrients by tapping into other plants through haustoria which are highly specialized organs in parasitic angiosperms. However, an understanding of the molecular mechanisms involved in haustorium development is limited. In this study, RNA sequencing (RNA-seq analyses were performed to identify changes in gene expression and metabolic pathways associated with the development of the S. album haustorium. A total of 56,011 non-redundant contigs with a mean contig size of 618 bp were obtained by de novo assembly of the transcriptome of haustoria and non-haustorial seedling roots. A substantial number of the identified differentially expressed genes were involved in cell wall metabolism and protein metabolism, as well as mitochondrial electron transport functions. Phytohormone-mediated regulation might play an important role during haustorial development. Especially, auxin signaling is likely to be essential for haustorial initiation, and genes related to cytokinin and gibberellin biosynthesis and metabolism are involved in haustorial development. Our results suggest that genes encoding nodulin-like proteins may be important for haustorial morphogenesis in S. album. The obtained sequence data will become a rich resource for future research in this interesting species. This information improves our understanding of haustorium development in root hemiparasitic species and will allow further exploration of the detailed molecular mechanisms underlying plant parasitism.
Novak, Rachel L; Harper, David P; Caudell, David; Slape, Christopher; Beachy, Sarah H; Aplan, Peter D
NUP98-HOXD13 (NHD13) and CALM-AF10 (CA10) are oncogenic fusion proteins produced by recurrent chromosomal translocations in patients with acute myeloid leukemia (AML). Transgenic mice that express these fusions develop AML with a long latency and incomplete penetrance, suggesting that collaborating genetic events are required for leukemic transformation. We employed genetic techniques to identify both preleukemic abnormalities in healthy transgenic mice as well as collaborating events leading to leukemic transformation. Candidate gene resequencing revealed that 6 of 27 (22%) CA10 AMLs spontaneously acquired a Ras pathway mutation and 8 of 27 (30%) acquired an Flt3 mutation. Two CA10 AMLs acquired an Flt3 internal-tandem duplication, demonstrating that these mutations can be acquired in murine as well as human AML. Gene expression profiles revealed a marked upregulation of Hox genes, particularly Hoxa5, Hoxa9, and Hoxa10 in both NHD13 and CA10 mice. Furthermore, mir196b, which is embedded within the Hoxa locus, was overexpressed in both CA10 and NHD13 samples. In contrast, the Hox cofactors Meis1 and Pbx3 were differentially expressed; Meis1 was increased in CA10 AMLs but not NHD13 AMLs, whereas Pbx3 was consistently increased in NHD13 but not CA10 AMLs. Silencing of Pbx3 in NHD13 cells led to decreased proliferation, increased apoptosis, and decreased colony formation in vitro, suggesting a previously unexpected role for Pbx3 in leukemic transformation. Published by Elsevier Inc.
Weile, Christian; Gardner, Paul P; Hedegaard, Mads M
neuroblastoma cell line SK-N-AS. Using this strategy, we identify thousands of human candidate RNA genes. To further verify the expression of these genes, we focused on candidate genes that had a stable hairpin structures or a high level of covariance. Using northern blotting, we verify the expression of 2 out...
Stam, Remco; Scheikl, Daniela; Tellier, Aurélien
Nod-like receptors (NLRs) are nucleotide-binding domain and leucine-rich repeats containing proteins that are important in plant resistance signaling. Many of the known pathogen resistance (R) genes in plants are NLRs and they can recognize pathogen molecules directly or indirectly. As such, divergence and copy number variants at these genes are found to be high between species. Within populations, positive and balancing selection are to be expected if plants coevolve with their pathogens. In order to understand the complexity of R-gene coevolution in wild nonmodel species, it is necessary to identify the full range of NLRs and infer their evolutionary history. Here we investigate and reveal polymorphism occurring at 220 NLR genes within one population of the partially selfing wild tomato species Solanum pennellii. We use a combination of enrichment sequencing and pooling ten individuals, to specifically sequence NLR genes in a resource and cost-effective manner. We focus on the effects which different mapping and single nucleotide polymorphism calling software and settings have on calling polymorphisms in customized pooled samples. Our results are accurately verified using Sanger sequencing of polymorphic gene fragments. Our results indicate that some NLRs, namely 13 out of 220, have maintained polymorphism within our S. pennellii population. These genes show a wide range of πN/πS ratios and differing site frequency spectra. We compare our observed rate of heterozygosity with expectations for this selfing and bottlenecked population. We conclude that our method enables us to pinpoint NLR genes which have experienced natural selection in their habitat. © The Author 2016. Published by Oxford University Press on behalf of the Society for Molecular Biology and Evolution.
Stam, Remco; Scheikl, Daniela; Tellier, Aurélien
Nod-like receptors (NLRs) are nucleotide-binding domain and leucine-rich repeats containing proteins that are important in plant resistance signaling. Many of the known pathogen resistance (R) genes in plants are NLRs and they can recognize pathogen molecules directly or indirectly. As such, divergence and copy number variants at these genes are found to be high between species. Within populations, positive and balancing selection are to be expected if plants coevolve with their pathogens. In order to understand the complexity of R-gene coevolution in wild nonmodel species, it is necessary to identify the full range of NLRs and infer their evolutionary history. Here we investigate and reveal polymorphism occurring at 220 NLR genes within one population of the partially selfing wild tomato species Solanum pennellii. We use a combination of enrichment sequencing and pooling ten individuals, to specifically sequence NLR genes in a resource and cost-effective manner. We focus on the effects which different mapping and single nucleotide polymorphism calling software and settings have on calling polymorphisms in customized pooled samples. Our results are accurately verified using Sanger sequencing of polymorphic gene fragments. Our results indicate that some NLRs, namely 13 out of 220, have maintained polymorphism within our S. pennellii population. These genes show a wide range of πN/πS ratios and differing site frequency spectra. We compare our observed rate of heterozygosity with expectations for this selfing and bottlenecked population. We conclude that our method enables us to pinpoint NLR genes which have experienced natural selection in their habitat. PMID:27189991
Hoshi, Ayaka; Oshima, Kenro; Kakizawa, Shigeyuki; Ishii, Yoshiko; Ozeki, Johji; Hashimoto, Masayoshi; Komatsu, Ken; Kagiwada, Satoshi; Yamaji, Yasuyuki; Namba, Shigetou
One of the most important themes in agricultural science is the identification of virulence factors involved in plant disease. Here, we show that a single virulence factor, tengu-su inducer (TENGU), induces witches' broom and dwarfism and is a small secreted protein of the plant-pathogenic bacterium, phytoplasma. When tengu was expressed in Nicotiana benthamiana plants, these plants showed symptoms of witches' broom and dwarfism, which are typical of phytoplasma infection. Transgenic Arabidopsis thaliana lines expressing tengu exhibited similar symptoms, confirming the effects of tengu expression on plants. Although the localization of phytoplasma was restricted to the phloem, TENGU protein was detected in apical buds by immunohistochemical analysis, suggesting that TENGU was transported from the phloem to other cells. Microarray analyses showed that auxin-responsive genes were significantly down-regulated in the tengu-transgenic plants compared with GUS-transgenic control plants. These results suggest that TENGU inhibits auxin-related pathways, thereby affecting plant development. PMID:19329488
Full Text Available Shikimate kinase (SK; EC 22.214.171.124 catalyzes the fifth reaction of the shikimate pathway, which directs carbon from the central metabolism pool to a broad range of secondary metabolites involved in plant development, growth, and stress responses. In this study, we demonstrate the role of plant SK gene duplicate evolution in the diversification of metabolic regulation and the acquisition of novel and physiologically essential function. Phylogenetic analysis of plant SK homologs resolves an orthologous cluster of plant SKs and two functionally distinct orthologous clusters. These previously undescribed genes, shikimate kinase-like 1 (SKL1 and -2 (SKL2, do not encode SK activity, are present in all major plant lineages, and apparently evolved under positive selection following SK gene duplication over 400 MYA. This is supported by functional assays using recombinant SK, SKL1, and SKL2 from Arabidopsis thaliana (At and evolutionary analyses of the diversification of SK-catalytic and -substrate binding sites based on theoretical structure models. AtSKL1 mutants yield albino and novel variegated phenotypes, which indicate SKL1 is required for chloroplast biogenesis. Extant SKL2 sequences show a strong genetic signature of positive selection, which is enriched in a protein-protein interaction module not found in other SK homologs. We also report the first kinetic characterization of plant SKs and show that gene expression diversification among the AtSK inparalogs is correlated with developmental processes and stress responses. This study examines the functional diversification of ancient and recent plant SK gene duplicates and highlights the utility of SKs as scaffolds for functional innovation.
Kim, Jeongwoo; Kim, Hyunjin; Yoon, Youngmi; Park, Sanghyun
Since the genome project in 1990s, a number of studies associated with genes have been conducted and researchers have confirmed that genes are involved in disease. For this reason, the identification of the relationships between diseases and genes is important in biology. We propose a method called LGscore, which identifies disease-related genes using Google data and literature data. To implement this method, first, we construct a disease-related gene network using text-mining results. We then extract gene-gene interactions based on co-occurrences in abstract data obtained from PubMed, and calculate the weights of edges in the gene network by means of Z-scoring. The weights contain two values: the frequency and the Google search results. The frequency value is extracted from literature data, and the Google search result is obtained using Google. We assign a score to each gene through a network analysis. We assume that genes with a large number of links and numerous Google search results and frequency values are more likely to be involved in disease. For validation, we investigated the top 20 inferred genes for five different diseases using answer sets. The answer sets comprised six databases that contain information on disease-gene relationships. We identified a significant number of disease-related genes as well as candidate genes for Alzheimer's disease, diabetes, colon cancer, lung cancer, and prostate cancer. Our method was up to 40% more accurate than existing methods. Copyright © 2015 Elsevier Inc. All rights reserved.
Benedito, V.A.; Visser, P.B.; Angenent, G.C.; Krens, F.A.
Virus-induced gene silencing (VIGS) has been shown to be of great potential in plant reverse genetics. Advantages of VIGS over other approaches, such as T-DNA or transposon tagging, include the circumvention of plant transformation, methodological simplicity and robustness, and speedy results. These
Full Text Available Autism spectrum disorder (ASD is marked by a strong genetic heterogeneity, which is underlined by the low overlap between ASD risk gene lists proposed in different studies. In this context, molecular networks can be used to analyze the results of several genome-wide studies in order to underline those network regions harboring genetic variations associated with ASD, the so-called “disease modules.” In this work, we used a recent network diffusion-based approach to jointly analyze multiple ASD risk gene lists. We defined genome-scale prioritizations of human genes in relation to ASD genes from multiple studies, found significantly connected gene modules associated with ASD and predicted genes functionally related to ASD risk genes. Most of them play a role in synapsis and neuronal development and function; many are related to syndromes that can be in comorbidity with ASD and the remaining are involved in epigenetics, cell cycle, cell adhesion and cancer.
Full Text Available The analysis of gene expression data has shown that transcriptionally coordinated (co-expressed genes are often functionally related, enabling scientists to use expression data in gene function prediction. This Focused Review discusses our original paper (Large-scale co-expression approach to dissect secondary cell wall formation across plant species, Frontiers in Plant Science 2:23. In this paper we applied cross-species analysis to co-expression networks of genes involved in cellulose biosynthesis. We show that the co-expression networks from different species are highly similar, indicating that whole biological pathways are conserved across species. This finding has two important implications. First, the analysis can transfer gene function annotation from well-studied plants, such as Arabidopsis, to other, uncharacterized plant species. As the analysis finds genes that have similar sequence and similar expression pattern across different organisms, functionally equivalent genes can be identified. Second, since co-expression analyses are often noisy, a comparative analysis should have higher performance, as parts of co-expression networks that are conserved are more likely to be functionally relevant. In this Focused Review, we outline the comparative analysis done in the original paper and comment on the recent advances and approaches that allow comparative analyses of co-function networks. We hypothesize that, in comparison to simple co-expression analysis, comparative analysis would yield more accurate gene function predictions. Finally, by combining comparative analysis with genomic information of green plants, we propose a possible composition of cellulose biosynthesis machinery during earlier stages of plant evolution.
Golomb, Benjamin L.
Lactic acid bacteria have been isolated from living, harvested, and fermented plant materials; however, the adaptations these bacteria possess for growth on plant tissues are largely unknown. In this study, we investigated plant habitat-specific traits of Lactococcus lactis during growth in an Arabidopsis thaliana leaf tissue lysate (ATL). L. lactis KF147, a strain originally isolated from plants, exhibited a higher growth rate and reached 7.9-fold-greater cell densities during growth in ATL than the dairy-associated strain L. lactis IL1403. Transcriptome profiling (RNA-seq) of KF147 identified 853 induced and 264 repressed genes during growth in ATL compared to that in GM17 laboratory culture medium. Genes induced in ATL included those involved in the arginine deiminase pathway and a total of 140 carbohydrate transport and metabolism genes, many of which are involved in xylose, arabinose, cellobiose, and hemicellulose metabolism. The induction of those genes corresponded with L. lactis KF147 nutrient consumption and production of metabolic end products in ATL as measured by gas chromatography-time of flight mass spectrometry (GC-TOF/MS) untargeted metabolomic profiling. To assess the importance of specific plant-inducible genes for L. lactis growth in ATL, xylose metabolism was targeted for gene knockout mutagenesis. Wild-type L. lactis strain KF147 but not an xylA deletion mutant was able to grow using xylose as the sole carbon source. However, both strains grew to similarly high levels in ATL, indicating redundancy in L. lactis carbohydrate metabolism on plant tissues. These findings show that certain strains of L. lactis are well adapted for growth on plants and possess specific traits relevant for plant-based food, fuel, and feed fermentations. PMID:25384484
Wang, Li; Zhu, Chen; Jin, Lin; Xiao, Aihua; Duan, Jie; Ma, Luyi
Kalanchoe (K.) daigremontiana is important for studying asexual reproduction under different environmental conditions. Here, we describe a novel KdNOVEL41 (KdN41) gene that may confer drought resistance and could thereby affect K. daigremontiana development. The detected subcellular localization of a KdN41/Yellow Fluorescent Protein (YFP) fusion protein was in the nucleus and cell membrane. Drought, salt, and heat stress treatment in tobacco plants containing the KdN41 gene promoter driving β-glucuronidase (GUS) gene transcription revealed that only drought stress triggered strong GUS staining in the vascular tissues. Overexpression (OE) of the KdN41 gene conferred improved drought resistance in tobacco plants compared to wild-type and transformed with empty vector plants by inducing higher antioxidant enzyme activities, decreasing cell membrane damage, increasing abscisic acid (ABA) content, causing reinforced drought resistance related gene expression profiles. The 3,3'-diaminobenzidine (DAB) and nitroblue tetrazolium (NBT) staining results also showed less relative oxygen species (ROS) content in KdN41-overexpressing tobacco leaf during drought stress. Surprisingly, by re-watering after drought stress, KdN41-overexpressing tobacco showed earlier flowering. Overall, the KdN41 gene plays roles in ROS scavenging and osmotic damage reduction to improve tobacco drought resistance, which may increase our understanding of the molecular network involved in developmental manipulation under drought stress in K. daigremontiana.
Full Text Available Abstract Background Being sessile organisms, plants should adjust their metabolism to dynamic changes in their environment. Such adjustments need particular coordination in branched metabolic networks in which a given metabolite can be converted into multiple other metabolites via different enzymatic chains. In the present report, we developed a novel "Gene Coordination" bioinformatics approach and use it to elucidate adjustable transcriptional interactions of two branched amino acid metabolic networks in plants in response to environmental stresses, using publicly available microarray results. Results Using our "Gene Coordination" approach, we have identified in Arabidopsis plants two oppositely regulated groups of "highly coordinated" genes within the branched Asp-family network of Arabidopsis plants, which metabolizes the amino acids Lys, Met, Thr, Ile and Gly, as well as a single group of "highly coordinated" genes within the branched aromatic amino acid metabolic network, which metabolizes the amino acids Trp, Phe and Tyr. These genes possess highly coordinated adjustable negative and positive expression responses to various stress cues, which apparently regulate adjustable metabolic shifts between competing branches of these networks. We also provide evidence implying that these highly coordinated genes are central to impose intra- and inter-network interactions between the Asp-family and aromatic amino acid metabolic networks as well as differential system interactions with other growth promoting and stress-associated genome-wide genes. Conclusion Our novel Gene Coordination elucidates that branched amino acid metabolic networks in plants are regulated by specific groups of highly coordinated genes that possess adjustable intra-network, inter-network and genome-wide transcriptional interactions. We also hypothesize that such transcriptional interactions enable regulatory metabolic adjustments needed for adaptation to the stresses.
Danchin, Etienne G J; Perfus-Barbeoch, Laetitia; Rancurel, Corinne; Thorpe, Peter; Da Rocha, Martine; Bajew, Simon; Neilson, Roy; Guzeeva, Elena Sokolova; Da Silva, Corinne; Guy, Julie; Labadie, Karine; Esmenjaud, Daniel; Helder, Johannes; Jones, John T; den Akker, Sebastian Eves-van
Nematodes have evolved the ability to parasitize plants on at least four independent occasions, with plant parasites present in Clades 1, 2, 10 and 12 of the phylum. In the case of Clades 10 and 12, horizontal gene transfer of plant cell wall degrading enzymes from bacteria and fungi has been implicated in the evolution of plant parasitism. We have used ribonucleic acid sequencing (RNAseq) to generate reference transcriptomes for two economically important nematode species, Xiphinema index and Longidorus elongatus , representative of two genera within the early-branching Clade 2 of the phylum Nematoda. We used a transcriptome-wide analysis to identify putative horizontal gene transfer events. This represents the first in-depth transcriptome analysis from any plant-parasitic nematode of this clade. For each species, we assembled ~30 million Illumina reads into a reference transcriptome. We identified 62 and 104 transcripts, from X. index and L. elongatus , respectively, that were putatively acquired via horizontal gene transfer. By cross-referencing horizontal gene transfer prediction with a phylum-wide analysis of Pfam domains, we identified Clade 2-specific events. Of these, a GH12 cellulase from X. index was analysed phylogenetically and biochemically, revealing a likely bacterial origin and canonical enzymatic function. Horizontal gene transfer was previously shown to be a phenomenon that has contributed to the evolution of plant parasitism among nematodes. Our findings underline the importance and the extensiveness of this phenomenon in the evolution of plant-parasitic life styles in this speciose and widespread animal phylum.
Full Text Available Biodiversity protection and preservation of genetic variability is based on the fact that plant varieties are irreplaceable in production process and that they are more and more jeopardized by urban and industrial development. The most common way of preserving and at the same time the safest way is a storage in a gene bank. Prior to storage comes collecting, studying and replanting for Institute Gene Bank, Central State Gene Bank and for Regional Gene Banks. Institute for Vegetable Crops in Smederevska Palanka preserves a wide variety of vegetable germplasm. This is, so called, work collection, used as a gene resource for breeding purposes. Seed samples are stored at 4±2°C and 50% relative humidity. At the moment, the collection has 2265 samples. Almost all samples have the passport data, but only 10% of samples have been further characterized and evaluated.
Schori, M.; Showalter, A.M.
DNA barcoding involves the generation of DNA sequencing data from particular genetic regions in an organism and the use of these sequence data to identify or 'barcode' that organism and distinguish it from other species. Here, DNA barcoding is being used to identify several medicinal plants found in Pakistan and distinguished them from other similar species. Several challenges to the successful implementation of plant DNA barcoding are presented and discussed. Despite these challenges, DNA barcoding has the potential to uniquely identify medicinal plants and provide quality control and standardization of the plant material supplied to the pharmaceutical industry. (author)
Rodekohr, Sherie; Harris, Clark Richard
This handbook on identifying and selecting landscape plants can be used as a reference in landscaping courses or on an individual basis. The first of two sections, Identifying Plants for the Landscape, contains the following tables: shade tree identification; flowering tree identification; evergreen tree identification; flowering shrub…
Full Text Available Gene co-expression has been widely used to hypothesize gene function through guilt-by association. However, it is not clear to what degree co-expression is informative, whether it can be applied to genes involved in different biological processes, and how the type of dataset impacts inferences about gene functions. Here our goal is to assess the utility and limitations of using co-expression as a criterion to recover functional associations between genes. By determining the percentage of gene pairs in a metabolic pathway with significant expression correlation, we found that many genes in the same pathway do not have similar transcript profiles and the choice of dataset, annotation quality, gene function, expression similarity measure, and clustering approach significantly impacts the ability to recover functional associations between genes using Arabidopsis thaliana as an example. Some datasets are more informative in capturing coordinated expression profiles and larger data sets are not always better. In addition, to recover the maximum number of known pathways and identify candidate genes with similar functions, it is important to explore rather exhaustively multiple dataset combinations, similarity measures, clustering algorithms and parameters. Finally, we validated the biological relevance of co-expression cluster memberships with an independent phenomics dataset and found that genes that consistently cluster with leucine degradation genes tend to have similar leucine levels in mutants. This study provides a framework for obtaining gene functional associations by maximizing the information that can be obtained from gene expression datasets.
Feng, Cai-ping; Mundy, J.
The present mini-review describes newer methods and strategies, including transposon and T-DNA insertions, TILLING, Deleteagene, and RNA interference, to functionally analyze genes of interest in the model plant Arabidopsis. The relative advantages and disadvantages of the systems are also discus...
Yang, Xiaowen; Li, Yajie; Zang, Juan; Li, Yexia; Bie, Pengfei; Lu, Yanli; Wu, Qingmin
Brucella spp. are facultative intracellular pathogens, that cause a contagious zoonotic disease, that can result in such outcomes as abortion or sterility in susceptible animal hosts and grave, debilitating illness in humans. For deciphering the survival mechanism of Brucella spp. in vivo, 42 Brucella complete genomes from NCBI were analyzed for the pan-genome and core genome by identification of their composition and function of Brucella genomes. The results showed that the total 132,143 protein-coding genes in these genomes were divided into 5369 clusters. Among these, 1710 clusters were associated with the core genome, 1182 clusters with strain-specific genes and 2477 clusters with dispensable genomes. COG analysis indicated that 44 % of the core genes were devoted to metabolism, which were mainly responsible for energy production and conversion (COG category C), and amino acid transport and metabolism (COG category E). Meanwhile, approximately 35 % of the core genes were in positive selection. In addition, 1252 potential essential genes were predicted in the core genome by comparison with a prokaryote database of essential genes. The results suggested that the core genes in Brucella genomes are relatively conservation, and the energy and amino acid metabolism play a more important role in the process of growth and reproduction in Brucella spp. This study might help us to better understand the mechanisms of Brucella persistent infection and provide some clues for further exploring the gene modules of the intracellular survival in Brucella spp.
Full Text Available Abstract Background Structural chromosomal rearrangements that lead to expressed fusion genes are a hallmark of acute lymphoblastic leukemia (ALL. In this study, we performed transcriptome sequencing of 134 primary ALL patient samples to comprehensively detect fusion transcripts. Methods We combined fusion gene detection with genome-wide DNA methylation analysis, gene expression profiling, and targeted sequencing to determine molecular signatures of emerging ALL subtypes. Results We identified 64 unique fusion events distributed among 80 individual patients, of which over 50% have not previously been reported in ALL. Although the majority of the fusion genes were found only in a single patient, we identified several recurrent fusion gene families defined by promiscuous fusion gene partners, such as ETV6, RUNX1, PAX5, and ZNF384, or recurrent fusion genes, such as DUX4-IGH. Our data show that patients harboring these fusion genes displayed characteristic genome-wide DNA methylation and gene expression signatures in addition to distinct patterns in single nucleotide variants and recurrent copy number alterations. Conclusion Our study delineates the fusion gene landscape in pediatric ALL, including both known and novel fusion genes, and highlights fusion gene families with shared molecular etiologies, which may provide additional information for prognosis and therapeutic options in the future.
Cohn Zachary A
Full Text Available Abstract Background Cartilage plays a fundamental role in the development of the human skeleton. Early in embryogenesis, mesenchymal cells condense and differentiate into chondrocytes to shape the early skeleton. Subsequently, the cartilage anlagen differentiate to form the growth plates, which are responsible for linear bone growth, and the articular chondrocytes, which facilitate joint function. However, despite the multiplicity of roles of cartilage during human fetal life, surprisingly little is known about its transcriptome. To address this, a whole genome microarray expression profile was generated using RNA isolated from 18–22 week human distal femur fetal cartilage and compared with a database of control normal human tissues aggregated at UCLA, termed Celsius. Results 161 cartilage-selective genes were identified, defined as genes significantly expressed in cartilage with low expression and little variation across a panel of 34 non-cartilage tissues. Among these 161 genes were cartilage-specific genes such as cartilage collagen genes and 25 genes which have been associated with skeletal phenotypes in humans and/or mice. Many of the other cartilage-selective genes do not have established roles in cartilage or are novel, unannotated genes. Quantitative RT-PCR confirmed the unique pattern of gene expression observed by microarray analysis. Conclusion Defining the gene expression pattern for cartilage has identified new genes that may contribute to human skeletogenesis as well as provided further candidate genes for skeletal dysplasias. The data suggest that fetal cartilage is a complex and transcriptionally active tissue and demonstrate that the set of genes selectively expressed in the tissue has been greatly underestimated.
Kim, Jaehee; Ogden, Robert Todd; Kim, Haseong
Time course gene expression experiments are an increasingly popular method for exploring biological processes. Temporal gene expression profiles provide an important characterization of gene function, as biological systems are both developmental and dynamic. With such data it is possible to study gene expression changes over time and thereby to detect differential genes. Much of the early work on analyzing time series expression data relied on methods developed originally for static data and thus there is a need for improved methodology. Since time series expression is a temporal process, its unique features such as autocorrelation between successive points should be incorporated into the analysis. This work aims to identify genes that show different gene expression profiles across time. We propose a statistical procedure to discover gene groups with similar profiles using a nonparametric representation that accounts for the autocorrelation in the data. In particular, we first represent each profile in terms of a Fourier basis, and then we screen out genes that are not differentially expressed based on the Fourier coefficients. Finally, we cluster the remaining gene profiles using a model-based approach in the Fourier domain. We evaluate the screening results in terms of sensitivity, specificity, FDR and FNR, compare with the Gaussian process regression screening in a simulation study and illustrate the results by application to yeast cell-cycle microarray expression data with alpha-factor synchronization.The key elements of the proposed methodology: (i) representation of gene profiles in the Fourier domain; (ii) automatic screening of genes based on the Fourier coefficients and taking into account autocorrelation in the data, while controlling the false discovery rate (FDR); (iii) model-based clustering of the remaining gene profiles. Using this method, we identified a set of cell-cycle-regulated time-course yeast genes. The proposed method is general and can be
Background Horizontal gene transfer (HGT) is relatively common in plant mitochondrial genomes but the mechanisms, extent and consequences of transfer remain largely unknown. Previous results indicate that parasitic plants are often involved as either transfer donors or recipients, suggesting that direct contact between parasite and host facilitates genetic transfer among plants. Results In order to uncover the mechanistic details of plant-to-plant HGT, the extent and evolutionary fate of transfer was investigated between two groups: the parasitic genus Cuscuta and a small clade of Plantago species. A broad polymerase chain reaction (PCR) survey of mitochondrial genes revealed that at least three genes (atp1, atp6 and matR) were recently transferred from Cuscuta to Plantago. Quantitative PCR assays show that these three genes have a mitochondrial location in the one species line of Plantago examined. Patterns of sequence evolution suggest that these foreign genes degraded into pseudogenes shortly after transfer and reverse transcription (RT)-PCR analyses demonstrate that none are detectably transcribed. Three cases of gene conversion were detected between native and foreign copies of the atp1 gene. The identical phylogenetic distribution of the three foreign genes within Plantago and the retention of cytidines at ancestral positions of RNA editing indicate that these genes were probably acquired via a single, DNA-mediated transfer event. However, samplings of multiple individuals from two of the three species in the recipient Plantago clade revealed complex and perplexing phylogenetic discrepancies and patterns of sequence divergence for all three of the foreign genes. Conclusions This study reports the best evidence to date that multiple mitochondrial genes can be transferred via a single HGT event and that transfer occurred via a strictly DNA-level intermediate. The discovery of gene conversion between co-resident foreign and native mitochondrial copies suggests
Full Text Available Abstract Background Horizontal gene transfer (HGT is relatively common in plant mitochondrial genomes but the mechanisms, extent and consequences of transfer remain largely unknown. Previous results indicate that parasitic plants are often involved as either transfer donors or recipients, suggesting that direct contact between parasite and host facilitates genetic transfer among plants. Results In order to uncover the mechanistic details of plant-to-plant HGT, the extent and evolutionary fate of transfer was investigated between two groups: the parasitic genus Cuscuta and a small clade of Plantago species. A broad polymerase chain reaction (PCR survey of mitochondrial genes revealed that at least three genes (atp1, atp6 and matR were recently transferred from Cuscuta to Plantago. Quantitative PCR assays show that these three genes have a mitochondrial location in the one species line of Plantago examined. Patterns of sequence evolution suggest that these foreign genes degraded into pseudogenes shortly after transfer and reverse transcription (RT-PCR analyses demonstrate that none are detectably transcribed. Three cases of gene conversion were detected between native and foreign copies of the atp1 gene. The identical phylogenetic distribution of the three foreign genes within Plantago and the retention of cytidines at ancestral positions of RNA editing indicate that these genes were probably acquired via a single, DNA-mediated transfer event. However, samplings of multiple individuals from two of the three species in the recipient Plantago clade revealed complex and perplexing phylogenetic discrepancies and patterns of sequence divergence for all three of the foreign genes. Conclusions This study reports the best evidence to date that multiple mitochondrial genes can be transferred via a single HGT event and that transfer occurred via a strictly DNA-level intermediate. The discovery of gene conversion between co-resident foreign and native
Ye, Wei; Wu, Hongqing; He, Xin; Wang, Lei; Zhang, Weimin; Li, Haohua; Fan, Yunfei; Tan, Guohui; Liu, Taomei; Gao, Xiaoxia
Agarwood is a traditional Chinese medicine used as a clinical sedative, carminative, and antiemetic drug. Agarwood is formed in Aquilaria sinensis when A. sinensis trees are threatened by external physical, chemical injury or endophytic fungal irritation. However, the mechanism of agarwood formation via chemical induction remains unclear. In this study, we characterized the transcriptome of different parts of a chemically induced A. sinensis trunk sample with agarwood. The Illumina sequencing platform was used to identify the genes involved in agarwood formation. A five-year-old Aquilaria sinensis treated by formic acid was selected. The white wood part (B1 sample), the transition part between agarwood and white wood (W2 sample), the agarwood part (J3 sample), and the rotten wood part (F5 sample) were collected for transcriptome sequencing. Accordingly, 54,685,634 clean reads, which were assembled into 83,467 unigenes, were obtained with a Q20 value of 97.5%. A total of 50,565 unigenes were annotated using the Nr, Nt, SWISS-PROT, KEGG, COG, and GO databases. In particular, 171,331,352 unigenes were annotated by various pathways, including the sesquiterpenoid (ko00909) and plant-pathogen interaction (ko03040) pathways. These pathways were related to sesquiterpenoid biosynthesis and defensive responses to chemical stimulation. The transcriptome data of the different parts of the chemically induced A. sinensis trunk provide a rich source of materials for discovering and identifying the genes involved in sesquiterpenoid production and in defensive responses to chemical stimulation. This study is the first to use de novo sequencing and transcriptome assembly for different parts of chemically induced A. sinensis. Results demonstrate that the sesquiterpenoid biosynthesis pathway and WRKY transcription factor play important roles in agarwood formation via chemical induction. The comparative analysis of the transcriptome data of agarwood and A. sinensis lays the foundation
Full Text Available As a pathological condition, epilepsy is caused by abnormal neuronal discharge in brain which will temporarily disrupt the cerebral functions. Epilepsy is a chronic disease which occurs in all ages and would seriously affect patients’ personal lives. Thus, it is highly required to develop effective medicines or instruments to treat the disease. Identifying epilepsy-related genes is essential in order to understand and treat the disease because the corresponding proteins encoded by the epilepsy-related genes are candidates of the potential drug targets. In this study, a pioneering computational workflow was proposed to predict novel epilepsy-related genes using the random walk with restart (RWR algorithm. As reported in the literature RWR algorithm often produces a number of false positive genes, and in this study a permutation test and functional association tests were implemented to filter the genes identified by RWR algorithm, which greatly reduce the number of suspected genes and result in only thirty-three novel epilepsy genes. Finally, these novel genes were analyzed based upon some recently published literatures. Our findings implicate that all novel genes were closely related to epilepsy. It is believed that the proposed workflow can also be applied to identify genes related to other diseases and deepen our understanding of the mechanisms of these diseases.
Perotto, Silvia; Rodda, Marco; Benetti, Alex; Sillo, Fabiano; Ercole, Enrico; Rodda, Michele; Girlanda, Mariangela; Murat, Claude; Balestrini, Raffaella
Orchids fully depend on symbiotic interactions with specific soil fungi for seed germination and early development. Germinated seeds give rise to a protocorm, a heterotrophic organ that acquires nutrients, including organic carbon, from the mycorrhizal partner. It has long been debated if this interaction is mutualistic or antagonistic. To investigate the molecular bases of the orchid response to mycorrhizal invasion, we developed a symbiotic in vitro system between Serapias vomeracea, a Mediterranean green meadow orchid, and the rhizoctonia-like fungus Tulasnella calospora. 454 pyrosequencing was used to generate an inventory of plant and fungal genes expressed in mycorrhizal protocorms, and plant genes could be reliably identified with a customized bioinformatic pipeline. A small panel of plant genes was selected and expression was assessed by real-time quantitative PCR in mycorrhizal and non-mycorrhizal protocorm tissues. Among these genes were some markers of mutualistic (e.g. nodulins) as well as antagonistic (e.g. pathogenesis-related and wound/stress-induced) genes. None of the pathogenesis or wound/stress-related genes were significantly up-regulated in mycorrhizal tissues, suggesting that fungal colonization does not trigger strong plant defence responses. In addition, the highest expression fold change in mycorrhizal tissues was found for a nodulin-like gene similar to the plastocyanin domain-containing ENOD55. Another nodulin-like gene significantly more expressed in the symbiotic tissues of mycorrhizal protocorms was similar to a sugar transporter of the SWEET family. Two genes coding for mannose-binding lectins were significantly up-regulated in the presence of the mycorrhizal fungus, but their role in the symbiosis is unclear.
Full Text Available Transposable elements (TE usually take up a substantial portion of eukaryotic genome. Activities of TEs can cause genome instability or gene mutations that are harmful or even disastrous to the host. TEs also contribute to gene and genome evolution at many aspects. Part of miRNA genes in mammals have been found to derive from transposons while convincing evidences are absent for plants. We found that a considerable number of previously annotated plant miRNAs are identical or homologous to transposons (TE-MIR, which include a small number of bona fide miRNA genes that conform to generally accepted plant miRNA annotation rules, and hairpin derived siRNAs likely to be pre-evolved miRNAs. Analysis of these TE-MIRs indicate that transitions from the medium to high copy TEs into miRNA genes may undergo steps such as inverted repeat formation, sequence speciation and adaptation to miRNA biogenesis. We also identified initial target genes of the TE-MIRs, which contain homologous sequences in their CDS as consequence of cognate TE insertions. About one-third of the initial target mRNAs are supported by publicly available degradome sequencing data for TE-MIR sRNA induced cleavages. Targets of the TE-MIRs are biased to non-TE related genes indicating their penchant to acquire cellular functions during evolution. Interestingly, most of these TE insertions span boundaries between coding and non-coding sequences indicating their incorporation into CDS through alteration of splicing or translation start or stop signals. Taken together, our findings suggest that TEs in gene rich regions can form foldbacks in non-coding part of transcripts that may eventually evolve into miRNA genes or be integrated into protein coding sequences to form potential targets in a "temperate" manner. Thus, transposons may supply as resources for the evolution of miRNA-target interactions in plants.
Ariyarathna, H A Chandima K; Oldach, Klaus H; Francki, Michael G
Although the HKT transporter genes ascertain some of the key determinants of crop salt tolerance mechanisms, the diversity and functional role of group II HKT genes are not clearly understood in bread wheat. The advanced knowledge on rice HKT and whole genome sequence was, therefore, used in comparative gene analysis to identify orthologous wheat group II HKT genes and their role in trait variation under different saline environments. The four group II HKTs in rice identified two orthologous gene families from bread wheat, including the known TaHKT2;1 gene family and a new distinctly different gene family designated as TaHKT2;2. A single copy of TaHKT2;2 was found on each homeologous chromosome arm 7AL, 7BL and 7DL and each gene was expressed in leaf blade, sheath and root tissues under non-stressed and at 200 mM salt stressed conditions. The proteins encoded by genes of the TaHKT2;2 family revealed more than 93% amino acid sequence identity but ≤52% amino acid identity compared to the proteins encoded by TaHKT2;1 family. Specifically, variations in known critical domains predicted functional differences between the two protein families. Similar to orthologous rice genes on chromosome 6L, TaHKT2;1 and TaHKT2;2 genes were located approximately 3 kb apart on wheat chromosomes 7AL, 7BL and 7DL, forming a static syntenic block in the two species. The chromosomal region on 7AL containing TaHKT2;1 7AL-1 co-located with QTL for shoot Na(+) concentration and yield in some saline environments. The differences in copy number, genes sequences and encoded proteins between TaHKT2;2 homeologous genes and other group II HKT gene families within and across species likely reflect functional diversity for ion selectivity and transport in plants. Evidence indicated that neither TaHKT2;2 nor TaHKT2;1 were associated with primary root Na(+) uptake but TaHKT2;1 may be associated with trait variation for Na(+) exclusion and yield in some but not all saline environments.
Song, Xiaoming; Duan, Weike; Huang, Zhinan; Liu, Gaofeng; Wu, Peng; Liu, Tongkun; Li, Ying; Hou, Xilin
In plants, flowering is the most important transition from vegetative to reproductive growth. The flowering patterns of monocots and eudicots are distinctly different, but few studies have described the evolutionary patterns of the flowering genes in them. In this study, we analysed the evolutionary pattern, duplication and expression level of these genes. The main results were as follows: (i) characterization of flowering genes in monocots and eudicots, including the identification of family-specific, orthologous and collinear genes; (ii) full characterization of CONSTANS-like genes in Brassica rapa (BraCOL genes), the key flowering genes; (iii) exploration of the evolution of COL genes in plant kingdom and construction of the evolutionary pattern of COL genes; (iv) comparative analysis of CO and FT genes between Brassicaceae and Grass, which identified several family-specific amino acids, and revealed that CO and FT protein structures were similar in B. rapa and Arabidopsis but different in rice; and (v) expression analysis of photoperiod pathway-related genes in B. rapa under different photoperiod treatments by RT-qPCR. This analysis will provide resources for understanding the flowering mechanisms and evolutionary pattern of COL genes. In addition, this genome-wide comparative study of COL genes may also provide clues for evolution of other flowering genes.
Full Text Available Integrative analysis of gene dosage, expression, and ontology (GO data was performed to discover driver genes in the carcinogenesis and chemoradioresistance of cervical cancers. Gene dosage and expression profiles of 102 locally advanced cervical cancers were generated by microarray techniques. Fifty-two of these patients were also analyzed with the Illumina expression method to confirm the gene expression results. An independent cohort of 41 patients was used for validation of gene expressions associated with clinical outcome. Statistical analysis identified 29 recurrent gains and losses and 3 losses (on 3p, 13q, 21q associated with poor outcome after chemoradiotherapy. The intratumor heterogeneity, assessed from the gene dosage profiles, was low for these alterations, showing that they had emerged prior to many other alterations and probably were early events in carcinogenesis. Integration of the alterations with gene expression and GO data identified genes that were regulated by the alterations and revealed five biological processes that were significantly overrepresented among the affected genes: apoptosis, metabolism, macromolecule localization, translation, and transcription. Four genes on 3p (RYBP, GBE1 and 13q (FAM48A, MED4 correlated with outcome at both the gene dosage and expression level and were satisfactorily validated in the independent cohort. These integrated analyses yielded 57 candidate drivers of 24 genetic events, including novel loci responsible for chemoradioresistance. Further mapping of the connections among genetic events, drivers, and biological processes suggested that each individual event stimulates specific processes in carcinogenesis through the coordinated control of multiple genes. The present results may provide novel therapeutic opportunities of both early and advanced stage cervical cancers.
Full Text Available Background: A large number of gene expression profiling (GEP studies on colorectal carcinogenesis have been performed but no reliable gene signature has been identified so far due to the lack of reproducibility in the reported genes. There is growing evidence that functionally related genes, rather than individual genes, contribute to the etiology of complex traits. We used, as a novel approach, pathway enrichment tools to define functionally related genes that are consistently up- or down-regulated in colorectal carcinogenesis. Materials and Methods: We started the analysis with 242 unique annotated genes that had been reported by any of three recent meta-analyses covering GEP studies on genes differentially expressed in carcinoma vs normal mucosa. Most of these genes (218, 91.9% had been reported in at least three GEP studies. These 242 genes were submitted to bioinformatic analysis using a total of nine tools to detect enrichment of Gene Ontology (GO categories or Kyoto Encyclopedia of Genes and Genomes (KEGG pathways. As a final consistency criterion the pathway categories had to be enriched by several tools to be taken into consideration. Results: Our pathway-based enrichment analysis identified the categories of ribosomal protein constituents, extracellular matrix receptor interaction, carbonic anhydrase isozymes, and a general category related to inflammation and cellular response as significantly and consistently overrepresented entities. Conclusions: We triaged the genes covered by the published GEP literature on colorectal carcinogenesis and subjected them to multiple enrichment tools in order to identify the consistently enriched gene categories. These turned out to have known functional relationships to cancer development and thus deserve further investigation.
Mehrotra, Shweta; Goyal, Vinod
Agrobacterium, the natures' genetic engineer, has been used as a vector to create transgenic plants. Agrobacterium-mediated gene transfer in plants is a highly efficient transformation process which is governed by various factors including genotype of the host plant, explant, vector, plasmid, bacterial strain, composition of culture medium, tissue damage, and temperature of co-cultivation. Agrobacterium has been successfully used to transform various economically and horticulturally important monocot and dicot species by standard tissue culture and in planta transformation techniques like floral or seedling infilteration, apical meristem transformation, and the pistil drip methods. Monocots have been comparatively difficult to transform by Agrobacterium. However, successful transformations have been reported in the last few years based on the adjustment of the parameters that govern the responses of monocots to Agrobacterium. A novel Agrobacterium transferred DNA-derived nanocomplex method has been developed which will be highly valuable for plant biology and biotechnology. Agrobacterium-mediated genetic transformation is known to be the preferred method of creating transgenic plants from a commercial and biosafety perspective. Agrobacterium-mediated gene transfer predominantly results in the integration of foreign genes at a single locus in the host plant, without associated vector backbone and is also known to produce marker free plants, which are the prerequisites for commercialization of transgenic crops. Research in Agrobacterium-mediated transformation can provide new and novel insights into the understanding of the regulatory process controlling molecular, cellular, biochemical, physiological, and developmental processes occurring during Agrobacterium-mediated transformation and also into a wide range of aspects on biological safety of transgenic crops to improve crop production to meet the demands of ever-growing world's population.
PRAKASH KUMAR G
and Walsh 1996). The balance between proliferation and ... In three lines, insertion occurred in genes previously implicated in the control of quiescence, i.e. ...... arrest-specific traps fall into different functional classes, such as cytoskeletal ...
Ding, Ruoyao; Arighi, Cecilia N.; Lee, Jung-Youn; Wu, Cathy H.; Vijay-Shanker, K.
Background Automatically detecting gene/protein names in the literature and connecting them to databases records, also known as gene normalization, provides a means to structure the information buried in free-text literature. Gene normalization is critical for improving the coverage of annotation in the databases, and is an essential component of many text mining systems and database curation pipelines. Methods In this manuscript, we describe a gene normalization system specifically tailored for plant species, called pGenN (pivot-based Gene Normalization). The system consists of three steps: dictionary-based gene mention detection, species assignment, and intra species normalization. We have developed new heuristics to improve each of these phases. Results We evaluated the performance of pGenN on an in-house expertly annotated corpus consisting of 104 plant relevant abstracts. Our system achieved an F-value of 88.9% (Precision 90.9% and Recall 87.2%) on this corpus, outperforming state-of-art systems presented in BioCreative III. We have processed over 440,000 plant-related Medline abstracts using pGenN. The gene normalization results are stored in a local database for direct query from the pGenN web interface (proteininformationresource.org/pgenn/). The annotated literature corpus is also publicly available through the PIR text mining portal (proteininformationresource.org/iprolink/). PMID:26258475
Blyth, Julie; Makrantoni, Vasso; Barton, Rachael E.; Spanos, Christos; Rappsilber, Juri; Marston, Adele L.
Meiosis is a specialized cell division that generates gametes, such as eggs and sperm. Errors in meiosis result in miscarriages and are the leading cause of birth defects; however, the molecular origins of these defects remain unknown. Studies in model organisms are beginning to identify the genes and pathways important for meiosis, but the parts list is still poorly defined. Here we present a comprehensive catalog of genes important for meiosis in the fission yeast, Schizosaccharomyces pombe. Our genome-wide functional screen surveyed all nonessential genes for roles in chromosome segregation and spore formation. Novel genes important at distinct stages of the meiotic chromosome segregation and differentiation program were identified. Preliminary characterization implicated three of these genes in centrosome/spindle pole body, centromere, and cohesion function. Our findings represent a near-complete parts list of genes important for meiosis in fission yeast, providing a valuable resource to advance our molecular understanding of meiosis. PMID:29259000
Paules Richard S
Full Text Available Abstract Background A common observation in the analysis of gene expression data is that many genes display similarity in their expression patterns and therefore appear to be co-regulated. However, the variation associated with microarray data and the complexity of the experimental designs make the acquisition of co-expressed genes a challenge. We developed a novel method for Extracting microarray gene expression Patterns and Identifying co-expressed Genes, designated as EPIG. The approach utilizes the underlying structure of gene expression data to extract patterns and identify co-expressed genes that are responsive to experimental conditions. Results Through evaluation of the correlations among profiles, the magnitude of variation in gene expression profiles, and profile signal-to-noise ratio's, EPIG extracts a set of patterns representing co-expressed genes. The method is shown to work well with a simulated data set and microarray data obtained from time-series studies of dauer recovery and L1 starvation in C. elegans and after ultraviolet (UV or ionizing radiation (IR-induced DNA damage in diploid human fibroblasts. With the simulated data set, EPIG extracted the appropriate number of patterns which were more stable and homogeneous than the set of patterns that were determined using the CLICK or CAST clustering algorithms. However, CLICK performed better than EPIG and CAST with respect to the average correlation between clusters/patterns of the simulated data. With real biological data, EPIG extracted more dauer-specific patterns than CLICK. Furthermore, analysis of the IR/UV data revealed 18 unique patterns and 2661 genes out of approximately 17,000 that were identified as significantly expressed and categorized to the patterns by EPIG. The time-dependent patterns displayed similar and dissimilar responses between IR and UV treatments. Gene Ontology analysis applied to each pattern-related subset of co-expressed genes revealed underlying
Full Text Available Rheumatoid arthritis (RA is a complex autoimmune disease. Using a gene-based association research strategy, the present study aims to detect unknown susceptibility to RA and to address the ethnic differences in genetic susceptibility to RA between European and Asian populations.Gene-based association analyses were performed with KGG 2.5 by using publicly available large RA datasets (14,361 RA cases and 43,923 controls of European subjects, 4,873 RA cases and 17,642 controls of Asian Subjects. For the newly identified RA-associated genes, gene set enrichment analyses and protein-protein interactions analyses were carried out with DAVID and STRING version 10.0, respectively. Differential expression verification was conducted using 4 GEO datasets. The expression levels of three selected 'highly verified' genes were measured by ELISA among our in-house RA cases and controls.A total of 221 RA-associated genes were newly identified by gene-based association study, including 71'overlapped', 76 'European-specific' and 74 'Asian-specific' genes. Among them, 105 genes had significant differential expressions between RA patients and health controls at least in one dataset, especially for 20 genes including 11 'overlapped' (ABCF1, FLOT1, HLA-F, IER3, TUBB, ZKSCAN4, BTN3A3, HSP90AB1, CUTA, BRD2, HLA-DMA, 5 'European-specific' (PHTF1, RPS18, BAK1, TNFRSF14, SUOX and 4 'Asian-specific' (RNASET2, HFE, BTN2A2, MAPK13 genes whose differential expressions were significant at least in three datasets. The protein expressions of two selected genes FLOT1 (P value = 1.70E-02 and HLA-DMA (P value = 4.70E-02 in plasma were significantly different in our in-house samples.Our study identified 221 novel RA-associated genes and especially highlighted the importance of 20 candidate genes on RA. The results addressed ethnic genetic background differences for RA susceptibility between European and Asian populations and detected a long list of overlapped or ethnic specific RA
Full Text Available Differential expression plays an important role in cancer diagnosis and classification. In recent years, many methods have been used to identify differentially expressed genes. However, the recognition rate and reliability of gene selection still need to be improved. In this paper, a novel constrained method named robust nonnegative matrix factorization via joint graph Laplacian and discriminative information (GLD-RNMF is proposed for identifying differentially expressed genes, in which manifold learning and the discriminative label information are incorporated into the traditional nonnegative matrix factorization model to train the objective matrix. Specifically, L2,1-norm minimization is enforced on both the error function and the regularization term which is robust to outliers and noise in gene data. Furthermore, the multiplicative update rules and the details of convergence proof are shown for the new model. The experimental results on two publicly available cancer datasets demonstrate that GLD-RNMF is an effective method for identifying differentially expressed genes.
Full Text Available Background: The presence of diverse types of nanomaterials (NMs in commerce is growing at an exponential pace. As a result, human exposure to these materials in the environment is inevitable, necessitating the need for rapid and reliable toxicity testing methods to accurately assess the potential hazards associated with NMs. In this study, we applied biclustering and gene set enrichment analysis methods to derive essential features of altered lung transcriptome following exposure to NMs that are associated with lung-specific diseases. Several datasets from public microarray repositories describing pulmonary diseases in mouse models following exposure to a variety of substances were examined and functionally related biclusters of genes showing similar expression profiles were identified. The identified biclusters were then used to conduct a gene set enrichment analysis on pulmonary gene expression profiles derived from mice exposed to nano-titanium dioxide (nano-TiO2, carbon black (CB or carbon nanotubes (CNTs to determine the disease significance of these data-driven gene sets.Results: Biclusters representing inflammation (chemokine activity, DNA binding, cell cycle, apoptosis, reactive oxygen species (ROS and fibrosis processes were identified. All of the NM studies were significant with respect to the bicluster related to chemokine activity (DAVID; FDR p-value = 0.032. The bicluster related to pulmonary fibrosis was enriched in studies where toxicity induced by CNT and CB studies was investigated, suggesting the potential for these materials to induce lung fibrosis. The pro-fibrogenic potential of CNTs is well established. Although CB has not been shown to induce fibrosis, it induces stronger inflammatory, oxidative stress and DNA damage responses than nano-TiO2 particles.Conclusion: The results of the analysis correctly identified all NMs to be inflammogenic and only CB and CNTs as potentially fibrogenic. In addition to identifying several
Full Text Available Abstract Background Domain or gene fusion analysis is a bioinformatics method for detecting gene fusions in one organism by comparing its genome to that of other organisms. The occurrence of gene fusions suggests that the two original genes that participated in the fusion are functionally linked, i.e. their gene products interact either as part of a multi-subunit protein complex, or in a metabolic pathway. Gene fusion analysis has been used to identify protein functional links in prokaryotes as well as in eukaryotic model organisms, such as yeast and Drosophila. Results In this study we have extended this approach to include a number of recently sequenced protists, four of which are pathogenic, to identify fusion linked proteins in Trypanosoma brucei, the causative agent of African sleeping sickness. We have also examined the evolution of the gene fusion events identified, to determine whether they can be attributed to fusion or fission, by looking at the conservation of the fused genes and of the individual component genes across the major eukaryotic and prokaryotic lineages. We find relatively limited occurrence of gene fusions/fissions within the protist lineages examined. Our results point to two trypanosome-specific gene fissions, which have recently been experimentally confirmed, one fusion involving proteins involved in the same metabolic pathway, as well as two novel putative functional links between fusion-linked protein pairs. Conclusions This is the first study of protein functional links in T. brucei identified by gene fusion analysis. We have used strict thresholds and only discuss results which are highly likely to be genuine and which either have already been or can be experimentally verified. We discuss the possible impact of the identification of these novel putative protein-protein interactions, to the development of new trypanosome therapeutic drugs.
Zirlinger, M.; Kreiman, Gabriel; Anderson, D. J.
Microarray technology represents a potentially powerful method for identifying cell type- and regionally restricted genes expressed in the brain. Here we have combined a microarray analysis of differential gene expression among five selected brain regions, including the amygdala, cerebellum, hippocampus, olfactory bulb, and periaqueductal gray, with in situ hybridization. On average, 0.3% of the 34,000 genes interrogated were highly enriched in each of the five regions...
Roncato-Maccari, Lauren D B; Ramos, Humberto J O; Pedrosa, Fabio O; Alquini, Yedo; Chubatsu, Leda S; Yates, Marshall G; Rigo, Liu U; Steffens, Maria Berenice R; Souza, Emanuel M
Abstract The interactions between maize, sorghum, wheat and rice plants and Herbaspirillum seropedicae were examined microscopically following inoculation with the H. seropedicae LR15 strain, a Nif(+) (Pnif::gusA) mutant obtained by the insertion of a gusA-kanamycin cassette into the nifH gene of the H. seropedicae wild-type strain. The expression of the Pnif::gusA fusion was followed during the association of the diazotroph with the gramineous species. Histochemical analysis of seedlings of maize, sorghum, wheat and rice grown in vermiculite showed that strain LR15 colonized root surfaces and inner tissues. In early steps of the endophytic association, H. seropedicae colonized root exudation sites, such as axils of secondary roots and intercellular spaces of the root cortex; it then occupied the vascular tissue and there expressed nif genes. The expression of nif genes occurred in roots, stems and leaves as detected by the GUS reporter system. The expression of nif genes was also observed in bacterial colonies located in the external mucilaginous root material, 8 days after inoculation. Moreover, the colonization of plant tissue by H. seropedicae did not depend on the nitrogen-fixing ability, since similar numbers of cells were isolated from roots or shoots of the plants inoculated with Nif(+) or Nif(-) strains.
Anna V. Shchennikova
Full Text Available Monotropa hypopitys is a mycoheterotrophic, nonphotosynthetic plant acquiring nutrients from the roots of autotrophic trees through mycorrhizal symbiosis, and, similar to other extant plants, forming asymmetrical lateral organs during development. The members of the YABBY family of transcription factors are important players in the establishment of leaf and leaf-like organ polarity in plants. This is the first report on the identification of YABBY genes in a mycoheterotrophic plant devoid of aboveground vegetative organs. Seven M. hypopitys YABBY members were identified and classified into four clades. By structural analysis of putative encoded proteins, we confirmed the presence of YABBY-defining conserved domains and identified novel clade-specific motifs. Transcriptomic and qRT-PCR analyses of different tissues revealed MhyYABBY transcriptional patterns, which were similar to those of orthologous YABBY genes from other angiosperms. These data should contribute to the understanding of the role of the YABBY genes in the regulation of developmental and physiological processes in achlorophyllous leafless plants.
Puccinia striiformis f. sp. tritici (Pst) causes stripe rust, one of the most important diseases of wheat worldwide. To identify Pst genes involved in infection and sporulation, a custom oligonucleotide Genechip was made using sequences of 442 genes selected from Pst cDNA libraries. Microarray analy...
Kaczkowski, Bogumil; Tanaka, Yuji; Kawaji, Hideya
Genes that are commonly deregulated in cancer are clinically attractive as candidate pan-diagnostic markers and therapeutic targets. To globally identify such targets, we compared Cap Analysis of Gene Expression (CAGE) profiles from 225 different cancer cell lines and 339 corresponding primary cell...
Bream, Elise N A; Leppellere, Cara R; Cooper, Margaret E
Background:The aim of this study was to identify genetic variants contributing to preterm birth (PTB) using a linkage candidate gene approach.Methods:We studied 99 single-nucleotide polymorphisms (SNPs) for 33 genes in 257 families with PTBs segregating. Nonparametric and parametric analyses were...... through the infant and/or the mother in the etiology of PTB....
Hu, H; Haas, S.A.; Chelly, J.; Esch, H. Van; Raynaud, M.; Brouwer, A.P. de; Weinert, S.; Froyen, G.; Frints, S.G.; Laumonnier, F.; Zemojtel, T.; Love, M.I.; Richard, H.; Emde, A.K.; Bienek, M.; Jensen, C.; Hambrock, M.; Fischer, U.; Langnick, C.; Feldkamp, M.; Wissink-Lindhout, W.; Lebrun, N.; Castelnau, L.; Rucci, J.; Montjean, R.; Dorseuil, O.; Billuart, P.; Stuhlmann, T.; Shaw, M.; Corbett, M.A.; Gardner, A.; Willis-Owen, S.; Tan, C.; Friend, K.L.; Belet, S.; Roozendaal, K.E. van; Jimenez-Pocquet, M.; Moizard, M.P.; Ronce, N.; Sun, R.; O'Keeffe, S.; Chenna, R.; Bommel, A. van; Goke, J.; Hackett, A.; Field, M.; Christie, L.; Boyle, J.; Haan, E.; Nelson, J.; Turner, G.; Baynam, G.; Gillessen-Kaesbach, G.; Muller, U.; Steinberger, D.; Budny, B.; Badura-Stronka, M.; Latos-Bielenska, A.; Ousager, L.B.; Wieacker, P.; Rodriguez Criado, G.; Bondeson, M.L.; Anneren, G.; Dufke, A.; Cohen, M.; Maldergem, L. Van; Vincent-Delorme, C.; Echenne, B.; Simon-Bouy, B.; Kleefstra, T.; Willemsen, M.H.; Fryns, J.P.; Devriendt, K.; Ullmann, R.; Vingron, M.; Wrogemann, K.; Wienker, T.F.; Tzschach, A.; Bokhoven, H. van; Gecz, J.; Jentsch, T.J.; Chen, W.; Ropers, H.H.; Kalscheuer, V.M.
X-linked intellectual disability (XLID) is a clinically and genetically heterogeneous disorder. During the past two decades in excess of 100 X-chromosome ID genes have been identified. Yet, a large number of families mapping to the X-chromosome remained unresolved suggesting that more XLID genes or
Hu, H; Haas, S A; Chelly, J
X-linked intellectual disability (XLID) is a clinically and genetically heterogeneous disorder. During the past two decades in excess of 100 X-chromosome ID genes have been identified. Yet, a large number of families mapping to the X-chromosome remained unresolved suggesting that more XLID genes...
Phylactides, M.; Rowntree, R.; Nuthall, H.
hypersensitive sites (DHS) within the locus. We previously identified at least 12 clusters of DHS across the CFTR gene and here further evaluate DHS in introns 2,3,10,16,17a, 18, 20 and 21 to assess their functional importance in regulation of CFTR gene expression. Transient transfections of enhancer/reporter...
Noor, Dzul Azri Mohamed; Jeyapalan, Jennie N; Alhazmi, Safiah; Carr, Matthew; Squibb, Benjamin; Wallace, Claire; Tan, Christopher; Cusack, Martin; Hughes, Jaime; Reader, Tom; Shipley, Janet; Sheer, Denise; Scotting, Paul J
Silencing of genes by DNA methylation is a common phenomenon in many types of cancer. However, the genome-wide effect of DNA methylation on gene expression has been analysed in relatively few cancers. Germ cell tumours (GCTs) are a complex group of malignancies. They are unique in developing from a pluripotent progenitor cell. Previous analyses have suggested that non-seminomas exhibit much higher levels of DNA methylation than seminomas. The genomic targets that are methylated, the extent to which this results in gene silencing and the identity of the silenced genes most likely to play a role in the tumours' biology have not yet been established. In this study, genome-wide methylation and expression analysis of GCT cell lines was combined with gene expression data from primary tumours to address this question. Genome methylation was analysed using the Illumina infinium HumanMethylome450 bead chip system and gene expression was analysed using Affymetrix GeneChip Human Genome U133 Plus 2.0 arrays. Regulation by methylation was confirmed by demethylation using 5-aza-2-deoxycytidine and reverse transcription-quantitative PCR. Large differences in the level of methylation of the CpG islands of individual genes between tumour cell lines correlated well with differential gene expression. Treatment of non-seminoma cells with 5-aza-2-deoxycytidine verified that methylation of all genes tested played a role in their silencing in yolk sac tumour cells and many of these genes were also differentially expressed in primary tumours. Genes silenced by methylation in the various GCT cell lines were identified. Several pluripotency-associated genes were identified as a major functional group of silenced genes.
Hultman, Jenni; Tamminen, Manu; Pärnänen, Katariina; Cairns, Johannes; Karkman, Antti; Virta, Marko
Wastewater treatment plants (WWTPs) collect wastewater from various sources for a multi-step treatment process. By mixing a large variety of bacteria and promoting their proximity, WWTPs constitute potential hotspots for the emergence of antibiotic resistant bacteria. Concerns have been expressed regarding the potential of WWTPs to spread antibiotic resistance genes (ARGs) from environmental reservoirs to human pathogens. We utilized epicPCR (Emulsion, Paired Isolation and Concatenation PCR) to detect the bacterial hosts of ARGs in two WWTPs. We identified the host distribution of four resistance-associated genes (tetM, int1, qacEΔ1and blaOXA-58) in influent and effluent. The bacterial hosts of these resistance genes varied between the WWTP influent and effluent, with a generally decreasing host range in the effluent. Through 16S rRNA gene sequencing, it was determined that the resistance gene carrying bacteria include both abundant and rare taxa. Our results suggest that the studied WWTPs mostly succeed in decreasing the host range of the resistance genes during the treatment process. Still, there were instances where effluent contained resistance genes in bacterial groups not carrying these genes in the influent. By permitting exhaustive profiling of resistance-associated gene hosts in WWTP bacterial communities, the application of epicPCR provides a new level of precision to our resistance gene risk estimates.
Fochi, Valeria; Falla, Nicole; Girlanda, Mariangela; Perotto, Silvia; Balestrini, Raffaella
Orchid mycorrhizal protocorms and roots are heterogeneous structures composed of different plant cell-types, where cells colonized by intracellular fungal coils (the pelotons) are close to non-colonized plant cells. Moreover, the fungal coils undergo rapid turnover inside the colonized cells, so that plant cells containing coils at different developmental stages can be observed in the same tissue section. Here, we have investigated by laser microdissection (LMD) the localization of specific plant gene transcripts in different cell-type populations collected from mycorrhizal protocorms and roots of the Mediterranean orchid Serapias vomeracea colonized by Tulasnella calospora. RNAs extracted from the different cell-type populations have been used to study plant gene expression, focusing on genes potentially involved in N uptake and transport and previously identified as up-regulated in symbiotic protocorms. Results clearly showed that some plant N transporters are differentially expressed in cells containing fungal coils at different developmental stages, as well as in non-colonized cells, and allowed the identification of new functional markers associated to coil-containing cells. Copyright © 2017 Elsevier B.V. All rights reserved.
Zhang, Tao; Zhao, Yun-Long; Zhao, Jian-Hua; Wang, Sheng; Jin, Yun; Chen, Zhong-Qi; Fang, Yuan-Yuan; Hua, Chen-Lei; Ding, Shou-Wei; Guo, Hui-Shan
Plant pathogenic fungi represent the largest group of disease-causing agents on crop plants, and are a constant and major threat to agriculture worldwide. Recent studies have shown that engineered production of RNA interference (RNAi)-inducing dsRNA in host plants can trigger specific fungal gene silencing and confer resistance to fungal pathogens 1-7 . Although these findings illustrate efficient uptake of host RNAi triggers by pathogenic fungi, it is unknown whether or not such an uptake mechanism has been evolved for a natural biological function in fungus-host interactions. Here, we show that in response to infection with Verticillium dahliae (a vascular fungal pathogen responsible for devastating wilt diseases in many crops) cotton plants increase production of microRNA 166 (miR166) and miR159 and export both to the fungal hyphae for specific silencing. We found that two V. dahliae genes encoding a Ca 2+ -dependent cysteine protease (Clp-1) and an isotrichodermin C-15 hydroxylase (HiC-15), and targeted by miR166 and miR159, respectively, are both essential for fungal virulence. Notably, V. dahliae strains expressing either Clp-1 or HiC-15 rendered resistant to the respective miRNA exhibited drastically enhanced virulence in cotton plants. Together, our findings identify a novel defence strategy of host plants by exporting specific miRNAs to induce cross-kingdom gene silencing in pathogenic fungi and confer disease resistance.
Liew, O. W.; Chong, Jenny P. C.; Asundi, Anand K.
This work focuses on developing a portable fibre optic fluorescence analyser for rapid identification of genetically modified plants tagged with a fluorescent marker gene. Independent transgenic tobacco plant lines expressing the enhanced green fluorescence protein (EGFP) gene were regenerated following Agrobacterium-mediated gene transfer. Molecular characterisation of these plant lines was carried out at the DNA level by PCR screening to confirm their transgenic status. Conventional transgene expression analysis was then carried out at the RNA level by RT-PCR and at the protein level by Western blotting using anti-GFP rabbit antiserum. The amount of plant-expressed EGFP on a Western blot was quantified against known amounts of purified EGFP by scanning densitometry. The expression level of EGFP in transformed plants was found to range from 0.1 - 0.6% of total extractable protein. A comparison between conventional western analysis of transformants and direct spectroscopic quantification using the fibre optic fluorescence analyser was made. The results showed that spectroscopic measurements of fluorescence emission from strong EGFP expressors correlated positively with Western blot data. However, the fluorescence analyser was also able to identify weakly expressing plant transformants below the detection limit of colorimetric Western blotting.
de O. Buanafina, Marcia Maria [Pennsylvania State Univ., University Park, PA (United States)
This proposal focuses on cell wall feruloylation and our long term goal is to identify and isolate novel genes controlling feruloylation and to characterize the phenotype of mutants in this pathway, with a spotlight on cell wall properties.
Lilley, Catherine J.; Maqbool, Abbas; Wu, Duqing; Yusup, Hazijah B.; Jones, Laura M.; Birch, Paul R. J.; Urwin, Peter E.
Plant pathogens and parasites are a major threat to global food security. Plant parasitism has arisen four times independently within the phylum Nematoda, resulting in at least one parasite of every major food crop in the world. Some species within the most economically important order (Tylenchida) secrete proteins termed effectors into their host during infection to re-programme host development and immunity. The precise detail of how nematodes evolve new effectors is not clear. Here we reconstruct the evolutionary history of a novel effector gene family. We show that during the evolution of plant parasitism in the Tylenchida, the housekeeping glutathione synthetase (GS) gene was extensively replicated. New GS paralogues acquired multiple dorsal gland promoter elements, altered spatial expression to the secretory dorsal gland, altered temporal expression to primarily parasitic stages, and gained a signal peptide for secretion. The gene products are delivered into the host plant cell during infection, giving rise to “GS-like effectors”. Remarkably, by solving the structure of GS-like effectors we show that during this process they have also diversified in biochemical activity, and likely represent the founding members of a novel class of GS-like enzyme. Our results demonstrate the re-purposing of an endogenous housekeeping gene to form a family of effectors with modified functions. We anticipate that our discovery will be a blueprint to understand the evolution of other plant-parasitic nematode effectors, and the foundation to uncover a novel enzymatic function. PMID:29641602
Full Text Available Recombinant proteins are primarily produced from cultures of mammalian, insect, and bacteria cells. In recent years, the development of deconstructed virus-based vectors has allowed plants to become a viable platform for recombinant protein production, with advantages in versatility, speed, cost, scalability, and safety over the current production paradigms. In this paper, we review the recent progress in the methodology of agroinfiltration, a solution to overcome the challenge of transgene delivery into plant cells for large-scale manufacturing of recombinant proteins. General gene delivery methodologies in plants are first summarized, followed by extensive discussion on the application and scalability of each agroinfiltration method. New development of a spray-based agroinfiltration and its application on field-grown plants is highlighted. The discussion of agroinfiltration vectors focuses on their applications for producing complex and heteromultimeric proteins and is updated with the development of bridge vectors. Progress on agroinfiltration in Nicotiana and non-Nicotiana plant hosts is subsequently showcased in context of their applications for producing high-value human biologics and low-cost and high-volume industrial enzymes. These new advancements in agroinfiltration greatly enhance the robustness and scalability of transgene delivery in plants, facilitating the adoption of plant transient expression systems for manufacturing recombinant proteins with a broad range of applications.
Plant genetic transformation usually depends on efficient adventitious regeneration systems. In almond (Prunus dulcis Mill.), regeneration of transgenic adventitious shoots was achieved but with low efficiency. Histological studies identified two main stages of organogenesis in almond explants that ...
Hu, H.; Haas, S.A.; Chelly, J.; Van Esch, H.; Raynaud, M.; de Brouwer, A.P.M.; Weinert, S.; Froyen, G.; Frints, S.G.M.; Laumonnier, F.; Zemojtel, T.; Love, M.I.; Richard, H.; Emde, A.K.; Bienek, M.
X-linked intellectual disability (XLID) is a clinically and genetically heterogeneous disorder. During the past two decades in excess of 100 X-chromosome ID genes have been identified. Yet, a large number of families mapping to the X-chromosome remained unresolved suggesting that more XLID genes or loci are yet to be identified. Here, we have investigated 405 unresolved families with XLID. We employed massively parallel sequencing of all X-chromosome exons in the index males. The majority of ...
Liang, Chaoqiong; Hao, Jianjun; Meng, Yan; Luo, Laixin; Li, Jianqiang
Cucumber green mottle mosaic virus (CGMMV) is an economically important pathogen and causes significant reduction of both yield and quality of cucumber (Cucumis sativus). Currently, there were no satisfied strategies for controlling the disease. A better understanding of microRNA (miRNA) expression related to the regulation of plant-virus interactions and virus resistance would be of great assistance when developing control strategies for CGMMV. However, accurate expression analysis is highly dependent on robust and reliable reference gene used as an internal control for normalization of miRNA expression. Most commonly used reference genes involved in CGMMV-infected cucumber are not universally expressed depending on tissue types and stages of plant development. It is therefore crucial to identify suitable reference genes in investigating the role of miRNA expression. In this study, seven reference genes, including Actin, Tubulin, EF-1α, 18S rRNA, Ubiquitin, GAPDH and Cyclophilin, were evaluated for the most accurate results in analyses using reverse transcription-quantitative polymerase chain reaction (RT-qPCR). Gene expression was assayed on cucumber leaves, stems and roots that were collected at different days post inoculation with CGMMV. The expression data were analyzed using algorithms including delta-Ct, geNorm, NormFinder, and BestKeeper as well as the comparative tool RefFinder. The reference genes were subsequently validated using miR159. The results showed that EF-1α and GAPDH were the most reliable reference genes for normalizing miRNA expression in leaf, root and stem samples, while Ubiquitin and EF-1α were the most suitable combination overall. PMID:29543906
Wu, Mingsong; Tu, Tao; Huang, Yunchao; Cao, Yi
To understand the carcinogenesis caused by accumulated genetic and epigenetic alterations and seek novel biomarkers for various cancers, studying differentially expressed genes between cancerous and normal tissues is crucial. In the study, two cDNA libraries of lung cancer were constructed and screened for identification of differentially expressed genes. Two cDNA libraries of differentially expressed genes were constructed using lung adenocarcinoma tissue and adjacent nonmalignant lung tissue by suppression subtractive hybridization. The data of the cDNA libraries were then analyzed and compared using bioinformatics analysis. Levels of mRNA and protein were measured by quantitative real-time polymerase chain reaction (q-RT-PCR) and western blot respectively, as well as expression and localization of proteins were determined by immunostaining. Gene functions were investigated using proliferation and migration assays after gene silencing and gene over-expression. Two libraries of differentially expressed genes were obtained. The forward-subtracted library (FSL) and the reverse-subtracted library (RSL) contained 177 and 59 genes, respectively. Bioinformatic analysis demonstrated that these genes were involved in a wide range of cellular functions. The vast majority of these genes were newly identified to be abnormally expressed in lung cancer. In the first stage of the screening for 16 genes, we compared lung cancer tissues with their adjacent non-malignant tissues at the mRNA level, and found six genes (ERGIC3, DDR1, HSP90B1, SDC1, RPSA, and LPCAT1) from the FSL were significantly up-regulated while two genes (GPX3 and TIMP3) from the RSL were significantly down-regulated (P < 0.05). The ERGIC3 protein was also over-expressed in lung cancer tissues and cultured cells, and expression of ERGIC3 was correlated with the differentiated degree and histological type of lung cancer. The up-regulation of ERGIC3 could promote cellular migration and proliferation in vitro. The
John Patrick Mpindi
Full Text Available BACKGROUND: Meta-analysis of gene expression microarray datasets presents significant challenges for statistical analysis. We developed and validated a new bioinformatic method for the identification of genes upregulated in subsets of samples of a given tumour type ('outlier genes', a hallmark of potential oncogenes. METHODOLOGY: A new statistical method (the gene tissue index, GTI was developed by modifying and adapting algorithms originally developed for statistical problems in economics. We compared the potential of the GTI to detect outlier genes in meta-datasets with four previously defined statistical methods, COPA, the OS statistic, the t-test and ORT, using simulated data. We demonstrated that the GTI performed equally well to existing methods in a single study simulation. Next, we evaluated the performance of the GTI in the analysis of combined Affymetrix gene expression data from several published studies covering 392 normal samples of tissue from the central nervous system, 74 astrocytomas, and 353 glioblastomas. According to the results, the GTI was better able than most of the previous methods to identify known oncogenic outlier genes. In addition, the GTI identified 29 novel outlier genes in glioblastomas, including TYMS and CDKN2A. The over-expression of these genes was validated in vivo by immunohistochemical staining data from clinical glioblastoma samples. Immunohistochemical data were available for 65% (19 of 29 of these genes, and 17 of these 19 genes (90% showed a typical outlier staining pattern. Furthermore, raltitrexed, a specific inhibitor of TYMS used in the therapy of tumour types other than glioblastoma, also effectively blocked cell proliferation in glioblastoma cell lines, thus highlighting this outlier gene candidate as a potential therapeutic target. CONCLUSIONS/SIGNIFICANCE: Taken together, these results support the GTI as a novel approach to identify potential oncogene outliers and drug targets. The algorithm is
Mufti, F.U.D.; Banaras, S.
Internal control genes are the constitutive genes which maintain the basic cellular functions and regularly express in both normal and stressed conditions in living organisms. They are used in normalization of gene expression studies in comparative analysis of target genes, as their expression remains comparatively unchanged in all varied conditions. Among internal control genes, actin is considered as a candidate gene for expression studies due to its vital role in shaping cytoskeleton and plant physiology. Unfortunately most of such knowledge is limited to only model plants or crops, not much is known about important medicinal plants. Therefore, we selected seven important medicinal wild plants for molecular identification of actin gene. We used gene specific primers designed from the conserved regions of several known orthologues or homologues of actin genes from other plants. The amplified products of 370-380 bp were sequenced and submitted to GeneBank after their confirmation using different bioinformatics tools. All the novel partial sequences of putative actin genes were submitted to GeneBank (Parthenium hysterophorus (KJ774023), Fagonia indica (KJ774024), Rhazya stricta (KJ774025), Whithania coagulans (KJ774026), Capparis decidua (KJ774027), Verbena officinalis (KJ774028) and Aerva javanica (KJ774029)). The comparisons of these partial sequences by Basic Local Alignment Search Tool (BLAST) and phylogenetic trees demonstrated high similarity with known actin genes of other plants. Our findings illustrated highly conserved nature of actin gene among these selected plants. These novel partial fragments of actin genes from these wild medicinal plants can be used as internal controls for future gene expression studies of these important plants after precise validations of their stable expression in such plants. (author)
Zhao, Yanhong; Liao, Xiaofang; Huang, Zhipeng; Chen, Peng; Zhou, Bujin; Liu, Dongmei; Kong, Xiangjun; Zhou, Ruiyang
Chimeric genes resulting from the rearrangement of a mitochondrial genome were generally thought to be a causal factor in the occurrence of cytoplasmic male sterility (CMS). In the study, earlier we reported that identifying a 47 bp deletion at 3'- flanking of atp9 that was linked to male sterile cytoplasm in kenaf. The truncated fragment was fused with atp9, a mitochondrial transit signal (MTS) and/or GFP, comprised two chimeric genes MTS-HM184-GFP and MTS-HM184. The plant expression vector pBI121 containing chimeric genes were then introduced to tobacco plants by Agrobacterium-mediated T-DNA transformation. The result showed that certain transgenic plants were male sterility or semi-sterility, while some were not. The expression analysis further demonstrated that higher level of expression were showed in the sterility plants, while no expression or less expression in fertility plants, the levels of expression of semi-sterility were in between. And the sterile plant (containing MTS-HM184-GFP) had abnormal anther produced malformed/shriveled pollen grains stained negative that failed to germinate (0%), the corresponding fruits was shrunken, the semi-sterile plants having normal anther shape produced about 10-50% normal pollen grains, the corresponding fruits were not full, and the germination rate was 58%. Meanwhile these transgenic plants which altered on fertility were further analyzed in phenotype. As a result, the metamorphosis leaves were observed in the seedling stage, the plant height of transgenic plants was shorter than wild type. The growth duration of transgenic tobacco was delayed 30-45 days compared to the wild type. The copy numbers of target genes of transgenic tobacco were analyzed using the real-time quantitative method. The results showed that these transgenic plants targeting-expression in mitochondrial containing MTS-HM184-GFP had 1 copy and 2 copies, the other two plants containing MTS-HM184 both had 3 copies, but 0 copy in wild type. In
Full Text Available Breast cancers (BCs of the luminal B subtype are estrogen receptor-positive (ER+, highly proliferative, resistant to standard therapies and have a poor prognosis. To better understand this subtype we compared DNA copy number aberrations (CNAs, DNA promoter methylation, gene expression profiles, and somatic mutations in nine selected genes, in 32 luminal B tumors with those observed in 156 BCs of the other molecular subtypes. Frequent CNAs included 8p11-p12 and 11q13.1-q13.2 amplifications, 7q11.22-q34, 8q21.12-q24.23, 12p12.3-p13.1, 12q13.11-q24.11, 14q21.1-q23.1, 17q11.1-q25.1, 20q11.23-q13.33 gains and 6q14.1-q24.2, 9p21.3-p24,3, 9q21.2, 18p11.31-p11.32 losses. A total of 237 and 101 luminal B-specific candidate oncogenes and tumor suppressor genes (TSGs presented a deregulated expression in relation with their CNAs, including 11 genes previously reported associated with endocrine resistance. Interestingly, 88% of the potential TSGs are located within chromosome arm 6q, and seven candidate oncogenes are potential therapeutic targets. A total of 100 candidate oncogenes were validated in a public series of 5,765 BCs and the overexpression of 67 of these was associated with poor survival in luminal tumors. Twenty-four genes presented a deregulated expression in relation with a high DNA methylation level. FOXO3, PIK3CA and TP53 were the most frequent mutated genes among the nine tested. In a meta-analysis of next-generation sequencing data in 875 BCs, KCNB2 mutations were associated with luminal B cases while candidate TSGs MDN1 (6q15 and UTRN (6q24, were mutated in this subtype. In conclusion, we have reported luminal B candidate genes that may play a role in the development and/or hormone resistance of this aggressive subtype.
Aznar, Aude; Chalvin, Camille; Shih, Patrick M.
the ratio of C6 to C5 sugars in the cell wall and decreasing the lignin content are two important targets in engineering of plants that are more suitable for downstream processing for second-generation biofuel production.Results: We have studied the basic mechanisms of cell wall biosynthesis and identified...... genes involved in biosynthesis of pectic galactan, including the GALS1 galactan synthase and the UDP-galactose/UDP-rhamnose transporter URGT1. We have engineered plants with a more suitable biomass composition by applying these findings, in conjunction with synthetic biology and gene stacking tools...... to vessels where this polysaccharide is essential. Finally, the high galactan and low xylan traits were stacked with the low lignin trait obtained by expressing the QsuB gene encoding dehydroshikimate dehydratase in lignifying cells.Conclusion: The results show that approaches to increasing C6 sugar content...
Jazayeri, Roshanak; Hu, Hao; Fattahi, Zohreh; Musante, Luciana; Abedini, Seyedeh Sedigheh; Hosseini, Masoumeh; Wienker, Thomas F; Ropers, Hans Hilger; Najmabadi, Hossein; Kahrizi, Kimia
Intellectual disability (ID) is a neuro-developmental disorder which causes considerable socio-economic problems. Some ID individuals are also affected by ataxia, and the condition includes different mutations affecting several genes. We used whole exome sequencing (WES) in combination with homozygosity mapping (HM) to identify the genetic defects in five consanguineous families among our cohort study, with two affected children with ID and ataxia as major clinical symptoms. We identified three novel candidate genes, RIPPLY1, MRPL10, SNX14, and a new mutation in known gene SURF1. All are autosomal genes, except RIPPLY1, which is located on the X chromosome. Two are housekeeping genes, implicated in transcription and translation regulation and intracellular trafficking, and two encode mitochondrial proteins. The pathogenesis of these variants was evaluated by mutation classification, bioinformatic methods, review of medical and biological relevance, co-segregation studies in the particular family, and a normal population study. Linkage analysis and exome sequencing of a small number of affected family members is a powerful new technique which can be used to decrease the number of candidate genes in heterogenic disorders such as ID, and may even identify the responsible gene(s).
Guo, Sujuan; Pridham, Kevin J; Virbasius, Ching-Man
Dysregulated autophagy is central to the pathogenesis and therapeutic development of cancer. However, how autophagy is regulated in cancer is not well understood and genes that modulate cancer autophagy are not fully defined. To gain more insights into autophagy regulation in cancer, we performed...... with fluorescence-activated cell sorting, we successfully isolated autophagic K562 cells where we identified 336 short hairpin RNAs. After candidate validation using Cyto-ID fluorescence spectrophotometry, LC3B immunoblotting, and quantitative RT-PCR, 82 genes were identified as autophagy-regulating genes. 20 genes...... have been reported previously and the remaining 62 candidates are novel autophagy mediators. Bioinformatic analyses revealed that most candidate genes were involved in molecular pathways regulating autophagy, rather than directly participating in the autophagy process. Further autophagy flux assays...
Geisheker, Madeleine R.; Heymann, Gabriel; Wang, Tianyun; Coe, Bradley P.; Turner, Tychele N.; Stessman, Holly A.F.; Hoekzema, Kendra; Kvarnung, Malin; Shaw, Marie; Friend, Kathryn; Liebelt, Jan; Barnett, Christopher; Thompson, Elizabeth M.; Haan, Eric; Guo, Hui; Anderlid, Britt-Marie; Nordgren, Ann; Lindstrand, Anna; Vandeweyer, Geert; Alberti, Antonino; Avola, Emanuela; Vinci, Mirella; Giusto, Stefania; Pramparo, Tiziano; Pierce, Karen; Nalabolu, Srinivasa; Michaelson, Jacob J.; Sedlacek, Zdenek; Santen, Gijs W.E.; Peeters, Hilde; Hakonarson, Hakon; Courchesne, Eric; Romano, Corrado; Kooy, R. Frank; Bernier, Raphael A.; Nordenskjöld, Magnus; Gecz, Jozef; Xia, Kun; Zweifel, Larry S.; Eichler, Evan E.
Although de novo missense mutations have been predicted to account for more cases of autism than gene-truncating mutations, most research has focused on the latter. We identified the properties of de novo missense mutations in patients with neurodevelopmental disorders (NDDs) and highlight 35 genes with excess missense mutations. Additionally, 40 amino acid sites were recurrently mutated in 36 genes, and targeted sequencing of 20 sites in 17,689 NDD patients identified 21 new patients with identical missense mutations. One recurrent site (p.Ala636Thr) occurs in a glutamate receptor subunit, GRIA1. This same amino acid substitution in the homologous but distinct mouse glutamate receptor subunit Grid2 is associated with Lurcher ataxia. Phenotypic follow-up in five individuals with GRIA1 mutations shows evidence of specific learning disabilities and autism. Overall, we find significant clustering of de novo mutations in 200 genes, highlighting specific functional domains and synaptic candidate genes important in NDD pathology. PMID:28628100
Hammarlöf, Disa L; Canals, Rocío; Hinton, Jay C D
The availability of thousands of genome sequences of bacterial pathogens poses a particular challenge because each genome contains hundreds of genes of unknown function (FUN). How can we easily discover which FUN genes encode important virulence factors? One solution is to combine two different functional genomic approaches. First, transcriptomics identifies bacterial FUN genes that show differential expression during the process of mammalian infection. Second, global mutagenesis identifies individual FUN genes that the pathogen requires to cause disease. The intersection of these datasets can reveal a small set of candidate genes most likely to encode novel virulence attributes. We demonstrate this approach with the Salmonella infection model, and propose that a similar strategy could be used for other bacterial pathogens. Copyright © 2013 Elsevier Ltd. All rights reserved.
Filippis, Ioannis; Lopez-Cobollo, Rosa; Abbott, James; Butcher, Sarah; Bishop, Gerard J
Plant organs are made from multiple cell types, and defining the expression level of a gene in any one cell or group of cells from a complex mixture is difficult. Dicotyledonous plants normally have three distinct layers of cells, L1, L2 and L3. Layer L1 is the single layer of cells making up the epidermis, layer L2 the single cell sub-epidermal layer and layer L3 constitutes the rest of the internal cells. Here we show how it is possible to harvest an organ and characterise the level of layer-specific expression by using a periclinal chimera that has its L1 layer from Solanum pennellii and its L2 and L3 layers from Solanum lycopersicum. This is possible by measuring the level of the frequency of species-specific transcripts. RNA-seq analysis enabled the genome-wide assessment of whether a gene is expressed in the L1 or L2/L3 layers. From 13 277 genes that are expressed in both the chimera and the parental lines and with at least one polymorphism between the parental alleles, we identified 382 genes that are preferentially expressed in L1 in contrast to 1159 genes in L2/L3. Gene ontology analysis shows that many genes preferentially expressed in L1 are involved in cutin and wax biosynthesis, whereas numerous genes that are preferentially expressed in L2/L3 tissue are associated with chloroplastic processes. These data indicate the use of such chimeras and provide detailed information on the level of layer-specific expression of genes. © 2013 East Malling Research The Plant Journal © 2013 John Wiley & Sons Ltd.
Full Text Available Progress in understanding complex genetic diseases has been bolstered by synthetic approaches that overlay diverse data types and analyses to identify functionally important genes. Pre-term birth (PTB, a major complication of pregnancy, is a leading cause of infant mortality worldwide. A major obstacle in addressing PTB is that the mechanisms controlling parturition and birth timing remain poorly understood. Integrative approaches that overlay datasets derived from comparative genomics with function-derived ones have potential to advance our understanding of the genetics of birth timing, and thus provide insights into the genes that may contribute to PTB. We intersected data from fast evolving coding and non-coding gene regions in the human and primate lineage with data from genes expressed in the placenta, from genes that show enriched expression only in the placenta, as well as from genes that are differentially expressed in four distinct PTB clinical subtypes. A large fraction of genes that are expressed in placenta, and differentially expressed in PTB clinical subtypes (23-34% are fast evolving, and are associated with functions that include adhesion neurodevelopmental and immune processes. Functional categories of genes that express fast evolution in coding regions differ from those linked to fast evolution in non-coding regions. Finally, there is a surprising lack of overlap between fast evolving genes that are differentially expressed in four PTB clinical subtypes. Integrative approaches, especially those that incorporate evolutionary perspectives, can be successful in identifying potential genetic contributions to complex genetic diseases, such as PTB.
Peter E Larsen
Full Text Available In mycorrhizal symbiosis, plant roots form close, mutually beneficial interactions with soil fungi. Before this mycorrhizal interaction can be established however, plant roots must be capable of detecting potential beneficial fungal partners and initiating the gene expression patterns necessary to begin symbiosis. To predict a plant root – mycorrhizal fungi sensor systems, we analyzed in vitro experiments of Populus tremuloides (aspen tree and Laccaria bicolor (mycorrhizal fungi interaction and leveraged over 200 previously published transcriptomic experimental data sets, 159 experimentally validated plant transcription factor binding motifs, and more than 120-thousand experimentally validated protein-protein interactions to generate models of pre-mycorrhizal sensor systems in aspen root. These sensor mechanisms link extracellular signaling molecules with gene regulation through a network comprised of membrane receptors, signal cascade proteins, transcription factors, and transcription factor biding DNA motifs. Modeling predicted four pre-mycorrhizal sensor complexes in aspen that interact with fifteen transcription factors to regulate the expression of 1184 genes in response to extracellular signals synthesized by Laccaria. Predicted extracellular signaling molecules include common signaling molecules such as phenylpropanoids, salicylate, and, jasmonic acid. This multi-omic computational modeling approach for predicting the complex sensory networks yielded specific, testable biological hypotheses for mycorrhizal interaction signaling compounds, sensor complexes, and mechanisms of gene regulation.
Guo, Sujuan; Pridham, Kevin J; Virbasius, Ching-Man; He, Bin; Zhang, Liqing; Varmark, Hanne; Green, Michael R; Sheng, Zhi
Dysregulated autophagy is central to the pathogenesis and therapeutic development of cancer. However, how autophagy is regulated in cancer is not well understood and genes that modulate cancer autophagy are not fully defined. To gain more insights into autophagy regulation in cancer, we performed a large-scale RNA interference screen in K562 human chronic myeloid leukemia cells using monodansylcadaverine staining, an autophagy-detecting approach equivalent to immunoblotting of the autophagy marker LC3B or fluorescence microscopy of GFP-LC3B. By coupling monodansylcadaverine staining with fluorescence-activated cell sorting, we successfully isolated autophagic K562 cells where we identified 336 short hairpin RNAs. After candidate validation using Cyto-ID fluorescence spectrophotometry, LC3B immunoblotting, and quantitative RT-PCR, 82 genes were identified as autophagy-regulating genes. 20 genes have been reported previously and the remaining 62 candidates are novel autophagy mediators. Bioinformatic analyses revealed that most candidate genes were involved in molecular pathways regulating autophagy, rather than directly participating in the autophagy process. Further autophagy flux assays revealed that 57 autophagy-regulating genes suppressed autophagy initiation, whereas 21 candidates promoted autophagy maturation. Our RNA interference screen identifies identified genes that regulate autophagy at different stages, which helps decode autophagy regulation in cancer and offers novel avenues to develop autophagy-related therapies for cancer.
Full Text Available Apoptosis is the process of programmed cell death (PCD that occurs in multicellular organisms. This process of normal cell death is required to maintain the balance of homeostasis. In addition, some diseases, such as obesity, cancer, and neurodegenerative diseases, can be cured through apoptosis, which produces few side effects. An effective comprehension of the mechanisms underlying apoptosis will be helpful to prevent and treat some diseases. The identification of genes related to apoptosis is essential to uncover its underlying mechanisms. In this study, a computational method was proposed to identify novel candidate genes related to apoptosis. First, protein-protein interaction information was used to construct a weighted graph. Second, a shortest path algorithm was applied to the graph to search for new candidate genes. Finally, the obtained genes were filtered by a permutation test. As a result, 26 genes were obtained, and we discuss their likelihood of being novel apoptosis-related genes by collecting evidence from published literature.
Duffy, Supipi; Fam, Hok Khim; Wang, Yi Kan; Styles, Erin B.; Kim, Jung-Hyun; Ang, J. Sidney; Singh, Tejomayee; Larionov, Vladimir; Shah, Sohrab P.; Andrews, Brenda; Boerkoel, Cornelius F.; Hieter, Philip
Somatic copy number amplification and gene overexpression are common features of many cancers. To determine the role of gene overexpression on chromosome instability (CIN), we performed genome-wide screens in the budding yeast for yeast genes that cause CIN when overexpressed, a phenotype we refer to as dosage CIN (dCIN), and identified 245 dCIN genes. This catalog of genes reveals human orthologs known to be recurrently overexpressed and/or amplified in tumors. We show that two genes, TDP1, a tyrosyl-DNA-phosphdiesterase, and TAF12, an RNA polymerase II TATA-box binding factor, cause CIN when overexpressed in human cells. Rhabdomyosarcoma lines with elevated human Tdp1 levels also exhibit CIN that can be partially rescued by siRNA-mediated knockdown of TDP1. Overexpression of dCIN genes represents a genetic vulnerability that could be leveraged for selective killing of cancer cells through targeting of an unlinked synthetic dosage lethal (SDL) partner. Using SDL screens in yeast, we identified a set of genes that when deleted specifically kill cells with high levels of Tdp1. One gene was the histone deacetylase RPD3, for which there are known inhibitors. Both HT1080 cells overexpressing hTDP1 and rhabdomyosarcoma cells with elevated levels of hTdp1 were more sensitive to histone deacetylase inhibitors valproic acid (VPA) and trichostatin A (TSA), recapitulating the SDL interaction in human cells and suggesting VPA and TSA as potential therapeutic agents for tumors with elevated levels of hTdp1. The catalog of dCIN genes presented here provides a candidate list to identify genes that cause CIN when overexpressed in cancer, which can then be leveraged through SDL to selectively target tumors. PMID:27551064
Zhou, Yuanshuai; Liu, Yongjing; Li, Kening; Zhang, Rui; Qiu, Fujun; Zhao, Ning; Xu, Yan
Over the last decade, an increasing number of integrative studies on cancer-related genes have been published. Integrative analyses aim to overcome the limitation of a single data type, and provide a more complete view of carcinogenesis. The vast majority of these studies used sample-matched data of gene expression and copy number to investigate the impact of copy number alteration on gene expression, and to predict and prioritize candidate oncogenes and tumor suppressor genes. However, correlations between genes were neglected in these studies. Our work aimed to evaluate the co-alteration of copy number, methylation and expression, allowing us to identify cancer-related genes and essential functional modules in cancer. We built the Integrated Co-alteration network (ICan) based on multi-omics data, and analyzed the network to uncover cancer-related genes. After comparison with random networks, we identified 155 ovarian cancer-related genes, including well-known (TP53, BRCA1, RB1 and PTEN) and also novel cancer-related genes, such as PDPN and EphA2. We compared the results with a conventional method: CNAmet, and obtained a significantly better area under the curve value (ICan: 0.8179, CNAmet: 0.5183). In this paper, we describe a framework to find cancer-related genes based on an Integrated Co-alteration network. Our results proved that ICan could precisely identify candidate cancer genes and provide increased mechanistic understanding of carcinogenesis. This work suggested a new research direction for biological network analyses involving multi-omics data.
Full Text Available Abstract Background The identification of gene differential co-expression patterns between cancer stages is a newly developing method to reveal the underlying molecular mechanisms of carcinogenesis. Most researches of this subject lack an algorithm useful for performing a statistical significance assessment involving cancer progression. Lacking this specific algorithm is apparently absent in identifying precise gene pairs correlating to cancer progression. Results In this investigation we studied gene pair co-expression change by using a stochastic process model for approximating the underlying dynamic procedure of the co-expression change during cancer progression. Also, we presented a novel analytical method named 'Stochastic process model for Identifying differentially co-expressed Gene pair' (SIG method. This method has been applied to two well known prostate cancer data sets: hormone sensitive versus hormone resistant, and healthy versus cancerous. From these data sets, 428,582 gene pairs and 303,992 gene pairs were identified respectively. Afterwards, we used two different current statistical methods to the same data sets, which were developed to identify gene pair differential co-expression and did not consider cancer progression in algorithm. We then compared these results from three different perspectives: progression analysis, gene pair identification effectiveness analysis, and pathway enrichment analysis. Statistical methods were used to quantify the quality and performance of these different perspectives. They included: Re-identification Scale (RS and Progression Score (PS in progression analysis, True Positive Rate (TPR in gene pair analysis, and Pathway Enrichment Score (PES in pathway analysis. Our results show small values of RS and large values of PS, TPR, and PES; thus, suggesting that gene pairs identified by the SIG method are highly correlated with cancer progression, and highly enriched in disease-specific pathways. From
David G Ashbrook
Full Text Available Bipolar disorder (BD is a significant neuropsychiatric disorder with a lifetime prevalence of ~1%. To identify genetic variants underlying BD genome-wide association studies (GWAS have been carried out. While many variants of small effect associated with BD have been identified few have yet been confirmed, partly because of the low power of GWAS due to multiple comparisons being made. Complementary mapping studies using murine models have identified genetic variants for behavioral traits linked to BD, often with high power, but these identified regions often contain too many genes for clear identification of candidate genes. In the current study we have aligned human BD GWAS results and mouse linkage studies to help define and evaluate candidate genes linked to BD, seeking to use the power of the mouse mapping with the precision of GWAS. We use quantitative trait mapping for open field test and elevated zero maze data in the largest mammalian model system, the BXD recombinant inbred mouse population, to identify genomic regions associated with these BD-like phenotypes. We then investigate these regions in whole genome data from the Psychiatric Genomics Consortium’s bipolar disorder GWAS to identify candidate genes associated with BD. Finally we establish the biological relevance and pathways of these genes in a comprehensive systems genetics analysis.We identify four genes associated with both mouse anxiety and human BD. While TNR is a novel candidate for BD, we can confirm previously suggested associations with CMYA5, MCTP1 and RXRG. A cross-species, systems genetics analysis shows that MCTP1, RXRG and TNR coexpress with genes linked to psychiatric disorders and identify the striatum as a potential site of action. CMYA5, MCTP1, RXRG and TNR are associated with mouse anxiety and human BD. We hypothesize that MCTP1, RXRG and TNR influence intercellular signaling in the striatum.
Kong, Hualei; Tong, Pan; Zhao, Xiaodong; Sun, Jielin; Li, Hua
In the past decade, molecular classification of cancer has gained high popularity owing to its high predictive power on clinical outcomes as compared with traditional methods commonly used in clinical practice. In particular, using gene expression profiles, recent studies have successfully identified a number of gene sets for the delineation of cancer subtypes that are associated with distinct prognosis. However, identification of such gene sets remains a laborious task due to the lack of tools with flexibility, integration and ease of use. To reduce the burden, we have developed an R package, CAsubtype, to efficiently identify gene sets predictive of cancer subtypes and clinical outcomes. By integrating more than 13,000 annotated gene sets, CAsubtype provides a comprehensive repertoire of candidates for new cancer subtype identification. For easy data access, CAsubtype further includes the gene expression and clinical data of more than 2000 cancer patients from TCGA. CAsubtype first employs principal component analysis to identify gene sets (from user-provided or package-integrated ones) with robust principal components representing significantly large variation between cancer samples. Based on these principal components, CAsubtype visualizes the sample distribution in low-dimensional space for better understanding of the distinction between samples and classifies samples into subgroups with prevalent clustering algorithms. Finally, CAsubtype performs survival analysis to compare the clinical outcomes between the identified subgroups, assessing their clinical value as potentially novel cancer subtypes. In conclusion, CAsubtype is a flexible and well-integrated tool in the R environment to identify gene sets for cancer subtype identification and clinical outcome prediction. Its simple R commands and comprehensive data sets enable efficient examination of the clinical value of any given gene set, thus facilitating hypothesis generating and testing in biological and
Rawal, H C; Singh, N K; Sharma, T R
Genome-wide identification and phylogenetic and syntenic comparison were performed for the genes responsible for phenylalanine ammonia lyase (PAL) and peroxidase A (POX A) enzymes in nine plant species representing very diverse groups like legumes (Glycine max and Medicago truncatula), fruits (Vitis vinifera), cereals (Sorghum bicolor, Zea mays, and Oryza sativa), trees (Populus trichocarpa), and model dicot (Arabidopsis thaliana) and monocot (Brachypodium distachyon) species. A total of 87 and 1045 genes in PAL and POX A gene families, respectively, have been identified in these species. The phylogenetic and syntenic comparison along with motif distributions shows a high degree of conservation of PAL genes, suggesting that these genes may predate monocot/eudicot divergence. The POX A family genes, present in clusters at the subtelomeric regions of chromosomes, might be evolving and expanding with higher rate than the PAL gene family. Our analysis showed that during the expansion of POX A gene family, many groups and subgroups have evolved, resulting in a high level of functional divergence among monocots and dicots. These results will act as a first step toward the understanding of monocot/eudicot evolution and functional characterization of these gene families in the future.
H. C. Rawal
Full Text Available Genome-wide identification and phylogenetic and syntenic comparison were performed for the genes responsible for phenylalanine ammonia lyase (PAL and peroxidase A (POX A enzymes in nine plant species representing very diverse groups like legumes (Glycine max and Medicago truncatula, fruits (Vitis vinifera, cereals (Sorghum bicolor, Zea mays, and Oryza sativa, trees (Populus trichocarpa, and model dicot (Arabidopsis thaliana and monocot (Brachypodium distachyon species. A total of 87 and 1045 genes in PAL and POX A gene families, respectively, have been identified in these species. The phylogenetic and syntenic comparison along with motif distributions shows a high degree of conservation of PAL genes, suggesting that these genes may predate monocot/eudicot divergence. The POX A family genes, present in clusters at the subtelomeric regions of chromosomes, might be evolving and expanding with higher rate than the PAL gene family. Our analysis showed that during the expansion of POX A gene family, many groups and subgroups have evolved, resulting in a high level of functional divergence among monocots and dicots. These results will act as a first step toward the understanding of monocot/eudicot evolution and functional characterization of these gene families in the future.
Gesing, Stefan; Schindler, Daniel; Nowrousian, Minou
Ascomycetes differentiate four major morphological types of fruiting bodies (apothecia, perithecia, pseudothecia and cleistothecia) that are derived from an ancestral fruiting body. Thus, fruiting body differentiation is most likely controlled by a set of common core genes. One way to identify such genes is to search for genes with evolutionary conserved expression patterns. Using suppression subtractive hybridization (SSH), we selected differentially expressed transcripts in Pyronema confluens (Pezizales) by comparing two cDNA libraries specific for sexual and for vegetative development, respectively. The expression patterns of selected genes from both libraries were verified by quantitative real time PCR. Expression of several corresponding homologous genes was found to be conserved in two members of the Sordariales (Sordaria macrospora and Neurospora crassa), a derived group of ascomycetes that is only distantly related to the Pezizales. Knockout studies with N. crassa orthologues of differentially regulated genes revealed a functional role during fruiting body development for the gene NCU05079, encoding a putative MFS peptide transporter. These data indicate conserved gene expression patterns and a functional role of the corresponding genes during fruiting body development; such genes are candidates of choice for further functional analysis. © 2013 WILEY-VCH Verlag GmbH & Co. KGaA, Weinheim.
Sheng, Sheng; Liao, Cheng-Wu; Zheng, Yu; Zhou, Yu; Xu, Yan; Song, Wen-Miao; He, Peng; Zhang, Jian; Wu, Fu-An
Meteorus pulchricornis is an endoparasitoid wasp which attacks the larvae of various lepidopteran pests. We present the first antennal transcriptome dataset for M. pulchricornis. A total of 48,845,072 clean reads were obtained and 34,967 unigenes were assembled. Of these, 15,458 unigenes showed a significant similarity (E-value <10 -5 ) to known proteins in the NCBI non-redundant protein database. Gene ontology (GO) and cluster of orthologous groups (COG) analyses were used to classify the functions of M. pulchricornis antennae genes. We identified 16 putative odorant-binding protein (OBP) genes, eight chemosensory protein (CSP) genes, 99 olfactory receptor (OR) genes, 19 ionotropic receptor (IR) genes and one sensory neuron membrane protein (SNMP) gene. BLASTx best hit results and phylogenetic analysis both indicated that these chemosensory genes were most closely related to those found in other hymenopteran species. Real-time quantitative PCR assays showed that 14 MpulOBP genes were antennae-specific. Of these, MpulOBP6, MpulOBP9, MpulOBP10, MpulOBP12, MpulOBP15 and MpulOBP16 were found to have greater expression in the antennae than in other body parts, while MpulOBP2 and MpulOBP3 were expressed predominately in the legs and abdomens, respectively. These results might provide a foundation for future studies of olfactory genes and chemoreception in M. pulchricornis. Copyright © 2017 Elsevier Inc. All rights reserved.
Full Text Available The passion fruit (Passiflora edulis Sims, also known as the purple granadilla, is widely cultivated as the new darling of the fruit market throughout southern China. This exotic and perennial climber is adapted to warm and humid climates, and thus is generally intolerant of cold. There is limited information about gene regulation and signaling pathways related to the cold stress response in this species. In this study, two transcriptome libraries (KEDU_AP vs. GX_AP were constructed from the aerial parts of cold-tolerant and cold-susceptible varieties of P. edulis, respectively. Overall, 126,284,018 clean reads were obtained, and 86,880 unigenes with a mean size of 1449 bp were assembled. Of these, there were 64,067 (73.74% unigenes with significant similarity to publicly available plant protein sequences. Expression profiles were generated, and 3045 genes were found to be significantly differentially expressed between the KEDU_AP and GX_AP libraries, including 1075 (35.3% up-regulated and 1970 (64.7% down-regulated. These included 36 genes in enriched pathways of plant hormone signal transduction, and 56 genes encoding putative transcription factors. Six genes involved in the ICE1–CBF–COR pathway were induced in the cold-tolerant variety, and their expression levels were further verified using quantitative real-time PCR. This report is the first to identify genes and signaling pathways involved in cold tolerance using high-throughput transcriptome sequencing in P. edulis. These findings may provide useful insights into the molecular mechanisms regulating cold tolerance and genetic breeding in Passiflora spp.
Full Text Available Abstract Background To interpret microarray experiments, several ontological analysis tools have been developed. However, current tools are limited to specific organisms. Results We developed a bioinformatics system to assign the probe set sequences of any organism to a hierarchical functional classification modelled on KEGG ontology. The GeneBins database currently supports the functional classification of expression data from four Affymetrix arrays; Arabidopsis thaliana, Oryza sativa, Glycine max and Medicago truncatula. An online analysis tool to identify relevant functions is also provided. Conclusion GeneBins provides resources to interpret gene expression results from microarray experiments. It is available at http://bioinfoserver.rsbs.anu.edu.au/utils/GeneBins/
Sharma, Vivekanand; Law, Wayne; Balick, Michael J; Sarkar, Indra Neil
The growing amount of data describing historical medicinal uses of plants from digitization efforts provides the opportunity to develop systematic approaches for identifying potential plant-based therapies. However, the task of cataloguing plant use information from natural language text is a challenging task for ethnobotanists. To date, there have been only limited adoption of informatics approaches used for supporting the identification of ethnobotanical information associated with medicinal uses. This study explored the feasibility of using biomedical terminologies and natural language processing approaches for extracting relevant plant-associated therapeutic use information from historical biodiversity literature collection available from the Biodiversity Heritage Library. The results from this preliminary study suggest that there is potential utility of informatics methods to identify medicinal plant knowledge from digitized resources as well as highlight opportunities for improvement.
Chabot, Adrien; Shrit, Ralla A; Blekhman, Ran; Gilad, Yoav
Most phenotypic differences between human and chimpanzee are likely to result from differences in gene regulation, rather than changes to protein-coding regions. To date, however, only a handful of human-chimpanzee nucleotide differences leading to changes in gene regulation have been identified. To hone in on differences in regulatory elements between human and chimpanzee, we focused on 10 genes that were previously found to be differentially expressed between the two species. We then designed reporter gene assays for the putative human and chimpanzee promoters of the 10 genes. Of seven promoters that we found to be active in human liver cell lines, human and chimpanzee promoters had significantly different activity in four cases, three of which recapitulated the gene expression difference seen in the microarray experiment. For these three genes, we were therefore able to demonstrate that a change in cis influences expression differences between humans and chimpanzees. Moreover, using site-directed mutagenesis on one construct, the promoter for the DDA3 gene, we were able to identify three nucleotides that together lead to a cis regulatory difference between the species. High-throughput application of this approach can provide a map of regulatory element differences between humans and our close evolutionary relatives.
Forno, Erick; Wang, Ting; Yan, Qi; Brehm, John; Acosta-Perez, Edna; Colon-Semidey, Angel; Alvarez, Maria; Boutaoui, Nadia; Cloutier, Michelle M; Alcorn, John F; Canino, Glorisa; Chen, Wei; Celedón, Juan C
Childhood asthma is a complex disease. In this study, we aim to identify genes associated with childhood asthma through a multiomics "vertical" approach that integrates multiple analytical steps using linear and logistic regression models. In a case-control study of childhood asthma in Puerto Ricans (n = 1,127), we used adjusted linear or logistic regression models to evaluate associations between several analytical steps of omics data, including genome-wide (GW) genotype data, GW methylation, GW expression profiling, cytokine levels, asthma-intermediate phenotypes, and asthma status. At each point, only the top genes/single-nucleotide polymorphisms/probes/cytokines were carried forward for subsequent analysis. In step 1, asthma modified the gene expression-protein level association for 1,645 genes; pathway analysis showed an enrichment of these genes in the cytokine signaling system (n = 269 genes). In steps 2-3, expression levels of 40 genes were associated with intermediate phenotypes (asthma onset age, forced expiratory volume in 1 second, exacerbations, eosinophil counts, and skin test reactivity); of those, methylation of seven genes was also associated with asthma. Of these seven candidate genes, IL5RA was also significant in analytical steps 4-8. We then measured plasma IL-5 receptor α levels, which were associated with asthma age of onset and moderate-severe exacerbations. In addition, in silico database analysis showed that several of our identified IL5RA single-nucleotide polymorphisms are associated with transcription factors related to asthma and atopy. This approach integrates several analytical steps and is able to identify biologically relevant asthma-related genes, such as IL5RA. It differs from other methods that rely on complex statistical models with various assumptions.
Zeng, L W; Singh, R S
The genes responsible for hybrid male sterility in species crosses are usually identified by introgressing chromosome segments, monitored by visible markers, between closely related species by continuous backcrosses. This commonly used method, however, suffers from two problems. First, it relies on the availability of markers to monitor the introgressed regions and so the portion of the genome examined is limited to the marked regions. Secondly, the introgressed regions are usually large and it is impossible to tell if the effects of the introgressed regions are the result of single (or few) major genes or many minor genes (polygenes). Here we introduce a simple and general method for identifying putative major hybrid male sterility genes which is free of these problems. In this method, the actual hybrid male sterility genes (rather than markers), or tightly linked gene complexes with large effects, are selectively introgressed from one species into the background of another species by repeated backcrosses. This is performed by selectively backcrossing heterozygous (for hybrid male sterility gene or genes) females producing fertile and sterile sons in roughly equal proportions to males of either parental species. As no marker gene is required for this procedure, this method can be used with any species pairs that produce unisexual sterility. With the application of this method, a small X chromosome region of Drosophila mauritiana which produces complete hybrid male sterility (aspermic testes) in the background of D. simulans was identified. Recombination analysis reveals that this region contains a second major hybrid male sterility gene linked to the forked locus located at either 62.7 +/- 0.66 map units or at the centromere region of the X chromosome of D. mauritiana.
Gupta, Anika; Sun, Min Woo; Paskov, Kelley Marie; Stockham, Nate Tyler; Jung, Jae-Yoon; Wall, Dennis Paul
Despite mounting evidence for the strong role of genetics in the phenotypic manifestation of Autism Spectrum Disorder (ASD), the specific genes responsible for the variable forms of ASD remain undefined. ASD may be best explained by a combinatorial genetic model with varying epistatic interactions across many small effect mutations. Coalitional or cooperative game theory is a technique that studies the combined effects of groups of players, known as coalitions, seeking to identify players who tend to improve the performance--the relationship to a specific disease phenotype--of any coalition they join. This method has been previously shown to boost biologically informative signal in gene expression data but to-date has not been applied to the search for cooperative mutations among putative ASD genes. We describe our approach to highlight genes relevant to ASD using coalitional game theory on alteration data of 1,965 fully sequenced genomes from 756 multiplex families. Alterations were encoded into binary matrices for ASD (case) and unaffected (control) samples, indicating likely gene-disrupting, inherited mutations in altered genes. To determine individual gene contributions given an ASD phenotype, a "player" metric, referred to as the Shapley value, was calculated for each gene in the case and control cohorts. Sixty seven genes were found to have significantly elevated player scores and likely represent significant contributors to the genetic coordination underlying ASD. Using network and cross-study analysis, we found that these genes are involved in biological pathways known to be affected in the autism cases and that a subset directly interact with several genes known to have strong associations to autism. These findings suggest that coalitional game theory can be applied to large-scale genomic data to identify hidden yet influential players in complex polygenic disorders such as autism.
López, Camilo E; Acosta, Iván F; Jara, Carlos; Pedraza, Fabio; Gaitán-Solís, Eliana; Gallego, Gerardo; Beebe, Steve; Tohme, Joe
ABSTRACT A polymerase chain reaction approach using degenerate primers that targeted the conserved domains of cloned plant disease resistance genes (R genes) was used to isolate a set of 15 resistance gene analogs (RGAs) from common bean (Phaseolus vulgaris). Eight different classes of RGAs were obtained from nucleotide binding site (NBS)-based primers and seven from not previously described Toll/Interleukin-1 receptor-like (TIR)-based primers. Putative amino acid sequences of RGAs were significantly similar to R genes and contained additional conserved motifs. The NBS-type RGAs were classified in two subgroups according to the expected final residue in the kinase-2 motif. Eleven RGAs were mapped at 19 loci on eight linkage groups of the common bean genetic map constructed at Centro Internacional de Agricultura Tropical. Genetic linkage was shown for eight RGAs with partial resistance to anthracnose, angular leaf spot (ALS) and Bean golden yellow mosaic virus (BGYMV). RGA1 and RGA2 were associated with resistance loci to anthracnose and BGYMV and were part of two clusters of R genes previously described. A new major cluster was detected by RGA7 and explained up to 63.9% of resistance to ALS and has a putative contribution to anthracnose resistance. These results show the usefulness of RGAs as candidate genes to detect and eventually isolate numerous R genes in common bean.
Niu, Erli; Shang, Xiaoguang; Cheng, Chaoze; Bao, Jianghao; Zeng, Yanda; Cai, Caiping; Du, Xiongming; Guo, Wangzhen
COBRA-Like (COBL) genes, which encode a plant-specific glycosylphosphatidylinositol (GPI) anchored protein, have been proven to be key regulators in the orientation of cell expansion and cellulose crystallinity status. Genome-wide analysis has been performed in A. thaliana, O. sativa, Z. mays and S. lycopersicum, but little in Gossypium. Here we identified 19, 18 and 33 candidate COBL genes from three sequenced cotton species, diploid cotton G. raimondii, G. arboreum and tetraploid cotton G. hirsutum acc. TM-1, respectively. These COBL members were anchored onto 10 chromosomes in G. raimondii and could be divided into two subgroups. Expression patterns of COBL genes showed highly developmental and spatial regulation in G. hirsutum acc. TM-1. Of them, GhCOBL9 and GhCOBL13 were preferentially expressed at the secondary cell wall stage of fiber development and had significantly co-upregulated expression with cellulose synthase genes GhCESA4, GhCESA7 and GhCESA8. Besides, GhCOBL9 Dt and GhCOBL13 Dt were co-localized with previously reported cotton fiber quality quantitative trait loci (QTLs) and the favorable allele types of GhCOBL9 Dt had significantly positive correlations with fiber quality traits, indicating that these two genes might play an important role in fiber development. PMID:26710066
Iaria, Domenico; Chiappetta, Adriana; Muzzalupo, Innocenzo
In olive (Olea europaea L.), the processes controlling self-incompatibility are still unclear and the molecular basis underlying this process are still not fully characterized. In order to determine compatibility relationships, using next-generation sequencing techniques and a de novo transcriptome assembly strategy, we show that pollen tubes from different olive plants, grown in vitro in a medium containing its own pistil and in combination pollen/pistil from self-sterile and self-fertile cultivars, have a distinct gene expression profile and many of the differentially expressed sequences between the samples fall within gene families involved in the development of the pollen tube, such as lipase, carboxylesterase, pectinesterase, pectin methylesterase, and callose synthase. Moreover, different genes involved in signal transduction, transcription, and growth are overrepresented. The analysis also allowed us to identify members in actin and actin depolymerization factor and fibrin gene family and member of the Ca(2+) binding gene family related to the development and polarization of pollen apical tip. The whole transcriptomic analysis, through the identification of the differentially expressed transcripts set and an extended functional annotation analysis, will lead to a better understanding of the mechanisms of pollen germination and pollen tube growth in the olive.
Chen, Lei; Pan, Hongying; Zhang, Yu-Hang; Feng, Kaiyan; Kong, XiangYin; Huang, Tao; Cai, Yu-Dong
Bone and dental diseases are serious public health problems. Most current clinical treatments for these diseases can produce side effects. Regeneration is a promising therapy for bone and dental diseases, yielding natural tissue recovery with few side effects. Because soft tissues inside the bone and dentin are densely populated with nerves and vessels, the study of bone and dentin regeneration should also consider the co-regeneration of nerves and vessels. In this study, a network-based method to identify co-regeneration genes for bone, dentin, nerve and vessel was constructed based on an extensive network of protein-protein interactions. Three procedures were applied in the network-based method. The first procedure, searching, sought the shortest paths connecting regeneration genes of one tissue type with regeneration genes of other tissues, thereby extracting possible co-regeneration genes. The second procedure, testing, employed a permutation test to evaluate whether possible genes were false discoveries; these genes were excluded by the testing procedure. The last procedure, screening, employed two rules, the betweenness ratio rule and interaction score rule, to select the most essential genes. A total of seventeen genes were inferred by the method, which were deemed to contribute to co-regeneration of at least two tissues. All these seventeen genes were extensively discussed to validate the utility of the method.
Full Text Available Abstract Background Large-scale genomic studies often identify large gene lists, for example, the genes sharing the same expression patterns. The interpretation of these gene lists is generally achieved by extracting concepts overrepresented in the gene lists. This analysis often depends on manual annotation of genes based on controlled vocabularies, in particular, Gene Ontology (GO. However, the annotation of genes is a labor-intensive process; and the vocabularies are generally incomplete, leaving some important biological domains inadequately covered. Results We propose a statistical method that uses the primary literature, i.e. free-text, as the source to perform overrepresentation analysis. The method is based on a statistical framework of mixture model and addresses the methodological flaws in several existing programs. We implemented this method within a literature mining system, BeeSpace, taking advantage of its analysis environment and added features that facilitate the interactive analysis of gene sets. Through experimentation with several datasets, we showed that our program can effectively summarize the important conceptual themes of large gene sets, even when traditional GO-based analysis does not yield informative results. Conclusions We conclude that the current work will provide biologists with a tool that effectively complements the existing ones for overrepresentation analysis from genomic experiments. Our program, Genelist Analyzer, is freely available at: http://workerbee.igb.uiuc.edu:8080/BeeSpace/Search.jsp
Cross-species multiple environmental stress responses: An integrated approach to identify candidate genes for multiple stress tolerance in sorghum (Sorghum bicolor (L. Moench and related model species.
Adugna Abdi Woldesemayat
Full Text Available Crop response to the changing climate and unpredictable effects of global warming with adverse conditions such as drought stress has brought concerns about food security to the fore; crop yield loss is a major cause of concern in this regard. Identification of genes with multiple responses across environmental stresses is the genetic foundation that leads to crop adaptation to environmental perturbations.In this paper, we introduce an integrated approach to assess candidate genes for multiple stress responses across-species. The approach combines ontology based semantic data integration with expression profiling, comparative genomics, phylogenomics, functional gene enrichment and gene enrichment network analysis to identify genes associated with plant stress phenotypes. Five different ontologies, viz., Gene Ontology (GO, Trait Ontology (TO, Plant Ontology (PO, Growth Ontology (GRO and Environment Ontology (EO were used to semantically integrate drought related information.Target genes linked to Quantitative Trait Loci (QTLs controlling yield and stress tolerance in sorghum (Sorghum bicolor (L. Moench and closely related species were identified. Based on the enriched GO terms of the biological processes, 1116 sorghum genes with potential responses to 5 different stresses, such as drought (18%, salt (32%, cold (20%, heat (8% and oxidative stress (25% were identified to be over-expressed. Out of 169 sorghum drought responsive QTLs associated genes that were identified based on expression datasets, 56% were shown to have multiple stress responses. On the other hand, out of 168 additional genes that have been evaluated for orthologous pairs, 90% were conserved across species for drought tolerance. Over 50% of identified maize and rice genes were responsive to drought and salt stresses and were co-located within multifunctional QTLs. Among the total identified multi-stress responsive genes, 272 targets were shown to be co-localized within QTLs
Zhang, Tianxiao; Hou, Liping; Chen, David T; McMahon, Francis J; Wang, Jen-Chyong; Rice, John P
Bipolar disorder is a mental illness with lifetime prevalence of about 1%. Previous genetic studies have identified multiple chromosomal linkage regions and candidate genes that might be associated with bipolar disorder. The present study aimed to identify potential susceptibility variants for bipolar disorder using 6 related case samples from a four-generation family. A combination of exome sequencing and linkage analysis was performed to identify potential susceptibility variants for bipolar disorder. Our study identified a list of five potential candidate genes for bipolar disorder. Among these five genes, GRID1(Glutamate Receptor Delta-1 Subunit), which was previously reported to be associated with several psychiatric disorders and brain related traits, is particularly interesting. Variants with functional significance in this gene were identified from two cousins in our bipolar disorder pedigree. Our findings suggest a potential role for these genes and the related rare variants in the onset and development of bipolar disorder in this one family. Additional research is needed to replicate these findings and evaluate their patho-biological significance. Copyright © 2017 Elsevier B.V. All rights reserved.
Hasselbalch, Hans Carl; Skov, Vibe; Stauffer Larsen, Thomas
Identifying a distinct gene signature for myelofibrosis may yield novel information of the genes, which are responsible for progression of essential thrombocythemia and polycythemia vera towards myelofibrosis. We aimed at identifying a simple gene signature - composed of a few genes - which were...
Fatou K. Ndiaye
Full Text Available Objectives: Genome-wide association studies (GWAS have identified >100 loci independently contributing to type 2 diabetes (T2D risk. However, translational implications for precision medicine and for the development of novel treatments have been disappointing, due to poor knowledge of how these loci impact T2D pathophysiology. Here, we aimed to measure the expression of genes located nearby T2D associated signals and to assess their effect on insulin secretion from pancreatic beta cells. Methods: The expression of 104 candidate T2D susceptibility genes was measured in a human multi-tissue panel, through PCR-free expression assay. The effects of the knockdown of beta-cell enriched genes were next investigated on insulin secretion from the human EndoC-βH1 beta-cell line. Finally, we performed RNA-sequencing (RNA-seq so as to assess the pathways affected by the knockdown of the new genes impacting insulin secretion from EndoC-βH1, and we analyzed the expression of the new genes in mouse models with altered pancreatic beta-cell function. Results: We found that the candidate T2D susceptibility genes' expression is significantly enriched in pancreatic beta cells obtained by laser capture microdissection or sorted by flow cytometry and in EndoC-βH1 cells, but not in insulin sensitive tissues. Furthermore, the knockdown of seven T2D-susceptibility genes (CDKN2A, GCK, HNF4A, KCNK16, SLC30A8, TBC1D4, and TCF19 with already known expression and/or function in beta cells changed insulin secretion, supporting our functional approach. We showed first evidence for a role in insulin secretion of four candidate T2D-susceptibility genes (PRC1, SRR, ZFAND3, and ZFAND6 with no previous knowledge of presence and function in beta cells. RNA-seq in EndoC-βH1 cells with decreased expression of PRC1, SRR, ZFAND6, or ZFAND3 identified specific gene networks related to T2D pathophysiology. Finally, a positive correlation between the expression of Ins2 and the
Peñagaricano, Francisco; Zorrilla, Pilar; Naya, Hugo; Robello, Carlos; Urioste, Jorge I
The white coat colour of sheep is an important economic trait. For unknown reasons, some animals are born with, and others develop with time, black skin spots that can also produce pigmented fibres. The presence of pigmented fibres in the white wool significantly decreases the fibre quality. The aim of this work was to study gene expression in black spots (with and without pigmented fibres) and white skin by microarray techniques, in order to identify the possible genes involved in the development of this trait. Five unrelated Corriedale sheep were used and, for each animal, the three possible comparisons (three different hybridisations) between the three samples of interest were performed. Differential gene expression patterns were analysed using different t-test approaches. Most of the major genes with well-known roles in skin pigmentation, e.g. ASIP, MC1R and C-KIT, showed no significant difference in the gene expression between white skin and black spots. On the other hand, many of the differentially expressed genes (raw P-value spots. The gene expression of C-FOS and KLF4, transcription factors involved in the cellular response to external factors such as ultraviolet light, was validated by quantitative polymerase chain reaction (PCR). This exploratory study provides a list of candidate genes that could be associated with the development of black skin spots that should be studied in more detail. Characterisation of these genes will enable us to discern the molecular mechanisms involved in the development of this feature and, hence, increase our understanding of melanocyte biology and skin pigmentation. In sheep, understanding this phenomenon is a first step towards developing molecular tools to assist in the selection against the presence of pigmented fibres in white wool.
Aguileta, Gabriela; Lengelle, Juliette; Chiapello, Hélène; Giraud, Tatiana; Viaud, Muriel; Fournier, Elisabeth; Rodolphe, François; Marthey, Sylvain; Ducasse, Aurélie; Gendrault, Annie; Poulain, Julie; Wincker, Patrick; Gout, Lilian
The rapid evolution of particular genes is essential for the adaptation of pathogens to new hosts and new environments. Powerful methods have been developed for detecting targets of selection in the genome. Here we used divergence data to compare genes among four closely related fungal pathogens adapted to different hosts to elucidate the functions putatively involved in adaptive processes. For this goal, ESTs were sequenced in the specialist fungal pathogens Botrytis tulipae and Botrytis ficariarum, and compared with genome sequences of Botrytis cinerea and Sclerotinia sclerotiorum, responsible for diseases on over 200 plant species. A maximum likelihood-based analysis of 642 predicted orthologs detected 21 genes showing footprints of positive selection. These results were validated by resequencing nine of these genes in additional Botrytis species, showing they have also been rapidly evolving in other related species. Twenty of the 21 genes had not previously been identified as pathogenicity factors in B. cinerea, but some had functions related to plant-fungus interactions. The putative functions were involved in respiratory and energy metabolism, protein and RNA metabolism, signal transduction or virulence, similarly to what was detected in previous studies using the same approach in other pathogens. Mutants of B. cinerea were generated for four of these genes as a first attempt to elucidate their functions. Copyright © 2012 Elsevier B.V. All rights reserved.
Feride İffet Şahin
Full Text Available Mutations in the SRY gene prevent the differentiation of the fetal gonads to testes and cause developing female phenotype, and as a result sex reversal and pure gonadal dysgenesis (Swyer syndrome can be developed. Different types of mutations identified in the SRY gene are responsible for 15% of the gonadal dysgenesis. In this study, we report a new mutation (R132P in the High Mobility Group (HMG region of SRY gene was detected in a patient with primary amenorrhea who has 46,XY karyotype. This mutation leads to replacement of the polar and basic arginine with a nonpolar hydrophobic proline residue at aminoacid 132 in the nuclear localization signal region of the protein. With this case report we want to emphasize the genetic approach to the patients with gonadal dysgenesis. If Y chromosome is detected during cytogenetic analysis, revealing the presence of the SRY gene and identification of mutations in this gene by sequencing analysis is become important in.
Full Text Available Abstract Background Gene expression technologies have the ability to generate vast amounts of data, yet there often resides only limited resources for subsequent validation studies. This necessitates the ability to perform sorting and prioritization of the output data. Previously described methodologies have used functional pathways or transcriptional regulatory grouping to sort genes for further study. In this paper we demonstrate a comparative genomics based method to leverage data from animal models to prioritize genes for validation. This approach allows one to develop a disease-based focus for the prioritization of gene data, a process that is essential for systems that lack significant functional pathway data yet have defined animal models. This method is made possible through the use of highly controlled spotted cDNA slide production and the use of comparative bioinformatics databases without the use of cross-species slide hybridizations. Results Using gene expression profiling we have demonstrated a similar whole transcriptome gene expression patterns in prostate cancer cells from human and rat prostate cancer cell lines both at baseline expression levels and after treatment with physiologic concentrations of the proposed chemopreventive agent Selenium. Using both the human PC3 and rat PAII prostate cancer cell lines have gone on to identify a subset of one hundred and fifty-four genes that demonstrate a similar level of differential expression to Selenium treatment in both species. Further analysis and data mining for two genes, the Insulin like Growth Factor Binding protein 3, and Retinoic X Receptor alpha, demonstrates an association with prostate cancer, functional pathway links, and protein-protein interactions that make these genes prime candidates for explaining the mechanism of Selenium's chemopreventive effect in prostate cancer. These genes are subsequently validated by western blots showing Selenium based induction and using
Guo, Wei; Zhang, Bin; Li, Yan; Duan, Hui-Quan; Sun, Chao; Xu, Yun-Qiang; Feng, Shi-Qing
The present study aimed to reveal the potential genes associated with the pathogenesis of intervertebral disc degeneration (IDD) by analyzing microarray data using bioinformatics. Gene expression profiles of two regions of the intervertebral disc were compared between patients with IDD and controls. GSE70362 containing two groups of gene expression profiles, 16 nucleus pulposus (NP) samples from patients with IDD and 8 from controls, and 16 annulus fibrosus (AF) samples from patients with IDD and 8 from controls, was downloaded from the Gene Expression Omnibus database. A total of 93 and 114 differentially expressed genes (DEGs) were identified in NP and AF samples, respectively, using a limma software package for the R programming environment. Gene Ontology (GO) function enrichment analysis was performed to identify the associated biological functions of DEGs in IDD, which indicated that the DEGs may be involved in various processes, including cell adhesion, biological adhesion and extracellular matrix organization. Pathway enrichment analysis using the Kyoto Encyclopedia of Genes and Genomes (KEGG) demonstrated that the identified DEGs were potentially involved in focal adhesion and the p53 signaling pathway. Further analysis revealed that there were 35 common DEGs observed between the two regions (NP and AF), which may be further regulated by 6 clusters of microRNAs (miRNAs) retrieved with WebGestalt. The genes in the DEG‑miRNA regulatory network were annotated using GO function and KEGG pathway enrichment analysis, among which extracellular matrix organization was the most significant disrupted biological process and focal adhesion was the most significant dysregulated pathway. In addition, the result of protein‑protein interaction network modules demonstrated the involvement of inflammatory cytokine interferon signaling in IDD. These findings may not only advance the understanding of the pathogenesis of IDD, but also identify novel potential
Thomassen, Mads; Jochumsen, Kirsten M; Mogensen, Ole
the relation of gene expression and chromosomal position to identify chromosomal regions of importance for early recurrence of ovarian cancer. By use of *Gene Set Enrichment Analysis*, we have ranked chromosomal regions according to their association to survival. Over-representation analysis including 1...... using death (P = 0.015) and recurrence (P = 0.002) as outcome. The combined mutation score is strongly associated to upregulation of several growth factor pathways....
Qi, Xiaoxiao; Wu, Jun; Wang, Lifen; Li, Leiting; Cao, Yufen; Tian, Luming; Dong, Xingguang; Zhang, Shaoling
'Kuerlexiangli' (Pyrus sinkiangensis Yu), a native pear of Xinjiang, China, is an important agricultural fruit and primary export to the international market. However, fruit with persistent calyxes affect fruit shape and quality. Although several studies have looked into the physiological aspects of the calyx abscission process, the underlying molecular mechanisms remain unknown. In order to better understand the molecular basis of the process of calyx abscission, materials at three critical stages of regulation, with 6000 × Flusilazole plus 300 × PBO treatment (calyx abscising treatment) and 50 mg.L-1GA3 treatment (calyx persisting treatment), were collected and cDNA fragments were sequenced using digital transcript abundance measurements to identify candidate genes. Digital transcript abundance measurements was performed using high-throughput Illumina GAII sequencing on seven samples that were collected at three important stages of the calyx abscission process with chemical agent treatments promoting calyx abscission and persistence. Altogether more than 251,123,845 high quality reads were obtained with approximately 8.0 M raw data for each library. The values of 69.85%-71.90% of clean data in the digital transcript abundance measurements could be mapped to the pear genome database. There were 12,054 differentially expressed genes having Gene Ontology (GO) terms and associating with 251 Kyoto Encyclopedia of Genes and Genomes (KEGG) defined pathways. The differentially expressed genes correlated with calyx abscission were mainly involved in photosynthesis, plant hormone signal transduction, cell wall modification, transcriptional regulation, and carbohydrate metabolism. Furthermore, candidate calyx abscission-specific genes, e.g. Inflorescence deficient in abscission gene, were identified. Quantitative real-time PCR was used to confirm the digital transcript abundance measurements results. We identified candidate genes that showed highly dynamic changes in
Ramirez-Córdova, Jesús; Drnevich, Jenny; Madrigal-Pulido, Jaime Alberto; Arrizon, Javier; Allen, Kirk; Martínez-Velázquez, Moisés; Alvarez-Maya, Ikuri
During ethanol fermentation, yeast cells are exposed to stress due to the accumulation of ethanol, cell growth is altered and the output of the target product is reduced. For Agave beverages, like tequila, no reports have been published on the global gene expression under ethanol stress. In this work, we used microarray analysis to identify Saccharomyces cerevisiae genes involved in the ethanol response. Gene expression of a tequila yeast strain of S. cerevisiae (AR5) was explored by comparing global gene expression with that of laboratory strain S288C, both after ethanol exposure. Additionally, we used two different culture conditions, cells grown in Agave tequilana juice as a natural fermentation media or grown in yeast-extract peptone dextrose as artificial media. Of the 6368 S. cerevisiae genes in the microarray, 657 genes were identified that had different expression responses to ethanol stress due to strain and/or media. A cluster of 28 genes was found over-expressed specifically in the AR5 tequila strain that could be involved in the adaptation to tequila yeast fermentation, 14 of which are unknown such as yor343c, ylr162w, ygr182c, ymr265c, yer053c-a or ydr415c. These could be the most suitable genes for transforming tequila yeast to increase ethanol tolerance in the tequila fermentation process. Other genes involved in response to stress (RFC4, TSA1, MLH1, PAU3, RAD53) or transport (CYB2, TIP20, QCR9) were expressed in the same cluster. Unknown genes could be good candidates for the development of recombinant yeasts with ethanol tolerance for use in industrial tequila fermentation.
Wilson, Michael H; Holman, Tara J; Sørensen, Iben
Plant cell wall composition is important for regulating growth rates, especially in roots. However, neither analyses of cell wall composition nor transcriptomes on their own can comprehensively reveal which genes and processes are mediating growth and cell elongation rates. This study reveals...... the benefits of carrying out multiple analyses in combination. Sections of roots from five anatomically and functionally defined zones in Arabidopsis thaliana were prepared and divided into three biological replicates. We used glycan microarrays and antibodies to identify the major classes of glycans......)cellular localization of many epitopes. Extensins were localized in epidermal and cortex cell walls, while AGP glycans were specific to different tissues from root-hair cells to the stele. The transcriptome analysis found several gene families peaking in the REZ. These included a large family of peroxidases (which...
Burkholderia cenocepacia infection often leads to fatal cepacia syndrome in cystic fibrosis patients. However, antibiotic therapy rarely results in complete eradication of the pathogen due to its intrinsic resistance to many clinically available antibiotics. Recent attention has turned to the identification of essential genes as the proteins encoded by these genes may serve as potential targets for development of novel antimicrobials. In this study, we utilized TraDIS (Transposon Directed Insertion-site Sequencing) as a genome-wide screening tool to facilitate the identification of B. cenocepacia genes essential for its growth and viability. A transposon mutant pool consisting of approximately 500,000 mutants was successfully constructed, with more than 400,000 unique transposon insertion sites identified by computational analysis of TraDIS datasets. The saturated library allowed for the identification of 383 genes that were predicted to be essential in B. cenocepacia. We extended the application of TraDIS to identify conditionally essential genes required for in vitro growth and revealed an additional repertoire of 439 genes to be crucial for B. cenocepacia growth under nutrient-depleted conditions. The library of B. cenocepacia mutants can subsequently be subjected to various biologically related conditions to facilitate the discovery of genes involved in niche adaptation as well as pathogenicity and virulence.
Meiosis and gamete formation are processes that are essential for sexual reproduction in all eukaryotic organisms. Multiple intracellular and extracellular signals feed into pathways that converge on transcription factors that induce the expression of meiosis-specific genes. Once triggered the meiosis-specific gene expression program proceeds in a cascade that drives progress through the events of meiosis and gamete formation. Meiosis-specific gene expression is tightly controlled by a balance of positive and negative regulatory factors that respond to a plethora of signaling pathways. The budding yeast Saccharomyces cerevisiae has proven to be an outstanding model for the dissection of gametogenesis owing to the sophisticated genetic manipulations that can be performed with the cells. It is possible to use a variety selection and screening methods to identify genes and their functions. High-throughput screening technology has been developed to allow an array of all viable yeast gene deletion mutants to be screened for phenotypes and for regulators of gene expression. This chapter describes a protocol that has been used to screen a library of homozygous diploid yeast deletion strains to identify regulators of the meiosis-specific IME1 gene.
Cai, Zhiying; Li, Guohua; Lin, Chunhua; Shi, Tao; Zhai, Ligang; Chen, Yipeng; Huang, Guixiu
To gain more insight into the molecular mechanisms of Colletotrichum gloeosporioides pathogenesis, Agrobacterium tumefaciens-mediated transformation (ATMT) was used to identify mutants of C. gloeosporioides impaired in pathogenicity. An ATMT library of 4128 C. gloeosporioides transformants was generated. Transformants were screened for defects in pathogenicity with a detached copper brown leaf assay. 32 mutants showing reproducible pathogenicity defects were obtained. Southern blot analysis showed 60.4% of the transformants had single-site T-DNA integrations. 16 Genomic sequences flanking T-DNA were recovered from mutants by thermal asymmetric interlaced PCR, and were used to isolate the tagged genes from the genome sequence of wild-type C. gloeosporioides by Basic Local Alignment Search Tool searches against the local genome database of the wild-type C. gloeosporioides. One potential pathogenicity genes encoded calcium-translocating P-type ATPase. Six potential pathogenicity genes had no known homologs in filamentous fungi and were likely to be novel fungal virulence factors. Two putative genes encoded Glycosyltransferase family 28 domain-containing protein and Mov34/MPN/PAD-1 family protein, respectively. Five potential pathogenicity genes had putative function matched with putative protein of other Colletotrichum species. Two known C. gloeosporioides pathogenicity genes were also identified, the encoding Glomerella cingulata hard-surface induced protein and C. gloeosporioides regulatory subunit of protein kinase A gene involved in cAMP-dependent PKA signal transduction pathway. Copyright © 2013 Elsevier GmbH. All rights reserved.
Cedoz, Pierre-Louis; Prunello, Marcos; Brennan, Kevin; Gevaert, Olivier
DNA methylation is an important mechanism regulating gene transcription, and its role in carcinogenesis has been extensively studied. Hyper and hypomethylation of genes is a major mechanism of gene expression deregulation in a wide range of diseases. At the same time, high-throughput DNA methylation assays have been developed generating vast amounts of genome wide DNA methylation measurements. We developed MethylMix, an algorithm implemented in R to identify disease specific hyper and hypomethylated genes. Here we present a new version of MethylMix that automates the construction of DNA-methylation and gene expression datasets from The Cancer Genome Atlas (TCGA). More precisely, MethylMix 2.0 incorporates two major updates: the automated downloading of DNA methylation and gene expression datasets from TCGA and the automated preprocessing of such datasets: value imputation, batch correction and CpG sites clustering within each gene. The resulting datasets can subsequently be analyzed with MethylMix to identify transcriptionally predictive methylation states. We show that the Differential Methylation Values created by MethylMix can be used for cancer subtyping. email@example.com. https://bioconductor.org/packages/release/bioc/manuals/MethylMix/man/MethylMix.pdf. MethylMix 2.0 was implemented as an R package and is available in bioconductor.
Wong, Yee-Chin; Abd El Ghany, Moataz; Naeem, Raeece; Lee, Kok-Wei; Tan, Yung-Chie; Pain, Arnab; Nathan, Sheila
Burkholderia cenocepacia infection often leads to fatal cepacia syndrome in cystic fibrosis patients. However, antibiotic therapy rarely results in complete eradication of the pathogen due to its intrinsic resistance to many clinically available antibiotics. Recent attention has turned to the identification of essential genes as the proteins encoded by these genes may serve as potential targets for development of novel antimicrobials. In this study, we utilized TraDIS (Transposon Directed Insertion-site Sequencing) as a genome-wide screening tool to facilitate the identification of B. cenocepacia genes essential for its growth and viability. A transposon mutant pool consisting of approximately 500,000 mutants was successfully constructed, with more than 400,000 unique transposon insertion sites identified by computational analysis of TraDIS datasets. The saturated library allowed for the identification of 383 genes that were predicted to be essential in B. cenocepacia. We extended the application of TraDIS to identify conditionally essential genes required for in vitro growth and revealed an additional repertoire of 439 genes to be crucial for B. cenocepacia growth under nutrient-depleted conditions. The library of B. cenocepacia mutants can subsequently be subjected to various biologically related conditions to facilitate the discovery of genes involved in niche adaptation as well as pathogenicity and virulence.
Full Text Available Burkholderia cenocepacia infection often leads to fatal cepacia syndrome in cystic fibrosis patients. However, antibiotic therapy rarely results in complete eradication of the pathogen due to its intrinsic resistance to many clinically available antibiotics. Recent attention has turned to the identification of essential genes as the proteins encoded by these genes may serve as potential targets for development of novel antimicrobials. In this study, we utilized TraDIS (Transposon Directed Insertion-site Sequencing as a genome-wide screening tool to facilitate the identification of B. cenocepacia genes essential for its growth and viability. A transposon mutant pool consisting of approximately 500,000 mutants was successfully constructed, with more than 400,000 unique transposon insertion sites identified by computational analysis of TraDIS datasets. The saturated library allowed for the identification of 383 genes that were predicted to be essential in B. cenocepacia. We extended the application of TraDIS to identify conditionally essential genes required for in vitro growth and revealed an additional repertoire of 439 genes to be crucial for B. cenocepacia growth under nutrient-depleted conditions. The library of B. cenocepacia mutants can subsequently be subjected to various biologically related conditions to facilitate the discovery of genes involved in niche adaptation as well as pathogenicity and virulence.
Full Text Available Kernel starch content is an important trait in maize (Zea mays L. as it accounts for 65% to 75% of the dry kernel weight and positively correlates with seed yield. A number of starch synthesis-related genes have been identified in maize in recent years. However, many loci underlying variation in starch content among maize inbred lines still remain to be identified. The current study is a genome-wide association study that used a set of 263 maize inbred lines. In this panel, the average kernel starch content was 66.99%, ranging from 60.60% to 71.58% over the three study years. These inbred lines were genotyped with the SNP50 BeadChip maize array, which is comprised of 56,110 evenly spaced, random SNPs. Population structure was controlled by a mixed linear model (MLM as implemented in the software package TASSEL. After the statistical analyses, four SNPs were identified as significantly associated with starch content (P ≤ 0.0001, among which one each are located on chromosomes 1 and 5 and two are on chromosome 2. Furthermore, 77 candidate genes associated with starch synthesis were found within the 100-kb intervals containing these four QTLs, and four highly associated genes were within 20-kb intervals of the associated SNPs. Among the four genes, Glucose-1-phosphate adenylyltransferase (APS1; Gene ID GRMZM2G163437 is known as an important regulator of kernel starch content. The identified SNPs, QTLs, and candidate genes may not only be readily used for germplasm improvement by marker-assisted selection in breeding, but can also elucidate the genetic basis of starch content. Further studies on these identified candidate genes may help determine the molecular mechanisms regulating kernel starch content in maize and other important cereal crops.
At Formosa Plastics Corporation's plant in Point Comfort, Texas, a plant-wide assessment team analyzed process energy requirements, reviewed new technologies for applicability, and found ways to improve the plant's energy efficiency. The assessment team identified the energy requirements of each process and compared actual energy consumption with theoretical process requirements. The team estimated that total annual energy savings would be about 115,000 MBtu for natural gas and nearly 14 million kWh for electricity if the plant makes several improvements, which include upgrading the gas compressor impeller, improving the vent blower system, and recovering steam condensate for reuse. Total annual cost savings could be $1.5 million. The U.S. Department of Energy's Industrial Technologies Program cosponsored this assessment.
Li, Xiao-Jie; Li, Mo; Zhou, Ying; Hu, Shan; Hu, Rong; Chen, Yun; Li, Xue-Bao
RAV (related to ABI3/VP1) protein containing an AP2 domain in the N-terminal region and a B3 domain in the C-terminal region, which belongs to AP2 transcription factor family, is unique in higher plants. In this study, a gene (GhRAV1) encoding a RAV protein of 357 amino acids was identified in cotton (Gossypium hirsutum). Transient expression analysis of the eGFP:GhRAV1 fusion genes in tobacco (Nicotiana tabacum) epidermal cells revealed that GhRAV1 protein was localized in the cell nucleus. Quantitative RT-PCR analysis indicated that expression of GhRAV1 in cotton is induced by abscisic acid (ABA), NaCl and polyethylene glycol (PEG). Overexpression of GhRAV1 in Arabidopsis resulted in plant sensitive to ABA, NaCl and PEG. With abscisic acid (ABA) treatment, seed germination and green seedling rates of the GhRAV1 transgenic plants were remarkably lower than those of wild type. In the presence of NaCl, the seed germination and seedling growth of the GhRAV1 transgenic lines were inhibited greater than those of wild type. And chlorophyll content and maximum photochemical efficiency of the transgenic plants were significantly lower than those of wild type. Under drought stress, the GhRAV1 transgenic plants displayed more severe wilting than wild type. Furthermore, expressions of the stress-related genes were altered in the GhRAV1 transgenic Arabidopsis plants under high salinity and drought stresses. Collectively, our data suggested that GhRAV1 may be involved in response to high salinity and drought stresses through regulating expressions of the stress-related genes during cotton development.
Baltussen, Tim J H; Coolen, Jordy P M; Zoll, Jan; Verweij, Paul E; Melchers, Willem J G
Aspergillus fumigatus is a saprophytic fungus that extensively produces conidia. These microscopic asexually reproductive structures are small enough to reach the lungs. Germination of conidia followed by hyphal growth inside human lungs is a key step in the establishment of infection in immunocompromised patients. RNA-Seq was used to analyze the transcriptome of dormant and germinating A. fumigatus conidia. Construction of a gene co-expression network revealed four gene clusters (modules) correlated with a growth phase (dormant, isotropic growth, polarized growth). Transcripts levels of genes encoding for secondary metabolites were high in dormant conidia. During isotropic growth, transcript levels of genes involved in cell wall modifications increased. Two modules encoding for growth and cell cycle/DNA processing were associated with polarized growth. In addition, the co-expression network was used to identify highly connected intermodular hub genes. These genes may have a pivotal role in the respective module and could therefore be compelling therapeutic targets. Generally, cell wall remodeling is an important process during isotropic and polarized growth, characterized by an increase of transcripts coding for hyphal growth and cell cycle/DNA processing when polarized growth is initiated. Copyright © 2018 The Authors. Published by Elsevier Inc. All rights reserved.
Golubovskaya, Vita M.; Ho, Baotran; Conroy, Jeffrey; Liu, Song; Wang, Dan; Cance, William G.
Focal Adhesion Kinase (FAK) is a non-receptor kinase that plays an important role in many cellular processes: adhesion, proliferation, invasion, angiogenesis, metastasis and survival. Recently, we have shown that Roslin 2 or R2 (1-benzyl-15,3,5,7-tetraazatricyclo[126.96.36.199~3,7~]decane) compound disrupts FAK and p53 proteins, activates p53 transcriptional activity, and blocks tumor growth. In this report we performed a microarray gene expression analysis of R2-treated HCT116 p53 +/+ and p53 −/− cells and detected 1484 genes that were significantly up- or down-regulated (p < 0.05) in HCT116 p53 +/+ cells but not in p53 −/− cells. Among up-regulated genes in HCT p53 +/+ cells we detected critical p53 targets: Mdm-2, Noxa-1, and RIP1. Among down-regulated genes, Met, PLK2, KIF14, BIRC2 and other genes were identified. In addition, a combination of R2 compound with M13 compound that disrupts FAK and Mmd-2 complex or R2 and Nutlin-1 that disrupts Mdm-2 and p53 decreased clonogenicity of HCT116 p53 +/+ colon cancer cells more significantly than each agent alone in a p53-dependent manner. Thus, the report detects gene expression profile in response to R2 treatment and demonstrates that the combination of drugs targeting FAK, Mdm-2, and p53 can be a novel therapy approach
This study was conducted to identify mutations in the homogentisate 1,2 dioxygenase gene (HGD) in alkaptonuria patients among Jordanian population. Blood samples were collected from four alkaptonuria patients, four carriers, and two healthy volunteers. DNA was isolated from peripheral blood. All 14 exons of the HGD gene were amplified using the polymerase chain reaction (PCR) technique. The PCR products were then purified and analyzed by sequencing. Five mutations were identified in our samples. Four of them were novel C1273A, T1046G, 551-552insG, T533G and had not been previously reported, and one mutation T847C has been described before. The types of mutations identified were two missense mutations, one splice site mutation, one frameshift mutation, and one polymorphism. We present the first molecular study of the HGD gene in Jordanian alkaptonuria patients. This study provides valuable information about the molecular basis of alkaptonuria in Jordanian population.
Halabi, Najeeb M.; Martinez, Alejandra; Al-Farsi, Halema; Mery, Eliane; Puydenus, Laurence; Pujol, Pascal; Khalak, Hanif G.; McLurcan, Cameron; Ferron, Gwenael; Querleu, Denis; Al-Azwani, Iman; Al-Dous, Eman; Mohamoud, Yasmin A.; Malek, Joel A.; Rafii, Arash
Identifying genes where a variant allele is preferentially expressed in tumors could lead to a better understanding of cancer biology and optimization of targeted therapy. However, tumor sample heterogeneity complicates standard approaches for detecting preferential allele expression. We therefore developed a novel approach combining genome and transcriptome sequencing data from the same sample that corrects for sample heterogeneity and identifies significant preferentially expressed alleles. We applied this analysis to epithelial ovarian cancer samples consisting of matched primary ovary and peritoneum and lymph node metastasis. We find that preferentially expressed variant alleles include germline and somatic variants, are shared at a relatively high frequency between patients, and are in gene networks known to be involved in cancer processes. Analysis at a patient level identifies patient-specific preferentially expressed alleles in genes that are targets for known drugs. Analysis at a site level identifies patterns of site specific preferential allele expression with similar pathways being impacted in the primary and metastasis sites. We conclude that genes with preferentially expressed variant alleles can act as cancer drivers and that targeting those genes could lead to new therapeutic strategies. PMID:26735499
Barker, Gregory A; Diamond, Scott L
Some barriers to DNA lipofection are well characterized; however, there is as yet no method of finding unknown pathways that impact the process. A druggable genome small-interfering RNA (siRNA) screen against 5,520 genes was tested for its effect on lipofection of human aortic endothelial cells (HAECs). We found 130 gene targets which, when silenced by pooled siRNAs (three siRNAs per gene), resulted in enhanced luminescence after lipofection (86 gene targets showed reduced expression). In confirmation tests with single siRNAs, 18 of the 130 hits showed enhanced lipofection with two or more individual siRNAs in the absence of cytotoxicity. Of these confirmed gene targets, we identified five leading candidates, two of which are isoforms of the regulatory subunit of protein phosphatase 2A (PP2A). The best candidate siRNA targeted the PPP2R2C gene and produced a 65% increase in luminescence from lipofection, with a quantitative PCR-validated knockdown of approximately 76%. Flow cytometric analysis confirmed that the silencing of the PPP2R2C gene resulted in an improvement of 10% in transfection efficiency, thereby demonstrating an increase in the number of transfected cells. These results show that an RNA interference (RNAi) high-throughput screen (HTS) can be applied to nonviral gene transfer. We have also demonstrated that siRNAs can be co-delivered with lipofected DNA to increase the transfection efficiency in vitro.
Full Text Available Few driver genes have been well established in esophageal squamous cell carcinoma (ESCC. Identification of the genomic aberrations that contribute to changes in gene expression profiles can be used to predict driver genes.We searched for driver genes in ESCC by integrative analysis of gene expression microarray profiles and copy number data. To narrow down candidate genes, we performed survival analysis on expression data and tested the genetic vulnerability of each genes using public RNAi screening data. We confirmed the results by performing RNAi experiments and evaluating the clinical relevance of candidate genes in an independent ESCC cohort.We found 10 significantly recurrent copy number alterations accompanying gene expression changes, including loci 11q13.2, 7p11.2, 3q26.33, and 17q12, which harbored CCND1, EGFR, SOX2, and ERBB2, respectively. Analysis of survival data and RNAi screening data suggested that GRB7, located on 17q12, was a driver gene in ESCC. In ESCC cell lines harboring 17q12 amplification, knockdown of GRB7 reduced the proliferation, migration, and invasion capacities of cells. Moreover, siRNA targeting GRB7 had a synergistic inhibitory effect when combined with trastuzumab, an anti-ERBB2 antibody. Survival analysis of the independent cohort also showed that high GRB7 expression was associated with poor prognosis in ESCC.Our integrative analysis provided important insights into ESCC pathogenesis. We identified GRB7 as a novel ESCC driver gene and potential new therapeutic target.
Full Text Available The bacterial plant pathogen Pseudomonas syringae pv. phaseolicola (Pph colonises the surface of common bean plants before moving into the interior of plant tissue, via wounds and stomata. In the intercellular spaces the pathogen proliferates in the apoplastic fluid and forms microcolonies (biofilms around plant cells. If the pathogen can suppress the plant's natural resistance response, it will cause halo blight disease. The process of resistance suppression is fairly well understood, but the mechanisms used by the pathogen in colonisation are less clear. We hypothesised that we could apply in vitro genetic screens to look for changes in motility, colony formation, and adhesion, which are proxies for infection, microcolony formation and cell adhesion. We made transposon (Tn mutant libraries of Pph strains 1448A and 1302A and found 106/1920 mutants exhibited alterations in colony morphology, motility and biofilm formation. Identification of the insertion point of the Tn identified within the genome highlighted, as expected, a number of altered motility mutants bearing mutations in genes encoding various parts of the flagellum. Genes involved in nutrient biosynthesis, membrane associated proteins, and a number of conserved hypothetical protein (CHP genes were also identified. A mutation of one CHP gene caused a positive increase in in planta bacterial growth. This rapid and inexpensive screening method allows the discovery of genes important for in vitro traits that can be correlated to roles in the plant interaction.
Bordeaux John M
Full Text Available Abstract Background Global transcriptional analysis of loblolly pine (Pinus taeda L. is challenging due to limited molecular tools. PtGen2, a 26,496 feature cDNA microarray, was fabricated and used to assess drought-induced gene expression in loblolly pine propagule roots. Statistical analysis of differential expression and weighted gene correlation network analysis were used to identify drought-responsive genes and further characterize the molecular basis of drought tolerance in loblolly pine. Results Microarrays were used to interrogate root cDNA populations obtained from 12 genotype × treatment combinations (four genotypes, three watering regimes. Comparison of drought-stressed roots with roots from the control treatment identified 2445 genes displaying at least a 1.5-fold expression difference (false discovery rate = 0.01. Genes commonly associated with drought response in pine and other plant species, as well as a number of abiotic and biotic stress-related genes, were up-regulated in drought-stressed roots. Only 76 genes were identified as differentially expressed in drought-recovered roots, indicating that the transcript population can return to the pre-drought state within 48 hours. Gene correlation analysis predicts a scale-free network topology and identifies eleven co-expression modules that ranged in size from 34 to 938 members. Network topological parameters identified a number of central nodes (hubs including those with significant homology (E-values ≤ 2 × 10-30 to 9-cis-epoxycarotenoid dioxygenase, zeatin O-glucosyltransferase, and ABA-responsive protein. Identified hubs also include genes that have been associated previously with osmotic stress, phytohormones, enzymes that detoxify reactive oxygen species, and several genes of unknown function. Conclusion PtGen2 was used to evaluate transcriptome responses in loblolly pine and was leveraged to identify 2445 differentially expressed genes responding to severe drought stress in
Background Global transcriptional analysis of loblolly pine (Pinus taeda L.) is challenging due to limited molecular tools. PtGen2, a 26,496 feature cDNA microarray, was fabricated and used to assess drought-induced gene expression in loblolly pine propagule roots. Statistical analysis of differential expression and weighted gene correlation network analysis were used to identify drought-responsive genes and further characterize the molecular basis of drought tolerance in loblolly pine. Results Microarrays were used to interrogate root cDNA populations obtained from 12 genotype × treatment combinations (four genotypes, three watering regimes). Comparison of drought-stressed roots with roots from the control treatment identified 2445 genes displaying at least a 1.5-fold expression difference (false discovery rate = 0.01). Genes commonly associated with drought response in pine and other plant species, as well as a number of abiotic and biotic stress-related genes, were up-regulated in drought-stressed roots. Only 76 genes were identified as differentially expressed in drought-recovered roots, indicating that the transcript population can return to the pre-drought state within 48 hours. Gene correlation analysis predicts a scale-free network topology and identifies eleven co-expression modules that ranged in size from 34 to 938 members. Network topological parameters identified a number of central nodes (hubs) including those with significant homology (E-values ≤ 2 × 10-30) to 9-cis-epoxycarotenoid dioxygenase, zeatin O-glucosyltransferase, and ABA-responsive protein. Identified hubs also include genes that have been associated previously with osmotic stress, phytohormones, enzymes that detoxify reactive oxygen species, and several genes of unknown function. Conclusion PtGen2 was used to evaluate transcriptome responses in loblolly pine and was leveraged to identify 2445 differentially expressed genes responding to severe drought stress in roots. Many of the
Jing, Shengli; Zhang, Lei; Ma, Yinhua; Liu, Bingfang; Zhao, Yan; Yu, Hangjin; Zhou, Xi; Qin, Rui; Zhu, Lili; He, Guangcun
Insects and plants have coexisted for over 350 million years and their interactions have affected ecosystems and agricultural practices worldwide. Variation in herbivorous insects' virulence to circumvent host resistance has been extensively documented. However, despite decades of investigation, the genetic foundations of virulence are currently unknown. The brown planthopper (Nilaparvata lugens) is the most destructive rice (Oryza sativa) pest in the world. The identification of the resistance gene Bph1 and its introduction in commercial rice varieties prompted the emergence of a new virulent brown planthopper biotype that was able to break the resistance conferred by Bph1. In this study, we aimed to construct a high density linkage map for the brown planthopper and identify the loci responsible for its virulence in order to determine their genetic architecture. Based on genotyping data for hundreds of molecular markers in three mapping populations, we constructed the most comprehensive linkage map available for this species, covering 96.6% of its genome. Fifteen chromosomes were anchored with 124 gene-specific markers. Using genome-wide scanning and interval mapping, the Qhp7 locus that governs preference for Bph1 plants was mapped to a 0.1 cM region of chromosome 7. In addition, two major QTLs that govern the rate of insect growth on resistant rice plants were identified on chromosomes 5 (Qgr5) and 14 (Qgr14). This is the first study to successfully locate virulence in the genome of this important agricultural insect by marker-based genetic mapping. Our results show that the virulence which overcomes the resistance conferred by Bph1 is controlled by a few major genes and that the components of virulence originate from independent genetic characters. The isolation of these loci will enable the elucidation of the molecular mechanisms underpinning the rice-brown planthopper interaction and facilitate the development of durable approaches for controlling this most
Full Text Available Insects and plants have coexisted for over 350 million years and their interactions have affected ecosystems and agricultural practices worldwide. Variation in herbivorous insects' virulence to circumvent host resistance has been extensively documented. However, despite decades of investigation, the genetic foundations of virulence are currently unknown. The brown planthopper (Nilaparvata lugens is the most destructive rice (Oryza sativa pest in the world. The identification of the resistance gene Bph1 and its introduction in commercial rice varieties prompted the emergence of a new virulent brown planthopper biotype that was able to break the resistance conferred by Bph1. In this study, we aimed to construct a high density linkage map for the brown planthopper and identify the loci responsible for its virulence in order to determine their genetic architecture. Based on genotyping data for hundreds of molecular markers in three mapping populations, we constructed the most comprehensive linkage map available for this species, covering 96.6% of its genome. Fifteen chromosomes were anchored with 124 gene-specific markers. Using genome-wide scanning and interval mapping, the Qhp7 locus that governs preference for Bph1 plants was mapped to a 0.1 cM region of chromosome 7. In addition, two major QTLs that govern the rate of insect growth on resistant rice plants were identified on chromosomes 5 (Qgr5 and 14 (Qgr14. This is the first study to successfully locate virulence in the genome of this important agricultural insect by marker-based genetic mapping. Our results show that the virulence which overcomes the resistance conferred by Bph1 is controlled by a few major genes and that the components of virulence originate from independent genetic characters. The isolation of these loci will enable the elucidation of the molecular mechanisms underpinning the rice-brown planthopper interaction and facilitate the development of durable approaches for
Lenburg, Marc E; Liou, Louis S; Gerry, Norman P; Frampton, Garrett M; Cohen, Herbert T; Christman, Michael F
Renal cell carcinoma is a common malignancy that often presents as a metastatic-disease for which there are no effective treatments. To gain insights into the mechanism of renal cell carcinogenesis, a number of genome-wide expression profiling studies have been performed. Surprisingly, there is very poor agreement among these studies as to which genes are differentially regulated. To better understand this lack of agreement we profiled renal cell tumor gene expression using genome-wide microarrays (45,000 probe sets) and compare our analysis to previous microarray studies. We hybridized total RNA isolated from renal cell tumors and adjacent normal tissue to Affymetrix U133A and U133B arrays. We removed samples with technical defects and removed probesets that failed to exhibit sequence-specific hybridization in any of the samples. We detected differential gene expression in the resulting dataset with parametric methods and identified keywords that are overrepresented in the differentially expressed genes with the Fisher-exact test. We identify 1,234 genes that are more than three-fold changed in renal tumors by t-test, 800 of which have not been previously reported to be altered in renal cell tumors. Of the only 37 genes that have been identified as being differentially expressed in three or more of five previous microarray studies of renal tumor gene expression, our analysis finds 33 of these genes (89%). A key to the sensitivity and power of our analysis is filtering out defective samples and genes that are not reliably detected. The widespread use of sample-wise voting schemes for detecting differential expression that do not control for false positives likely account for the poor overlap among previous studies. Among the many genes we identified using parametric methods that were not previously reported as being differentially expressed in renal cell tumors are several oncogenes and tumor suppressor genes that likely play important roles in renal cell
Hu, H; Haas, S A; Chelly, J; Van Esch, H; Raynaud, M; de Brouwer, A P M; Weinert, S; Froyen, G; Frints, S G M; Laumonnier, F; Zemojtel, T; Love, M I; Richard, H; Emde, A-K; Bienek, M; Jensen, C; Hambrock, M; Fischer, U; Langnick, C; Feldkamp, M; Wissink-Lindhout, W; Lebrun, N; Castelnau, L; Rucci, J; Montjean, R; Dorseuil, O; Billuart, P; Stuhlmann, T; Shaw, M; Corbett, M A; Gardner, A; Willis-Owen, S; Tan, C; Friend, K L; Belet, S; van Roozendaal, K E P; Jimenez-Pocquet, M; Moizard, M-P; Ronce, N; Sun, R; O'Keeffe, S; Chenna, R; van Bömmel, A; Göke, J; Hackett, A; Field, M; Christie, L; Boyle, J; Haan, E; Nelson, J; Turner, G; Baynam, G; Gillessen-Kaesbach, G; Müller, U; Steinberger, D; Budny, B; Badura-Stronka, M; Latos-Bieleńska, A; Ousager, L B; Wieacker, P; Rodríguez Criado, G; Bondeson, M-L; Annerén, G; Dufke, A; Cohen, M; Van Maldergem, L; Vincent-Delorme, C; Echenne, B; Simon-Bouy, B; Kleefstra, T; Willemsen, M; Fryns, J-P; Devriendt, K; Ullmann, R; Vingron, M; Wrogemann, K; Wienker, T F; Tzschach, A; van Bokhoven, H; Gecz, J; Jentsch, T J; Chen, W; Ropers, H-H; Kalscheuer, V M
X-linked intellectual disability (XLID) is a clinically and genetically heterogeneous disorder. During the past two decades in excess of 100 X-chromosome ID genes have been identified. Yet, a large number of families mapping to the X-chromosome remained unresolved suggesting that more XLID genes or loci are yet to be identified. Here, we have investigated 405 unresolved families with XLID. We employed massively parallel sequencing of all X-chromosome exons in the index males. The majority of these males were previously tested negative for copy number variations and for mutations in a subset of known XLID genes by Sanger sequencing. In total, 745 X-chromosomal genes were screened. After stringent filtering, a total of 1297 non-recurrent exonic variants remained for prioritization. Co-segregation analysis of potential clinically relevant changes revealed that 80 families (20%) carried pathogenic variants in established XLID genes. In 19 families, we detected likely causative protein truncating and missense variants in 7 novel and validated XLID genes (CLCN4, CNKSR2, FRMPD4, KLHL15, LAS1L, RLIM and USP27X) and potentially deleterious variants in 2 novel candidate XLID genes (CDK16 and TAF1). We show that the CLCN4 and CNKSR2 variants impair protein functions as indicated by electrophysiological studies and altered differentiation of cultured primary neurons from Clcn4(-/-) mice or after mRNA knock-down. The newly identified and candidate XLID proteins belong to pathways and networks with established roles in cognitive function and intellectual disability in particular. We suggest that systematic sequencing of all X-chromosomal genes in a cohort of patients with genetic evidence for X-chromosome locus involvement may resolve up to 58% of Fragile X-negative cases.
Loughman, James; Wildsoet, Christine F.; Williams, Cathy; Guggenheim, Jeremy A.
Purpose To test the hypothesis that genes known to cause clinical syndromes featuring myopia also harbor polymorphisms contributing to nonsyndromic refractive errors. Methods Clinical phenotypes and syndromes that have refractive errors as a recognized feature were identified using the Online Mendelian Inheritance in Man (OMIM) database. One hundred fifty-four unique causative genes were identified, of which 119 were specifically linked with myopia and 114 represented syndromic myopia (i.e., myopia and at least one other clinical feature). Myopia was the only refractive error listed for 98 genes and hyperopia and the only refractive error noted for 28 genes, with the remaining 28 genes linked to phenotypes with multiple forms of refractive error. Pathway analysis was carried out to find biological processes overrepresented within these sets of genes. Genetic variants located within 50 kb of the 119 myopia-related genes were evaluated for involvement in refractive error by analysis of summary statistics from genome-wide association studies (GWAS) conducted by the CREAM Consortium and 23andMe, using both single-marker and gene-based tests. Results Pathway analysis identified several biological processes already implicated in refractive error development through prior GWAS analyses and animal studies, including extracellular matrix remodeling, focal adhesion, and axon guidance, supporting the research hypothesis. Novel pathways also implicated in myopia development included mannosylation, glycosylation, lens development, gliogenesis, and Schwann cell differentiation. Hyperopia was found to be linked to a different pattern of biological processes, mostly related to organogenesis. Comparison with GWAS findings further confirmed that syndromic myopia genes were enriched for genetic variants that influence refractive errors in the general population. Gene-based analyses implicated 21 novel candidate myopia genes (ADAMTS18, ADAMTS2, ADAMTSL4, AGK, ALDH18A1, ASXL1, COL4A1
Warnat, Patrick; Oberthuer, André; Fischer, Matthias; Westermann, Frank; Eils, Roland; Brors, Benedikt
Neuroblastoma patients show heterogeneous clinical courses ranging from life-threatening progression to spontaneous regression. Recently, gene expression profiles of neuroblastoma tumours were associated with clinically different phenotypes. However, such data is still rare for important patient subgroups, such as patients with MYCN non-amplified advanced stage disease. Prediction of the individual course of disease and optimal therapy selection in this cohort is challenging. Additional research effort is needed to describe the patterns of gene expression in this cohort and to identify reliable prognostic markers for this subset of patients. We combined gene expression data from two studies in a meta-analysis in order to investigate differences in gene expression of advanced stage (3 or 4) tumours without MYCN amplification that show contrasting outcomes (alive or dead) at five years after initial diagnosis. In addition, a predictive model for outcome was generated. Gene expression profiles from 66 patients were included from two studies using different microarray platforms. In the combined data set, 72 genes were identified as differentially expressed by meta-analysis at a false discovery rate (FDR) of 8.33%. Meta-analysis detected 34 differentially expressed genes that were not found as significant in either single study. Outcome prediction based on data of both studies resulted in a predictive accuracy of 77%. Moreover, the genes that were differentially expressed in subgroups of advanced stage patients without MYCN amplification accurately separated MYCN amplified tumours from low stage tumours without MYCN amplification. Our findings support the hypothesis that neuroblastoma consists of two biologically distinct subgroups that differ by characteristic gene expression patterns, which are associated with divergent clinical outcome
Donaldson, Lara Elizabeth; Meier, Stuart Kurt
Cyclic nucleotides (CNs) are intracellular second messengers that play an important role in mediating physiological responses to environmental and developmental signals, in species ranging from bacteria to humans. In response to these signals, CNs are synthesized by nucleotidyl cyclases and then act by binding to and altering the activity of downstream target proteins known as cyclic nucleotide-binding proteins (CNBPs). A number of CNBPs have been identified across kingdoms including transcription factors, protein kinases, phosphodiesterases, and channels, all of which harbor conserved CN-binding domains. In plants however, few CNBPs have been identified as homology searches fail to return plant sequences with significant matches to known CNBPs. Recently, affinity pull-down techniques have been successfully used to identify CNBPs in animals and have provided new insights into CN signaling. The application of these techniques to plants has not yet been extensively explored and offers an alternative approach toward the unbiased discovery of novel CNBP candidates in plants. Here, an affinity pull-down technique for the identification of the plant CN interactome is presented. In summary, the method involves an extraction of plant proteins which is incubated with a CN-bait, followed by a series of increasingly stringent elutions that eliminates proteins in a sequential manner according to their affinity to the bait. The eluted and bait-bound proteins are separated by one-dimensional gel electrophoresis, excised, and digested with trypsin after which the resultant peptides are identified by mass spectrometry - techniques that are commonplace in proteomics experiments. The discovery of plant CNBPs promises to provide valuable insight into the mechanism of CN signal transduction in plants. © Springer Science+Business Media New York 2013.
Donaldson, Lara Elizabeth
Cyclic nucleotides (CNs) are intracellular second messengers that play an important role in mediating physiological responses to environmental and developmental signals, in species ranging from bacteria to humans. In response to these signals, CNs are synthesized by nucleotidyl cyclases and then act by binding to and altering the activity of downstream target proteins known as cyclic nucleotide-binding proteins (CNBPs). A number of CNBPs have been identified across kingdoms including transcription factors, protein kinases, phosphodiesterases, and channels, all of which harbor conserved CN-binding domains. In plants however, few CNBPs have been identified as homology searches fail to return plant sequences with significant matches to known CNBPs. Recently, affinity pull-down techniques have been successfully used to identify CNBPs in animals and have provided new insights into CN signaling. The application of these techniques to plants has not yet been extensively explored and offers an alternative approach toward the unbiased discovery of novel CNBP candidates in plants. Here, an affinity pull-down technique for the identification of the plant CN interactome is presented. In summary, the method involves an extraction of plant proteins which is incubated with a CN-bait, followed by a series of increasingly stringent elutions that eliminates proteins in a sequential manner according to their affinity to the bait. The eluted and bait-bound proteins are separated by one-dimensional gel electrophoresis, excised, and digested with trypsin after which the resultant peptides are identified by mass spectrometry - techniques that are commonplace in proteomics experiments. The discovery of plant CNBPs promises to provide valuable insight into the mechanism of CN signal transduction in plants. © Springer Science+Business Media New York 2013.
Shpak, Elena D
Multiple receptor-like kinases (RLKs) enable intercellular communication that coordinates growth and development of plant tissues. ERECTA family receptors (ERfs) are an ancient family of leucine-rich repeat RLKs that in Arabidopsis consists of three genes: ERECTA, ERL1, and ERL2. ERfs sense secreted cysteine-rich peptides from the EPF/EPFL family and transmit the signal through a MAP kinase cascade. This review discusses the functions of ERfs in stomata development, in regulation of longitudinal growth of aboveground organs, during reproductive development, and in the shoot apical meristem. In addition the role of ERECTA in plant responses to biotic and abiotic factors is examined. Elena D. Shpak (Corresponding author). © 2013 Institute of Botany, Chinese Academy of Sciences.
Bettini, Priscilla P; Marvasi, Massimiliano; Fani, Fabiola; Lazzara, Luigi; Cosi, Elena; Melani, Lorenzo; Mauro, Maria Luisa
Insertion of Agrobacterium rhizogenes rolB gene into plant genome affects plant development, hormone balance and defence. However, beside the current research, the overall transcriptional response and gene expression of rolB as a modulator in plant is unknown. Transformed rolB tomato plant (Solanum lycopersicum L.) cultivar Tondino has been used to investigate the differential expression profile. Tomato is a well-known model organism both at the genetic and molecular level, and one of the most important commercial food crops in the world. Through the construction and characterization of a cDNA subtracted library, we have investigated the differential gene expression between transgenic clones of rolB and control tomato and have evaluated genes specifically transcribed in transgenic rolB plants. Among the selected genes, five genes encoding for chlorophyll a/b binding protein, carbonic anhydrase, cytochrome b 6 /f complex Fe-S subunit, potassium efflux antiporter 3, and chloroplast small heat-shock protein, all involved in chloroplast function, were identified. Measurement of photosynthesis efficiency by the level of three different photosynthetic parameters (F v /F m , rETR, NPQ) showed rolB significant increase in non-photochemical quenching and a, b chlorophyll content. Our results point to highlight the role of rolB on plant fitness by improving photosynthesis. Copyright © 2016 Elsevier GmbH. All rights reserved.
Hao, Xinyuan; Horvath, David P.; Chao, Wun S.; Yang, Yajun; Wang, Xinchao; Xiao, Bin
Reliable reference selection for the accurate quantification of gene expression under various experimental conditions is a crucial step in qRT-PCR normalization. To date, only a few housekeeping genes have been identified and used as reference genes in tea plant. The validity of those reference genes are not clear since their expression stabilities have not been rigorously examined. To identify more appropriate reference genes for qRT-PCR studies on tea plant, we examined the expression stability of 11 candidate reference genes from three different sources: the orthologs of Arabidopsis traditional reference genes and stably expressed genes identified from whole-genome GeneChip studies, together with three housekeeping gene commonly used in tea plant research. We evaluated the transcript levels of these genes in 94 experimental samples. The expression stabilities of these 11 genes were ranked using four different computation programs including geNorm, Normfinder, BestKeeper, and the comparative ∆CT method. Results showed that the three commonly used housekeeping genes of CsTUBULIN1, CsACINT1 and Cs18S rRNA1 together with CsUBQ1 were the most unstable genes in all sample ranking order. However, CsPTB1, CsEF1, CsSAND1, CsCLATHRIN1 and CsUBC1 were the top five appropriate reference genes for qRT-PCR analysis in complex experimental conditions. PMID:25474086
Ahanger, Mohammad Abass; Akram, Nudrat Aisha; Ashraf, Muhammad; Alyemeni, Mohammed Nasser; Wijaya, Leonard; Ahmad, Parvaiz
Increasing global population, urbanization and industrialization are increasing the rate of conversion of arable land into wasteland. Supplying food to an ever-increasing population is one of the biggest challenges that agriculturalists and plant scientists are currently confronting. Environmental stresses make this situation even graver. Despite the induction of several tolerance mechanisms, sensitive plants often fail to survive under environmental extremes. New technological approaches are imperative. Conventional breeding methods have a limited potential to improve plant genomes against environmental stress. Recently, genetic engineering has contributed enormously to the development of genetically modified varieties of different crops such as cotton, maize, rice, canola and soybean. The identification of stress-responsive genes and their subsequent introgression or overexpression within sensitive crop species are now being widely carried out by plant scientists. Engineering of important tolerance pathways, like antioxidant enzymes, osmolyte accumulation, membrane-localized transporters for efficient compartmentation of deleterious ions and accumulation of essential elements and resistance against pests or pathogens is also an area that has been intensively researched. In this review, the role of biotechnology and its successes, prospects and challenges in developing stress-tolerant crop cultivars are discussed.
Yang, Feiling; Hu, Jinming; Wu, Ruidong
Suitable surrogates are critical for identifying optimal priority conservation areas (PCAs) to protect regional biodiversity. This study explored the efficiency of using endangered plants and animals as surrogates for identifying PCAs at the county level in Yunnan, southwest China. We ran the Dobson algorithm under three surrogate scenarios at 75% and 100% conservation levels and identified four types of PCAs. Assessment of the protection efficiencies of the four types of PCAs showed that endangered plants had higher surrogacy values than endangered animals but that the two were not substitutable; coupled endangered plants and animals as surrogates yielded a higher surrogacy value than endangered plants or animals as surrogates; the plant-animal priority areas (PAPAs) was the optimal among the four types of PCAs for conserving both endangered plants and animals in Yunnan. PAPAs could well represent overall species diversity distribution patterns and overlap with critical biogeographical regions in Yunnan. Fourteen priority units in PAPAs should be urgently considered as optimizing Yunnan’s protected area system. The spatial pattern of PAPAs at the 100% conservation level could be conceptualized into three connected conservation belts, providing a valuable reference for optimizing the layout of the in situ protected area system in Yunnan.
Thomassen, Mads; Tan, Qihua; Kruse, Torben A
Metastasis is believed to progress in several steps including different pathways but the determination and understanding of these mechanisms is still fragmentary. Microarray analysis of gene expression patterns in breast tumors has been used to predict outcome in recent studies. Besides classification of outcome, these global expression patterns may reflect biological mechanisms involved in metastasis of breast cancer. Our purpose has been to investigate pathways and transcription factors involved in metastasis by use of gene expression data sets. We have analyzed 8 publicly available gene expression data sets. A global approach, 'gene set enrichment analysis' as well as an approach focusing on a subset of significantly differently regulated genes, GenMAPP, has been applied to rank pathway gene sets according to differential regulation in metastasizing tumors compared to non-metastasizing tumors. Meta-analysis has been used to determine overrepresentation of pathways and transcription factors targets, concordant deregulated in metastasizing breast tumors, in several data sets. The major findings are up-regulation of cell cycle pathways and a metabolic shift towards glucose metabolism reflected in several pathways in metastasizing tumors. Growth factor pathways seem to play dual roles; EGF and PDGF pathways are decreased, while VEGF and sex-hormone pathways are increased in tumors that metastasize. Furthermore, migration, proteasome, immune system, angiogenesis, DNA repair and several signal transduction pathways are associated to metastasis. Finally several transcription factors e.g. E2F, NFY, and YY1 are identified as being involved in metastasis. By pathway meta-analysis many biological mechanisms beyond major characteristics such as proliferation are identified. Transcription factor analysis identifies a number of key factors that support central pathways. Several previously proposed treatment targets are identified and several new pathways that may
Tsoi, Lam C; Qin, Tingting; Slate, Elizabeth H; Zheng, W Jim
To utilize the large volume of gene expression information generated from different microarray experiments, several meta-analysis techniques have been developed. Despite these efforts, there remain significant challenges to effectively increasing the statistical power and decreasing the Type I error rate while pooling the heterogeneous datasets from public resources. The objective of this study is to develop a novel meta-analysis approach, Consistent Differential Expression Pattern (CDEP), to identify genes with common differential expression patterns across different datasets. We combined False Discovery Rate (FDR) estimation and the non-parametric RankProd approach to estimate the Type I error rate in each microarray dataset of the meta-analysis. These Type I error rates from all datasets were then used to identify genes with common differential expression patterns. Our simulation study showed that CDEP achieved higher statistical power and maintained low Type I error rate when compared with two recently proposed meta-analysis approaches. We applied CDEP to analyze microarray data from different laboratories that compared transcription profiles between metastatic and primary cancer of different types. Many genes identified as differentially expressed consistently across different cancer types are in pathways related to metastatic behavior, such as ECM-receptor interaction, focal adhesion, and blood vessel development. We also identified novel genes such as AMIGO2, Gem, and CXCL11 that have not been shown to associate with, but may play roles in, metastasis. CDEP is a flexible approach that borrows information from each dataset in a meta-analysis in order to identify genes being differentially expressed consistently. We have shown that CDEP can gain higher statistical power than other existing approaches under a variety of settings considered in the simulation study, suggesting its robustness and insensitivity to data variation commonly associated with microarray
Etienne G.J. Danchin
Full Text Available Nematodes have evolved the ability to parasitize plants on at least four independent occasions, with plant parasites present in Clades 1, 2, 10 and 12 of the phylum. In the case of Clades 10 and 12, horizontal gene transfer of plant cell wall degrading enzymes from bacteria and fungi has been implicated in the evolution of plant parasitism. We have used ribonucleic acid sequencing (RNAseq to generate reference transcriptomes for two economically important nematode species, Xiphinema index and Longidorus elongatus, representative of two genera within the early-branching Clade 2 of the phylum Nematoda. We used a transcriptome-wide analysis to identify putative horizontal gene transfer events. This represents the first in-depth transcriptome analysis from any plant-parasitic nematode of this clade. For each species, we assembled ~30 million Illumina reads into a reference transcriptome. We identified 62 and 104 transcripts, from X. index and L. elongatus, respectively, that were putatively acquired via horizontal gene transfer. By cross-referencing horizontal gene transfer prediction with a phylum-wide analysis of Pfam domains, we identified Clade 2-specific events. Of these, a GH12 cellulase from X. index was analysed phylogenetically and biochemically, revealing a likely bacterial origin and canonical enzymatic function. Horizontal gene transfer was previously shown to be a phenomenon that has contributed to the evolution of plant parasitism among nematodes. Our findings underline the importance and the extensiveness of this phenomenon in the evolution of plant-parasitic life styles in this speciose and widespread animal phylum.
Danchin, Etienne G.J.; Perfus-Barbeoch, Laetitia; Rancurel, Corinne; Thorpe, Peter; Da Rocha, Martine; Bajew, Simon; Neilson, Roy; Sokolova (Guzeeva), Elena; Da Silva, Corinne; Guy, Julie; Labadie, Karine; Esmenjaud, Daniel; Helder, Johannes; Jones, John T.
Nematodes have evolved the ability to parasitize plants on at least four independent occasions, with plant parasites present in Clades 1, 2, 10 and 12 of the phylum. In the case of Clades 10 and 12, horizontal gene transfer of plant cell wall degrading enzymes from bacteria and fungi has been implicated in the evolution of plant parasitism. We have used ribonucleic acid sequencing (RNAseq) to generate reference transcriptomes for two economically important nematode species, Xiphinema index and Longidorus elongatus, representative of two genera within the early-branching Clade 2 of the phylum Nematoda. We used a transcriptome-wide analysis to identify putative horizontal gene transfer events. This represents the first in-depth transcriptome analysis from any plant-parasitic nematode of this clade. For each species, we assembled ~30 million Illumina reads into a reference transcriptome. We identified 62 and 104 transcripts, from X. index and L. elongatus, respectively, that were putatively acquired via horizontal gene transfer. By cross-referencing horizontal gene transfer prediction with a phylum-wide analysis of Pfam domains, we identified Clade 2-specific events. Of these, a GH12 cellulase from X. index was analysed phylogenetically and biochemically, revealing a likely bacterial origin and canonical enzymatic function. Horizontal gene transfer was previously shown to be a phenomenon that has contributed to the evolution of plant parasitism among nematodes. Our findings underline the importance and the extensiveness of this phenomenon in the evolution of plant-parasitic life styles in this speciose and widespread animal phylum. PMID:29065523
Full Text Available Cymbidium ensifolium belongs to the genus Cymbidium of the orchid family. Owing to its spectacular flower morphology, C. ensifolium has considerable ecological and cultural value. However, limited genetic data is available for this non-model plant, and the molecular mechanism underlying floral organ identity is still poorly understood. In this study, we characterize the floral transcriptome of C. ensifolium and present, for the first time, extensive sequence and transcript abundance data of individual floral organs. After sequencing, over 10 Gb clean sequence data were generated and assembled into 111,892 unigenes with an average length of 932.03 base pairs, including 1,227 clusters and 110,665 singletons. Assembled sequences were annotated with gene descriptions, gene ontology, clusters of orthologous group terms, the Kyoto Encyclopedia of Genes and Genomes, and the plant transcription factor database. From these annotations, 131 flowering-associated unigenes, 61 CONSTANS-LIKE (COL unigenes and 90 floral homeotic genes were identified. In addition, four digital gene expression libraries were constructed for the sepal, petal, labellum and gynostemium, and 1,058 genes corresponding to individual floral organ development were identified. Among them, eight MADS-box genes were further investigated by full-length cDNA sequence analysis and expression validation, which revealed two APETALA1/AGL9-like MADS-box genes preferentially expressed in the sepal and petal, two AGAMOUS-like genes particularly restricted to the gynostemium, and four DEF-like genes distinctively expressed in different floral organs. The spatial expression of these genes varied distinctly in different floral mutant corresponding to different floral morphogenesis, which validated the specialized roles of them in floral patterning and further supported the effectiveness of our in silico analysis. This dataset generated in our study provides new insights into the molecular mechanisms
Xie, Dongwei; Dai, Zhigang; Yang, Zemao; Sun, Jian; Zhao, Debao; Yang, Xue; Zhang, Liguo; Tang, Qing; Su, Jianguang
Flax ( Linum usitatissimum L.) is an important cash crop, and its agronomic traits directly affect yield and quality. Molecular studies on flax remain inadequate because relatively few flax genes have been associated with agronomic traits or have been identified as having potential applications. To identify markers and candidate genes that can potentially be used for genetic improvement of crucial agronomic traits, we examined 224 specimens of core flax germplasm; specifically, phenotypic data for key traits, including plant height, technical length, number of branches, number of fruits, and 1000-grain weight were investigated under three environmental conditions before specific-locus amplified fragment sequencing (SLAF-seq) was employed to perform a genome-wide association study (GWAS) for these five agronomic traits. Subsequently, the results were used to screen single nucleotide polymorphism (SNP) loci and candidate genes that exhibited a significant correlation with the important agronomic traits. Our analyses identified a total of 42 SNP loci that showed significant correlations with the five important agronomic flax traits. Next, candidate genes were screened in the 10 kb zone of each of the 42 SNP loci. These SNP loci were then analyzed by a more stringent screening via co-identification using both a general linear model (GLM) and a mixed linear model (MLM) as well as co-occurrences in at least two of the three environments, whereby 15 final candidate genes were obtained. Based on these results, we determined that UGT and PL are candidate genes for plant height, GRAS and XTH are candidate genes for the number of branches, Contig1437 and LU0019C12 are candidate genes for the number of fruits, and PHO1 is a candidate gene for the 1000-seed weight. We propose that the identified SNP loci and corresponding candidate genes might serve as a biological basis for improving crucial agronomic flax traits.
Full Text Available Flax (Linum usitatissimum L. is an important cash crop, and its agronomic traits directly affect yield and quality. Molecular studies on flax remain inadequate because relatively few flax genes have been associated with agronomic traits or have been identified as having potential applications. To identify markers and candidate genes that can potentially be used for genetic improvement of crucial agronomic traits, we examined 224 specimens of core flax germplasm; specifically, phenotypic data for key traits, including plant height, technical length, number of branches, number of fruits, and 1000-grain weight were investigated under three environmental conditions before specific-locus amplified fragment sequencing (SLAF-seq was employed to perform a genome-wide association study (GWAS for these five agronomic traits. Subsequently, the results were used to screen single nucleotide polymorphism (SNP loci and candidate genes that exhibited a significant correlation with the important agronomic traits. Our analyses identified a total of 42 SNP loci that showed significant correlations with the five important agronomic flax traits. Next, candidate genes were screened in the 10 kb zone of each of the 42 SNP loci. These SNP loci were then analyzed by a more stringent screening via co-identification using both a general linear model (GLM and a mixed linear model (MLM as well as co-occurrences in at least two of the three environments, whereby 15 final candidate genes were obtained. Based on these results, we determined that UGT and PL are candidate genes for plant height, GRAS and XTH are candidate genes for the number of branches, Contig1437 and LU0019C12 are candidate genes for the number of fruits, and PHO1 is a candidate gene for the 1000-seed weight. We propose that the identified SNP loci and corresponding candidate genes might serve as a biological basis for improving crucial agronomic flax traits.
Full Text Available Abstract Background The hierarchical clustering tree (HCT with a dendrogram 1 and the singular value decomposition (SVD with a dimension-reduced representative map 2 are popular methods for two-way sorting the gene-by-array matrix map employed in gene expression profiling. While HCT dendrograms tend to optimize local coherent clustering patterns, SVD leading eigenvectors usually identify better global grouping and transitional structures. Results This study proposes a flipping mechanism for a conventional agglomerative HCT using a rank-two ellipse (R2E, an improved SVD algorithm for sorting purpose seriation by Chen 3 as an external reference. While HCTs always produce permutations with good local behaviour, the rank-two ellipse seriation gives the best global grouping patterns and smooth transitional trends. The resulting algorithm automatically integrates the desirable properties of each method so that users have access to a clustering and visualization environment for gene expression profiles that preserves coherent local clusters and identifies global grouping trends. Conclusion We demonstrate, through four examples, that the proposed method not only possesses better numerical and statistical properties, it also provides more meaningful biomedical insights than other sorting algorithms. We suggest that sorted proximity matrices for genes and arrays, in addition to the gene-by-array expression matrix, can greatly aid in the search for comprehensive understanding of gene expression structures. Software for the proposed methods can be obtained at http://gap.stat.sinica.edu.tw/Software/GAP.
Wang, Ying; Ding, Jia-tong; Yang, Hai-ming; Yan, Zheng-jie; Cao, Wei; Li, Yang-bai
Monochromatic light is widely applied to promote poultry reproductive performance, yet little is currently known regarding the mechanism by which light wavelengths affect pigeon reproduction. Recently, high-throughput sequencing technologies have been used to provide genomic information for solving this problem. In this study, we employed Illumina Hiseq 2000 to identify differentially expressed genes in ovary tissue from pigeons under blue and white light conditions and de novo transcriptome assembly to construct a comprehensive sequence database containing information on the mechanisms of follicle development. A total of 157,774 unigenes (mean length: 790 bp) were obtained by the Trinity program, and 35.83% of these unigenes were matched to genes in a non-redundant protein database. Gene description, gene ontology, and the clustering of orthologous group terms were performed to annotate the transcriptome assembly. Differentially expressed genes between blue and white light conditions included those related to oocyte maturation, hormone biosynthesis, and circadian rhythm. Furthermore, 17,574 SSRs and 533,887 potential SNPs were identified in this transcriptome assembly. This work is the first transcriptome analysis of the Columba ovary using Illumina technology, and the resulting transcriptome and differentially expressed gene data can facilitate further investigations into the molecular mechanism of the effect of blue light on follicle development and reproduction in pigeons and other bird species. PMID:26599806
Full Text Available Monochromatic light is widely applied to promote poultry reproductive performance, yet little is currently known regarding the mechanism by which light wavelengths affect pigeon reproduction. Recently, high-throughput sequencing technologies have been used to provide genomic information for solving this problem. In this study, we employed Illumina Hiseq 2000 to identify differentially expressed genes in ovary tissue from pigeons under blue and white light conditions and de novo transcriptome assembly to construct a comprehensive sequence database containing information on the mechanisms of follicle development. A total of 157,774 unigenes (mean length: 790 bp were obtained by the Trinity program, and 35.83% of these unigenes were matched to genes in a non-redundant protein database. Gene description, gene ontology, and the clustering of orthologous group terms were performed to annotate the transcriptome assembly. Differentially expressed genes between blue and white light conditions included those related to oocyte maturation, hormone biosynthesis, and circadian rhythm. Furthermore, 17,574 SSRs and 533,887 potential SNPs were identified in this transcriptome assembly. This work is the first transcriptome analysis of the Columba ovary using Illumina technology, and the resulting transcriptome and differentially expressed gene data can facilitate further investigations into the molecular mechanism of the effect of blue light on follicle development and reproduction in pigeons and other bird species.
Pei, Wuhong; Xu, Lisha; Huang, Sunny C; Pettie, Kade; Idol, Jennifer; Rissone, Alberto; Jimenez, Erin; Sinclair, Jason W; Slevin, Claire; Varshney, Gaurav K; Jones, MaryPat; Carrington, Blake; Bishop, Kevin; Huang, Haigen; Sood, Raman; Lin, Shuo; Burgess, Shawn M
Regenerative medicine holds great promise for both degenerative diseases and traumatic tissue injury which represent significant challenges to the health care system. Hearing loss, which affects hundreds of millions of people worldwide, is caused primarily by a permanent loss of the mechanosensory receptors of the inner ear known as hair cells. This failure to regenerate hair cells after loss is limited to mammals, while all other non-mammalian vertebrates tested were able to completely regenerate these mechanosensory receptors after injury. To understand the mechanism of hair cell regeneration and its association with regeneration of other tissues, we performed a guided mutagenesis screen using zebrafish lateral line hair cells as a screening platform to identify genes that are essential for hair cell regeneration, and further investigated how genes essential for hair cell regeneration were involved in the regeneration of other tissues. We created genetic mutations either by retroviral insertion or CRISPR/Cas9 approaches, and developed a high-throughput screening pipeline for analyzing hair cell development and regeneration. We screened 254 gene mutations and identified 7 genes specifically affecting hair cell regeneration. These hair cell regeneration genes fell into distinct and somewhat surprising functional categories. By examining the regeneration of caudal fin and liver, we found these hair cell regeneration genes often also affected other types of tissue regeneration. Therefore, our results demonstrate guided screening is an effective approach to discover regeneration candidates, and hair cell regeneration is associated with other tissue regeneration.
Full Text Available Understanding how the limb blastema is established after the initial wound healing response is an important aspect of regeneration research. Here we performed parallel expression profile time courses of healing lateral wounds versus amputated limbs in axolotl. This comparison between wound healing and regeneration allowed us to identify amputation-specific genes. By clustering the expression profiles of these samples, we could detect three distinguishable phases of gene expression - early wound healing followed by a transition-phase leading to establishment of the limb development program, which correspond to the three phases of limb regeneration that had been defined by morphological criteria. By focusing on the transition-phase, we identified 93 strictly amputation-associated genes many of which are implicated in oxidative-stress response, chromatin modification, epithelial development or limb development. We further classified the genes based on whether they were or were not significantly expressed in the developing limb bud. The specific localization of 53 selected candidates within the blastema was investigated by in situ hybridization. In summary, we identified a set of genes that are expressed specifically during regeneration and are therefore, likely candidates for the regulation of blastema formation.
Guardiola-Serrano, Francisca; Haendeler, Judith; Lukosz, Margarete; Sturm, Karsten; Melchner, Harald von; Altschmied, Joachim
Tumor necrosis factor alpha (TNFα) is a pleiotropic cytokine involved in apoptotic cell death, cellular proliferation, differentiation, inflammation, and tumorigenesis. In tumors it is secreted by tumor associated macrophages and can have both pro- and anti-tumorigenic effects. To identify genes regulated by TNFα, we performed a gene trap screen in the mammary carcinoma cell line MCF-7 and recovered 64 unique, TNFα-induced gene trap integration sites. Among these were the genes coding for the zinc finger protein ZC3H10 and for the transcription factor grainyhead-like 3 (GRHL3). In line with the dual effects of TNFα on tumorigenesis, we found that ZC3H10 inhibits anchorage independent growth in soft agar suggesting a tumor suppressor function, whereas GRHL3 strongly stimulated the migration of endothelial cells which is consistent with an angiogenic, pro-tumorigenic function
Nowrousian, Minou; Ringelberg, Carol; Dunlap, Jay C; Loros, Jennifer J; Kück, Ulrich
The filamentous fungus Sordaria macrospora forms complex three-dimensional fruiting bodies that protect the developing ascospores and ensure their proper discharge. Several regulatory genes essential for fruiting body development were previously isolated by complementation of the sterile mutants pro1, pro11 and pro22. To establish the genetic relationships between these genes and to identify downstream targets, we have conducted cross-species microarray hybridizations using cDNA arrays derived from the closely related fungus Neurospora crassa and RNA probes prepared from wild-type S. macrospora and the three developmental mutants. Of the 1,420 genes which gave a signal with the probes from all the strains used, 172 (12%) were regulated differently in at least one of the three mutants compared to the wild type, and 17 (1.2%) were regulated differently in all three mutant strains. Microarray data were verified by Northern analysis or quantitative real time PCR. Among the genes that are up- or down-regulated in the mutant strains are genes encoding the pheromone precursors, enzymes involved in melanin biosynthesis and a lectin-like protein. Analysis of gene expression in double mutants revealed a complex network of interaction between the pro gene products.
Melvin, Vida Senkus; Feng, Weiguo; Hernandez-Lagunas, Laura; Artinger, Kristin Bruk; Williams, Trevor
BACKGROUND The regulatory mechanisms underpinning facial development are conserved between diverse species. Therefore, results from model systems provide insight into the genetic causes of human craniofacial defects. Previously, we generated a comprehensive dataset examining gene expression during development and fusion of the mouse facial prominences. Here, we used this resource to identify genes that have dynamic expression patterns in the facial prominences, but for which only limited information exists concerning developmental function. RESULTS This set of ~80 genes was used for a high throughput functional analysis in the zebrafish system using Morpholino gene knockdown technology. This screen revealed three classes of cranial cartilage phenotypes depending upon whether knockdown of the gene affected the neurocranium, viscerocranium, or both. The targeted genes that produced consistent phenotypes encoded proteins linked to transcription (meis1, meis2a, tshz2, vgll4l), signaling (pkdcc, vlk, macc1, wu:fb16h09), and extracellular matrix function (smoc2). The majority of these phenotypes were not altered by reduction of p53 levels, demonstrating that both p53 dependent and independent mechanisms were involved in the craniofacial abnormalities. CONCLUSIONS This Morpholino-based screen highlights new genes involved in development of the zebrafish craniofacial skeleton with wider relevance to formation of the face in other species, particularly mouse and human. PMID:23559552
Full Text Available Antibiotics are often used to prevent sickness and improve production in animal agriculture, and the residues in animal bodies may enter tannery wastewater during leather production. This study aimed to use Illumina high-throughput sequencing to investigate the occurrence, diversity and abundance of antibiotic resistance genes (ARGs and mobile genetic elements (MGEs in aerobic and anaerobic sludge of a full-scale tannery wastewater treatment plant (WWTP. Metagenomic analysis showed that Proteobacteria, Firmicutes, Bacteroidetes and Actinobacteria dominated in the WWTP, but the relative abundance of archaea in anaerobic sludge was higher than in aerobic sludge. Sequencing reads from aerobic and anaerobic sludge revealed differences in the abundance of functional genes between both microbial communities. Genes coding for antibiotic resistance were identified in both communities. BLAST analysis against Antibiotic Resistance Genes Database (ARDB further revealed that aerobic and anaerobic sludge contained various ARGs with high abundance, among which sulfonamide resistance gene sul1 had the highest abundance, occupying over 20% of the total ARGs reads. Tetracycline resistance genes (tet were highly rich in the anaerobic sludge, among which tet33 had the highest abundance, but was absent in aerobic sludge. Over 70 types of insertion sequences were detected in each sludge sample, and class 1 integrase genes were prevalent in the WWTP. The results highlighted prevalence of ARGs and MGEs in tannery WWTPs, which may deserve more public health concerns.
REN, ZHONGLU; WANG, WENHUI; LI, JINMING
Identifying colon cancer subtypes based on molecular signatures may allow for a more rational, patient-specific approach to therapy in the future. Classifications using gene expression data have been attempted before with little concordance between the different studies carried out. In this study we aimed to uncover subtypes of colon cancer that have distinct biological characteristics and identify a set of novel biomarkers which could best reflect the clinical and/or biological characteristi...
Wang, Feng; Liang, Yuting; Jiang, Yuji; Yang, Yunfeng; Xue, Kai; Xiong, Jinbo; Zhou, Jizhong; Sun, Bo
Plants have an important impact on soil microbial communities and their functions. However, how plants determine the microbial composition and network interactions is still poorly understood. During a four-year field experiment, we investigated the functional gene composition of three types of soils (Phaeozem, Cambisols and Acrisol) under maize planting and bare fallow regimes located in cold temperate, warm temperate and subtropical regions, respectively. The core genes were identified using high-throughput functional gene microarray (GeoChip 3.0), and functional molecular ecological networks (fMENs) were subsequently developed with the random matrix theory (RMT)-based conceptual framework. Our results demonstrated that planting significantly (P soils and 83.5% of microbial alpha-diversity can be explained by the plant factor. Moreover, planting had significant impacts on the microbial community structure and the network interactions of the microbial communities. The calculated network complexity was higher under maize planting than under bare fallow regimes. The increase of the functional genes led to an increase in both soil respiration and nitrification potential with maize planting, indicating that changes in the soil microbial communities and network interactions influenced ecological functioning.
Ramos, Geovana Brotto; Salomão, Heloisa; Francio, Angela Schneider; Fava, Vinícius Medeiros; Werneck, Renata Iani; Mira, Marcelo Távora
Genetic studies have identified several genes and genomic regions contributing to the control of host susceptibility to leprosy. Here, we test variants of the positional and functional candidate gene SOD2 for association with leprosy in 2 independent population samples. Family-based analysis revealed an association between leprosy and allele G of marker rs295340 (P = .042) and borderline evidence of an association between leprosy and alleles C and A of markers rs4880 (P = .077) and rs5746136 (P = .071), respectively. Findings were validated in an independent case-control sample for markers rs295340 (P = .049) and rs4880 (P = .038). These results suggest SOD2 as a newly identified gene conferring susceptibility to leprosy. © The Author 2016. Published by Oxford University Press for the Infectious Diseases Society of America. All rights reserved. For permissions, e-mail firstname.lastname@example.org.
Childs, Kevin [Michigan State Univ., East Lansing, MI (United States); Buell, Robin [Michigan State Univ., East Lansing, MI (United States); Zhao, Bingyu [Virginia Polytechnic Inst. and State Univ. (Virginia Tech), Blacksburg, VA (United States); Zhang, Xunzhong [Virginia Polytechnic Inst. and State Univ. (Virginia Tech), Blacksburg, VA (United States)
Switchgrass (Panicum virgatum) is a warm-season C4 grass that is a target lignocellulosic biofuel species for use in the United States due to its local adaption capabilities and high biomass accumulation. Two ecotypes of switchgrass have been described. Members of the lowland ecotype are taller, have narrower leaf blades and generate more biomass compared to individuals from the upland ecotype. Additionally, lowland plants are generally found in the southern United States while upland switchgrass is more typically present in the northern United States. These differences are important as it is envisioned that switchgrass for biofuel production will typically be grown on marginal lands in the northern United States to supplement and diversify farmers' traditional crop incomes. While lowland switchgrass is more productive, it has poor winter survivability in northern latitudes where upland switchgrass is expected to be grown for biofuel use. Abiotic stresses likely to be encountered by switchgrass include drought and salinity. Despite initially being described as preferring wetter environments, members of the lowland ecotype have been characterized as being more drought tolerant than plants of the upland ecotype. Nonetheless, direct trials have indicated that variation for drought tolerance exists in both ecotypes, but prior to this project, only a relatively small number of switchgrass lines had been tested for drought responses. Similarly, switchgrass cultivars have not been widely tested for salt tolerance, but a few studies have shown that even mild salt stress can inhibit growth. The effects of drought and salt stress on plant growth are complex. Both drought and salinity affect the osmotic potential of plant cells and negatively affect plant growth due to reduced water potential and reduced photosynthesis that results from lower stomatal conductance of CO2. Plants respond to drought and salt stress by activating genes that directly attempt to
Full Text Available Invasive animals have been linked to the extinctions of native wildlife, and to significant agricultural financial losses or impacts. Current approaches to control invasive species require ongoing resources and management over large geographic scales, and often result in the short-term suppression of populations. New and innovative approaches are warranted. Recently, the RNA guided gene drive system based on CRISPR/Cas9 is being proposed as a potential gene editing tool that could be used by wildlife managers as a non-lethal addition or alternative to help reduce pest animal populations. While regulatory control and social acceptance are crucial issues that must be addressed, there is an opportunity now to identify the knowledge and research gaps that exist for some important invasive species. Here we systematically determine the knowledge gaps for pest species for which gene drives could potentially be applied. We apply a conceptual ecological risk framework within the gene drive context within an Australian environment to identify key requirements for undertaking work on seven exemplar invasive species in Australia. This framework allows an evaluation of the potential research on an invasive species of interest and within a gene drive and risk context. We consider the currently available biological, genetic and ecological information for the house mouse, European red fox, feral cat, European rabbit, cane toad, black rat and European starling to evaluate knowledge gaps and identify candidate species for future research. We discuss these findings in the context of future thematic areas of research worth pursuing in preparation for a more formal assessment of the use of gene drives as a novel strategy for the control of these and other invasive species. Keywords: Invasive species, Gene drive, CRISPR, Pest management, Islands
Full Text Available Intellectual disability (ID is a major health problem mostly with an unknown etiology. Recently exome sequencing of individuals with ID identified novel genes implicated in the disease. Therefore the purpose of the present study was to identify the genetic cause of ID in one syndromic and two non-syndromic Pakistani families. Whole exome of three ID probands was sequenced. Missense variations in two plausible novel genes implicated in autosomal recessive ID were identified: lysine (K-specific methyltransferase 2B (KMT2B, zinc finger protein 589 (ZNF589, as well as hedgehog acyltransferase (HHAT with a de novo mutation with autosomal dominant mode of inheritance. The KMT2B recessive variant is the first report of recessive Kleefstra syndrome-like phenotype. Identification of plausible causative mutations for two recessive and a dominant type of ID, in genes not previously implicated in disease, underscores the large genetic heterogeneity of ID. These results also support the viewpoint that large number of ID genes converge on limited number of common networks i.e. ZNF589 belongs to KRAB-domain zinc-finger proteins previously implicated in ID, HHAT is predicted to affect sonic hedgehog, which is involved in several disorders with ID, KMT2B associated with syndromic ID fits the epigenetic module underlying the Kleefstra syndromic spectrum. The association of these novel genes in three different Pakistani ID families highlights the importance of screening these genes in more families with similar phenotypes from different populations to confirm the involvement of these genes in pathogenesis of ID.
Full Text Available Background/Aims: Pediatric sepsis is a disease that threatens life of children. The incidence of pediatric sepsis is higher in developing countries due to various reasons, such as insufficient immunization and nutrition, water and air pollution, etc. Exploring the potential genes via different methods is of significance for the prevention and treatment of pediatric sepsis. This study aimed to identify potential genes associated with pediatric sepsis utilizing analysis of gene network and entropy. Methods: The mRNA expression in the blood samples collected from 20 septic children and 30 healthy controls was quantified by using Affymetrix HG-U133A microarray. Two condition-specific protein-protein interaction networks (PINs, one for the healthy control and the other one for the children with sepsis, were deduced by combining the fundamental human PINs with gene expression profiles in the two phenotypes. Subsequently, distinct modules from the two conditional networks were extracted by adopting a maximal clique-merging approach. Delta entropy (ΔS was calculated between sepsis and control modules. Results: Then, key genes displaying changes in gene composition were identified by matching the control and sepsis modules. Two objective modules were obtained, in which ribosomal protein RPL4 and RPL9 as well as TOP2A were probably considered as the key genes differentiating sepsis from healthy controls. Conclusion: According to previous reports and this work, TOP2A is the potential gene therapy target for pediatric sepsis. The relationship between pediatric sepsis and RPL4 and RPL9 needs further investigation.
Cohen William W
Full Text Available Abstract Background One step in the model organism database curation process is to find, for each article, the identifier of every gene discussed in the article. We consider a relaxation of this problem suitable for semi-automated systems, in which each article is associated with a ranked list of possible gene identifiers, and experimentally compare methods for solving this geneId ranking problem. In addition to baseline approaches based on combining named entity recognition (NER systems with a "soft dictionary" of gene synonyms, we evaluate a graph-based method which combines the outputs of multiple NER systems, as well as other sources of information, and a learning method for reranking the output of the graph-based method. Results We show that named entity recognition (NER systems with similar F-measure performance can have significantly different performance when used with a soft dictionary for geneId-ranking. The graph-based approach can outperform any of its component NER systems, even without learning, and learning can further improve the performance of the graph-based ranking approach. Conclusion The utility of a named entity recognition (NER system for geneId-finding may not be accurately predicted by its entity-level F1 performance, the most common performance measure. GeneId-ranking systems are best implemented by combining several NER systems. With appropriate combination methods, usefully accurate geneId-ranking systems can be constructed based on easily-available resources, without resorting to problem-specific, engineered components.
Basyuni, M.; Wati, R.
The present study evaluates the bioinformatics methods to analyze twenty-four predicted polyprenol reductase genes from higher plants on GenBank as well as predicted the structure, composition, similarity, subcellular localization, and phylogenetic. The physicochemical properties of plant polyprenol showed diversity among the observed genes. The percentage of the secondary structure of plant polyprenol genes followed the ratio order of α helix > random coil > extended chain structure. The values of chloroplast but not signal peptide were too low, indicated that few chloroplast transit peptide in plant polyprenol reductase genes. The possibility of the potential transit peptide showed variation among the plant polyprenol reductase, suggested the importance of understanding the variety of peptide components of plant polyprenol genes. To clarify this finding, a phylogenetic tree was drawn. The phylogenetic tree shows several branches in the tree, suggested that plant polyprenol reductase genes grouped into divergent clusters in the tree.
Nicolas, Aude; Kenna, Kevin P.; Renton, Alan E.; Ticozzi, Nicola; Faghri, Faraz; Chia, Ruth; Dominov, Janice A.; Kenna, Brendan J.; Nalls, Mike A.; Keagle, Pamela; Rivera, Alberto M.; van Rheenen, Wouter; Murphy, Natalie A.; van Vugt, Joke J.F.A.; Geiger, Joshua T.; van der Spek, Rick; Pliner, Hannah A.; Smith, Bradley N.; Marangi, Giuseppe; Topp, Simon D.; Abramzon, Yevgeniya; Gkazi, Athina Soragia; Eicher, John D.; Kenna, Aoife; Logullo, Francesco O.; Simone, Isabella L.; Logroscino, Giancarlo; Salvi, Fabrizio; Bartolomei, Ilaria; Borghero, Giuseppe; Murru, Maria Rita; Costantino, Emanuela; Pani, Carla; Puddu, Roberta; Caredda, Carla; Piras, Valeria; Tranquilli, Stefania; Cuccu, Stefania; Corongiu, Daniela; Melis, Maurizio; Milia, Antonio; Marrosu, Francesco; Marrosu, Maria Giovanna; Floris, Gianluca; Cannas, Antonino; Capasso, Margherita; Caponnetto, Claudia; Mancardi, Gianluigi; Origone, Paola; Mandich, Paola; Conforti, Francesca L.; Cavallaro, Sebastiano; Mora, Gabriele; Marinou, Kalliopi; Sideri, Riccardo; Penco, Silvana; Mosca, Lorena; Lunetta, Christian; Pinter, Giuseppe Lauria; Corbo, Massimo; Riva, Nilo; Carrera, Paola; Volanti, Paolo; Mandrioli, Jessica; Fini, Nicola; Fasano, Antonio; Tremolizzo, Lucio; Arosio, Alessandro; Ferrarese, Carlo; Trojsi, Francesca; Tedeschi, Gioacchino; Monsurrò, Maria Rosaria; Piccirillo, Giovanni; Femiano, Cinzia; Ticca, Anna; Ortu, Enzo; La Bella, Vincenzo; Spataro, Rossella; Colletti, Tiziana; Sabatelli, Mario; Zollino, Marcella; Conte, Amelia; Luigetti, Marco; Lattante, Serena; Marangi, Giuseppe; Santarelli, Marialuisa; Petrucci, Antonio; Pugliatti, Maura; Pirisi, Angelo; Parish, Leslie D.; Occhineri, Patrizia; Giannini, Fabio; Battistini, Stefania; Ricci, Claudia; Benigni, Michele; Cau, Tea B.; Loi, Daniela; Calvo, Andrea; Moglia, Cristina; Brunetti, Maura; Barberis, Marco; Restagno, Gabriella; Casale, Federico; Marrali, Giuseppe; Fuda, Giuseppe; Ossola, Irene; Cammarosano, Stefania; Canosa, Antonio; Ilardi, Antonio; Manera, Umberto; Grassano, Maurizio; Tanel, Raffaella; Pisano, Fabrizio; Mora, Gabriele; Calvo, Andrea; Mazzini, Letizia; Riva, Nilo; Mandrioli, Jessica; Caponnetto, Claudia; Battistini, Stefania; Volanti, Paolo; La Bella, Vincenzo; Conforti, Francesca L.; Borghero, Giuseppe; Messina, Sonia; Simone, Isabella L.; Trojsi, Francesca; Salvi, Fabrizio; Logullo, Francesco O.; D'Alfonso, Sandra; Corrado, Lucia; Capasso, Margherita; Ferrucci, Luigi; Harms, Matthew B.; Goldstein, David B.; Shneider, Neil A.; Goutman, Stephen A.; Simmons, Zachary; Miller, Timothy M.; Chandran, Siddharthan; Pal, Suvankar; Manousakis, George; Appel, Stanley H.; Simpson, Ericka; Wang, Leo; Baloh, Robert H.; Gibson, Summer B.; Bedlack, Richard; Lacomis, David; Sareen, Dhruv; Sherman, Alexander; Bruijn, Lucie; Penny, Michelle; Moreno, Cristiane de Araujo Martins; Kamalakaran, Sitharthan; Goldstein, David B.; Allen, Andrew S.; Appel, Stanley; Baloh, Robert H.; Bedlack, Richard S.; Boone, Braden E.; Brown, Robert; Carulli, John P.; Chesi, Alessandra; Chung, Wendy K.; Cirulli, Elizabeth T.; Cooper, Gregory M.; Couthouis, Julien; Day-Williams, Aaron G.; Dion, Patrick A.; Gibson, Summer B.; Gitler, Aaron D.; Glass, Jonathan D.; Goldstein, David B.; Han, Yujun; Harms, Matthew B.; Harris, Tim; Hayes, Sebastian D.; Jones, Angela L.; Keebler, Jonathan; Krueger, Brian J.; Lasseigne, Brittany N.; Levy, Shawn E.; Lu, Yi Fan; Maniatis, Tom; McKenna-Yasek, Diane; Miller, Timothy M.; Myers, Richard M.; Petrovski, Slavé; Pulst, Stefan M.; Raphael, Alya R.; Ravits, John M.; Ren, Zhong; Rouleau, Guy A.; Sapp, Peter C.; Shneider, Neil A.; Simpson, Ericka; Sims, Katherine B.; Staropoli, John F.; Waite, Lindsay L.; Wang, Quanli; Wimbish, Jack R.; Xin, Winnie W.; Gitler, Aaron D.; Harris, Tim; Myers, Richard M.; Phatnani, Hemali; Kwan, Justin; Sareen, Dhruv; Broach, James R.; Simmons, Zachary; Arcila-Londono, Ximena; Lee, Edward B.; Van Deerlin, Vivianna M.; Shneider, Neil A.; Fraenkel, Ernest; Ostrow, Lyle W.; Baas, Frank; Zaitlen, Noah; Berry, James D.; Malaspina, Andrea; Fratta, Pietro; Cox, Gregory A.; Thompson, Leslie M.; Finkbeiner, Steve; Dardiotis, Efthimios; Miller, Timothy M.; Chandran, Siddharthan; Pal, Suvankar; Hornstein, Eran; MacGowan, Daniel J.L.; Heiman-Patterson, Terry D.; Hammell, Molly G.; Patsopoulos, Nikolaos A.; Dubnau, Joshua; Nath, Avindra; Phatnani, Hemali; Musunuri, Rajeeva Lochan; Evani, Uday Shankar; Abhyankar, Avinash; Zody, Michael C.; Kaye, Julia; Finkbeiner, Steven; Wyman, Stacia K.; LeNail, Alexander; Lima, Leandro; Fraenkel, Ernest; Rothstein, Jeffrey D.; Svendsen, Clive N.; Thompson, Leslie M.; Van Eyk, Jenny; Maragakis, Nicholas J.; Berry, James D.; Glass, Jonathan D.; Miller, Timothy M.; Kolb, Stephen J.; Baloh, Robert H.; Cudkowicz, Merit; Baxi, Emily; Kaye, Julia; Finkbeiner, Steven; Wyman, Stacia K.; Finkbeiner, Steven; LeNail, Alex; Lima, Leandro; Fraenkel, Ernest; Fraenkel, Ernest; Svendsen, Clive N.; Svendsen, Clive N.; Thompson, Leslie M.; Thompson, Leslie M.; Van Eyk, Jennifer E.; Berry, James D.; Berry, James D.; Miller, Timothy M.; Kolb, Stephen J.; Cudkowicz, Merit; Cudkowicz, Merit; Baxi, Emily; Benatar, Michael; Taylor, J. Paul; Wu, Gang; Rampersaud, Evadnie; Wuu, Joanne; Rademakers, Rosa; Züchner, Stephan; Schule, Rebecca; McCauley, Jacob; Hussain, Sumaira; Cooley, Anne; Wallace, Marielle; Clayman, Christine; Barohn, Richard; Statland, Jeffrey; Ravits, John M.; Swenson, Andrea; Jackson, Carlayne; Trivedi, Jaya; Khan, Shaida; Katz, Jonathan; Jenkins, Liberty; Burns, Ted; Gwathmey, Kelly; Caress, James; McMillan, Corey; Elman, Lauren; Pioro, Erik P.; Heckmann, Jeannine; So, Yuen; Walk, David; Maiser, Samuel; Zhang, Jinghui; Benatar, Michael; Taylor, J. Paul; Taylor, J. Paul; Rampersaud, Evadnie; Wu, Gang; Wuu, Joanne; Silani, Vincenzo; Ticozzi, Nicola; Gellera, Cinzia; Ratti, Antonia; Taroni, Franco; Lauria, Giuseppe; Verde, Federico; Fogh, Isabella; Tiloca, Cinzia; Comi, Giacomo P.; Sorarù, Gianni; Cereda, Cristina; D'Alfonso, Sandra; Corrado, Lucia; De Marchi, Fabiola; Corti, Stefania; Ceroni, Mauro; Mazzini, Letizia; Siciliano, Gabriele; Filosto, Massimiliano; Inghilleri, Maurizio; Peverelli, Silvia; Colombrita, Claudia; Poletti, Barbara; Maderna, Luca; Del Bo, Roberto; Gagliardi, Stella; Querin, Giorgia; Bertolin, Cinzia; Pensato, Viviana; Castellotti, Barbara; Lauria, Giuseppe; Verde, Federico; Fogh, Isabella; Tiloca, Cinzia; Fogh, Isabella; Comi, Giacomo P.; Sorarù, Gianni; Cereda, Cristina; Camu, William; Mouzat, Kevin; Lumbroso, Serge; Corcia, Philippe; Meininger, Vincent; Besson, Gérard; Lagrange, Emmeline; Clavelou, Pierre; Guy, Nathalie; Couratier, Philippe; Vourch, Patrick; Danel, Véronique; Bernard, Emilien; Lemasson, Gwendal; Corcia, Philippe; Laaksovirta, Hannu; Myllykangas, Liisa; Jansson, Lilja; Valori, Miko; Ealing, John; Hamdalla, Hisham; Rollinson, Sara; Pickering-Brown, Stuart; Orrell, Richard W.; Sidle, Katie C.; Malaspina, Andrea; Hardy, John; Singleton, Andrew B.; Johnson, Janel O.; Arepalli, Sampath; Sapp, Peter C.; McKenna-Yasek, Diane; Polak, Meraida; Asress, Seneshaw; Al-Sarraj, Safa; King, Andrew; Troakes, Claire; Vance, Caroline; de Belleroche, Jacqueline; Baas, Frank; ten Asbroek, Anneloor L.M.A.; Muñoz-Blanco, José Luis; Hernandez, Dena G.; Ding, Jinhui; Gibbs, J. Raphael; Scholz, Sonja W.; Scholz, Sonja W.; Floeter, Mary Kay; Campbell, Roy H.; Landi, Francesco; Bowser, Robert; Pulst, Stefan M.; Ravits, John M.; MacGowan, Daniel J.L.; Kirby, Janine; Pioro, Erik P.; Pamphlett, Roger; Broach, James; Gerhard, Glenn; Dunckley, Travis L.; Brady, Christopher B.; Brady, Christopher B.; Kowall, Neil W.; Troncoso, Juan C.; Le Ber, Isabelle; Mouzat, Kevin; Lumbroso, Serge; Mouzat, Kevin; Lumbroso, Serge; Heiman-Patterson, Terry D.; Heiman-Patterson, Terry D.; Kamel, Freya; Van Den Bosch, Ludo; Van Den Bosch, Ludo; Baloh, Robert H.; Strom, Tim M.; Meitinger, Thomas; Strom, Tim M.; Shatunov, Aleksey; Van Eijk, Kristel R.; de Carvalho, Mamede; de Carvalho, Mamede; Kooyman, Maarten; Middelkoop, Bas; Moisse, Matthieu; McLaughlin, Russell; Van Es, Michael A.; Weber, Markus; Boylan, Kevin B.; Van Blitterswijk, Marka; Rademakers, Rosa; Morrison, Karen; Basak, A. Nazli; Mora, Jesús S.; Drory, Vivian; Shaw, Pamela; Turner, Martin R.; Talbot, Kevin; Hardiman, Orla; Williams, Kelly L.; Fifita, Jennifer A.; Nicholson, Garth A.; Blair, Ian P.; Nicholson, Garth A.; Rouleau, Guy A.; Esteban-Pérez, Jesús; García-Redondo, Alberto; Al-Chalabi, Ammar; Al Kheifat, Ahmad; Al-Chalabi, Ammar; Andersen, Peter M.; Basak, A. Nazli; Blair, Ian P.; Chio, Adriano; Cooper-Knock, Jonathan; Corcia, Philippe; Couratier, Philippe; de Carvalho, Mamede; Dekker, Annelot; Drory, Vivian; Redondo, Alberto Garcia; Gotkine, Marc; Hardiman, Orla; Hide, Winston; Iacoangeli, Alfredo; Glass, Jonathan D.; Kenna, Kevin P.; Kiernan, Matthew; Kooyman, Maarten; Landers, John E.; McLaughlin, Russell; Middelkoop, Bas; Mill, Jonathan; Neto, Miguel Mitne; Moisse, Matthieu; Pardina, Jesus Mora; Morrison, Karen; Newhouse, Stephen; Pinto, Susana; Pulit, Sara; Robberecht, Wim; Shatunov, Aleksey; Shaw, Pamela; Shaw, Chris; Silani, Vincenzo; Sproviero, William; Tazelaar, Gijs; Ticozzi, Nicola; Van Damme, Philip; van den Berg, Leonard; van der Spek, Rick; Van Eijk, Kristel R.; Van Es, Michael A.; van Rheenen, Wouter; van Vugt, Joke J.F.A.; Veldink, Jan H.; Weber, Markus; Williams, Kelly L.; Van Damme, Philip; Robberecht, Wim; Zatz, Mayana; Robberecht, Wim; Bauer, Denis C.; Twine, Natalie A.; Rogaeva, Ekaterina; Zinman, Lorne; Ostrow, Lyle W.; Maragakis, Nicholas J.; Rothstein, Jeffrey D.; Simmons, Zachary; Cooper-Knock, Johnathan; Brice, Alexis; Goutman, Stephen A.; Feldman, Eva L.; Gibson, Summer B.; Taroni, Franco; Ratti, Antonia; Ratti, Antonia; Gellera, Cinzia; Van Damme, Philip; Robberecht, Wim; Fratta, Pietro; Sabatelli, Mario; Lunetta, Christian; Ludolph, Albert C.; Andersen, Peter M.; Weishaupt, Jochen H.; Camu, William; Trojanowski, John Q.; Van Deerlin, Vivianna M.; Brown, Robert H.; van den Berg, Leonard; Veldink, Jan H.; Harms, Matthew B.; Glass, Jonathan D.; Stone, David J.; Tienari, Pentti; Silani, Vincenzo; Silani, Vincenzo; Chiò, Adriano; Shaw, Christopher E.; Chiò, Adriano; Traynor, Bryan J.; Landers, John E.; Traynor, Bryan J.
To identify novel genes associated with ALS, we undertook two lines of investigation. We carried out a genome-wide association study comparing 20,806 ALS cases and 59,804 controls. Independently, we performed a rare variant burden analysis comparing 1,138 index familial ALS cases and 19,494
Background Identifying essential genes in bacteria supports to identify potential drug targets and an understanding of minimal requirements for a synthetic cell. However, experimentally assaying the essentiality of their coding genes is resource intensive and not feasible for all bacterial organisms, in particular if they are infective. Results We developed a machine learning technique to identify essential genes using the experimental data of genome-wide knock-out screens from one bacterial organism to infer essential genes of another related bacterial organism. We used a broad variety of topological features, sequence characteristics and co-expression properties potentially associated with essentiality, such as flux deviations, centrality, codon frequencies of the sequences, co-regulation and phyletic retention. An organism-wise cross-validation on bacterial species yielded reliable results with good accuracies (area under the receiver-operator-curve of 75% - 81%). Finally, it was applied to drug target predictions for Salmonella typhimurium. We compared our predictions to the viability of experimental knock-outs of S. typhimurium and identified 35 enzymes, which are highly relevant to be considered as potential drug targets. Specifically, we detected promising drug targets in the non-mevalonate pathway. Conclusions Using elaborated features characterizing network topology, sequence information and microarray data enables to predict essential genes from a bacterial reference organism to a related query organism without any knowledge about the essentiality of genes of the query organism. In general, such a method is beneficial for inferring drug targets when experimental data about genome-wide knockout screens is not available for the investigated organism. PMID:20438628
Full Text Available Gastric cancer continues to be one of the deadliest cancers in the world and therefore identification of new drugs targeting this type of cancer is thus of significant importance. The purpose of this study was to identify and validate a therapeutic agent which might improve the outcomes for gastric cancer patients in the future.Using microarray technology, we generated a gene expression profile of human gastric cancer-specific genes from human gastric cancer tissue samples. We used this profile in the Broad Institute's Connectivity Map analysis to identify candidate therapeutic compounds for gastric cancer. We found the histone deacetylase inhibitor vorinostat as the lead compound and thus a potential therapeutic drug for gastric cancer. Vorinostat induced both apoptosis and autophagy in gastric cancer cell lines. Pharmacological and genetic inhibition of autophagy however, increased the therapeutic efficacy of vorinostat, indicating that a combination of vorinostat with autophagy inhibitors may therapeutically be more beneficial. Moreover, gene expression analysis of gastric cancer identified a collection of genes (ITGB5, TYMS, MYB, APOC1, CBX5, PLA2G2A, and KIF20A whose expression was elevated in gastric tumor tissue and downregulated more than 2-fold by vorinostat treatment in gastric cancer cell lines. In contrast, SCGB2A1, TCN1, CFD, APLP1, and NQO1 manifested a reversed pattern.We showed that analysis of gene expression signature may represent an emerging approach to discover therapeutic agents for gastric cancer, such as vorinostat. The observation of altered gene expression after vorinostat treatment may provide the clue to identify the molecular mechanism of vorinostat and those patients likely to benefit from vorinostat treatment.
Choi, Woonyoung; Park, Yun-Yong; Kim, KyoungHyun; Kim, Sang-Bae; Lee, Ju-Seog; Mills, Gordon B.; Cho, Jae Yong
Background Gastric cancer continues to be one of the deadliest cancers in the world and therefore identification of new drugs targeting this type of cancer is thus of significant importance. The purpose of this study was to identify and validate a therapeutic agent which might improve the outcomes for gastric cancer patients in the future. Methodology/Principal Findings Using microarray technology, we generated a gene expression profile of human gastric cancer–specific genes from human gastric cancer tissue samples. We used this profile in the Broad Institute's Connectivity Map analysis to identify candidate therapeutic compounds for gastric cancer. We found the histone deacetylase inhibitor vorinostat as the lead compound and thus a potential therapeutic drug for gastric cancer. Vorinostat induced both apoptosis and autophagy in gastric cancer cell lines. Pharmacological and genetic inhibition of autophagy however, increased the therapeutic efficacy of vorinostat, indicating that a combination of vorinostat with autophagy inhibitors may therapeutically be more beneficial. Moreover, gene expression analysis of gastric cancer identified a collection of genes (ITGB5, TYMS, MYB, APOC1, CBX5, PLA2G2A, and KIF20A) whose expression was elevated in gastric tumor tissue and downregulated more than 2-fold by vorinostat treatment in gastric cancer cell lines. In contrast, SCGB2A1, TCN1, CFD, APLP1, and NQO1 manifested a reversed pattern. Conclusions/Significance We showed that analysis of gene expression signature may represent an emerging approach to discover therapeutic agents for gastric cancer, such as vorinostat. The observation of altered gene expression after vorinostat treatment may provide the clue to identify the molecular mechanism of vorinostat and those patients likely to benefit from vorinostat treatment. PMID:21931799
Falkenberg, K J; Newbold, A; Gould, C M; Luu, J; Trapani, J A; Matthews, G M; Simpson, K J; Johnstone, R W
Vorinostat is an FDA-approved histone deacetylase inhibitor (HDACi) that has proven clinical success in some patients; however, it remains unclear why certain patients remain unresponsive to this agent and other HDACis. Constitutive STAT (signal transducer and activator of transcription) activation, overexpression of prosurvival Bcl-2 proteins and loss of HR23B have been identified as potential biomarkers of HDACi resistance; however, none have yet been used to aid the clinical utility of HDACi. Herein, we aimed to further elucidate vorinostat-resistance mechanisms through a functional genomics screen to identify novel genes that when knocked down by RNA interference (RNAi) sensitized cells to vorinostat-induced apoptosis. A synthetic lethal functional screen using a whole-genome protein-coding RNAi library was used to identify genes that when knocked down cooperated with vorinostat to induce tumor cell apoptosis in otherwise resistant cells. Through iterative screening, we identified 10 vorinostat-resistance candidate genes that sensitized specifically to vorinostat. One of these vorinostat-resistance genes was GLI1, an oncogene not previously known to regulate the activity of HDACi. Treatment of vorinostat-resistant cells with the GLI1 small-molecule inhibitor, GANT61, phenocopied the effect of GLI1 knockdown. The mechanism by which GLI1 loss of function sensitized tumor cells to vorinostat-induced apoptosis is at least in part through interactions with vorinostat to alter gene expression in a manner that favored apoptosis. Upon GLI1 knockdown and vorinostat treatment, BCL2L1 expression was repressed and overexpression of BCL2L1 inhibited GLI1-knockdown-mediated vorinostat sensitization. Taken together, we present the identification and characterization of GLI1 as a new HDACi resistance gene, providing a strong rationale for development of GLI1 inhibitors for clinical use in combination with HDACi therapy.
Fusco, Dahlene N; Brisac, Cynthia; John, Sinu P; Huang, Yi-Wen; Chin, Christopher R; Xie, Tiao; Zhao, Hong; Jilg, Nikolaus; Zhang, Leiliang; Chevaliez, Stephane; Wambua, Daniel; Lin, Wenyu; Peng, Lee; Chung, Raymond T; Brass, Abraham L
Hepatitis C virus (HCV) infection is a leading cause of end-stage liver disease. Interferon-α (IFNα) is an important component of anti-HCV therapy; it up-regulates transcription of IFN-stimulated genes, many of which have been investigated for their antiviral effects. However, all of the genes required for the antiviral function of IFNα (IFN effector genes [IEGs]) are not known. IEGs include not only IFN-stimulated genes, but other nontranscriptionally induced genes that are required for the antiviral effect of IFNα. In contrast to candidate approaches based on analyses of messenger RNA (mRNA) expression, identification of IEGs requires a broad functional approach. We performed an unbiased genome-wide small interfering RNA screen to identify IEGs that inhibit HCV. Huh7.5.1 hepatoma cells were transfected with small interfering RNAs incubated with IFNα and then infected with JFH1 HCV. Cells were stained using HCV core antibody, imaged, and analyzed to determine the percent infection. Candidate IEGs detected in the screen were validated and analyzed further. The screen identified 120 previously unreported IEGs. From these, we more fully evaluated the following: asparagine-linked glycosylation 10 homolog (yeast, α-1,2-glucosyltransferase); butyrylcholinesterase; dipeptidyl-peptidase 4 (CD26, adenosine deaminase complexing protein 2); glucokinase (hexokinase 4) regulator; guanylate cyclase 1, soluble, β 3; MYST histone acetyltransferase 1; protein phosphatase 3 (formerly 2B), catalytic subunit, β isoform; peroxisomal proliferator-activated receptor-γ-DBD-interacting protein 1; and solute carrier family 27 (fatty acid transporter), member 2; and demonstrated that they enabled IFNα-mediated suppression of HCV at multiple steps of its life cycle. Expression of these genes had more potent effects against flaviviridae because a subset was required for IFNα to suppress dengue virus but not influenza A virus. In addition, many of the host genes detected in this
Kordmahalleh, Mina Moradi; Sefidmazgi, Mohammad Gorji; Harrison, Scott H; Homaifar, Abdollah
The modeling of genetic interactions within a cell is crucial for a basic understanding of physiology and for applied areas such as drug design. Interactions in gene regulatory networks (GRNs) include effects of transcription factors, repressors, small metabolites, and microRNA species. In addition, the effects of regulatory interactions are not always simultaneous, but can occur after a finite time delay, or as a combined outcome of simultaneous and time delayed interactions. Powerful biotechnologies have been rapidly and successfully measuring levels of genetic expression to illuminate different states of biological systems. This has led to an ensuing challenge to improve the identification of specific regulatory mechanisms through regulatory network reconstructions. Solutions to this challenge will ultimately help to spur forward efforts based on the usage of regulatory network reconstructions in systems biology applications. We have developed a hierarchical recurrent neural network (HRNN) that identifies time-delayed gene interactions using time-course data. A customized genetic algorithm (GA) was used to optimize hierarchical connectivity of regulatory genes and a target gene. The proposed design provides a non-fully connected network with the flexibility of using recurrent connections inside the network. These features and the non-linearity of the HRNN facilitate the process of identifying temporal patterns of a GRN. Our HRNN method was implemented with the Python language. It was first evaluated on simulated data representing linear and nonlinear time-delayed gene-gene interaction models across a range of network sizes and variances of noise. We then further demonstrated the capability of our method in reconstructing GRNs of the Saccharomyces cerevisiae synthetic network for in vivo benchmarking of reverse-engineering and modeling approaches (IRMA). We compared the performance of our method to TD-ARACNE, HCC-CLINDE, TSNI and ebdbNet across different network
Prasad, Shiv S; Russell, Marsha; Nowakowska, Margeryta; Williams, Andrew; Yauk, Carole
Mild ischaemic exposures before or after severe injurious ischaemia that elicit neuroprotective responses are referred to as preconditioning and post-conditioning. The corresponding molecular mechanisms of neuroprotection are not completely understood. Identification of the genes and associated pathways of corresponding neuroprotection would provide insight into neuronal survival, potential therapeutic approaches and assessments of therapies for stroke. The objectives of this study were to use global gene expression approach to infer the molecular mechanisms in pre- and post-conditioning-derived neuroprotection in cortical neurons following oxygen and glucose deprivation (OGD) in vitro and then to apply these findings to predict corresponding functional pathways. To this end, microarray analysis was applied to rat cortical neurons with or without the pre- and post-conditioning treatments at 3-h post-reperfusion, and differentially expressed transcripts were subjected to statistical, hierarchical clustering and pathway analyses. The expression patterns of 3,431 genes altered under all conditions of ischaemia (with and without pre- or post-conditioning). We identified 1,595 genes that were commonly regulated within both the pre- and post-conditioning treatments. Cluster analysis revealed that transcription profiles clustered tightly within controls, non-conditioned OGD and neuroprotected groups. Two clusters defining neuroprotective conditions associated with up- and downregulated genes were evident. The five most upregulated genes within the neuroprotective clusters were Tagln, Nes, Ptrf, Vim and Adamts9, and the five most downregulated genes were Slc7a3, Bex1, Brunol4, Nrxn3 and Cpne4. Pathway analysis revealed that the intracellular and second messenger signalling pathways in addition to cell death were predominantly associated with downregulated pre- and post-conditioning associated genes, suggesting that modulation of cell death and signal transduction pathways
Olm, Matthew R.; Morowitz, Michael J.
ABSTRACT Antibiotic resistance in pathogens is extensively studied, and yet little is known about how antibiotic resistance genes of typical gut bacteria influence microbiome dynamics. Here, we leveraged genomes from metagenomes to investigate how genes of the premature infant gut resistome correspond to the ability of bacteria to survive under certain environmental and clinical conditions. We found that formula feeding impacts the resistome. Random forest models corroborated by statistical tests revealed that the gut resistome of formula-fed infants is enriched in class D beta-lactamase genes. Interestingly, Clostridium difficile strains harboring this gene are at higher abundance in formula-fed infants than C. difficile strains lacking this gene. Organisms with genes for major facilitator superfamily drug efflux pumps have higher replication rates under all conditions, even in the absence of antibiotic therapy. Using a machine learning approach, we identified genes that are predictive of an organism’s direction of change in relative abundance after administration of vancomycin and cephalosporin antibiotics. The most accurate results were obtained by reducing annotated genomic data to five principal components classified by boosted decision trees. Among the genes involved in predicting whether an organism increased in relative abundance after treatment are those that encode subclass B2 beta-lactamases and transcriptional regulators of vancomycin resistance. This demonstrates that machine learning applied to genome-resolved metagenomics data can identify key genes for survival after antibiotics treatment and predict how organisms in the gut microbiome will respond to antibiotic administration. IMPORTANCE The process of reconstructing genomes from environmental sequence data (genome-resolved metagenomics) allows unique insight into microbial systems. We apply this technique to investigate how the antibiotic resistance genes of bacteria affect their ability to
Wang, Pei; Song, Fan; Cai, Wanzhi
Insect mitochondrial genomes are very important to understand the molecular evolution as well as for phylogenetic and phylogeographic studies of the insects. The Miridae are the largest family of Heteroptera encompassing more than 11,000 described species and of great economic importance. For better understanding the diversity and the evolution of plant bugs, we sequence five new mitochondrial genomes and present the first comparative analysis of nine mitochondrial genomes of mirids available to date. Our result showed that gene content, gene arrangement, base composition and sequences of mitochondrial transcription termination factor were conserved in plant bugs. Intra-genus species shared more conserved genomic characteristics, such as nucleotide and amino acid composition of protein-coding genes, secondary structure and anticodon mutations of tRNAs, and non-coding sequences. Control region possessed several distinct characteristics, including: variable size, abundant tandem repetitions, and intra-genus conservation; and was useful in evolutionary and population genetic studies. The AGG codon reassignments were investigated between serine and lysine in the genera Adelphocoris and other cimicomorphans. Our analysis revealed correlated evolution between reassignments of the AGG codon and specific point mutations at the antidocons of tRNALys and tRNASer(AGN). Phylogenetic analysis indicated that mitochondrial genome sequences were useful in resolving family level relationship of Cimicomorpha. Comparative evolutionary analysis of plant bug mitochondrial genomes allowed the identification of previously neglected coding genes or non-coding regions as potential molecular markers. The finding of the AGG codon reassignments between serine and lysine indicated the parallel evolution of the genetic code in Hemiptera mitochondrial genomes. PMID:24988409
Do, Jin Hwan; Choi, Dong-Kug
The analysis of microarray data is essential for large amounts of gene expression data. In this review we focus on clustering techniques. The biological rationale for this approach is the fact that many co-expressed genes are co-regulated, and identifying co-expressed genes could aid in functional annotation of novel genes, de novo identification of transcription factor binding sites and elucidation of complex biological pathways. Co-expressed genes are usually identified in microarray experiments by clustering techniques. There are many such methods, and the results obtained even for the same datasets may vary considerably depending on the algorithms and metrics for dissimilarity measures used, as well as on user-selectable parameters such as desired number of clusters and initial values. Therefore, biologists who want to interpret microarray data should be aware of the weakness and strengths of the clustering methods used. In this review, we survey the basic principles of clustering of DNA microarray data from crisp clustering algorithms such as hierarchical clustering, K-means and self-organizing maps, to complex clustering algorithms like fuzzy clustering.
Full Text Available Limb-girdle muscular dystrophies (LGMD are genetically and clinically heterogeneous conditions. We investigated a large family with autosomal dominant transmission pattern, previously classified as LGMD1F and mapped to chromosome 7q32. Affected members are characterized by muscle weakness affecting earlier the pelvic girdle and the ileopsoas muscles. We sequenced the whole exome of four family members and identified a shared heterozygous frame-shift variant in the Transportin 3 (TNPO3 gene, encoding a member of the importin-β super-family. The TNPO3 gene is mapped within the LGMD1F critical interval and its 923-amino acid human gene product is also expressed in skeletal muscle. In addition, we identified an isolated case of LGMD with a new missense mutation in the same gene. We localized the mutant TNPO3 around the nucleus, but not inside. The involvement of gene related to the nuclear transport suggests a novel disease mechanism leading to muscular dystrophy.
Serrano-Mislata, Antonio; Bencivenga, Stefano; Bush, Max; Schiessl, Katharina; Boden, Scott; Sablowski, Robert
DELLA proteins associate with transcription factors to control plant growth in response to gibberellin 1 . Semi-dwarf DELLA mutants with improved harvest index and decreased lodging greatly improved global food security during the 'green revolution' in the 1960-1970s 2 . However, DELLA mutants are pleiotropic and the developmental basis for their effects on plant architecture remains poorly understood. Here, we show that DELLA proteins have genetically separable roles in controlling stem growth and the size of the inflorescence meristem, where flowers initiate. Quantitative three-dimensional image analysis, combined with a genome-wide screen for DELLA-bound loci in the inflorescence tip, revealed that DELLAs limit meristem size in Arabidopsis by directly upregulating the cell-cycle inhibitor KRP2 in the underlying rib meristem, without affecting the canonical WUSCHEL-CLAVATA meristem size regulators 3 . Mutation of KRP2 in a DELLA semi-dwarf background restored meristem size, but not stem growth, and accelerated flower production. In barley, secondary mutations in the DELLA gain-of-function mutant Sln1d 4 also uncoupled meristem and inflorescence size from plant height. Our work reveals an unexpected and conserved role for DELLA genes in controlling shoot meristem function and suggests how dissection of pleiotropic DELLA functions could unlock further yield gains in semi-dwarf mutants.
Nowrousian, Minou; Teichert, Ines; Masloff, Sandra; Kück, Ulrich
The study of mutants to elucidate gene functions has a long and successful history; however, to discover causative mutations in mutants that were generated by random mutagenesis often takes years of laboratory work and requires previously generated genetic and/or physical markers, or resources like DNA libraries for complementation. Here, we present an alternative method to identify defective genes in developmental mutants of the filamentous fungus Sordaria macrospora through Illumina/Solexa whole-genome sequencing. We sequenced pooled DNA from progeny of crosses of three mutants and the wild type and were able to pinpoint the causative mutations in the mutant strains through bioinformatics analysis. One mutant is a spore color mutant, and the mutated gene encodes a melanin biosynthesis enzyme. The causative mutation is a G to A change in the first base of an intron, leading to a splice defect. The second mutant carries an allelic mutation in the pro41 gene encoding a protein essential for sexual development. In the mutant, we detected a complex pattern of deletion/rearrangements at the pro41 locus. In the third mutant, a point mutation in the stop codon of a transcription factor-encoding gene leads to the production of immature fruiting bodies. For all mutants, transformation with a wild type-copy of the affected gene restored the wild-type phenotype. Our data demonstrate that whole-genome sequencing of mutant strains is a rapid method to identify developmental genes in an organism that can be genetically crossed and where a reference genome sequence is available, even without prior mapping information.
Zhang, Dale; Qi, Jinfeng; Yue, Jipei; Huang, Jinling; Sun, Ting; Li, Suoping; Wen, Jian-Fan; Hettenhausen, Christian; Wu, Jinsong; Wang, Lei; Zhuang, Huifu; Wu, Jianqiang; Sun, Guiling
Besides gene duplication and de novo gene generation, horizontal gene transfer (HGT) is another important way of acquiring new genes. HGT may endow the recipients with novel phenotypic traits that are important for species evolution and adaption to new ecological niches. Parasitic systems expectedly allow the occurrence of HGT at relatively high frequencies due to their long-term physical contact. In plants, a number of HGT events have been reported between the organelles of parasites and the hosts, but HGT between host and parasite nuclear genomes has rarely been found. A thorough transcriptome screening revealed that a strictosidine synthase-like (SSL) gene in the root parasitic plant Orobanche aegyptiaca and the shoot parasitic plant Cuscuta australis showed much higher sequence similarities with those in Brassicaceae than with those in their close relatives, suggesting independent gene horizontal transfer events from Brassicaceae to these parasites. These findings were strongly supported by phylogenetic analysis and their identical unique amino acid residues and deletions. Intriguingly, the nucleus-located SSL genes in Brassicaceae belonged to a new member of SSL gene family, which were originated from gene duplication. The presence of introns indicated that the transfer occurred directly by DNA integration in both parasites. Furthermore, positive selection was detected in the foreign SSL gene in O. aegyptiaca but not in C. australis. The expression of the foreign SSL genes in these two parasitic plants was detected in multiple development stages and tissues, and the foreign SSL gene was induced after wounding treatment in C. australis stems. These data imply that the foreign genes may still retain certain functions in the recipient species. Our study strongly supports that parasitic plants can gain novel nuclear genes from distantly related host species by HGT and the foreign genes may execute certain functions in the new hosts.
Ma, Hongming; Dang, Ying; Wu, Yonggan; Jia, Gengxiang; Anaya, Edgar; Zhang, Junli; Abraham, Sojan; Choi, Jang-Gi; Shi, Guojun; Qi, Ling; Manjunath, N; Wu, Haoquan
West Nile virus (WNV) causes an acute neurological infection attended by massive neuronal cell death. However, the mechanism(s) behind the virus-induced cell death is poorly understood. Using a library containing 77,406 sgRNAs targeting 20,121 genes, we performed a genome-wide screen followed by a second screen with a sub-library. Among the genes identified, seven genes, EMC2, EMC3, SEL1L, DERL2, UBE2G2, UBE2J1, and HRD1, stood out as having the strongest phenotype, whose knockout conferred strong protection against WNV-induced cell death with two different WNV strains and in three cell lines. Interestingly, knockout of these genes did not block WNV replication. Thus, these appear to be essential genes that link WNV replication to downstream cell death pathway(s). In addition, the fact that all of these genes belong to the ER-associated protein degradation (ERAD) pathway suggests that this might be the primary driver of WNV-induced cell death. Copyright © 2015 The Authors. Published by Elsevier Inc. All rights reserved.
Thomas Larsen; D. Lee Taylor; Mary Beth Leigh; Diane M. O' Brien
Amino acids play an important role in ecology as essential nutrients for animals and as currencies in symbiotic associations. Here we present a new approach to tracing the origins of amino acids by identifying unique patterns of carbon isotope signatures generated by amino acid synthesis in plants, fungi, and bacteria ("13C fingerprints...
Chintalapudi, Sumana R; Jablonski, Monica M
Loss of retinal ganglion cells (RGCs) is one of the hallmarks of retinal neurodegenerative diseases, glaucoma being one of the most common. Recently, γ-synuclein (SNCG) was shown to be highly expressed in the somas and axons of RGCs. In various mouse models of glaucoma, downregulation of Sncg gene expression correlates with RGC loss. To investigate the regulation of Sncg in RGCs, we used a systems genetics approach to identify a gene that modulates the expression of Sncg, followed by confirmatory studies in both healthy and diseased retinas. We found that chromosome 1 harbors an eQTL that modulates the expression of Sncg in the mouse retina and identified Pfdn2 as the candidate upstream modulator of Sncg expression. Downregulation of Pfdn2 in enriched RGCs causes a concomitant reduction in Sncg. In this chapter, we describe our strategy and methods for identifying and confirming a genetic modulation of a glaucoma-associated gene. A similar method can be applied to other genes expressed in other tissues.
Thakur, Nidhi; Upadhyay, Santosh Kumar; Verma, Praveen C; Chandrashekar, Krishnappa; Tuli, Rakesh; Singh, Pradhyumna K
Expression of double strand RNA (dsRNA) designed against important insect genes in transgenic plants have been shown to give protection against pests through RNA interference (RNAi), thus opening the way for a new generation of insect-resistant crops. We have earlier compared the efficacy of dsRNAs/siRNAs, against a number of target genes, for interference in growth of whitefly (Bemisia tabaci) upon oral feeding. The v-ATPase subunit A (v-ATPaseA) coding gene was identified as a crucial target. We now report the effectiveness of transgenic tobacco plants expressing siRNA to silence v-ATPaseA gene expression for the control of whitefly infestation. Transgenic tobacco lines were developed for the expression of long dsRNA precursor to make siRNA and knock down the v-ATPaseA mRNA in whitefly. Molecular analysis and insecticidal properties of the transgenic plants established the formation of siRNA targeting the whitefly v-ATPaseA, in the leaves. The transcript level of v-ATPaseA in whiteflies was reduced up to 62% after feeding on the transgenic plants. Heavy infestation of whiteflies on the control plants caused significant loss of sugar content which led to the drooping of leaves. The transgenic plants did not show drooping effect. Host plant derived pest resistance was achieved against whiteflies by genetic transformation of tobacco which generated siRNA against the whitefly v-ATPaseA gene. Transgenic tobacco lines expressing dsRNA of v-ATPaseA, delivered sufficient siRNA to whiteflies feeding on them, mounting a significant silencing response, leading to their mortality. The transcript level of the target gene was reduced in whiteflies feeding on transgenic plants. The strategy can be taken up for genetic engineering of plants to control whiteflies in field crops.
Full Text Available BACKGROUND: Expression of double strand RNA (dsRNA designed against important insect genes in transgenic plants have been shown to give protection against pests through RNA interference (RNAi, thus opening the way for a new generation of insect-resistant crops. We have earlier compared the efficacy of dsRNAs/siRNAs, against a number of target genes, for interference in growth of whitefly (Bemisia tabaci upon oral feeding. The v-ATPase subunit A (v-ATPaseA coding gene was identified as a crucial target. We now report the effectiveness of transgenic tobacco plants expressing siRNA to silence v-ATPaseA gene expression for the control of whitefly infestation. METHODOLOGY/PRINCIPAL FINDINGS: Transgenic tobacco lines were developed for the expression of long dsRNA precursor to make siRNA and knock down the v-ATPaseA mRNA in whitefly. Molecular analysis and insecticidal properties of the transgenic plants established the formation of siRNA targeting the whitefly v-ATPaseA, in the leaves. The transcript level of v-ATPaseA in whiteflies was reduced up to 62% after feeding on the transgenic plants. Heavy infestation of whiteflies on the control plants caused significant loss of sugar content which led to the drooping of leaves. The transgenic plants did not show drooping effect. CONCLUSIONS/SIGNIFICANCE: Host plant derived pest resistance was achieved against whiteflies by genetic transformation of tobacco which generated siRNA against the whitefly v-ATPaseA gene. Transgenic tobacco lines expressing dsRNA of v-ATPaseA, delivered sufficient siRNA to whiteflies feeding on them, mounting a significant silencing response, leading to their mortality. The transcript level of the target gene was reduced in whiteflies feeding on transgenic plants. The strategy can be taken up for genetic engineering of plants to control whiteflies in field crops.
Expression quantitative trait loci (eQTL) mapping is a tool that can systematically identify genetic variation affecting gene expression. eQTL mapping studies have shown that certain genomic locations, referred to as regulatory hotspots, may affect the expression levels of many genes. Recently, studies have shown that various confounding factors may induce spurious regulatory hotspots. Here, we introduce a novel statistical method that effectively eliminates spurious hotspots while retaining genuine hotspots. Applied to simulated and real datasets, we validate that our method achieves greater sensitivity while retaining low false discovery rates compared to previous methods. PMID:24708878
Chen, Dijun; Kaufmann, Kerstin
Key transcription factors (TFs) controlling the morphogenesis of flowers and leaves have been identified in the model plant Arabidopsis thaliana. Recent genome-wide approaches based on chromatin immunoprecipitation (ChIP) followed by high-throughput DNA sequencing (ChIP-seq) enable systematic identification of genome-wide TF binding sites (TFBSs) of these regulators. Here, we describe a computational pipeline for analyzing ChIP-seq data to identify TFBSs and to characterize gene regulatory networks (GRNs) with applications to the regulatory studies of flower development. In particular, we provide step-by-step instructions on how to download, analyze, visualize, and integrate genome-wide data in order to construct GRNs for beginners of bioinformatics. The practical guide presented here is ready to apply to other similar ChIP-seq datasets to characterize GRNs of interest.
Full Text Available Abstract Background Endometriosis is an enigmatic disease. Gene expression profiling of endometriosis has been used in several studies, but few studies went further to classify subtypes of endometriosis based on expression patterns and to identify possible pathways involved in endometriosis. Some of the observed pathways are more inconsistent between the studies, and these candidate pathways presumably only represent a fraction of the pathways involved in endometriosis. Methods We applied a standardised microarray preprocessing and gene set enrichment analysis to six independent studies, and demonstrated increased concordance between these gene datasets. Results We find 16 up-regulated and 19 down-regulated pathways common in ovarian endometriosis data sets, 22 up-regulated and one down-regulated pathway common in peritoneal endometriosis data sets. Among them, 12 up-regulated and 1 down-regulated were found consistent between ovarian and peritoneal endometriosis. The main canonical pathways identified are related to immunological and inflammatory disease. Early secretory phase has the most over-represented pathways in the three uterine cycle phases. There are no overlapping significant pathways between the dataset from human endometrial endothelial cells and the datasets from ovarian endometriosis which used whole tissues. Conclusion The study of complex diseases through pathway analysis is able to highlight genes weakly connected to the phenotype which may be difficult to detect by using classical univariate statistics. By standardised microarray preprocessing and GSEA, we have increased the concordance in identifying many biological mechanisms involved in endometriosis. The identified gene pathways will shed light on the understanding of endometriosis and promote the development of novel therapies.
Full Text Available Numerous comparative genome analyses have revealed the wide extent of horizontal gene transfer (HGT in living organisms, which contributes to their evolution and genetic diversity. Viruses play important roles in HGT. Endogenous viral elements (EVEs are defined as viral DNA sequences present within the genomes of non-viral organisms. In eukaryotic cells, the majority of EVEs are derived from RNA viruses using reverse transcription. In contrast, endogenous non-retroviral elements (ENREs are poorly studied. However, the increasing availability of genomic data and the rapid development of bioinformatics tools have enabled the identification of several ENREs in various eukaryotic organisms. To date, a small number of ENREs integrated into plant genomes have been identified. Of the known non-retroviruses, most identified ENREs are derived from double-strand (ds RNA viruses, followed by single-strand (ss DNA and ssRNA viruses. At least eight virus families have been identified. Of these, viruses in the family Partitiviridae are dominant, followed by viruses of the families Chrysoviridae and Geminiviridae. The identified ENREs have been primarily identified in eudicots, followed by monocots. In this review, we briefly discuss the current view on non-retroviral sequences integrated into plant genomes that are associated with plant-virus evolution and their possible roles in antiviral resistance.
Wright, Robin; Parrish, Mark L; Cadera, Emily; Larson, Lynnelle; Matson, Clinton K; Garrett-Engele, Philip; Armour, Chris; Lum, Pek Yee; Shoemaker, Daniel D
Increased levels of HMG-CoA reductase induce cell type- and isozyme-specific proliferation of the endoplasmic reticulum. In yeast, the ER proliferations induced by Hmg1p consist of nuclear-associated stacks of smooth ER membranes known as karmellae. To identify genes required for karmellae assembly, we compared the composition of populations of homozygous diploid S. cerevisiae deletion mutants following 20 generations of growth with and without karmellae. Using an initial population of 1,557 deletion mutants, 120 potential mutants were identified as a result of three independent experiments. Each experiment produced a largely non-overlapping set of potential mutants, suggesting that differences in specific growth conditions could be used to maximize the comprehensiveness of similar parallel analysis screens. Only two genes, UBC7 and YAL011W, were identified in all three experiments. Subsequent analysis of individual mutant strains confirmed that each experiment was identifying valid mutations, based on the mutant's sensitivity to elevated HMG-CoA reductase and inability to assemble normal karmellae. The largest class of HMG-CoA reductase-sensitive mutations was a subset of genes that are involved in chromatin structure and transcriptional regulation, suggesting that karmellae assembly requires changes in transcription or that the presence of karmellae may interfere with normal transcriptional regulation. Copyright 2003 John Wiley & Sons, Ltd.
Ban, Yusuke; Moriguchi, Takaya
The pigmentation of anthocyanins is one of the important determinants for consumer preference and marketability in horticultural crops such as fruits and flowers. To elucidate the mechanisms underlying the physiological process leading to the pigmentation of anthocyanins, identification of the genes differentially expressed in response to anthocyanin accumulation is a useful strategy. Currently, microarrays have been widely used to isolate differentially expressed genes. However, the use of microarrays is limited by its high cost of special apparatus and materials. Therefore, availability of microarrays is limited and does not come into common use at present. Suppression subtractive hybridization (SSH) is an alternative tool that has been widely used to identify differentially expressed genes due to its easy handling and relatively low cost. This chapter describes the procedures for SSH, including RNA extraction from polysaccharides and polyphenol-rich samples, poly(A)+ RNA purification, evaluation of subtraction efficiency, and differential screening using reverse northern in apple skin.
Gao, Long; Uzun, Yasin; Gao, Peng; He, Bing; Ma, Xiaoke; Wang, Jiahui; Han, Shizhong; Tan, Kai
Identifying noncoding risk variants remains a challenging task. Because noncoding variants exert their effects in the context of a gene regulatory network (GRN), we hypothesize that explicit use of disease-relevant GRNs can significantly improve the inference accuracy of noncoding risk variants. We describe Annotation of Regulatory Variants using Integrated Networks (ARVIN), a general computational framework for predicting causal noncoding variants. It employs a set of novel regulatory network-based features, combined with sequence-based features to infer noncoding risk variants. Using known causal variants in gene promoters and enhancers in a number of diseases, we show ARVIN outperforms state-of-the-art methods that use sequence-based features alone. Additional experimental validation using reporter assay further demonstrates the accuracy of ARVIN. Application of ARVIN to seven autoimmune diseases provides a holistic view of the gene subnetwork perturbed by the combinatorial action of the entire set of risk noncoding mutations.
Se Chang Park
Full Text Available Cyprinid herpes virus 3 (CyHV-3 diseases have been reported around the world and are associated with high mortalities of koi (Cyprinus carpio. Although little work has been conducted on the molecular analysis of this virus, glycoprotein genes identified in the present study seem to be valuable targets for genetic comparison of this virus. Three envelope glycoprotein genes (ORF25, 65 and 116 of the CyHV-3 isolates from the USA, Israel, Japan and Korea were compared, and interestingly, sequence insertions or deletions were observed in these target regions. In addition, polymorphisms were presented in microsatellite zones from two glycoprotein genes (ORF65 and 116. In phylogenetic tree analysis, the Korean isolate was remarkably distinguished from USA, Israel, Japan isolates. These findings may be suitable for many applications including isolates differentiation and phylogeny studies.
Han, Jee Eun; Kim, Ji Hyung; Renault, Tristan; Choresca, Casiano; Shin, Sang Phil; Jun, Jin Woo; Park, Se Chang
Cyprinid herpes virus 3 (CyHV-3) diseases have been reported around the world and are associated with high mortalities of koi (Cyprinus carpio). Although little work has been conducted on the molecular analysis of this virus, glycoprotein genes identified in the present study seem to be valuable targets for genetic comparison of this virus. Three envelope glycoprotein genes (ORF25, 65 and 116) of the CyHV-3 isolates from the USA, Israel, Japan and Korea were compared, and interestingly, sequence insertions or deletions were observed in these target regions. In addition, polymorphisms were presented in microsatellite zones from two glycoprotein genes (ORF65 and 116). In phylogenetic tree analysis, the Korean isolate was remarkably distinguished from USA, Israel, Japan isolates. These findings may be suitable for many applications including isolates differentiation and phylogeny studies. PMID:23435236
Baroukh, Caroline; Jenkins, Sherry L; Dannenfelser, Ruth; Ma'ayan, Avi
Word-clouds recently emerged on the web as a solution for quickly summarizing text by maximizing the display of most relevant terms about a specific topic in the minimum amount of space. As biologists are faced with the daunting amount of new research data commonly presented in textual formats, word-clouds can be used to summarize and represent biological and/or biomedical content for various applications. Genes2WordCloud is a web application that enables users to quickly identify biological themes from gene lists and research relevant text by constructing and displaying word-clouds. It provides users with several different options and ideas for the sources that can be used to generate a word-cloud. Different options for rendering and coloring the word-clouds give users the flexibility to quickly generate customized word-clouds of their choice. Genes2WordCloud is a word-cloud generator and a word-cloud viewer that is based on WordCram implemented using Java, Processing, AJAX, mySQL, and PHP. Text is fetched from several sources and then processed to extract the most relevant terms with their computed weights based on word frequencies. Genes2WordCloud is freely available for use online; it is open source software and is available for installation on any web-site along with supporting documentation at http://www.maayanlab.net/G2W. Genes2WordCloud provides a useful way to summarize and visualize large amounts of textual biological data or to find biological themes from several different sources. The open source availability of the software enables users to implement customized word-clouds on their own web-sites and desktop applications.
Full Text Available Abstract Background Word-clouds recently emerged on the web as a solution for quickly summarizing text by maximizing the display of most relevant terms about a specific topic in the minimum amount of space. As biologists are faced with the daunting amount of new research data commonly presented in textual formats, word-clouds can be used to summarize and represent biological and/or biomedical content for various applications. Results Genes2WordCloud is a web application that enables users to quickly identify biological themes from gene lists and research relevant text by constructing and displaying word-clouds. It provides users with several different options and ideas for the sources that can be used to generate a word-cloud. Different options for rendering and coloring the word-clouds give users the flexibility to quickly generate customized word-clouds of their choice. Methods Genes2WordCloud is a word-cloud generator and a word-cloud viewer that is based on WordCram implemented using Java, Processing, AJAX, mySQL, and PHP. Text is fetched from several sources and then processed to extract the most relevant terms with their computed weights based on word frequencies. Genes2WordCloud is freely available for use online; it is open source software and is available for installation on any web-site along with supporting documentation at http://www.maayanlab.net/G2W. Conclusions Genes2WordCloud provides a useful way to summarize and visualize large amounts of textual biological data or to find biological themes from several different sources. The open source availability of the software enables users to implement customized word-clouds on their own web-sites and desktop applications.
Chen, Shilin; Yao, Hui; Han, Jianping; Liu, Chang; Song, Jingyuan; Shi, Linchun; Zhu, Yingjie; Ma, Xinye; Gao, Ting; Pang, Xiaohui; Luo, Kun; Li, Ying; Li, Xiwen; Jia, Xiaocheng; Lin, Yulin; Leon, Christine
The plant working group of the Consortium for the Barcode of Life recommended the two-locus combination of rbcL+matK as the plant barcode, yet the combination was shown to successfully discriminate among 907 samples from 550 species at the species level with a probability of 72%. The group admits that the two-locus barcode is far from perfect due to the low identification rate, and the search is not over. Here, we compared seven candidate DNA barcodes (psbA-trnH, matK, rbcL, rpoC1, ycf5, ITS2, and ITS) from medicinal plant species. Our ranking criteria included PCR amplification efficiency, differential intra- and inter-specific divergences, and the DNA barcoding gap. Our data suggest that the second internal transcribed spacer (ITS2) of nuclear ribosomal DNA represents the most suitable region for DNA barcoding applications. Furthermore, we tested the discrimination ability of ITS2 in more than 6600 plant samples belonging to 4800 species from 753 distinct genera and found that the rate of successful identification with the ITS2 was 92.7% at the species level. The ITS2 region can be potentially used as a standard DNA barcode to identify medicinal plants and their closely related species. We also propose that ITS2 can serve as a novel universal barcode for the identification of a broader range of plant taxa.
Full Text Available White mold, caused by the necrotrophic fungus (Lib. de Bary, is a major disease of common bean ( L.. WM7.1 and WM8.3 are two quantitative trait loci (QTL with major effects on tolerance to the pathogen. Advanced backcross populations segregating individually for either of the two QTL, and a recombinant inbred (RI population segregating for both QTL were used to fine map and confirm the genetic location of the QTL. The QTL intervals were physically mapped using the reference common bean genome sequence, and the physical intervals for each QTL were further confirmed by sequence-based introgression mapping. Using whole-genome sequence data from susceptible and tolerant DNA pools, introgressed regions were identified as those with significantly higher numbers of single-nucleotide polymorphisms (SNPs relative to the whole genome. By combining the QTL and SNP data, WM7.1 was located to a 660-kb region that contained 41 gene models on the proximal end of chromosome Pv07, while the WM8.3 introgression was narrowed to a 1.36-Mb region containing 70 gene models. The most polymorphic candidate gene in the WM7.1 region encodes a BEACH-domain protein associated with apoptosis. Within the WM8.3 interval, a receptor-like protein with the potential to recognize pathogen effectors was the most polymorphic gene. The use of gene and sequence-based mapping identified two candidate genes whose putative functions are consistent with the current model of pathogenicity.
Walter, Vonn; Du, Ying; Danilova, Ludmila; Hayward, Michele C; Hayes, D Neil
Integrated analyses of multiple genomic datatypes are now common in cancer profiling studies. Such data present opportunities for numerous computational experiments, yet analytic pipelines are limited. Tools such as the cBioPortal and Regulome Explorer, although useful, are not easy to access programmatically or to implement locally. Here, we introduce the MVisAGe R package, which allows users to quantify gene-level associations between two genomic datatypes to investigate the effect of genomic alterations (e.g., DNA copy number changes on gene expression). Visualizing Pearson/Spearman correlation coefficients according to the genomic positions of the underlying genes provides a powerful yet novel tool for conducting exploratory analyses. We demonstrate its utility by analyzing three publicly available cancer datasets. Our approach highlights canonical oncogenes in chr11q13 that displayed the strongest associations between expression and copy number, including CCND1 and CTTN , genes not identified by copy number analysis in the primary reports. We demonstrate highly concordant usage of shared oncogenes on chr3q, yet strikingly diverse oncogene usage on chr11q as a function of HPV infection status. Regions of chr19 that display remarkable associations between methylation and gene expression were identified, as were previously unreported miRNA-gene expression associations that may contribute to the epithelial-to-mesenchymal transition. Significance: This study presents an important bioinformatics tool that will enable integrated analyses of multiple genomic datatypes. Cancer Res; 78(12); 3375-85. ©2018 AACR . ©2018 American Association for Cancer Research.
Kelley, Rowena Y; Gresham, Cathy; Harper, Jonathan; Bridges, Susan M; Warburton, Marilyn L; Hawkins, Leigh K; Pechanova, Olga; Peethambaran, Bela; Pechan, Tibor; Luthe, Dawn S; Mylroie, J E; Ankala, Arunkanth; Ozkan, Seval; Henry, W B; Williams, W P
Aspergillus flavus Link:Fr, an opportunistic fungus that produces aflatoxin, is pathogenic to maize and other oilseed crops. Aflatoxin is a potent carcinogen, and its presence markedly reduces the value of grain. Understanding and enhancing host resistance to A. flavus infection and/or subsequent aflatoxin accumulation is generally considered an efficient means of reducing grain losses to aflatoxin. Different proteomic, genomic and genetic studies of maize (Zea mays L.) have generated large data sets with the goal of identifying genes responsible for conferring resistance to A. flavus, or aflatoxin. In order to maximize the usage of different data sets in new studies, including association mapping, we have constructed a relational database with web interface integrating the results of gene expression, proteomic (both gel-based and shotgun), Quantitative Trait Loci (QTL) genetic mapping studies, and sequence data from the literature to facilitate selection of candidate genes for continued investigation. The Corn Fungal Resistance Associated Sequences Database (CFRAS-DB) (http://agbase.msstate.edu/) was created with the main goal of identifying genes important to aflatoxin resistance. CFRAS-DB is implemented using MySQL as the relational database management system running on a Linux server, using an Apache web server, and Perl CGI scripts as the web interface. The database and the associated web-based interface allow researchers to examine many lines of evidence (e.g. microarray, proteomics, QTL studies, SNP data) to assess the potential role of a gene or group of genes in the response of different maize lines to A. flavus infection and subsequent production of aflatoxin by the fungus. CFRAS-DB provides the first opportunity to integrate data pertaining to the problem of A. flavus and aflatoxin resistance in maize in one resource and to support queries across different datasets. The web-based interface gives researchers different query options for mining the database
Wu, Min; Liu, Huanlong; Han, Guomin; Cai, Ronghao; Pan, Feng; Xiang, Yan
The WRKY family are transcription factors, involved in plant development, and response to biotic and abiotic stresses. Moso bamboo is an important bamboo that has high ecological, economic and cultural value and is widely distributed in the south of China. In this study, we performed a genome-wide identification of WRKY members in moso bamboo and identified 89 members. By comparative analysis in six grass genomes, we found the WRKY gene family may have experienced or be experiencing purifying selection. Based on relative expression levels among WRKY IIc members under three abiotic stresses, PeWRKY83 functioned as a transcription factor and was selected for detailed analysis. The transgenic Arabidopsis of PeWRKY83 showed superior physiological properties compared with the WT under salt stress. Overexpression plants were less sensitive to ABA at both germination and postgermination stages and accumulated more endogenous ABA under salt stress conditions. Further studies demonstrated that overexpression of PeWRKY83 could regulate the expression of some ABA biosynthesis genes (AtAAO3, AtNCED2, AtNCED3), signaling genes (AtABI1, AtPP2CA) and responsive genes (AtRD29A, AtRD29B, AtABF1) under salt stress. Together, these results suggested that PeWRKY83 functions as a novel WRKY-related TF which plays a positive role in salt tolerance by regulating stress-induced ABA synthesis.
Chen, Yanhui; Han, Yangyang; Zhang, Meng; Zhou, Shan; Kong, Xiangzhu; Wang, Wei
Expansins are cell wall proteins that are grouped into two main families, α-expansins and β-expansins, and they are implicated in the control of cell extension via the disruption of hydrogen bonds between cellulose and matrix glucans. TaEXPA2 is an α-expansin gene identified in wheat. Based on putative cis-regulatory elements in the TaEXPA2 promoter sequence and the expression pattern induced when polyethylene glycol (PEG) is used to mimic water stress, we hypothesized that TaEXPA2 is involved in plant drought tolerance and plant development. Through transient expression of 35S::TaEXPA2-GFP in onion epidermal cells, TaEXPA2 was localized to the cell wall. Constitutive expression of TaEXPA2 in tobacco improved seed production by increasing capsule number, not seed size, without having any effect on plant growth patterns. The transgenic tobacco exhibited a significantly greater tolerance to water-deficiency stress than did wild-type (WT) plants. We found that under drought stress, the transgenic plants maintained a better water status. The accumulated content of osmotic adjustment substances, such as proline, in TaEXPA2 transgenic plants was greater than that in WT plants. Transgenic plants also displayed greater antioxidative competence as indicated by their lower malondialdehyde (MDA) content, relative electrical conductivity, and reactive oxygen species (ROS) accumulation than did WT plants. This result suggests that the transgenic plants suffer less damage from ROS under drought conditions. The activities of some antioxidant enzymes as well as expression levels of several genes encoding key antioxidant enzymes were higher in the transgenic plants than in the WT plants under drought stress. Collectively, our results suggest that ectopic expression of the wheat expansin gene TaEXPA2 improves seed production and drought tolerance in transgenic tobacco plants.
Full Text Available Abstract Background Independently derived expression profiles of the same biological condition often have few genes in common. In this study, we created populations of expression profiles from publicly available microarray datasets of cancer (breast, lymphoma and renal samples linked to clinical information with an iterative machine learning algorithm. ROC curves were used to assess the prediction error of each profile for classification. We compared the prediction error of profiles correlated with molecular phenotype against profiles correlated with relapse-free status. Prediction error of profiles identified with supervised univariate feature selection algorithms were compared to profiles selected randomly from a all genes on the microarray platform and b a list of known disease-related genes (a priori selection. We also determined the relevance of expression profiles on test arrays from independent datasets, measured on either the same or different microarray platforms. Results Highly discriminative expression profiles were produced on both simulated gene expression data and expression data from breast cancer and lymphoma datasets on the basis of ER and BCL-6 expression, respectively. Use of relapse-free status to identify profiles for prognosis prediction resulted in poorly discriminative decision rules. Supervised feature selection resulted in more accurate classifications than random or a priori selection, however, the difference in prediction error decreased as the number of features increased. These results held when decision rules were applied across-datasets to samples profiled on the same microarray platform. Conclusion Our results show that many gene sets predict molecular phenotypes accurately. Given this, expression profiles identified using different training datasets should be expected to show little agreement. In addition, we demonstrate the difficulty in predicting relapse directly from microarray data using supervised machine
Full Text Available Coenzyme Q (CoQ is an essential factor for aerobic growth and oxidative phosphorylation in the electron transport system. The biosynthetic pathway for CoQ has been proposed mainly from biochemical and genetic analyses of Escherichia coli and Saccharomyces cerevisiae; however, the biosynthetic pathway in higher eukaryotes has been explored in only a limited number of studies. We previously reported the roles of several genes involved in CoQ synthesis in the fission yeast Schizosaccharomyces pombe. Here, we expand these findings by identifying ten genes (dps1, dlp1, ppt1, and coq3-9 that are required for CoQ synthesis. CoQ10-deficient S. pombe coq deletion strains were generated and characterized. All mutant fission yeast strains were sensitive to oxidative stress, produced a large amount of sulfide, required an antioxidant to grow on minimal medium, and did not survive at the stationary phase. To compare the biosynthetic pathway of CoQ in fission yeast with that in higher eukaryotes, the ability of CoQ biosynthetic genes from humans and plants (Arabidopsis thaliana to functionally complement the S. pombe coq deletion strains was determined. With the exception of COQ9, expression of all other human and plant COQ genes recovered CoQ10 production by the fission yeast coq deletion strains, although the addition of a mitochondrial targeting sequence was required for human COQ3 and COQ7, as well as A. thaliana COQ6. In summary, this study describes the functional conservation of CoQ biosynthetic genes between yeasts, humans, and plants.
Full Text Available Localizing messenger RNAs at specific subcellular sites is a conserved mechanism for targeting the synthesis of cytoplasmic proteins to distinct subcellular domains, thereby generating the asymmetric protein distributions necessary for cellular and developmental polarity. However, the full range of transcripts that are asymmetrically distributed in specialized cell types, and the significance of their localization, especially in the nervous system, are not known. We used the EP-MS2 method, which combines EP transposon insertion with the MS2/MCP in vivo fluorescent labeling system, to screen for novel localized transcripts in polarized cells, focusing on the highly branched Drosophila class IV dendritic arborization neurons. Of a total of 541 lines screened, we identified 55 EP-MS2 insertions producing transcripts that were enriched in neuronal processes, particularly in dendrites. The 47 genes identified by these insertions encode molecularly diverse proteins, and are enriched for genes that function in neuronal development and physiology. RNAi-mediated knockdown confirmed roles for many of the candidate genes in dendrite morphogenesis. We propose that the transport of mRNAs encoded by these genes into the dendrites allows their expression to be regulated on a local scale during the dynamic developmental processes of dendrite outgrowth, branching, and/or remodeling.
Hong, R. L., Hamaguchi, L., Busch, M. A., and Weigel, D.
OAK-B135 In Arabidopsis thaliana, cis-regulatory sequences of the floral homeotic gene AGAMOUS (AG) are located in the second intron. This 3 kb intron contains binding sites for two direct activators of AG, LEAFY (LFY) and WUSCHEL (WUS), along with other putative regulatory elements. We have used phylogenetic footprinting and the related technique of phylogenetic shadowing to identify putative cis-regulatory elements in this intron. Among 29 Brassicaceae, several other motifs, but not the LFY and WUS binding sites previously identified, are largely invariant. Using reporter gene analyses, we tested six of these motifs and found that they are all functionally important for activity of AG regulatory sequences in A. thaliana. Although there is little obvious sequence similarity outside the Brassicaceae, the intron from cucumber AG has at least partial activity in A. thaliana. Our studies underscore the value of the comparative approach as a tool that complements gene-by-gene promoter dissection, but also highlight that sequence-based studies alone are insufficient for a complete identification of cis-regulatory sites.
Johnston, Iain G; Williams, Ben P
Since their endosymbiotic origin, mitochondria have lost most of their genes. Although many selective mechanisms underlying the evolution of mitochondrial genomes have been proposed, a data-driven exploration of these hypotheses is lacking, and a quantitatively supported consensus remains absent. We developed HyperTraPS, a methodology coupling stochastic modeling with Bayesian inference, to identify the ordering of evolutionary events and suggest their causes. Using 2015 complete mitochondrial genomes, we inferred evolutionary trajectories of mtDNA gene loss across the eukaryotic tree of life. We find that proteins comprising the structural cores of the electron transport chain are preferentially encoded within mitochondrial genomes across eukaryotes. A combination of high GC content and high protein hydrophobicity is required to explain patterns of mtDNA gene retention; a model that accounts for these selective pressures can also predict the success of artificial gene transfer experiments in vivo. This work provides a general method for data-driven inference of the ordering of evolutionary and progressive events, here identifying the distinct features shaping mitochondrial genomes of present-day species. Copyright © 2016 Elsevier Inc. All rights reserved.
Shen, Changbing; Gao, Jing; Sheng, Yujun; Dou, Jinfa; Zhou, Fusheng; Zheng, Xiaodong; Ko, Randy; Tang, Xianfa; Zhu, Caihong; Yin, Xianyong; Sun, Liangdan; Cui, Yong; Zhang, Xuejun
Vitiligo is an autoimmune disease with a strong genetic component, characterized by areas of depigmented skin resulting from loss of epidermal melanocytes. Genetic factors are known to play key roles in vitiligo through discoveries in association studies and family studies. Previously, vitiligo susceptibility genes were mainly revealed through linkage analysis and candidate gene studies. Recently, our understanding of the genetic basis of vitiligo has been rapidly advancing through genome-wide association study (GWAS). More than 40 robust susceptible loci have been identified and confirmed to be associated with vitiligo by using GWAS. Most of these associated genes participate in important pathways involved in the pathogenesis of vitiligo. Many susceptible loci with unknown functions in the pathogenesis of vitiligo have also been identified, indicating that additional molecular mechanisms may contribute to the risk of developing vitiligo. In this review, we summarize the key loci that are of genome-wide significance, which have been shown to influence vitiligo risk. These genetic loci may help build the foundation for genetic diagnosis and personalize treatment for patients with vitiligo in the future. However, substantial additional studies, including gene-targeted and functional studies, are required to confirm the causality of the genetic variants and their biological relevance in the development of vitiligo. PMID:26870082
Orabona, Guilherme; Morgan, Thomas; Haataja, Ritva; Hallman, Mikko; Puttonen, Hilkka; Menon, Ramkumar; Kuczynski, Edward; Norwitz, Errol; Snegovskikh, Victoria; Palotie, Aarno; Fellman, Vineta; DeFranco, Emily A.; Chaudhari, Bimal P.; McGregor, Tracy L.; McElroy, Jude J.; Oetjens, Matthew T.; Teramo, Kari; Borecki, Ingrid; Fay, Justin; Muglia, Louis
Coordination of fetal maturation with birth timing is essential for mammalian reproduction. In humans, preterm birth is a disorder of profound global health significance. The signals initiating parturition in humans have remained elusive, due to divergence in physiological mechanisms between humans and model organisms typically studied. Because of relatively large human head size and narrow birth canal cross-sectional area compared to other primates, we hypothesized that genes involved in parturition would display accelerated evolution along the human and/or higher primate phylogenetic lineages to decrease the length of gestation and promote delivery of a smaller fetus that transits the birth canal more readily. Further, we tested whether current variation in such accelerated genes contributes to preterm birth risk. Evidence from allometric scaling of gestational age suggests human gestation has been shortened relative to other primates. Consistent with our hypothesis, many genes involved in reproduction show human acceleration in their coding or adjacent noncoding regions. We screened >8,400 SNPs in 150 human accelerated genes in 165 Finnish preterm and 163 control mothers for association with preterm birth. In this cohort, the most significant association was in FSHR, and 8 of the 10 most significant SNPs were in this gene. Further evidence for association of a linkage disequilibrium block of SNPs in FSHR, rs11686474, rs11680730, rs12473870, and rs1247381 was found in African Americans. By considering human acceleration, we identified a novel gene that may be associated with preterm birth, FSHR. We anticipate other human accelerated genes will similarly be associated with preterm birth risk and elucidate essential pathways for human parturition. PMID:21533219
Velleman Sandra G
Full Text Available Abstract Background Skeletal muscle growth and development from embryo to adult consists of a series of carefully regulated changes in gene expression. Understanding these developmental changes in agriculturally important species is essential to the production of high quality meat products. For example, consumer demand for lean, inexpensive meat products has driven the turkey industry to unprecedented production through intensive genetic selection. However, achievements of increased body weight and muscle mass have been countered by an increased incidence of myopathies and meat quality defects. In a previous study, we developed and validated a turkey skeletal muscle-specific microarray as a tool for functional genomics studies. The goals of the current study were to utilize this microarray to elucidate functional pathways of genes responsible for key events in turkey skeletal muscle development and to compare differences in gene expression between two genetic lines of turkeys. To achieve these goals, skeletal muscle samples were collected at three critical stages in muscle development: 18d embryo (hyperplasia, 1d post-hatch (shift from myoblast-mediated growth to satellite cell-modulated growth by hypertrophy, and 16wk (market age from two genetic lines: a randombred control line (RBC2 maintained without selection pressure, and a line (F selected from the RBC2 line for increased 16wk body weight. Array hybridizations were performed in two experiments: Experiment 1 directly compared the developmental stages within genetic line, while Experiment 2 directly compared the two lines within each developmental stage. Results A total of 3474 genes were differentially expressed (false discovery rate; FDR Conclusions The current study identified gene pathways and uncovered novel genes important in turkey muscle growth and development. Future experiments will focus further on several of these candidate genes and the expression and mechanism of action of
Riazuddin, S; Hussain, M; Razzaq, A; Iqbal, Z; Shahzad, M; Polla, D L; Song, Y; van Beusekom, E; Khan, A A; Tomas-Roca, L; Rashid, M; Zahoor, M Y; Wissink-Lindhout, W M; Basra, M A R; Ansar, M; Agha, Z; van Heeswijk, K; Rasheed, F; Van de Vorst, M; Veltman, J A; Gilissen, C; Akram, J; Kleefstra, T; Assir, M Z; Grozeva, D; Carss, K; Raymond, F L; O'Connor, T D; Riazuddin, S A; Khan, S N; Ahmed, Z M; de Brouwer, A P M; van Bokhoven, H; Riazuddin, S
Intellectual disability (ID) is a clinically and genetically heterogeneous disorder, affecting 1-3% of the general population. Although research into the genetic causes of ID has recently gained momentum, identification of pathogenic mutations that cause autosomal recessive ID (ARID) has lagged behind, predominantly due to non-availability of sizeable families. Here we present the results of exome sequencing in 121 large consanguineous Pakistani ID families. In 60 families, we identified homozygous or compound heterozygous DNA variants in a single gene, 30 affecting reported ID genes and 30 affecting novel candidate ID genes. Potential pathogenicity of these alleles was supported by co-segregation with the phenotype, low frequency in control populations and the application of stringent bioinformatics analyses. In another eight families segregation of multiple pathogenic variants was observed, affecting 19 genes that were either known or are novel candidates for ID. Transcriptome profiles of normal human brain tissues showed that the novel candidate ID genes formed a network significantly enriched for transcriptional co-expression (P<0.0001) in the frontal cortex during fetal development and in the temporal-parietal and sub-cortex during infancy through adulthood. In addition, proteins encoded by 12 novel ID genes directly interact with previously reported ID proteins in six known pathways essential for cognitive function (P<0.0001). These results suggest that disruptions of temporal parietal and sub-cortical neurogenesis during infancy are critical to the pathophysiology of ID. These findings further expand the existing repertoire of genes involved in ARID, and provide new insights into the molecular mechanisms and the transcriptome map of ID.
Proudhon, D; Wei, J; Briat, J; Theil, E C
Ferritin, a protein widespread in nature, concentrates iron approximately 10(11)-10(12)-fold above the solubility within a spherical shell of 24 subunits; it derives in plants and animals from a common ancestor (based on sequence) but displays a cytoplasmic location in animals compared to the plastid in contemporary plants. Ferritin gene regulation in plants and animals is altered by development, hormones, and excess iron; iron signals target DNA in plants but mRNA in animals. Evolution has thus conserved the two end points of ferritin gene expression, the physiological signals and the protein structure, while allowing some divergence of the genetic mechanisms. Comparison of ferritin gene organization in plants and animals, made possible by the cloning of a dicot (soybean) ferritin gene presented here and the recent cloning of two monocot (maize) ferritin genes, shows evolutionary divergence in ferritin gene organization between plants and animals but conservation among plants or among animals; divergence in the genetic mechanism for iron regulation is reflected by the absence in all three plant genes of the IRE, a highly conserved, noncoding sequence in vertebrate animal ferritin mRNA. In plant ferritin genes, the number of introns (n = 7) is higher than in animals (n = 3). Second, no intron positions are conserved when ferritin genes of plants and animals are compared, although all ferritin gene introns are in the coding region; within kingdoms, the intron positions in ferritin genes are conserved. Finally, secondary protein structure has no apparent relationship to intron/exon boundaries in plant ferritin genes, whereas in animal ferritin genes the correspondence is high. The structural differences in introns/exons among phylogenetically related ferritin coding sequences and the high conservation of the gene structure within plant or animal kingdoms of the gene structure within plant or animal kingdoms suggest that kingdom-specific functional constraints may
Jiang, Peng; Scarpa, Joseph R; Fitzpatrick, Karrie; Losic, Bojan; Gao, Vance D; Hao, Ke; Summa, Keith C; Yang, He S; Zhang, Bin; Allada, Ravi; Vitaterna, Martha H; Turek, Fred W; Kasarskis, Andrew
Sleep dysfunction and stress susceptibility are comorbid complex traits that often precede and predispose patients to a variety of neuropsychiatric diseases. Here, we demonstrate multilevel organizations of genetic landscape, candidate genes, and molecular networks associated with 328 stress and sleep traits in a chronically stressed population of 338 (C57BL/6J × A/J) F2 mice. We constructed striatal gene co-expression networks, revealing functionally and cell-type-specific gene co-regulations important for stress and sleep. Using a composite ranking system, we identified network modules most relevant for 15 independent phenotypic categories, highlighting a mitochondria/synaptic module that links sleep and stress. The key network regulators of this module are overrepresented with genes implicated in neuropsychiatric diseases. Our work suggests that the interplay among sleep, stress, and neuropathology emerges from genetic influences on gene expression and their collective organization through complex molecular networks, providing a framework for interrogating the mechanisms underlying sleep, stress susceptibility, and related neuropsychiatric disorders. Copyright © 2015 The Authors. Published by Elsevier Inc. All rights reserved.
Tuo, Youlin; An, Ning; Zhang, Ming
The aim of the present study was to investigate the feature genes in metastatic breast cancer samples. A total of 5 expression profiles of metastatic breast cancer samples were downloaded from the Gene Expression Omnibus database, which were then analyzed using the MetaQC and MetaDE packages in R language. The feature genes between metastasis and non‑metastasis samples were screened under the threshold of PSVM) classifier training and verification. The accuracy of the SVM classifier was then evaluated using another independent dataset from The Cancer Genome Atlas database. Finally, function and pathway enrichment analyses for genes in the SVM classifier were performed. A total of 541 feature genes were identified between metastatic and non‑metastatic samples. The top 10 genes with the highest betweenness centrality values in the PPI network of feature genes were Nuclear RNA Export Factor 1, cyclin‑dependent kinase 2 (CDK2), myelocytomatosis proto‑oncogene protein (MYC), Cullin 5, SHC Adaptor Protein 1, Clathrin heavy chain, Nucleolin, WD repeat domain 1, proteasome 26S subunit non‑ATPase 2 and telomeric repeat binding factor 2. The cyclin‑dependent kinase inhibitor 1A (CDKN1A), E2F transcription factor 1 (E2F1), and MYC interacted with CDK2. The SVM classifier constructed by the top 30 feature genes was able to distinguish metastatic samples from non‑metastatic samples [correct rate, specificity, positive predictive value and negative predictive value >0.89; sensitivity >0.84; area under the receiver operating characteristic curve (AUROC) >0.96]. The verification of the SVM classifier in an independent dataset (35 metastatic samples and 143 non‑metastatic samples) revealed an accuracy of 94.38% and AUROC of 0.958. Cell cycle associated functions and pathways were the most significant terms of the 30 feature genes. A SVM classifier was constructed to assess the possibility of breast cancer metastasis, which presented high accuracy in several
Full Text Available Abstract Background Hepatitis C virus (HCV RNA synthesis and protein expression affect cell homeostasis by modulation of gene expression. The impact of HCV replication on global cell transcription has not been fully evaluated. Thus, we analysed the expression profiles of different clones of human hepatoma-derived Huh-7 cells carrying a self-replicating HCV RNA which express all viral proteins (HCV replicon system. Results First, we compared the expression profile of HCV replicon clone 21-5 with both the Huh-7 parental cells and the 21-5 cured (21-5c cells. In these latter, the HCV RNA has been eliminated by IFN-α treatment. To confirm data, we also analyzed microarray results from both the 21-5 and two other HCV replicon clones, 22-6 and 21-7, compared to the Huh-7 cells. The study was carried out by using the Applied Biosystems (AB Human Genome Survey Microarray v1.0 which provides 31,700 probes that correspond to 27,868 human genes. Microarray analysis revealed a specific transcriptional program induced by HCV in replicon cells respect to both IFN-α-cured and Huh-7 cells. From the original datasets of differentially expressed genes, we selected by Venn diagrams a final list of 38 genes modulated by HCV in all clones. Most of the 38 genes have never been described before and showed high fold-change associated with significant p-value, strongly supporting data reliability. Classification of the 38 genes by Panther System identified functional categories that were significantly enriched in this gene set, such as histones and ribosomal proteins as well as extracellular matrix and intracellular protein traffic. The dataset also included new genes involved in lipid metabolism, extracellular matrix and cytoskeletal network, which may be critical for HCV replication and pathogenesis. Conclusion Our data provide a comprehensive analysis of alterations in gene expression induced by HCV replication and reveal modulation of new genes potentially useful
Zanotto-Filho, Alfeu; Dashnamoorthy, Ravi; Loranc, Eva; de Souza, Luis H T; Moreira, José C F; Suresh, Uthra; Chen, Yidong; Bishop, Alexander J R
Alkylating agents are a key component of cancer chemotherapy. Several cellular mechanisms are known to be important for its survival, particularly DNA repair and xenobiotic detoxification, yet genomic screens indicate that additional cellular components may be involved. Elucidating these components has value in either identifying key processes that can be modulated to improve chemotherapeutic efficacy or may be altered in some cancers to confer chemoresistance. We therefore set out to reevaluate our prior Drosophila RNAi screening data by comparison to gene expression arrays in order to determine if we could identify any novel processes in alkylation damage survival. We noted a consistent conservation of alkylation survival pathways across platforms and species when the analysis was conducted on a pathway/process level rather than at an individual gene level. Better results were obtained when combining gene lists from two datasets (RNAi screen plus microarray) prior to analysis. In addition to previously identified DNA damage responses (p53 signaling and Nucleotide Excision Repair), DNA-mRNA-protein metabolism (transcription/translation) and proteasome machinery, we also noted a highly conserved cross-species requirement for NRF2, glutathione (GSH)-mediated drug detoxification and Endoplasmic Reticulum stress (ER stress)/Unfolded Protein Responses (UPR) in cells exposed to alkylation. The requirement for GSH, NRF2 and UPR in alkylation survival was validated by metabolomics, protein studies and functional cell assays. From this we conclude that RNAi/gene expression fusion is a valid strategy to rapidly identify key processes that may be extendable to other contexts beyond damage survival.
Full Text Available Alkylating agents are a key component of cancer chemotherapy. Several cellular mechanisms are known to be important for its survival, particularly DNA repair and xenobiotic detoxification, yet genomic screens indicate that additional cellular components may be involved. Elucidating these components has value in either identifying key processes that can be modulated to improve chemotherapeutic efficacy or may be altered in some cancers to confer chemoresistance. We therefore set out to reevaluate our prior Drosophila RNAi screening data by comparison to gene expression arrays in order to determine if we could identify any novel processes in alkylation damage survival. We noted a consistent conservation of alkylation survival pathways across platforms and species when the analysis was conducted on a pathway/process level rather than at an individual gene level. Better results were obtained when combining gene lists from two datasets (RNAi screen plus microarray prior to analysis. In addition to previously identified DNA damage responses (p53 signaling and Nucleotide Excision Repair, DNA-mRNA-protein metabolism (transcription/translation and proteasome machinery, we also noted a highly conserved cross-species requirement for NRF2, glutathione (GSH-mediated drug detoxification and Endoplasmic Reticulum stress (ER stress/Unfolded Protein Responses (UPR in cells exposed to alkylation. The requirement for GSH, NRF2 and UPR in alkylation survival was validated by metabolomics, protein studies and functional cell assays. From this we conclude that RNAi/gene expression fusion is a valid strategy to rapidly identify key processes that may be extendable to other contexts beyond damage survival.
Williams, Ben; Johnston, Iain
Since their endosymbiotic origin, mitochondria have lost most of their genes. Although many selective mechanisms underlying the evolution of mitochondrial genomes have been proposed, a data-driven exploration of these hypotheses is lacking, and a quantitatively supported consensus remains absent. We developed HyperTraPS, a methodology coupling stochastic modelling with Bayesian inference, to identify the ordering of evolutionary events and suggest their causes. Using 2015 complete mitochondri...
Ren, Zhonglu; Wang, Wenhui; Li, Jinming
Identifying colon cancer subtypes based on molecular signatures may allow for a more rational, patient-specific approach to therapy in the future. Classifications using gene expression data have been attempted before with little concordance between the different studies carried out. In this study we aimed to uncover subtypes of colon cancer that have distinct biological characteristics and identify a set of novel biomarkers which could best reflect the clinical and/or biological characteristics of each subtype. Clustering analysis and discriminant analysis were utilized to discover the subtypes in two different molecular levels on 153 colon cancer samples from The Cancer Genome Atlas (TCGA) Data Portal. At gene expression level, we identified two major subtypes, ECL1 (expression cluster 1) and ECL2 (expression cluster 2) and a list of signature genes. Due to the heterogeneity of colon cancer, the subtype ECL1 can be further subdivided into three nested subclasses, and HOTAIR were found upregulated in subclass 2. At DNA methylation level, we uncovered three major subtypes, MCL1 (methylation cluster 1), MCL2 (methylation cluster 2) and MCL3 (methylation cluster 3). We found only three subtypes of CpG island methylator phenotype (CIMP) in colon cancer instead of the four subtypes in the previous reports, and we found no sufficient evidence to subdivide MCL3 into two distinct subgroups.
Higgins, Michael J.; Day, Colleen D.; Smilinich, Nancy J.; Ni, L.; Cooper, Paul R.; Nowak, Norma J.; Davies, Chris; de Jong, Pieter J.; Hejtmancik, Fielding; Evans, Glen A.; Smith, Richard J.H.; Shows, Thomas B.
Usher syndrome 1C (USH1C) is a congenital condition manifesting profound hearing loss, the absence of vestibular function, and eventual retinal degeneration. The USH1C locus has been mapped genetically to a 2- to 3-cM interval in 11p14–15.1 between D11S899 and D11S861. In an effort to identify the USH1C disease gene we have isolated the region between these markers in yeast artificial chromosomes (YACs) using a combination of STS content mapping and Alu–PCR hybridization. The YAC contig is ∼3.5 Mb and has located several other loci within this interval, resulting in the order CEN-LDHA-SAA1-TPH-D11S1310-(D11S1888/KCNC1)-MYOD1-D11S902D11S921-D11S1890-TEL. Subsequent haplotyping and homozygosity analysis refined the location of the disease gene to a 400-kb interval between D11S902 and D11S1890 with all affected individuals being homozygous for the internal marker D11S921. To facilitate gene identification, the critical region has been converted into P1 artificial chromosome (PAC) clones using sequence-tagged sites (STSs) mapped to the YAC contig, Alu–PCR products generated from the YACs, and PAC end probes. A contig of >50 PAC clones has been assembled between D11S1310 and D11S1890, confirming the order of markers used in haplotyping. Three PAC clones representing nearly two-thirds of the USH1C critical region have been sequenced. PowerBLAST analysis identified six clusters of expressed sequence tags (ESTs), two known genes (BIR,SUR1) mapped previously to this region, and a previously characterized but unmapped gene NEFA (DNA binding/EF hand/acidic amino-acid-rich). GRAIL analysis identified 11 CpG islands and 73 exons of excellent quality. These data allowed the construction of a transcription map for the USH1C critical region, consisting of three known genes and six or more novel transcripts. Based on their map location, these loci represent candidate disease loci for USH1C. The NEFA gene was assessed as the USH1C locus by the sequencing of an amplified NEFA
Kantor, Elizabeth D.; Hutter, Carolyn M.; Minnier, Jessica; Berndt, Sonja I.; Brenner, Hermann; Caan, Bette J.; Campbell, Peter T.; Carlson, Christopher S.; Casey, Graham; Chan, Andrew T.; Chang-Claude, Jenny; Chanock, Stephen J.; Cotterchio, Michelle; Du, Mengmeng; Duggan, David; Fuchs, Charles S.; Giovannucci, Edward L.; Gong, Jian; Harrison, Tabitha A.; Hayes, Richard B.; Henderson, Brian E.; Hoffmeister, Michael; Hopper, John L.; Jenkins, Mark A.; Jiao, Shuo; Kolonel, Laurence N.; Le Marchand, Loic; Lemire, Mathieu; Ma, Jing; Newcomb, Polly A.; Ochs-Balcom, Heather M.; Pflugeisen, Bethann M.; Potter, John D.; Rudolph, Anja; Schoen, Robert E.; Seminara, Daniela; Slattery, Martha L.; Stelling, Deanna L.; Thomas, Fridtjof; Thornquist, Mark; Ulrich, Cornelia M.; Warnick, Greg S.; Zanke, Brent W.; Peters, Ulrike; Hsu, Li; White, Emily
BACKGROUND Genome-wide association studies have identified several single nucleotide polymorphisms (SNPs) that are associated with risk of colorectal cancer (CRC). Prior research has evaluated the presence of gene-environment interaction involving the first 10 identified susceptibility loci, but little work has been conducted on interaction involving SNPs at recently identified susceptibility loci, including: rs10911251, rs6691170, rs6687758, rs11903757, rs10936599, rs647161, rs1321311, rs719725, rs1665650, rs3824999, rs7136702, rs11169552, rs59336, rs3217810, rs4925386, and rs2423279. METHODS Data on 9160 cases and 9280 controls from the Genetics and Epidemiology of Colorectal Cancer Consortium (GECCO) and Colon Cancer Family Registry (CCFR) were used to evaluate the presence of interaction involving the above-listed SNPs and sex, body mass index (BMI), alcohol consumption, smoking, aspirin use, post-menopausal hormone (PMH) use, as well as intake of dietary calcium, dietary fiber, dietary folate, red meat, processed meat, fruit, and vegetables. Interaction was evaluated using a fixed-effects meta-analysis of an efficient Empirical Bayes estimator, and permutation was used to account for multiple comparisons. RESULTS None of the permutation-adjusted p-values reached statistical significance. CONCLUSIONS The associations between recently identified genetic susceptibility loci and CRC are not strongly modified by sex, BMI, alcohol, smoking, aspirin, PMH use, and various dietary factors. IMPACT Results suggest no evidence of strong gene-environment interactions involving the recently identified 16 susceptibility loci for CRC taken one at a time. PMID:24994789
Hua, Zhi-Gang; Lin, Yan; Yuan, Ya-Zhou; Yang, De-Chang; Wei, Wen; Guo, Feng-Biao
In 2003, we developed an ab initio program, ZCURVE 1.0, to find genes in bacterial and archaeal genomes. In this work, we present the updated version (i.e. ZCURVE 3.0). Using 422 prokaryotic genomes, the average accuracy was 93.7% with the updated version, compared with 88.7% with the original version. Such results also demonstrate that ZCURVE 3.0 is comparable with Glimmer 3.02 and may provide complementary predictions to it. In fact, the joint application of the two programs generated better results by correctly finding more annotated genes while also containing fewer false-positive predictions. As the exclusive function, ZCURVE 3.0 contains one post-processing program that can identify essential genes with high accuracy (generally >90%). We hope ZCURVE 3.0 will receive wide use with the web-based running mode. The updated ZCURVE can be freely accessed from http://cefg.uestc.edu.cn/zcurve/ or http://tubic.tju.edu.cn/zcurveb/ without any restrictions. © The Author(s) 2015. Published by Oxford University Press on behalf of Nucleic Acids Research.
Hua, Zhi-Gang; Lin, Yan; Yuan, Ya-Zhou; Yang, De-Chang; Wei, Wen; Guo, Feng-Biao
In 2003, we developed an ab initio program, ZCURVE 1.0, to find genes in bacterial and archaeal genomes. In this work, we present the updated version (i.e. ZCURVE 3.0). Using 422 prokaryotic genomes, the average accuracy was 93.7% with the updated version, compared with 88.7% with the original version. Such results also demonstrate that ZCURVE 3.0 is comparable with Glimmer 3.02 and may provide complementary predictions to it. In fact, the joint application of the two programs generated better results by correctly finding more annotated genes while also containing fewer false-positive predictions. As the exclusive function, ZCURVE 3.0 contains one post-processing program that can identify essential genes with high accuracy (generally >90%). We hope ZCURVE 3.0 will receive wide use with the web-based running mode. The updated ZCURVE can be freely accessed from http://cefg.uestc.edu.cn/zcurve/ or http://tubic.tju.edu.cn/zcurveb/ without any restrictions. PMID:25977299
Bitner-Glindzicz, M; Lindley, K J; Rutland, P; Blaydon, D; Smith, V V; Milla, P J; Hussain, K; Furth-Lavi, J; Cosgrove, K E; Shepherd, R M; Barnes, P D; O'Brien, R E; Farndon, P A; Sowden, J; Liu, X Z; Scanlan, M J; Malcolm, S; Dunne, M J; Aynsley-Green, A; Glaser, B
Usher syndrome type 1 describes the association of profound, congenital sensorineural deafness, vestibular hypofunction and childhood onset retinitis pigmentosa. It is an autosomal recessive condition and is subdivided on the basis of linkage analysis into types 1A through 1E. Usher type 1C maps to the region containing the genes ABCC8 and KCNJ11 (encoding components of ATP-sensitive K + (KATP) channels), which may be mutated in patients with hyperinsulinism. We identified three individuals from two consanguineous families with severe hyperinsulinism, profound congenital sensorineural deafness, enteropathy and renal tubular dysfunction. The molecular basis of the disorder is a homozygous 122-kb deletion of 11p14-15, which includes part of ABCC8 and overlaps with the locus for Usher syndrome type 1C and DFNB18. The centromeric boundary of this deletion includes part of a gene shown to be mutated in families with type 1C Usher syndrome, and is hence assigned the name USH1C. The pattern of expression of the USH1C protein is consistent with the clinical features exhibited by individuals with the contiguous gene deletion and with isolated Usher type 1C.
Palumbo, Maria Concetta; Zenoni, Sara; Fasoli, Marianna; Massonnet, Mélanie; Farina, Lorenzo; Castiglione, Filippo; Pezzotti, Mario; Paci, Paola
We developed an approach that integrates different network-based methods to analyze the correlation network arising from large-scale gene expression data. By studying grapevine (Vitis vinifera) and tomato (Solanum lycopersicum) gene expression atlases and a grapevine berry transcriptomic data set during the transition from immature to mature growth, we identified a category named "fight-club hubs" characterized by a marked negative correlation with the expression profiles of neighboring genes in the network. A special subset named "switch genes" was identified, with the additional property of many significant negative correlations outside their own group in the network. Switch genes are involved in multiple processes and include transcription factors that may be considered master regulators of the previously reported transcriptome remodeling that marks the developmental shift from immature to mature growth. All switch genes, expressed at low levels in vegetative/green tissues, showed a significant increase in mature/woody organs, suggesting a potential regulatory role during the developmental transition. Finally, our analysis of tomato gene expression data sets showed that wild-type switch genes are downregulated in ripening-deficient mutants. The identification of known master regulators of tomato fruit maturation suggests our method is suitable for the detection of key regulators of organ development in different fleshy fruit crops. © 2014 American Society of Plant Biologists. All rights reserved.
Secretome Characterization and Correlation Analysis Reveal Putative Pathogenicity Mechanisms and Identify Candidate Avirulence Genes in the Wheat Stripe Rust Fungus Puccinia striiformis f. sp. tritici.
Xia, Chongjing; Wang, Meinan; Cornejo, Omar E; Jiwan, Derick A; See, Deven R; Chen, Xianming
Stripe (yellow) rust, caused by Puccinia striiformis f. sp. tritici ( Pst ), is one of the most destructive diseases of wheat worldwide. Planting resistant cultivars is an effective way to control this disease, but race-specific resistance can be overcome quickly due to the rapid evolving Pst population. Studying the pathogenicity mechanisms is critical for understanding how Pst virulence changes and how to develop wheat cultivars with durable resistance to stripe rust. We re-sequenced 7 Pst isolates and included additional 7 previously sequenced isolates to represent balanced virulence/avirulence profiles for several avirulence loci in seretome analyses. We observed an uneven distribution of heterozygosity among the isolates. Secretome comparison of Pst with other rust fungi identified a large portion of species-specific secreted proteins, suggesting that they may have specific roles when interacting with the wheat host. Thirty-two effectors of Pst were identified from its secretome. We identified candidates for Avr genes corresponding to six Yr genes by correlating polymorphisms for effector genes to the virulence/avirulence profiles of the 14 Pst isolates. The putative AvYr76 was present in the avirulent isolates, but absent in the virulent isolates, suggesting that deleting the coding region of the candidate avirulence gene has produced races virulent to resistance gene Yr76 . We conclude that incorporating avirulence/virulence phenotypes into correlation analysis with variations in genomic structure and secretome, particularly presence/absence polymorphisms of effectors, is an efficient way to identify candidate Avr genes in Pst . The candidate effector genes provide a rich resource for further studies to determine the evolutionary history of Pst populations and the co-evolutionary arms race between Pst and wheat. The Avr candidates identified in this study will lead to cloning avirulence genes in Pst , which will enable us to understand molecular mechanisms
Custers, D; Van Praag, N; Courselle, P; Apers, S; Deconinck, E
Erectile dysfunction (ED) is a sexual disorder characterized by the inability to achieve or maintain a sufficiently rigid erection. Despite the availability of non-invasive oral treatment options, many patients turn to herbal alternatives. Furthermore, herbal supplements are increasingly gaining popularity in industrialized countries and, as a consequence, quality control is a highly important issue. Unfortunately, this is not a simple task since plants are often crushed and mixed with other plants, which complicates their identification by usage of classical approaches such as microscopy. The aim of this study was to explore the potential use of chromatographic fingerprinting to identify plants present in herbal preparations intended for the treatment of ED. To achieve this goal, a HPLC-PDA and a HPLC-MS method were developed, using a full factorial experimental design in order to acquire characteristic fingerprints of three plants which are potentially beneficial for treating ED: Epimedium spp., Pausinystalia yohimbe and Tribulus terrestris. The full factorial design demonstrated that for all three plant references a C8 column (250mm×4.6mm; 5µm particle size) is best suited; methanol and an ammonium formate buffer (pH 3) were found to be the best constituents for the mobile phase. The suitability of this strategy was demonstrated by analysing several self-made triturations in three different botanical matrices, which mimic the influential effects that could be expected when analysing herbal supplements. To conclude, this study demonstrates that chromatographic fingerprinting could provide a useful means to identify plants in a complex herbal mixture. Copyright © 2016 Elsevier B.V. All rights reserved.
Ckurshumova, Wenzislava; Scarpella, Enrico; Goldstein, Rochelle S; Berleth, Thomas
Genes expressed in vascular tissues have been identified by several strategies, usually with a focus on mature vascular cells. In this study, we explored the possibility of using two opposite types of altered tissue compositions in combination with a double-filter selection to identify genes with a high probability of vascular expression in early organ primordia. Specifically, we generated full-transcriptome microarray profiles of plants with (a) genetically strongly reduced and (b) pharmacologically vastly increased vascular tissues and identified a reproducible cohort of 158 transcripts that fulfilled the dual requirement of being underrepresented in (a) and overrepresented in (b). In order to assess the predictive value of our identification scheme for vascular gene expression, we determined the expression patterns of genes in two unbiased subsamples. First, we assessed the expression patterns of all twenty annotated transcription factor genes from the cohort of 158 genes and found that seventeen of the twenty genes were preferentially expressed in leaf vascular cells. Remarkably, fifteen of these seventeen vascular genes were clearly expressed already very early in leaf vein development. Twelve genes with published leaf expression patterns served as a second subsample to monitor the representation of vascular genes in our cohort. Of those twelve genes, eleven were preferentially expressed in leaf vascular tissues. Based on these results we propose that our compendium of 158 genes represents a sample that is highly enriched for genes expressed in vascular tissues and that our approach is particularly suited to detect genes expressed in vascular cell lineages at early stages of their inception. Copyright © 2011 Elsevier Ireland Ltd. All rights reserved.
Full Text Available Similar to other malignancies, urothelial carcinoma (UC is characterized by specific recurrent chromosomal aberrations and gene mutations. However, the interconnection between specific genomic alterations, and how patterns of chromosomal alterations adhere to different molecular subgroups of UC, is less clear. We applied tiling resolution array CGH to 146 cases of UC and identified a number of regions harboring recurrent focal genomic amplifications and deletions. Several potential oncogenes were included in the amplified regions, including known oncogenes like E2F3, CCND1, and CCNE1, as well as new candidate genes, such as SETDB1 (1q21, and BCL2L1 (20q11. We next combined genome profiling with global gene expression, gene mutation, and protein expression data and identified two major genomic circuits operating in urothelial carcinoma. The first circuit was characterized by FGFR3 alterations, overexpression of CCND1, and 9q and CDKN2A deletions. The second circuit was defined by E3F3 amplifications and RB1 deletions, as well as gains of 5p, deletions at PTEN and 2q36, 16q, 20q, and elevated CDKN2A levels. TP53/MDM2 alterations were common for advanced tumors within the two circuits. Our data also suggest a possible RAS/RAF circuit. The tumors with worst prognosis showed a gene expression profile that indicated a keratinized phenotype. Taken together, our integrative approach revealed at least two separate networks of genomic alterations linked to the molecular diversity seen in UC, and that these circuits may reflect distinct pathways of tumor development.
Full Text Available Abstract Background Multiple epigenetic and genetic changes have been reported in colorectal tumors, but few of these have clinical impact. This study aims to pinpoint epigenetic markers that can discriminate between non-malignant and malignant tissue from the large bowel, i.e. markers with diagnostic potential. The methylation status of eleven genes (ADAMTS1, CDKN2A, CRABP1, HOXA9, MAL, MGMT, MLH1, NR3C1, PTEN, RUNX3, and SCGB3A1 was determined in 154 tissue samples including normal mucosa, adenomas, and carcinomas of the colorectum. The gene-specific and widespread methylation status among the carcinomas was related to patient gender and age, and microsatellite instability status. Possible CIMP tumors were identified by comparing the methylation profile with microsatellite instability (MSI, BRAF-, KRAS-, and TP53 mutation status. Results The mean number of methylated genes per sample was 0.4 in normal colon mucosa from tumor-free individuals, 1.2 in mucosa from cancerous bowels, 2.2 in adenomas, and 3.9 in carcinomas. Widespread methylation was found in both adenomas and carcinomas. The promoters of ADAMTS1, MAL, and MGMT were frequently methylated in benign samples as well as in malignant tumors, independent of microsatellite instability. In contrast, normal mucosa samples taken from bowels without tumor were rarely methylated for the same genes. Hypermethylated CRABP1, MLH1, NR3C1, RUNX3, and SCGB3A1 were shown to be identifiers of carcinomas with microsatellite instability. In agreement with the CIMP concept, MSI and mutated BRAF were associated with samples harboring hypermethylation of several target genes. Conclusion Methylated ADAMTS1, MGMT, and MAL are suitable as markers for early tumor detection.
Badea, Liviu; Herlea, Vlad; Dima, Simona Olimpia; Dumitrascu, Traian; Popescu, Irinel
The precise details of pancreatic ductal adenocarcinoma (PDAC) pathogenesis are still insufficiently known, requiring the use of high-throughput methods. However, PDAC is especially difficult to study using microarrays due to its strong desmoplastic reaction, which involves a hyperproliferating stroma that effectively "masks" the contribution of the minoritary neoplastic epithelial cells. Thus it is not clear which of the genes that have been found differentially expressed between normal and whole tumor tissues are due to the tumor epithelia and which simply reflect the differences in cellular composition. To address this problem, laser microdissection studies have been performed, but these have to deal with much smaller tissue sample quantities and therefore have significantly higher experimental noise. In this paper we combine our own large sample whole-tissue study with a previously published smaller sample microdissection study by Grützmann et al. to identify the genes that are specifically overexpressed in PDAC tumor epithelia. The overlap of this list of genes with other microarray studies of pancreatic cancer as well as with the published literature is impressive. Moreover, we find a number of genes whose over-expression appears to be inversely correlated with patient survival: keratin 7, laminin gamma 2, stratifin, platelet phosphofructokinase, annexin A2, MAP4K4 and OACT2 (MBOAT2), which are all specifically upregulated in the neoplastic epithelia, rather than the tumor stroma. We improve on other microarray studies of PDAC by putting together the higher statistical power due to a larger number of samples with information about cell-type specific expression and patient survival.
Bewick, Adam J; Niederhuth, Chad E; Ji, Lexiang; Rohr, Nicholas A; Griffin, Patrick T; Leebens-Mack, Jim; Schmitz, Robert J
The evolution of gene body methylation (gbM), its origins, and its functional consequences are poorly understood. By pairing the largest collection of transcriptomes (>1000) and methylomes (77) across Viridiplantae, we provide novel insights into the evolution of gbM and its relationship to CHROMOMETHYLASE (CMT) proteins. CMTs are evolutionary conserved DNA methyltransferases in Viridiplantae. Duplication events gave rise to what are now referred to as CMT1, 2 and 3. Independent losses of CMT1, 2, and 3 in eudicots, CMT2 and ZMET in monocots and monocots/commelinids, variation in copy number, and non-neutral evolution suggests overlapping or fluid functional evolution of this gene family. DNA methylation within genes is widespread and is found in all major taxonomic groups of Viridiplantae investigated. Genes enriched with methylated CGs (mCG) were also identified in species sister to angiosperms. The proportion of genes and DNA methylation patterns associated with gbM are restricted to angiosperms with a functional CMT3 or ortholog. However, mCG-enriched genes in the gymnosperm Pinus taeda shared some similarities with gbM genes in Amborella trichopoda. Additionally, gymnosperms and ferns share a CMT homolog closely related to CMT2 and 3. Hence, the dependency of gbM on a CMT most likely extends to all angiosperms and possibly gymnosperms and ferns. The resulting gene family phylogeny of CMT transcripts from the most diverse sampling of plants to date redefines our understanding of CMT evolution and its evolutionary consequences on DNA methylation. Future, functional tests of homologous and paralogous CMTs will uncover novel roles and consequences to the epigenome.
Seidl, M.F.; Ackerveken, van den G.; Govers, F.; Snel, B.
The taxonomic class of oomycetes contains numerous pathogens of plants and animals but is related to nonpathogenic diatoms and brown algae. Oomycetes have flexible genomes comprising large gene families that play roles in pathogenicity. The evolutionary processes that shaped the gene content have
Schoonbeek, H.; Nistelrooy, van J.G.M.; Waard, de M.A.
The role of multiple ATP-binding cassette (ABC) and major facilitator superfamily (MFS) transporter genes from the plant pathogenic fungus Botrytis cinerea in protection against natural fungitoxic compounds was studied by expression analysis and phenotyping of gene-replacement mutants. The
Wang, Yumei; Yin, Xiaoling; Yang, Fang
Sepsis is an inflammatory-related disease, and severe sepsis would induce multiorgan dysfunction, which is the most common cause of death of patients in noncoronary intensive care units. Progression of novel therapeutic strategies has proven to be of little impact on the mortality of severe sepsis, and unfortunately, its mechanisms still remain poorly understood. In this study, we analyzed gene expression profiles of severe sepsis with failure of lung, kidney, and liver for the identification of potential biomarkers. We first downloaded the gene expression profiles from the Gene Expression Omnibus and performed preprocessing of raw microarray data sets and identification of differential expression genes (DEGs) through the R programming software; then, significantly enriched functions of DEGs in lung, kidney, and liver failure sepsis samples were obtained from the Database for Annotation, Visualization, and Integrated Discovery; finally, protein-protein interaction network was constructed for DEGs based on the STRING database, and network modules were also obtained through the MCODE cluster method. As a result, lung failure sepsis has the highest number of DEGs of 859, whereas the number of DEGs in kidney and liver failure sepsis samples is 178 and 175, respectively. In addition, 17 overlaps were obtained among the three lists of DEGs. Biological processes related to immune and inflammatory response were found to be significantly enriched in DEGs. Network and module analysis identified four gene clusters in which all or most of genes were upregulated. The expression changes of Icam1 and Socs3 were further validated through quantitative PCR analysis. This study should shed light on the development of sepsis and provide potential therapeutic targets for sepsis-induced multiorgan failure.
Rice, K L; Lin, X; Wolniak, K; Ebert, B L; Berkofsky-Fessler, W; Buzzai, M; Sun, Y; Xi, C; Elkin, P; Levine, R; Golub, T; Gilliland, D G; Crispino, J D; Licht, J D; Zhang, W
Polycythemia vera (PV), essential thrombocythemia and primary myelofibrosis, are myeloproliferative neoplasms (MPNs) with distinct clinical features and are associated with the JAK2V617F mutation. To identify genomic anomalies involved in the pathogenesis of these disorders, we profiled 87 MPN patients using Affymetrix 250K single-nucleotide polymorphism (SNP) arrays. Aberrations affecting chr9 were the most frequently observed and included 9pLOH (n=16), trisomy 9 (n=6) and amplifications of 9p13.3–23.3 (n=1), 9q33.1–34.13 (n=1) and 9q34.13 (n=6). Patients with trisomy 9 were associated with elevated JAK2V617F mutant allele burden, suggesting that gain of chr9 represents an alternative mechanism for increasing JAK2V617F dosage. Gene expression profiling of patients with and without chr9 abnormalities (+9, 9pLOH), identified genes potentially involved in disease pathogenesis including JAK2, STAT5B and MAPK14. We also observed recurrent gains of 1p36.31–36.33 (n=6), 17q21.2–q21.31 (n=5) and 17q25.1–25.3 (n=5) and deletions affecting 18p11.31–11.32 (n=8). Combined SNP and gene expression analysis identified aberrations affecting components of a non-canonical PRC2 complex (EZH1, SUZ12 and JARID2) and genes comprising a ‘HSC signature' (MLLT3, SMARCA2 and PBX1). We show that NFIB, which is amplified in 7/87 MPN patients and upregulated in PV CD34+ cells, protects cells from apoptosis induced by cytokine withdrawal
Conclusion: Application of different statistical analyses to detect potential resistance genes reliably has shown to conduct interesting results that improve knowledge on molecular mechanisms of plant resistance to pathogens.
Freytag, Virginie; Probst, Sabine; Hadziselimovic, Nils; Boglari, Csaba; Hauser, Yannick; Peter, Fabian; Gabor Fenyves, Bank; Milnik, Annette; Demougin, Philippe; Vukojevic, Vanja; de Quervain, Dominique J-F; Papassotiropoulos, Andreas; Stetak, Attila
The identification of genes related to encoding, storage, and retrieval of memories is a major interest in neuroscience. In the current study, we analyzed the temporal gene expression changes in a neuronal mRNA pool during an olfactory long-term associative memory (LTAM) in Caenorhabditis elegans hermaphrodites. Here, we identified a core set of 712 (538 upregulated and 174 downregulated) genes that follows three distinct temporal peaks demonstrating multiple gene regulation waves in LTAM. Compared with the previously published positive LTAM gene set (Lakhina et al., 2015), 50% of the identified upregulated genes here overlap with the previous dataset, possibly representing stimulus-independent memory-related genes. On the other hand, the remaining genes were not previously identified in positive associative memory and may specifically regulate aversive LTAM. Our results suggest a multistep gene activation process during the formation and retrieval of long-term memory and define general memory-implicated genes as well as conditioning-type-dependent gene sets. SIGNIFICANCE STATEMENT The identification of genes regulating different steps of memory is of major interest in neuroscience. Identification of common memory genes across different learning paradigms and the temporal activation of the genes are poorly studied. Here, we investigated the temporal aspects of Caenorhabditis elegans gene expression changes using aversive olfactory associative long-term memory (LTAM) and identified three major gene activation waves. Like in previous studies, aversive LTAM is also CREB dependent, and CREB activity is necessary immediately after training. Finally, we define a list of memory paradigm-independent core gene sets as well as conditioning-dependent genes. Copyright © 2017 the authors 0270-6474/17/376661-12$15.00/0.
In this study we identified three families with Lynch syndrome from a rural cancer center in western India (KCHRC, Goraj, Gujarat, where 70-75 CRC patients are seen annually. DNA isolated from the blood of consented family members of all three families (8-10 members/family was subjected to NGS sequencing methods on an Illumina HiSeq 4000 platform. We identified unique mutations in the MLH1 gene in all three HNPCC family members. Two of the three unrelated families shared a common mutation (154delA and 156delA. Total 8 members of a family were identified as carriers for 156delA mutation of which 5 members were unaffected while 3 were affected (age of onset: 1 member <30yrs & 2 were>40yr. The family with 154delA mutation showed 2 affected members (>40yr carrying the mutations.LYS618DEL mutation found in 8 members of the third family showed that both affected and unaffected carried the mutation. Thus the common mutations identified in the MLH1 gene in two unrelated families had a high risk for lynch syndrome especially above the age of 40.
Zhu, Yan; Chen, Longxian; Zhang, Chengjun; Hao, Pei; Jing, Xinyun; Li, Xuan
Selaginella moellendorffii, a lycophyte, is a model plant to study the early evolution and development of vascular plants. As the first and only sequenced lycophyte to date, the genome of S. moellendorffii revealed many conserved genes and pathways, as well as specialized genes different from flowering plants. Despite the progress made, little is known about long noncoding RNAs (lncRNA) and the alternative splicing (AS) of coding genes in S. moellendorffii. Its coding gene models have not been fully validated with transcriptome data. Furthermore, it remains important to understand whether the regulatory mechanisms similar to flowering plants are used, and how they operate in a non-seed primitive vascular plant. RNA-sequencing (RNA-seq) was performed for three S. moellendorffii tissues, root, stem, and leaf, by constructing strand-specific RNA-seq libraries from RNA purified using RiboMinus isolation protocol. A total of 176 million reads (44 Gbp) were obtained from three tissue types, and were mapped to S. moellendorffii genome. By comparing with 22,285 existing gene models of S. moellendorffii, we identified 7930 high-confidence novel coding genes (a 35.6% increase), and for the first time reported 4422 lncRNAs in a lycophyte. Further, we refined 2461 (11.0%) of existing gene models, and identified 11,030 AS events (for 5957 coding genes) revealed for the first time for lycophytes. Tissue-specific gene expression with functional implication was analyzed, and 1031, 554, and 269 coding genes, and 174, 39, and 17 lncRNAs were identified in root, stem, and leaf tissues, respectively. The expression of critical genes for vascular development stages, i.e. formation of provascular cells, xylem specification and differentiation, and phloem specification and differentiation, was compared in S. moellendorffii tissues, indicating a less complex regulatory mechanism in lycophytes than in flowering plants. The results were further strengthened by the evolutionary trend of
Therrien, S.; Komljenovic, D.; Therrien, P.; Ruest, C.; Prevost, P.; Vaillancourt, R.
This paper presents a methodology developed to identify the boundaries of the systems important to safety (SIS) at the Gentilly-2 Nuclear Power Plant (NPP), Hydro-Quebec. The SIS boundaries identification considers nuclear safety only. Components that are not identified as important to safety are systematically identified as related to safety. A global assessment process such as WANO/INPO AP-913 'Equipment Reliability Process' will be needed to implement adequate changes in the management rules of those components. The paper depicts results in applying the methodology to the Shutdown Systems 1 and 2 (SDS 1, 2), and to the Emergency Core Cooling System (ECCS). This validation process enabled fine tuning the methodology, performing a better estimate of the effort required to evaluate a system, and identifying components important to safety of these systems. (author)
Full Text Available Glioblastoma Multiforme (GBM cells are highly invasive, infiltrating into the surrounding normal brain tissue, making it impossible to completely eradicate GBM tumors by surgery or radiation. Increasing evidence also shows that these migratory cells are highly resistant to cytotoxic reagents, but decreasing their migratory capability can re-sensitize them to chemotherapy. These evidences suggest that the migratory cell population may serve as a better therapeutic target for more effective treatment of GBM. In order to understand the regulatory mechanism underlying the motile phenotype, we carried out a genome-wide RNAi screen for genes inhibiting the migration of GBM cells. The screening identified a total of twenty-five primary hits; seven of them were confirmed by secondary screening. Further study showed that three of the genes, FLNA, KHSRP and HCFC1, also functioned in vivo, and knocking them down caused multifocal tumor in a mouse model. Interestingly, two genes, KHSRP and HCFC1, were also found to be correlated with the clinical outcome of GBM patients. These two genes have not been previously associated with cell migration.
Glebes, Tirzah Y; Sandoval, Nicholas R; Gillis, Jacob H; Gill, Ryan T
Engineering both feedstock and product tolerance is important for transitioning towards next-generation biofuels derived from renewable sources. Tolerance to chemical inhibitors typically results in complex phenotypes, for which multiple genetic changes must often be made to confer tolerance. Here, we performed a genome-wide search for furfural-tolerant alleles using the TRackable Multiplex Recombineering (TRMR) method (Warner et al. (2010), Nature Biotechnology), which uses chromosomally integrated mutations directed towards increased or decreased expression of virtually every gene in Escherichia coli. We employed various growth selection strategies to assess the role of selection design towards growth enrichments. We also compared genes with increased fitness from our TRMR selection to those from a previously reported genome-wide identification study of furfural tolerance genes using a plasmid-based genomic library approach (Glebes et al. (2014) PLOS ONE). In several cases, growth improvements were observed for the chromosomally integrated promoter/RBS mutations but not for the plasmid-based overexpression constructs. Through this assessment, four novel tolerance genes, ahpC, yhjH, rna, and dicA, were identified and confirmed for their effect on improving growth in the presence of furfural. © 2014 Wiley Periodicals, Inc.
Merritt, Melissa A; Parsons, Peter G; Newton, Tanya R; Martyn, Adam C; Webb, Penelope M; Green, Adèle C; Papadimos, David J; Boyle, Glen M
The malignant potential of serous ovarian tumors, the most common ovarian tumor subtype, varies from benign to low malignant potential (LMP) tumors to frankly invasive cancers. Given the uncertainty about the relationship between these different forms, we compared their patterns of gene expression. Expression profiling was carried out on samples of 7 benign, 7 LMP and 28 invasive (moderate and poorly differentiated) serous tumors and four whole normal ovaries using oligonucleotide microarrays representing over 21,000 genes. We identified 311 transcripts that distinguished invasive from benign tumors, and 20 transcripts that were significantly differentially expressed between invasive and LMP tumors at p < 0.01 (with multiple testing correction). Five genes that were differentially expressed between invasive and either benign or normal tissues were validated by real time PCR in an independent panel of 46 serous tumors (4 benign, 7 LMP, 35 invasive). Overexpression of SLPI and WNT7A and down-regulation of C6orf31, PDGFRA and GLTSCR2 were measured in invasive and LMP compared with benign and normal tissues. Over-expression of WNT7A in an ovarian cancer cell line led to increased migration and invasive capacity. These results highlight several genes that may play an important role across the spectrum of serous ovarian tumorigenesis
Full Text Available “Bois noir” (BN is a grapevine yellows disease, associated with phytoplasma strains related to ‘Candidatus Phytoplasma solani’, that causes severe losses to viticulture in the Euro-Mediterranean basin. Due to the complex ecological cycle of its etiological agent, BN epidemiology is only partially known, and no effective control strategies have been developed. Numerous studies have focused on molecular characterization of BN phytoplasma strains, to identify molecular markers useful to accurately describe their genetic diversity, geographic distribution and host range. In the present study, a multiple gene analysess were carried out on 16S rRNA, tuf, vmp1, and stamp genes to study the genetic variability among 18 BN phytoplasma strains detected in diverse regions of the Republic of Macedonia. Restriction fragment length polymorphism (RFLP assays showed the presence of one 16S rRNA (16SrXII-A, two tuf (tuf-type a, tuf-type b, five vmp1 (V2-TA, V3, V4, V14, V18, and three stamp (S1, S2, S3 gene patterns among the examined strains. Based on the collective RFLP patterns, seven genotypes (Mac1 to Mac7 were described as evidence for genetic heterogeneity, and highlighting their prevalence and distribution in the investigated regions. Phylogenetic analyses on vmp1 and stamp genes underlined the affiliation of Macedonian BN phytoplasma strains to clusters associated with distinct ecologies.
Full Text Available Schizophrenia (SCZ is a severe, debilitating mental illness which has a significant genetic component. The identification of genetic factors related to SCZ has been challenging and these factors remain largely unknown. To evaluate the contribution of de novo variants (DNVs to SCZ, we sequenced the exomes of 53 individuals with sporadic SCZ and of their non-affected parents. We identified 49 DNVs, 18 of which were predicted to alter gene function, including 13 damaging missense mutations, 2 conserved splice site mutations, 2 nonsense mutations, and 1 frameshift deletion. The average number of exonic DNV per proband was 0.88, which corresponds to an exonic point mutation rate of 1.7×10(-8 per nucleotide per generation. The non-synonymous-to-synonymous mutation ratio of 2.06 did not differ from neutral expectations. Overall, this study provides a list of 18 putative candidate genes for sporadic SCZ, and when combined with the results of similar reports, identifies a second proband carrying a non-synonymous DNV in the RGS12 gene.
Luisier, Raphaëlle; Unterberger, Elif B.; Goodman, Jay I.; Schwarz, Michael; Moggs, Jonathan; Terranova, Rémi; van Nimwegen, Erik
Gene regulatory interactions underlying the early stages of non-genotoxic carcinogenesis are poorly understood. Here, we have identified key candidate regulators of phenobarbital (PB)-mediated mouse liver tumorigenesis, a well-characterized model of non-genotoxic carcinogenesis, by applying a new computational modeling approach to a comprehensive collection of in vivo gene expression studies. We have combined our previously developed motif activity response analysis (MARA), which models gene expression patterns in terms of computationally predicted transcription factor binding sites with singular value decomposition (SVD) of the inferred motif activities, to disentangle the roles that different transcriptional regulators play in specific biological pathways of tumor promotion. Furthermore, transgenic mouse models enabled us to identify which of these regulatory activities was downstream of constitutive androstane receptor and β-catenin signaling, both crucial components of PB-mediated liver tumorigenesis. We propose novel roles for E2F and ZFP161 in PB-mediated hepatocyte proliferation and suggest that PB-mediated suppression of ESR1 activity contributes to the development of a tumor-prone environment. Our study shows that combining MARA with SVD allows for automated identification of independent transcription regulatory programs within a complex in vivo tissue environment and provides novel mechanistic insights into PB-mediated hepatocarcinogenesis. PMID:24464994
Full Text Available The plant hormone auxin plays pivotal roles in many aspects of plant growth and development. The auxin/indole-3-acetic acid (Aux/IAA gene family encodes short-lived nuclear proteins acting on auxin perception and signaling, but the evolutionary history of this gene family remains to be elucidated. In this study, the Aux/IAA gene family in 17 plant species covering all major lineages of plants is identified and analyzed by using multiple bioinformatics methods. A total of 434 Aux/IAA genes was found among these plant species, and the gene copy number ranges from three (Physcomitrella patens to 63 (Glycine max. The phylogenetic analysis shows that the canonical Aux/IAA proteins can be generally divided into five major clades, and the origin of Aux/IAA proteins could be traced back to the common ancestor of land plants and green algae. Many truncated Aux/IAA proteins were found, and some of these truncated Aux/IAA proteins may be generated from the C-terminal truncation of auxin response factor (ARF proteins. Our results indicate that tandem and segmental duplications play dominant roles for the expansion of the Aux/IAA gene family mainly under purifying selection. The putative nuclear localization signals (NLSs in Aux/IAA proteins are conservative, and two kinds of new primordial bipartite NLSs in P. patens and Selaginella moellendorffii were discovered. Our findings not only give insights into the origin and expansion of the Aux/IAA gene family, but also provide a basis for understanding their functions during the course of evolution.
Wu, Wentao; Liu, Yaxue; Wang, Yuqian; Li, Huimin; Liu, Jiaxi; Tan, Jiaxin; He, Jiadai; Bai, Jingwen; Ma, Haoli
The plant hormone auxin plays pivotal roles in many aspects of plant growth and development. The auxin/indole-3-acetic acid (Aux/IAA) gene family encodes short-lived nuclear proteins acting on auxin perception and signaling, but the evolutionary history of this gene family remains to be elucidated. In this study, the Aux/IAA gene family in 17 plant species covering all major lineages of plants is identified and analyzed by using multiple bioinformatics methods. A total of 434 Aux/IAA genes was found among these plant species, and the gene copy number ranges from three ( Physcomitrella patens ) to 63 ( Glycine max ). The phylogenetic analysis shows that the canonical Aux/IAA proteins can be generally divided into five major clades, and the origin of Aux/IAA proteins could be traced back to the common ancestor of land plants and green algae. Many truncated Aux/IAA proteins were found, and some of these truncated Aux/IAA proteins may be generated from the C-terminal truncation of auxin response factor (ARF) proteins. Our results indicate that tandem and segmental duplications play dominant roles for the expansion of the Aux/IAA gene family mainly under purifying selection. The putative nuclear localization signals (NLSs) in Aux/IAA proteins are conservative, and two kinds of new primordial bipartite NLSs in P. patens and Selaginella moellendorffii were discovered. Our findings not only give insights into the origin and expansion of the Aux/IAA gene family, but also provide a basis for understanding their functions during the course of evolution.
Full Text Available Many biological processes are controlled by intricate networks of transcriptional regulators. With the development of microarray technology, transcriptional changes can be examined at the whole-genome level. However, such analysis often lacks information on the hierarchical relationship between components of a given system. Systemic acquired resistance (SAR is an inducible plant defense response involving a cascade of transcriptional events induced by salicylic acid through the transcription cofactor NPR1. To identify additional regulatory nodes in the SAR network, we performed microarray analysis on Arabidopsis plants expressing the NPR1-GR (glucocorticoid receptor fusion protein. Since nuclear translocation of NPR1-GR requires dexamethasone, we were able to control NPR1-dependent transcription and identify direct transcriptional targets of NPR1. We show that NPR1 directly upregulates the expression of eight WRKY transcription factor genes. This large family of 74 transcription factors has been implicated in various defense responses, but no specific WRKY factor has been placed in the SAR network. Identification of NPR1-regulated WRKY factors allowed us to perform in-depth genetic analysis on a small number of WRKY factors and test well-defined phenotypes of single and double mutants associated with NPR1. Among these WRKY factors we found both positive and negative regulators of SAR. This genomics-directed approach unambiguously positioned five WRKY factors in the complex transcriptional regulatory network of SAR. Our work not only discovered new transcription regulatory components in the signaling network of SAR but also demonstrated that functional studies of large gene families have to take into consideration sequence similarity as well as the expression patterns of the candidates.
Armijos Jaramillo, Vinicio Danilo; Vargas, Walter Alberto; Sukno, Serenella Ana; Thon, Michael R
The genus Colletotrichum contains a large number of phytopathogenic fungi that produce enormous economic losses around the world. The effect of horizontal gene transfer (HGT) has not been studied yet in these organisms. Inter-Kingdom HGT into fungal genomes has been reported in the past but knowledge about the HGT between plants and fungi is particularly limited. We describe a gene in the genome of several species of the genus Colletotrichum with a strong resemblance to subtilisins typically found in plant genomes. Subtilisins are an important group of serine proteases, widely distributed in all of the kingdoms of life. Our hypothesis is that the gene was acquired by Colletotrichum spp. through (HGT) from plants to a Colletotrichum ancestor. We provide evidence to support this hypothesis in the form of phylogenetic analyses as well as a characterization of the similarity of the subtilisin at the primary, secondary and tertiary structural levels. The remarkable level of structural conservation of Colletotrichum plant-like subtilisin (CPLS) with plant subtilisins and the differences with the rest of Colletotrichum subtilisins suggests the possibility of molecular mimicry. Our phylogenetic analysis indicates that the HGT event would have occurred approximately 150-155 million years ago, after the divergence of the Colletotrichum lineage from other fungi. Gene expression analysis shows that the gene is modulated during the infection of maize by C. graminicola suggesting that it has a role in plant disease. Furthermore, the upregulation of the CPLS coincides with the downregulation of several plant genes encoding subtilisins. Based on the known roles of subtilisins in plant pathogenic fungi and the gene expression pattern that we observed, we postulate that the CPLSs have an important role in plant infection.
Drost, Mark; Lützen, Anne; van Hees, Sandrine
In many individuals suspected of the common cancer predisposition Lynch syndrome, variants of unclear significance (VUS), rather than an obviously pathogenic mutations, are identified in one of the DNA mismatch repair (MMR) genes. The uncertainty of whether such VUS inactivate MMR, and therefore...... function. When a residue identified as mutated in an individual suspected of Lynch syndrome is listed as critical in such a reverse diagnosis catalog, there is a high probability that the corresponding human VUS is pathogenic. To investigate the applicability of this approach, we have generated....... Nearly half of these critical residues match with VUS previously identified in individuals suspected of Lynch syndrome. This aids in the assignment of pathogenicity to these human VUS and validates the approach described here as a diagnostic tool. In a wider perspective, this work provides a model...