identify genes related: Topics by WorldWideScience.org

Sample records for identify genes related

ICan: an integrated co-alteration network to identify ovarian cancer-related genes.

Science.gov (United States)

Zhou, Yuanshuai; Liu, Yongjing; Li, Kening; Zhang, Rui; Qiu, Fujun; Zhao, Ning; Xu, Yan

2015-01-01

Over the last decade, an increasing number of integrative studies on cancer-related genes have been published. Integrative analyses aim to overcome the limitation of a single data type, and provide a more complete view of carcinogenesis. The vast majority of these studies used sample-matched data of gene expression and copy number to investigate the impact of copy number alteration on gene expression, and to predict and prioritize candidate oncogenes and tumor suppressor genes. However, correlations between genes were neglected in these studies. Our work aimed to evaluate the co-alteration of copy number, methylation and expression, allowing us to identify cancer-related genes and essential functional modules in cancer. We built the Integrated Co-alteration network (ICan) based on multi-omics data, and analyzed the network to uncover cancer-related genes. After comparison with random networks, we identified 155 ovarian cancer-related genes, including well-known (TP53, BRCA1, RB1 and PTEN) and also novel cancer-related genes, such as PDPN and EphA2. We compared the results with a conventional method: CNAmet, and obtained a significantly better area under the curve value (ICan: 0.8179, CNAmet: 0.5183). In this paper, we describe a framework to find cancer-related genes based on an Integrated Co-alteration network. Our results proved that ICan could precisely identify candidate cancer genes and provide increased mechanistic understanding of carcinogenesis. This work suggested a new research direction for biological network analyses involving multi-omics data.
Identifying and Analyzing Novel Epilepsy-Related Genes Using Random Walk with Restart Algorithm

Directory of Open Access Journals (Sweden)

Wei Guo

2017-01-01

Full Text Available As a pathological condition, epilepsy is caused by abnormal neuronal discharge in brain which will temporarily disrupt the cerebral functions. Epilepsy is a chronic disease which occurs in all ages and would seriously affect patients’ personal lives. Thus, it is highly required to develop effective medicines or instruments to treat the disease. Identifying epilepsy-related genes is essential in order to understand and treat the disease because the corresponding proteins encoded by the epilepsy-related genes are candidates of the potential drug targets. In this study, a pioneering computational workflow was proposed to predict novel epilepsy-related genes using the random walk with restart (RWR algorithm. As reported in the literature RWR algorithm often produces a number of false positive genes, and in this study a permutation test and functional association tests were implemented to filter the genes identified by RWR algorithm, which greatly reduce the number of suspected genes and result in only thirty-three novel epilepsy genes. Finally, these novel genes were analyzed based upon some recently published literatures. Our findings implicate that all novel genes were closely related to epilepsy. It is believed that the proposed workflow can also be applied to identify genes related to other diseases and deepen our understanding of the mechanisms of these diseases.
LGscore: A method to identify disease-related genes using biological literature and Google data.

Science.gov (United States)

Kim, Jeongwoo; Kim, Hyunjin; Yoon, Youngmi; Park, Sanghyun

2015-04-01

Since the genome project in 1990s, a number of studies associated with genes have been conducted and researchers have confirmed that genes are involved in disease. For this reason, the identification of the relationships between diseases and genes is important in biology. We propose a method called LGscore, which identifies disease-related genes using Google data and literature data. To implement this method, first, we construct a disease-related gene network using text-mining results. We then extract gene-gene interactions based on co-occurrences in abstract data obtained from PubMed, and calculate the weights of edges in the gene network by means of Z-scoring. The weights contain two values: the frequency and the Google search results. The frequency value is extracted from literature data, and the Google search result is obtained using Google. We assign a score to each gene through a network analysis. We assume that genes with a large number of links and numerous Google search results and frequency values are more likely to be involved in disease. For validation, we investigated the top 20 inferred genes for five different diseases using answer sets. The answer sets comprised six databases that contain information on disease-gene relationships. We identified a significant number of disease-related genes as well as candidate genes for Alzheimer's disease, diabetes, colon cancer, lung cancer, and prostate cancer. Our method was up to 40% more accurate than existing methods. Copyright © 2015 Elsevier Inc. All rights reserved.
Identifying Novel Candidate Genes Related to Apoptosis from a Protein-Protein Interaction Network

Directory of Open Access Journals (Sweden)

Baoman Wang

2015-01-01

Full Text Available Apoptosis is the process of programmed cell death (PCD that occurs in multicellular organisms. This process of normal cell death is required to maintain the balance of homeostasis. In addition, some diseases, such as obesity, cancer, and neurodegenerative diseases, can be cured through apoptosis, which produces few side effects. An effective comprehension of the mechanisms underlying apoptosis will be helpful to prevent and treat some diseases. The identification of genes related to apoptosis is essential to uncover its underlying mechanisms. In this study, a computational method was proposed to identify novel candidate genes related to apoptosis. First, protein-protein interaction information was used to construct a weighted graph. Second, a shortest path algorithm was applied to the graph to search for new candidate genes. Finally, the obtained genes were filtered by a permutation test. As a result, 26 genes were obtained, and we discuss their likelihood of being novel apoptosis-related genes by collecting evidence from published literature.
Identifying novel fruit-related genes in Arabidopsis thaliana based on the random walk with restart algorithm.

Science.gov (United States)

Zhang, Yunhua; Dai, Li; Liu, Ying; Zhang, YuHang; Wang, ShaoPeng

2017-01-01

Fruit is essential for plant reproduction and is responsible for protection and dispersal of seeds. The development and maturation of fruit is tightly regulated by numerous genetic factors that respond to environmental and internal stimulation. In this study, we attempted to identify novel fruit-related genes in a model organism, Arabidopsis thaliana, using a computational method. Based on validated fruit-related genes, the random walk with restart (RWR) algorithm was applied on a protein-protein interaction (PPI) network using these genes as seeds. The identified genes with high probabilities were filtered by the permutation test and linkage tests. In the permutation test, the genes that were selected due to the structure of the PPI network were discarded. In the linkage tests, the importance of each candidate gene was measured from two aspects: (1) its functional associations with validated genes and (2) its similarity with validated genes on gene ontology (GO) terms and KEGG pathways. Finally, 255 inferred genes were obtained, subsequent extensive analysis of important genes revealed that they mainly contribute to ubiquitination (UBQ9, UBQ8, UBQ11, UBQ10), serine hydroxymethyl transfer (SHM7, SHM5, SHM6) or glycol-metabolism (HXKL2_ARATH, CSY5, GAPCP1), suggesting essential roles during the development and maturation of fruit in Arabidopsis thaliana.
Genome-Wide Temporal Expression Profiling in Caenorhabditis elegans Identifies a Core Gene Set Related to Long-Term Memory.

Science.gov (United States)

Freytag, Virginie; Probst, Sabine; Hadziselimovic, Nils; Boglari, Csaba; Hauser, Yannick; Peter, Fabian; Gabor Fenyves, Bank; Milnik, Annette; Demougin, Philippe; Vukojevic, Vanja; de Quervain, Dominique J-F; Papassotiropoulos, Andreas; Stetak, Attila

2017-07-12

The identification of genes related to encoding, storage, and retrieval of memories is a major interest in neuroscience. In the current study, we analyzed the temporal gene expression changes in a neuronal mRNA pool during an olfactory long-term associative memory (LTAM) in Caenorhabditis elegans hermaphrodites. Here, we identified a core set of 712 (538 upregulated and 174 downregulated) genes that follows three distinct temporal peaks demonstrating multiple gene regulation waves in LTAM. Compared with the previously published positive LTAM gene set (Lakhina et al., 2015), 50% of the identified upregulated genes here overlap with the previous dataset, possibly representing stimulus-independent memory-related genes. On the other hand, the remaining genes were not previously identified in positive associative memory and may specifically regulate aversive LTAM. Our results suggest a multistep gene activation process during the formation and retrieval of long-term memory and define general memory-implicated genes as well as conditioning-type-dependent gene sets. SIGNIFICANCE STATEMENT The identification of genes regulating different steps of memory is of major interest in neuroscience. Identification of common memory genes across different learning paradigms and the temporal activation of the genes are poorly studied. Here, we investigated the temporal aspects of Caenorhabditis elegans gene expression changes using aversive olfactory associative long-term memory (LTAM) and identified three major gene activation waves. Like in previous studies, aversive LTAM is also CREB dependent, and CREB activity is necessary immediately after training. Finally, we define a list of memory paradigm-independent core gene sets as well as conditioning-dependent genes. Copyright © 2017 the authors 0270-6474/17/376661-12$15.00/0.
Weighted gene co-expression network analysis of expression data of monozygotic twins identifies specific modules and hub genes related to BMI.

Science.gov (United States)

Wang, Weijing; Jiang, Wenjie; Hou, Lin; Duan, Haiping; Wu, Yili; Xu, Chunsheng; Tan, Qihua; Li, Shuxia; Zhang, Dongfeng

2017-11-13

The therapeutic management of obesity is challenging, hence further elucidating the underlying mechanisms of obesity development and identifying new diagnostic biomarkers and therapeutic targets are urgent and necessary. Here, we performed differential gene expression analysis and weighted gene co-expression network analysis (WGCNA) to identify significant genes and specific modules related to BMI based on gene expression profile data of 7 discordant monozygotic twins. In the differential gene expression analysis, it appeared that 32 differentially expressed genes (DEGs) were with a trend of up-regulation in twins with higher BMI when compared to their siblings. Categories of positive regulation of nitric-oxide synthase biosynthetic process, positive regulation of NF-kappa B import into nucleus, and peroxidase activity were significantly enriched within GO database and NF-kappa B signaling pathway within KEGG database. DEGs of NAMPT, TLR9, PTGS2, HBD, and PCSK1N might be associated with obesity. In the WGCNA, among the total 20 distinct co-expression modules identified, coral1 module (68 genes) had the strongest positive correlation with BMI (r = 0.56, P = 0.04) and disease status (r = 0.56, P = 0.04). Categories of positive regulation of phospholipase activity, high-density lipoprotein particle clearance, chylomicron remnant clearance, reverse cholesterol transport, intermediate-density lipoprotein particle, chylomicron, low-density lipoprotein particle, very-low-density lipoprotein particle, voltage-gated potassium channel complex, cholesterol transporter activity, and neuropeptide hormone activity were significantly enriched within GO database for this module. And alcoholism and cell adhesion molecules pathways were significantly enriched within KEGG database. Several hub genes, such as GAL, ASB9, NPPB, TBX2, IL17C, APOE, ABCG4, and APOC2 were also identified. The module eigengene of saddlebrown module (212 genes) was also significantly
Consistent Differential Expression Pattern (CDEP) on microarray to identify genes related to metastatic behavior.

Science.gov (United States)

Tsoi, Lam C; Qin, Tingting; Slate, Elizabeth H; Zheng, W Jim

2011-11-11

To utilize the large volume of gene expression information generated from different microarray experiments, several meta-analysis techniques have been developed. Despite these efforts, there remain significant challenges to effectively increasing the statistical power and decreasing the Type I error rate while pooling the heterogeneous datasets from public resources. The objective of this study is to develop a novel meta-analysis approach, Consistent Differential Expression Pattern (CDEP), to identify genes with common differential expression patterns across different datasets. We combined False Discovery Rate (FDR) estimation and the non-parametric RankProd approach to estimate the Type I error rate in each microarray dataset of the meta-analysis. These Type I error rates from all datasets were then used to identify genes with common differential expression patterns. Our simulation study showed that CDEP achieved higher statistical power and maintained low Type I error rate when compared with two recently proposed meta-analysis approaches. We applied CDEP to analyze microarray data from different laboratories that compared transcription profiles between metastatic and primary cancer of different types. Many genes identified as differentially expressed consistently across different cancer types are in pathways related to metastatic behavior, such as ECM-receptor interaction, focal adhesion, and blood vessel development. We also identified novel genes such as AMIGO2, Gem, and CXCL11 that have not been shown to associate with, but may play roles in, metastasis. CDEP is a flexible approach that borrows information from each dataset in a meta-analysis in order to identify genes being differentially expressed consistently. We have shown that CDEP can gain higher statistical power than other existing approaches under a variety of settings considered in the simulation study, suggesting its robustness and insensitivity to data variation commonly associated with microarray
Identifying Tmem59 related gene regulatory network of mouse neural stem cell from a compendium of expression profiles

Directory of Open Access Journals (Sweden)

Guo Xiuyun

2011-09-01

Full Text Available Abstract Background Neural stem cells offer potential treatment for neurodegenerative disorders, such like Alzheimer's disease (AD. While much progress has been made in understanding neural stem cell function, a precise description of the molecular mechanisms regulating neural stem cells is not yet established. This lack of knowledge is a major barrier holding back the discovery of therapeutic uses of neural stem cells. In this paper, the regulatory mechanism of mouse neural stem cell (NSC differentiation by tmem59 is explored on the genome-level. Results We identified regulators of tmem59 during the differentiation of mouse NSCs from a compendium of expression profiles. Based on the microarray experiment, we developed the parallelized SWNI algorithm to reconstruct gene regulatory networks of mouse neural stem cells. From the inferred tmem59 related gene network including 36 genes, pou6f1 was identified to regulate tmem59 significantly and might play an important role in the differentiation of NSCs in mouse brain. There are four pathways shown in the gene network, indicating that tmem59 locates in the downstream of the signalling pathway. The real-time RT-PCR results shown that the over-expression of pou6f1 could significantly up-regulate tmem59 expression in C17.2 NSC line. 16 out of 36 predicted genes in our constructed network have been reported to be AD-related, including Ace, aqp1, arrdc3, cd14, cd59a, cds1, cldn1, cox8b, defb11, folr1, gdi2, mmp3, mgp, myrip, Ripk4, rnd3, and sncg. The localization of tmem59 related genes and functional-related gene groups based on the Gene Ontology (GO annotation was also identified. Conclusions Our findings suggest that the expression of tmem59 is an important factor contributing to AD. The parallelized SWNI algorithm increased the efficiency of network reconstruction significantly. This study enables us to highlight novel genes that may be involved in NSC differentiation and provides a shortcut to
Identifying key genes in rheumatoid arthritis by weighted gene co-expression network analysis.

Science.gov (United States)

Ma, Chunhui; Lv, Qi; Teng, Songsong; Yu, Yinxian; Niu, Kerun; Yi, Chengqin

2017-08-01

This study aimed to identify rheumatoid arthritis (RA) related genes based on microarray data using the WGCNA (weighted gene co-expression network analysis) method. Two gene expression profile datasets GSE55235 (10 RA samples and 10 healthy controls) and GSE77298 (16 RA samples and seven healthy controls) were downloaded from Gene Expression Omnibus database. Characteristic genes were identified using metaDE package. WGCNA was used to find disease-related networks based on gene expression correlation coefficients, and module significance was defined as the average gene significance of all genes used to assess the correlation between the module and RA status. Genes in the disease-related gene co-expression network were subject to functional annotation and pathway enrichment analysis using Database for Annotation Visualization and Integrated Discovery. Characteristic genes were also mapped to the Connectivity Map to screen small molecules. A total of 599 characteristic genes were identified. For each dataset, characteristic genes in the green, red and turquoise modules were most closely associated with RA, with gene numbers of 54, 43 and 79, respectively. These genes were enriched in totally enriched in 17 Gene Ontology terms, mainly related to immune response (CD97, FYB, CXCL1, IKBKE, CCR1, etc.), inflammatory response (CD97, CXCL1, C3AR1, CCR1, LYZ, etc.) and homeostasis (C3AR1, CCR1, PLN, CCL19, PPT1, etc.). Two small-molecule drugs sanguinarine and papaverine were predicted to have a therapeutic effect against RA. Genes related to immune response, inflammatory response and homeostasis presumably have critical roles in RA pathogenesis. Sanguinarine and papaverine have a potential therapeutic effect against RA. © 2017 Asia Pacific League of Associations for Rheumatology and John Wiley & Sons Australia, Ltd.
Identifying potential maternal genes of Bombyx mori using digital gene expression profiling

Science.gov (United States)

Xu, Pingzhen

2018-01-01

Maternal genes present in mature oocytes play a crucial role in the early development of silkworm. Although maternal genes have been widely studied in many other species, there has been limited research in Bombyx mori. High-throughput next generation sequencing provides a practical method for gene discovery on a genome-wide level. Herein, a transcriptome study was used to identify maternal-related genes from silkworm eggs. Unfertilized eggs from five different stages of early development were used to detect the changing situation of gene expression. The expressed genes showed different patterns over time. Seventy-six maternal genes were annotated according to homology analysis with Drosophila melanogaster. More than half of the differentially expressed maternal genes fell into four expression patterns, while the expression patterns showed a downward trend over time. The functional annotation of these material genes was mainly related to transcription factor activity, growth factor activity, nucleic acid binding, RNA binding, ATP binding, and ion binding. Additionally, twenty-two gene clusters including maternal genes were identified from 18 scaffolds. Altogether, we plotted a profile for the maternal genes of Bombyx mori using a digital gene expression profiling method. This will provide the basis for maternal-specific signature research and improve the understanding of the early development of silkworm. PMID:29462160
Suppression subtractive hybridization identified differentially expressed genes in lung adenocarcinoma: ERGIC3 as a novel lung cancer-related gene

International Nuclear Information System (INIS)

Wu, Mingsong; Tu, Tao; Huang, Yunchao; Cao, Yi

2013-01-01

To understand the carcinogenesis caused by accumulated genetic and epigenetic alterations and seek novel biomarkers for various cancers, studying differentially expressed genes between cancerous and normal tissues is crucial. In the study, two cDNA libraries of lung cancer were constructed and screened for identification of differentially expressed genes. Two cDNA libraries of differentially expressed genes were constructed using lung adenocarcinoma tissue and adjacent nonmalignant lung tissue by suppression subtractive hybridization. The data of the cDNA libraries were then analyzed and compared using bioinformatics analysis. Levels of mRNA and protein were measured by quantitative real-time polymerase chain reaction (q-RT-PCR) and western blot respectively, as well as expression and localization of proteins were determined by immunostaining. Gene functions were investigated using proliferation and migration assays after gene silencing and gene over-expression. Two libraries of differentially expressed genes were obtained. The forward-subtracted library (FSL) and the reverse-subtracted library (RSL) contained 177 and 59 genes, respectively. Bioinformatic analysis demonstrated that these genes were involved in a wide range of cellular functions. The vast majority of these genes were newly identified to be abnormally expressed in lung cancer. In the first stage of the screening for 16 genes, we compared lung cancer tissues with their adjacent non-malignant tissues at the mRNA level, and found six genes (ERGIC3, DDR1, HSP90B1, SDC1, RPSA, and LPCAT1) from the FSL were significantly up-regulated while two genes (GPX3 and TIMP3) from the RSL were significantly down-regulated (P < 0.05). The ERGIC3 protein was also over-expressed in lung cancer tissues and cultured cells, and expression of ERGIC3 was correlated with the differentiated degree and histological type of lung cancer. The up-regulation of ERGIC3 could promote cellular migration and proliferation in vitro. The
Genome-wide gene expression array identifies novel genes related to disease severity and excessive daytime sleepiness in patients with obstructive sleep apnea.

Directory of Open Access Journals (Sweden)

Yung-Che Chen

Full Text Available We aimed to identify novel molecular associations between chronic intermittent hypoxia with re-oxygenation and adverse consequences in obstructive sleep apnea (OSA. We analyzed gene expression profiles of peripheral blood mononuclear cells from 48 patients with sleep-disordered breathing stratified into four groups: primary snoring (PS, moderate to severe OSA (MSO, very severe OSA (VSO, and very severe OSA patients on long-term continuous positive airway pressure treatment (VSOC. Comparisons of the microarray gene expression data identified eight genes up-regulated with OSA and down-regulated with CPAP treatment, and five genes down-regulated with OSA and up-regulated with CPAP treatment. Protein expression levels of two genes related to endothelial tight junction (AMOT P130, and PLEKHH3, and three genes related to anti-or pro-apoptosis (BIRC3, ADAR1 P150, and LGALS3 were all increased in the VSO group, while AMOT P130 was further increased, and PLEKHH3, BIRC3, and ADAR1 P150 were all decreased in the VSOC group. Subgroup analyses revealed that AMOT P130 protein expression was increased in OSA patients with excessive daytime sleepiness, BIRC3 protein expression was decreased in OSA patients with hypertension, and LGALS3 protein expression was increased in OSA patients with chronic kidney disease. In vitro short-term intermittent hypoxia with re-oxygenation experiment showed immediate over-expression of ADAR1 P150. In conclusion, we identified a novel association between AMOT/PLEKHH3/BIRC3/ADAR1/LGALS3 over-expressions and high severity index in OSA patients. AMOT and GALIG may constitute an important determinant for the development of hypersomnia and kidney injury, respectively, while BIRC3 may play a protective role in the development of hypertension.
MicroRNA expression profiling to identify and validate reference genes for relative quantification in colorectal cancer.

LENUS (Irish Health Repository)

Chang, Kah Hoong

2010-01-01

BACKGROUND: Advances in high-throughput technologies and bioinformatics have transformed gene expression profiling methodologies. The results of microarray experiments are often validated using reverse transcription quantitative PCR (RT-qPCR), which is the most sensitive and reproducible method to quantify gene expression. Appropriate normalisation of RT-qPCR data using stably expressed reference genes is critical to ensure accurate and reliable results. Mi(cro)RNA expression profiles have been shown to be more accurate in disease classification than mRNA expression profiles. However, few reports detailed a robust identification and validation strategy for suitable reference genes for normalisation in miRNA RT-qPCR studies. METHODS: We adopt and report a systematic approach to identify the most stable reference genes for miRNA expression studies by RT-qPCR in colorectal cancer (CRC). High-throughput miRNA profiling was performed on ten pairs of CRC and normal tissues. By using the mean expression value of all expressed miRNAs, we identified the most stable candidate reference genes for subsequent validation. As such the stability of a panel of miRNAs was examined on 35 tumour and 39 normal tissues. The effects of normalisers on the relative quantity of established oncogenic (miR-21 and miR-31) and tumour suppressor (miR-143 and miR-145) target miRNAs were assessed. RESULTS: In the array experiment, miR-26a, miR-345, miR-425 and miR-454 were identified as having expression profiles closest to the global mean. From a panel of six miRNAs (let-7a, miR-16, miR-26a, miR-345, miR-425 and miR-454) and two small nucleolar RNA genes (RNU48 and Z30), miR-16 and miR-345 were identified as the most stably expressed reference genes. The combined use of miR-16 and miR-345 to normalise expression data enabled detection of a significant dysregulation of all four target miRNAs between tumour and normal colorectal tissue. CONCLUSIONS: Our study demonstrates that the top six most
MicroRNA expression profiling to identify and validate reference genes for relative quantification in colorectal cancer

LENUS (Irish Health Repository)

Chang, Kah Hoong

2010-04-29

Abstract Background Advances in high-throughput technologies and bioinformatics have transformed gene expression profiling methodologies. The results of microarray experiments are often validated using reverse transcription quantitative PCR (RT-qPCR), which is the most sensitive and reproducible method to quantify gene expression. Appropriate normalisation of RT-qPCR data using stably expressed reference genes is critical to ensure accurate and reliable results. Mi(cro)RNA expression profiles have been shown to be more accurate in disease classification than mRNA expression profiles. However, few reports detailed a robust identification and validation strategy for suitable reference genes for normalisation in miRNA RT-qPCR studies. Methods We adopt and report a systematic approach to identify the most stable reference genes for miRNA expression studies by RT-qPCR in colorectal cancer (CRC). High-throughput miRNA profiling was performed on ten pairs of CRC and normal tissues. By using the mean expression value of all expressed miRNAs, we identified the most stable candidate reference genes for subsequent validation. As such the stability of a panel of miRNAs was examined on 35 tumour and 39 normal tissues. The effects of normalisers on the relative quantity of established oncogenic (miR-21 and miR-31) and tumour suppressor (miR-143 and miR-145) target miRNAs were assessed. Results In the array experiment, miR-26a, miR-345, miR-425 and miR-454 were identified as having expression profiles closest to the global mean. From a panel of six miRNAs (let-7a, miR-16, miR-26a, miR-345, miR-425 and miR-454) and two small nucleolar RNA genes (RNU48 and Z30), miR-16 and miR-345 were identified as the most stably expressed reference genes. The combined use of miR-16 and miR-345 to normalise expression data enabled detection of a significant dysregulation of all four target miRNAs between tumour and normal colorectal tissue. Conclusions Our study demonstrates that the top six most
Gene Ontology and KEGG Enrichment Analyses of Genes Related to Age-Related Macular Degeneration

Directory of Open Access Journals (Sweden)

Jian Zhang

2014-01-01

Full Text Available Identifying disease genes is one of the most important topics in biomedicine and may facilitate studies on the mechanisms underlying disease. Age-related macular degeneration (AMD is a serious eye disease; it typically affects older adults and results in a loss of vision due to retina damage. In this study, we attempt to develop an effective method for distinguishing AMD-related genes. Gene ontology and KEGG enrichment analyses of known AMD-related genes were performed, and a classification system was established. In detail, each gene was encoded into a vector by extracting enrichment scores of the gene set, including it and its direct neighbors in STRING, and gene ontology terms or KEGG pathways. Then certain feature-selection methods, including minimum redundancy maximum relevance and incremental feature selection, were adopted to extract key features for the classification system. As a result, 720 GO terms and 11 KEGG pathways were deemed the most important factors for predicting AMD-related genes.
Suppression subtractive hybridization as a tool to identify anthocyanin metabolism-related genes in apple skin.

Science.gov (United States)

Ban, Yusuke; Moriguchi, Takaya

2010-01-01

The pigmentation of anthocyanins is one of the important determinants for consumer preference and marketability in horticultural crops such as fruits and flowers. To elucidate the mechanisms underlying the physiological process leading to the pigmentation of anthocyanins, identification of the genes differentially expressed in response to anthocyanin accumulation is a useful strategy. Currently, microarrays have been widely used to isolate differentially expressed genes. However, the use of microarrays is limited by its high cost of special apparatus and materials. Therefore, availability of microarrays is limited and does not come into common use at present. Suppression subtractive hybridization (SSH) is an alternative tool that has been widely used to identify differentially expressed genes due to its easy handling and relatively low cost. This chapter describes the procedures for SSH, including RNA extraction from polysaccharides and polyphenol-rich samples, poly(A)+ RNA purification, evaluation of subtraction efficiency, and differential screening using reverse northern in apple skin.
Epidermal growth factor gene is a newly identified candidate gene for gout

OpenAIRE

Lin Han; Chunwei Cao; Zhaotong Jia; Shiguo Liu; Zhen Liu; Ruosai Xin; Can Wang; Xinde Li; Wei Ren; Xuefeng Wang; Changgui Li

2016-01-01

Chromosome 4q25 has been identified as a genomic region associated with gout. However, the associations of gout with the genes in this region have not yet been confirmed. Here, we performed two-stage analysis to determine whether variations in candidate genes in the 4q25 region are associated with gout in a male Chinese Han population. We first evaluated 96 tag single nucleotide polymorphisms (SNPs) in eight inflammatory/immune pathway- or glucose/lipid metabolism-related genes in the 4q25 re...
Determining Semantically Related Significant Genes.

Science.gov (United States)

Taha, Kamal

2014-01-01

GO relation embodies some aspects of existence dependency. If GO term xis existence-dependent on GO term y, the presence of y implies the presence of x. Therefore, the genes annotated with the function of the GO term y are usually functionally and semantically related to the genes annotated with the function of the GO term x. A large number of gene set enrichment analysis methods have been developed in recent years for analyzing gene sets enrichment. However, most of these methods overlook the structural dependencies between GO terms in GO graph by not considering the concept of existence dependency. We propose in this paper a biological search engine called RSGSearch that identifies enriched sets of genes annotated with different functions using the concept of existence dependency. We observe that GO term xcannot be existence-dependent on GO term y, if x- and y- have the same specificity (biological characteristics). After encoding into a numeric format the contributions of GO terms annotating target genes to the semantics of their lowest common ancestors (LCAs), RSGSearch uses microarray experiment to identify the most significant LCA that annotates the result genes. We evaluated RSGSearch experimentally and compared it with five gene set enrichment systems. Results showed marked improvement.
NIH Researchers Identify OCD Risk Gene

Science.gov (United States)

... News From NIH NIH Researchers Identify OCD Risk Gene Past Issues / Summer 2006 Table of Contents For ... and Alcoholism (NIAAA) have identified a previously unknown gene variant that doubles an individual's risk for obsessive- ...

Application of biclustering of gene expression data and gene set enrichment analysis methods to identify potentially disease causing nanomaterials

Directory of Open Access Journals (Sweden)

Andrew Williams

2015-12-01

Full Text Available Background: The presence of diverse types of nanomaterials (NMs in commerce is growing at an exponential pace. As a result, human exposure to these materials in the environment is inevitable, necessitating the need for rapid and reliable toxicity testing methods to accurately assess the potential hazards associated with NMs. In this study, we applied biclustering and gene set enrichment analysis methods to derive essential features of altered lung transcriptome following exposure to NMs that are associated with lung-specific diseases. Several datasets from public microarray repositories describing pulmonary diseases in mouse models following exposure to a variety of substances were examined and functionally related biclusters of genes showing similar expression profiles were identified. The identified biclusters were then used to conduct a gene set enrichment analysis on pulmonary gene expression profiles derived from mice exposed to nano-titanium dioxide (nano-TiO2, carbon black (CB or carbon nanotubes (CNTs to determine the disease significance of these data-driven gene sets.Results: Biclusters representing inflammation (chemokine activity, DNA binding, cell cycle, apoptosis, reactive oxygen species (ROS and fibrosis processes were identified. All of the NM studies were significant with respect to the bicluster related to chemokine activity (DAVID; FDR p-value = 0.032. The bicluster related to pulmonary fibrosis was enriched in studies where toxicity induced by CNT and CB studies was investigated, suggesting the potential for these materials to induce lung fibrosis. The pro-fibrogenic potential of CNTs is well established. Although CB has not been shown to induce fibrosis, it induces stronger inflammatory, oxidative stress and DNA damage responses than nano-TiO2 particles.Conclusion: The results of the analysis correctly identified all NMs to be inflammogenic and only CB and CNTs as potentially fibrogenic. In addition to identifying several
Multiple Genes Related to Muscle Identified through a Joint Analysis of a Two-stage Genome-wide Association Study for Racing Performance of 1,156 Thoroughbreds

Directory of Open Access Journals (Sweden)

Dong-Hyun Shin

2015-06-01

Full Text Available Thoroughbred, a relatively recent horse breed, is best known for its use in horse racing. Although myostatin (MSTN variants have been reported to be highly associated with horse racing performance, the trait is more likely to be polygenic in nature. The purpose of this study was to identify genetic variants strongly associated with racing performance by using estimated breeding value (EBV for race time as a phenotype. We conducted a two-stage genome-wide association study to search for genetic variants associated with the EBV. In the first stage of genome-wide association study, a relatively large number of markers (~54,000 single-nucleotide polymorphisms, SNPs were evaluated in a small number of samples (240 horses. In the second stage, a relatively small number of markers identified to have large effects (170 SNPs were evaluated in a much larger number of samples (1,156 horses. We also validated the SNPs related to MSTN known to have large effects on racing performance and found significant associations in the stage two analysis, but not in stage one. We identified 28 significant SNPs related to 17 genes. Among these, six genes have a function related to myogenesis and five genes are involved in muscle maintenance. To our knowledge, these genes are newly reported for the genetic association with racing performance of Thoroughbreds. It complements a recent horse genome-wide association studies of racing performance that identified other SNPs and genes as the most significant variants. These results will help to expand our knowledge of the polygenic nature of racing performance in Thoroughbreds.
Transcriptomic Analysis Identifies Candidate Genes Related to Intramuscular Fat Deposition and Fatty Acid Composition in the Breast Muscle of Squabs (Columba

Directory of Open Access Journals (Sweden)

Manhong Ye

2016-07-01

Full Text Available Despite the fact that squab is consumed throughout the world because of its high nutritional value and appreciated sensory attributes, aspects related to its characterization, and in particular genetic issues, have rarely been studied. In this study, meat traits in terms of pH, water-holding capacity, intramuscular fat content, and fatty acid profile of the breast muscle of squabs from two meat pigeon breeds were determined. Breed-specific differences were detected in fat-related traits of intramuscular fat content and fatty acid composition. RNA-Sequencing was applied to compare the transcriptomes of muscle and liver tissues between squabs of two breeds to identify candidate genes associated with the differences in the capacity of fat deposition. A total of 27 differentially expressed genes assigned to pathways of lipid metabolism were identified, of which, six genes belonged to the peroxisome proliferator-activated receptor signaling pathway along with four other genes. Our results confirmed in part previous reports in livestock and provided also a number of genes which had not been related to fat deposition so far. These genes can serve as a basis for further investigations to screen markers closely associated with intramuscular fat content and fatty acid composition in squabs. The data from this study were deposited in the National Center for Biotechnology Information (NCBI’s Sequence Read Archive under the accession numbers SRX1680021 and SRX1680022. This is the first transcriptome analysis of the muscle and liver tissue in Columba using next generation sequencing technology. Data provided here are of potential value to dissect functional genes influencing fat deposition in squabs.
A Strategy for Identifying Quantitative Trait Genes Using Gene Expression Analysis and Causal Analysis

Directory of Open Access Journals (Sweden)

Akira Ishikawa

2017-11-01

Full Text Available Large numbers of quantitative trait loci (QTL affecting complex diseases and other quantitative traits have been reported in humans and model animals. However, the genetic architecture of these traits remains elusive due to the difficulty in identifying causal quantitative trait genes (QTGs for common QTL with relatively small phenotypic effects. A traditional strategy based on techniques such as positional cloning does not always enable identification of a single candidate gene for a QTL of interest because it is difficult to narrow down a target genomic interval of the QTL to a very small interval harboring only one gene. A combination of gene expression analysis and statistical causal analysis can greatly reduce the number of candidate genes. This integrated approach provides causal evidence that one of the candidate genes is a putative QTG for the QTL. Using this approach, I have recently succeeded in identifying a single putative QTG for resistance to obesity in mice. Here, I outline the integration approach and discuss its usefulness using my studies as an example.
A Strategy for Identifying Quantitative Trait Genes Using Gene Expression Analysis and Causal Analysis.

Science.gov (United States)

Ishikawa, Akira

2017-11-27

Large numbers of quantitative trait loci (QTL) affecting complex diseases and other quantitative traits have been reported in humans and model animals. However, the genetic architecture of these traits remains elusive due to the difficulty in identifying causal quantitative trait genes (QTGs) for common QTL with relatively small phenotypic effects. A traditional strategy based on techniques such as positional cloning does not always enable identification of a single candidate gene for a QTL of interest because it is difficult to narrow down a target genomic interval of the QTL to a very small interval harboring only one gene. A combination of gene expression analysis and statistical causal analysis can greatly reduce the number of candidate genes. This integrated approach provides causal evidence that one of the candidate genes is a putative QTG for the QTL. Using this approach, I have recently succeeded in identifying a single putative QTG for resistance to obesity in mice. Here, I outline the integration approach and discuss its usefulness using my studies as an example.
Rice Transcriptome Analysis to Identify Possible Herbicide Quinclorac Detoxification Genes

Directory of Open Access Journals (Sweden)

Wenying eXu

2015-09-01

Full Text Available Quinclorac is a highly selective auxin-type herbicide, and is widely used in the effective control of barnyard grass in paddy rice fields, improving the world’s rice yield. The herbicide mode of action of quinclorac has been proposed and hormone interactions affect quinclorac signaling. Because of widespread use, quinclorac may be transported outside rice fields with the drainage waters, leading to soil and water pollution and environmental health problems.In this study, we used 57K Affymetrix rice whole-genome array to identify quinclorac signaling response genes to study the molecular mechanisms of action and detoxification of quinclorac in rice plants. Overall, 637 probe sets were identified with differential expression levels under either 6 or 24 h of quinclorac treatment. Auxin-related genes such as GH3 and OsIAAs responded to quinclorac treatment. Gene Ontology analysis showed that genes of detoxification-related family genes were significantly enriched, including cytochrome P450, GST, UGT, and ABC and drug transporter genes. Moreover, real-time RT-PCR analysis showed that top candidate P450 families such as CYP81, CYP709C and CYP72A genes were universally induced by different herbicides. Some Arabidopsis genes for the same P450 family were up-regulated under quinclorac treatment.We conduct rice whole-genome GeneChip analysis and the first global identification of quinclorac response genes. This work may provide potential markers for detoxification of quinclorac and biomonitors of environmental chemical pollution.
Gene-based Association Approach Identify Genes Across Stress Traits in Fruit Flies

DEFF Research Database (Denmark)

Rohde, Palle Duun; Edwards, Stefan McKinnon; Sarup, Pernille Merete

Identification of genes explaining variation in quantitative traits or genetic risk factors of human diseases requires both good phenotypic- and genotypic data, but also efficient statistical methods. Genome-wide association studies may reveal association between phenotypic variation and variation...... approach grouping variants accordingly to gene position, thus lowering the number of statistical tests performed and increasing the probability of identifying genes with small to moderate effects. Using this approach we identify numerous genes associated with different types of stresses in Drosophila...... melanogaster, but also identify common genes that affects the stress traits....
Expression Analysis of Immune Related Genes Identified from the Coelomocytes of Sea Cucumber (Apostichopus japonicus in Response to LPS Challenge

Directory of Open Access Journals (Sweden)

Ying Dong

2014-10-01

Full Text Available The sea cucumber (Apostichopus japonicus occupies a basal position during the evolution of deuterostomes and is also an important aquaculture species. In order to identify more immune effectors, transcriptome sequencing of A. japonicus coelomocytes in response to lipopolysaccharide (LPS challenge was performed using the Illumina HiSeq™ 2000 platform. One hundred and seven differentially expressed genes were selected and divided into four functional categories including pathogen recognition (25 genes, reorganization of cytoskeleton (27 genes, inflammation (41 genes and apoptosis (14 genes. They were analyzed to elucidate the mechanisms of host-pathogen interactions and downstream signaling transduction. Quantitative real-time polymerase chain reactions (qRT-PCRs of 10 representative genes validated the accuracy and reliability of RNA sequencing results with the correlation coefficients from 0.88 to 0.98 and p-value <0.05. Expression analysis of immune-related genes after LPS challenge will be useful in understanding the immune response mechanisms of A. japonicus against pathogen invasion and developing strategies for resistant markers selection.
Identifying essential genes in bacterial metabolic networks with machine learning methods

Science.gov (United States)

2010-01-01

Background Identifying essential genes in bacteria supports to identify potential drug targets and an understanding of minimal requirements for a synthetic cell. However, experimentally assaying the essentiality of their coding genes is resource intensive and not feasible for all bacterial organisms, in particular if they are infective. Results We developed a machine learning technique to identify essential genes using the experimental data of genome-wide knock-out screens from one bacterial organism to infer essential genes of another related bacterial organism. We used a broad variety of topological features, sequence characteristics and co-expression properties potentially associated with essentiality, such as flux deviations, centrality, codon frequencies of the sequences, co-regulation and phyletic retention. An organism-wise cross-validation on bacterial species yielded reliable results with good accuracies (area under the receiver-operator-curve of 75% - 81%). Finally, it was applied to drug target predictions for Salmonella typhimurium. We compared our predictions to the viability of experimental knock-outs of S. typhimurium and identified 35 enzymes, which are highly relevant to be considered as potential drug targets. Specifically, we detected promising drug targets in the non-mevalonate pathway. Conclusions Using elaborated features characterizing network topology, sequence information and microarray data enables to predict essential genes from a bacterial reference organism to a related query organism without any knowledge about the essentiality of genes of the query organism. In general, such a method is beneficial for inferring drug targets when experimental data about genome-wide knockout screens is not available for the investigated organism. PMID:20438628
Expression profiling identifies genes involved in emphysema severity

Directory of Open Access Journals (Sweden)

Bowman Rayleen V

2009-09-01

Full Text Available Abstract Chronic obstructive pulmonary disease (COPD is a major public health problem. The aim of this study was to identify genes involved in emphysema severity in COPD patients. Gene expression profiling was performed on total RNA extracted from non-tumor lung tissue from 30 smokers with emphysema. Class comparison analysis based on gas transfer measurement was performed to identify differentially expressed genes. Genes were then selected for technical validation by quantitative reverse transcriptase-PCR (qRT-PCR if also represented on microarray platforms used in previously published emphysema studies. Genes technically validated advanced to tests of biological replication by qRT-PCR using an independent test set of 62 lung samples. Class comparison identified 98 differentially expressed genes (p p Gene expression profiling of lung from emphysema patients identified seven candidate genes associated with emphysema severity including COL6A3, SERPINF1, ZNHIT6, NEDD4, CDKN2A, NRN1 and GSTM3.
Exome sequencing of a large family identifies potential candidate genes contributing risk to bipolar disorder.

Science.gov (United States)

Zhang, Tianxiao; Hou, Liping; Chen, David T; McMahon, Francis J; Wang, Jen-Chyong; Rice, John P

2018-03-01

Bipolar disorder is a mental illness with lifetime prevalence of about 1%. Previous genetic studies have identified multiple chromosomal linkage regions and candidate genes that might be associated with bipolar disorder. The present study aimed to identify potential susceptibility variants for bipolar disorder using 6 related case samples from a four-generation family. A combination of exome sequencing and linkage analysis was performed to identify potential susceptibility variants for bipolar disorder. Our study identified a list of five potential candidate genes for bipolar disorder. Among these five genes, GRID1(Glutamate Receptor Delta-1 Subunit), which was previously reported to be associated with several psychiatric disorders and brain related traits, is particularly interesting. Variants with functional significance in this gene were identified from two cousins in our bipolar disorder pedigree. Our findings suggest a potential role for these genes and the related rare variants in the onset and development of bipolar disorder in this one family. Additional research is needed to replicate these findings and evaluate their patho-biological significance. Copyright © 2017 Elsevier B.V. All rights reserved.
Comprehensive analysis of gene expression patterns of hedgehog-related genes

Directory of Open Access Journals (Sweden)

Baillie David

2006-10-01

Full Text Available Abstract Background The Caenorhabditis elegans genome encodes ten proteins that share sequence similarity with the Hedgehog signaling molecule through their C-terminal autoprocessing Hint/Hog domain. These proteins contain novel N-terminal domains, and C. elegans encodes dozens of additional proteins containing only these N-terminal domains. These gene families are called warthog, groundhog, ground-like and quahog, collectively called hedgehog (hh-related genes. Previously, the expression pattern of seventeen genes was examined, which showed that they are primarily expressed in the ectoderm. Results With the completion of the C. elegans genome sequence in November 2002, we reexamined and identified 61 hh-related ORFs. Further, we identified 49 hh-related ORFs in C. briggsae. ORF analysis revealed that 30% of the genes still had errors in their predictions and we improved these predictions here. We performed a comprehensive expression analysis using GFP fusions of the putative intergenic regulatory sequence with one or two transgenic lines for most genes. The hh-related genes are expressed in one or a few of the following tissues: hypodermis, seam cells, excretory duct and pore cells, vulval epithelial cells, rectal epithelial cells, pharyngeal muscle or marginal cells, arcade cells, support cells of sensory organs, and neuronal cells. Using time-lapse recordings, we discovered that some hh-related genes are expressed in a cyclical fashion in phase with molting during larval development. We also generated several translational GFP fusions, but they did not show any subcellular localization. In addition, we also studied the expression patterns of two genes with similarity to Drosophila frizzled, T23D8.1 and F27E11.3A, and the ortholog of the Drosophila gene dally-like, gpn-1, which is a heparan sulfate proteoglycan. The two frizzled homologs are expressed in a few neurons in the head, and gpn-1 is expressed in the pharynx. Finally, we compare the
Genome-wide transcriptome study in wheat identified candidate genes related to processing quality, majority of them showing interaction (quality x development) and having temporal and spatial distributions

Science.gov (United States)

2014-01-01

Background The cultivated bread wheat (Triticum aestivum L.) possesses unique flour quality, which can be processed into many end-use food products such as bread, pasta, chapatti (unleavened flat bread), biscuit, etc. The present wheat varieties require improvement in processing quality to meet the increasing demand of better quality food products. However, processing quality is very complex and controlled by many genes, which have not been completely explored. To identify the candidate genes whose expressions changed due to variation in processing quality and interaction (quality x development), genome-wide transcriptome studies were performed in two sets of diverse Indian wheat varieties differing for chapatti quality. It is also important to understand the temporal and spatial distributions of their expressions for designing tissue and growth specific functional genomics experiments. Results Gene-specific two-way ANOVA analysis of expression of about 55 K transcripts in two diverse sets of Indian wheat varieties for chapatti quality at three seed developmental stages identified 236 differentially expressed probe sets (10-fold). Out of 236, 110 probe sets were identified for chapatti quality. Many processing quality related key genes such as glutenin and gliadins, puroindolines, grain softness protein, alpha and beta amylases, proteases, were identified, and many other candidate genes related to cellular and molecular functions were also identified. The ANOVA analysis revealed that the expression of 56 of 110 probe sets was involved in interaction (quality x development). Majority of the probe sets showed differential expression at early stage of seed development i.e. temporal expression. Meta-analysis revealed that the majority of the genes expressed in one or a few growth stages indicating spatial distribution of their expressions. The differential expressions of a few candidate genes such as pre-alpha/beta-gliadin and gamma gliadin were validated by RT
Genome-wide transcriptome study in wheat identified candidate genes related to processing quality, majority of them showing interaction (quality x development) and having temporal and spatial distributions.

Science.gov (United States)

Singh, Anuradha; Mantri, Shrikant; Sharma, Monica; Chaudhury, Ashok; Tuli, Rakesh; Roy, Joy

2014-01-16

The cultivated bread wheat (Triticum aestivum L.) possesses unique flour quality, which can be processed into many end-use food products such as bread, pasta, chapatti (unleavened flat bread), biscuit, etc. The present wheat varieties require improvement in processing quality to meet the increasing demand of better quality food products. However, processing quality is very complex and controlled by many genes, which have not been completely explored. To identify the candidate genes whose expressions changed due to variation in processing quality and interaction (quality x development), genome-wide transcriptome studies were performed in two sets of diverse Indian wheat varieties differing for chapatti quality. It is also important to understand the temporal and spatial distributions of their expressions for designing tissue and growth specific functional genomics experiments. Gene-specific two-way ANOVA analysis of expression of about 55 K transcripts in two diverse sets of Indian wheat varieties for chapatti quality at three seed developmental stages identified 236 differentially expressed probe sets (10-fold). Out of 236, 110 probe sets were identified for chapatti quality. Many processing quality related key genes such as glutenin and gliadins, puroindolines, grain softness protein, alpha and beta amylases, proteases, were identified, and many other candidate genes related to cellular and molecular functions were also identified. The ANOVA analysis revealed that the expression of 56 of 110 probe sets was involved in interaction (quality x development). Majority of the probe sets showed differential expression at early stage of seed development i.e. temporal expression. Meta-analysis revealed that the majority of the genes expressed in one or a few growth stages indicating spatial distribution of their expressions. The differential expressions of a few candidate genes such as pre-alpha/beta-gliadin and gamma gliadin were validated by RT-PCR. Therefore, this study
Sleeping Beauty transposon mutagenesis identifies genes that cooperate with mutant Smad4 in gastric cancer development.

Science.gov (United States)

Takeda, Haruna; Rust, Alistair G; Ward, Jerrold M; Yew, Christopher Chin Kuan; Jenkins, Nancy A; Copeland, Neal G

2016-04-05

Mutations in SMAD4 predispose to the development of gastrointestinal cancer, which is the third leading cause of cancer-related deaths. To identify genes driving gastric cancer (GC) development, we performed a Sleeping Beauty (SB) transposon mutagenesis screen in the stomach of Smad4(+/-) mutant mice. This screen identified 59 candidate GC trunk drivers and a much larger number of candidate GC progression genes. Strikingly, 22 SB-identified trunk drivers are known or candidate cancer genes, whereas four SB-identified trunk drivers, including PTEN, SMAD4, RNF43, and NF1, are known human GC trunk drivers. Similar to human GC, pathway analyses identified WNT, TGF-β, and PI3K-PTEN signaling, ubiquitin-mediated proteolysis, adherens junctions, and RNA degradation in addition to genes involved in chromatin modification and organization as highly deregulated pathways in GC. Comparative oncogenomic filtering of the complete list of SB-identified genes showed that they are highly enriched for genes mutated in human GC and identified many candidate human GC genes. Finally, by comparing our complete list of SB-identified genes against the list of mutated genes identified in five large-scale human GC sequencing studies, we identified LDL receptor-related protein 1B (LRP1B) as a previously unidentified human candidate GC tumor suppressor gene. In LRP1B, 129 mutations were found in 462 human GC samples sequenced, and LRP1B is one of the top 10 most deleted genes identified in a panel of 3,312 human cancers. SB mutagenesis has, thus, helped to catalog the cooperative molecular mechanisms driving SMAD4-induced GC growth and discover genes with potential clinical importance in human GC.
Identifying Cancer Driver Genes Using Replication-Incompetent Retroviral Vectors

Directory of Open Access Journals (Sweden)

Victor M. Bii

2016-10-01

Full Text Available Identifying novel genes that drive tumor metastasis and drug resistance has significant potential to improve patient outcomes. High-throughput sequencing approaches have identified cancer genes, but distinguishing driver genes from passengers remains challenging. Insertional mutagenesis screens using replication-incompetent retroviral vectors have emerged as a powerful tool to identify cancer genes. Unlike replicating retroviruses and transposons, replication-incompetent retroviral vectors lack additional mutagenesis events that can complicate the identification of driver mutations from passenger mutations. They can also be used for almost any human cancer due to the broad tropism of the vectors. Replication-incompetent retroviral vectors have the ability to dysregulate nearby cancer genes via several mechanisms including enhancer-mediated activation of gene promoters. The integrated provirus acts as a unique molecular tag for nearby candidate driver genes which can be rapidly identified using well established methods that utilize next generation sequencing and bioinformatics programs. Recently, retroviral vector screens have been used to efficiently identify candidate driver genes in prostate, breast, liver and pancreatic cancers. Validated driver genes can be potential therapeutic targets and biomarkers. In this review, we describe the emergence of retroviral insertional mutagenesis screens using replication-incompetent retroviral vectors as a novel tool to identify cancer driver genes in different cancer types.
Protein functional links in Trypanosoma brucei, identified by gene fusion analysis

Directory of Open Access Journals (Sweden)

Trimpalis Philip

2011-07-01

Full Text Available Abstract Background Domain or gene fusion analysis is a bioinformatics method for detecting gene fusions in one organism by comparing its genome to that of other organisms. The occurrence of gene fusions suggests that the two original genes that participated in the fusion are functionally linked, i.e. their gene products interact either as part of a multi-subunit protein complex, or in a metabolic pathway. Gene fusion analysis has been used to identify protein functional links in prokaryotes as well as in eukaryotic model organisms, such as yeast and Drosophila. Results In this study we have extended this approach to include a number of recently sequenced protists, four of which are pathogenic, to identify fusion linked proteins in Trypanosoma brucei, the causative agent of African sleeping sickness. We have also examined the evolution of the gene fusion events identified, to determine whether they can be attributed to fusion or fission, by looking at the conservation of the fused genes and of the individual component genes across the major eukaryotic and prokaryotic lineages. We find relatively limited occurrence of gene fusions/fissions within the protist lineages examined. Our results point to two trypanosome-specific gene fissions, which have recently been experimentally confirmed, one fusion involving proteins involved in the same metabolic pathway, as well as two novel putative functional links between fusion-linked protein pairs. Conclusions This is the first study of protein functional links in T. brucei identified by gene fusion analysis. We have used strict thresholds and only discuss results which are highly likely to be genuine and which either have already been or can be experimentally verified. We discuss the possible impact of the identification of these novel putative protein-protein interactions, to the development of new trypanosome therapeutic drugs.
Identifying genetic relatives without compromising privacy.

Science.gov (United States)

He, Dan; Furlotte, Nicholas A; Hormozdiari, Farhad; Joo, Jong Wha J; Wadia, Akshay; Ostrovsky, Rafail; Sahai, Amit; Eskin, Eleazar

2014-04-01

The development of high-throughput genomic technologies has impacted many areas of genetic research. While many applications of these technologies focus on the discovery of genes involved in disease from population samples, applications of genomic technologies to an individual's genome or personal genomics have recently gained much interest. One such application is the identification of relatives from genetic data. In this application, genetic information from a set of individuals is collected in a database, and each pair of individuals is compared in order to identify genetic relatives. An inherent issue that arises in the identification of relatives is privacy. In this article, we propose a method for identifying genetic relatives without compromising privacy by taking advantage of novel cryptographic techniques customized for secure and private comparison of genetic information. We demonstrate the utility of these techniques by allowing a pair of individuals to discover whether or not they are related without compromising their genetic information or revealing it to a third party. The idea is that individuals only share enough special-purpose cryptographically protected information with each other to identify whether or not they are relatives, but not enough to expose any information about their genomes. We show in HapMap and 1000 Genomes data that our method can recover first- and second-order genetic relationships and, through simulations, show that our method can identify relationships as distant as third cousins while preserving privacy.
Gene Signature in Sessile Serrated Polyps Identifies Colon Cancer Subtype

Science.gov (United States)

Kanth, Priyanka; Bronner, Mary P.; Boucher, Kenneth M.; Burt, Randall W.; Neklason, Deborah W.; Hagedorn, Curt H.; Delker, Don A.

2016-01-01

Sessile serrated colon adenoma/polyps (SSA/Ps) are found during routine screening colonoscopy and may account for 20–30% of colon cancers. However, differentiating SSA/Ps from hyperplastic polyps (HP) with little risk of cancer is challenging and complementary molecular markers are needed. Additionally, the molecular mechanisms of colon cancer development from SSA/Ps are poorly understood. RNA sequencing was performed on 21 SSA/Ps, 10 HPs, 10 adenomas, 21 uninvolved colon and 20 control colon specimens. Differential expression and leave-one-out cross validation methods were used to define a unique gene signature of SSA/Ps. Our SSA/P gene signature was evaluated in colon cancer RNA-Seq data from The Cancer Genome Atlas (TCGA) to identify a subtype of colon cancers that may develop from SSA/Ps. A total of 1422 differentially expressed genes were found in SSA/Ps relative to controls. Serrated polyposis syndrome (n=12) and sporadic SSA/Ps (n=9) exhibited almost complete (96%) gene overlap. A 51-gene panel in SSA/P showed similar expression in a subset of TCGA colon cancers with high microsatellite instability (MSI-H). A smaller seven-gene panel showed high sensitivity and specificity in identifying BRAF mutant, CpG island methylator phenotype high (CIMP-H) and MLH1 silenced colon cancers. We describe a unique gene signature in SSA/Ps that identifies a subset of colon cancers likely to develop through the serrated pathway. These gene panels may be utilized for improved differentiation of SSA/Ps from HPs and provide insights into novel molecular pathways altered in colon cancer arising from the serrated pathway. PMID:27026680
Novel Myopia Genes and Pathways Identified From Syndromic Forms of Myopia

Science.gov (United States)

Loughman, James; Wildsoet, Christine F.; Williams, Cathy; Guggenheim, Jeremy A.

2018-01-01

Purpose To test the hypothesis that genes known to cause clinical syndromes featuring myopia also harbor polymorphisms contributing to nonsyndromic refractive errors. Methods Clinical phenotypes and syndromes that have refractive errors as a recognized feature were identified using the Online Mendelian Inheritance in Man (OMIM) database. One hundred fifty-four unique causative genes were identified, of which 119 were specifically linked with myopia and 114 represented syndromic myopia (i.e., myopia and at least one other clinical feature). Myopia was the only refractive error listed for 98 genes and hyperopia and the only refractive error noted for 28 genes, with the remaining 28 genes linked to phenotypes with multiple forms of refractive error. Pathway analysis was carried out to find biological processes overrepresented within these sets of genes. Genetic variants located within 50 kb of the 119 myopia-related genes were evaluated for involvement in refractive error by analysis of summary statistics from genome-wide association studies (GWAS) conducted by the CREAM Consortium and 23andMe, using both single-marker and gene-based tests. Results Pathway analysis identified several biological processes already implicated in refractive error development through prior GWAS analyses and animal studies, including extracellular matrix remodeling, focal adhesion, and axon guidance, supporting the research hypothesis. Novel pathways also implicated in myopia development included mannosylation, glycosylation, lens development, gliogenesis, and Schwann cell differentiation. Hyperopia was found to be linked to a different pattern of biological processes, mostly related to organogenesis. Comparison with GWAS findings further confirmed that syndromic myopia genes were enriched for genetic variants that influence refractive errors in the general population. Gene-based analyses implicated 21 novel candidate myopia genes (ADAMTS18, ADAMTS2, ADAMTSL4, AGK, ALDH18A1, ASXL1, COL4A1

Cross-species multiple environmental stress responses: An integrated approach to identify candidate genes for multiple stress tolerance in sorghum (Sorghum bicolor (L. Moench and related model species.

Directory of Open Access Journals (Sweden)

Adugna Abdi Woldesemayat

Full Text Available Crop response to the changing climate and unpredictable effects of global warming with adverse conditions such as drought stress has brought concerns about food security to the fore; crop yield loss is a major cause of concern in this regard. Identification of genes with multiple responses across environmental stresses is the genetic foundation that leads to crop adaptation to environmental perturbations.In this paper, we introduce an integrated approach to assess candidate genes for multiple stress responses across-species. The approach combines ontology based semantic data integration with expression profiling, comparative genomics, phylogenomics, functional gene enrichment and gene enrichment network analysis to identify genes associated with plant stress phenotypes. Five different ontologies, viz., Gene Ontology (GO, Trait Ontology (TO, Plant Ontology (PO, Growth Ontology (GRO and Environment Ontology (EO were used to semantically integrate drought related information.Target genes linked to Quantitative Trait Loci (QTLs controlling yield and stress tolerance in sorghum (Sorghum bicolor (L. Moench and closely related species were identified. Based on the enriched GO terms of the biological processes, 1116 sorghum genes with potential responses to 5 different stresses, such as drought (18%, salt (32%, cold (20%, heat (8% and oxidative stress (25% were identified to be over-expressed. Out of 169 sorghum drought responsive QTLs associated genes that were identified based on expression datasets, 56% were shown to have multiple stress responses. On the other hand, out of 168 additional genes that have been evaluated for orthologous pairs, 90% were conserved across species for drought tolerance. Over 50% of identified maize and rice genes were responsive to drought and salt stresses and were co-located within multifunctional QTLs. Among the total identified multi-stress responsive genes, 272 targets were shown to be co-localized within QTLs
Extracting gene expression patterns and identifying co-expressed genes from microarray data reveals biologically responsive processes

Directory of Open Access Journals (Sweden)

Paules Richard S

2007-11-01

Full Text Available Abstract Background A common observation in the analysis of gene expression data is that many genes display similarity in their expression patterns and therefore appear to be co-regulated. However, the variation associated with microarray data and the complexity of the experimental designs make the acquisition of co-expressed genes a challenge. We developed a novel method for Extracting microarray gene expression Patterns and Identifying co-expressed Genes, designated as EPIG. The approach utilizes the underlying structure of gene expression data to extract patterns and identify co-expressed genes that are responsive to experimental conditions. Results Through evaluation of the correlations among profiles, the magnitude of variation in gene expression profiles, and profile signal-to-noise ratio's, EPIG extracts a set of patterns representing co-expressed genes. The method is shown to work well with a simulated data set and microarray data obtained from time-series studies of dauer recovery and L1 starvation in C. elegans and after ultraviolet (UV or ionizing radiation (IR-induced DNA damage in diploid human fibroblasts. With the simulated data set, EPIG extracted the appropriate number of patterns which were more stable and homogeneous than the set of patterns that were determined using the CLICK or CAST clustering algorithms. However, CLICK performed better than EPIG and CAST with respect to the average correlation between clusters/patterns of the simulated data. With real biological data, EPIG extracted more dauer-specific patterns than CLICK. Furthermore, analysis of the IR/UV data revealed 18 unique patterns and 2661 genes out of approximately 17,000 that were identified as significantly expressed and categorized to the patterns by EPIG. The time-dependent patterns displayed similar and dissimilar responses between IR and UV treatments. Gene Ontology analysis applied to each pattern-related subset of co-expressed genes revealed underlying
Relating genes to function: identifying enriched transcription factors using the ENCODE ChIP-Seq significance tool.

Science.gov (United States)

Auerbach, Raymond K; Chen, Bin; Butte, Atul J

2013-08-01

Biological analysis has shifted from identifying genes and transcripts to mapping these genes and transcripts to biological functions. The ENCODE Project has generated hundreds of ChIP-Seq experiments spanning multiple transcription factors and cell lines for public use, but tools for a biomedical scientist to analyze these data are either non-existent or tailored to narrow biological questions. We present the ENCODE ChIP-Seq Significance Tool, a flexible web application leveraging public ENCODE data to identify enriched transcription factors in a gene or transcript list for comparative analyses. The ENCODE ChIP-Seq Significance Tool is written in JavaScript on the client side and has been tested on Google Chrome, Apple Safari and Mozilla Firefox browsers. Server-side scripts are written in PHP and leverage R and a MySQL database. The tool is available at http://encodeqt.stanford.edu. abutte@stanford.edu Supplementary material is available at Bioinformatics online.
A Multiomics Approach to Identify Genes Associated with Childhood Asthma Risk and Morbidity.

Science.gov (United States)

Forno, Erick; Wang, Ting; Yan, Qi; Brehm, John; Acosta-Perez, Edna; Colon-Semidey, Angel; Alvarez, Maria; Boutaoui, Nadia; Cloutier, Michelle M; Alcorn, John F; Canino, Glorisa; Chen, Wei; Celedón, Juan C

2017-10-01

Childhood asthma is a complex disease. In this study, we aim to identify genes associated with childhood asthma through a multiomics "vertical" approach that integrates multiple analytical steps using linear and logistic regression models. In a case-control study of childhood asthma in Puerto Ricans (n = 1,127), we used adjusted linear or logistic regression models to evaluate associations between several analytical steps of omics data, including genome-wide (GW) genotype data, GW methylation, GW expression profiling, cytokine levels, asthma-intermediate phenotypes, and asthma status. At each point, only the top genes/single-nucleotide polymorphisms/probes/cytokines were carried forward for subsequent analysis. In step 1, asthma modified the gene expression-protein level association for 1,645 genes; pathway analysis showed an enrichment of these genes in the cytokine signaling system (n = 269 genes). In steps 2-3, expression levels of 40 genes were associated with intermediate phenotypes (asthma onset age, forced expiratory volume in 1 second, exacerbations, eosinophil counts, and skin test reactivity); of those, methylation of seven genes was also associated with asthma. Of these seven candidate genes, IL5RA was also significant in analytical steps 4-8. We then measured plasma IL-5 receptor α levels, which were associated with asthma age of onset and moderate-severe exacerbations. In addition, in silico database analysis showed that several of our identified IL5RA single-nucleotide polymorphisms are associated with transcription factors related to asthma and atopy. This approach integrates several analytical steps and is able to identify biologically relevant asthma-related genes, such as IL5RA. It differs from other methods that rely on complex statistical models with various assumptions.
Expression microarray meta-analysis identifies genes associated with Ras/MAPK and related pathways in progression of muscle-invasive bladder transition cell carcinoma.

Directory of Open Access Journals (Sweden)

Jonathan A Ewald

Full Text Available The effective detection and management of muscle-invasive bladder Transition Cell Carcinoma (TCC continues to be an urgent clinical challenge. While some differences of gene expression and function in papillary (Ta, superficial (T1 and muscle-invasive (≥T2 bladder cancers have been investigated, the understanding of mechanisms involved in the progression of bladder tumors remains incomplete. Statistical methods of pathway-enrichment, cluster analysis and text-mining can extract and help interpret functional information about gene expression patterns in large sets of genomic data. The public availability of patient-derived expression microarray data allows open access and analysis of large amounts of clinical data. Using these resources, we investigated gene expression differences associated with tumor progression and muscle-invasive TCC. Gene expression was calculated relative to Ta tumors to assess progression-associated differences, revealing a network of genes related to Ras/MAPK and PI3K signaling pathways with increased expression. Further, we identified genes within this network that are similarly expressed in superficial Ta and T1 stages but altered in muscle-invasive T2 tumors, finding 7 genes (COL3A1, COL5A1, COL11A1, FN1, ErbB3, MAPK10 and CDC25C whose expression patterns in muscle-invasive tumors are consistent in 5 to 7 independent outside microarray studies. Further, we found increased expression of the fibrillar collagen proteins COL3A1 and COL5A1 in muscle-invasive tumor samples and metastatic T24 cells. Our results suggest that increased expression of genes involved in mitogenic signaling may support the progression of muscle-invasive bladder tumors that generally lack activating mutations in these pathways, while expression changes of fibrillar collagens, fibronectin and specific signaling proteins are associated with muscle-invasive disease. These results identify potential biomarkers and targets for TCC treatments, and
Contribution of WUSCHEL-related homeobox (WOX genes to identify the phylogenetic relationships among Petunia species

Directory of Open Access Journals (Sweden)

Ana Lúcia Anversa Segatto

Full Text Available Abstract Developmental genes are believed to contribute to major changes during plant evolution, from infrageneric to higher levels. Due to their putative high sequence conservation, developmental genes are rarely used as molecular markers, and few studies including these sequences at low taxonomic levels exist. WUSCHEL-related homeobox genes (WOX are transcription factors exclusively present in plants and are involved in developmental processes. In this study, we characterized the infrageneric genetic variation of Petunia WOX genes. We obtained phylogenetic relationships consistent with other phylogenies based on nuclear markers, but with higher statistical support, resolution in terminals, and compatibility with flower morphological changes.
Transcriptome Sequencing of Chemically Induced Aquilaria sinensis to Identify Genes Related to Agarwood Formation.

Science.gov (United States)

Ye, Wei; Wu, Hongqing; He, Xin; Wang, Lei; Zhang, Weimin; Li, Haohua; Fan, Yunfei; Tan, Guohui; Liu, Taomei; Gao, Xiaoxia

2016-01-01

Agarwood is a traditional Chinese medicine used as a clinical sedative, carminative, and antiemetic drug. Agarwood is formed in Aquilaria sinensis when A. sinensis trees are threatened by external physical, chemical injury or endophytic fungal irritation. However, the mechanism of agarwood formation via chemical induction remains unclear. In this study, we characterized the transcriptome of different parts of a chemically induced A. sinensis trunk sample with agarwood. The Illumina sequencing platform was used to identify the genes involved in agarwood formation. A five-year-old Aquilaria sinensis treated by formic acid was selected. The white wood part (B1 sample), the transition part between agarwood and white wood (W2 sample), the agarwood part (J3 sample), and the rotten wood part (F5 sample) were collected for transcriptome sequencing. Accordingly, 54,685,634 clean reads, which were assembled into 83,467 unigenes, were obtained with a Q20 value of 97.5%. A total of 50,565 unigenes were annotated using the Nr, Nt, SWISS-PROT, KEGG, COG, and GO databases. In particular, 171,331,352 unigenes were annotated by various pathways, including the sesquiterpenoid (ko00909) and plant-pathogen interaction (ko03040) pathways. These pathways were related to sesquiterpenoid biosynthesis and defensive responses to chemical stimulation. The transcriptome data of the different parts of the chemically induced A. sinensis trunk provide a rich source of materials for discovering and identifying the genes involved in sesquiterpenoid production and in defensive responses to chemical stimulation. This study is the first to use de novo sequencing and transcriptome assembly for different parts of chemically induced A. sinensis. Results demonstrate that the sesquiterpenoid biosynthesis pathway and WRKY transcription factor play important roles in agarwood formation via chemical induction. The comparative analysis of the transcriptome data of agarwood and A. sinensis lays the foundation
Integration of human adipocyte chromosomal interactions with adipose gene expression prioritizes obesity-related genes from GWAS.

Science.gov (United States)

Pan, David Z; Garske, Kristina M; Alvarez, Marcus; Bhagat, Yash V; Boocock, James; Nikkola, Elina; Miao, Zong; Raulerson, Chelsea K; Cantor, Rita M; Civelek, Mete; Glastonbury, Craig A; Small, Kerrin S; Boehnke, Michael; Lusis, Aldons J; Sinsheimer, Janet S; Mohlke, Karen L; Laakso, Markku; Pajukanta, Päivi; Ko, Arthur

2018-04-17

Increased adiposity is a hallmark of obesity and overweight, which affect 2.2 billion people world-wide. Understanding the genetic and molecular mechanisms that underlie obesity-related phenotypes can help to improve treatment options and drug development. Here we perform promoter Capture Hi-C in human adipocytes to investigate interactions between gene promoters and distal elements as a transcription-regulating mechanism contributing to these phenotypes. We find that promoter-interacting elements in human adipocytes are enriched for adipose-related transcription factor motifs, such as PPARG and CEBPB, and contribute to heritability of cis-regulated gene expression. We further intersect these data with published genome-wide association studies for BMI and BMI-related metabolic traits to identify the genes that are under genetic cis regulation in human adipocytes via chromosomal interactions. This integrative genomics approach identifies four cis-eQTL-eGene relationships associated with BMI or obesity-related traits, including rs4776984 and MAP2K5, which we further confirm by EMSA, and highlights 38 additional candidate genes.
Epidermal growth factor gene is a newly identified candidate gene for gout.

Science.gov (United States)

Han, Lin; Cao, Chunwei; Jia, Zhaotong; Liu, Shiguo; Liu, Zhen; Xin, Ruosai; Wang, Can; Li, Xinde; Ren, Wei; Wang, Xuefeng; Li, Changgui

2016-08-10

Chromosome 4q25 has been identified as a genomic region associated with gout. However, the associations of gout with the genes in this region have not yet been confirmed. Here, we performed two-stage analysis to determine whether variations in candidate genes in the 4q25 region are associated with gout in a male Chinese Han population. We first evaluated 96 tag single nucleotide polymorphisms (SNPs) in eight inflammatory/immune pathway- or glucose/lipid metabolism-related genes in the 4q25 region in 480 male gout patients and 480 controls. The SNP rs12504538, located in the elongation of very-long-chain-fatty-acid-like family member 6 gene (Elovl6), was found to be associated with gout susceptibility (Padjusted = 0.00595). In the second stage of analysis, we performed fine mapping analysis of 93 tag SNPs in Elovl6 and in the epidermal growth factor gene (EGF) and its flanking regions in 1017 male patients gout and 1897 healthy male controls. We observed a significant association between the T allele of EGF rs2298999 and gout (odds ratio = 0.77, 95% confidence interval = 0.67-0.88, Padjusted = 6.42 × 10(-3)). These results provide the first evidence for an association between the EGF rs2298999 C/T polymorphism and gout. Our findings should be validated in additional populations.
Gene Network Construction from Microarray Data Identifies a Key Network Module and Several Candidate Hub Genes in Age-Associated Spatial Learning Impairment.

Science.gov (United States)

Uddin, Raihan; Singh, Shiva M

2017-01-01

As humans age many suffer from a decrease in normal brain functions including spatial learning impairments. This study aimed to better understand the molecular mechanisms in age-associated spatial learning impairment (ASLI). We used a mathematical modeling approach implemented in Weighted Gene Co-expression Network Analysis (WGCNA) to create and compare gene network models of young (learning unimpaired) and aged (predominantly learning impaired) brains from a set of exploratory datasets in rats in the context of ASLI. The major goal was to overcome some of the limitations previously observed in the traditional meta- and pathway analysis using these data, and identify novel ASLI related genes and their networks based on co-expression relationship of genes. This analysis identified a set of network modules in the young, each of which is highly enriched with genes functioning in broad but distinct GO functional categories or biological pathways. Interestingly, the analysis pointed to a single module that was highly enriched with genes functioning in "learning and memory" related functions and pathways. Subsequent differential network analysis of this "learning and memory" module in the aged (predominantly learning impaired) rats compared to the young learning unimpaired rats allowed us to identify a set of novel ASLI candidate hub genes. Some of these genes show significant repeatability in networks generated from independent young and aged validation datasets. These hub genes are highly co-expressed with other genes in the network, which not only show differential expression but also differential co-expression and differential connectivity across age and learning impairment. The known function of these hub genes indicate that they play key roles in critical pathways, including kinase and phosphatase signaling, in functions related to various ion channels, and in maintaining neuronal integrity relating to synaptic plasticity and memory formation. Taken together, they
Systematic enrichment analysis of gene expression profiling studies identifies consensus pathways implicated in colorectal cancer development

Directory of Open Access Journals (Sweden)

Jesús Lascorz

2011-01-01

Full Text Available Background: A large number of gene expression profiling (GEP studies on colorectal carcinogenesis have been performed but no reliable gene signature has been identified so far due to the lack of reproducibility in the reported genes. There is growing evidence that functionally related genes, rather than individual genes, contribute to the etiology of complex traits. We used, as a novel approach, pathway enrichment tools to define functionally related genes that are consistently up- or down-regulated in colorectal carcinogenesis. Materials and Methods: We started the analysis with 242 unique annotated genes that had been reported by any of three recent meta-analyses covering GEP studies on genes differentially expressed in carcinoma vs normal mucosa. Most of these genes (218, 91.9% had been reported in at least three GEP studies. These 242 genes were submitted to bioinformatic analysis using a total of nine tools to detect enrichment of Gene Ontology (GO categories or Kyoto Encyclopedia of Genes and Genomes (KEGG pathways. As a final consistency criterion the pathway categories had to be enriched by several tools to be taken into consideration. Results: Our pathway-based enrichment analysis identified the categories of ribosomal protein constituents, extracellular matrix receptor interaction, carbonic anhydrase isozymes, and a general category related to inflammation and cellular response as significantly and consistently overrepresented entities. Conclusions: We triaged the genes covered by the published GEP literature on colorectal carcinogenesis and subjected them to multiple enrichment tools in order to identify the consistently enriched gene categories. These turned out to have known functional relationships to cancer development and thus deserve further investigation.
Diametrical clustering for identifying anti-correlated gene clusters.

Science.gov (United States)

Dhillon, Inderjit S; Marcotte, Edward M; Roshan, Usman

2003-09-01

Clustering genes based upon their expression patterns allows us to predict gene function. Most existing clustering algorithms cluster genes together when their expression patterns show high positive correlation. However, it has been observed that genes whose expression patterns are strongly anti-correlated can also be functionally similar. Biologically, this is not unintuitive-genes responding to the same stimuli, regardless of the nature of the response, are more likely to operate in the same pathways. We present a new diametrical clustering algorithm that explicitly identifies anti-correlated clusters of genes. Our algorithm proceeds by iteratively (i). re-partitioning the genes and (ii). computing the dominant singular vector of each gene cluster; each singular vector serving as the prototype of a 'diametric' cluster. We empirically show the effectiveness of the algorithm in identifying diametrical or anti-correlated clusters. Testing the algorithm on yeast cell cycle data, fibroblast gene expression data, and DNA microarray data from yeast mutants reveals that opposed cellular pathways can be discovered with this method. We present systems whose mRNA expression patterns, and likely their functions, oppose the yeast ribosome and proteosome, along with evidence for the inverse transcriptional regulation of a number of cellular systems.
CGMIM: Automated text-mining of Online Mendelian Inheritance in Man (OMIM to identify genetically-associated cancers and candidate genes

Directory of Open Access Journals (Sweden)

Jones Steven

2005-03-01

Full Text Available Abstract Background Online Mendelian Inheritance in Man (OMIM is a computerized database of information about genes and heritable traits in human populations, based on information reported in the scientific literature. Our objective was to establish an automated text-mining system for OMIM that will identify genetically-related cancers and cancer-related genes. We developed the computer program CGMIM to search for entries in OMIM that are related to one or more cancer types. We performed manual searches of OMIM to verify the program results. Results In the OMIM database on September 30, 2004, CGMIM identified 1943 genes related to cancer. BRCA2 (OMIM *164757, BRAF (OMIM *164757 and CDKN2A (OMIM *600160 were each related to 14 types of cancer. There were 45 genes related to cancer of the esophagus, 121 genes related to cancer of the stomach, and 21 genes related to both. Analysis of CGMIM results indicate that fewer than three gene entries in OMIM should mention both, and the more than seven-fold discrepancy suggests cancers of the esophagus and stomach are more genetically related than current literature suggests. Conclusion CGMIM identifies genetically-related cancers and cancer-related genes. In several ways, cancers with shared genetic etiology are anticipated to lead to further etiologic hypotheses and advances regarding environmental agents. CGMIM results are posted monthly and the source code can be obtained free of charge from the BC Cancer Research Centre website http://www.bccrc.ca/ccr/CGMIM.
A general method for identifying major hybrid male sterility genes in Drosophila.

Science.gov (United States)

Zeng, L W; Singh, R S

1995-10-01

The genes responsible for hybrid male sterility in species crosses are usually identified by introgressing chromosome segments, monitored by visible markers, between closely related species by continuous backcrosses. This commonly used method, however, suffers from two problems. First, it relies on the availability of markers to monitor the introgressed regions and so the portion of the genome examined is limited to the marked regions. Secondly, the introgressed regions are usually large and it is impossible to tell if the effects of the introgressed regions are the result of single (or few) major genes or many minor genes (polygenes). Here we introduce a simple and general method for identifying putative major hybrid male sterility genes which is free of these problems. In this method, the actual hybrid male sterility genes (rather than markers), or tightly linked gene complexes with large effects, are selectively introgressed from one species into the background of another species by repeated backcrosses. This is performed by selectively backcrossing heterozygous (for hybrid male sterility gene or genes) females producing fertile and sterile sons in roughly equal proportions to males of either parental species. As no marker gene is required for this procedure, this method can be used with any species pairs that produce unisexual sterility. With the application of this method, a small X chromosome region of Drosophila mauritiana which produces complete hybrid male sterility (aspermic testes) in the background of D. simulans was identified. Recombination analysis reveals that this region contains a second major hybrid male sterility gene linked to the forked locus located at either 62.7 +/- 0.66 map units or at the centromere region of the X chromosome of D. mauritiana.
ENU Mutagenesis in Mice Identifies Candidate Genes For Hypogonadism

Science.gov (United States)

Weiss, Jeffrey; Hurley, Lisa A.; Harris, Rebecca M.; Finlayson, Courtney; Tong, Minghan; Fisher, Lisa A.; Moran, Jennifer L.; Beier, David R.; Mason, Christopher; Jameson, J. Larry

2012-01-01

Genome-wide mutagenesis was performed in mice to identify candidate genes for male infertility, for which the predominant causes remain idiopathic. Mice were mutagenized using N-ethyl-N-nitrosourea (ENU), bred, and screened for phenotypes associated with the male urogenital system. Fifteen heritable lines were isolated and chromosomal loci were assigned using low density genome-wide SNP arrays. Ten of the fifteen lines were pursued further using higher resolution SNP analysis to narrow the candidate gene regions. Exon sequencing of candidate genes identified mutations in mice with cystic kidneys (Bicc1), cryptorchidism (Rxfp2), restricted germ cell deficiency (Plk4), and severe germ cell deficiency (Prdm9). In two other lines with severe hypogonadism candidate sequencing failed to identify mutations, suggesting defects in genes with previously undocumented roles in gonadal function. These genomic intervals were sequenced in their entirety and a candidate mutation was identified in SnrpE in one of the two lines. The line harboring the SnrpE variant retains substantial spermatogenesis despite small testis size, an unusual phenotype. In addition to the reproductive defects, heritable phenotypes were observed in mice with ataxia (Myo5a), tremors (Pmp22), growth retardation (unknown gene), and hydrocephalus (unknown gene). These results demonstrate that the ENU screen is an effective tool for identifying potential causes of male infertility. PMID:22258617
Gene expression meta-analysis identifies chromosomal regions involved in ovarian cancer survival

DEFF Research Database (Denmark)

Thomassen, Mads; Jochumsen, Kirsten M; Mogensen, Ole

2009-01-01

the relation of gene expression and chromosomal position to identify chromosomal regions of importance for early recurrence of ovarian cancer. By use of *Gene Set Enrichment Analysis*, we have ranked chromosomal regions according to their association to survival. Over-representation analysis including 1...... using death (P = 0.015) and recurrence (P = 0.002) as outcome. The combined mutation score is strongly associated to upregulation of several growth factor pathways....
Autophagy-related genes in Helicobacter pylori infection.

Science.gov (United States)

Tanaka, Shingo; Nagashima, Hiroyuki; Uotani, Takahiro; Graham, David Y; Yamaoka, Yoshio

2017-06-01

In vitro studies have shown that Helicobacter pylori (H. pylori) infection induces autophagy in gastric epithelial cells. However, prolonged exposure to H. pylori reduces autophagy by preventing maturation of the autolysosome. The alterations of the autophagy-related genes in H. pylori infection are not yet fully understood. We analyzed autophagy-related gene expression in H. pylori-infected gastric mucosa compared with uninfected gastric mucosa obtained from 136 Bhutanese volunteers with mild dyspeptic symptoms. We also studied single nucleotide polymorphisms (SNPs) of autophagy-related gene in 283 Bhutanese participants to identify the influence on susceptibility to H. pylori infection. Microarray analysis of 226 autophagy-related genes showed that 16 genes were upregulated (7%) and nine were downregulated (4%). We used quantitative reverse transcriptase polymerase chain reaction to measure mRNA levels of the downregulated genes (ATG16L1, ATG5, ATG4D, and ATG9A) that were core molecules of autophagy. ATG16L1 and ATG5 mRNA levels in H. pylori-positive specimens (n=86) were significantly less than those in H. pylori-negative specimens (n=50). ATG16L1 mRNA levels were inversely related to H. pylori density. We also compared SNPs of ATG16L1 (rs2241880) among 206 H. pylori-positive and 77 H. pylori-negative subjects. The odds ratio for the presence of H. pylori in the GG genotype was 0.40 (95% CI: 0.18-0.91) relative to the AA/AG genotypes. Autophagy-related gene expression profiling using high-throughput microarray analysis indicated that downregulation of core autophagy machinery genes may depress autophagy functions and possibly provide a better intracellular habit for H. pylori in gastric epithelial cells. © 2017 John Wiley & Sons Ltd.
Epidermal growth factor gene is a newly identified candidate gene for gout

Science.gov (United States)

Han, Lin; Cao, Chunwei; Jia, Zhaotong; Liu, Shiguo; Liu, Zhen; Xin, Ruosai; Wang, Can; Li, Xinde; Ren, Wei; Wang, Xuefeng; Li, Changgui

2016-01-01

Chromosome 4q25 has been identified as a genomic region associated with gout. However, the associations of gout with the genes in this region have not yet been confirmed. Here, we performed two-stage analysis to determine whether variations in candidate genes in the 4q25 region are associated with gout in a male Chinese Han population. We first evaluated 96 tag single nucleotide polymorphisms (SNPs) in eight inflammatory/immune pathway- or glucose/lipid metabolism-related genes in the 4q25 region in 480 male gout patients and 480 controls. The SNP rs12504538, located in the elongation of very-long-chain-fatty-acid-like family member 6 gene (Elovl6), was found to be associated with gout susceptibility (Padjusted = 0.00595). In the second stage of analysis, we performed fine mapping analysis of 93 tag SNPs in Elovl6 and in the epidermal growth factor gene (EGF) and its flanking regions in 1017 male patients gout and 1897 healthy male controls. We observed a significant association between the T allele of EGF rs2298999 and gout (odds ratio = 0.77, 95% confidence interval = 0.67–0.88, Padjusted = 6.42 × 10−3). These results provide the first evidence for an association between the EGF rs2298999 C/T polymorphism and gout. Our findings should be validated in additional populations. PMID:27506295
Systematically characterizing and prioritizing chemosensitivity related gene based on Gene Ontology and protein interaction network

Directory of Open Access Journals (Sweden)

Chen Xin

2012-10-01

Full Text Available Abstract Background The identification of genes that predict in vitro cellular chemosensitivity of cancer cells is of great importance. Chemosensitivity related genes (CRGs have been widely utilized to guide clinical and cancer chemotherapy decisions. In addition, CRGs potentially share functional characteristics and network features in protein interaction networks (PPIN. Methods In this study, we proposed a method to identify CRGs based on Gene Ontology (GO and PPIN. Firstly, we documented 150 pairs of drug-CCRG (curated chemosensitivity related gene from 492 published papers. Secondly, we characterized CCRGs from the perspective of GO and PPIN. Thirdly, we prioritized CRGs based on CCRGs’ GO and network characteristics. Lastly, we evaluated the performance of the proposed method. Results We found that CCRG enriched GO terms were most often related to chemosensitivity and exhibited higher similarity scores compared to randomly selected genes. Moreover, CCRGs played key roles in maintaining the connectivity and controlling the information flow of PPINs. We then prioritized CRGs using CCRG enriched GO terms and CCRG network characteristics in order to obtain a database of predicted drug-CRGs that included 53 CRGs, 32 of which have been reported to affect susceptibility to drugs. Our proposed method identifies a greater number of drug-CCRGs, and drug-CCRGs are much more significantly enriched in predicted drug-CRGs, compared to a method based on the correlation of gene expression and drug activity. The mean area under ROC curve (AUC for our method is 65.2%, whereas that for the traditional method is 55.2%. Conclusions Our method not only identifies CRGs with expression patterns strongly correlated with drug activity, but also identifies CRGs in which expression is weakly correlated with drug activity. This study provides the framework for the identification of signatures that predict in vitro cellular chemosensitivity and offers a valuable
Transcriptomic Analysis Using Olive Varieties and Breeding Progenies Identifies Candidate Genes Involved in Plant Architecture.

Science.gov (United States)

González-Plaza, Juan J; Ortiz-Martín, Inmaculada; Muñoz-Mérida, Antonio; García-López, Carmen; Sánchez-Sevilla, José F; Luque, Francisco; Trelles, Oswaldo; Bejarano, Eduardo R; De La Rosa, Raúl; Valpuesta, Victoriano; Beuzón, Carmen R

2016-01-01

Plant architecture is a critical trait in fruit crops that can significantly influence yield, pruning, planting density and harvesting. Little is known about how plant architecture is genetically determined in olive, were most of the existing varieties are traditional with an architecture poorly suited for modern growing and harvesting systems. In the present study, we have carried out microarray analysis of meristematic tissue to compare expression profiles of olive varieties displaying differences in architecture, as well as seedlings from their cross pooled on the basis of their sharing architecture-related phenotypes. The microarray used, previously developed by our group has already been applied to identify candidates genes involved in regulating juvenile to adult transition in the shoot apex of seedlings. Varieties with distinct architecture phenotypes and individuals from segregating progenies displaying opposite architecture features were used to link phenotype to expression. Here, we identify 2252 differentially expressed genes (DEGs) associated to differences in plant architecture. Microarray results were validated by quantitative RT-PCR carried out on genes with functional annotation likely related to plant architecture. Twelve of these genes were further analyzed in individual seedlings of the corresponding pool. We also examined Arabidopsis mutants in putative orthologs of these targeted candidate genes, finding altered architecture for most of them. This supports a functional conservation between species and potential biological relevance of the candidate genes identified. This study is the first to identify genes associated to plant architecture in olive, and the results obtained could be of great help in future programs aimed at selecting phenotypes adapted to modern cultivation practices in this species.

EST sequencing and gene expression profiling of defence-related genes from Persea americana infected with Phytophthora cinnamomi

Directory of Open Access Journals (Sweden)

Mahomed Waheed

2011-11-01

Full Text Available Abstract Background Avocado (Persea americana belongs to the Lauraceae family and is an important commercial fruit crop in over 50 countries. The most serious pathogen affecting avocado production is Phytophthora cinnamomi which causes Phytophthora root rot (PRR. Root pathogens such as P. cinnamomi and their interactions with hosts are poorly understood and despite the importance of both the avocado crop and the effect Phytophthora has on its cultivation, there is a lack of molecular knowledge underpinning our understanding of defence strategies against the pathogen. In order to initiate a better understanding of host-specific defence we have generated EST data using 454 pyrosequencing and profiled nine defence-related genes from Pc-infected avocado roots. Results 2.0 Mb of data was generated consisting of ~10,000 reads on a single lane of the GS FLX platform. Using the Newbler assembler 371 contigs were assembled, of which 367 are novel for Persea americana. Genes were classified according to Gene Ontology terms. In addition to identifying root-specific ESTs we were also able to identify and quantify the expression of nine defence-related genes that were differentially regulated in response to P. cinnamomi. Genes such as metallothionein, thaumatin and the pathogenesis related PsemI, mlo and profilin were found to be differentially regulated. Conclusions This is the first study in elucidating the avocado root transcriptome as well as identifying defence responses of avocado roots to the root pathogen P. cinnamomi. Our data is currently the only EST data that has been generated for avocado rootstocks, and the ESTs identified in this study have already been useful in identifying defence-related genes as well as providing gene information for other studies looking at processes such as ROS regulation as well as hypoxia in avocado roots. Our EST data will aid in the elucidation of the avocado transcriptome and identification of markers for improved
Gastric Cancer Associated Genes Identified by an Integrative Analysis of Gene Expression Data

Directory of Open Access Journals (Sweden)

Bing Jiang

2017-01-01

Full Text Available Gastric cancer is one of the most severe complex diseases with high morbidity and mortality in the world. The molecular mechanisms and risk factors for this disease are still not clear since the cancer heterogeneity caused by different genetic and environmental factors. With more and more expression data accumulated nowadays, we can perform integrative analysis for these data to understand the complexity of gastric cancer and to identify consensus players for the heterogeneous cancer. In the present work, we screened the published gene expression data and analyzed them with integrative tool, combined with pathway and gene ontology enrichment investigation. We identified several consensus differentially expressed genes and these genes were further confirmed with literature mining; at last, two genes, that is, immunoglobulin J chain and C-X-C motif chemokine ligand 17, were screened as novel gastric cancer associated genes. Experimental validation is proposed to further confirm this finding.
A large-scale RNA interference screen identifies genes that regulate autophagy at different stages.

Science.gov (United States)

Guo, Sujuan; Pridham, Kevin J; Virbasius, Ching-Man; He, Bin; Zhang, Liqing; Varmark, Hanne; Green, Michael R; Sheng, Zhi

2018-02-12

Dysregulated autophagy is central to the pathogenesis and therapeutic development of cancer. However, how autophagy is regulated in cancer is not well understood and genes that modulate cancer autophagy are not fully defined. To gain more insights into autophagy regulation in cancer, we performed a large-scale RNA interference screen in K562 human chronic myeloid leukemia cells using monodansylcadaverine staining, an autophagy-detecting approach equivalent to immunoblotting of the autophagy marker LC3B or fluorescence microscopy of GFP-LC3B. By coupling monodansylcadaverine staining with fluorescence-activated cell sorting, we successfully isolated autophagic K562 cells where we identified 336 short hairpin RNAs. After candidate validation using Cyto-ID fluorescence spectrophotometry, LC3B immunoblotting, and quantitative RT-PCR, 82 genes were identified as autophagy-regulating genes. 20 genes have been reported previously and the remaining 62 candidates are novel autophagy mediators. Bioinformatic analyses revealed that most candidate genes were involved in molecular pathways regulating autophagy, rather than directly participating in the autophagy process. Further autophagy flux assays revealed that 57 autophagy-regulating genes suppressed autophagy initiation, whereas 21 candidates promoted autophagy maturation. Our RNA interference screen identifies identified genes that regulate autophagy at different stages, which helps decode autophagy regulation in cancer and offers novel avenues to develop autophagy-related therapies for cancer.
Characterization of transformation related genes in oral cancer cells.

Science.gov (United States)

Chang, D D; Park, N H; Denny, C T; Nelson, S F; Pe, M

1998-04-16

A cDNA representational difference analysis (cDNA-RDA) and an arrayed filter technique were used to characterize transformation-related genes in oral cancer. From an initial comparison of normal oral epithelial cells and a human papilloma virus (HPV)-immortalized oral epithelial cell line, we obtained 384 differentially expressed gene fragments and arrayed them on a filter. Two hundred and twelve redundant clones were identified by three rounds of back hybridization. Sequence analysis of the remaining clones revealed 99 unique clones corresponding to 69 genes. The expression of these transformation related gene fragments in three nontumorigenic HPV-immortalized oral epithelial cell lines and three oral cancer cell lines were simultaneously monitored using a cDNA array hybridization. Although there was a considerable cell line-to-cell line variability in the expression of these clones, a reliable prediction of their expression could be made from the cDNA array hybridization. Our study demonstrates the utility of combining cDNA-RDA and arrayed filters in high-throughput gene expression difference analysis. The differentially expressed genes identified in this study should be informative in studying oral epithelial cell carcinogenesis.
Genome-wide methylation analysis identifies genes silenced in non-seminoma cell lines.

Science.gov (United States)

Noor, Dzul Azri Mohamed; Jeyapalan, Jennie N; Alhazmi, Safiah; Carr, Matthew; Squibb, Benjamin; Wallace, Claire; Tan, Christopher; Cusack, Martin; Hughes, Jaime; Reader, Tom; Shipley, Janet; Sheer, Denise; Scotting, Paul J

2016-01-01

Silencing of genes by DNA methylation is a common phenomenon in many types of cancer. However, the genome-wide effect of DNA methylation on gene expression has been analysed in relatively few cancers. Germ cell tumours (GCTs) are a complex group of malignancies. They are unique in developing from a pluripotent progenitor cell. Previous analyses have suggested that non-seminomas exhibit much higher levels of DNA methylation than seminomas. The genomic targets that are methylated, the extent to which this results in gene silencing and the identity of the silenced genes most likely to play a role in the tumours' biology have not yet been established. In this study, genome-wide methylation and expression analysis of GCT cell lines was combined with gene expression data from primary tumours to address this question. Genome methylation was analysed using the Illumina infinium HumanMethylome450 bead chip system and gene expression was analysed using Affymetrix GeneChip Human Genome U133 Plus 2.0 arrays. Regulation by methylation was confirmed by demethylation using 5-aza-2-deoxycytidine and reverse transcription-quantitative PCR. Large differences in the level of methylation of the CpG islands of individual genes between tumour cell lines correlated well with differential gene expression. Treatment of non-seminoma cells with 5-aza-2-deoxycytidine verified that methylation of all genes tested played a role in their silencing in yolk sac tumour cells and many of these genes were also differentially expressed in primary tumours. Genes silenced by methylation in the various GCT cell lines were identified. Several pluripotency-associated genes were identified as a major functional group of silenced genes.
Transcriptomic analysis of a tertiary relict plant, extreme xerophyte Reaumuria soongorica to identify genes related to drought adaptation.

Directory of Open Access Journals (Sweden)

Yong Shi

Full Text Available BACKGROUND: Reaumuria soongorica is an extreme xerophyte shrub widely distributed in the desert regions including sand dune, Gobi and marginal loess of central Asia which plays a crucial role to sustain and restore fragile desert ecosystems. However, due to the lacking of the genomic sequences, studies on R. soongorica had mainly limited in physiological responses to drought stress. Here, a deep transcriptomic sequencing of R. soongorica will facilitate molecular functional studies and pave the path to understand drought adaptation for a desert plant. METHODOLOGY/PRINCIPAL FINDINGS: A total of 53,193,660 clean paired-end reads was generated from the Illumina HiSeq™ 2000 platform. By assembly with Trinity, we got 173,700 contigs and 77,647 unigenes with mean length of 677 bp and N50 of 1109 bp. Over 55% (43,054 unigenes were successfully annotated based on sequence similarity against public databases as well as Rfam and Pfam database. Local BLAST and Kyoto Encyclopedia of Genes and Genomes (KEGG maps were used to further exhausting seek for candidate genes related to drought adaptation and a set of 123 putative candidate genes were identified. Moreover, all the C4 photosynthesis genes existed and were active in R. soongorica, which has been regarded as a typical C3 plant. CONCLUSION/SIGNIFICANCE: The assembled unigenes in present work provide abundant genomic information for the functional assignments in an extreme xerophyte R. soongorica, and will help us exploit the genetic basis of how desert plants adapt to drought environment in the near future.
Identifying genes and gene networks involved in chromium metabolism and detoxification in Crambe abyssinica

International Nuclear Information System (INIS)

Zulfiqar, Asma; Paulose, Bibin; Chhikara, Sudesh; Dhankher, Om Parkash

2011-01-01

Chromium pollution is a serious environmental problem with few cost-effective remediation strategies available. Crambe abyssinica (a member of Brassicaseae), a non-food, fast growing high biomass crop, is an ideal candidate for phytoremediation of heavy metals contaminated soils. The present study used a PCR-Select Suppression Subtraction Hybridization approach in C. abyssinica to isolate differentially expressed genes in response to Cr exposure. A total of 72 differentially expressed subtracted cDNAs were sequenced and found to represent 43 genes. The subtracted cDNAs suggest that Cr stress significantly affects pathways related to stress/defense, ion transporters, sulfur assimilation, cell signaling, protein degradation, photosynthesis and cell metabolism. The regulation of these genes in response to Cr exposure was further confirmed by semi-quantitative RT-PCR. Characterization of these differentially expressed genes may enable the engineering of non-food, high-biomass plants, including C. abyssinica, for phytoremediation of Cr-contaminated soils and sediments. - Highlights: → Molecular mechanism of Cr uptake and detoxification in plants is not well known. → We identified differentially regulated genes upon Cr exposure in Crambe abyssinica. → 72 Cr-induced subtracted cDNAs were sequenced and found to represent 43 genes. → Pathways linked to stress, ion transport, and sulfur assimilation were affected. → This is the first Cr transcriptome study in a crop with phytoremediation potential. - This study describes the identification and isolation of differentially expressed genes involved in chromium metabolism and detoxification in a non-food industrial oil crop Crambe abyssinica.
Identifying genes and gene networks involved in chromium metabolism and detoxification in Crambe abyssinica

Energy Technology Data Exchange (ETDEWEB)

Zulfiqar, Asma, E-mail: asmazulfiqar08@yahoo.com [Department of Plant, Soil, and Insect Sciences, 270 Stockbridge Road, University of Massachusetts Amherst, MA 01003 (United States); Paulose, Bibin, E-mail: bpaulose@psis.umass.edu [Department of Plant, Soil, and Insect Sciences, 270 Stockbridge Road, University of Massachusetts Amherst, MA 01003 (United States); Chhikara, Sudesh, E-mail: sudesh@psis.umass.edu [Department of Plant, Soil, and Insect Sciences, 270 Stockbridge Road, University of Massachusetts Amherst, MA 01003 (United States); Dhankher, Om Parkash, E-mail: parkash@psis.umass.edu [Department of Plant, Soil, and Insect Sciences, 270 Stockbridge Road, University of Massachusetts Amherst, MA 01003 (United States)

2011-10-15

Chromium pollution is a serious environmental problem with few cost-effective remediation strategies available. Crambe abyssinica (a member of Brassicaseae), a non-food, fast growing high biomass crop, is an ideal candidate for phytoremediation of heavy metals contaminated soils. The present study used a PCR-Select Suppression Subtraction Hybridization approach in C. abyssinica to isolate differentially expressed genes in response to Cr exposure. A total of 72 differentially expressed subtracted cDNAs were sequenced and found to represent 43 genes. The subtracted cDNAs suggest that Cr stress significantly affects pathways related to stress/defense, ion transporters, sulfur assimilation, cell signaling, protein degradation, photosynthesis and cell metabolism. The regulation of these genes in response to Cr exposure was further confirmed by semi-quantitative RT-PCR. Characterization of these differentially expressed genes may enable the engineering of non-food, high-biomass plants, including C. abyssinica, for phytoremediation of Cr-contaminated soils and sediments. - Highlights: > Molecular mechanism of Cr uptake and detoxification in plants is not well known. > We identified differentially regulated genes upon Cr exposure in Crambe abyssinica. > 72 Cr-induced subtracted cDNAs were sequenced and found to represent 43 genes. > Pathways linked to stress, ion transport, and sulfur assimilation were affected. > This is the first Cr transcriptome study in a crop with phytoremediation potential. - This study describes the identification and isolation of differentially expressed genes involved in chromium metabolism and detoxification in a non-food industrial oil crop Crambe abyssinica.
A cross-study gene set enrichment analysis identifies critical pathways in endometriosis

Directory of Open Access Journals (Sweden)

Bai Chunyan

2009-09-01

Full Text Available Abstract Background Endometriosis is an enigmatic disease. Gene expression profiling of endometriosis has been used in several studies, but few studies went further to classify subtypes of endometriosis based on expression patterns and to identify possible pathways involved in endometriosis. Some of the observed pathways are more inconsistent between the studies, and these candidate pathways presumably only represent a fraction of the pathways involved in endometriosis. Methods We applied a standardised microarray preprocessing and gene set enrichment analysis to six independent studies, and demonstrated increased concordance between these gene datasets. Results We find 16 up-regulated and 19 down-regulated pathways common in ovarian endometriosis data sets, 22 up-regulated and one down-regulated pathway common in peritoneal endometriosis data sets. Among them, 12 up-regulated and 1 down-regulated were found consistent between ovarian and peritoneal endometriosis. The main canonical pathways identified are related to immunological and inflammatory disease. Early secretory phase has the most over-represented pathways in the three uterine cycle phases. There are no overlapping significant pathways between the dataset from human endometrial endothelial cells and the datasets from ovarian endometriosis which used whole tissues. Conclusion The study of complex diseases through pathway analysis is able to highlight genes weakly connected to the phenotype which may be difficult to detect by using classical univariate statistics. By standardised microarray preprocessing and GSEA, we have increased the concordance in identifying many biological mechanisms involved in endometriosis. The identified gene pathways will shed light on the understanding of endometriosis and promote the development of novel therapies.
Automatically identifying gene/protein terms in MEDLINE abstracts.

Science.gov (United States)

Yu, Hong; Hatzivassiloglou, Vasileios; Rzhetsky, Andrey; Wilbur, W John

2002-01-01

Natural language processing (NLP) techniques are used to extract information automatically from computer-readable literature. In biology, the identification of terms corresponding to biological substances (e.g., genes and proteins) is a necessary step that precedes the application of other NLP systems that extract biological information (e.g., protein-protein interactions, gene regulation events, and biochemical pathways). We have developed GPmarkup (for "gene/protein-full name mark up"), a software system that automatically identifies gene/protein terms (i.e., symbols or full names) in MEDLINE abstracts. As a part of marking up process, we also generated automatically a knowledge source of paired gene/protein symbols and full names (e.g., LARD for lymphocyte associated receptor of death) from MEDLINE. We found that many of the pairs in our knowledge source do not appear in the current GenBank database. Therefore our methods may also be used for automatic lexicon generation. GPmarkup has 73% recall and 93% precision in identifying and marking up gene/protein terms in MEDLINE abstracts. A random sample of gene/protein symbols and full names and a sample set of marked up abstracts can be viewed at http://www.cpmc.columbia.edu/homepages/yuh9001/GPmarkup/. Contact. hy52@columbia.edu. Voice: 212-939-7028; fax: 212-666-0140.
Identifying key genes associated with acute myocardial infarction.

Science.gov (United States)

Cheng, Ming; An, Shoukuan; Li, Junquan

2017-10-01

This study aimed to identify key genes associated with acute myocardial infarction (AMI) by reanalyzing microarray data. Three gene expression profile datasets GSE66360, GSE34198, and GSE48060 were downloaded from GEO database. After data preprocessing, genes without heterogeneity across different platforms were subjected to differential expression analysis between the AMI group and the control group using metaDE package. P FI) network. Then, DEGs in each module were subjected to pathway enrichment analysis using DAVID. MiRNAs and transcription factors predicted to regulate target DEGs were identified. Quantitative real-time polymerase chain reaction (RT-PCR) was applied to verify the expression of genes. A total of 913 upregulated genes and 1060 downregulated genes were identified in the AMI group. A FI network consists of 21 modules and DEGs in 12 modules were significantly enriched in pathways. The transcription factor-miRNA-gene network contains 2 transcription factors FOXO3 and MYBL2, and 2 miRNAs hsa-miR-21-5p and hsa-miR-30c-5p. RT-PCR validations showed that expression levels of FOXO3 and MYBL2 were significantly increased in AMI, and expression levels of hsa-miR-21-5p and hsa-miR-30c-5p were obviously decreased in AMI. A total of 41 DEGs, such as SOCS3, VAPA, and COL5A2, are speculated to have roles in the pathogenesis of AMI; 2 transcription factors FOXO3 and MYBL2, and 2 miRNAs hsa-miR-21-5p and hsa-miR-30c-5p may be involved in the regulation of the expression of these DEGs.
Integration of genome-wide association studies with biological knowledge identifies six novel genes related to kidney function.

Science.gov (United States)

Chasman, Daniel I; Fuchsberger, Christian; Pattaro, Cristian; Teumer, Alexander; Böger, Carsten A; Endlich, Karlhans; Olden, Matthias; Chen, Ming-Huei; Tin, Adrienne; Taliun, Daniel; Li, Man; Gao, Xiaoyi; Gorski, Mathias; Yang, Qiong; Hundertmark, Claudia; Foster, Meredith C; O'Seaghdha, Conall M; Glazer, Nicole; Isaacs, Aaron; Liu, Ching-Ti; Smith, Albert V; O'Connell, Jeffrey R; Struchalin, Maksim; Tanaka, Toshiko; Li, Guo; Johnson, Andrew D; Gierman, Hinco J; Feitosa, Mary F; Hwang, Shih-Jen; Atkinson, Elizabeth J; Lohman, Kurt; Cornelis, Marilyn C; Johansson, Asa; Tönjes, Anke; Dehghan, Abbas; Lambert, Jean-Charles; Holliday, Elizabeth G; Sorice, Rossella; Kutalik, Zoltan; Lehtimäki, Terho; Esko, Tõnu; Deshmukh, Harshal; Ulivi, Sheila; Chu, Audrey Y; Murgia, Federico; Trompet, Stella; Imboden, Medea; Coassin, Stefan; Pistis, Giorgio; Harris, Tamara B; Launer, Lenore J; Aspelund, Thor; Eiriksdottir, Gudny; Mitchell, Braxton D; Boerwinkle, Eric; Schmidt, Helena; Cavalieri, Margherita; Rao, Madhumathi; Hu, Frank; Demirkan, Ayse; Oostra, Ben A; de Andrade, Mariza; Turner, Stephen T; Ding, Jingzhong; Andrews, Jeanette S; Freedman, Barry I; Giulianini, Franco; Koenig, Wolfgang; Illig, Thomas; Meisinger, Christa; Gieger, Christian; Zgaga, Lina; Zemunik, Tatijana; Boban, Mladen; Minelli, Cosetta; Wheeler, Heather E; Igl, Wilmar; Zaboli, Ghazal; Wild, Sarah H; Wright, Alan F; Campbell, Harry; Ellinghaus, David; Nöthlings, Ute; Jacobs, Gunnar; Biffar, Reiner; Ernst, Florian; Homuth, Georg; Kroemer, Heyo K; Nauck, Matthias; Stracke, Sylvia; Völker, Uwe; Völzke, Henry; Kovacs, Peter; Stumvoll, Michael; Mägi, Reedik; Hofman, Albert; Uitterlinden, Andre G; Rivadeneira, Fernando; Aulchenko, Yurii S; Polasek, Ozren; Hastie, Nick; Vitart, Veronique; Helmer, Catherine; Wang, Jie Jin; Stengel, Bénédicte; Ruggiero, Daniela; Bergmann, Sven; Kähönen, Mika; Viikari, Jorma; Nikopensius, Tiit; Province, Michael; Ketkar, Shamika; Colhoun, Helen; Doney, Alex; Robino, Antonietta; Krämer, Bernhard K; Portas, Laura; Ford, Ian; Buckley, Brendan M; Adam, Martin; Thun, Gian-Andri; Paulweber, Bernhard; Haun, Margot; Sala, Cinzia; Mitchell, Paul; Ciullo, Marina; Kim, Stuart K; Vollenweider, Peter; Raitakari, Olli; Metspalu, Andres; Palmer, Colin; Gasparini, Paolo; Pirastu, Mario; Jukema, J Wouter; Probst-Hensch, Nicole M; Kronenberg, Florian; Toniolo, Daniela; Gudnason, Vilmundur; Shuldiner, Alan R; Coresh, Josef; Schmidt, Reinhold; Ferrucci, Luigi; Siscovick, David S; van Duijn, Cornelia M; Borecki, Ingrid B; Kardia, Sharon L R; Liu, Yongmei; Curhan, Gary C; Rudan, Igor; Gyllensten, Ulf; Wilson, James F; Franke, Andre; Pramstaller, Peter P; Rettig, Rainer; Prokopenko, Inga; Witteman, Jacqueline; Hayward, Caroline; Ridker, Paul M; Parsa, Afshin; Bochud, Murielle; Heid, Iris M; Kao, W H Linda; Fox, Caroline S; Köttgen, Anna

2012-12-15

In conducting genome-wide association studies (GWAS), analytical approaches leveraging biological information may further understanding of the pathophysiology of clinical traits. To discover novel associations with estimated glomerular filtration rate (eGFR), a measure of kidney function, we developed a strategy for integrating prior biological knowledge into the existing GWAS data for eGFR from the CKDGen Consortium. Our strategy focuses on single nucleotide polymorphism (SNPs) in genes that are connected by functional evidence, determined by literature mining and gene ontology (GO) hierarchies, to genes near previously validated eGFR associations. It then requires association thresholds consistent with multiple testing, and finally evaluates novel candidates by independent replication. Among the samples of European ancestry, we identified a genome-wide significant SNP in FBXL20 (P = 5.6 × 10(-9)) in meta-analysis of all available data, and additional SNPs at the INHBC, LRP2, PLEKHA1, SLC3A2 and SLC7A6 genes meeting multiple-testing corrected significance for replication and overall P-values of 4.5 × 10(-4)-2.2 × 10(-7). Neither the novel PLEKHA1 nor FBXL20 associations, both further supported by association with eGFR among African Americans and with transcript abundance, would have been implicated by eGFR candidate gene approaches. LRP2, encoding the megalin receptor, was identified through connection with the previously known eGFR gene DAB2 and extends understanding of the megalin system in kidney function. These findings highlight integration of existing genome-wide association data with independent biological knowledge to uncover novel candidate eGFR associations, including candidates lacking known connections to kidney-specific pathways. The strategy may also be applicable to other clinical phenotypes, although more testing will be needed to assess its potential for discovery in general.
Calcitonin gene-related peptide antagonism and cluster headache

DEFF Research Database (Denmark)

Ashina, Håkan; Newman, Lawrence; Ashina, Sait

2017-01-01

Calcitonin gene-related peptide (CGRP) is a key signaling molecule involved in migraine pathophysiology. Efficacy of CGRP monoclonal antibodies and antagonists in migraine treatment has fueled an increasing interest in the prospect of treating cluster headache (CH) with CGRP antagonism. The exact...... role of CGRP and its mechanism of action in CH have not been fully clarified. A search for original studies and randomized controlled trials (RCTs) published in English was performed in PubMed and in ClinicalTrials.gov . The search term used was "cluster headache and calcitonin gene related peptide......" and "primary headaches and calcitonin gene related peptide." Reference lists of identified articles were also searched for additional relevant papers. Human experimental studies have reported elevated plasma CGRP levels during both spontaneous and glyceryl trinitrate-induced cluster attacks. CGRP may play...
Conidiogenesis-related DNA photolyase gene in Beauveria bassiana.

Science.gov (United States)

Lee, Se Jin; Lee, Mi Rong; Kim, Sihyeon; Kim, Jong Cheol; Park, So Eun; Shin, Tae Young; Kim, Jae Su

2018-03-01

Beauveria bassiana is an entomopathogenic fungi used in environmentally mindful pest management. Its main active ingredient, conidia, is commercially available as a fungal biopesticide. Many studies of conidia production have focused on how to optimize culture conditions for maximum productivity and stability against unfavorable abiotic factors. However, understanding of how conidiogenesis-related genes provide improved conidial production remains unclear. In this study, we focus on identifying conidiogenesis-related genes in B. bassiana ERL1170 using a random mutagenesis technique. Transformation of ERL1170 using restriction enzyme-mediated integration generated one morphologically different transformant, ERL1170-pABeG #163. The transformant was confirmed to represent B. bassiana, and the binary vector was successfully integrated into the genome of ERL1170. Compared to the wild type, transformant #163 showed very slow hyphal growth and within 6 days only produced bassiana exhibits thread-like hyphae and conidiophore structures and circular conidia. To determine the location of the randomly inserted DNA, we conducted thermal asymmetric interlaced (TAIL) PCR and Escherichia coli cloning to clearly sequence the disrupted region. We identified one colony (colony No. 7) with an insertion site identified as DNA photolyase. This was confirmed through a gene knock-out study. It is possible the gene that encodes for DNA photolyase was disrupted during the insertion process and might be involved in fungal conidiogenesis. This work serves as a platform for exploring the function of a variety of B. bassiana genes involved in pest management and their downstream processing. Copyright © 2018 Elsevier Inc. All rights reserved.
Using reporter gene assays to identify cis regulatory differences between humans and chimpanzees.

Science.gov (United States)

Chabot, Adrien; Shrit, Ralla A; Blekhman, Ran; Gilad, Yoav

2007-08-01

Most phenotypic differences between human and chimpanzee are likely to result from differences in gene regulation, rather than changes to protein-coding regions. To date, however, only a handful of human-chimpanzee nucleotide differences leading to changes in gene regulation have been identified. To hone in on differences in regulatory elements between human and chimpanzee, we focused on 10 genes that were previously found to be differentially expressed between the two species. We then designed reporter gene assays for the putative human and chimpanzee promoters of the 10 genes. Of seven promoters that we found to be active in human liver cell lines, human and chimpanzee promoters had significantly different activity in four cases, three of which recapitulated the gene expression difference seen in the microarray experiment. For these three genes, we were therefore able to demonstrate that a change in cis influences expression differences between humans and chimpanzees. Moreover, using site-directed mutagenesis on one construct, the promoter for the DDA3 gene, we were able to identify three nucleotides that together lead to a cis regulatory difference between the species. High-throughput application of this approach can provide a map of regulatory element differences between humans and our close evolutionary relatives.
Network Diffusion-Based Prioritization of Autism Risk Genes Identifies Significantly Connected Gene Modules

Directory of Open Access Journals (Sweden)

Ettore Mosca

2017-09-01

Full Text Available Autism spectrum disorder (ASD is marked by a strong genetic heterogeneity, which is underlined by the low overlap between ASD risk gene lists proposed in different studies. In this context, molecular networks can be used to analyze the results of several genome-wide studies in order to underline those network regions harboring genetic variations associated with ASD, the so-called “disease modules.” In this work, we used a recent network diffusion-based approach to jointly analyze multiple ASD risk gene lists. We defined genome-scale prioritizations of human genes in relation to ASD genes from multiple studies, found significantly connected gene modules associated with ASD and predicted genes functionally related to ASD risk genes. Most of them play a role in synapsis and neuronal development and function; many are related to syndromes that can be in comorbidity with ASD and the remaining are involved in epigenetics, cell cycle, cell adhesion and cancer.
Comparative Transcriptome Analysis Identifies Candidate Genes Related to Skin Color Differentiation in Red Tilapia.

Science.gov (United States)

Zhu, Wenbin; Wang, Lanmei; Dong, Zaijie; Chen, Xingting; Song, Feibiao; Liu, Nian; Yang, Hui; Fu, Jianjun

2016-08-11

Red tilapia is becoming more popular for aquaculture production in China in recent years. However, the pigmentation differentiation in genetic breeding is the main problem limiting its development of commercial red tilapia culture and the genetic basis of skin color variation is still unknown. In this study, we conducted Illumina sequencing of transcriptome on three color variety red tilapia. A total of 224,895,758 reads were generated, resulting in 160,762 assembled contigs that were used as reference contigs. The contigs of red tilapia transcriptome had hits in the range of 53.4% to 86.7% of the unique proteins of zebrafish, fugu, medaka, three-spined stickleback and tilapia. And 44,723 contigs containing 77,423 simple sequence repeats (SSRs) were identified, with 16,646 contigs containing more than one SSR. Three skin transcriptomes were compared pairwise and the results revealed that there were 148 common significantly differentially expressed unigenes and several key genes related to pigment synthesis, i.e. tyr, tyrp1, silv, sox10, slc24a5, cbs and slc7a11, were included. The results will facilitate understanding the molecular mechanisms of skin pigmentation differentiation in red tilapia and accelerate the molecular selection of the specific strain with consistent skin colors.
Next-generation sequencing identifies transportin 3 as the causative gene for LGMD1F.

Directory of Open Access Journals (Sweden)

Annalaura Torella

Full Text Available Limb-girdle muscular dystrophies (LGMD are genetically and clinically heterogeneous conditions. We investigated a large family with autosomal dominant transmission pattern, previously classified as LGMD1F and mapped to chromosome 7q32. Affected members are characterized by muscle weakness affecting earlier the pelvic girdle and the ileopsoas muscles. We sequenced the whole exome of four family members and identified a shared heterozygous frame-shift variant in the Transportin 3 (TNPO3 gene, encoding a member of the importin-β super-family. The TNPO3 gene is mapped within the LGMD1F critical interval and its 923-amino acid human gene product is also expressed in skeletal muscle. In addition, we identified an isolated case of LGMD with a new missense mutation in the same gene. We localized the mutant TNPO3 around the nucleus, but not inside. The involvement of gene related to the nuclear transport suggests a novel disease mechanism leading to muscular dystrophy.
Preferential Allele Expression Analysis Identifies Shared Germline and Somatic Driver Genes in Advanced Ovarian Cancer

Science.gov (United States)

Halabi, Najeeb M.; Martinez, Alejandra; Al-Farsi, Halema; Mery, Eliane; Puydenus, Laurence; Pujol, Pascal; Khalak, Hanif G.; McLurcan, Cameron; Ferron, Gwenael; Querleu, Denis; Al-Azwani, Iman; Al-Dous, Eman; Mohamoud, Yasmin A.; Malek, Joel A.; Rafii, Arash

2016-01-01

Identifying genes where a variant allele is preferentially expressed in tumors could lead to a better understanding of cancer biology and optimization of targeted therapy. However, tumor sample heterogeneity complicates standard approaches for detecting preferential allele expression. We therefore developed a novel approach combining genome and transcriptome sequencing data from the same sample that corrects for sample heterogeneity and identifies significant preferentially expressed alleles. We applied this analysis to epithelial ovarian cancer samples consisting of matched primary ovary and peritoneum and lymph node metastasis. We find that preferentially expressed variant alleles include germline and somatic variants, are shared at a relatively high frequency between patients, and are in gene networks known to be involved in cancer processes. Analysis at a patient level identifies patient-specific preferentially expressed alleles in genes that are targets for known drugs. Analysis at a site level identifies patterns of site specific preferential allele expression with similar pathways being impacted in the primary and metastasis sites. We conclude that genes with preferentially expressed variant alleles can act as cancer drivers and that targeting those genes could lead to new therapeutic strategies. PMID:26735499
Loss-of-function of neuroplasticity-related genes confers risk for human neurodevelopmental disorders.

Science.gov (United States)

Smith, Milo R; Glicksberg, Benjamin S; Li, Li; Chen, Rong; Morishita, Hirofumi; Dudley, Joel T

2018-01-01

High and increasing prevalence of neurodevelopmental disorders place enormous personal and economic burdens on society. Given the growing realization that the roots of neurodevelopmental disorders often lie in early childhood, there is an urgent need to identify childhood risk factors. Neurodevelopment is marked by periods of heightened experience-dependent neuroplasticity wherein neural circuitry is optimized by the environment. If these critical periods are disrupted, development of normal brain function can be permanently altered, leading to neurodevelopmental disorders. Here, we aim to systematically identify human variants in neuroplasticity-related genes that confer risk for neurodevelopmental disorders. Historically, this knowledge has been limited by a lack of techniques to identify genes related to neurodevelopmental plasticity in a high-throughput manner and a lack of methods to systematically identify mutations in these genes that confer risk for neurodevelopmental disorders. Using an integrative genomics approach, we determined loss-of-function (LOF) variants in putative plasticity genes, identified from transcriptional profiles of brain from mice with elevated plasticity, that were associated with neurodevelopmental disorders. From five shared differentially expressed genes found in two mouse models of juvenile-like elevated plasticity (juvenile wild-type or adult Lynx1-/- relative to adult wild-type) that were also genotyped in the Mount Sinai BioMe Biobank we identified multiple associations between LOF genes and increased risk for neurodevelopmental disorders across 10,510 patients linked to the Mount Sinai Electronic Medical Records (EMR), including epilepsy and schizophrenia. This work demonstrates a novel approach to identify neurodevelopmental risk genes and points toward a promising avenue to discover new drug targets to address the unmet therapeutic needs of neurodevelopmental disease.

Cry-Bt identifier: a biological database for PCR detection of Cry genes present in transgenic plants.

Science.gov (United States)

Singh, Vinay Kumar; Ambwani, Sonu; Marla, Soma; Kumar, Anil

2009-10-23

We describe the development of a user friendly tool that would assist in the retrieval of information relating to Cry genes in transgenic crops. The tool also helps in detection of transformed Cry genes from Bacillus thuringiensis present in transgenic plants by providing suitable designed primers for PCR identification of these genes. The tool designed based on relational database model enables easy retrieval of information from the database with simple user queries. The tool also enables users to access related information about Cry genes present in various databases by interacting with different sources (nucleotide sequences, protein sequence, sequence comparison tools, published literature, conserved domains, evolutionary and structural data). http://insilicogenomics.in/Cry-btIdentifier/welcome.html.
Gene expression analysis identifies global gene dosage sensitivity in cancer

DEFF Research Database (Denmark)

Fehrmann, Rudolf S. N.; Karjalainen, Juha M.; Krajewska, Malgorzata

2015-01-01

Many cancer-associated somatic copy number alterations (SCNAs) are known. Currently, one of the challenges is to identify the molecular downstream effects of these variants. Although several SCNAs are known to change gene expression levels, it is not clear whether each individual SCNA affects gen...
Molecular analysis of expansion, differentiation, and growth factor treatment of human chondrocytes identifies differentiation markers and growth-related genes.

Science.gov (United States)

Benz, Karin; Breit, Stephen; Lukoschek, Martin; Mau, Hans; Richter, Wiltrud

2002-04-26

This study is intended to optimise expansion and differentiation of cultured human chondrocytes by growth factor application and to identify molecular markers to monitor their differentiation state. We dissected the molecular consequences of matrix release, monolayer, and 3D-alginate culture, growth factor optimised expansion, and re-differentiation protocols by gene expression analysis. Among 19 common cartilage molecules assessed by cDNA array, six proved best to monitor differentiation. Instant down-regulation at release of cells from the matrix was strongest for COL 2A1, fibromodulin, and PRELP while LUM, CHI3L1, and CHI3L2 were expansion-related. Both gene sets reflected the physiologic effects of the most potent growth-inducing (PDGF-BB) and proteoglycan-inducing (BMP-4) factors. Only CRTAC1 expression correlated with 2D/3D switches while the molecular phenotype of native chondrocytes was not restored. The markers and optimised protocols we suggest can help to improve cell therapy of cartilage defects and chondrocyte differentiation from stem cell sources.
NHR-23 dependent collagen and hedgehog-related genes required for molting

International Nuclear Information System (INIS)

Kouns, Nathaniel A.; Nakielna, Johana; Behensky, Frantisek; Krause, Michael W.; Kostrouch, Zdenek; Kostrouchova, Marta

2011-01-01

Highlights: → NHR-23 is a critical regulator of nematode development and molting. → The manuscript characterizes the loss-of-function phenotype of an nhr-23 mutant. → Whole genome expression analysis identifies new potential targets of NHR-23. → Hedgehog-related genes are identified as NHR-23 dependent genes. → New link between sterol mediated signaling and regulation by NHR-23 is found. -- Abstract: NHR-23, a conserved member of the nuclear receptor family of transcription factors, is required for normal development in Caenorhabditis elegans where it plays a critical role in growth and molting. In a search for NHR-23 dependent genes, we performed whole genome comparative expression microarrays on both control and nhr-23 inhibited synchronized larvae. Genes that decreased in response to nhr-23 RNAi included several collagen genes. Unexpectedly, several hedgehog-related genes were also down-regulated after nhr-23 RNAi. A homozygous nhr-23 deletion allele was used to confirm the RNAi knockdown phenotypes and the changes in gene expression. Our results indicate that NHR-23 is a critical co-regulator of functionally linked genes involved in growth and molting and reveal evolutionary parallels among the ecdysozoa.
NHR-23 dependent collagen and hedgehog-related genes required for molting

Energy Technology Data Exchange (ETDEWEB)

Kouns, Nathaniel A.; Nakielna, Johana; Behensky, Frantisek [Laboratory of Model Systems, Institute of Inherited Metabolic Disorders, First Faculty of Medicine, Charles University, Prague (Czech Republic); Krause, Michael W. [Laboratory of Molecular Biology, National Institute of Diabetes and Digestive and Kidney Diseases, National Institutes of Health, Bethesda, MD (United States); Kostrouch, Zdenek [Laboratory of Model Systems, Institute of Inherited Metabolic Disorders, First Faculty of Medicine, Charles University, Prague (Czech Republic); Kostrouchova, Marta, E-mail: marta.kostrouchova@lf1.cuni.cz [Laboratory of Model Systems, Institute of Inherited Metabolic Disorders, First Faculty of Medicine, Charles University, Prague (Czech Republic)

2011-10-07

Highlights: {yields} NHR-23 is a critical regulator of nematode development and molting. {yields} The manuscript characterizes the loss-of-function phenotype of an nhr-23 mutant. {yields} Whole genome expression analysis identifies new potential targets of NHR-23. {yields} Hedgehog-related genes are identified as NHR-23 dependent genes. {yields} New link between sterol mediated signaling and regulation by NHR-23 is found. -- Abstract: NHR-23, a conserved member of the nuclear receptor family of transcription factors, is required for normal development in Caenorhabditis elegans where it plays a critical role in growth and molting. In a search for NHR-23 dependent genes, we performed whole genome comparative expression microarrays on both control and nhr-23 inhibited synchronized larvae. Genes that decreased in response to nhr-23 RNAi included several collagen genes. Unexpectedly, several hedgehog-related genes were also down-regulated after nhr-23 RNAi. A homozygous nhr-23 deletion allele was used to confirm the RNAi knockdown phenotypes and the changes in gene expression. Our results indicate that NHR-23 is a critical co-regulator of functionally linked genes involved in growth and molting and reveal evolutionary parallels among the ecdysozoa.
Tensor decomposition-based unsupervised feature extraction identifies candidate genes that induce post-traumatic stress disorder-mediated heart diseases.

Science.gov (United States)

Taguchi, Y-H

2017-12-21

Although post-traumatic stress disorder (PTSD) is primarily a mental disorder, it can cause additional symptoms that do not seem to be directly related to the central nervous system, which PTSD is assumed to directly affect. PTSD-mediated heart diseases are some of such secondary disorders. In spite of the significant correlations between PTSD and heart diseases, spatial separation between the heart and brain (where PTSD is primarily active) prevents researchers from elucidating the mechanisms that bridge the two disorders. Our purpose was to identify genes linking PTSD and heart diseases. In this study, gene expression profiles of various murine tissues observed under various types of stress or without stress were analyzed in an integrated manner using tensor decomposition (TD). Based upon the obtained features, ∼ 400 genes were identified as candidate genes that may mediate heart diseases associated with PTSD. Various gene enrichment analyses supported biological reliability of the identified genes. Ten genes encoding protein-, DNA-, or mRNA-interacting proteins-ILF2, ILF3, ESR1, ESR2, RAD21, HTT, ATF2, NR3C1, TP53, and TP63-were found to be likely to regulate expression of most of these ∼ 400 genes and therefore are candidate primary genes that cause PTSD-mediated heart diseases. Approximately 400 genes in the heart were also found to be strongly affected by various drugs whose known adverse effects are related to heart diseases and/or fear memory conditioning; these data support the reliability of our findings. TD-based unsupervised feature extraction turned out to be a useful method for gene selection and successfully identified possible genes causing PTSD-mediated heart diseases.
Genome-wide association study identifies candidate genes for starch content regulation in maize kernels

Directory of Open Access Journals (Sweden)

Na Liu

2016-07-01

Full Text Available Kernel starch content is an important trait in maize (Zea mays L. as it accounts for 65% to 75% of the dry kernel weight and positively correlates with seed yield. A number of starch synthesis-related genes have been identified in maize in recent years. However, many loci underlying variation in starch content among maize inbred lines still remain to be identified. The current study is a genome-wide association study that used a set of 263 maize inbred lines. In this panel, the average kernel starch content was 66.99%, ranging from 60.60% to 71.58% over the three study years. These inbred lines were genotyped with the SNP50 BeadChip maize array, which is comprised of 56,110 evenly spaced, random SNPs. Population structure was controlled by a mixed linear model (MLM as implemented in the software package TASSEL. After the statistical analyses, four SNPs were identified as significantly associated with starch content (P ≤ 0.0001, among which one each are located on chromosomes 1 and 5 and two are on chromosome 2. Furthermore, 77 candidate genes associated with starch synthesis were found within the 100-kb intervals containing these four QTLs, and four highly associated genes were within 20-kb intervals of the associated SNPs. Among the four genes, Glucose-1-phosphate adenylyltransferase (APS1; Gene ID GRMZM2G163437 is known as an important regulator of kernel starch content. The identified SNPs, QTLs, and candidate genes may not only be readily used for germplasm improvement by marker-assisted selection in breeding, but can also elucidate the genetic basis of starch content. Further studies on these identified candidate genes may help determine the molecular mechanisms regulating kernel starch content in maize and other important cereal crops.
Identifying novel genes and biological processes relevant to the development of cancer therapy-induced mucositis: An informative gene network analysis.

Directory of Open Access Journals (Sweden)

Cielito C Reyes-Gibby

Full Text Available Mucositis is a complex, dose-limiting toxicity of chemotherapy or radiotherapy that leads to painful mouth ulcers, difficulty eating or swallowing, gastrointestinal distress, and reduced quality of life for patients with cancer. Mucositis is most common for those undergoing high-dose chemotherapy and hematopoietic stem cell transplantation and for those being treated for malignancies of the head and neck. Treatment and management of mucositis remain challenging. It is expected that multiple genes are involved in the formation, severity, and persistence of mucositis. We used Ingenuity Pathway Analysis (IPA, a novel network-based approach that integrates complex intracellular and intercellular interactions involved in diseases, to systematically explore the molecular complexity of mucositis. As a first step, we searched the literature to identify genes that harbor or are close to the genetic variants significantly associated with mucositis. Our literature review identified 27 candidate genes, of which ERCC1, XRCC1, and MTHFR were the most frequently studied for mucositis. On the basis of this 27-gene list, we used IPA to generate gene networks for mucositis. The most biologically significant novel molecules identified through IPA analyses included TP53, CTNNB1, MYC, RB1, P38 MAPK, and EP300. Additionally, uracil degradation II (reductive and thymine degradation pathways (p = 1.06-08 were most significant. Finally, utilizing 66 SNPs within the 8 most connected IPA-derived candidate molecules, we conducted a genetic association study for oral mucositis in the head and neck cancer patients who were treated using chemotherapy and/or radiation therapy (186 head and neck cancer patients with oral mucositis vs. 699 head and neck cancer patients without oral mucositis. The top ranked gene identified through this association analysis was RB1 (rs2227311, p-value = 0.034, odds ratio = 0.67. In conclusion, gene network analysis identified novel molecules and
Identifying novel genes and biological processes relevant to the development of cancer therapy-induced mucositis: An informative gene network analysis.

Science.gov (United States)

Reyes-Gibby, Cielito C; Melkonian, Stephanie C; Wang, Jian; Yu, Robert K; Shelburne, Samuel A; Lu, Charles; Gunn, Gary Brandon; Chambers, Mark S; Hanna, Ehab Y; Yeung, Sai-Ching J; Shete, Sanjay

2017-01-01

Mucositis is a complex, dose-limiting toxicity of chemotherapy or radiotherapy that leads to painful mouth ulcers, difficulty eating or swallowing, gastrointestinal distress, and reduced quality of life for patients with cancer. Mucositis is most common for those undergoing high-dose chemotherapy and hematopoietic stem cell transplantation and for those being treated for malignancies of the head and neck. Treatment and management of mucositis remain challenging. It is expected that multiple genes are involved in the formation, severity, and persistence of mucositis. We used Ingenuity Pathway Analysis (IPA), a novel network-based approach that integrates complex intracellular and intercellular interactions involved in diseases, to systematically explore the molecular complexity of mucositis. As a first step, we searched the literature to identify genes that harbor or are close to the genetic variants significantly associated with mucositis. Our literature review identified 27 candidate genes, of which ERCC1, XRCC1, and MTHFR were the most frequently studied for mucositis. On the basis of this 27-gene list, we used IPA to generate gene networks for mucositis. The most biologically significant novel molecules identified through IPA analyses included TP53, CTNNB1, MYC, RB1, P38 MAPK, and EP300. Additionally, uracil degradation II (reductive) and thymine degradation pathways (p = 1.06-08) were most significant. Finally, utilizing 66 SNPs within the 8 most connected IPA-derived candidate molecules, we conducted a genetic association study for oral mucositis in the head and neck cancer patients who were treated using chemotherapy and/or radiation therapy (186 head and neck cancer patients with oral mucositis vs. 699 head and neck cancer patients without oral mucositis). The top ranked gene identified through this association analysis was RB1 (rs2227311, p-value = 0.034, odds ratio = 0.67). In conclusion, gene network analysis identified novel molecules and biological
Resistance gene candidates identified by PCR with degenerate oligonucleotide primers map to clusters of resistance genes in lettuce.

Science.gov (United States)

Shen, K A; Meyers, B C; Islam-Faridi, M N; Chin, D B; Stelly, D M; Michelmore, R W

1998-08-01

The recent cloning of genes for resistance against diverse pathogens from a variety of plants has revealed that many share conserved sequence motifs. This provides the possibility of isolating numerous additional resistance genes by polymerase chain reaction (PCR) with degenerate oligonucleotide primers. We amplified resistance gene candidates (RGCs) from lettuce with multiple combinations of primers with low degeneracy designed from motifs in the nucleotide binding sites (NBSs) of RPS2 of Arabidopsis thaliana and N of tobacco. Genomic DNA, cDNA, and bacterial artificial chromosome (BAC) clones were successfully used as templates. Four families of sequences were identified that had the same similarity to each other as to resistance genes from other species. The relationship of the amplified products to resistance genes was evaluated by several sequence and genetic criteria. The amplified products contained open reading frames with additional sequences characteristic of NBSs. Hybridization of RGCs to genomic DNA and to BAC clones revealed large numbers of related sequences. Genetic analysis demonstrated the existence of clustered multigene families for each of the four RGC sequences. This parallels classical genetic data on clustering of disease resistance genes. Two of the four families mapped to known clusters of resistance genes; these two families were therefore studied in greater detail. Additional evidence that these RGCs could be resistance genes was gained by the identification of leucine-rich repeat (LRR) regions in sequences adjoining the NBS similar to those in RPM1 and RPS2 of A. thaliana. Fluorescent in situ hybridization confirmed the clustered genomic distribution of these sequences. The use of PCR with degenerate oligonucleotide primers is therefore an efficient method to identify numerous RGCs in plants.
Identification of apoptosis-related PLZF target genes

International Nuclear Information System (INIS)

Bernardo, Maria Victoria; Yelo, Estefania; Gimeno, Lourdes; Campillo, Jose Antonio; Parrado, Antonio

2007-01-01

The PLZF gene encodes a BTB/POZ-zinc finger-type transcription factor, involved in physiological development, proliferation, differentiation, and apoptosis. In this paper, we investigate proliferation, survival, and gene expression regulation in stable clones from the human haematopoietic K562, DG75, and Jurkat cell lines with inducible expression of PLZF. In Jurkat cells, but not in K562 and DG75 cells, PLZF induced growth suppression and apoptosis in a cell density-dependent manner. Deletion of the BTB/POZ domain of PLZF abrogated growth suppression and apoptosis. PLZF was expressed with a nuclear speckled pattern distinctively in the full-length PLZF-expressing Jurkat clones, suggesting that the nuclear speckled localization is required for PLZF-induced apoptosis. By microarray analysis, we identified that the apoptosis-inducer TP53INP1, ID1, and ID3 genes were upregulated, and the apoptosis-inhibitor TERT gene was downregulated. The identification of apoptosis-related PLZF target genes may have biological and clinical relevance in cancer typified by altered PLZF expression
Transcription profiling and identification of infection-related genes in Phytophthora cactorum.

Science.gov (United States)

Chen, Xiao-Ren; Huang, Shen-Xin; Zhang, Ye; Sheng, Gui-Lin; Zhang, Bo-Yue; Li, Qi-Yuan; Zhu, Feng; Xu, Jing-You

2018-04-01

Phytophthora cactorum, an oomycete pathogen, infects more than 200 plant species within several plant families. To gain insight into the repertoire of the infection-related genes of P. cactorum, Illumina RNA-Seq was used to perform a global transcriptome analysis of three life cycle stages of the pathogen, mycelia (MY), zoospores (ZO) and germinating cysts with germ tubes (GC). From over 9.8 million Illumina reads for each library, 18,402, 18,569 and 19,443 distinct genes were identified for MY, ZO and GC libraries, respectively. Furthermore, the transcriptome difference among MY, ZO and GC stages was investigated. Gene ontology (GO) and KEGG pathway enrichment analyses revealed diverse biological functions and processes. Comparative analysis identified a large number of genes that are associated with specific stages and pathogenicity, including 166 effector genes. Of them, most of RXLR and NLP genes showed induction while the majority of CRN genes were down-regulated in GC, the important pre-infection stage, compared to either MY or ZO. And 14 genes encoding small cysteine-rich (SCR) secretory proteins showed differential expression during the developmental stages and in planta. Ectopic expression in the Solanaceae indicated that SCR113 and one elicitin PcINF1 can trigger cell death on Nicotiana benthamiana, tobacco (N. tabacum) and tomato (Solanum lycopersicum) leaves. Neither conserved domain nor homologues of SCR113 in other organisms can be identified. Collectively, our study provides a comprehensive examination of gene expression across three P. cactorum developmental stages and describes pathogenicity-related genes, all of which will help elucidate the pathogenicity mechanism of this destructive pathogen.
Candidate Essential Genes in Burkholderia cenocepacia J2315 Identified by Genome-Wide TraDIS

KAUST Repository

Wong, Yee-Chin

2016-08-22

Burkholderia cenocepacia infection often leads to fatal cepacia syndrome in cystic fibrosis patients. However, antibiotic therapy rarely results in complete eradication of the pathogen due to its intrinsic resistance to many clinically available antibiotics. Recent attention has turned to the identification of essential genes as the proteins encoded by these genes may serve as potential targets for development of novel antimicrobials. In this study, we utilized TraDIS (Transposon Directed Insertion-site Sequencing) as a genome-wide screening tool to facilitate the identification of B. cenocepacia genes essential for its growth and viability. A transposon mutant pool consisting of approximately 500,000 mutants was successfully constructed, with more than 400,000 unique transposon insertion sites identified by computational analysis of TraDIS datasets. The saturated library allowed for the identification of 383 genes that were predicted to be essential in B. cenocepacia. We extended the application of TraDIS to identify conditionally essential genes required for in vitro growth and revealed an additional repertoire of 439 genes to be crucial for B. cenocepacia growth under nutrient-depleted conditions. The library of B. cenocepacia mutants can subsequently be subjected to various biologically related conditions to facilitate the discovery of genes involved in niche adaptation as well as pathogenicity and virulence.
Candidate Essential Genes in Burkholderia cenocepacia J2315 Identified by Genome-Wide TraDIS

KAUST Repository

Wong, Yee-Chin; Abd El Ghany, Moataz; Naeem, Raeece; Lee, Kok-Wei; Tan, Yung-Chie; Pain, Arnab; Nathan, Sheila

2016-01-01

Burkholderia cenocepacia infection often leads to fatal cepacia syndrome in cystic fibrosis patients. However, antibiotic therapy rarely results in complete eradication of the pathogen due to its intrinsic resistance to many clinically available antibiotics. Recent attention has turned to the identification of essential genes as the proteins encoded by these genes may serve as potential targets for development of novel antimicrobials. In this study, we utilized TraDIS (Transposon Directed Insertion-site Sequencing) as a genome-wide screening tool to facilitate the identification of B. cenocepacia genes essential for its growth and viability. A transposon mutant pool consisting of approximately 500,000 mutants was successfully constructed, with more than 400,000 unique transposon insertion sites identified by computational analysis of TraDIS datasets. The saturated library allowed for the identification of 383 genes that were predicted to be essential in B. cenocepacia. We extended the application of TraDIS to identify conditionally essential genes required for in vitro growth and revealed an additional repertoire of 439 genes to be crucial for B. cenocepacia growth under nutrient-depleted conditions. The library of B. cenocepacia mutants can subsequently be subjected to various biologically related conditions to facilitate the discovery of genes involved in niche adaptation as well as pathogenicity and virulence.
Candidate essential genes in Burkholderia cenocepacia J2315 identified by genome-wide TraDIS

Directory of Open Access Journals (Sweden)

Yee-Chin Wong

2016-08-01

Full Text Available Burkholderia cenocepacia infection often leads to fatal cepacia syndrome in cystic fibrosis patients. However, antibiotic therapy rarely results in complete eradication of the pathogen due to its intrinsic resistance to many clinically available antibiotics. Recent attention has turned to the identification of essential genes as the proteins encoded by these genes may serve as potential targets for development of novel antimicrobials. In this study, we utilized TraDIS (Transposon Directed Insertion-site Sequencing as a genome-wide screening tool to facilitate the identification of B. cenocepacia genes essential for its growth and viability. A transposon mutant pool consisting of approximately 500,000 mutants was successfully constructed, with more than 400,000 unique transposon insertion sites identified by computational analysis of TraDIS datasets. The saturated library allowed for the identification of 383 genes that were predicted to be essential in B. cenocepacia. We extended the application of TraDIS to identify conditionally essential genes required for in vitro growth and revealed an additional repertoire of 439 genes to be crucial for B. cenocepacia growth under nutrient-depleted conditions. The library of B. cenocepacia mutants can subsequently be subjected to various biologically related conditions to facilitate the discovery of genes involved in niche adaptation as well as pathogenicity and virulence.
Transcriptome analysis and anthocyanin-related genes in red leaf lettuce.

Science.gov (United States)

Zhang, Y Z; Xu, S Z; Cheng, Y W; Ya, H Y; Han, J M

2016-01-29

This study aimed to analyze the transcriptome profile of red lettuce and identify the genes involved in anthocyanin accumulation. Red leaf lettuce is a popular vegetable and popular due to its high anthocyanin content. However, there is limited information available about the genes involved in anthocyanin biosynthesis in this species. In this study, transcriptomes of 15-day-old seedlings and 40-day-old red lettuce leaves were analyzed using an Illuminia HiseqTM 2500 platform. A total of 10.6 GB clean data were obtained and de novo assembled into 83,333 unigenes with an N50 of 1067. After annotation against public databases, 51,850 unigene sequences were identified, among which 46,087 were annotated in the NCBI non-redundant protein database, and 41,752 were annotated in the Swiss-Prot database. A total of 9125 unigenes were mapped into 163 pathways using the Kyoto Encyclopedia of Genes and Genomes database. Thirty-four structural genes were found to cover the main steps of the anthocyanin pathway, including chalcone synthase, chalcone isomerase, flavanone 3-hydroxylase, flavonoid 3'-hydroxylase, flavonoid 3',5'-hydroxylase, dihydroflavonol 4-reductase, and anthocyanidin synthase. Seven MYB, three bHLH, and two WD40 genes, considered anthocyanin regulatory genes, were also identified. In addition, 3607 simple sequence repeat (SSR) markers were identified from 2916 unigenes. This research uncovered the transcriptomic characteristics of red leaf lettuce seedlings and mature plants. The identified candidate genes related to anthocyanin biosynthesis and the detected SSRs provide useful tools for future molecular breeding studies.
Integrated database for identifying candidate genes for Aspergillus flavus resistance in maize.

Science.gov (United States)

Kelley, Rowena Y; Gresham, Cathy; Harper, Jonathan; Bridges, Susan M; Warburton, Marilyn L; Hawkins, Leigh K; Pechanova, Olga; Peethambaran, Bela; Pechan, Tibor; Luthe, Dawn S; Mylroie, J E; Ankala, Arunkanth; Ozkan, Seval; Henry, W B; Williams, W P

2010-10-07

Aspergillus flavus Link:Fr, an opportunistic fungus that produces aflatoxin, is pathogenic to maize and other oilseed crops. Aflatoxin is a potent carcinogen, and its presence markedly reduces the value of grain. Understanding and enhancing host resistance to A. flavus infection and/or subsequent aflatoxin accumulation is generally considered an efficient means of reducing grain losses to aflatoxin. Different proteomic, genomic and genetic studies of maize (Zea mays L.) have generated large data sets with the goal of identifying genes responsible for conferring resistance to A. flavus, or aflatoxin. In order to maximize the usage of different data sets in new studies, including association mapping, we have constructed a relational database with web interface integrating the results of gene expression, proteomic (both gel-based and shotgun), Quantitative Trait Loci (QTL) genetic mapping studies, and sequence data from the literature to facilitate selection of candidate genes for continued investigation. The Corn Fungal Resistance Associated Sequences Database (CFRAS-DB) (http://agbase.msstate.edu/) was created with the main goal of identifying genes important to aflatoxin resistance. CFRAS-DB is implemented using MySQL as the relational database management system running on a Linux server, using an Apache web server, and Perl CGI scripts as the web interface. The database and the associated web-based interface allow researchers to examine many lines of evidence (e.g. microarray, proteomics, QTL studies, SNP data) to assess the potential role of a gene or group of genes in the response of different maize lines to A. flavus infection and subsequent production of aflatoxin by the fungus. CFRAS-DB provides the first opportunity to integrate data pertaining to the problem of A. flavus and aflatoxin resistance in maize in one resource and to support queries across different datasets. The web-based interface gives researchers different query options for mining the database
Transcriptomic variation among six Arabidopsis thaliana accessions identified several novel genes controlling aluminium tolerance.

Science.gov (United States)

Kusunoki, Kazutaka; Nakano, Yuki; Tanaka, Keisuke; Sakata, Yoichi; Koyama, Hiroyuki; Kobayashi, Yuriko

2017-02-01

Differences in the expression levels of aluminium (Al) tolerance genes are a known determinant of Al tolerance among plant varieties. We combined transcriptomic analysis of six Arabidopsis thaliana accessions with contrasting Al tolerance and a reverse genetic approach to identify Al-tolerance genes responsible for differences in Al tolerance between accession groups. Gene expression variation increased in the signal transduction process under Al stress and in growth-related processes in the absence of stress. Co-expression analysis and promoter single nucleotide polymorphism searching suggested that both trans-acting polymorphisms of Al signal transduction pathway and cis-acting polymorphisms in the promoter sequences caused the variations in gene expression associated with Al tolerance. Compared with the wild type, Al sensitivity increased in T-DNA knockout (KO) lines for five genes, including TARGET OF AVRB OPERATION1 (TAO1) and an unannotated gene (At5g22530). These were identified from 53 Al-inducible genes showing significantly higher expression in tolerant accessions than in sensitive accessions. These results indicate that the difference in transcriptional signalling is partly associated with the natural variation in Al tolerance in Arabidopsis. Our study also demonstrates the feasibility of comparative transcriptome analysis by using natural genetic variation for the identification of genes responsible for Al stress tolerance. © 2016 John Wiley & Sons Ltd.
Gene Co-expression Analysis to Characterize Genes Related to Marbling Trait in Hanwoo (Korean) Cattle.

Science.gov (United States)

Lim, Dajeong; Lee, Seung-Hwan; Kim, Nam-Kuk; Cho, Yong-Min; Chai, Han-Ha; Seong, Hwan-Hoo; Kim, Heebal

2013-01-01

Marbling (intramuscular fat) is an important trait that affects meat quality and is a casual factor determining the price of beef in the Korean beef market. It is a complex trait and has many biological pathways related to muscle and fat. There is a need to identify functional modules or genes related to marbling traits and investigate their relationships through a weighted gene co-expression network analysis based on the system level. Therefore, we investigated the co-expression relationships of genes related to the 'marbling score' trait and systemically analyzed the network topology in Hanwoo (Korean cattle). As a result, we determined 3 modules (gene groups) that showed statistically significant results for marbling score. In particular, one module (denoted as red) has a statistically significant result for marbling score (p = 0.008) and intramuscular fat (p = 0.02) and water capacity (p = 0.006). From functional enrichment and relationship analysis of the red module, the pathway hub genes (IL6, CHRNE, RB1, INHBA and NPPA) have a direct interaction relationship and share the biological functions related to fat or muscle, such as adipogenesis or muscle growth. This is the first gene network study with m.logissimus in Hanwoo to observe co-expression patterns in divergent marbling phenotypes. It may provide insights into the functional mechanisms of the marbling trait.
Gene Co-expression Analysis to Characterize Genes Related to Marbling Trait in Hanwoo (Korean Cattle

Directory of Open Access Journals (Sweden)

Dajeong Lim

2013-01-01

Full Text Available Marbling (intramuscular fat is an important trait that affects meat quality and is a casual factor determining the price of beef in the Korean beef market. It is a complex trait and has many biological pathways related to muscle and fat. There is a need to identify functional modules or genes related to marbling traits and investigate their relationships through a weighted gene co-expression network analysis based on the system level. Therefore, we investigated the co-expression relationships of genes related to the ‘marbling score’ trait and systemically analyzed the network topology in Hanwoo (Korean cattle. As a result, we determined 3 modules (gene groups that showed statistically significant results for marbling score. In particular, one module (denoted as red has a statistically significant result for marbling score (p = 0.008 and intramuscular fat (p = 0.02 and water capacity (p = 0.006. From functional enrichment and relationship analysis of the red module, the pathway hub genes (IL6, CHRNE, RB1, INHBA and NPPA have a direct interaction relationship and share the biological functions related to fat or muscle, such as adipogenesis or muscle growth. This is the first gene network study with m.logissimus in Hanwoo to observe co-expression patterns in divergent marbling phenotypes. It may provide insights into the functional mechanisms of the marbling trait.

Intersex related gene expression profiles in clams Scrobicularia plana: Molecular markers and environmental application

International Nuclear Information System (INIS)

Ciocan, Corina M.; Cubero-Leon, Elena; Langston, William J.; Pope, Nick; Cornelius, Keith; Hill, E.M.; Alvarez-Munoz, Diana; Indiveri, Paolo; Lerebours, Adelaide; Minier, Christophe; Rotchell, Jeanette M.

2015-01-01

Highlights: • Expression of intersex-related genes was analysed in clam gonads sampled from the Channel. • Genes were differentially expressed at sites with varying levels of intersex and contaminants. • Correlations between gene expressions, key contaminants and sampling sites were identified. • No single gene expression studied correlated with intersex incidence. - Abstract: Intersex, the appearance of female characteristics in male gonads, has been identified in several aquatic species. It is a widespread phenomenon in populations of the bivalve, Scrobicularia plana, from the southwest coast of the U.K. Genes previously identified as differentially expressed (ferritin, testicular haploid expressed gene, THEG, proliferating cell nuclear antigen, PCNA; receptor activated protein kinase C, RACK; cytochrome B, CYB; and cytochrome c oxidase 1, COX1) in intersex clams relative to normal male clams, were selected for characterisation and an environmental survey of the Channel region. Transcripts were significantly differentially expressed at sites with varying intersex incidence and contaminant burdens. Significant correlations between specific gene expressions, key contaminants and sampling locations have been identified, though no single gene was associated with intersex incidence. The results highlight the difficulty in understanding the intersex phenomenon in molluscs where there is still a lack of knowledge on the control of normal reproduction
Assembly of inflammation-related genes for pathway-focused genetic analysis.

Directory of Open Access Journals (Sweden)

Matthew J Loza

2007-10-01

Full Text Available Recent identifications of associations between novel variants in inflammation-related genes and several common diseases emphasize the need for systematic evaluations of these genes in disease susceptibility. Considering that many genes are involved in the complex inflammation responses and many genetic variants in these genes have the potential to alter the functions and expression of these genes, we assembled a list of key inflammation-related genes to facilitate the identification of genetic associations of diseases with an inflammation-related etiology. We first reviewed various phases of inflammation responses, including the development of immune cells, sensing of danger, influx of cells to sites of insult, activation and functional responses of immune and non-immune cells, and resolution of the immune response. Assisted by the Ingenuity Pathway Analysis, we then identified 17 functional sub-pathways that are involved in one or multiple phases. This organization would greatly increase the chance of detecting gene-gene interactions by hierarchical clustering of genes with their functional closeness in a pathway. Finally, as an example application, we have developed tagging single nucleotide polymorphism (tSNP arrays for populations of European and African descent to capture all the common variants of these key inflammation-related genes. Assays of these tSNPs have been designed and assembled into two Affymetrix ParAllele customized chips, one each for European (12,011 SNPs and African (21,542 SNPs populations. These tSNPs have greater coverage for these inflammation-related genes compared to the existing genome-wide arrays, particularly in the African population. These tSNP arrays can facilitate systematic evaluation of inflammation pathways in disease susceptibility. For additional applications, other genotyping platforms could also be employed. For existing genome-wide association data, this list of key inflammation-related genes and
Prediction of disease-related genes based on weighted tissue-specific networks by using DNA methylation.

Science.gov (United States)

Li, Min; Zhang, Jiayi; Liu, Qing; Wang, Jianxin; Wu, Fang-Xiang

2014-01-01

Predicting disease-related genes is one of the most important tasks in bioinformatics and systems biology. With the advances in high-throughput techniques, a large number of protein-protein interactions are available, which make it possible to identify disease-related genes at the network level. However, network-based identification of disease-related genes is still a challenge as the considerable false-positives are still existed in the current available protein interaction networks (PIN). Considering the fact that the majority of genetic disorders tend to manifest only in a single or a few tissues, we constructed tissue-specific networks (TSN) by integrating PIN and tissue-specific data. We further weighed the constructed tissue-specific network (WTSN) by using DNA methylation as it plays an irreplaceable role in the development of complex diseases. A PageRank-based method was developed to identify disease-related genes from the constructed networks. To validate the effectiveness of the proposed method, we constructed PIN, weighted PIN (WPIN), TSN, WTSN for colon cancer and leukemia, respectively. The experimental results on colon cancer and leukemia show that the combination of tissue-specific data and DNA methylation can help to identify disease-related genes more accurately. Moreover, the PageRank-based method was effective to predict disease-related genes on the case studies of colon cancer and leukemia. Tissue-specific data and DNA methylation are two important factors to the study of human diseases. The same method implemented on the WTSN can achieve better results compared to those being implemented on original PIN, WPIN, or TSN. The PageRank-based method outperforms degree centrality-based method for identifying disease-related genes from WTSN.
Cartilage-selective genes identified in genome-scale analysis of non-cartilage and cartilage gene expression

Directory of Open Access Journals (Sweden)

Cohn Zachary A

2007-06-01

Full Text Available Abstract Background Cartilage plays a fundamental role in the development of the human skeleton. Early in embryogenesis, mesenchymal cells condense and differentiate into chondrocytes to shape the early skeleton. Subsequently, the cartilage anlagen differentiate to form the growth plates, which are responsible for linear bone growth, and the articular chondrocytes, which facilitate joint function. However, despite the multiplicity of roles of cartilage during human fetal life, surprisingly little is known about its transcriptome. To address this, a whole genome microarray expression profile was generated using RNA isolated from 18–22 week human distal femur fetal cartilage and compared with a database of control normal human tissues aggregated at UCLA, termed Celsius. Results 161 cartilage-selective genes were identified, defined as genes significantly expressed in cartilage with low expression and little variation across a panel of 34 non-cartilage tissues. Among these 161 genes were cartilage-specific genes such as cartilage collagen genes and 25 genes which have been associated with skeletal phenotypes in humans and/or mice. Many of the other cartilage-selective genes do not have established roles in cartilage or are novel, unannotated genes. Quantitative RT-PCR confirmed the unique pattern of gene expression observed by microarray analysis. Conclusion Defining the gene expression pattern for cartilage has identified new genes that may contribute to human skeletogenesis as well as provided further candidate genes for skeletal dysplasias. The data suggest that fetal cartilage is a complex and transcriptionally active tissue and demonstrate that the set of genes selectively expressed in the tissue has been greatly underestimated.
Detecting Horizontal Gene Transfer between Closely Related Taxa.

Directory of Open Access Journals (Sweden)

Orit Adato

2015-10-01

Full Text Available Horizontal gene transfer (HGT, the transfer of genetic material between organisms, is crucial for genetic innovation and the evolution of genome architecture. Existing HGT detection algorithms rely on a strong phylogenetic signal distinguishing the transferred sequence from ancestral (vertically derived genes in its recipient genome. Detecting HGT between closely related species or strains is challenging, as the phylogenetic signal is usually weak and the nucleotide composition is normally nearly identical. Nevertheless, there is a great importance in detecting HGT between congeneric species or strains, especially in clinical microbiology, where understanding the emergence of new virulent and drug-resistant strains is crucial, and often time-sensitive. We developed a novel, self-contained technique named Near HGT, based on the synteny index, to measure the divergence of a gene from its native genomic environment and used it to identify candidate HGT events between closely related strains. The method confirms candidate transferred genes based on the constant relative mutability (CRM. Using CRM, the algorithm assigns a confidence score based on "unusual" sequence divergence. A gene exhibiting exceptional deviations according to both synteny and mutability criteria, is considered a validated HGT product. We first employed the technique to a set of three E. coli strains and detected several highly probable horizontally acquired genes. We then compared the method to existing HGT detection tools using a larger strain data set. When combined with additional approaches our new algorithm provides richer picture and brings us closer to the goal of detecting all newly acquired genes in a particular strain.
Identification of immune response-related genes in the Chinese oak silkworm, Antheraea pernyi by suppression subtractive hybridization.

Science.gov (United States)

Liu, Qiu-Ning; Zhu, Bao-Jian; Wang, Lei; Wei, Guo-Qing; Dai, Li-Shang; Lin, Kun-Zhang; Sun, Yu; Qiu, Jian-Feng; Fu, Wei-Wei; Liu, Chao-Liang

2013-11-01

Insects possess an innate immune system that responds to invading microorganisms. In this study, a subtractive cDNA library was constructed to screen for immune response-related genes in the fat bodies of Antheraea pernyi (Lepidoptera: Saturniidae) pupa challenged with Escherichia coli. Four hundred putative EST clones were identified by suppression subtractive hybridization (SSH), including 50 immune response-related genes, three cytoskeleton genes, eight cell cycle and apoptosis genes, five respiration and energy metabolism genes, five transport genes, 40 metabolism genes, ten stress response genes, four transcription and translation regulation genes and 77 unknown genes. To verify the reliability of the SSH data, the transcription of a set of randomly selected immune response-related genes were confirmed by semi-quantitative reverse transcription-PCR (RT-PCR) and real-time quantitative reverse transcription-PCR (qRT-PCR). These identified immune response-related genes provide insight into understanding the innate immunity in A. pernyi. Copyright © 2013 Elsevier Inc. All rights reserved.
Regulatory elements of the floral homeotic gene AGAMOUS identified by phylogenetic footprinting and shadowing.

Energy Technology Data Exchange (ETDEWEB)

Hong, R. L., Hamaguchi, L., Busch, M. A., and Weigel, D.

2003-06-01

OAK-B135 In Arabidopsis thaliana, cis-regulatory sequences of the floral homeotic gene AGAMOUS (AG) are located in the second intron. This 3 kb intron contains binding sites for two direct activators of AG, LEAFY (LFY) and WUSCHEL (WUS), along with other putative regulatory elements. We have used phylogenetic footprinting and the related technique of phylogenetic shadowing to identify putative cis-regulatory elements in this intron. Among 29 Brassicaceae, several other motifs, but not the LFY and WUS binding sites previously identified, are largely invariant. Using reporter gene analyses, we tested six of these motifs and found that they are all functionally important for activity of AG regulatory sequences in A. thaliana. Although there is little obvious sequence similarity outside the Brassicaceae, the intron from cucumber AG has at least partial activity in A. thaliana. Our studies underscore the value of the comparative approach as a tool that complements gene-by-gene promoter dissection, but also highlight that sequence-based studies alone are insufficient for a complete identification of cis-regulatory sites.
Epigenetics-related genes in prostate cancer: expression profile in prostate cancer tissues, androgen-sensitive and -insensitive cell lines.

Science.gov (United States)

Shaikhibrahim, Zaki; Lindstrot, Andreas; Ochsenfahrt, Jacqueline; Fuchs, Kerstin; Wernert, Nicolas

2013-01-01

Epigenetic changes have been suggested to drive prostate cancer (PCa) development and progression. Therefore, in this study, we aimed to identify novel epigenetics-related genes in PCa tissues, and to examine their expression in metastatic PCa cell lines. We analyzed the expression of epigenetics-related genes via a clustering analysis based on gene function in moderately and poorly differentiated PCa glands compared to normal glands of the peripheral zone (prostate proper) from PCa patients using Whole Human Genome Oligo Microarrays. Our analysis identified 12 epigenetics-related genes with a more than 2-fold increase or decrease in expression and a p-value epigenetics-related genes that we identified in primary PCa tissues may provide further insight into the role that epigenetic changes play in PCa. Moreover, some of the genes that we identified may play important roles in primary PCa and metastasis, in primary PCa only, or in metastasis only. Follow-up studies are required to investigate the functional role and the role that the expression of these genes play in the outcome and progression of PCa using tissue microarrays.
Weighted gene co-expression network analysis of expression data of monozygotic twins identifies specific modules and hub genes related to BMI

DEFF Research Database (Denmark)

Wang, Weijing; Jiang, Wenjie; Hou, Lin

2017-01-01

BACKGROUND: The therapeutic management of obesity is challenging, hence further elucidating the underlying mechanisms of obesity development and identifying new diagnostic biomarkers and therapeutic targets are urgent and necessary. Here, we performed differential gene expression analysis......) were with a trend of up-regulation in twins with higher BMI when compared to their siblings. Categories of positive regulation of nitric-oxide synthase biosynthetic process, positive regulation of NF-kappa B import into nucleus, and peroxidase activity were significantly enriched within GO database...
Sequence-Based Introgression Mapping Identifies Candidate White Mold Tolerance Genes in Common Bean

Directory of Open Access Journals (Sweden)

Sujan Mamidi

2016-07-01

Full Text Available White mold, caused by the necrotrophic fungus (Lib. de Bary, is a major disease of common bean ( L.. WM7.1 and WM8.3 are two quantitative trait loci (QTL with major effects on tolerance to the pathogen. Advanced backcross populations segregating individually for either of the two QTL, and a recombinant inbred (RI population segregating for both QTL were used to fine map and confirm the genetic location of the QTL. The QTL intervals were physically mapped using the reference common bean genome sequence, and the physical intervals for each QTL were further confirmed by sequence-based introgression mapping. Using whole-genome sequence data from susceptible and tolerant DNA pools, introgressed regions were identified as those with significantly higher numbers of single-nucleotide polymorphisms (SNPs relative to the whole genome. By combining the QTL and SNP data, WM7.1 was located to a 660-kb region that contained 41 gene models on the proximal end of chromosome Pv07, while the WM8.3 introgression was narrowed to a 1.36-Mb region containing 70 gene models. The most polymorphic candidate gene in the WM7.1 region encodes a BEACH-domain protein associated with apoptosis. Within the WM8.3 interval, a receptor-like protein with the potential to recognize pathogen effectors was the most polymorphic gene. The use of gene and sequence-based mapping identified two candidate genes whose putative functions are consistent with the current model of pathogenicity.
Digital gene expression profiling of flax (Linum usitatissimum L.) stem peel identifies genes enriched in fiber-bearing phloem tissue.

Science.gov (United States)

Guo, Yuan; Qiu, Caisheng; Long, Songhua; Chen, Ping; Hao, Dongmei; Preisner, Marta; Wang, Hui; Wang, Yufu

2017-08-30

To better understand the molecular mechanisms and gene expression characteristics associated with development of bast fiber cell within flax stem phloem, the gene expression profiling of flax stem peels and leaves were screened, using Illumina's Digital Gene Expression (DGE) analysis. Four DGE libraries (2 for stem peel and 2 for leaf), ranging from 6.7 to 9.2 million clean reads were obtained, which produced 7.0 million and 6.8 million mapped reads for flax stem peel and leave, respectively. By differential gene expression analysis, a total of 975 genes, of which 708 (73%) genes have protein-coding annotation, were identified as phloem enriched genes putatively involved in the processes of polysaccharide and cell wall metabolism. Differential expression genes (DEGs) was validated using quantitative RT-PCR, the expression pattern of all nine genes determined by qRT-PCR fitted in well with that obtained by sequencing analysis. Cluster and Gene Ontology (GO) analysis revealed that a large number of genes related to metabolic process, catalytic activity and binding category were expressed predominantly in the stem peels. The Kyoto Encyclopedia of Genes and Genomes (KEGG) analysis of the phloem enriched genes suggested approximately 111 biological pathways. The large number of genes and pathways produced from DGE sequencing will expand our understanding of the complex molecular and cellular events in flax bast fiber development and provide a foundation for future studies on fiber development in other bast fiber crops. Copyright © 2017 Elsevier B.V. All rights reserved.
Renal Gene Expression Database (RGED): a relational database of gene expression profiles in kidney disease.

Science.gov (United States)

Zhang, Qingzhou; Yang, Bo; Chen, Xujiao; Xu, Jing; Mei, Changlin; Mao, Zhiguo

2014-01-01

We present a bioinformatics database named Renal Gene Expression Database (RGED), which contains comprehensive gene expression data sets from renal disease research. The web-based interface of RGED allows users to query the gene expression profiles in various kidney-related samples, including renal cell lines, human kidney tissues and murine model kidneys. Researchers can explore certain gene profiles, the relationships between genes of interests and identify biomarkers or even drug targets in kidney diseases. The aim of this work is to provide a user-friendly utility for the renal disease research community to query expression profiles of genes of their own interest without the requirement of advanced computational skills. Website is implemented in PHP, R, MySQL and Nginx and freely available from http://rged.wall-eva.net. http://rged.wall-eva.net. © The Author(s) 2014. Published by Oxford University Press.
Renal Gene Expression Database (RGED): a relational database of gene expression profiles in kidney disease

Science.gov (United States)

Zhang, Qingzhou; Yang, Bo; Chen, Xujiao; Xu, Jing; Mei, Changlin; Mao, Zhiguo

2014-01-01

We present a bioinformatics database named Renal Gene Expression Database (RGED), which contains comprehensive gene expression data sets from renal disease research. The web-based interface of RGED allows users to query the gene expression profiles in various kidney-related samples, including renal cell lines, human kidney tissues and murine model kidneys. Researchers can explore certain gene profiles, the relationships between genes of interests and identify biomarkers or even drug targets in kidney diseases. The aim of this work is to provide a user-friendly utility for the renal disease research community to query expression profiles of genes of their own interest without the requirement of advanced computational skills. Availability and implementation: Website is implemented in PHP, R, MySQL and Nginx and freely available from http://rged.wall-eva.net. Database URL: http://rged.wall-eva.net PMID:25252782
Exome sequencing in 53 sporadic cases of schizophrenia identifies 18 putative candidate genes.

Directory of Open Access Journals (Sweden)

Michel Guipponi

Full Text Available Schizophrenia (SCZ is a severe, debilitating mental illness which has a significant genetic component. The identification of genetic factors related to SCZ has been challenging and these factors remain largely unknown. To evaluate the contribution of de novo variants (DNVs to SCZ, we sequenced the exomes of 53 individuals with sporadic SCZ and of their non-affected parents. We identified 49 DNVs, 18 of which were predicted to alter gene function, including 13 damaging missense mutations, 2 conserved splice site mutations, 2 nonsense mutations, and 1 frameshift deletion. The average number of exonic DNV per proband was 0.88, which corresponds to an exonic point mutation rate of 1.7×10(-8 per nucleotide per generation. The non-synonymous-to-synonymous mutation ratio of 2.06 did not differ from neutral expectations. Overall, this study provides a list of 18 putative candidate genes for sporadic SCZ, and when combined with the results of similar reports, identifies a second proband carrying a non-synonymous DNV in the RGS12 gene.
An Evolutionary Genomic Approach to Identify Genes Involved in Human Birth Timing

Science.gov (United States)

Orabona, Guilherme; Morgan, Thomas; Haataja, Ritva; Hallman, Mikko; Puttonen, Hilkka; Menon, Ramkumar; Kuczynski, Edward; Norwitz, Errol; Snegovskikh, Victoria; Palotie, Aarno; Fellman, Vineta; DeFranco, Emily A.; Chaudhari, Bimal P.; McGregor, Tracy L.; McElroy, Jude J.; Oetjens, Matthew T.; Teramo, Kari; Borecki, Ingrid; Fay, Justin; Muglia, Louis

2011-01-01

Coordination of fetal maturation with birth timing is essential for mammalian reproduction. In humans, preterm birth is a disorder of profound global health significance. The signals initiating parturition in humans have remained elusive, due to divergence in physiological mechanisms between humans and model organisms typically studied. Because of relatively large human head size and narrow birth canal cross-sectional area compared to other primates, we hypothesized that genes involved in parturition would display accelerated evolution along the human and/or higher primate phylogenetic lineages to decrease the length of gestation and promote delivery of a smaller fetus that transits the birth canal more readily. Further, we tested whether current variation in such accelerated genes contributes to preterm birth risk. Evidence from allometric scaling of gestational age suggests human gestation has been shortened relative to other primates. Consistent with our hypothesis, many genes involved in reproduction show human acceleration in their coding or adjacent noncoding regions. We screened >8,400 SNPs in 150 human accelerated genes in 165 Finnish preterm and 163 control mothers for association with preterm birth. In this cohort, the most significant association was in FSHR, and 8 of the 10 most significant SNPs were in this gene. Further evidence for association of a linkage disequilibrium block of SNPs in FSHR, rs11686474, rs11680730, rs12473870, and rs1247381 was found in African Americans. By considering human acceleration, we identified a novel gene that may be associated with preterm birth, FSHR. We anticipate other human accelerated genes will similarly be associated with preterm birth risk and elucidate essential pathways for human parturition. PMID:21533219
DRUMS: a human disease related unique gene mutation search engine.

Science.gov (United States)

Li, Zuofeng; Liu, Xingnan; Wen, Jingran; Xu, Ye; Zhao, Xin; Li, Xuan; Liu, Lei; Zhang, Xiaoyan

2011-10-01

With the completion of the human genome project and the development of new methods for gene variant detection, the integration of mutation data and its phenotypic consequences has become more important than ever. Among all available resources, locus-specific databases (LSDBs) curate one or more specific genes' mutation data along with high-quality phenotypes. Although some genotype-phenotype data from LSDB have been integrated into central databases little effort has been made to integrate all these data by a search engine approach. In this work, we have developed disease related unique gene mutation search engine (DRUMS), a search engine for human disease related unique gene mutation as a convenient tool for biologists or physicians to retrieve gene variant and related phenotype information. Gene variant and phenotype information were stored in a gene-centred relational database. Moreover, the relationships between mutations and diseases were indexed by the uniform resource identifier from LSDB, or another central database. By querying DRUMS, users can access the most popular mutation databases under one interface. DRUMS could be treated as a domain specific search engine. By using web crawling, indexing, and searching technologies, it provides a competitively efficient interface for searching and retrieving mutation data and their relationships to diseases. The present system is freely accessible at http://www.scbit.org/glif/new/drums/index.html. © 2011 Wiley-Liss, Inc.
Transcriptomic analysis identifies genes and pathways related to myrmecophagy in the Malayan pangolin (Manis javanica

Directory of Open Access Journals (Sweden)

Jing-E Ma

2017-12-01

Full Text Available The Malayan pangolin (Manis javanica is an unusual, scale-covered, toothless mammal that specializes in myrmecophagy. Due to their threatened status and continuing decline in the wild, concerted efforts have been made to conserve and rescue this species in captivity in China. Maintaining this species in captivity is a significant challenge, partly because little is known of the molecular mechanisms of its digestive system. Here, the first large-scale sequencing analyses of the salivary gland, liver and small intestine transcriptomes of an adult M. javanica genome were performed, and the results were compared with published liver transcriptome profiles for a pregnant M. javanica female. A total of 24,452 transcripts were obtained, among which 22,538 were annotated on the basis of seven databases. In addition, 3,373 new genes were predicted, of which 1,459 were annotated. Several pathways were found to be involved in myrmecophagy, including olfactory transduction, amino sugar and nucleotide sugar metabolism, lipid metabolism, and terpenoid and polyketide metabolism pathways. Many of the annotated transcripts were involved in digestive functions: 997 transcripts were related to sensory perception, 129 were related to digestive enzyme gene families, and 199 were related to molecular transporters. One transcript for an acidic mammalian chitinase was found in the annotated data, and this might be closely related to the unique digestive function of pangolins. These pathways and transcripts are involved in specialization processes related to myrmecophagy (a form of insectivory and carbohydrate, protein and lipid digestive pathways, probably reflecting adaptations to myrmecophagy. Our study is the first to investigate the molecular mechanisms underlying myrmecophagy in M. javanica, and we hope that our results may play a role in the conservation of this species.
Mutational analysis of EGFR and related signaling pathway genes in lung adenocarcinomas identifies a novel somatic kinase domain mutation in FGFR4.

Directory of Open Access Journals (Sweden)

Jenifer L Marks

2007-05-01

Full Text Available Fifty percent of lung adenocarcinomas harbor somatic mutations in six genes that encode proteins in the EGFR signaling pathway, i.e., EGFR, HER2/ERBB2, HER4/ERBB4, PIK3CA, BRAF, and KRAS. We performed mutational profiling of a large cohort of lung adenocarcinomas to uncover other potential somatic mutations in genes of this signaling pathway that could contribute to lung tumorigenesis.We analyzed genomic DNA from a total of 261 resected, clinically annotated non-small cell lung cancer (NSCLC specimens. The coding sequences of 39 genes were screened for somatic mutations via high-throughput dideoxynucleotide sequencing of PCR-amplified gene products. Mutations were considered to be somatic only if they were found in an independent tumor-derived PCR product but not in matched normal tissue. Sequencing of 9MB of tumor sequence identified 239 putative genetic variants. We further examined 22 variants found in RAS family genes and 135 variants localized to exons encoding the kinase domain of respective proteins. We identified a total of 37 non-synonymous somatic mutations; 36 were found collectively in EGFR, KRAS, BRAF, and PIK3CA. One somatic mutation was a previously unreported mutation in the kinase domain (exon 16 of FGFR4 (Glu681Lys, identified in 1 of 158 tumors. The FGFR4 mutation is analogous to a reported tumor-specific somatic mutation in ERBB2 and is located in the same exon as a previously reported kinase domain mutation in FGFR4 (Pro712Thr in a lung adenocarcinoma cell line.This study is one of the first comprehensive mutational analyses of major genes in a specific signaling pathway in a sizeable cohort of lung adenocarcinomas. Our results suggest the majority of gain-of-function mutations within kinase genes in the EGFR signaling pathway have already been identified. Our findings also implicate FGFR4 in the pathogenesis of a subset of lung adenocarcinomas.
Identifying Stable Reference Genes for qRT-PCR Normalisation in Gene Expression Studies of Narrow-Leafed Lupin (Lupinus angustifolius L..

Directory of Open Access Journals (Sweden)

Candy M Taylor

Full Text Available Quantitative Reverse Transcription PCR (qRT-PCR is currently one of the most popular, high-throughput and sensitive technologies available for quantifying gene expression. Its accurate application depends heavily upon normalisation of gene-of-interest data with reference genes that are uniformly expressed under experimental conditions. The aim of this study was to provide the first validation of reference genes for Lupinus angustifolius (narrow-leafed lupin, a significant grain legume crop using a selection of seven genes previously trialed as reference genes for the model legume, Medicago truncatula. In a preliminary evaluation, the seven candidate reference genes were assessed on the basis of primer specificity for their respective targeted region, PCR amplification efficiency, and ability to discriminate between cDNA and gDNA. Following this assessment, expression of the three most promising candidates [Ubiquitin C (UBC, Helicase (HEL, and Polypyrimidine tract-binding protein (PTB] was evaluated using the NormFinder and RefFinder statistical algorithms in two narrow-leafed lupin lines, both with and without vernalisation treatment, and across seven organ types (cotyledons, stem, leaves, shoot apical meristem, flowers, pods and roots encompassing three developmental stages. UBC was consistently identified as the most stable candidate and has sufficiently uniform expression that it may be used as a sole reference gene under the experimental conditions tested here. However, as organ type and developmental stage were associated with greater variability in relative expression, it is recommended using UBC and HEL as a pair to achieve optimal normalisation. These results highlight the importance of rigorously assessing candidate reference genes for each species across a diverse range of organs and developmental stages. With emerging technologies, such as RNAseq, and the completion of valuable transcriptome data sets, it is possible that other
ColoLipidGene: signature of lipid metabolism-related genes to predict prognosis in stage-II colon cancer patients

Science.gov (United States)

Vargas, Teodoro; Moreno-Rubio, Juan; Herranz, Jesús; Cejas, Paloma; Molina, Susana; González-Vallinas, Margarita; Mendiola, Marta; Burgos, Emilio; Aguayo, Cristina; Custodio, Ana B.; Machado, Isidro; Ramos, David; Gironella, Meritxell; Espinosa-Salinas, Isabel; Ramos, Ricardo; Martín-Hernández, Roberto; Risueño, Alberto; De Las Rivas, Javier; Reglero, Guillermo; Yaya, Ricardo; Fernández-Martos, Carlos; Aparicio, Jorge; Maurel, Joan; Feliu, Jaime; de Molina, Ana Ramírez

2015-01-01

Lipid metabolism plays an essential role in carcinogenesis due to the requirements of tumoral cells to sustain increased structural, energetic and biosynthetic precursor demands for cell proliferation. We investigated the association between expression of lipid metabolism-related genes and clinical outcome in intermediate-stage colon cancer patients with the aim of identifying a metabolic profile associated with greater malignancy and increased risk of relapse. Expression profile of 70 lipid metabolism-related genes was determined in 77 patients with stage II colon cancer. Cox regression analyses using c-index methodology was applied to identify a metabolic-related signature associated to prognosis. The metabolic signature was further confirmed in two independent validation sets of 120 patients and additionally, in a group of 264 patients from a public database. The combined analysis of these 4 genes, ABCA1, ACSL1, AGPAT1 and SCD, constitutes a metabolic-signature (ColoLipidGene) able to accurately stratify stage II colon cancer patients with 5-fold higher risk of relapse with strong statistical power in the four independent groups of patients. The identification of a group of 4 genes that predict survival in intermediate-stage colon cancer patients allows delineation of a high-risk group that may benefit from adjuvant therapy, and avoids the toxic and unnecessary chemotherapy in patients classified as low-risk group. PMID:25749516

The Integrative Method Based on the Module-Network for Identifying Driver Genes in Cancer Subtypes

Directory of Open Access Journals (Sweden)

Xinguo Lu

2018-01-01

Full Text Available With advances in next-generation sequencing(NGS technologies, a large number of multiple types of high-throughput genomics data are available. A great challenge in exploring cancer progression is to identify the driver genes from the variant genes by analyzing and integrating multi-types genomics data. Breast cancer is known as a heterogeneous disease. The identification of subtype-specific driver genes is critical to guide the diagnosis, assessment of prognosis and treatment of breast cancer. We developed an integrated frame based on gene expression profiles and copy number variation (CNV data to identify breast cancer subtype-specific driver genes. In this frame, we employed statistical machine-learning method to select gene subsets and utilized an module-network analysis method to identify potential candidate driver genes. The final subtype-specific driver genes were acquired by paired-wise comparison in subtypes. To validate specificity of the driver genes, the gene expression data of these genes were applied to classify the patient samples with 10-fold cross validation and the enrichment analysis were also conducted on the identified driver genes. The experimental results show that the proposed integrative method can identify the potential driver genes and the classifier with these genes acquired better performance than with genes identified by other methods.
Gene expression patterns combined with network analysis identify hub genes associated with bladder cancer.

Science.gov (United States)

Bi, Dongbin; Ning, Hao; Liu, Shuai; Que, Xinxiang; Ding, Kejia

2015-06-01

To explore molecular mechanisms of bladder cancer (BC), network strategy was used to find biomarkers for early detection and diagnosis. The differentially expressed genes (DEGs) between bladder carcinoma patients and normal subjects were screened using empirical Bayes method of the linear models for microarray data package. Co-expression networks were constructed by differentially co-expressed genes and links. Regulatory impact factors (RIF) metric was used to identify critical transcription factors (TFs). The protein-protein interaction (PPI) networks were constructed by the Search Tool for the Retrieval of Interacting Genes/Proteins (STRING) and clusters were obtained through molecular complex detection (MCODE) algorithm. Centralities analyses for complex networks were performed based on degree, stress and betweenness. Enrichment analyses were performed based on Gene Ontology (GO) and Kyoto Encyclopedia of Genes and Genomes (KEGG) databases. Co-expression networks and TFs (based on expression data of global DEGs and DEGs in different stages and grades) were identified. Hub genes of complex networks, such as UBE2C, ACTA2, FABP4, CKS2, FN1 and TOP2A, were also obtained according to analysis of degree. In gene enrichment analyses of global DEGs, cell adhesion, proteinaceous extracellular matrix and extracellular matrix structural constituent were top three GO terms. ECM-receptor interaction, focal adhesion, and cell cycle were significant pathways. Our results provide some potential underlying biomarkers of BC. However, further validation is required and deep studies are needed to elucidate the pathogenesis of BC. Copyright © 2015 Elsevier Ltd. All rights reserved.
A search engine to identify pathway genes from expression data on multiple organisms

Directory of Open Access Journals (Sweden)

Zambon Alexander C

2007-05-01

Full Text Available Abstract Background The completion of several genome projects showed that most genes have not yet been characterized, especially in multicellular organisms. Although most genes have unknown functions, a large collection of data is available describing their transcriptional activities under many different experimental conditions. In many cases, the coregulatation of a set of genes across a set of conditions can be used to infer roles for genes of unknown function. Results We developed a search engine, the Multiple-Species Gene Recommender (MSGR, which scans gene expression datasets from multiple organisms to identify genes that participate in a genetic pathway. The MSGR takes a query consisting of a list of genes that function together in a genetic pathway from one of six organisms: Homo sapiens, Drosophila melanogaster, Caenorhabditis elegans, Saccharomyces cerevisiae, Arabidopsis thaliana, and Helicobacter pylori. Using a probabilistic method to merge searches, the MSGR identifies genes that are significantly coregulated with the query genes in one or more of those organisms. The MSGR achieves its highest accuracy for many human pathways when searches are combined across species. We describe specific examples in which new genes were identified to be involved in a neuromuscular signaling pathway and a cell-adhesion pathway. Conclusion The search engine can scan large collections of gene expression data for new genes that are significantly coregulated with a pathway of interest. By integrating searches across organisms, the MSGR can identify pathway members whose coregulation is either ancient or newly evolved.
Rapid Communication: MiR-92a as a housekeeping gene for analysis of bovine mastitis-related microRNA in milk.

Science.gov (United States)

Lai, Y C; Fujikawa, T; Ando, T; Kitahara, G; Koiwa, M; Kubota, C; Miura, N

2017-06-01

Our aim was to identify a suitable microRNA housekeeping gene for real-time PCR analysis of bovine mastitis-related microRNA in milk. We identified , , and as housekeeping gene candidates on the basis of previous Solexa sequencing results. Threshold cycle (CT) values for , , and did not differ between milk from control cows and milk from mastitis-affected cows. NormFinder software identified as the most stable single housekeeping gene. We evaluated the suitability of the housekeeping gene candidates by using them to assess expression levels of the inflammation-related gene . Regardless of the housekeeping gene candidates used for normalization, relative expression levels of were significantly higher in mastitis-affected samples than in control samples. However, of all the housekeeping genes and gene combinations investigated, normalization with alone generated the difference in relative expression between mastitis-affected and control samples with the highest significance. These results suggest that is suitable for use as a housekeeping gene for analysis of bovine mastitis-related microRNA in milk.
Analysis of Pigeon (Columba) Ovary Transcriptomes to Identify Genes Involved in Blue Light Regulation

Science.gov (United States)

Wang, Ying; Ding, Jia-tong; Yang, Hai-ming; Yan, Zheng-jie; Cao, Wei; Li, Yang-bai

2015-01-01

Monochromatic light is widely applied to promote poultry reproductive performance, yet little is currently known regarding the mechanism by which light wavelengths affect pigeon reproduction. Recently, high-throughput sequencing technologies have been used to provide genomic information for solving this problem. In this study, we employed Illumina Hiseq 2000 to identify differentially expressed genes in ovary tissue from pigeons under blue and white light conditions and de novo transcriptome assembly to construct a comprehensive sequence database containing information on the mechanisms of follicle development. A total of 157,774 unigenes (mean length: 790 bp) were obtained by the Trinity program, and 35.83% of these unigenes were matched to genes in a non-redundant protein database. Gene description, gene ontology, and the clustering of orthologous group terms were performed to annotate the transcriptome assembly. Differentially expressed genes between blue and white light conditions included those related to oocyte maturation, hormone biosynthesis, and circadian rhythm. Furthermore, 17,574 SSRs and 533,887 potential SNPs were identified in this transcriptome assembly. This work is the first transcriptome analysis of the Columba ovary using Illumina technology, and the resulting transcriptome and differentially expressed gene data can facilitate further investigations into the molecular mechanism of the effect of blue light on follicle development and reproduction in pigeons and other bird species. PMID:26599806
Analysis of Pigeon (Columba Ovary Transcriptomes to Identify Genes Involved in Blue Light Regulation.

Directory of Open Access Journals (Sweden)

Ying Wang

Full Text Available Monochromatic light is widely applied to promote poultry reproductive performance, yet little is currently known regarding the mechanism by which light wavelengths affect pigeon reproduction. Recently, high-throughput sequencing technologies have been used to provide genomic information for solving this problem. In this study, we employed Illumina Hiseq 2000 to identify differentially expressed genes in ovary tissue from pigeons under blue and white light conditions and de novo transcriptome assembly to construct a comprehensive sequence database containing information on the mechanisms of follicle development. A total of 157,774 unigenes (mean length: 790 bp were obtained by the Trinity program, and 35.83% of these unigenes were matched to genes in a non-redundant protein database. Gene description, gene ontology, and the clustering of orthologous group terms were performed to annotate the transcriptome assembly. Differentially expressed genes between blue and white light conditions included those related to oocyte maturation, hormone biosynthesis, and circadian rhythm. Furthermore, 17,574 SSRs and 533,887 potential SNPs were identified in this transcriptome assembly. This work is the first transcriptome analysis of the Columba ovary using Illumina technology, and the resulting transcriptome and differentially expressed gene data can facilitate further investigations into the molecular mechanism of the effect of blue light on follicle development and reproduction in pigeons and other bird species.
Gene expression profile identifies potential biomarkers for human intervertebral disc degeneration.

Science.gov (United States)

Guo, Wei; Zhang, Bin; Li, Yan; Duan, Hui-Quan; Sun, Chao; Xu, Yun-Qiang; Feng, Shi-Qing

2017-12-01

The present study aimed to reveal the potential genes associated with the pathogenesis of intervertebral disc degeneration (IDD) by analyzing microarray data using bioinformatics. Gene expression profiles of two regions of the intervertebral disc were compared between patients with IDD and controls. GSE70362 containing two groups of gene expression profiles, 16 nucleus pulposus (NP) samples from patients with IDD and 8 from controls, and 16 annulus fibrosus (AF) samples from patients with IDD and 8 from controls, was downloaded from the Gene Expression Omnibus database. A total of 93 and 114 differentially expressed genes (DEGs) were identified in NP and AF samples, respectively, using a limma software package for the R programming environment. Gene Ontology (GO) function enrichment analysis was performed to identify the associated biological functions of DEGs in IDD, which indicated that the DEGs may be involved in various processes, including cell adhesion, biological adhesion and extracellular matrix organization. Pathway enrichment analysis using the Kyoto Encyclopedia of Genes and Genomes (KEGG) demonstrated that the identified DEGs were potentially involved in focal adhesion and the p53 signaling pathway. Further analysis revealed that there were 35 common DEGs observed between the two regions (NP and AF), which may be further regulated by 6 clusters of microRNAs (miRNAs) retrieved with WebGestalt. The genes in the DEG‑miRNA regulatory network were annotated using GO function and KEGG pathway enrichment analysis, among which extracellular matrix organization was the most significant disrupted biological process and focal adhesion was the most significant dysregulated pathway. In addition, the result of protein‑protein interaction network modules demonstrated the involvement of inflammatory cytokine interferon signaling in IDD. These findings may not only advance the understanding of the pathogenesis of IDD, but also identify novel potential
Link-based quantitative methods to identify differentially coexpressed genes and gene Pairs

Directory of Open Access Journals (Sweden)

Ye Zhi-Qiang

2011-08-01

Full Text Available Abstract Background Differential coexpression analysis (DCEA is increasingly used for investigating the global transcriptional mechanisms underlying phenotypic changes. Current DCEA methods mostly adopt a gene connectivity-based strategy to estimate differential coexpression, which is characterized by comparing the numbers of gene neighbors in different coexpression networks. Although it simplifies the calculation, this strategy mixes up the identities of different coexpression neighbors of a gene, and fails to differentiate significant differential coexpression changes from those trivial ones. Especially, the correlation-reversal is easily missed although it probably indicates remarkable biological significance. Results We developed two link-based quantitative methods, DCp and DCe, to identify differentially coexpressed genes and gene pairs (links. Bearing the uniqueness of exploiting the quantitative coexpression change of each gene pair in the coexpression networks, both methods proved to be superior to currently popular methods in simulation studies. Re-mining of a publicly available type 2 diabetes (T2D expression dataset from the perspective of differential coexpression analysis led to additional discoveries than those from differential expression analysis. Conclusions This work pointed out the critical weakness of current popular DCEA methods, and proposed two link-based DCEA algorithms that will make contribution to the development of DCEA and help extend it to a broader spectrum.
Citrus plastid-related gene profiling based on expressed sequence tag analyses

Directory of Open Access Journals (Sweden)

Tercilio Calsa Jr.

2007-01-01

Full Text Available Plastid-related sequences, derived from putative nuclear or plastome genes, were searched in a large collection of expressed sequence tags (ESTs and genomic sequences from the Citrus Biotechnology initiative in Brazil. The identified putative Citrus chloroplast gene sequences were compared to those from Arabidopsis, Eucalyptus and Pinus. Differential expression profiling for plastid-directed nuclear-encoded proteins and photosynthesis-related gene expression variation between Citrus sinensis and Citrus reticulata, when inoculated or not with Xylella fastidiosa, were also analyzed. Presumed Citrus plastome regions were more similar to Eucalyptus. Some putative genes appeared to be preferentially expressed in vegetative tissues (leaves and bark or in reproductive organs (flowers and fruits. Genes preferentially expressed in fruit and flower may be associated with hypothetical physiological functions. Expression pattern clustering analysis suggested that photosynthesis- and carbon fixation-related genes appeared to be up- or down-regulated in a resistant or susceptible Citrus species after Xylella inoculation in comparison to non-infected controls, generating novel information which may be helpful to develop novel genetic manipulation strategies to control Citrus variegated chlorosis (CVC.
An elm EST database for identifying leaf beetle egg-induced defense genes

Directory of Open Access Journals (Sweden)

Büchel Kerstin

2012-06-01

Full Text Available Abstract Background Plants can defend themselves against herbivorous insects prior to the onset of larval feeding by responding to the eggs laid on their leaves. In the European field elm (Ulmus minor, egg laying by the elm leaf beetle ( Xanthogaleruca luteola activates the emission of volatiles that attract specialised egg parasitoids, which in turn kill the eggs. Little is known about the transcriptional changes that insect eggs trigger in plants and how such indirect defense mechanisms are orchestrated in the context of other biological processes. Results Here we present the first large scale study of egg-induced changes in the transcriptional profile of a tree. Five cDNA libraries were generated from leaves of (i untreated control elms, and elms treated with (ii egg laying and feeding by elm leaf beetles, (iii feeding, (iv artificial transfer of egg clutches, and (v methyl jasmonate. A total of 361,196 ESTs expressed sequence tags (ESTs were identified which clustered into 52,823 unique transcripts (Unitrans and were stored in a database with a public web interface. Among the analyzed Unitrans, 73% could be annotated by homology to known genes in the UniProt (Plant database, particularly to those from Vitis, Ricinus, Populus and Arabidopsis. Comparative in silico analysis among the different treatments revealed differences in Gene Ontology term abundances. Defense- and stress-related gene transcripts were present in high abundance in leaves after herbivore egg laying, but transcripts involved in photosynthesis showed decreased abundance. Many pathogen-related genes and genes involved in phytohormone signaling were expressed, indicative of jasmonic acid biosynthesis and activation of jasmonic acid responsive genes. Cross-comparisons between different libraries based on expression profiles allowed the identification of genes with a potential relevance in egg-induced defenses, as well as other biological processes, including signal transduction
An elm EST database for identifying leaf beetle egg-induced defense genes.

Science.gov (United States)

Büchel, Kerstin; McDowell, Eric; Nelson, Will; Descour, Anne; Gershenzon, Jonathan; Hilker, Monika; Soderlund, Carol; Gang, David R; Fenning, Trevor; Meiners, Torsten

2012-06-15

Plants can defend themselves against herbivorous insects prior to the onset of larval feeding by responding to the eggs laid on their leaves. In the European field elm (Ulmus minor), egg laying by the elm leaf beetle ( Xanthogaleruca luteola) activates the emission of volatiles that attract specialised egg parasitoids, which in turn kill the eggs. Little is known about the transcriptional changes that insect eggs trigger in plants and how such indirect defense mechanisms are orchestrated in the context of other biological processes. Here we present the first large scale study of egg-induced changes in the transcriptional profile of a tree. Five cDNA libraries were generated from leaves of (i) untreated control elms, and elms treated with (ii) egg laying and feeding by elm leaf beetles, (iii) feeding, (iv) artificial transfer of egg clutches, and (v) methyl jasmonate. A total of 361,196 ESTs expressed sequence tags (ESTs) were identified which clustered into 52,823 unique transcripts (Unitrans) and were stored in a database with a public web interface. Among the analyzed Unitrans, 73% could be annotated by homology to known genes in the UniProt (Plant) database, particularly to those from Vitis, Ricinus, Populus and Arabidopsis. Comparative in silico analysis among the different treatments revealed differences in Gene Ontology term abundances. Defense- and stress-related gene transcripts were present in high abundance in leaves after herbivore egg laying, but transcripts involved in photosynthesis showed decreased abundance. Many pathogen-related genes and genes involved in phytohormone signaling were expressed, indicative of jasmonic acid biosynthesis and activation of jasmonic acid responsive genes. Cross-comparisons between different libraries based on expression profiles allowed the identification of genes with a potential relevance in egg-induced defenses, as well as other biological processes, including signal transduction, transport and primary metabolism
A hybrid network-based method for the detection of disease-related genes

Science.gov (United States)

Cui, Ying; Cai, Meng; Dai, Yang; Stanley, H. Eugene

2018-02-01

Detecting disease-related genes is crucial in disease diagnosis and drug design. The accepted view is that neighbors of a disease-causing gene in a molecular network tend to cause the same or similar diseases, and network-based methods have been recently developed to identify novel hereditary disease-genes in available biomedical networks. Despite the steady increase in the discovery of disease-associated genes, there is still a large fraction of disease genes that remains under the tip of the iceberg. In this paper we exploit the topological properties of the protein-protein interaction (PPI) network to detect disease-related genes. We compute, analyze, and compare the topological properties of disease genes with non-disease genes in PPI networks. We also design an improved random forest classifier based on these network topological features, and a cross-validation test confirms that our method performs better than previous similar studies.
A graph-search framework for associating gene identifiers with documents

Directory of Open Access Journals (Sweden)

Cohen William W

2006-10-01

Full Text Available Abstract Background One step in the model organism database curation process is to find, for each article, the identifier of every gene discussed in the article. We consider a relaxation of this problem suitable for semi-automated systems, in which each article is associated with a ranked list of possible gene identifiers, and experimentally compare methods for solving this geneId ranking problem. In addition to baseline approaches based on combining named entity recognition (NER systems with a "soft dictionary" of gene synonyms, we evaluate a graph-based method which combines the outputs of multiple NER systems, as well as other sources of information, and a learning method for reranking the output of the graph-based method. Results We show that named entity recognition (NER systems with similar F-measure performance can have significantly different performance when used with a soft dictionary for geneId-ranking. The graph-based approach can outperform any of its component NER systems, even without learning, and learning can further improve the performance of the graph-based ranking approach. Conclusion The utility of a named entity recognition (NER system for geneId-finding may not be accurately predicted by its entity-level F1 performance, the most common performance measure. GeneId-ranking systems are best implemented by combining several NER systems. With appropriate combination methods, usefully accurate geneId-ranking systems can be constructed based on easily-available resources, without resorting to problem-specific, engineered components.
Deep learning of mutation-gene-drug relations from the literature.

Science.gov (United States)

Lee, Kyubum; Kim, Byounggun; Choi, Yonghwa; Kim, Sunkyu; Shin, Wonho; Lee, Sunwon; Park, Sungjoon; Kim, Seongsoon; Tan, Aik Choon; Kang, Jaewoo

2018-01-25

Molecular biomarkers that can predict drug efficacy in cancer patients are crucial components for the advancement of precision medicine. However, identifying these molecular biomarkers remains a laborious and challenging task. Next-generation sequencing of patients and preclinical models have increasingly led to the identification of novel gene-mutation-drug relations, and these results have been reported and published in the scientific literature. Here, we present two new computational methods that utilize all the PubMed articles as domain specific background knowledge to assist in the extraction and curation of gene-mutation-drug relations from the literature. The first method uses the Biomedical Entity Search Tool (BEST) scoring results as some of the features to train the machine learning classifiers. The second method uses not only the BEST scoring results, but also word vectors in a deep convolutional neural network model that are constructed from and trained on numerous documents such as PubMed abstracts and Google News articles. Using the features obtained from both the BEST search engine scores and word vectors, we extract mutation-gene and mutation-drug relations from the literature using machine learning classifiers such as random forest and deep convolutional neural networks. Our methods achieved better results compared with the state-of-the-art methods. We used our proposed features in a simple machine learning model, and obtained F1-scores of 0.96 and 0.82 for mutation-gene and mutation-drug relation classification, respectively. We also developed a deep learning classification model using convolutional neural networks, BEST scores, and the word embeddings that are pre-trained on PubMed or Google News data. Using deep learning, the classification accuracy improved, and F1-scores of 0.96 and 0.86 were obtained for the mutation-gene and mutation-drug relations, respectively. We believe that our computational methods described in this research could be
Genomics and relative expression analysis identifies key genes associated with high female to male flower ratio in Jatropha curcas L.

Science.gov (United States)

Gangwar, Manali; Sood, Hemant; Chauhan, Rajinder Singh

2016-04-01

Jatropha curcas, has been projected as a major source of biodiesel due to high seed oil content (42 %). A major roadblock for commercialization of Jatropha-based biodiesel is low seed yield per inflorescence, which is affected by low female to male flower ratio (1:25-30). Molecular dissection of female flower development by analyzing genes involved in phase transitions and floral organ development is, therefore, crucial for increasing seed yield. Expression analysis of 42 genes implicated in floral organ development and sex determination was done at six floral developmental stages of a J. curcas genotype (IC561235) with inherently higher female to male flower ratio (1:8-10). Relative expression analysis of these genes was done on low ratio genotype. Genes TFL1, SUP, AP1, CRY2, CUC2, CKX1, TAA1 and PIN1 were associated with reproductive phase transition. Further, genes CUC2, TAA1, CKX1 and PIN1 were associated with female flowering while SUP and CRY2 in female flower transition. Relative expression of these genes with respect to low female flower ratio genotype showed up to ~7 folds increase in transcript abundance of SUP, TAA1, CRY2 and CKX1 genes in intermediate buds but not a significant increase (~1.25 folds) in female flowers, thereby suggesting that these genes possibly play a significant role in increased transition towards female flowering by promoting abortion of male flower primordia. The outcome of study has implications in feedstock improvement of J. curcas through functional validation and eventual utilization of key genes associated with female flowering.
Meta-Analysis of Placental Transcriptome Data Identifies a Novel Molecular Pathway Related to Preeclampsia.

Science.gov (United States)

van Uitert, Miranda; Moerland, Perry D; Enquobahrie, Daniel A; Laivuori, Hannele; van der Post, Joris A M; Ris-Stalpers, Carrie; Afink, Gijs B

2015-01-01

Studies using the placental transcriptome to identify key molecules relevant for preeclampsia are hampered by a relatively small sample size. In addition, they use a variety of bioinformatics and statistical methods, making comparison of findings challenging. To generate a more robust preeclampsia gene expression signature, we performed a meta-analysis on the original data of 11 placenta RNA microarray experiments, representing 139 normotensive and 116 preeclamptic pregnancies. Microarray data were pre-processed and analyzed using standardized bioinformatics and statistical procedures and the effect sizes were combined using an inverse-variance random-effects model. Interactions between genes in the resulting gene expression signature were identified by pathway analysis (Ingenuity Pathway Analysis, Gene Set Enrichment Analysis, Graphite) and protein-protein associations (STRING). This approach has resulted in a comprehensive list of differentially expressed genes that led to a 388-gene meta-signature of preeclamptic placenta. Pathway analysis highlights the involvement of the previously identified hypoxia/HIF1A pathway in the establishment of the preeclamptic gene expression profile, while analysis of protein interaction networks indicates CREBBP/EP300 as a novel element central to the preeclamptic placental transcriptome. In addition, there is an apparent high incidence of preeclampsia in women carrying a child with a mutation in CREBBP/EP300 (Rubinstein-Taybi Syndrome). The 388-gene preeclampsia meta-signature offers a vital starting point for further studies into the relevance of these genes (in particular CREBBP/EP300) and their concomitant pathways as biomarkers or functional molecules in preeclampsia. This will result in a better understanding of the molecular basis of this disease and opens up the opportunity to develop rational therapies targeting the placental dysfunction causal to preeclampsia.
Ortholog-based screening and identification of genes related to intracellular survival.

Science.gov (United States)

Yang, Xiaowen; Wang, Jiawei; Bing, Guoxia; Bie, Pengfei; De, Yanyan; Lyu, Yanli; Wu, Qingmin

2018-04-20

Bioinformatics and comparative genomics analysis methods were used to predict unknown pathogen genes based on homology with identified or functionally clustered genes. In this study, the genes of common pathogens were analyzed to screen and identify genes associated with intracellular survival through sequence similarity, phylogenetic tree analysis and the λ-Red recombination system test method. The total 38,952 protein-coding genes of common pathogens were divided into 19,775 clusters. As demonstrated through a COG analysis, information storage and processing genes might play an important role intracellular survival. Only 19 clusters were present in facultative intracellular pathogens, and not all were present in extracellular pathogens. Construction of a phylogenetic tree selected 18 of these 19 clusters. Comparisons with the DEG database and previous research revealed that seven other clusters are considered essential gene clusters and that seven other clusters are associated with intracellular survival. Moreover, this study confirmed that clusters screened by orthologs with similar function could be replaced with an approved uvrY gene and its orthologs, and the results revealed that the usg gene is associated with intracellular survival. The study improves the current understanding of intracellular pathogens characteristics and allows further exploration of the intracellular survival-related gene modules in these pathogens. Copyright © 2018. Published by Elsevier B.V.
Identification of pathogenicity‐related genes in Fusarium oxysporum f. sp. cepae

Science.gov (United States)

Vágány, Viktória; Jackson, Alison C.; Harrison, Richard J.; Rainoni, Alessandro; Clarkson, John P.

2016-01-01

Summary Pathogenic isolates of Fusarium oxysporum, distinguished as formae speciales (f. spp.) on the basis of their host specificity, cause crown rots, root rots and vascular wilts on many important crops worldwide. Fusarium oxysporum f. sp. cepae (FOC) is particularly problematic to onion growers worldwide and is increasing in prevalence in the UK. We characterized 31 F. oxysporum isolates collected from UK onions using pathogenicity tests, sequencing of housekeeping genes and identification of effectors. In onion seedling and bulb tests, 21 isolates were pathogenic and 10 were non‐pathogenic. The molecular characterization of these isolates, and 21 additional isolates comprising other f. spp. and different Fusarium species, was carried out by sequencing three housekeeping genes. A concatenated tree separated the F. oxysporum isolates into six clades, but did not distinguish between pathogenic and non‐pathogenic isolates. Ten putative effectors were identified within FOC, including seven Secreted In Xylem (SIX) genes first reported in F. oxysporum f. sp. lycopersici. Two highly homologous proteins with signal peptides and RxLR motifs (CRX1/CRX2) and a gene with no previously characterized domains (C5) were also identified. The presence/absence of nine of these genes was strongly related to pathogenicity against onion and all were shown to be expressed in planta. Different SIX gene complements were identified in other f. spp., but none were identified in three other Fusarium species from onion. Although the FOC SIX genes had a high level of homology with other f. spp., there were clear differences in sequences which were unique to FOC, whereas CRX1 and C5 genes appear to be largely FOC specific. PMID:26609905
Gene-Based Genome-Wide Association Analysis in European and Asian Populations Identified Novel Genes for Rheumatoid Arthritis.

Directory of Open Access Journals (Sweden)

Hong Zhu

Full Text Available Rheumatoid arthritis (RA is a complex autoimmune disease. Using a gene-based association research strategy, the present study aims to detect unknown susceptibility to RA and to address the ethnic differences in genetic susceptibility to RA between European and Asian populations.Gene-based association analyses were performed with KGG 2.5 by using publicly available large RA datasets (14,361 RA cases and 43,923 controls of European subjects, 4,873 RA cases and 17,642 controls of Asian Subjects. For the newly identified RA-associated genes, gene set enrichment analyses and protein-protein interactions analyses were carried out with DAVID and STRING version 10.0, respectively. Differential expression verification was conducted using 4 GEO datasets. The expression levels of three selected 'highly verified' genes were measured by ELISA among our in-house RA cases and controls.A total of 221 RA-associated genes were newly identified by gene-based association study, including 71'overlapped', 76 'European-specific' and 74 'Asian-specific' genes. Among them, 105 genes had significant differential expressions between RA patients and health controls at least in one dataset, especially for 20 genes including 11 'overlapped' (ABCF1, FLOT1, HLA-F, IER3, TUBB, ZKSCAN4, BTN3A3, HSP90AB1, CUTA, BRD2, HLA-DMA, 5 'European-specific' (PHTF1, RPS18, BAK1, TNFRSF14, SUOX and 4 'Asian-specific' (RNASET2, HFE, BTN2A2, MAPK13 genes whose differential expressions were significant at least in three datasets. The protein expressions of two selected genes FLOT1 (P value = 1.70E-02 and HLA-DMA (P value = 4.70E-02 in plasma were significantly different in our in-house samples.Our study identified 221 novel RA-associated genes and especially highlighted the importance of 20 candidate genes on RA. The results addressed ethnic genetic background differences for RA susceptibility between European and Asian populations and detected a long list of overlapped or ethnic specific RA
Genome-wide Analyses Identify KIF5A as a Novel ALS Gene.

Science.gov (United States)

Nicolas, Aude; Kenna, Kevin P; Renton, Alan E; Ticozzi, Nicola; Faghri, Faraz; Chia, Ruth; Dominov, Janice A; Kenna, Brendan J; Nalls, Mike A; Keagle, Pamela; Rivera, Alberto M; van Rheenen, Wouter; Murphy, Natalie A; van Vugt, Joke J F A; Geiger, Joshua T; Van der Spek, Rick A; Pliner, Hannah A; Shankaracharya; Smith, Bradley N; Marangi, Giuseppe; Topp, Simon D; Abramzon, Yevgeniya; Gkazi, Athina Soragia; Eicher, John D; Kenna, Aoife; Mora, Gabriele; Calvo, Andrea; Mazzini, Letizia; Riva, Nilo; Mandrioli, Jessica; Caponnetto, Claudia; Battistini, Stefania; Volanti, Paolo; La Bella, Vincenzo; Conforti, Francesca L; Borghero, Giuseppe; Messina, Sonia; Simone, Isabella L; Trojsi, Francesca; Salvi, Fabrizio; Logullo, Francesco O; D'Alfonso, Sandra; Corrado, Lucia; Capasso, Margherita; Ferrucci, Luigi; Moreno, Cristiane de Araujo Martins; Kamalakaran, Sitharthan; Goldstein, David B; Gitler, Aaron D; Harris, Tim; Myers, Richard M; Phatnani, Hemali; Musunuri, Rajeeva Lochan; Evani, Uday Shankar; Abhyankar, Avinash; Zody, Michael C; Kaye, Julia; Finkbeiner, Steven; Wyman, Stacia K; LeNail, Alex; Lima, Leandro; Fraenkel, Ernest; Svendsen, Clive N; Thompson, Leslie M; Van Eyk, Jennifer E; Berry, James D; Miller, Timothy M; Kolb, Stephen J; Cudkowicz, Merit; Baxi, Emily; Benatar, Michael; Taylor, J Paul; Rampersaud, Evadnie; Wu, Gang; Wuu, Joanne; Lauria, Giuseppe; Verde, Federico; Fogh, Isabella; Tiloca, Cinzia; Comi, Giacomo P; Sorarù, Gianni; Cereda, Cristina; Corcia, Philippe; Laaksovirta, Hannu; Myllykangas, Liisa; Jansson, Lilja; Valori, Miko; Ealing, John; Hamdalla, Hisham; Rollinson, Sara; Pickering-Brown, Stuart; Orrell, Richard W; Sidle, Katie C; Malaspina, Andrea; Hardy, John; Singleton, Andrew B; Johnson, Janel O; Arepalli, Sampath; Sapp, Peter C; McKenna-Yasek, Diane; Polak, Meraida; Asress, Seneshaw; Al-Sarraj, Safa; King, Andrew; Troakes, Claire; Vance, Caroline; de Belleroche, Jacqueline; Baas, Frank; Ten Asbroek, Anneloor L M A; Muñoz-Blanco, José Luis; Hernandez, Dena G; Ding, Jinhui; Gibbs, J Raphael; Scholz, Sonja W; Floeter, Mary Kay; Campbell, Roy H; Landi, Francesco; Bowser, Robert; Pulst, Stefan M; Ravits, John M; MacGowan, Daniel J L; Kirby, Janine; Pioro, Erik P; Pamphlett, Roger; Broach, James; Gerhard, Glenn; Dunckley, Travis L; Brady, Christopher B; Kowall, Neil W; Troncoso, Juan C; Le Ber, Isabelle; Mouzat, Kevin; Lumbroso, Serge; Heiman-Patterson, Terry D; Kamel, Freya; Van Den Bosch, Ludo; Baloh, Robert H; Strom, Tim M; Meitinger, Thomas; Shatunov, Aleksey; Van Eijk, Kristel R; de Carvalho, Mamede; Kooyman, Maarten; Middelkoop, Bas; Moisse, Matthieu; McLaughlin, Russell L; Van Es, Michael A; Weber, Markus; Boylan, Kevin B; Van Blitterswijk, Marka; Rademakers, Rosa; Morrison, Karen E; Basak, A Nazli; Mora, Jesús S; Drory, Vivian E; Shaw, Pamela J; Turner, Martin R; Talbot, Kevin; Hardiman, Orla; Williams, Kelly L; Fifita, Jennifer A; Nicholson, Garth A; Blair, Ian P; Rouleau, Guy A; Esteban-Pérez, Jesús; García-Redondo, Alberto; Al-Chalabi, Ammar; Rogaeva, Ekaterina; Zinman, Lorne; Ostrow, Lyle W; Maragakis, Nicholas J; Rothstein, Jeffrey D; Simmons, Zachary; Cooper-Knock, Johnathan; Brice, Alexis; Goutman, Stephen A; Feldman, Eva L; Gibson, Summer B; Taroni, Franco; Ratti, Antonia; Gellera, Cinzia; Van Damme, Philip; Robberecht, Wim; Fratta, Pietro; Sabatelli, Mario; Lunetta, Christian; Ludolph, Albert C; Andersen, Peter M; Weishaupt, Jochen H; Camu, William; Trojanowski, John Q; Van Deerlin, Vivianna M; Brown, Robert H; van den Berg, Leonard H; Veldink, Jan H; Harms, Matthew B; Glass, Jonathan D; Stone, David J; Tienari, Pentti; Silani, Vincenzo; Chiò, Adriano; Shaw, Christopher E; Traynor, Bryan J; Landers, John E

2018-03-21

To identify novel genes associated with ALS, we undertook two lines of investigation. We carried out a genome-wide association study comparing 20,806 ALS cases and 59,804 controls. Independently, we performed a rare variant burden analysis comparing 1,138 index familial ALS cases and 19,494 controls. Through both approaches, we identified kinesin family member 5A (KIF5A) as a novel gene associated with ALS. Interestingly, mutations predominantly in the N-terminal motor domain of KIF5A are causative for two neurodegenerative diseases: hereditary spastic paraplegia (SPG10) and Charcot-Marie-Tooth type 2 (CMT2). In contrast, ALS-associated mutations are primarily located at the C-terminal cargo-binding tail domain and patients harboring loss-of-function mutations displayed an extended survival relative to typical ALS cases. Taken together, these results broaden the phenotype spectrum resulting from mutations in KIF5A and strengthen the role of cytoskeletal defects in the pathogenesis of ALS. Copyright © 2018 Elsevier Inc. All rights reserved.

Identifying Candidate Reprogramming Genes in Mouse Induced Pluripotent Stem Cells.

Science.gov (United States)

Gao, Fang; Li, Jingyu; Zhang, Heng; Yang, Xu; An, Tiezhu

2017-08-01

Factor-based induced reprogramming approaches have tremendous potential for human regenerative medicine, but the efficiencies of these approaches are still low. In this study, we analyzed the global transcriptional profiles of mouse induced pluripotent stem cells (miPSCs) and mouse embryonic stem cells (mESCs) from seven different labs and present here the first successful clustering according to cell type, not by lab of origin. We identified 2131 different expression genes (DEs) as candidate pluripotency-associated genes by comparing mESCs/miPSCs with somatic cells and 720 DEs between miPSCs and mESCs. Interestingly, there was a significant overlap between the two DE sets. Therefore, we defined the overlap DEs as "consensus DEs" including 313 miPSC-specific genes expressed at a higher level in miPSCs versus mESCs and 184 mESC-specific genes in total and reasoned that these may contribute to the differences in pluripotency between mESCs and miPSCs. A classification of "consensus DEs" according to their different expression levels between somatic cells and mESCs/miPSCs shows that 86% of the miPSC-specific genes are more highly expressed in somatic cells, while 73% of mESC-specific genes are highly expressed in mESCs/miPSCs, indicating that the miPSCs have not efficiently silenced the expression pattern of the somatic cells from which they are derived and failed to completely induce the genes with high expression levels in mESCs. We further revealed a strong correlation between oocyte-enriched factors and insufficiently induced mESC-specific genes and identified 11 hub genes via network analysis. In light of these findings, we postulated that these key hub genes might not only drive somatic cell nuclear transfer (SCNT) reprogramming but also augment the efficiency and quality of miPSC reprogramming.
Patterns of genomic variation in the poplar rust fungus Melampsora larici-populina identify pathogenesis-related factors

Directory of Open Access Journals (Sweden)

Antoine ePersoons

2014-09-01

Full Text Available Melampsora larici-populina is a fungal pathogen responsible for foliar rust disease on poplar trees, which causes damage to forest plantations worldwide, particularly in Northern Europe. The reference genome of the isolate 98AG31 was previously sequenced using a whole genome shotgun strategy, revealing a large genome of 101 megabases containing 16,399 predicted genes, which included secreted protein genes representing poplar rust candidate effectors. In the present study, the genomes of 15 isolates collected over the past 20 years throughout the French territory, representing distinct virulence profiles, were characterized by massively parallel sequencing to assess genetic variation in the poplar rust fungus. Comparison to the reference genome revealed striking structural variations. Analysis of coverage and sequencing depth identified large missing regions between isolates related to the mating type loci. More than 611,824 single-nucleotide polymorphism (SNP positions were uncovered overall, indicating a remarkable level of polymorphism. Based on the accumulation of non-synonymous substitutions in coding sequences and the relative frequencies of synonymous and non-synonymous polymorphisms (i.e. PN/PS, we identify candidate genes that may be involved in fungal pathogenesis. Correlation between non-synonymous SNPs in genes encoding secreted proteins and pathotypes of the studied isolates revealed candidate genes potentially related to virulences 1, 6 and 8 of the poplar rust fungus.
In-Silico Integration Approach to Identify a Key miRNA Regulating a Gene Network in Aggressive Prostate Cancer

Science.gov (United States)

Colaprico, Antonio; Bontempi, Gianluca; Castiglioni, Isabella

2018-01-01

Like other cancer diseases, prostate cancer (PC) is caused by the accumulation of genetic alterations in the cells that drives malignant growth. These alterations are revealed by gene profiling and copy number alteration (CNA) analysis. Moreover, recent evidence suggests that also microRNAs have an important role in PC development. Despite efforts to profile PC, the alterations (gene, CNA, and miRNA) and biological processes that correlate with disease development and progression remain partially elusive. Many gene signatures proposed as diagnostic or prognostic tools in cancer poorly overlap. The identification of co-expressed genes, that are functionally related, can identify a core network of genes associated with PC with a better reproducibility. By combining different approaches, including the integration of mRNA expression profiles, CNAs, and miRNA expression levels, we identified a gene signature of four genes overlapping with other published gene signatures and able to distinguish, in silico, high Gleason-scored PC from normal human tissue, which was further enriched to 19 genes by gene co-expression analysis. From the analysis of miRNAs possibly regulating this network, we found that hsa-miR-153 was highly connected to the genes in the network. Our results identify a four-gene signature with diagnostic and prognostic value in PC and suggest an interesting gene network that could play a key regulatory role in PC development and progression. Furthermore, hsa-miR-153, controlling this network, could be a potential biomarker for theranostics in high Gleason-scored PC. PMID:29562723
Exome sequencing identifies three novel candidate genes implicated in intellectual disability.

Directory of Open Access Journals (Sweden)

Zehra Agha

Full Text Available Intellectual disability (ID is a major health problem mostly with an unknown etiology. Recently exome sequencing of individuals with ID identified novel genes implicated in the disease. Therefore the purpose of the present study was to identify the genetic cause of ID in one syndromic and two non-syndromic Pakistani families. Whole exome of three ID probands was sequenced. Missense variations in two plausible novel genes implicated in autosomal recessive ID were identified: lysine (K-specific methyltransferase 2B (KMT2B, zinc finger protein 589 (ZNF589, as well as hedgehog acyltransferase (HHAT with a de novo mutation with autosomal dominant mode of inheritance. The KMT2B recessive variant is the first report of recessive Kleefstra syndrome-like phenotype. Identification of plausible causative mutations for two recessive and a dominant type of ID, in genes not previously implicated in disease, underscores the large genetic heterogeneity of ID. These results also support the viewpoint that large number of ID genes converge on limited number of common networks i.e. ZNF589 belongs to KRAB-domain zinc-finger proteins previously implicated in ID, HHAT is predicted to affect sonic hedgehog, which is involved in several disorders with ID, KMT2B associated with syndromic ID fits the epigenetic module underlying the Kleefstra syndromic spectrum. The association of these novel genes in three different Pakistani ID families highlights the importance of screening these genes in more families with similar phenotypes from different populations to confirm the involvement of these genes in pathogenesis of ID.
Machine Learning Leveraging Genomes from Metagenomes Identifies Influential Antibiotic Resistance Genes in the Infant Gut Microbiome

Science.gov (United States)

Olm, Matthew R.; Morowitz, Michael J.

2018-01-01

ABSTRACT Antibiotic resistance in pathogens is extensively studied, and yet little is known about how antibiotic resistance genes of typical gut bacteria influence microbiome dynamics. Here, we leveraged genomes from metagenomes to investigate how genes of the premature infant gut resistome correspond to the ability of bacteria to survive under certain environmental and clinical conditions. We found that formula feeding impacts the resistome. Random forest models corroborated by statistical tests revealed that the gut resistome of formula-fed infants is enriched in class D beta-lactamase genes. Interestingly, Clostridium difficile strains harboring this gene are at higher abundance in formula-fed infants than C. difficile strains lacking this gene. Organisms with genes for major facilitator superfamily drug efflux pumps have higher replication rates under all conditions, even in the absence of antibiotic therapy. Using a machine learning approach, we identified genes that are predictive of an organism’s direction of change in relative abundance after administration of vancomycin and cephalosporin antibiotics. The most accurate results were obtained by reducing annotated genomic data to five principal components classified by boosted decision trees. Among the genes involved in predicting whether an organism increased in relative abundance after treatment are those that encode subclass B2 beta-lactamases and transcriptional regulators of vancomycin resistance. This demonstrates that machine learning applied to genome-resolved metagenomics data can identify key genes for survival after antibiotics treatment and predict how organisms in the gut microbiome will respond to antibiotic administration. IMPORTANCE The process of reconstructing genomes from environmental sequence data (genome-resolved metagenomics) allows unique insight into microbial systems. We apply this technique to investigate how the antibiotic resistance genes of bacteria affect their ability to
Integration of multiple networks and pathways identifies cancer driver genes in pan-cancer analysis.

Science.gov (United States)

Cava, Claudia; Bertoli, Gloria; Colaprico, Antonio; Olsen, Catharina; Bontempi, Gianluca; Castiglioni, Isabella

2018-01-06

Modern high-throughput genomic technologies represent a comprehensive hallmark of molecular changes in pan-cancer studies. Although different cancer gene signatures have been revealed, the mechanism of tumourigenesis has yet to be completely understood. Pathways and networks are important tools to explain the role of genes in functional genomic studies. However, few methods consider the functional non-equal roles of genes in pathways and the complex gene-gene interactions in a network. We present a novel method in pan-cancer analysis that identifies de-regulated genes with a functional role by integrating pathway and network data. A pan-cancer analysis of 7158 tumour/normal samples from 16 cancer types identified 895 genes with a central role in pathways and de-regulated in cancer. Comparing our approach with 15 current tools that identify cancer driver genes, we found that 35.6% of the 895 genes identified by our method have been found as cancer driver genes with at least 2/15 tools. Finally, we applied a machine learning algorithm on 16 independent GEO cancer datasets to validate the diagnostic role of cancer driver genes for each cancer. We obtained a list of the top-ten cancer driver genes for each cancer considered in this study. Our analysis 1) confirmed that there are several known cancer driver genes in common among different types of cancer, 2) highlighted that cancer driver genes are able to regulate crucial pathways.
A 6-gene signature identifies four molecular subgroups of neuroblastoma

Science.gov (United States)

2011-01-01

Background There are currently three postulated genomic subtypes of the childhood tumour neuroblastoma (NB); Type 1, Type 2A, and Type 2B. The most aggressive forms of NB are characterized by amplification of the oncogene MYCN (MNA) and low expression of the favourable marker NTRK1. Recently, mutations or high expression of the familial predisposition gene Anaplastic Lymphoma Kinase (ALK) was associated to unfavourable biology of sporadic NB. Also, various other genes have been linked to NB pathogenesis. Results The present study explores subgroup discrimination by gene expression profiling using three published microarray studies on NB (47 samples). Four distinct clusters were identified by Principal Components Analysis (PCA) in two separate data sets, which could be verified by an unsupervised hierarchical clustering in a third independent data set (101 NB samples) using a set of 74 discriminative genes. The expression signature of six NB-associated genes ALK, BIRC5, CCND1, MYCN, NTRK1, and PHOX2B, significantly discriminated the four clusters (p INSS stage 4 and/or dead of disease, p < 0.05, Fisher's exact test). Conclusions Based on expression profiling we have identified four molecular subgroups of neuroblastoma, which can be distinguished by a 6-gene signature. The fourth subgroup has not been described elsewhere, and efforts are currently made to further investigate this group's specific characteristics. PMID:21492432
Addiction and Reward-related Genes Show Altered Expression in the Postpartum Nucleus Accumbens

Directory of Open Access Journals (Sweden)

Changjiu eZhao

2014-11-01

Full Text Available Motherhood involves a switch in natural rewards, whereby offspring become highly rewarding. Nucleus accumbens (NAC is a key CNS region for natural rewards and addictions, but to date no study has evaluated on a large scale the events in NAC that underlie the maternal change in natural rewards. In this study we utilized microarray and bioinformatics approaches to evaluate postpartum NAC gene expression changes in mice. Modular Single-set Enrichment Test (MSET indicated that postpartum (relative to virgin NAC gene expression profile was significantly enriched for genes related to addiction and reward in 5 of 5 independently curated databases (e.g., Malacards, Phenopedia. Over 100 addiction/reward related genes were identified and these included: Per1, Per2, Arc, Homer2, Creb1, Grm3, Fosb, Gabrb3, Adra2a, Ntrk2, Cry1, Penk, Cartpt, Adcy1, Npy1r, Htr1a, Drd1a, Gria1, and Pdyn. ToppCluster analysis found maternal NAC expression profile to be significantly enriched for genes related to the drug action of nicotine, ketamine, and dronabinol. Pathway analysis indicated postpartum NAC as enriched for RNA processing, CNS development/differentiation, and transcriptional regulation. Weighted Gene Coexpression Network Analysis identified possible networks for transcription factors, including Nr1d1, Per2, Fosb, Egr1, and Nr4a1. The postpartum state involves increased risk for mental health disorders and MSET analysis indicated postpartum NAC to be enriched for genes related to depression, bipolar disorder, and schizophrenia. Mental health related genes included: Fabp7, Grm3, Penk, and Nr1d1. We confirmed via quantitative PCR Nr1d1, Per2, Grm3, Penk, Drd1a, and Pdyn. This study indicates for the first time that postpartum NAC involves large scale gene expression alterations linked to addiction and reward. Because the postpartum state also involves decreased response to drugs, the findings could provide insights into how to mitigate addictions.
[Key effect genes responding to nerve injury identified by gene ontology and computer pattern recognition].

Science.gov (United States)

Pan, Qian; Peng, Jin; Zhou, Xue; Yang, Hao; Zhang, Wei

2012-07-01

In order to screen out important genes from large gene data of gene microarray after nerve injury, we combine gene ontology (GO) method and computer pattern recognition technology to find key genes responding to nerve injury, and then verify one of these screened-out genes. Data mining and gene ontology analysis of gene chip data GSE26350 was carried out through MATLAB software. Cd44 was selected from screened-out key gene molecular spectrum by comparing genes' different GO terms and positions on score map of principal component. Function interferences were employed to influence the normal binding of Cd44 and one of its ligands, chondroitin sulfate C (CSC), to observe neurite extension. Gene ontology analysis showed that the first genes on score map (marked by red *) mainly distributed in molecular transducer activity, receptor activity, protein binding et al molecular function GO terms. Cd44 is one of six effector protein genes, and attracted us with its function diversity. After adding different reagents into the medium to interfere the normal binding of CSC and Cd44, varying-degree remissions of CSC's inhibition on neurite extension were observed. CSC can inhibit neurite extension through binding Cd44 on the neuron membrane. This verifies that important genes in given physiological processes can be identified by gene ontology analysis of gene chip data.
Meta-Analysis of Placental Transcriptome Data Identifies a Novel Molecular Pathway Related to Preeclampsia.

Directory of Open Access Journals (Sweden)

Miranda van Uitert

Full Text Available Studies using the placental transcriptome to identify key molecules relevant for preeclampsia are hampered by a relatively small sample size. In addition, they use a variety of bioinformatics and statistical methods, making comparison of findings challenging. To generate a more robust preeclampsia gene expression signature, we performed a meta-analysis on the original data of 11 placenta RNA microarray experiments, representing 139 normotensive and 116 preeclamptic pregnancies. Microarray data were pre-processed and analyzed using standardized bioinformatics and statistical procedures and the effect sizes were combined using an inverse-variance random-effects model. Interactions between genes in the resulting gene expression signature were identified by pathway analysis (Ingenuity Pathway Analysis, Gene Set Enrichment Analysis, Graphite and protein-protein associations (STRING. This approach has resulted in a comprehensive list of differentially expressed genes that led to a 388-gene meta-signature of preeclamptic placenta. Pathway analysis highlights the involvement of the previously identified hypoxia/HIF1A pathway in the establishment of the preeclamptic gene expression profile, while analysis of protein interaction networks indicates CREBBP/EP300 as a novel element central to the preeclamptic placental transcriptome. In addition, there is an apparent high incidence of preeclampsia in women carrying a child with a mutation in CREBBP/EP300 (Rubinstein-Taybi Syndrome. The 388-gene preeclampsia meta-signature offers a vital starting point for further studies into the relevance of these genes (in particular CREBBP/EP300 and their concomitant pathways as biomarkers or functional molecules in preeclampsia. This will result in a better understanding of the molecular basis of this disease and opens up the opportunity to develop rational therapies targeting the placental dysfunction causal to preeclampsia.
Association analysis identifies TLR7 and TLR8 as novel risk genes in asthma and related disorders

DEFF Research Database (Denmark)

Møller-Larsen, Steffen; Nyegaard, Mette; Haagerup, Annette

2008-01-01

the TLR7 and TLR8 genes. METHODS: We investigated the involvement of TLR7 and TLR8 in the aetiology of asthma and related disorders by a family based association analysis of two independently ascertained family samples comprising 540 and 424 individuals from 135 and 100 families, respectively. Ten...
Identification of pathogenicity-related genes in Fusarium oxysporum f. sp. cepae.

Science.gov (United States)

Taylor, Andrew; Vágány, Viktória; Jackson, Alison C; Harrison, Richard J; Rainoni, Alessandro; Clarkson, John P

2016-09-01

Pathogenic isolates of Fusarium oxysporum, distinguished as formae speciales (f. spp.) on the basis of their host specificity, cause crown rots, root rots and vascular wilts on many important crops worldwide. Fusarium oxysporum f. sp. cepae (FOC) is particularly problematic to onion growers worldwide and is increasing in prevalence in the UK. We characterized 31 F. oxysporum isolates collected from UK onions using pathogenicity tests, sequencing of housekeeping genes and identification of effectors. In onion seedling and bulb tests, 21 isolates were pathogenic and 10 were non-pathogenic. The molecular characterization of these isolates, and 21 additional isolates comprising other f. spp. and different Fusarium species, was carried out by sequencing three housekeeping genes. A concatenated tree separated the F. oxysporum isolates into six clades, but did not distinguish between pathogenic and non-pathogenic isolates. Ten putative effectors were identified within FOC, including seven Secreted In Xylem (SIX) genes first reported in F. oxysporum f. sp. lycopersici. Two highly homologous proteins with signal peptides and RxLR motifs (CRX1/CRX2) and a gene with no previously characterized domains (C5) were also identified. The presence/absence of nine of these genes was strongly related to pathogenicity against onion and all were shown to be expressed in planta. Different SIX gene complements were identified in other f. spp., but none were identified in three other Fusarium species from onion. Although the FOC SIX genes had a high level of homology with other f. spp., there were clear differences in sequences which were unique to FOC, whereas CRX1 and C5 genes appear to be largely FOC specific. © 2015 The Authors Molecular Plant Pathology Published by British Society for Plant Pathology and John Wiley & Sons Ltd.
Macular xanthophylls, lipoprotein-related genes, and age-related macular degeneration.

Science.gov (United States)

Koo, Euna; Neuringer, Martha; SanGiovanni, John Paul

2014-07-01

Plant-based macular xanthophylls (MXs; lutein and zeaxanthin) and the lutein metabolite meso-zeaxanthin are the major constituents of macular pigment, a compound concentrated in retinal areas that are responsible for fine-feature visual sensation. There is an unmet need to examine the genetics of factors influencing regulatory mechanisms and metabolic fates of these 3 MXs because they are linked to processes implicated in the pathogenesis of age-related macular degeneration (AMD). In this work we provide an overview of evidence supporting a molecular basis for AMD-MX associations as they may relate to DNA sequence variation in AMD- and lipoprotein-related genes. We recognize a number of emerging research opportunities, barriers, knowledge gaps, and tools offering promise for meaningful investigation and inference in the field. Overviews on AMD- and high-density lipoprotein (HDL)-related genes encoding receptors, transporters, and enzymes affecting or affected by MXs are followed with information on localization of products from these genes to retinal cell types manifesting AMD-related pathophysiology. Evidence on the relation of each gene or gene product with retinal MX response to nutrient intake is discussed. This information is followed by a review of results from mechanistic studies testing gene-disease relations. We then present findings on relations of AMD with DNA sequence variants in MX-associated genes. Our conclusion is that AMD-associated DNA variants that influence the actions and metabolic fates of HDL system constituents should be examined further for concomitant influence on MX absorption, retinal tissue responses to MX intake, and the capacity to modify MX-associated factors and processes implicated in AMD pathogenesis. © 2014 American Society for Nutrition.
Whole-Genome Sequencing of Sordaria macrospora Mutants Identifies Developmental Genes.

Science.gov (United States)

Nowrousian, Minou; Teichert, Ines; Masloff, Sandra; Kück, Ulrich

2012-02-01

The study of mutants to elucidate gene functions has a long and successful history; however, to discover causative mutations in mutants that were generated by random mutagenesis often takes years of laboratory work and requires previously generated genetic and/or physical markers, or resources like DNA libraries for complementation. Here, we present an alternative method to identify defective genes in developmental mutants of the filamentous fungus Sordaria macrospora through Illumina/Solexa whole-genome sequencing. We sequenced pooled DNA from progeny of crosses of three mutants and the wild type and were able to pinpoint the causative mutations in the mutant strains through bioinformatics analysis. One mutant is a spore color mutant, and the mutated gene encodes a melanin biosynthesis enzyme. The causative mutation is a G to A change in the first base of an intron, leading to a splice defect. The second mutant carries an allelic mutation in the pro41 gene encoding a protein essential for sexual development. In the mutant, we detected a complex pattern of deletion/rearrangements at the pro41 locus. In the third mutant, a point mutation in the stop codon of a transcription factor-encoding gene leads to the production of immature fruiting bodies. For all mutants, transformation with a wild type-copy of the affected gene restored the wild-type phenotype. Our data demonstrate that whole-genome sequencing of mutant strains is a rapid method to identify developmental genes in an organism that can be genetically crossed and where a reference genome sequence is available, even without prior mapping information.
Gonad Transcriptome Analysis of the Pacific Oyster Crassostrea gigas Identifies Potential Genes Regulating the Sex Determination and Differentiation Process.

Science.gov (United States)

Yue, Chenyang; Li, Qi; Yu, Hong

2018-04-01

The Pacific oyster Crassostrea gigas is a commercially important bivalve in aquaculture worldwide. C. gigas has a fascinating sexual reproduction system consisting of dioecism, sex change, and occasional hermaphroditism, while knowledge of the molecular mechanisms of sex determination and differentiation is still limited. In this study, the transcriptomes of male and female gonads at different gametogenesis stages were characterized by RNA-seq. Hierarchical clustering based on genes differentially expressed revealed that 1269 genes were expressed specifically in female gonads and 817 genes were expressed increasingly over the course of spermatogenesis. Besides, we identified two and one gene modules related to female and male gonad development, respectively, using weighted gene correlation network analysis (WGCNA). Interestingly, GO and KEGG enrichment analysis showed that neurotransmitter-related terms were significantly enriched in genes related to ovary development, suggesting that the neurotransmitters were likely to regulate female sex differentiation. In addition, two hub genes related to testis development, lncRNA LOC105321313 and Cg-Sh3kbp1, and one hub gene related to ovary development, Cg-Malrd1-like, were firstly investigated. This study points out the role of neurotransmitter and non-coding RNA regulation during gonad development and produces lists of novel relevant candidate genes for further studies. All of these provided valuable information to understand the molecular mechanisms of C. gigas sex determination and differentiation.
Replicon-dependent differentiation of symbiosis-related genes in Sinorhizobium strains nodulating Glycine max.

Science.gov (United States)

Guo, Hui Juan; Wang, En Tao; Zhang, Xing Xing; Li, Qin Qin; Zhang, Yan Ming; Tian, Chang Fu; Chen, Wen Xin

2014-02-01

In order to investigate the genetic differentiation of Sinorhizobium strains nodulating Glycine max and related microevolutionary mechanisms, three housekeeping genes (SMc00019, truA, and thrA) and 16 symbiosis-related genes on the chromosome (7 genes), pSymA (6 genes), and pSymB (3 genes) were analyzed. Five distinct species were identified among the test strains by calculating the average nucleotide identity (ANI) of SMc00019-truA-thrA: Sinorhizobium fredii, Sinorhizobium sojae, Sinorhizobium sp. I, Sinorhizobium sp. II, and Sinorhizobium sp. III. These species assignments were also supported by population genetics and phylogenetic analyses of housekeeping genes and symbiosis-related genes on the chromosome and pSymB. Different levels of genetic differentiation were observed among these species or different replicons. S. sojae was the most divergent from the other test species and was characterized by its low intraspecies diversity and limited geographic distribution. Intergenic recombination dominated the evolution of 19 genes from different replicons. Intraspecies recombination happened frequently in housekeeping genes and symbiosis-related genes on the chromosome and pSymB, whereas pSymA genes showed a clear pattern of lateral-transfer events between different species. Moreover, pSymA genes were characterized by a lower level of polymorphism and recombination than those on the chromosome and pSymB. Taken together, genes from different replicons of rhizobia might be involved in the establishment of symbiosis with legumes, but these symbiosis-related genes might have evolved differently according to their corresponding replicons.
Glucocorticoid Receptor Related Genes: Genotype And Brain Gene Expression Relationships To Suicide And Major Depressive Disorder

Science.gov (United States)

Pantazatos, Spiro P.; Huang, Yung-yu; Rosoklija, Gorazd B.; Dwork, Andrew J.; Burke, Ainsley; Arango, Victoria; Oquendo, Maria A.; Mann, J. John

2016-01-01

Introduction We tested the relationship between genotype, gene expression and suicidal behavior and MDD in live subjects and postmortem samples for three genes, associated with the hypothalamic-pituitary-adrenal axis, suicidal behavior and major depressive disorder (MDD); FK506 binding protein 5 (FKBP5), Spindle and kinetochore-associated protein 2 (SKA2) and Glucocorticoid Receptor (NR3C1). Materials and Methods Single-nucleotide polymorphisms (SNPs) and haplotypes were tested for association with suicidal behavior and MDD in a live (N=277) and a postmortem sample (N=209). RNA-seq was used to examine gene and isoform-level brain expression postmortem (Brodmann Area 9) (N=59). Expression quantitative trait loci (eQTL) relationships were examined using a public database (UK Brain Expression Consortium). Results We identified a haplotype within the FKBP5 gene, present in 47% of the live subjects, that was associated with increased risk of suicide attempt (OR=1.58, t=6.03, p=0.014). Six SNPs on this gene, three SNPs on SKA2 and one near NR3C1 showed before-adjustment association with attempted suicide, and two SNPs of SKA2 with suicide death, but none stayed significant after adjustment for multiple testing. Only the SKA2 SNPs were related to expression in the prefrontal cortex. One NR3C1 transcript had lower expression in suicide relative to non-suicide sudden death cases (b=-0.48, SE=0.12, t=-4.02, adjusted p=0.004). Conclusion We have identified an association of FKBP5 haplotype with risk of suicide attempt and found an association between suicide and altered NR3C1 gene expression in the prefrontal cortex. Our findings further implicate hypothalamic pituitary axis dysfunction in suicidal behavior. PMID:27030168
GLUCOCORTICOID RECEPTOR-RELATED GENES: GENOTYPE AND BRAIN GENE EXPRESSION RELATIONSHIPS TO SUICIDE AND MAJOR DEPRESSIVE DISORDER.

Science.gov (United States)

Yin, Honglei; Galfalvy, Hanga; Pantazatos, Spiro P; Huang, Yung-Yu; Rosoklija, Gorazd B; Dwork, Andrew J; Burke, Ainsley; Arango, Victoria; Oquendo, Maria A; Mann, J John

2016-06-01

We tested the relationship between genotype, gene expression and suicidal behavior and major depressive disorder (MDD) in live subjects and postmortem samples for three genes, associated with the hypothalamic-pituitary-adrenal axis, suicidal behavior, and MDD; FK506-binding protein 5 (FKBP5), Spindle and kinetochore-associated protein 2 (SKA2), and Glucocorticoid Receptor (NR3C1). Single-nucleotide polymorphisms (SNPs) and haplotypes were tested for association with suicidal behavior and MDD in a live (N = 277) and a postmortem sample (N = 209). RNA-seq was used to examine gene and isoform-level brain expression postmortem (Brodmann Area 9; N = 59). Expression quantitative trait loci (eQTL) relationships were examined using a public database (UK Brain Expression Consortium). We identified a haplotype within the FKBP5 gene, present in 47% of the live subjects, which was associated with increased risk of suicide attempt (OR = 1.58, t = 6.03, P = .014). Six SNPs on this gene, three SNPs on SKA2, and one near NR3C1 showed before-adjustment association with attempted suicide, and two SNPs of SKA2 with suicide death, but none stayed significant after adjustment for multiple testing. Only the SKA2 SNPs were related to expression in the prefrontal cortex (pFCTX). One NR3C1 transcript had lower expression in suicide relative to nonsuicide sudden death cases (b = -0.48, SE = 0.12, t = -4.02, adjusted P = .004). We have identified an association of FKBP5 haplotype with risk of suicide attempt and found an association between suicide and altered NR3C1 gene expression in the pFCTX. Our findings further implicate hypothalamic pituitary axis dysfunction in suicidal behavior. © 2016 Wiley Periodicals, Inc.
Genome-Wide Association Study Identifies NBS-LRR-Encoding Genes Related with Anthracnose and Common Bacterial Blight in the Common Bean.

Science.gov (United States)

Wu, Jing; Zhu, Jifeng; Wang, Lanfen; Wang, Shumin

2017-01-01

Nucleotide-binding site and leucine-rich repeat (NBS-LRR) genes represent the largest and most important disease resistance genes in plants. The genome sequence of the common bean ( Phaseolus vulgaris L.) provides valuable data for determining the genomic organization of NBS-LRR genes. However, data on the NBS-LRR genes in the common bean are limited. In total, 178 NBS-LRR-type genes and 145 partial genes (with or without a NBS) located on 11 common bean chromosomes were identified from genome sequences database. Furthermore, 30 NBS-LRR genes were classified into Toll/interleukin-1 receptor (TIR)-NBS-LRR (TNL) types, and 148 NBS-LRR genes were classified into coiled-coil (CC)-NBS-LRR (CNL) types. Moreover, the phylogenetic tree supported the division of these PvNBS genes into two obvious groups, TNL types and CNL types. We also built expression profiles of NBS genes in response to anthracnose and common bacterial blight using qRT-PCR. Finally, we detected nine disease resistance loci for anthracnose (ANT) and seven for common bacterial blight (CBB) using the developed NBS-SSR markers. Among these loci, NSSR24, NSSR73, and NSSR265 may be located at new regions for ANT resistance, while NSSR65 and NSSR260 may be located at new regions for CBB resistance. Furthermore, we validated NSSR24, NSSR65, NSSR73, NSSR260, and NSSR265 using a new natural population. Our results provide useful information regarding the function of the NBS-LRR proteins and will accelerate the functional genomics and evolutionary studies of NBS-LRR genes in food legumes. NBS-SSR markers represent a wide-reaching resource for molecular breeding in the common bean and other food legumes. Collectively, our results should be of broad interest to bean scientists and breeders.
QTLs for seed vigor-related traits identified in maize seeds germinated under artificial aging conditions.

Science.gov (United States)

Han, Zanping; Ku, Lixia; Zhang, Zhenzhen; Zhang, Jun; Guo, Shulei; Liu, Haiying; Zhao, Ruifang; Ren, Zhenzhen; Zhang, Liangkun; Su, Huihui; Dong, Lei; Chen, Yanhui

2014-01-01

High seed vigor is important for agricultural production due to the associated potential for increased growth and productivity. However, a better understanding of the underlying molecular mechanisms is required because the genetic basis for seed vigor remains unknown. We used single-nucleotide polymorphism (SNP) markers to map quantitative trait loci (QTLs) for four seed vigor traits in two connected recombinant inbred line (RIL) maize populations under four treatment conditions during seed germination. Sixty-five QTLs distributed between the two populations were identified and a meta-analysis was used to integrate genetic maps. Sixty-one initially identified QTLs were integrated into 18 meta-QTLs (mQTLs). Initial QTLs with contribution to phenotypic variation values of R(2)>10% were integrated into mQTLs. Twenty-three candidate genes for association with seed vigor traits coincided with 13 mQTLs. The candidate genes had functions in the glycolytic pathway and in protein metabolism. QTLs with major effects (R(2)>10%) were identified under at least one treatment condition for mQTL2, mQTL3-2, and mQTL3-4. Candidate genes included a calcium-dependent protein kinase gene (302810918) involved in signal transduction that mapped in the mQTL3-2 interval associated with germination energy (GE) and germination percentage (GP), and an hsp20/alpha crystallin family protein gene (At5g51440) that mapped in the mQTL3-4 interval associated with GE and GP. Two initial QTLs with a major effect under at least two treatment conditions were identified for mQTL5-2. A cucumisin-like Ser protease gene (At5g67360) mapped in the mQTL5-2 interval associated with GP. The chromosome regions for mQTL2, mQTL3-2, mQTL3-4, and mQTL5-2 may be hot spots for QTLs related to seed vigor traits. The mQTLs and candidate genes identified in this study provide valuable information for the identification of additional quantitative trait genes.

QTLs for seed vigor-related traits identified in maize seeds germinated under artificial aging conditions.

Directory of Open Access Journals (Sweden)

Zanping Han

Full Text Available High seed vigor is important for agricultural production due to the associated potential for increased growth and productivity. However, a better understanding of the underlying molecular mechanisms is required because the genetic basis for seed vigor remains unknown. We used single-nucleotide polymorphism (SNP markers to map quantitative trait loci (QTLs for four seed vigor traits in two connected recombinant inbred line (RIL maize populations under four treatment conditions during seed germination. Sixty-five QTLs distributed between the two populations were identified and a meta-analysis was used to integrate genetic maps. Sixty-one initially identified QTLs were integrated into 18 meta-QTLs (mQTLs. Initial QTLs with contribution to phenotypic variation values of R(2>10% were integrated into mQTLs. Twenty-three candidate genes for association with seed vigor traits coincided with 13 mQTLs. The candidate genes had functions in the glycolytic pathway and in protein metabolism. QTLs with major effects (R(2>10% were identified under at least one treatment condition for mQTL2, mQTL3-2, and mQTL3-4. Candidate genes included a calcium-dependent protein kinase gene (302810918 involved in signal transduction that mapped in the mQTL3-2 interval associated with germination energy (GE and germination percentage (GP, and an hsp20/alpha crystallin family protein gene (At5g51440 that mapped in the mQTL3-4 interval associated with GE and GP. Two initial QTLs with a major effect under at least two treatment conditions were identified for mQTL5-2. A cucumisin-like Ser protease gene (At5g67360 mapped in the mQTL5-2 interval associated with GP. The chromosome regions for mQTL2, mQTL3-2, mQTL3-4, and mQTL5-2 may be hot spots for QTLs related to seed vigor traits. The mQTLs and candidate genes identified in this study provide valuable information for the identification of additional quantitative trait genes.
A Systematic Investigation into Aging Related Genes in Brain and Their Relationship with Alzheimer's Disease.

Science.gov (United States)

Meng, Guofeng; Zhong, Xiaoyan; Mei, Hongkang

2016-01-01

Aging, as a complex biological process, is accompanied by the accumulation of functional loses at different levels, which makes age to be the biggest risk factor to many neurological diseases. Even following decades of investigation, the process of aging is still far from being fully understood, especially at a systematic level. In this study, we identified aging related genes in brain by collecting the ones with sustained and consistent gene expression or DNA methylation changes in the aging process. Functional analysis with Gene Ontology to these genes suggested transcriptional regulators to be the most affected genes in the aging process. Transcription regulation analysis found some transcription factors, especially Specificity Protein 1 (SP1), to play important roles in regulating aging related gene expression. Module-based functional analysis indicated these genes to be associated with many well-known aging related pathways, supporting the validity of our approach to select aging related genes. Finally, we investigated the roles of aging related genes on Alzheimer's Disease (AD). We found that aging and AD related genes both involved some common pathways, which provided a possible explanation why aging made the brain more vulnerable to Alzheimer's Disease.
Clustering approaches to identifying gene expression patterns from DNA microarray data.

Science.gov (United States)

Do, Jin Hwan; Choi, Dong-Kug

2008-04-30

The analysis of microarray data is essential for large amounts of gene expression data. In this review we focus on clustering techniques. The biological rationale for this approach is the fact that many co-expressed genes are co-regulated, and identifying co-expressed genes could aid in functional annotation of novel genes, de novo identification of transcription factor binding sites and elucidation of complex biological pathways. Co-expressed genes are usually identified in microarray experiments by clustering techniques. There are many such methods, and the results obtained even for the same datasets may vary considerably depending on the algorithms and metrics for dissimilarity measures used, as well as on user-selectable parameters such as desired number of clusters and initial values. Therefore, biologists who want to interpret microarray data should be aware of the weakness and strengths of the clustering methods used. In this review, we survey the basic principles of clustering of DNA microarray data from crisp clustering algorithms such as hierarchical clustering, K-means and self-organizing maps, to complex clustering algorithms like fuzzy clustering.
Integration of mouse and human genome-wide association data identifies KCNIP4 as an asthma gene.

Directory of Open Access Journals (Sweden)

Blanca E Himes

Full Text Available Asthma is a common chronic respiratory disease characterized by airway hyperresponsiveness (AHR. The genetics of asthma have been widely studied in mouse and human, and homologous genomic regions have been associated with mouse AHR and human asthma-related phenotypes. Our goal was to identify asthma-related genes by integrating AHR associations in mouse with human genome-wide association study (GWAS data. We used Efficient Mixed Model Association (EMMA analysis to conduct a GWAS of baseline AHR measures from males and females of 31 mouse strains. Genes near or containing SNPs with EMMA p-values <0.001 were selected for further study in human GWAS. The results of the previously reported EVE consortium asthma GWAS meta-analysis consisting of 12,958 diverse North American subjects from 9 study centers were used to select a subset of homologous genes with evidence of association with asthma in humans. Following validation attempts in three human asthma GWAS (i.e., Sepracor/LOCCS/LODO/Illumina, GABRIEL, DAG and two human AHR GWAS (i.e., SHARP, DAG, the Kv channel interacting protein 4 (KCNIP4 gene was identified as nominally associated with both asthma and AHR at a gene- and SNP-level. In EVE, the smallest KCNIP4 association was at rs6833065 (P-value 2.9e-04, while the strongest associations for Sepracor/LOCCS/LODO/Illumina, GABRIEL, DAG were 1.5e-03, 1.0e-03, 3.1e-03 at rs7664617, rs4697177, rs4696975, respectively. At a SNP level, the strongest association across all asthma GWAS was at rs4697177 (P-value 1.1e-04. The smallest P-values for association with AHR were 2.3e-03 at rs11947661 in SHARP and 2.1e-03 at rs402802 in DAG. Functional studies are required to validate the potential involvement of KCNIP4 in modulating asthma susceptibility and/or AHR. Our results suggest that a useful approach to identify genes associated with human asthma is to leverage mouse AHR association data.
Genes Important for Schizosaccharomyces pombe Meiosis Identified Through a Functional Genomics Screen

Science.gov (United States)

Blyth, Julie; Makrantoni, Vasso; Barton, Rachael E.; Spanos, Christos; Rappsilber, Juri; Marston, Adele L.

2018-01-01

Meiosis is a specialized cell division that generates gametes, such as eggs and sperm. Errors in meiosis result in miscarriages and are the leading cause of birth defects; however, the molecular origins of these defects remain unknown. Studies in model organisms are beginning to identify the genes and pathways important for meiosis, but the parts list is still poorly defined. Here we present a comprehensive catalog of genes important for meiosis in the fission yeast, Schizosaccharomyces pombe. Our genome-wide functional screen surveyed all nonessential genes for roles in chromosome segregation and spore formation. Novel genes important at distinct stages of the meiotic chromosome segregation and differentiation program were identified. Preliminary characterization implicated three of these genes in centrosome/spindle pole body, centromere, and cohesion function. Our findings represent a near-complete parts list of genes important for meiosis in fission yeast, providing a valuable resource to advance our molecular understanding of meiosis. PMID:29259000
Targeted/exome sequencing identified mutations in ten Chinese patients diagnosed with Noonan syndrome and related disorders

Directory of Open Access Journals (Sweden)

Shanshan Xu

2017-10-01

Full Text Available Abstract Background Noonan syndrome (NS and Noonan syndrome with multiple lentigines (NSML are autosomal dominant developmental disorders. NS and NSML are caused by abnormalities in genes that encode proteins related to the RAS-MAPK pathway, including PTPN11, RAF1, BRAF, and MAP2K. In this study, we diagnosed ten NS or NSML patients via targeted sequencing or whole exome sequencing (TS/WES. Methods TS/WES was performed to identify mutations in ten Chinese patients who exhibited the following manifestations: potential facial dysmorphisms, short stature, congenital heart defects, and developmental delay. Sanger sequencing was used to confirm the suspected pathological variants in the patients and their family members. Results TS/WES revealed three mutations in the PTPN11 gene, three mutations in RAF1 gene, and four mutations in BRAF gene in the NS and NSML patients who were previously diagnosed based on the abovementioned clinical features. All the identified mutations were determined to be de novo mutations. However, two patients who carried the same mutation in the RAF1 gene presented different clinical features. One patient with multiple lentigines was diagnosed with NSML, while the other patient without lentigines was diagnosed with NS. In addition, a patient who carried a hotspot mutation in the BRAF gene was diagnosed with NS instead of cardiofaciocutaneous syndrome (CFCS. Conclusions TS/WES has emerged as a useful tool for definitive diagnosis and accurate genetic counseling of atypical cases. In this study, we analyzed ten Chinese patients diagnosed with NS and related disorders and identified their correspondingPTPN11, RAF1, and BRAF mutations. Among the target genes, BRAF showed the same degree of correlation with NS incidence as that of PTPN11 or RAF1.
Gene dosage, expression, and ontology analysis identifies driver genes in the carcinogenesis and chemoradioresistance of cervical cancer.

Directory of Open Access Journals (Sweden)

Malin Lando

2009-11-01

Full Text Available Integrative analysis of gene dosage, expression, and ontology (GO data was performed to discover driver genes in the carcinogenesis and chemoradioresistance of cervical cancers. Gene dosage and expression profiles of 102 locally advanced cervical cancers were generated by microarray techniques. Fifty-two of these patients were also analyzed with the Illumina expression method to confirm the gene expression results. An independent cohort of 41 patients was used for validation of gene expressions associated with clinical outcome. Statistical analysis identified 29 recurrent gains and losses and 3 losses (on 3p, 13q, 21q associated with poor outcome after chemoradiotherapy. The intratumor heterogeneity, assessed from the gene dosage profiles, was low for these alterations, showing that they had emerged prior to many other alterations and probably were early events in carcinogenesis. Integration of the alterations with gene expression and GO data identified genes that were regulated by the alterations and revealed five biological processes that were significantly overrepresented among the affected genes: apoptosis, metabolism, macromolecule localization, translation, and transcription. Four genes on 3p (RYBP, GBE1 and 13q (FAM48A, MED4 correlated with outcome at both the gene dosage and expression level and were satisfactorily validated in the independent cohort. These integrated analyses yielded 57 candidate drivers of 24 genetic events, including novel loci responsible for chemoradioresistance. Further mapping of the connections among genetic events, drivers, and biological processes suggested that each individual event stimulates specific processes in carcinogenesis through the coordinated control of multiple genes. The present results may provide novel therapeutic opportunities of both early and advanced stage cervical cancers.
Deep sequencing analysis of the transcriptomes of peanut aerial and subterranean young pods identifies candidate genes related to early embryo abortion.

Science.gov (United States)

Chen, Xiaoping; Zhu, Wei; Azam, Sarwar; Li, Heying; Zhu, Fanghe; Li, Haifen; Hong, Yanbin; Liu, Haiyan; Zhang, Erhua; Wu, Hong; Yu, Shanlin; Zhou, Guiyuan; Li, Shaoxiong; Zhong, Ni; Wen, Shijie; Li, Xingyu; Knapp, Steve J; Ozias-Akins, Peggy; Varshney, Rajeev K; Liang, Xuanqiang

2013-01-01

The failure of peg penetration into the soil leads to seed abortion in peanut. Knowledge of genes involved in these processes is comparatively deficient. Here, we used RNA-seq to gain insights into transcriptomes of aerial and subterranean pods. More than 2 million transcript reads with an average length of 396 bp were generated from one aerial (AP) and two subterranean (SP1 and SP2) pod libraries using pyrosequencing technology. After assembly, sets of 49 632, 49 952 and 50 494 from a total of 74 974 transcript assembly contigs (TACs) were identified in AP, SP1 and SP2, respectively. A clear linear relationship in the gene expression level was observed between these data sets. In brief, 2194 differentially expressed TACs with a 99.0% true-positive rate were identified, among which 859 and 1068 TACs were up-regulated in aerial and subterranean pods, respectively. Functional analysis showed that putative function based on similarity with proteins catalogued in UniProt and gene ontology term classification could be determined for 59 342 (79.2%) and 42 955 (57.3%) TACs, respectively. A total of 2968 TACs were mapped to 174 KEGG pathways, of which 168 were shared by aerial and subterranean transcriptomes. TACs involved in photosynthesis were significantly up-regulated and enriched in the aerial pod. In addition, two senescence-associated genes were identified as significantly up-regulated in the aerial pod, which potentially contribute to embryo abortion in aerial pods, and in turn, to cessation of swelling. The data set generated in this study provides evidence for some functional genes as robust candidates underlying aerial and subterranean pod development and contributes to an elucidation of the evolutionary implications resulting from fruit development under light and dark conditions. © 2012 The Authors Plant Biotechnology Journal © 2012 Society for Experimental Biology, Association of Applied Biologists and Blackwell Publishing Ltd.
Expression and functional assessment of candidate type 2 diabetes susceptibility genes identify four new genes contributing to human insulin secretion

Directory of Open Access Journals (Sweden)

Fatou K. Ndiaye

2017-06-01

Full Text Available Objectives: Genome-wide association studies (GWAS have identified >100 loci independently contributing to type 2 diabetes (T2D risk. However, translational implications for precision medicine and for the development of novel treatments have been disappointing, due to poor knowledge of how these loci impact T2D pathophysiology. Here, we aimed to measure the expression of genes located nearby T2D associated signals and to assess their effect on insulin secretion from pancreatic beta cells. Methods: The expression of 104 candidate T2D susceptibility genes was measured in a human multi-tissue panel, through PCR-free expression assay. The effects of the knockdown of beta-cell enriched genes were next investigated on insulin secretion from the human EndoC-βH1 beta-cell line. Finally, we performed RNA-sequencing (RNA-seq so as to assess the pathways affected by the knockdown of the new genes impacting insulin secretion from EndoC-βH1, and we analyzed the expression of the new genes in mouse models with altered pancreatic beta-cell function. Results: We found that the candidate T2D susceptibility genes' expression is significantly enriched in pancreatic beta cells obtained by laser capture microdissection or sorted by flow cytometry and in EndoC-βH1 cells, but not in insulin sensitive tissues. Furthermore, the knockdown of seven T2D-susceptibility genes (CDKN2A, GCK, HNF4A, KCNK16, SLC30A8, TBC1D4, and TCF19 with already known expression and/or function in beta cells changed insulin secretion, supporting our functional approach. We showed first evidence for a role in insulin secretion of four candidate T2D-susceptibility genes (PRC1, SRR, ZFAND3, and ZFAND6 with no previous knowledge of presence and function in beta cells. RNA-seq in EndoC-βH1 cells with decreased expression of PRC1, SRR, ZFAND6, or ZFAND3 identified specific gene networks related to T2D pathophysiology. Finally, a positive correlation between the expression of Ins2 and the
GESearch: An Interactive GUI Tool for Identifying Gene Expression Signature

Directory of Open Access Journals (Sweden)

Ning Ye

2015-01-01

Full Text Available The huge amount of gene expression data generated by microarray and next-generation sequencing technologies present challenges to exploit their biological meanings. When searching for the coexpression genes, the data mining process is largely affected by selection of algorithms. Thus, it is highly desirable to provide multiple options of algorithms in the user-friendly analytical toolkit to explore the gene expression signatures. For this purpose, we developed GESearch, an interactive graphical user interface (GUI toolkit, which is written in MATLAB and supports a variety of gene expression data files. This analytical toolkit provides four models, including the mean, the regression, the delegate, and the ensemble models, to identify the coexpression genes, and enables the users to filter data and to select gene expression patterns by browsing the display window or by importing knowledge-based genes. Subsequently, the utility of this analytical toolkit is demonstrated by analyzing two sets of real-life microarray datasets from cell-cycle experiments. Overall, we have developed an interactive GUI toolkit that allows for choosing multiple algorithms for analyzing the gene expression signatures.
Molecular evolution of candidate genes for crop-related traits in sunflower (Helianthus annuus L.).

Science.gov (United States)

Mandel, Jennifer R; McAssey, Edward V; Nambeesan, Savithri; Garcia-Navarro, Elena; Burke, John M

2014-01-01

Evolutionary analyses aimed at detecting the molecular signature of selection during crop domestication and/or improvement can be used to identify genes or genomic regions of likely agronomic importance. Here, we describe the DNA sequence-based characterization of a pool of candidate genes for crop-related traits in sunflower. These genes, which were identified based on homology to genes of known effect in other study systems, were initially sequenced from a panel of improved lines. All genes that exhibited a paucity of sequence diversity, consistent with the possible effects of selection during the evolution of cultivated sunflower, were then sequenced from a panel of wild sunflower accessions an outgroup. These data enabled formal tests for the effects of selection in shaping sequence diversity at these loci. When selection was detected, we further sequenced these genes from a panel of primitive landraces, thereby allowing us to investigate the likely timing of selection (i.e., domestication vs. improvement). We ultimately identified seven genes that exhibited the signature of positive selection during either domestication or improvement. Genetic mapping of a subset of these genes revealed co-localization between candidates for genes involved in the determination of flowering time, seed germination, plant growth/development, and branching and QTL that were previously identified for these traits in cultivated × wild sunflower mapping populations.
[Phylogenetic analysis of closely related Leuconostoc citreum species based on partial housekeeping genes].

Science.gov (United States)

Lv, Qiang; Chen, Ming; Xu, Haiyan; Song, Yuqin; Sun, Zhihong; Dan, Tong; Sun, Tiansong

2013-07-04

Using the 16S rRNA, dnaA, murC and pyrG gene sequences, we identified the phylogenetic relationship among closely related Leuconostoc citreum species. Seven Leu. citreum strains originally isolated from sourdough were characterized by PCR methods to amplify the dnaA, murC and pyrG gene sequences, which were determined to assess the suitability as phylogenetic markers. Then, we estimated the genetic distance and constructed the phylogenetic trees including 16S rRNA and above mentioned three housekeeping genes combining with published corresponding sequences. By comparing the phylogenetic trees, the topology of three housekeeping genes trees were consistent with that of 16S rRNA gene. The homology of closely related Leu. citreum species among dnaA, murC, pyrG and 16S rRNA gene sequences were different, ranged from75.5% to 97.2%, 50.2% to 99.7%, 65.0% to 99.8% and 98.5% 100%, respectively. The phylogenetic relationship of three housekeeping genes sequences were highly consistent with the results of 16S rRNA gene sequence, while the genetic distance of these housekeeping genes were extremely high than 16S rRNA gene. Consequently, the dnaA, murC and pyrG gene are suitable for classification and identification closely related Leu. citreum species.
Transcriptome Profiling to Identify Genes Involved in Mesosulfuron-Methyl Resistance in Alopecurus aequalis

Directory of Open Access Journals (Sweden)

Ning Zhao

2017-08-01

Full Text Available Non-target-site resistance (NTSR to herbicides is a worldwide concern for weed control. However, as the dominant NTSR mechanism in weeds, metabolic resistance is not yet well-characterized at the genetic level. For this study, we have identified a shortawn foxtail (Alopecurus aequalis Sobol. population displaying both TSR and NTSR to mesosulfuron-methyl and fenoxaprop-P-ethyl, yet the molecular basis for this NTSR remains unclear. To investigate the mechanisms of metabolic resistance, an RNA-Seq transcriptome analysis was used to find candidate genes that may confer metabolic resistance to the herbicide mesosulfuron-methyl in this plant population. The RNA-Seq libraries generated 831,846,736 clean reads. The de novo transcriptome assembly yielded 95,479 unigenes (averaging 944 bp in length that were assigned putative annotations. Among these, a total of 29,889 unigenes were assigned to 67 GO terms that contained three main categories, and 14,246 unigenes assigned to 32 predicted KEGG metabolic pathways. Global gene expression was measured using the reads generated from the untreated control (CK, water-only control (WCK, and mesosulfuron-methyl treatment (T of R and susceptible (S. Contigs that showed expression differences between mesosulfuron-methyl-treated R and S biotypes, and between mesosulfuron-methyl-treated, water-treated and untreated R plants were selected for further quantitative real-time PCR (qRT-PCR validation analyses. Seventeen contigs were consistently highly expressed in the resistant A. aequalis plants, including four cytochrome P450 monooxygenase (CytP450 genes, two glutathione S-transferase (GST genes, two glucosyltransferase (GT genes, two ATP-binding cassette (ABC transporter genes, and seven additional contigs with functional annotations related to oxidation, hydrolysis, and plant stress physiology. These 17 contigs could serve as major candidate genes for contributing to metabolic mesosulfuron-methyl resistance; hence
Suppression subtractive hybridization and comparative expression analysis to identify developmentally regulated genes in filamentous fungi.

Science.gov (United States)

Gesing, Stefan; Schindler, Daniel; Nowrousian, Minou

2013-09-01

Ascomycetes differentiate four major morphological types of fruiting bodies (apothecia, perithecia, pseudothecia and cleistothecia) that are derived from an ancestral fruiting body. Thus, fruiting body differentiation is most likely controlled by a set of common core genes. One way to identify such genes is to search for genes with evolutionary conserved expression patterns. Using suppression subtractive hybridization (SSH), we selected differentially expressed transcripts in Pyronema confluens (Pezizales) by comparing two cDNA libraries specific for sexual and for vegetative development, respectively. The expression patterns of selected genes from both libraries were verified by quantitative real time PCR. Expression of several corresponding homologous genes was found to be conserved in two members of the Sordariales (Sordaria macrospora and Neurospora crassa), a derived group of ascomycetes that is only distantly related to the Pezizales. Knockout studies with N. crassa orthologues of differentially regulated genes revealed a functional role during fruiting body development for the gene NCU05079, encoding a putative MFS peptide transporter. These data indicate conserved gene expression patterns and a functional role of the corresponding genes during fruiting body development; such genes are candidates of choice for further functional analysis. © 2013 WILEY-VCH Verlag GmbH & Co. KGaA, Weinheim.
Identifying the genes of unconventional high temperature superconductors.

Science.gov (United States)

Hu, Jiangping

We elucidate a recently emergent framework in unifying the two families of high temperature (high [Formula: see text]) superconductors, cuprates and iron-based superconductors. The unification suggests that the latter is simply the counterpart of the former to realize robust extended s-wave pairing symmetries in a square lattice. The unification identifies that the key ingredients (gene) of high [Formula: see text] superconductors is a quasi two dimensional electronic environment in which the d -orbitals of cations that participate in strong in-plane couplings to the p -orbitals of anions are isolated near Fermi energy. With this gene, the superexchange magnetic interactions mediated by anions could maximize their contributions to superconductivity. Creating the gene requires special arrangements between local electronic structures and crystal lattice structures. The speciality explains why high [Formula: see text] superconductors are so rare. An explicit prediction is made to realize high [Formula: see text] superconductivity in Co/Ni-based materials with a quasi two dimensional hexagonal lattice structure formed by trigonal bipyramidal complexes.
'Omics' approaches in tomato aimed at identifying candidate genes ...

African Journals Online (AJOL)

adriana

2013-12-04

Dec 4, 2013 ... importance for human health and nutrition. This species has ... function to genes, proteins and metabolites is still a daunting task. Major challenges ... relation of the expression pattern of genes with the accu- mulation pattern of ..... M, Gordon JS, Rose, JKC, Martin G, Tanksley SD, Bouzayen M,. Jahn MM ...
A 6-gene signature identifies four molecular subgroups of neuroblastoma

Directory of Open Access Journals (Sweden)

Kogner Per

2011-04-01

Full Text Available Abstract Background There are currently three postulated genomic subtypes of the childhood tumour neuroblastoma (NB; Type 1, Type 2A, and Type 2B. The most aggressive forms of NB are characterized by amplification of the oncogene MYCN (MNA and low expression of the favourable marker NTRK1. Recently, mutations or high expression of the familial predisposition gene Anaplastic Lymphoma Kinase (ALK was associated to unfavourable biology of sporadic NB. Also, various other genes have been linked to NB pathogenesis. Results The present study explores subgroup discrimination by gene expression profiling using three published microarray studies on NB (47 samples. Four distinct clusters were identified by Principal Components Analysis (PCA in two separate data sets, which could be verified by an unsupervised hierarchical clustering in a third independent data set (101 NB samples using a set of 74 discriminative genes. The expression signature of six NB-associated genes ALK, BIRC5, CCND1, MYCN, NTRK1, and PHOX2B, significantly discriminated the four clusters (p ALK, BIRC5, and PHOX2B, and was significantly associated with higher tumour stage, poor outcome and poor survival compared to the Type 1-corresponding favourable group (INSS stage 4 and/or dead of disease, p Conclusions Based on expression profiling we have identified four molecular subgroups of neuroblastoma, which can be distinguished by a 6-gene signature. The fourth subgroup has not been described elsewhere, and efforts are currently made to further investigate this group's specific characteristics.
Microarray analysis and scale-free gene networks identify candidate regulators in drought-stressed roots of loblolly pine (P. taeda L.

Directory of Open Access Journals (Sweden)

Bordeaux John M

2011-05-01

Full Text Available Abstract Background Global transcriptional analysis of loblolly pine (Pinus taeda L. is challenging due to limited molecular tools. PtGen2, a 26,496 feature cDNA microarray, was fabricated and used to assess drought-induced gene expression in loblolly pine propagule roots. Statistical analysis of differential expression and weighted gene correlation network analysis were used to identify drought-responsive genes and further characterize the molecular basis of drought tolerance in loblolly pine. Results Microarrays were used to interrogate root cDNA populations obtained from 12 genotype × treatment combinations (four genotypes, three watering regimes. Comparison of drought-stressed roots with roots from the control treatment identified 2445 genes displaying at least a 1.5-fold expression difference (false discovery rate = 0.01. Genes commonly associated with drought response in pine and other plant species, as well as a number of abiotic and biotic stress-related genes, were up-regulated in drought-stressed roots. Only 76 genes were identified as differentially expressed in drought-recovered roots, indicating that the transcript population can return to the pre-drought state within 48 hours. Gene correlation analysis predicts a scale-free network topology and identifies eleven co-expression modules that ranged in size from 34 to 938 members. Network topological parameters identified a number of central nodes (hubs including those with significant homology (E-values ≤ 2 × 10-30 to 9-cis-epoxycarotenoid dioxygenase, zeatin O-glucosyltransferase, and ABA-responsive protein. Identified hubs also include genes that have been associated previously with osmotic stress, phytohormones, enzymes that detoxify reactive oxygen species, and several genes of unknown function. Conclusion PtGen2 was used to evaluate transcriptome responses in loblolly pine and was leveraged to identify 2445 differentially expressed genes responding to severe drought stress in
Microarray analysis and scale-free gene networks identify candidate regulators in drought-stressed roots of loblolly pine (P. taeda L.)

Science.gov (United States)

2011-01-01

Background Global transcriptional analysis of loblolly pine (Pinus taeda L.) is challenging due to limited molecular tools. PtGen2, a 26,496 feature cDNA microarray, was fabricated and used to assess drought-induced gene expression in loblolly pine propagule roots. Statistical analysis of differential expression and weighted gene correlation network analysis were used to identify drought-responsive genes and further characterize the molecular basis of drought tolerance in loblolly pine. Results Microarrays were used to interrogate root cDNA populations obtained from 12 genotype × treatment combinations (four genotypes, three watering regimes). Comparison of drought-stressed roots with roots from the control treatment identified 2445 genes displaying at least a 1.5-fold expression difference (false discovery rate = 0.01). Genes commonly associated with drought response in pine and other plant species, as well as a number of abiotic and biotic stress-related genes, were up-regulated in drought-stressed roots. Only 76 genes were identified as differentially expressed in drought-recovered roots, indicating that the transcript population can return to the pre-drought state within 48 hours. Gene correlation analysis predicts a scale-free network topology and identifies eleven co-expression modules that ranged in size from 34 to 938 members. Network topological parameters identified a number of central nodes (hubs) including those with significant homology (E-values ≤ 2 × 10-30) to 9-cis-epoxycarotenoid dioxygenase, zeatin O-glucosyltransferase, and ABA-responsive protein. Identified hubs also include genes that have been associated previously with osmotic stress, phytohormones, enzymes that detoxify reactive oxygen species, and several genes of unknown function. Conclusion PtGen2 was used to evaluate transcriptome responses in loblolly pine and was leveraged to identify 2445 differentially expressed genes responding to severe drought stress in roots. Many of the
Identification, Characterization and Expression Analysis of Cell Wall Related Genes in Sorghum bicolor (L. Moench, a Food, Fodder and Biofuel Crop

Directory of Open Access Journals (Sweden)

KRISHAN MOHAN RAI

2016-08-01

Full Text Available Biomass based alternative fuels offer a solution to the world’s ever-increasing energy demand. With the ability to produce high biomass in marginal lands with low inputs, sorghum has a great potential to meet second-generation biofuel needs. Despite the sorghum crop importance in biofuel and fodder industry, there is no comprehensive information available on the cell wall related genes and gene families (biosynthetic and modification. It is important to identify the cell wall related genes to understand the cell wall biosynthetic process as well as to facilitate biomass manipulation. Genome-wide analysis using gene family specific Hidden Markov Model of conserved domains identified 520 genes distributed among 20 gene families related to biosynthesis/modification of various cell wall polymers such as cellulose, hemicellulose, pectin and lignin. Chromosomal localization analysis of these genes revealed that about 65% of cell wall related genes were confined to four chromosomes (Chr. 1-4. Further, 53 tandem duplication events involving 146 genes were identified in these gene families which could be associated with expansion of genes within families in sorghum. Additionally, we also identified 137 Simple Sequence Repeats related to 112 genes and target sites for 10 miRNAs in some important families such as cellulose synthase, cellulose synthase-like and laccases, etc. To gain further insight into potential functional roles, expression analysis of these gene families was performed using publicly available data sets in various tissues and under abiotic stress conditions. Expression analysis showed tissue specificity as well as differential expression under abiotic stress conditions. Overall, our study provides a comprehensive information on cell wall related genes families in sorghum which offers a valuable resource to develop strategies for altering biomass composition by plant breeding and genetic engineering approaches.

Identifying human disease genes through cross-species gene mapping of evolutionary conserved processes.

Directory of Open Access Journals (Sweden)

Martin Poot

2011-05-01

Full Text Available Understanding complex networks that modulate development in humans is hampered by genetic and phenotypic heterogeneity within and between populations. Here we present a method that exploits natural variation in highly diverse mouse genetic reference panels in which genetic and environmental factors can be tightly controlled. The aim of our study is to test a cross-species genetic mapping strategy, which compares data of gene mapping in human patients with functional data obtained by QTL mapping in recombinant inbred mouse strains in order to prioritize human disease candidate genes.We exploit evolutionary conservation of developmental phenotypes to discover gene variants that influence brain development in humans. We studied corpus callosum volume in a recombinant inbred mouse panel (C57BL/6J×DBA/2J, BXD strains using high-field strength MRI technology. We aligned mouse mapping results for this neuro-anatomical phenotype with genetic data from patients with abnormal corpus callosum (ACC development.From the 61 syndromes which involve an ACC, 51 human candidate genes have been identified. Through interval mapping, we identified a single significant QTL on mouse chromosome 7 for corpus callosum volume with a QTL peak located between 25.5 and 26.7 Mb. Comparing the genes in this mouse QTL region with those associated with human syndromes (involving ACC and those covered by copy number variations (CNV yielded a single overlap, namely HNRPU in humans and Hnrpul1 in mice. Further analysis of corpus callosum volume in BXD strains revealed that the corpus callosum was significantly larger in BXD mice with a B genotype at the Hnrpul1 locus than in BXD mice with a D genotype at Hnrpul1 (F = 22.48, p<9.87*10(-5.This approach that exploits highly diverse mouse strains provides an efficient and effective translational bridge to study the etiology of human developmental disorders, such as autism and schizophrenia.
Bioinformatic analysis of patient-derived ASPS gene expressions and ASPL-TFE3 fusion transcript levels identify potential therapeutic targets.

Directory of Open Access Journals (Sweden)

David G Covell

Full Text Available Gene expression data, collected from ASPS tumors of seven different patients and from one immortalized ASPS cell line (ASPS-1, was analyzed jointly with patient ASPL-TFE3 (t(X;17(p11;q25 fusion transcript data to identify disease-specific pathways and their component genes. Data analysis of the pooled patient and ASPS-1 gene expression data, using conventional clustering methods, revealed a relatively small set of pathways and genes characterizing the biology of ASPS. These results could be largely recapitulated using only the gene expression data collected from patient tumor samples. The concordance between expression measures derived from ASPS-1 and both pooled and individual patient tumor data provided a rationale for extending the analysis to include patient ASPL-TFE3 fusion transcript data. A novel linear model was exploited to link gene expressions to fusion transcript data and used to identify a small set of ASPS-specific pathways and their gene expression. Cellular pathways that appear aberrantly regulated in response to the t(X;17(p11;q25 translocation include the cell cycle and cell adhesion. The identification of pathways and gene subsets characteristic of ASPS support current therapeutic strategies that target the FLT1 and MET, while also proposing additional targeting of genes found in pathways involved in the cell cycle (CHK1, cell adhesion (ARHGD1A, cell division (CDC6, control of meiosis (RAD51L3 and mitosis (BIRC5, and chemokine-related protein tyrosine kinase activity (CCL4.
Transcriptome Analysis of Calcium- and Hormone-Related Gene Expressions during Different Stages of Peanut Pod Development

Science.gov (United States)

Li, Yan; Meng, Jingjing; Yang, Sha; Guo, Feng; Zhang, Jialei; Geng, Yun; Cui, Li; Wan, Shubo; Li, Xinguo

2017-01-01

Peanut is one of the calciphilous plants. Calcium serves as a ubiquitous central hub in a large number of signaling pathways. In the field, free calcium ion (Ca2+)-deficient soil can result in unfilled pods. Four pod stages were analyzed to determine the relationship between Ca2+ excretion and pod development. Peanut shells showed Ca2+ excretion at all four stages; however, both the embryo of Stage 4 (S4) and the red skin of Stage 3 (S3) showed Ca2+ absorbance. These results showed that embryo and red skin of peanut need Ca2+ during development. In order to survey the relationship among calcium, hormone and seed development from gene perspective, we further analyzed the seed transcriptome at Stage 2 (S2), S3, and S4. About 70 million high quality clean reads were generated, which were assembled into 58,147 unigenes. By comparing these three stages, total 4,457 differentially expressed genes were identified. In these genes, 53 Ca2+ related genes, 40 auxin related genes, 15 gibberellin genes, 20 ethylene related genes, 2 abscisic acid related genes, and 7 cytokinin related genes were identified. Additionally, a part of them were validated by qRT-PCR. Most of their expressions changed during the pod development. Since some reports showed that Ca2+ signal transduction pathway is involved in hormone regulation pathway, these results implied that peanut seed development might be regulated by the collaboration of Ca2+ signal transduction pathway and hormone regulation pathway. PMID:28769950
Transcriptome Analysis of Calcium- and Hormone-Related Gene Expressions during Different Stages of Peanut Pod Development

Directory of Open Access Journals (Sweden)

Yan Li

2017-07-01

Full Text Available Peanut is one of the calciphilous plants. Calcium serves as a ubiquitous central hub in a large number of signaling pathways. In the field, free calcium ion (Ca2+-deficient soil can result in unfilled pods. Four pod stages were analyzed to determine the relationship between Ca2+ excretion and pod development. Peanut shells showed Ca2+ excretion at all four stages; however, both the embryo of Stage 4 (S4 and the red skin of Stage 3 (S3 showed Ca2+ absorbance. These results showed that embryo and red skin of peanut need Ca2+ during development. In order to survey the relationship among calcium, hormone and seed development from gene perspective, we further analyzed the seed transcriptome at Stage 2 (S2, S3, and S4. About 70 million high quality clean reads were generated, which were assembled into 58,147 unigenes. By comparing these three stages, total 4,457 differentially expressed genes were identified. In these genes, 53 Ca2+ related genes, 40 auxin related genes, 15 gibberellin genes, 20 ethylene related genes, 2 abscisic acid related genes, and 7 cytokinin related genes were identified. Additionally, a part of them were validated by qRT-PCR. Most of their expressions changed during the pod development. Since some reports showed that Ca2+ signal transduction pathway is involved in hormone regulation pathway, these results implied that peanut seed development might be regulated by the collaboration of Ca2+ signal transduction pathway and hormone regulation pathway.
Hotspots of missense mutation identify novel neurodevelopmental disorder genes and functional domains

Science.gov (United States)

Geisheker, Madeleine R.; Heymann, Gabriel; Wang, Tianyun; Coe, Bradley P.; Turner, Tychele N.; Stessman, Holly A.F.; Hoekzema, Kendra; Kvarnung, Malin; Shaw, Marie; Friend, Kathryn; Liebelt, Jan; Barnett, Christopher; Thompson, Elizabeth M.; Haan, Eric; Guo, Hui; Anderlid, Britt-Marie; Nordgren, Ann; Lindstrand, Anna; Vandeweyer, Geert; Alberti, Antonino; Avola, Emanuela; Vinci, Mirella; Giusto, Stefania; Pramparo, Tiziano; Pierce, Karen; Nalabolu, Srinivasa; Michaelson, Jacob J.; Sedlacek, Zdenek; Santen, Gijs W.E.; Peeters, Hilde; Hakonarson, Hakon; Courchesne, Eric; Romano, Corrado; Kooy, R. Frank; Bernier, Raphael A.; Nordenskjöld, Magnus; Gecz, Jozef; Xia, Kun; Zweifel, Larry S.; Eichler, Evan E.

2017-01-01

Although de novo missense mutations have been predicted to account for more cases of autism than gene-truncating mutations, most research has focused on the latter. We identified the properties of de novo missense mutations in patients with neurodevelopmental disorders (NDDs) and highlight 35 genes with excess missense mutations. Additionally, 40 amino acid sites were recurrently mutated in 36 genes, and targeted sequencing of 20 sites in 17,689 NDD patients identified 21 new patients with identical missense mutations. One recurrent site (p.Ala636Thr) occurs in a glutamate receptor subunit, GRIA1. This same amino acid substitution in the homologous but distinct mouse glutamate receptor subunit Grid2 is associated with Lurcher ataxia. Phenotypic follow-up in five individuals with GRIA1 mutations shows evidence of specific learning disabilities and autism. Overall, we find significant clustering of de novo mutations in 200 genes, highlighting specific functional domains and synaptic candidate genes important in NDD pathology. PMID:28628100
Identifying the rooted species tree from the distribution of unrooted gene trees under the coalescent.

Science.gov (United States)

Allman, Elizabeth S; Degnan, James H; Rhodes, John A

2011-06-01

Gene trees are evolutionary trees representing the ancestry of genes sampled from multiple populations. Species trees represent populations of individuals-each with many genes-splitting into new populations or species. The coalescent process, which models ancestry of gene copies within populations, is often used to model the probability distribution of gene trees given a fixed species tree. This multispecies coalescent model provides a framework for phylogeneticists to infer species trees from gene trees using maximum likelihood or Bayesian approaches. Because the coalescent models a branching process over time, all trees are typically assumed to be rooted in this setting. Often, however, gene trees inferred by traditional phylogenetic methods are unrooted. We investigate probabilities of unrooted gene trees under the multispecies coalescent model. We show that when there are four species with one gene sampled per species, the distribution of unrooted gene tree topologies identifies the unrooted species tree topology and some, but not all, information in the species tree edges (branch lengths). The location of the root on the species tree is not identifiable in this situation. However, for 5 or more species with one gene sampled per species, we show that the distribution of unrooted gene tree topologies identifies the rooted species tree topology and all its internal branch lengths. The length of any pendant branch leading to a leaf of the species tree is also identifiable for any species from which more than one gene is sampled.
Microarray profiling of mononuclear peripheral blood cells identifies novel candidate genes related to chemoradiation response in rectal cancer.

Directory of Open Access Journals (Sweden)

Pablo Palma

Full Text Available Preoperative chemoradiation significantly improves oncological outcome in locally advanced rectal cancer. However there is no effective method of predicting tumor response to chemoradiation in these patients. Peripheral blood mononuclear cells have emerged recently as pathology markers of cancer and other diseases, making possible their use as therapy predictors. Furthermore, the importance of the immune response in radiosensivity of solid organs led us to hypothesized that microarray gene expression profiling of peripheral blood mononuclear cells could identify patients with response to chemoradiation in rectal cancer. Thirty five 35 patients with locally advanced rectal cancer were recruited initially to perform the study. Peripheral blood samples were obtained before neaodjuvant treatment. RNA was extracted and purified to obtain cDNA and cRNA for hybridization of microarrays included in Human WG CodeLink bioarrays. Quantitative real time PCR was used to validate microarray experiment data. Results were correlated with pathological response, according to Mandard´s criteria and final UICC Stage (patients with tumor regression grade 1-2 and downstaging being defined as responders and patients with grade 3-5 and no downstaging as non-responders. Twenty seven out of 35 patients were finally included in the study. We performed a multiple t-test using Significance Analysis of Microarrays, to find those genes differing significantly in expression, between responders (n = 11 and non-responders (n = 16 to CRT. The differently expressed genes were: BC 035656.1, CIR, PRDM2, CAPG, FALZ, HLA-DPB2, NUPL2, and ZFP36. The measurement of FALZ (p = 0.029 gene expression level determined by qRT-PCR, showed statistically significant differences between the two groups. Gene expression profiling reveals novel genes in peripheral blood samples of mononuclear cells that could predict responders and non-responders to chemoradiation in patients with
Cross-species microarray hybridization to identify developmentally regulated genes in the filamentous fungus Sordaria macrospora.

Science.gov (United States)

Nowrousian, Minou; Ringelberg, Carol; Dunlap, Jay C; Loros, Jennifer J; Kück, Ulrich

2005-04-01

The filamentous fungus Sordaria macrospora forms complex three-dimensional fruiting bodies that protect the developing ascospores and ensure their proper discharge. Several regulatory genes essential for fruiting body development were previously isolated by complementation of the sterile mutants pro1, pro11 and pro22. To establish the genetic relationships between these genes and to identify downstream targets, we have conducted cross-species microarray hybridizations using cDNA arrays derived from the closely related fungus Neurospora crassa and RNA probes prepared from wild-type S. macrospora and the three developmental mutants. Of the 1,420 genes which gave a signal with the probes from all the strains used, 172 (12%) were regulated differently in at least one of the three mutants compared to the wild type, and 17 (1.2%) were regulated differently in all three mutant strains. Microarray data were verified by Northern analysis or quantitative real time PCR. Among the genes that are up- or down-regulated in the mutant strains are genes encoding the pheromone precursors, enzymes involved in melanin biosynthesis and a lectin-like protein. Analysis of gene expression in double mutants revealed a complex network of interaction between the pro gene products.
Identifying differential miR and gene consensus patterns in peripheral blood of patients with cardiovascular diseases from literature data.

Science.gov (United States)

Šatrauskienė, Agnė; Navickas, Rokas; Laucevičius, Aleksandras; Huber, Heinrich J

2017-06-30

Numerous recent studies suggest the potential of circulating MicroRNAs (miRs) in peripheral blood samples as diagnostic or prognostic markers for coronary artery disease (CAD), acute coronary syndrome (ACS) and heart failure (HF). However, literature often remains inconclusive regarding as to which markers are most indicative for which of the above diseases. This shortcoming is mainly due to the lack of a systematic analyses and absence of information on the functional pathophysiological role of these miRs and their target genes. We here provide an-easy-to-use scoring approach to investigate the likelihood of regulation of several miRs and their target genes from literature by identifying consensus patterns of regulation. We therefore have screened over 1000 articles that study mRNA markers in cardiovascular and metabolic diseases, and devised a scoring algorithm to identify consensus means for miRs and genes regulation across several studies. We then aimed to identify differential markers between CAD, ACS and HF. We first identified miRs (miR-122, -126, -223, -138 and -370) as commonly regulated within a group of metabolic disease, while investigating cardiac-related pathologies (CAD, ACS, HF) revealed a decisive role of miR-1, -499, -208b, and -133a. Looking at differential markers between cardiovascular disease revealed miR-1, miR-208a and miR-133a to distinguish ACS and CAD to HF. Relating differentially expressed miRs to their putative gene targets using MirTarBase, we further identified HCN2/4 and LASP1 as potential markers of CAD and ACS, but not in HF. Likewise, BLC-2 was found oppositely regulated between CAD and HF. Interestingly, while studying overlap in target genes between CAD, ACS and HF only revealed little similarities, mapping these genes to gene ontology terms revealed a surprising similarity between CAD and ACS compared to HF. We conclude that our analysis using gene and miR scores allows the extraction of meaningful markers and the elucidation
Sex-related differences in gene expression in human skeletal muscle.

Directory of Open Access Journals (Sweden)

Stephen Welle

2008-01-01

Full Text Available There is sexual dimorphism of skeletal muscle, the most obvious feature being the larger muscle mass of men. The molecular basis for this difference has not been clearly defined. To identify genes that might contribute to the relatively greater muscularity of men, we compared skeletal muscle gene expression profiles of 15 normal men and 15 normal women by using comprehensive oligonucleotide microarrays. Although there were sex-related differences in expression of several hundred genes, very few of the differentially expressed genes have functions that are obvious candidates for explaining the larger muscle mass of men. The men tended to have higher expression of genes encoding mitochondrial proteins, ribosomal proteins, and a few translation initiation factors. The women had >2-fold greater expression than the men (P<0.0001 of two genes that encode proteins in growth factor pathways known to be important in regulating muscle mass: growth factor receptor-bound 10 (GRB10 and activin A receptor IIB (ACVR2B. GRB10 encodes a protein that inhibits insulin-like growth factor-1 (IGF-1 signaling. ACVR2B encodes a myostatin receptor. Quantitative RT-PCR confirmed higher expression of GRB10 and ACVR2B genes in these women. In an independent microarray study of 10 men and 9 women with facioscapulohumeral dystrophy, women had higher expression of GRB10 (2.7-fold, P<0.001 and ACVR2B (1.7-fold, P<0.03. If these sex-related differences in mRNA expression lead to reduced IGF-1 activity and increased myostatin activity, they could contribute to the sex difference in muscle size.
Macular xanthophylls, lipoprotein-related genes, and age-related macular degeneration1234

Science.gov (United States)

Koo, Euna; Neuringer, Martha; SanGiovanni, John Paul

2014-01-01

Plant-based macular xanthophylls (MXs; lutein and zeaxanthin) and the lutein metabolite meso-zeaxanthin are the major constituents of macular pigment, a compound concentrated in retinal areas that are responsible for fine-feature visual sensation. There is an unmet need to examine the genetics of factors influencing regulatory mechanisms and metabolic fates of these 3 MXs because they are linked to processes implicated in the pathogenesis of age-related macular degeneration (AMD). In this work we provide an overview of evidence supporting a molecular basis for AMD-MX associations as they may relate to DNA sequence variation in AMD- and lipoprotein-related genes. We recognize a number of emerging research opportunities, barriers, knowledge gaps, and tools offering promise for meaningful investigation and inference in the field. Overviews on AMD- and high-density lipoprotein (HDL)–related genes encoding receptors, transporters, and enzymes affecting or affected by MXs are followed with information on localization of products from these genes to retinal cell types manifesting AMD-related pathophysiology. Evidence on the relation of each gene or gene product with retinal MX response to nutrient intake is discussed. This information is followed by a review of results from mechanistic studies testing gene-disease relations. We then present findings on relations of AMD with DNA sequence variants in MX-associated genes. Our conclusion is that AMD-associated DNA variants that influence the actions and metabolic fates of HDL system constituents should be examined further for concomitant influence on MX absorption, retinal tissue responses to MX intake, and the capacity to modify MX-associated factors and processes implicated in AMD pathogenesis. PMID:24829491
Sparse canonical correlation analysis for identifying, connecting and completing gene-expression networks

NARCIS (Netherlands)

Waaijenborg, S.; Zwinderman, A.H.

2009-01-01

ABSTRACT: BACKGROUND: We generalized penalized canonical correlation analysis for analyzing microarray gene-expression measurements for checking completeness of known metabolic pathways and identifying candidate genes for incorporation in the pathway. We used Wold's method for calculation of the
Genome-Wide Association Study Identifying Candidate Genes Influencing Important Agronomic Traits of Flax (Linum usitatissimum L.) Using SLAF-seq.

Science.gov (United States)

Xie, Dongwei; Dai, Zhigang; Yang, Zemao; Sun, Jian; Zhao, Debao; Yang, Xue; Zhang, Liguo; Tang, Qing; Su, Jianguang

2017-01-01

Flax ( Linum usitatissimum L.) is an important cash crop, and its agronomic traits directly affect yield and quality. Molecular studies on flax remain inadequate because relatively few flax genes have been associated with agronomic traits or have been identified as having potential applications. To identify markers and candidate genes that can potentially be used for genetic improvement of crucial agronomic traits, we examined 224 specimens of core flax germplasm; specifically, phenotypic data for key traits, including plant height, technical length, number of branches, number of fruits, and 1000-grain weight were investigated under three environmental conditions before specific-locus amplified fragment sequencing (SLAF-seq) was employed to perform a genome-wide association study (GWAS) for these five agronomic traits. Subsequently, the results were used to screen single nucleotide polymorphism (SNP) loci and candidate genes that exhibited a significant correlation with the important agronomic traits. Our analyses identified a total of 42 SNP loci that showed significant correlations with the five important agronomic flax traits. Next, candidate genes were screened in the 10 kb zone of each of the 42 SNP loci. These SNP loci were then analyzed by a more stringent screening via co-identification using both a general linear model (GLM) and a mixed linear model (MLM) as well as co-occurrences in at least two of the three environments, whereby 15 final candidate genes were obtained. Based on these results, we determined that UGT and PL are candidate genes for plant height, GRAS and XTH are candidate genes for the number of branches, Contig1437 and LU0019C12 are candidate genes for the number of fruits, and PHO1 is a candidate gene for the 1000-seed weight. We propose that the identified SNP loci and corresponding candidate genes might serve as a biological basis for improving crucial agronomic flax traits.
Genome-Wide Association Study Identifying Candidate Genes Influencing Important Agronomic Traits of Flax (Linum usitatissimum L. Using SLAF-seq

Directory of Open Access Journals (Sweden)

Dongwei Xie

2018-01-01

Full Text Available Flax (Linum usitatissimum L. is an important cash crop, and its agronomic traits directly affect yield and quality. Molecular studies on flax remain inadequate because relatively few flax genes have been associated with agronomic traits or have been identified as having potential applications. To identify markers and candidate genes that can potentially be used for genetic improvement of crucial agronomic traits, we examined 224 specimens of core flax germplasm; specifically, phenotypic data for key traits, including plant height, technical length, number of branches, number of fruits, and 1000-grain weight were investigated under three environmental conditions before specific-locus amplified fragment sequencing (SLAF-seq was employed to perform a genome-wide association study (GWAS for these five agronomic traits. Subsequently, the results were used to screen single nucleotide polymorphism (SNP loci and candidate genes that exhibited a significant correlation with the important agronomic traits. Our analyses identified a total of 42 SNP loci that showed significant correlations with the five important agronomic flax traits. Next, candidate genes were screened in the 10 kb zone of each of the 42 SNP loci. These SNP loci were then analyzed by a more stringent screening via co-identification using both a general linear model (GLM and a mixed linear model (MLM as well as co-occurrences in at least two of the three environments, whereby 15 final candidate genes were obtained. Based on these results, we determined that UGT and PL are candidate genes for plant height, GRAS and XTH are candidate genes for the number of branches, Contig1437 and LU0019C12 are candidate genes for the number of fruits, and PHO1 is a candidate gene for the 1000-seed weight. We propose that the identified SNP loci and corresponding candidate genes might serve as a biological basis for improving crucial agronomic flax traits.
G-NEST: a gene neighborhood scoring tool to identify co-conserved, co-expressed genes

Directory of Open Access Journals (Sweden)

Lemay Danielle G

2012-09-01

Full Text Available Abstract Background In previous studies, gene neighborhoods—spatial clusters of co-expressed genes in the genome—have been defined using arbitrary rules such as requiring adjacency, a minimum number of genes, a fixed window size, or a minimum expression level. In the current study, we developed a Gene Neighborhood Scoring Tool (G-NEST which combines genomic location, gene expression, and evolutionary sequence conservation data to score putative gene neighborhoods across all possible window sizes simultaneously. Results Using G-NEST on atlases of mouse and human tissue expression data, we found that large neighborhoods of ten or more genes are extremely rare in mammalian genomes. When they do occur, neighborhoods are typically composed of families of related genes. Both the highest scoring and the largest neighborhoods in mammalian genomes are formed by tandem gene duplication. Mammalian gene neighborhoods contain highly and variably expressed genes. Co-localized noisy gene pairs exhibit lower evolutionary conservation of their adjacent genome locations, suggesting that their shared transcriptional background may be disadvantageous. Genes that are essential to mammalian survival and reproduction are less likely to occur in neighborhoods, although neighborhoods are enriched with genes that function in mitosis. We also found that gene orientation and protein-protein interactions are partially responsible for maintenance of gene neighborhoods. Conclusions Our experiments using G-NEST confirm that tandem gene duplication is the primary driver of non-random gene order in mammalian genomes. Non-essentiality, co-functionality, gene orientation, and protein-protein interactions are additional forces that maintain gene neighborhoods, especially those formed by tandem duplicates. We expect G-NEST to be useful for other applications such as the identification of core regulatory modules, common transcriptional backgrounds, and chromatin domains. The
The prediction of candidate genes for cervix related cancer through gene ontology and graph theoretical approach.

Science.gov (United States)

Hindumathi, V; Kranthi, T; Rao, S B; Manimaran, P

2014-06-01

With rapidly changing technology, prediction of candidate genes has become an indispensable task in recent years mainly in the field of biological research. The empirical methods for candidate gene prioritization that succors to explore the potential pathway between genetic determinants and complex diseases are highly cumbersome and labor intensive. In such a scenario predicting potential targets for a disease state through in silico approaches are of researcher's interest. The prodigious availability of protein interaction data coupled with gene annotation renders an ease in the accurate determination of disease specific candidate genes. In our work we have prioritized the cervix related cancer candidate genes by employing Csaba Ortutay and his co-workers approach of identifying the candidate genes through graph theoretical centrality measures and gene ontology. With the advantage of the human protein interaction data, cervical cancer gene sets and the ontological terms, we were able to predict 15 novel candidates for cervical carcinogenesis. The disease relevance of the anticipated candidate genes was corroborated through a literature survey. Also the presence of the drugs for these candidates was detected through Therapeutic Target Database (TTD) and DrugMap Central (DMC) which affirms that they may be endowed as potential drug targets for cervical cancer.
Identifying miRNA and gene modules of colon cancer associated with pathological stage by weighted gene co-expression network analysis

Directory of Open Access Journals (Sweden)

Zhou X

2018-05-01

characterize the results of WGCNA.Results: Two gene modules (Gmagenta and Ggreen and one miRNA module were associated with the pathological stage. Six hub genes (COL1A2, THBS2, BGN, COL1A1, TAGLN and DACT3 were related to prognosis and validated to be associated with the pathological stage. Five hub miRNAs were identified to be related to prognosis (hsa-miR-125b-5p, hsa-miR-145-5p, hsa-let-7c-5p, hsa-miR-218-5p and hsa-miR-125b-2-3p. A total of 18 hub genes and seven hub miRNAs were predominantly expressed in tumor stroma. Proteoglycans in cancer, focal adhesion, extracellular matrix (ECM–receptor interaction and so on were common pathways of the three modules. Hsa-let-7c-5p was located at the core of miRNA–gene network.Conclusion: These findings help to advance the understanding of tumor stroma in the progression of CAC and provide prognostic biomarkers as well as therapeutic targets. Keywords: colon adenocarcinoma, weighted gene co-expression network analysis, differentially expressed genes, differentially expressed miRNA, tumor stroma
Identifying genes that mediate anthracyline toxicity in immune cells

Directory of Open Access Journals (Sweden)

Amber eFrick

2015-04-01

Full Text Available The role of the immune system in response to chemotherapeutic agents remains elusive. The interpatient variability observed in immune and chemotherapeutic cytotoxic responses is likely, at least in part, due to complex genetic differences. Through the use of a panel of genetically diverse mouse inbred strains, we developed a drug screening platform aimed at identifying genes underlying these chemotherapeutic cytotoxic effects on immune cells. Using genome-wide association studies (GWAS, we identified four genome-wide significant quantitative trait loci (QTL that contributed to the sensitivity of doxorubicin and idarubicin in immune cells. Of particular interest, a locus on chromosome 16 was significantly associated with cell viability following idarubicin administration (p = 5.01x10-8. Within this QTL lies App, which encodes amyloid beta precursor protein. Comparison of dose-response curves verified that T-cells in App knockout mice were more sensitive to idarubicin than those of C57BL/6J control mice (p < 0.05.In conclusion, the cellular screening approach coupled with GWAS led to the identification and subsequent validation of a gene involved in T-cell viability after idarubicin treatment. Previous studies have suggested a role for App in in vitro and in vivo cytotoxicity to anticancer agents; the overexpression of App enhances resistance, while the knockdown of this gene is deleterious to cell viability. Thus, further investigations should include performing mechanistic studies, validating additional genes from the GWAS, including Ppfia1 and Ppfibp1, and ultimately translating the findings to in vivo and human studies.
Identification of immunity related genes to study the Physalis peruviana--Fusarium oxysporum pathosystem.

Science.gov (United States)

Enciso-Rodríguez, Felix E; González, Carolina; Rodríguez, Edwin A; López, Camilo E; Landsman, David; Barrero, Luz Stella; Mariño-Ramírez, Leonardo

2013-01-01

The Cape gooseberry (Physalisperuviana L) is an Andean exotic fruit with high nutritional value and appealing medicinal properties. However, its cultivation faces important phytosanitary problems mainly due to pathogens like Fusarium oxysporum, Cercosporaphysalidis and Alternaria spp. Here we used the Cape gooseberry foliar transcriptome to search for proteins that encode conserved domains related to plant immunity including: NBS (Nucleotide Binding Site), CC (Coiled-Coil), TIR (Toll/Interleukin-1 Receptor). We identified 74 immunity related gene candidates in P. peruviana which have the typical resistance gene (R-gene) architecture, 17 Receptor like kinase (RLKs) candidates related to PAMP-Triggered Immunity (PTI), eight (TIR-NBS-LRR, or TNL) and nine (CC-NBS-LRR, or CNL) candidates related to Effector-Triggered Immunity (ETI) genes among others. These candidate genes were categorized by molecular function (98%), biological process (85%) and cellular component (79%) using gene ontology. Some of the most interesting predicted roles were those associated with binding and transferase activity. We designed 94 primers pairs from the 74 immunity-related genes (IRGs) to amplify the corresponding genomic regions on six genotypes that included resistant and susceptible materials. From these, we selected 17 single band amplicons and sequenced them in 14 F. oxysporum resistant and susceptible genotypes. Sequence polymorphisms were analyzed through preliminary candidate gene association, which allowed the detection of one SNP at the PpIRG-63 marker revealing a nonsynonymous mutation in the predicted LRR domain suggesting functional roles for resistance.
Identification of immunity related genes to study the Physalis peruviana--Fusarium oxysporum pathosystem.

Directory of Open Access Journals (Sweden)

Felix E Enciso-Rodríguez

Full Text Available The Cape gooseberry (Physalisperuviana L is an Andean exotic fruit with high nutritional value and appealing medicinal properties. However, its cultivation faces important phytosanitary problems mainly due to pathogens like Fusarium oxysporum, Cercosporaphysalidis and Alternaria spp. Here we used the Cape gooseberry foliar transcriptome to search for proteins that encode conserved domains related to plant immunity including: NBS (Nucleotide Binding Site, CC (Coiled-Coil, TIR (Toll/Interleukin-1 Receptor. We identified 74 immunity related gene candidates in P. peruviana which have the typical resistance gene (R-gene architecture, 17 Receptor like kinase (RLKs candidates related to PAMP-Triggered Immunity (PTI, eight (TIR-NBS-LRR, or TNL and nine (CC-NBS-LRR, or CNL candidates related to Effector-Triggered Immunity (ETI genes among others. These candidate genes were categorized by molecular function (98%, biological process (85% and cellular component (79% using gene ontology. Some of the most interesting predicted roles were those associated with binding and transferase activity. We designed 94 primers pairs from the 74 immunity-related genes (IRGs to amplify the corresponding genomic regions on six genotypes that included resistant and susceptible materials. From these, we selected 17 single band amplicons and sequenced them in 14 F. oxysporum resistant and susceptible genotypes. Sequence polymorphisms were analyzed through preliminary candidate gene association, which allowed the detection of one SNP at the PpIRG-63 marker revealing a nonsynonymous mutation in the predicted LRR domain suggesting functional roles for resistance.

Recurrent targeted genes of hepatitis B virus in the liver cancer genomes identified by a next-generation sequencing-based approach.

Directory of Open Access Journals (Sweden)

Dong Ding

Full Text Available Integration of the viral DNA into host chromosomes was found in most of the hepatitis B virus (HBV-related hepatocellular carcinomas (HCCs. Here we devised a massive anchored parallel sequencing (MAPS method using next-generation sequencing to isolate and sequence HBV integrants. Applying MAPS to 40 pairs of HBV-related HCC tissues (cancer and adjacent tissues, we identified 296 HBV integration events corresponding to 286 unique integration sites (UISs with precise HBV-Human DNA junctions. HBV integration favored chromosome 17 and preferentially integrated into human transcript units. HBV targeted genes were enriched in GO terms: cAMP metabolic processes, T cell differentiation and activation, TGF beta receptor pathway, ncRNA catabolic process, and dsRNA fragmentation and cellular response to dsRNA. The HBV targeted genes include 7 genes (PTPRJ, CNTN6, IL12B, MYOM1, FNDC3B, LRFN2, FN1 containing IPR003961 (Fibronectin, type III domain, 7 genes (NRG3, MASP2, NELL1, LRP1B, ADAM21, NRXN1, FN1 containing IPR013032 (EGF-like region, conserved site, and three genes (PDE7A, PDE4B, PDE11A containing IPR002073 (3', 5'-cyclic-nucleotide phosphodiesterase. Enriched pathways include hsa04512 (ECM-receptor interaction, hsa04510 (Focal adhesion, and hsa04012 (ErbB signaling pathway. Fewer integration events were found in cancers compared to cancer-adjacent tissues, suggesting a clonal expansion model in HCC development. Finally, we identified 8 genes that were recurrent target genes by HBV integration including fibronectin 1 (FN1 and telomerase reverse transcriptase (TERT1, two known recurrent target genes, and additional novel target genes such as SMAD family member 5 (SMAD5, phosphatase and actin regulator 4 (PHACTR4, and RNA binding protein fox-1 homolog (C. elegans 1 (RBFOX1. Integrating analysis with recently published whole-genome sequencing analysis, we identified 14 additional recurrent HBV target genes, greatly expanding the HBV recurrent target list
Candidate chemosensory genes identified in the endoparasitoid Meteorus pulchricornis (Hymenoptera: Braconidae) by antennal transcriptome analysis.

Science.gov (United States)

Sheng, Sheng; Liao, Cheng-Wu; Zheng, Yu; Zhou, Yu; Xu, Yan; Song, Wen-Miao; He, Peng; Zhang, Jian; Wu, Fu-An

2017-06-01

Meteorus pulchricornis is an endoparasitoid wasp which attacks the larvae of various lepidopteran pests. We present the first antennal transcriptome dataset for M. pulchricornis. A total of 48,845,072 clean reads were obtained and 34,967 unigenes were assembled. Of these, 15,458 unigenes showed a significant similarity (E-value <10 -5 ) to known proteins in the NCBI non-redundant protein database. Gene ontology (GO) and cluster of orthologous groups (COG) analyses were used to classify the functions of M. pulchricornis antennae genes. We identified 16 putative odorant-binding protein (OBP) genes, eight chemosensory protein (CSP) genes, 99 olfactory receptor (OR) genes, 19 ionotropic receptor (IR) genes and one sensory neuron membrane protein (SNMP) gene. BLASTx best hit results and phylogenetic analysis both indicated that these chemosensory genes were most closely related to those found in other hymenopteran species. Real-time quantitative PCR assays showed that 14 MpulOBP genes were antennae-specific. Of these, MpulOBP6, MpulOBP9, MpulOBP10, MpulOBP12, MpulOBP15 and MpulOBP16 were found to have greater expression in the antennae than in other body parts, while MpulOBP2 and MpulOBP3 were expressed predominately in the legs and abdomens, respectively. These results might provide a foundation for future studies of olfactory genes and chemoreception in M. pulchricornis. Copyright © 2017 Elsevier Inc. All rights reserved.
Integrating genome-wide association study and expression quantitative trait loci data identifies multiple genes and gene set associated with neuroticism.

Science.gov (United States)

Fan, Qianrui; Wang, Wenyu; Hao, Jingcan; He, Awen; Wen, Yan; Guo, Xiong; Wu, Cuiyan; Ning, Yujie; Wang, Xi; Wang, Sen; Zhang, Feng

2017-08-01

Neuroticism is a fundamental personality trait with significant genetic determinant. To identify novel susceptibility genes for neuroticism, we conducted an integrative analysis of genomic and transcriptomic data of genome wide association study (GWAS) and expression quantitative trait locus (eQTL) study. GWAS summary data was driven from published studies of neuroticism, totally involving 170,906 subjects. eQTL dataset containing 927,753 eQTLs were obtained from an eQTL meta-analysis of 5311 samples. Integrative analysis of GWAS and eQTL data was conducted by summary data-based Mendelian randomization (SMR) analysis software. To identify neuroticism associated gene sets, the SMR analysis results were further subjected to gene set enrichment analysis (GSEA). The gene set annotation dataset (containing 13,311 annotated gene sets) of GSEA Molecular Signatures Database was used. SMR single gene analysis identified 6 significant genes for neuroticism, including MSRA (p value=2.27×10 -10 ), MGC57346 (p value=6.92×10 -7 ), BLK (p value=1.01×10 -6 ), XKR6 (p value=1.11×10 -6 ), C17ORF69 (p value=1.12×10 -6 ) and KIAA1267 (p value=4.00×10 -6 ). Gene set enrichment analysis observed significant association for Chr8p23 gene set (false discovery rate=0.033). Our results provide novel clues for the genetic mechanism studies of neuroticism. Copyright © 2017. Published by Elsevier Inc.
Discovering implicit entity relation with the gene-citation-gene network.

Directory of Open Access Journals (Sweden)

Min Song

Full Text Available In this paper, we apply the entitymetrics model to our constructed Gene-Citation-Gene (GCG network. Based on the premise there is a hidden, but plausible, relationship between an entity in one article and an entity in its citing article, we constructed a GCG network of gene pairs implicitly connected through citation. We compare the performance of this GCG network to a gene-gene (GG network constructed over the same corpus but which uses gene pairs explicitly connected through traditional co-occurrence. Using 331,411 MEDLINE abstracts collected from 18,323 seed articles and their references, we identify 25 gene pairs. A comparison of these pairs with interactions found in BioGRID reveal that 96% of the gene pairs in the GCG network have known interactions. We measure network performance using degree, weighted degree, closeness, betweenness centrality and PageRank. Combining all measures, we find the GCG network has more gene pairs, but a lower matching rate than the GG network. However, combining top ranked genes in both networks produces a matching rate of 35.53%. By visualizing both the GG and GCG networks, we find that cancer is the most dominant disease associated with the genes in both networks. Overall, the study indicates that the GCG network can be useful for detecting gene interaction in an implicit manner.
MethylMix 2.0: an R package for identifying DNA methylation genes.

Science.gov (United States)

Cedoz, Pierre-Louis; Prunello, Marcos; Brennan, Kevin; Gevaert, Olivier

2018-04-14

DNA methylation is an important mechanism regulating gene transcription, and its role in carcinogenesis has been extensively studied. Hyper and hypomethylation of genes is a major mechanism of gene expression deregulation in a wide range of diseases. At the same time, high-throughput DNA methylation assays have been developed generating vast amounts of genome wide DNA methylation measurements. We developed MethylMix, an algorithm implemented in R to identify disease specific hyper and hypomethylated genes. Here we present a new version of MethylMix that automates the construction of DNA-methylation and gene expression datasets from The Cancer Genome Atlas (TCGA). More precisely, MethylMix 2.0 incorporates two major updates: the automated downloading of DNA methylation and gene expression datasets from TCGA and the automated preprocessing of such datasets: value imputation, batch correction and CpG sites clustering within each gene. The resulting datasets can subsequently be analyzed with MethylMix to identify transcriptionally predictive methylation states. We show that the Differential Methylation Values created by MethylMix can be used for cancer subtyping. olivier.gevaert@stanford.edu. https://bioconductor.org/packages/release/bioc/manuals/MethylMix/man/MethylMix.pdf. MethylMix 2.0 was implemented as an R package and is available in bioconductor.
Overexpression screens identify conserved dosage chromosome instability genes in yeast and human cancer

Science.gov (United States)

Duffy, Supipi; Fam, Hok Khim; Wang, Yi Kan; Styles, Erin B.; Kim, Jung-Hyun; Ang, J. Sidney; Singh, Tejomayee; Larionov, Vladimir; Shah, Sohrab P.; Andrews, Brenda; Boerkoel, Cornelius F.; Hieter, Philip

2016-01-01

Somatic copy number amplification and gene overexpression are common features of many cancers. To determine the role of gene overexpression on chromosome instability (CIN), we performed genome-wide screens in the budding yeast for yeast genes that cause CIN when overexpressed, a phenotype we refer to as dosage CIN (dCIN), and identified 245 dCIN genes. This catalog of genes reveals human orthologs known to be recurrently overexpressed and/or amplified in tumors. We show that two genes, TDP1, a tyrosyl-DNA-phosphdiesterase, and TAF12, an RNA polymerase II TATA-box binding factor, cause CIN when overexpressed in human cells. Rhabdomyosarcoma lines with elevated human Tdp1 levels also exhibit CIN that can be partially rescued by siRNA-mediated knockdown of TDP1. Overexpression of dCIN genes represents a genetic vulnerability that could be leveraged for selective killing of cancer cells through targeting of an unlinked synthetic dosage lethal (SDL) partner. Using SDL screens in yeast, we identified a set of genes that when deleted specifically kill cells with high levels of Tdp1. One gene was the histone deacetylase RPD3, for which there are known inhibitors. Both HT1080 cells overexpressing hTDP1 and rhabdomyosarcoma cells with elevated levels of hTdp1 were more sensitive to histone deacetylase inhibitors valproic acid (VPA) and trichostatin A (TSA), recapitulating the SDL interaction in human cells and suggesting VPA and TSA as potential therapeutic agents for tumors with elevated levels of hTdp1. The catalog of dCIN genes presented here provides a candidate list to identify genes that cause CIN when overexpressed in cancer, which can then be leveraged through SDL to selectively target tumors. PMID:27551064
A systems approach identifies networks and genes linking sleep and stress: implications for neuropsychiatric disorders.

Science.gov (United States)

Jiang, Peng; Scarpa, Joseph R; Fitzpatrick, Karrie; Losic, Bojan; Gao, Vance D; Hao, Ke; Summa, Keith C; Yang, He S; Zhang, Bin; Allada, Ravi; Vitaterna, Martha H; Turek, Fred W; Kasarskis, Andrew

2015-05-05

Sleep dysfunction and stress susceptibility are comorbid complex traits that often precede and predispose patients to a variety of neuropsychiatric diseases. Here, we demonstrate multilevel organizations of genetic landscape, candidate genes, and molecular networks associated with 328 stress and sleep traits in a chronically stressed population of 338 (C57BL/6J × A/J) F2 mice. We constructed striatal gene co-expression networks, revealing functionally and cell-type-specific gene co-regulations important for stress and sleep. Using a composite ranking system, we identified network modules most relevant for 15 independent phenotypic categories, highlighting a mitochondria/synaptic module that links sleep and stress. The key network regulators of this module are overrepresented with genes implicated in neuropsychiatric diseases. Our work suggests that the interplay among sleep, stress, and neuropathology emerges from genetic influences on gene expression and their collective organization through complex molecular networks, providing a framework for interrogating the mechanisms underlying sleep, stress susceptibility, and related neuropsychiatric disorders. Copyright © 2015 The Authors. Published by Elsevier Inc. All rights reserved.
Integrating mean and variance heterogeneities to identify differentially expressed genes.

Science.gov (United States)

Ouyang, Weiwei; An, Qiang; Zhao, Jinying; Qin, Huaizhen

2016-12-06

In functional genomics studies, tests on mean heterogeneity have been widely employed to identify differentially expressed genes with distinct mean expression levels under different experimental conditions. Variance heterogeneity (aka, the difference between condition-specific variances) of gene expression levels is simply neglected or calibrated for as an impediment. The mean heterogeneity in the expression level of a gene reflects one aspect of its distribution alteration; and variance heterogeneity induced by condition change may reflect another aspect. Change in condition may alter both mean and some higher-order characteristics of the distributions of expression levels of susceptible genes. In this report, we put forth a conception of mean-variance differentially expressed (MVDE) genes, whose expression means and variances are sensitive to the change in experimental condition. We mathematically proved the null independence of existent mean heterogeneity tests and variance heterogeneity tests. Based on the independence, we proposed an integrative mean-variance test (IMVT) to combine gene-wise mean heterogeneity and variance heterogeneity induced by condition change. The IMVT outperformed its competitors under comprehensive simulations of normality and Laplace settings. For moderate samples, the IMVT well controlled type I error rates, and so did existent mean heterogeneity test (i.e., the Welch t test (WT), the moderated Welch t test (MWT)) and the procedure of separate tests on mean and variance heterogeneities (SMVT), but the likelihood ratio test (LRT) severely inflated type I error rates. In presence of variance heterogeneity, the IMVT appeared noticeably more powerful than all the valid mean heterogeneity tests. Application to the gene profiles of peripheral circulating B raised solid evidence of informative variance heterogeneity. After adjusting for background data structure, the IMVT replicated previous discoveries and identified novel experiment
Role of G-protein-coupled receptor-related genes in insecticide resistance of the mosquito, Culex quinquefasciatus.

Science.gov (United States)

Li, Ting; Liu, Lena; Zhang, Lee; Liu, Nannan

2014-09-29

G-protein-coupled receptors regulate signal transduction pathways and play diverse and pivotal roles in the physiology of insects, however, the precise function of GPCRs in insecticide resistance remains unclear. Using quantitative RT-PCR and functional genomic methods, we, for the first time, explored the function of GPCRs and GPCR-related genes in insecticide resistance of mosquitoes, Culex quinquefasciatus. A comparison of the expression of 115 GPCR-related genes at a whole genome level between resistant and susceptible Culex mosquitoes identified one and three GPCR-related genes that were up-regulated in highly resistant Culex mosquito strains, HAmCq(G8) and MAmCq(G6), respectively. To characterize the function of these up-regulated GPCR-related genes in resistance, the up-regulated GPCR-related genes were knockdown in HAmCq(G8) and MAmCq(G6) using RNAi technique. Knockdown of these four GPCR-related genes not only decreased resistance of the mosquitoes to permethrin but also repressed the expression of four insecticide resistance-related P450 genes, suggesting the role of GPCR-related genes in resistance is involved in the regulation of resistance P450 gene expression. This results help in understanding of molecular regulation of resistance development in Cx. quinquefasciatus.
Systems Genetics Analysis to Identify the Genetic Modulation of a Glaucoma-Associated Gene.

Science.gov (United States)

Chintalapudi, Sumana R; Jablonski, Monica M

2017-01-01

Loss of retinal ganglion cells (RGCs) is one of the hallmarks of retinal neurodegenerative diseases, glaucoma being one of the most common. Recently, γ-synuclein (SNCG) was shown to be highly expressed in the somas and axons of RGCs. In various mouse models of glaucoma, downregulation of Sncg gene expression correlates with RGC loss. To investigate the regulation of Sncg in RGCs, we used a systems genetics approach to identify a gene that modulates the expression of Sncg, followed by confirmatory studies in both healthy and diseased retinas. We found that chromosome 1 harbors an eQTL that modulates the expression of Sncg in the mouse retina and identified Pfdn2 as the candidate upstream modulator of Sncg expression. Downregulation of Pfdn2 in enriched RGCs causes a concomitant reduction in Sncg. In this chapter, we describe our strategy and methods for identifying and confirming a genetic modulation of a glaucoma-associated gene. A similar method can be applied to other genes expressed in other tissues.
Genome-wide methylation analysis identifies a core set of hypermethylated genes in CIMP-H colorectal cancer.

Science.gov (United States)

McInnes, Tyler; Zou, Donghui; Rao, Dasari S; Munro, Francesca M; Phillips, Vicky L; McCall, John L; Black, Michael A; Reeve, Anthony E; Guilford, Parry J

2017-03-28

Aberrant DNA methylation profiles are a characteristic of all known cancer types, epitomized by the CpG island methylator phenotype (CIMP) in colorectal cancer (CRC). Hypermethylation has been observed at CpG islands throughout the genome, but it is unclear which factors determine whether an individual island becomes methylated in cancer. DNA methylation in CRC was analysed using the Illumina HumanMethylation450K array. Differentially methylated loci were identified using Significance Analysis of Microarrays (SAM) and the Wilcoxon Signed Rank (WSR) test. Unsupervised hierarchical clustering was used to identify methylation subtypes in CRC. In this study we characterized the DNA methylation profiles of 94 CRC tissues and their matched normal counterparts. Consistent with previous studies, unsupervized hierarchical clustering of genome-wide methylation data identified three subtypes within the tumour samples, designated CIMP-H, CIMP-L and CIMP-N, that showed high, low and very low methylation levels, respectively. Differential methylation between normal and tumour samples was analysed at the individual CpG level, and at the gene level. The distribution of hypermethylation in CIMP-N tumours showed high inter-tumour variability and appeared to be highly stochastic in nature, whereas CIMP-H tumours exhibited consistent hypermethylation at a subset of genes, in addition to a highly variable background of hypermethylated genes. EYA4, TFPI2 and TLX1 were hypermethylated in more than 90% of all tumours examined. One-hundred thirty-two genes were hypermethylated in 100% of CIMP-H tumours studied and these were highly enriched for functions relating to skeletal system development (Bonferroni adjusted p value =2.88E-15), segment specification (adjusted p value =9.62E-11), embryonic development (adjusted p value =1.52E-04), mesoderm development (adjusted p value =1.14E-20), and ectoderm development (adjusted p value =7.94E-16). Our genome-wide characterization of DNA
Finding Combination of Features from Promoter Regions for Ovarian Cancer-related Gene Group Classification

KAUST Repository

Olayan, Rawan S.

2012-01-01

In classification problems, it is always important to use the suitable combination of features that will be employed by classifiers. Generating the right combination of features usually results in good classifiers. In the situation when the problem is not well understood, data items are usually described by many features in the hope that some of these may be the relevant or most relevant ones. In this study, we focus on one such problem related to genes implicated in ovarian cancer (OC). We try to recognize two important OC-related gene groups: oncogenes, which support the development and progression of OC, and oncosuppressors, which oppose such tendencies. For this, we use the properties of promoters of these genes. We identified potential “regulatory features” that characterize OC-related oncogenes and oncosuppressors promoters. In our study, we used 211 oncogenes and 39 oncosuppressors. For these, we identified 538 characteristic sequence motifs from their promoters. Promoters are annotated by these motifs and derived feature vectors used to develop classification models. We made a comparison of a number of classification models in their ability to distinguish oncogenes from oncosuppressors. Based on 10-fold cross-validation, the resultant model was able to separate the two classes with sensitivity of 96% and specificity of 100% with the complete set of features. Moreover, we developed another recognition model where we attempted to distinguish oncogenes and oncosuppressors as one group from other OC-related genes. That model achieved accuracy of 82%. We believe that the results of this study will help in discovering other OC-related oncogenes and oncosuppressors not identified as yet.
Finding Combination of Features from Promoter Regions for Ovarian Cancer-related Gene Group Classification

KAUST Repository

Olayan, Rawan S.

2012-12-01

In classification problems, it is always important to use the suitable combination of features that will be employed by classifiers. Generating the right combination of features usually results in good classifiers. In the situation when the problem is not well understood, data items are usually described by many features in the hope that some of these may be the relevant or most relevant ones. In this study, we focus on one such problem related to genes implicated in ovarian cancer (OC). We try to recognize two important OC-related gene groups: oncogenes, which support the development and progression of OC, and oncosuppressors, which oppose such tendencies. For this, we use the properties of promoters of these genes. We identified potential “regulatory features” that characterize OC-related oncogenes and oncosuppressors promoters. In our study, we used 211 oncogenes and 39 oncosuppressors. For these, we identified 538 characteristic sequence motifs from their promoters. Promoters are annotated by these motifs and derived feature vectors used to develop classification models. We made a comparison of a number of classification models in their ability to distinguish oncogenes from oncosuppressors. Based on 10-fold cross-validation, the resultant model was able to separate the two classes with sensitivity of 96% and specificity of 100% with the complete set of features. Moreover, we developed another recognition model where we attempted to distinguish oncogenes and oncosuppressors as one group from other OC-related genes. That model achieved accuracy of 82%. We believe that the results of this study will help in discovering other OC-related oncogenes and oncosuppressors not identified as yet.
A stochastic model for identifying differential gene pair co-expression patterns in prostate cancer progression

Directory of Open Access Journals (Sweden)

Mao Yu

2009-07-01

Full Text Available Abstract Background The identification of gene differential co-expression patterns between cancer stages is a newly developing method to reveal the underlying molecular mechanisms of carcinogenesis. Most researches of this subject lack an algorithm useful for performing a statistical significance assessment involving cancer progression. Lacking this specific algorithm is apparently absent in identifying precise gene pairs correlating to cancer progression. Results In this investigation we studied gene pair co-expression change by using a stochastic process model for approximating the underlying dynamic procedure of the co-expression change during cancer progression. Also, we presented a novel analytical method named 'Stochastic process model for Identifying differentially co-expressed Gene pair' (SIG method. This method has been applied to two well known prostate cancer data sets: hormone sensitive versus hormone resistant, and healthy versus cancerous. From these data sets, 428,582 gene pairs and 303,992 gene pairs were identified respectively. Afterwards, we used two different current statistical methods to the same data sets, which were developed to identify gene pair differential co-expression and did not consider cancer progression in algorithm. We then compared these results from three different perspectives: progression analysis, gene pair identification effectiveness analysis, and pathway enrichment analysis. Statistical methods were used to quantify the quality and performance of these different perspectives. They included: Re-identification Scale (RS and Progression Score (PS in progression analysis, True Positive Rate (TPR in gene pair analysis, and Pathway Enrichment Score (PES in pathway analysis. Our results show small values of RS and large values of PS, TPR, and PES; thus, suggesting that gene pairs identified by the SIG method are highly correlated with cancer progression, and highly enriched in disease-specific pathways. From
Genome-wide analysis of pain-, nerve- and neurotrophin -related gene expression in the degenerating human annulus

Science.gov (United States)

2012-01-01

Background In spite of its high clinical relevance, the relationship between disc degeneration and low back pain is still not well understood. Recent studies have shown that genome-wide gene expression studies utilizing ontology searches provide an efficient and valuable methodology for identification of clinically relevant genes. Here we use this approach in analysis of pain-, nerve-, and neurotrophin-related gene expression patterns in specimens of human disc tissue. Control, non-herniated clinical, and herniated clinical specimens of human annulus tissue were studied following Institutional Review Board approval. Results Analyses were performed on more generated (Thompson grade IV and V) discs vs. less degenerated discs (grades I-III), on surgically operated discs vs. control discs, and on herniated vs. control discs. Analyses of more degenerated vs. less degenerated discs identified significant upregulation of well-recognized pain-related genes (bradykinin receptor B1, calcitonin gene-related peptide and catechol-0-methyltransferase). Nerve growth factor was significantly upregulated in surgical vs. control and in herniated vs. control discs. All three analyses also found significant changes in numerous proinflammatory cytokine- and chemokine-related genes. Nerve, neurotrophin and pain-ontology searches identified many matrix, signaling and functional genes which have known importance in the disc. Immunohistochemistry was utilized to confirm the presence of calcitonin gene-related peptide, catechol-0-methyltransferase and bradykinin receptor B1 at the protein level in the human annulus. Conclusions Findings point to the utility of microarray analyses in identification of pain-, neurotrophin and nerve-related genes in the disc, and point to the importance of future work exploring functional interactions between nerve and disc cells in vitro and in vivo. Nerve, pain and neurotrophin ontology searches identified numerous changes in proinflammatory cytokines and
Next-generation sequencing to identify candidate genes and develop diagnostic markers for a novel Phytophthora resistance gene, RpsHC18, in soybean.

Science.gov (United States)

Zhong, Chao; Sun, Suli; Li, Yinping; Duan, Canxing; Zhu, Zhendong

2018-03-01

A novel Phytophthora sojae resistance gene RpsHC18 was identified and finely mapped on soybean chromosome 3. Two NBS-LRR candidate genes were identified and two diagnostic markers of RpsHC18 were developed. Phytophthora root rot caused by Phytophthora sojae is a destructive disease of soybean. The most effective disease-control strategy is to deploy resistant cultivars carrying Phytophthora-resistant Rps genes. The soybean cultivar Huachun 18 has a broad and distinct resistance spectrum to 12 P. sojae isolates. Quantitative trait loci sequencing (QTL-seq), based on the whole-genome resequencing (WGRS) of two extreme resistant and susceptible phenotype bulks from an F 2:3 population, was performed, and one 767-kb genomic region with ΔSNP-index ≥ 0.9 on chromosome 3 was identified as the RpsHC18 candidate region in Huachun 18. The candidate region was reduced to a 146-kb region by fine mapping. Nonsynonymous SNP and haplotype analyses were carried out in the 146-kb region among ten soybean genotypes using WGRS. Four specific nonsynonymous SNPs were identified in two nucleotide-binding sites-leucine-rich repeat (NBS-LRR) genes, RpsHC18-NBL1 and RpsHC18-NBL2, which were considered to be the candidate genes. Finally, one specific SNP marker in each candidate gene was successfully developed using a tetra-primer ARMS-PCR assay, and the two markers were verified to be specific for RpsHC18 and to effectively distinguish other known Rps genes. In this study, we applied an integrated genomic-based strategy combining WGRS with traditional genetic mapping to identify RpsHC18 candidate genes and develop diagnostic markers. These results suggest that next-generation sequencing is a precise, rapid and cost-effective way to identify candidate genes and develop diagnostic markers, and it can accelerate Rps gene cloning and marker-assisted selection for breeding of P. sojae-resistant soybean cultivars.
A 6-gene signature identifies four molecular subgroups of neuroblastoma

LENUS (Irish Health Repository)

Abel, Frida

2011-04-14

Abstract Background There are currently three postulated genomic subtypes of the childhood tumour neuroblastoma (NB); Type 1, Type 2A, and Type 2B. The most aggressive forms of NB are characterized by amplification of the oncogene MYCN (MNA) and low expression of the favourable marker NTRK1. Recently, mutations or high expression of the familial predisposition gene Anaplastic Lymphoma Kinase (ALK) was associated to unfavourable biology of sporadic NB. Also, various other genes have been linked to NB pathogenesis. Results The present study explores subgroup discrimination by gene expression profiling using three published microarray studies on NB (47 samples). Four distinct clusters were identified by Principal Components Analysis (PCA) in two separate data sets, which could be verified by an unsupervised hierarchical clustering in a third independent data set (101 NB samples) using a set of 74 discriminative genes. The expression signature of six NB-associated genes ALK, BIRC5, CCND1, MYCN, NTRK1, and PHOX2B, significantly discriminated the four clusters (p < 0.05, one-way ANOVA test). PCA clusters p1, p2, and p3 were found to correspond well to the postulated subtypes 1, 2A, and 2B, respectively. Remarkably, a fourth novel cluster was detected in all three independent data sets. This cluster comprised mainly 11q-deleted MNA-negative tumours with low expression of ALK, BIRC5, and PHOX2B, and was significantly associated with higher tumour stage, poor outcome and poor survival compared to the Type 1-corresponding favourable group (INSS stage 4 and\\/or dead of disease, p < 0.05, Fisher\\'s exact test). Conclusions Based on expression profiling we have identified four molecular subgroups of neuroblastoma, which can be distinguished by a 6-gene signature. The fourth subgroup has not been described elsewhere, and efforts are currently made to further investigate this group\\'s specific characteristics.
Gene expression meta-analysis identifies metastatic pathways and transcription factors in breast cancer

International Nuclear Information System (INIS)

Thomassen, Mads; Tan, Qihua; Kruse, Torben A

2008-01-01

Metastasis is believed to progress in several steps including different pathways but the determination and understanding of these mechanisms is still fragmentary. Microarray analysis of gene expression patterns in breast tumors has been used to predict outcome in recent studies. Besides classification of outcome, these global expression patterns may reflect biological mechanisms involved in metastasis of breast cancer. Our purpose has been to investigate pathways and transcription factors involved in metastasis by use of gene expression data sets. We have analyzed 8 publicly available gene expression data sets. A global approach, 'gene set enrichment analysis' as well as an approach focusing on a subset of significantly differently regulated genes, GenMAPP, has been applied to rank pathway gene sets according to differential regulation in metastasizing tumors compared to non-metastasizing tumors. Meta-analysis has been used to determine overrepresentation of pathways and transcription factors targets, concordant deregulated in metastasizing breast tumors, in several data sets. The major findings are up-regulation of cell cycle pathways and a metabolic shift towards glucose metabolism reflected in several pathways in metastasizing tumors. Growth factor pathways seem to play dual roles; EGF and PDGF pathways are decreased, while VEGF and sex-hormone pathways are increased in tumors that metastasize. Furthermore, migration, proteasome, immune system, angiogenesis, DNA repair and several signal transduction pathways are associated to metastasis. Finally several transcription factors e.g. E2F, NFY, and YY1 are identified as being involved in metastasis. By pathway meta-analysis many biological mechanisms beyond major characteristics such as proliferation are identified. Transcription factor analysis identifies a number of key factors that support central pathways. Several previously proposed treatment targets are identified and several new pathways that may
Systems approach identifies an organic nitrogen-responsive gene network that is regulated by the master clock control gene CCA1.

Science.gov (United States)

Gutiérrez, Rodrigo A; Stokes, Trevor L; Thum, Karen; Xu, Xiaodong; Obertello, Mariana; Katari, Manpreet S; Tanurdzic, Milos; Dean, Alexis; Nero, Damion C; McClung, C Robertson; Coruzzi, Gloria M

2008-03-25

Understanding how nutrients affect gene expression will help us to understand the mechanisms controlling plant growth and development as a function of nutrient availability. Nitrate has been shown to serve as a signal for the control of gene expression in Arabidopsis. There is also evidence, on a gene-by-gene basis, that downstream products of nitrogen (N) assimilation such as glutamate (Glu) or glutamine (Gln) might serve as signals of organic N status that in turn regulate gene expression. To identify genome-wide responses to such organic N signals, Arabidopsis seedlings were transiently treated with ammonium nitrate in the presence or absence of MSX, an inhibitor of glutamine synthetase, resulting in a block of Glu/Gln synthesis. Genes that responded to organic N were identified as those whose response to ammonium nitrate treatment was blocked in the presence of MSX. We showed that some genes previously identified to be regulated by nitrate are under the control of an organic N-metabolite. Using an integrated network model of molecular interactions, we uncovered a subnetwork regulated by organic N that included CCA1 and target genes involved in N-assimilation. We validated some of the predicted interactions and showed that regulation of the master clock control gene CCA1 by Glu or a Glu-derived metabolite in turn regulates the expression of key N-assimilatory genes. Phase response curve analysis shows that distinct N-metabolites can advance or delay the CCA1 phase. Regulation of CCA1 by organic N signals may represent a novel input mechanism for N-nutrients to affect plant circadian clock function.
Bioinformatics analysis identify novel OB fold protein coding genes in C. elegans.

Directory of Open Access Journals (Sweden)

Daryanaz Dargahi

Full Text Available BACKGROUND: The C. elegans genome has been extensively annotated by the WormBase consortium that uses state of the art bioinformatics pipelines, functional genomics and manual curation approaches. As a result, the identification of novel genes in silico in this model organism is becoming more challenging requiring new approaches. The Oligonucleotide-oligosaccharide binding (OB fold is a highly divergent protein family, in which protein sequences, in spite of having the same fold, share very little sequence identity (5-25%. Therefore, evidence from sequence-based annotation may not be sufficient to identify all the members of this family. In C. elegans, the number of OB-fold proteins reported is remarkably low (n=46 compared to other evolutionary-related eukaryotes, such as yeast S. cerevisiae (n=344 or fruit fly D. melanogaster (n=84. Gene loss during evolution or differences in the level of annotation for this protein family, may explain these discrepancies. METHODOLOGY/PRINCIPAL FINDINGS: This study examines the possibility that novel OB-fold coding genes exist in the worm. We developed a bioinformatics approach that uses the most sensitive sequence-sequence, sequence-profile and profile-profile similarity search methods followed by 3D-structure prediction as a filtering step to eliminate false positive candidate sequences. We have predicted 18 coding genes containing the OB-fold that have remarkably partially been characterized in C. elegans. CONCLUSIONS/SIGNIFICANCE: This study raises the possibility that the annotation of highly divergent protein fold families can be improved in C. elegans. Similar strategies could be implemented for large scale analysis by the WormBase consortium when novel versions of the genome sequence of C. elegans, or other evolutionary related species are being released. This approach is of general interest to the scientific community since it can be used to annotate any genome.

Identification of genes related to Paulownia witches' broom by AFLP and MSAP.

Science.gov (United States)

Cao, Xibing; Fan, Guoqiang; Deng, Minjie; Zhao, Zhenli; Dong, Yanpeng

2014-08-21

DNA methylation is believed to play important roles in regulating gene expression in plant growth and development. Paulownia witches' broom (PaWB) infection has been reported to be related to gene expression changes in paulownia plantlets. To determine whether DNA methylation is associated with gene expression changes in response to phytoplasma, we investigated variations in genomic DNA sequence and methylation in PaWB plantlets treated with methyl methane sulfonate (MMS) using amplified fragment length polymorphism (AFLP) and methylation-sensitive amplification polymorphism (MSAP) techniques, respectively. The results indicated that PaWB seedings recovered a normal morphology after treatment with more than 15 mg·L(-1) MMS. PaWB infection did not cause changes of the paulownia DNA sequence at the AFLP level; However, DNA methylation levels and patterns were altered. Quantitative real-time PCR (qRT-PCR) showed that three of the methylated genes were up-regulated and three were down-regulated in the MMS-treated PaWB plantlets that had regained healthy morphology. These six genes might be involved in transcriptional regulation, plant defense, signal transduction and energy. The possible roles of these genes in PaWB are discussed. The results showed that changes of DNA methylation altered gene expression levels, and that MSAP might help identify genes related to PaWB.
Identification of Genes Related to Paulownia Witches’ Broom by AFLP and MSAP

Science.gov (United States)

Cao, Xibing; Fan, Guoqiang; Deng, Minjie; Zhao, Zhenli; Dong, Yanpeng

2014-01-01

DNA methylation is believed to play important roles in regulating gene expression in plant growth and development. Paulownia witches’ broom (PaWB) infection has been reported to be related to gene expression changes in paulownia plantlets. To determine whether DNA methylation is associated with gene expression changes in response to phytoplasma, we investigated variations in genomic DNA sequence and methylation in PaWB plantlets treated with methyl methane sulfonate (MMS) using amplified fragment length polymorphism (AFLP) and methylation-sensitive amplification polymorphism (MSAP) techniques, respectively. The results indicated that PaWB seedings recovered a normal morphology after treatment with more than 15 mg·L−1 MMS. PaWB infection did not cause changes of the paulownia DNA sequence at the AFLP level; However, DNA methylation levels and patterns were altered. Quantitative real-time PCR (qRT-PCR) showed that three of the methylated genes were up-regulated and three were down-regulated in the MMS-treated PaWB plantlets that had regained healthy morphology. These six genes might be involved in transcriptional regulation, plant defense, signal transduction and energy. The possible roles of these genes in PaWB are discussed. The results showed that changes of DNA methylation altered gene expression levels, and that MSAP might help identify genes related to PaWB. PMID:25196603
A cross-species genetic analysis identifies candidate genes for mouse anxiety and human bipolar disorder

Directory of Open Access Journals (Sweden)

David G Ashbrook

2015-07-01

Full Text Available Bipolar disorder (BD is a significant neuropsychiatric disorder with a lifetime prevalence of ~1%. To identify genetic variants underlying BD genome-wide association studies (GWAS have been carried out. While many variants of small effect associated with BD have been identified few have yet been confirmed, partly because of the low power of GWAS due to multiple comparisons being made. Complementary mapping studies using murine models have identified genetic variants for behavioral traits linked to BD, often with high power, but these identified regions often contain too many genes for clear identification of candidate genes. In the current study we have aligned human BD GWAS results and mouse linkage studies to help define and evaluate candidate genes linked to BD, seeking to use the power of the mouse mapping with the precision of GWAS. We use quantitative trait mapping for open field test and elevated zero maze data in the largest mammalian model system, the BXD recombinant inbred mouse population, to identify genomic regions associated with these BD-like phenotypes. We then investigate these regions in whole genome data from the Psychiatric Genomics Consortium’s bipolar disorder GWAS to identify candidate genes associated with BD. Finally we establish the biological relevance and pathways of these genes in a comprehensive systems genetics analysis.We identify four genes associated with both mouse anxiety and human BD. While TNR is a novel candidate for BD, we can confirm previously suggested associations with CMYA5, MCTP1 and RXRG. A cross-species, systems genetics analysis shows that MCTP1, RXRG and TNR coexpress with genes linked to psychiatric disorders and identify the striatum as a potential site of action. CMYA5, MCTP1, RXRG and TNR are associated with mouse anxiety and human BD. We hypothesize that MCTP1, RXRG and TNR influence intercellular signaling in the striatum.
A systems genetics approach identifies genes and pathways for type 2 diabetes in human islets

DEFF Research Database (Denmark)

Taneera, Jalal; Lang, Stefan; Sharma, Amitabh

2012-01-01

Close to 50 genetic loci have been associated with type 2 diabetes (T2D), but they explain only 15% of the heritability. In an attempt to identify additional T2D genes, we analyzed global gene expression in human islets from 63 donors. Using 48 genes located near T2D risk variants, we identified ...
Analysis of SOX10 mutations identified in Waardenburg-Hirschsprung patients: Differential effects on target gene regulation.

Science.gov (United States)

Chan, Kwok Keung; Wong, Corinne Kung Yen; Lui, Vincent Chi Hang; Tam, Paul Kwong Hang; Sham, Mai Har

2003-10-15

SOX10 is a member of the SOX gene family related by homology to the high-mobility group (HMG) box region of the testis-determining gene SRY. Mutations of the transcription factor gene SOX10 lead to Waardenburg-Hirschsprung syndrome (Waardenburg-Shah syndrome, WS4) in humans. A number of SOX10 mutations have been identified in WS4 patients who suffer from different extents of intestinal aganglionosis, pigmentation, and hearing abnormalities. Some patients also exhibit signs of myelination deficiency in the central and peripheral nervous systems. Although the molecular bases for the wide range of symptoms displayed by the patients are still not clearly understood, a few target genes for SOX10 have been identified. We have analyzed the impact of six different SOX10 mutations on the activation of SOX10 target genes by yeast one-hybrid and mammalian cell transfection assays. To investigate the transactivation activities of the mutant proteins, three different SOX target binding sites were introduced into luciferase reporter gene constructs and examined in our series of transfection assays: consensus HMG domain protein binding sites; SOX10 binding sites identified in the RET promoter; and Sox10 binding sites identified in the P0 promoter. We found that the same mutation could have different transactivation activities when tested with different target binding sites and in different cell lines. The differential transactivation activities of the SOX10 mutants appeared to correlate with the intestinal and/or neurological symptoms presented in the patients. Among the six mutant SOX10 proteins tested, much reduced transactivation activities were observed when tested on the SOX10 binding sites from the RET promoter. Of the two similar mutations X467K and 1400del12, only the 1400del12 mutant protein exhibited an increase of transactivation through the P0 promoter. While the lack of normal SOX10 mediated activation of RET transcription may lead to intestinal aganglionosis
Evolutionary Inference across Eukaryotes Identifies Specific Pressures Favoring Mitochondrial Gene Retention.

Science.gov (United States)

Johnston, Iain G; Williams, Ben P

2016-02-24

Since their endosymbiotic origin, mitochondria have lost most of their genes. Although many selective mechanisms underlying the evolution of mitochondrial genomes have been proposed, a data-driven exploration of these hypotheses is lacking, and a quantitatively supported consensus remains absent. We developed HyperTraPS, a methodology coupling stochastic modeling with Bayesian inference, to identify the ordering of evolutionary events and suggest their causes. Using 2015 complete mitochondrial genomes, we inferred evolutionary trajectories of mtDNA gene loss across the eukaryotic tree of life. We find that proteins comprising the structural cores of the electron transport chain are preferentially encoded within mitochondrial genomes across eukaryotes. A combination of high GC content and high protein hydrophobicity is required to explain patterns of mtDNA gene retention; a model that accounts for these selective pressures can also predict the success of artificial gene transfer experiments in vivo. This work provides a general method for data-driven inference of the ordering of evolutionary and progressive events, here identifying the distinct features shaping mitochondrial genomes of present-day species. Copyright © 2016 Elsevier Inc. All rights reserved.
'Omics' approaches in tomato aimed at identifying candidate genes ...

African Journals Online (AJOL)

adriana

2013-12-04

Dec 4, 2013 ... approaches could be combined in order to identify candidate genes for the genetic control of ascorbic ..... applied to other traits under the complex control of many ... Engineering increased vitamin C levels in ... Chem. Biol. 13:532–538. Giovannucci E, Rimm EB, Liu Y, Stampfer MJ, Willett WC (2002). A.
Gene expression differences between Noccaea caerulescens ecotypes help to identify candidate genes for metal phytoremediation.

Science.gov (United States)

Halimaa, Pauliina; Lin, Ya-Fen; Ahonen, Viivi H; Blande, Daniel; Clemens, Stephan; Gyenesei, Attila; Häikiö, Elina; Kärenlampi, Sirpa O; Laiho, Asta; Aarts, Mark G M; Pursiheimo, Juha-Pekka; Schat, Henk; Schmidt, Holger; Tuomainen, Marjo H; Tervahauta, Arja I

2014-03-18

Populations of Noccaea caerulescens show tremendous differences in their capacity to hyperaccumulate and hypertolerate metals. To explore the differences that could contribute to these traits, we undertook SOLiD high-throughput sequencing of the root transcriptomes of three phenotypically well-characterized N. caerulescens accessions, i.e., Ganges, La Calamine, and Monte Prinzera. Genes with possible contribution to zinc, cadmium, and nickel hyperaccumulation and hypertolerance were predicted. The most significant differences between the accessions were related to metal ion (di-, trivalent inorganic cation) transmembrane transporter activity, iron and calcium ion binding, (inorganic) anion transmembrane transporter activity, and antioxidant activity. Analysis of correlation between the expression profile of each gene and the metal-related characteristics of the accessions disclosed both previously characterized (HMA4, HMA3) and new candidate genes (e.g., for nickel IRT1, ZIP10, and PDF2.3) as possible contributors to the hyperaccumulation/tolerance phenotype. A number of unknown Noccaea-specific transcripts also showed correlation with Zn(2+), Cd(2+), or Ni(2+) hyperaccumulation/tolerance. This study shows that N. caerulescens populations have evolved great diversity in the expression of metal-related genes, facilitating adaptation to various metalliferous soils. The information will be helpful in the development of improved plants for metal phytoremediation.
Integrative multi-platform meta-analysis of gene expression profiles in pancreatic ductal adenocarcinoma patients for identifying novel diagnostic biomarkers.

Science.gov (United States)

Irigoyen, Antonio; Jimenez-Luna, Cristina; Benavides, Manuel; Caba, Octavio; Gallego, Javier; Ortuño, Francisco Manuel; Guillen-Ponce, Carmen; Rojas, Ignacio; Aranda, Enrique; Torres, Carolina; Prados, Jose

2018-01-01

Applying differentially expressed genes (DEGs) to identify feasible biomarkers in diseases can be a hard task when working with heterogeneous datasets. Expression data are strongly influenced by technology, sample preparation processes, and/or labeling methods. The proliferation of different microarray platforms for measuring gene expression increases the need to develop models able to compare their results, especially when different technologies can lead to signal values that vary greatly. Integrative meta-analysis can significantly improve the reliability and robustness of DEG detection. The objective of this work was to develop an integrative approach for identifying potential cancer biomarkers by integrating gene expression data from two different platforms. Pancreatic ductal adenocarcinoma (PDAC), where there is an urgent need to find new biomarkers due its late diagnosis, is an ideal candidate for testing this technology. Expression data from two different datasets, namely Affymetrix and Illumina (18 and 36 PDAC patients, respectively), as well as from 18 healthy controls, was used for this study. A meta-analysis based on an empirical Bayesian methodology (ComBat) was then proposed to integrate these datasets. DEGs were finally identified from the integrated data by using the statistical programming language R. After our integrative meta-analysis, 5 genes were commonly identified within the individual analyses of the independent datasets. Also, 28 novel genes that were not reported by the individual analyses ('gained' genes) were also discovered. Several of these gained genes have been already related to other gastroenterological tumors. The proposed integrative meta-analysis has revealed novel DEGs that may play an important role in PDAC and could be potential biomarkers for diagnosing the disease.
Coalitional game theory as a promising approach to identify candidate autism genes.

Science.gov (United States)

Gupta, Anika; Sun, Min Woo; Paskov, Kelley Marie; Stockham, Nate Tyler; Jung, Jae-Yoon; Wall, Dennis Paul

2018-01-01

Despite mounting evidence for the strong role of genetics in the phenotypic manifestation of Autism Spectrum Disorder (ASD), the specific genes responsible for the variable forms of ASD remain undefined. ASD may be best explained by a combinatorial genetic model with varying epistatic interactions across many small effect mutations. Coalitional or cooperative game theory is a technique that studies the combined effects of groups of players, known as coalitions, seeking to identify players who tend to improve the performance--the relationship to a specific disease phenotype--of any coalition they join. This method has been previously shown to boost biologically informative signal in gene expression data but to-date has not been applied to the search for cooperative mutations among putative ASD genes. We describe our approach to highlight genes relevant to ASD using coalitional game theory on alteration data of 1,965 fully sequenced genomes from 756 multiplex families. Alterations were encoded into binary matrices for ASD (case) and unaffected (control) samples, indicating likely gene-disrupting, inherited mutations in altered genes. To determine individual gene contributions given an ASD phenotype, a "player" metric, referred to as the Shapley value, was calculated for each gene in the case and control cohorts. Sixty seven genes were found to have significantly elevated player scores and likely represent significant contributors to the genetic coordination underlying ASD. Using network and cross-study analysis, we found that these genes are involved in biological pathways known to be affected in the autism cases and that a subset directly interact with several genes known to have strong associations to autism. These findings suggest that coalitional game theory can be applied to large-scale genomic data to identify hidden yet influential players in complex polygenic disorders such as autism.
Transcriptional Profiling of Whole Blood Identifies a Unique 5-Gene Signature for Myelofibrosis and Imminent Myelofibrosis Transformation

DEFF Research Database (Denmark)

Hasselbalch, Hans Carl; Skov, Vibe; Stauffer Larsen, Thomas

2014-01-01

Identifying a distinct gene signature for myelofibrosis may yield novel information of the genes, which are responsible for progression of essential thrombocythemia and polycythemia vera towards myelofibrosis. We aimed at identifying a simple gene signature - composed of a few genes - which were...
The FUN of identifying gene function in bacterial pathogens; insights from Salmonella functional genomics.

Science.gov (United States)

Hammarlöf, Disa L; Canals, Rocío; Hinton, Jay C D

2013-10-01

The availability of thousands of genome sequences of bacterial pathogens poses a particular challenge because each genome contains hundreds of genes of unknown function (FUN). How can we easily discover which FUN genes encode important virulence factors? One solution is to combine two different functional genomic approaches. First, transcriptomics identifies bacterial FUN genes that show differential expression during the process of mammalian infection. Second, global mutagenesis identifies individual FUN genes that the pathogen requires to cause disease. The intersection of these datasets can reveal a small set of candidate genes most likely to encode novel virulence attributes. We demonstrate this approach with the Salmonella infection model, and propose that a similar strategy could be used for other bacterial pathogens. Copyright © 2013 Elsevier Ltd. All rights reserved.
Gene expression signature analysis identifies vorinostat as a candidate therapy for gastric cancer.

Directory of Open Access Journals (Sweden)

Sofie Claerhout

Full Text Available Gastric cancer continues to be one of the deadliest cancers in the world and therefore identification of new drugs targeting this type of cancer is thus of significant importance. The purpose of this study was to identify and validate a therapeutic agent which might improve the outcomes for gastric cancer patients in the future.Using microarray technology, we generated a gene expression profile of human gastric cancer-specific genes from human gastric cancer tissue samples. We used this profile in the Broad Institute's Connectivity Map analysis to identify candidate therapeutic compounds for gastric cancer. We found the histone deacetylase inhibitor vorinostat as the lead compound and thus a potential therapeutic drug for gastric cancer. Vorinostat induced both apoptosis and autophagy in gastric cancer cell lines. Pharmacological and genetic inhibition of autophagy however, increased the therapeutic efficacy of vorinostat, indicating that a combination of vorinostat with autophagy inhibitors may therapeutically be more beneficial. Moreover, gene expression analysis of gastric cancer identified a collection of genes (ITGB5, TYMS, MYB, APOC1, CBX5, PLA2G2A, and KIF20A whose expression was elevated in gastric tumor tissue and downregulated more than 2-fold by vorinostat treatment in gastric cancer cell lines. In contrast, SCGB2A1, TCN1, CFD, APLP1, and NQO1 manifested a reversed pattern.We showed that analysis of gene expression signature may represent an emerging approach to discover therapeutic agents for gastric cancer, such as vorinostat. The observation of altered gene expression after vorinostat treatment may provide the clue to identify the molecular mechanism of vorinostat and those patients likely to benefit from vorinostat treatment.
Analyzing the genes related to Alzheimer's disease via a network and pathway-based approach.

Science.gov (United States)

Hu, Yan-Shi; Xin, Juncai; Hu, Ying; Zhang, Lei; Wang, Ju

2017-04-27

Our understanding of the molecular mechanisms underlying Alzheimer's disease (AD) remains incomplete. Previous studies have revealed that genetic factors provide a significant contribution to the pathogenesis and development of AD. In the past years, numerous genes implicated in this disease have been identified via genetic association studies on candidate genes or at the genome-wide level. However, in many cases, the roles of these genes and their interactions in AD are still unclear. A comprehensive and systematic analysis focusing on the biological function and interactions of these genes in the context of AD will therefore provide valuable insights to understand the molecular features of the disease. In this study, we collected genes potentially associated with AD by screening publications on genetic association studies deposited in PubMed. The major biological themes linked with these genes were then revealed by function and biochemical pathway enrichment analysis, and the relation between the pathways was explored by pathway crosstalk analysis. Furthermore, the network features of these AD-related genes were analyzed in the context of human interactome and an AD-specific network was inferred using the Steiner minimal tree algorithm. We compiled 430 human genes reported to be associated with AD from 823 publications. Biological theme analysis indicated that the biological processes and biochemical pathways related to neurodevelopment, metabolism, cell growth and/or survival, and immunology were enriched in these genes. Pathway crosstalk analysis then revealed that the significantly enriched pathways could be grouped into three interlinked modules-neuronal and metabolic module, cell growth/survival and neuroendocrine pathway module, and immune response-related module-indicating an AD-specific immune-endocrine-neuronal regulatory network. Furthermore, an AD-specific protein network was inferred and novel genes potentially associated with AD were identified. By
Investigation of de novo unique differentially expressed genes related to evolution in exercise response during domestication in Thoroughbred race horses.

Directory of Open Access Journals (Sweden)

Woncheoul Park

Full Text Available Previous studies of horse RNA-seq were performed by mapping sequence reads to the reference genome during transcriptome analysis. However in this study, we focused on two main ideas. First, differentially expressed genes (DEGs were identified by de novo-based analysis (DBA in RNA-seq data from six Thoroughbreds before and after exercise, here-after referred to as "de novo unique differentially expressed genes" (DUDEG. Second, by integrating both conventional DEGs and genes identified as being selected for during domestication of Thoroughbred and Jeju pony from whole genome re-sequencing (WGS data, we give a new concept to the definition of DEG. We identified 1,034 and 567 DUDEGs in skeletal muscle and blood, respectively. DUDEGs in skeletal muscle were significantly related to exercise-induced stress biological process gene ontology (BP-GO terms: 'immune system process'; 'response to stimulus'; and, 'death' and a KEGG pathways: 'JAK-STAT signaling pathway'; 'MAPK signaling pathway'; 'regulation of actin cytoskeleton'; and, 'p53 signaling pathway'. In addition, we found TIMELESS, EIF4A3 and ZNF592 in blood and CHMP4C and FOXO3 in skeletal muscle, to be in common between DUDEGs and selected genes identified by evolutionary statistics such as FST and Cross Population Extended Haplotype Homozygosity (XP-EHH. Moreover, in Thoroughbreds, three out of five genes (CHMP4C, EIF4A3 and FOXO3 related to exercise response showed relatively low nucleotide diversity compared to the Jeju pony. DUDEGs are not only conceptually new DEGs that cannot be attained from reference-based analysis (RBA but also supports previous RBA results related to exercise in Thoroughbred. In summary, three exercise related genes which were selected for during domestication in the evolutionary history of Thoroughbred were identified as conceptually new DEGs in this study.
Investigation of de novo unique differentially expressed genes related to evolution in exercise response during domestication in Thoroughbred race horses.

Science.gov (United States)

Park, Woncheoul; Kim, Jaemin; Kim, Hyeon Jeong; Choi, JaeYoung; Park, Jeong-Woong; Cho, Hyun-Woo; Kim, Byeong-Woo; Park, Myung Hum; Shin, Teak-Soon; Cho, Seong-Keun; Park, Jun-Kyu; Kim, Heebal; Hwang, Jae Yeon; Lee, Chang-Kyu; Lee, Hak-Kyo; Cho, Seoae; Cho, Byung-Wook

2014-01-01

Previous studies of horse RNA-seq were performed by mapping sequence reads to the reference genome during transcriptome analysis. However in this study, we focused on two main ideas. First, differentially expressed genes (DEGs) were identified by de novo-based analysis (DBA) in RNA-seq data from six Thoroughbreds before and after exercise, here-after referred to as "de novo unique differentially expressed genes" (DUDEG). Second, by integrating both conventional DEGs and genes identified as being selected for during domestication of Thoroughbred and Jeju pony from whole genome re-sequencing (WGS) data, we give a new concept to the definition of DEG. We identified 1,034 and 567 DUDEGs in skeletal muscle and blood, respectively. DUDEGs in skeletal muscle were significantly related to exercise-induced stress biological process gene ontology (BP-GO) terms: 'immune system process'; 'response to stimulus'; and, 'death' and a KEGG pathways: 'JAK-STAT signaling pathway'; 'MAPK signaling pathway'; 'regulation of actin cytoskeleton'; and, 'p53 signaling pathway'. In addition, we found TIMELESS, EIF4A3 and ZNF592 in blood and CHMP4C and FOXO3 in skeletal muscle, to be in common between DUDEGs and selected genes identified by evolutionary statistics such as FST and Cross Population Extended Haplotype Homozygosity (XP-EHH). Moreover, in Thoroughbreds, three out of five genes (CHMP4C, EIF4A3 and FOXO3) related to exercise response showed relatively low nucleotide diversity compared to the Jeju pony. DUDEGs are not only conceptually new DEGs that cannot be attained from reference-based analysis (RBA) but also supports previous RBA results related to exercise in Thoroughbred. In summary, three exercise related genes which were selected for during domestication in the evolutionary history of Thoroughbred were identified as conceptually new DEGs in this study.
Identification of Immunity Related Genes to Study the Physalis peruviana – Fusarium oxysporum Pathosystem

Science.gov (United States)

Enciso-Rodríguez, Felix E.; González, Carolina; Rodríguez, Edwin A.; López, Camilo E.; Landsman, David; Barrero, Luz Stella; Mariño-Ramírez, Leonardo

2013-01-01

The Cape gooseberry ( Physalis peruviana L) is an Andean exotic fruit with high nutritional value and appealing medicinal properties. However, its cultivation faces important phytosanitary problems mainly due to pathogens like Fusarium oxysporum, Cercosporaphysalidis and Alternaria spp. Here we used the Cape gooseberry foliar transcriptome to search for proteins that encode conserved domains related to plant immunity including: NBS (Nucleotide Binding Site), CC (Coiled-Coil), TIR (Toll/Interleukin-1 Receptor). We identified 74 immunity related gene candidates in P . peruviana which have the typical resistance gene (R-gene) architecture, 17 Receptor like kinase (RLKs) candidates related to PAMP-Triggered Immunity (PTI), eight (TIR-NBS-LRR, or TNL) and nine (CC–NBS-LRR, or CNL) candidates related to Effector-Triggered Immunity (ETI) genes among others. These candidate genes were categorized by molecular function (98%), biological process (85%) and cellular component (79%) using gene ontology. Some of the most interesting predicted roles were those associated with binding and transferase activity. We designed 94 primers pairs from the 74 immunity-related genes (IRGs) to amplify the corresponding genomic regions on six genotypes that included resistant and susceptible materials. From these, we selected 17 single band amplicons and sequenced them in 14 F. oxysporum resistant and susceptible genotypes. Sequence polymorphisms were analyzed through preliminary candidate gene association, which allowed the detection of one SNP at the PpIRG-63 marker revealing a nonsynonymous mutation in the predicted LRR domain suggesting functional roles for resistance. PMID:23844210
CAsubtype: An R Package to Identify Gene Sets Predictive of Cancer Subtypes and Clinical Outcomes.

Science.gov (United States)

Kong, Hualei; Tong, Pan; Zhao, Xiaodong; Sun, Jielin; Li, Hua

2018-03-01

In the past decade, molecular classification of cancer has gained high popularity owing to its high predictive power on clinical outcomes as compared with traditional methods commonly used in clinical practice. In particular, using gene expression profiles, recent studies have successfully identified a number of gene sets for the delineation of cancer subtypes that are associated with distinct prognosis. However, identification of such gene sets remains a laborious task due to the lack of tools with flexibility, integration and ease of use. To reduce the burden, we have developed an R package, CAsubtype, to efficiently identify gene sets predictive of cancer subtypes and clinical outcomes. By integrating more than 13,000 annotated gene sets, CAsubtype provides a comprehensive repertoire of candidates for new cancer subtype identification. For easy data access, CAsubtype further includes the gene expression and clinical data of more than 2000 cancer patients from TCGA. CAsubtype first employs principal component analysis to identify gene sets (from user-provided or package-integrated ones) with robust principal components representing significantly large variation between cancer samples. Based on these principal components, CAsubtype visualizes the sample distribution in low-dimensional space for better understanding of the distinction between samples and classifies samples into subgroups with prevalent clustering algorithms. Finally, CAsubtype performs survival analysis to compare the clinical outcomes between the identified subgroups, assessing their clinical value as potentially novel cancer subtypes. In conclusion, CAsubtype is a flexible and well-integrated tool in the R environment to identify gene sets for cancer subtype identification and clinical outcome prediction. Its simple R commands and comprehensive data sets enable efficient examination of the clinical value of any given gene set, thus facilitating hypothesis generating and testing in biological and
GTI: a novel algorithm for identifying outlier gene expression profiles from integrated microarray datasets.

Directory of Open Access Journals (Sweden)

John Patrick Mpindi

Full Text Available BACKGROUND: Meta-analysis of gene expression microarray datasets presents significant challenges for statistical analysis. We developed and validated a new bioinformatic method for the identification of genes upregulated in subsets of samples of a given tumour type ('outlier genes', a hallmark of potential oncogenes. METHODOLOGY: A new statistical method (the gene tissue index, GTI was developed by modifying and adapting algorithms originally developed for statistical problems in economics. We compared the potential of the GTI to detect outlier genes in meta-datasets with four previously defined statistical methods, COPA, the OS statistic, the t-test and ORT, using simulated data. We demonstrated that the GTI performed equally well to existing methods in a single study simulation. Next, we evaluated the performance of the GTI in the analysis of combined Affymetrix gene expression data from several published studies covering 392 normal samples of tissue from the central nervous system, 74 astrocytomas, and 353 glioblastomas. According to the results, the GTI was better able than most of the previous methods to identify known oncogenic outlier genes. In addition, the GTI identified 29 novel outlier genes in glioblastomas, including TYMS and CDKN2A. The over-expression of these genes was validated in vivo by immunohistochemical staining data from clinical glioblastoma samples. Immunohistochemical data were available for 65% (19 of 29 of these genes, and 17 of these 19 genes (90% showed a typical outlier staining pattern. Furthermore, raltitrexed, a specific inhibitor of TYMS used in the therapy of tumour types other than glioblastoma, also effectively blocked cell proliferation in glioblastoma cell lines, thus highlighting this outlier gene candidate as a potential therapeutic target. CONCLUSIONS/SIGNIFICANCE: Taken together, these results support the GTI as a novel approach to identify potential oncogene outliers and drug targets. The algorithm is
A translational systems biology approach in both animals and humans identifies a functionally related module of accumbal genes involved in the regulation of reward processing and binge drinking in males.

Science.gov (United States)

Stacey, David; Lourdusamy, Anbarasu; Ruggeri, Barbara; Maroteaux, Matthieu; Jia, Tianye; Cattrell, Anna; Nymberg, Charlotte; Banaschewski, Tobias; Bhattacharyya, Sohinee; Band, Hamid; Barker, Gareth; Bokde, Arun; Buchel, Christian; Carvalho, Fabiana; Conrod, Patricia; Desrivieres, Sylvane; Easton, Alanna; Fauth-Buehler, Mira; Fernandez-Medarde, Alberto; Flor, Herta; Frouin, Vincent; Gallinat, Jurgen; Garavanh, Hugh; Heinz, Andreas; Ittermann, Bernd; Lathrop, Mark; Lawrence, Claire; Loth, Eva; Mann, Karl; Martinot, Jean-Luc; Nees, Frauke; Paus, Tomas; Pausova, Zdenka; Rietschel, Marcella; Rotter, Andrea; Santos, Eugenio; Smolka, Michael; Sommer, Wolfgang; Mameli, Manuel; Spanagel, Rainer; Girault, Jean-Antoine; Mueller, Christian; Schumann, Gunter

2016-04-01

The mesolimbic dopamine system, composed primarily of dopaminergic neurons in the ventral tegmental area that project to striatal structures, is considered to be the key mediator of reinforcement-related mechanisms in the brain. Prompted by a genome-wide association meta-analysis implicating the Ras-specific guanine nucleotide-releasing factor 2 (RASGRF2) gene in the regulation of alcohol intake in men, we have recently shown that male Rasgrf2(-/-) mice exhibit reduced ethanol intake and preference accompanied by a perturbed mesolimbic dopamine system. We therefore propose that these mice represent a valid model to further elucidate the precise genes and mechanisms regulating mesolimbic dopamine functioning. Transcriptomic data from the nucleus accumbens (NAcc) of male Rasgrf2(-/-) mice and wild-type controls were analyzed by weighted gene coexpression network analysis (WGCNA). We performed follow-up genetic association tests in humans using a sample of male adolescents from the IMAGEN study characterized for binge drinking (n = 905) and ventral striatal activation during an fMRI reward task (n = 608). The WGCNA analyses using accumbal transcriptomic data revealed 37 distinct "modules," or functionally related groups of genes. Two of these modules were significantly associated with Rasgrf2 knockout status: M5 (p reward task (pempirical < 0.001). It was not possible to determine the extent to which the M5 module was dysregulated in Rasgrf2(-/-) mice by perturbed mesolimbic dopamine signalling or by the loss of Rasgrf2 function in the NAcc. Taken together, our findings indicate that the accumbal M5 module, initially identified as being dysregulated in male Rasgrf2(-/-) mice, is also relevant for human alcohol-related phenotypes potentially through the modulation of reinforcement mechanisms in the NAcc. We therefore propose that the genes comprising this module represent important candidates for further elucidation within the context of alcohol-related phenotypes.

A comparative genomics screen identifies a Sinorhizobium meliloti 1021 sodM-like gene strongly expressed within host plant nodules

Directory of Open Access Journals (Sweden)

Queiroux Clothilde

2012-05-01

Full Text Available Abstract Background We have used the genomic data in the Integrated Microbial Genomes system of the Department of Energy’s Joint Genome Institute to make predictions about rhizobial open reading frames that play a role in nodulation of host plants. The genomic data was screened by searching for ORFs conserved in α-proteobacterial rhizobia, but not conserved in closely-related non-nitrogen-fixing α-proteobacteria. Results Using this approach, we identified many genes known to be involved in nodulation or nitrogen fixation, as well as several new candidate genes. We knocked out selected new genes and assayed for the presence of nodulation phenotypes and/or nodule-specific expression. One of these genes, SMc00911, is strongly expressed by bacterial cells within host plant nodules, but is expressed minimally by free-living bacterial cells. A strain carrying an insertion mutation in SMc00911 is not defective in the symbiosis with host plants, but in contrast to expectations, this mutant strain is able to out-compete the S. meliloti 1021 wild type strain for nodule occupancy in co-inoculation experiments. The SMc00911 ORF is predicted to encode a “SodM-like” (superoxide dismutase-like protein containing a rhodanese sulfurtransferase domain at the N-terminus and a chromate-resistance superfamily domain at the C-terminus. Several other ORFs (SMb20360, SMc01562, SMc01266, SMc03964, and the SMc01424-22 operon identified in the screen are expressed at a moderate level by bacteria within nodules, but not by free-living bacteria. Conclusions Based on the analysis of ORFs identified in this study, we conclude that this comparative genomics approach can identify rhizobial genes involved in the nitrogen-fixing symbiosis with host plants, although none of the newly identified genes were found to be essential for this process.
Cross-species global and subset gene expression profiling identifies genes involved in prostate cancer response to selenium

Directory of Open Access Journals (Sweden)

Dhir Rajiv

2004-08-01

Full Text Available Abstract Background Gene expression technologies have the ability to generate vast amounts of data, yet there often resides only limited resources for subsequent validation studies. This necessitates the ability to perform sorting and prioritization of the output data. Previously described methodologies have used functional pathways or transcriptional regulatory grouping to sort genes for further study. In this paper we demonstrate a comparative genomics based method to leverage data from animal models to prioritize genes for validation. This approach allows one to develop a disease-based focus for the prioritization of gene data, a process that is essential for systems that lack significant functional pathway data yet have defined animal models. This method is made possible through the use of highly controlled spotted cDNA slide production and the use of comparative bioinformatics databases without the use of cross-species slide hybridizations. Results Using gene expression profiling we have demonstrated a similar whole transcriptome gene expression patterns in prostate cancer cells from human and rat prostate cancer cell lines both at baseline expression levels and after treatment with physiologic concentrations of the proposed chemopreventive agent Selenium. Using both the human PC3 and rat PAII prostate cancer cell lines have gone on to identify a subset of one hundred and fifty-four genes that demonstrate a similar level of differential expression to Selenium treatment in both species. Further analysis and data mining for two genes, the Insulin like Growth Factor Binding protein 3, and Retinoic X Receptor alpha, demonstrates an association with prostate cancer, functional pathway links, and protein-protein interactions that make these genes prime candidates for explaining the mechanism of Selenium's chemopreventive effect in prostate cancer. These genes are subsequently validated by western blots showing Selenium based induction and using
A large-scale RNA interference screen identifies genes that regulate autophagy at different stages

DEFF Research Database (Denmark)

Guo, Sujuan; Pridham, Kevin J; Virbasius, Ching-Man

2018-01-01

Dysregulated autophagy is central to the pathogenesis and therapeutic development of cancer. However, how autophagy is regulated in cancer is not well understood and genes that modulate cancer autophagy are not fully defined. To gain more insights into autophagy regulation in cancer, we performed...... with fluorescence-activated cell sorting, we successfully isolated autophagic K562 cells where we identified 336 short hairpin RNAs. After candidate validation using Cyto-ID fluorescence spectrophotometry, LC3B immunoblotting, and quantitative RT-PCR, 82 genes were identified as autophagy-regulating genes. 20 genes...... have been reported previously and the remaining 62 candidates are novel autophagy mediators. Bioinformatic analyses revealed that most candidate genes were involved in molecular pathways regulating autophagy, rather than directly participating in the autophagy process. Further autophagy flux assays...
Identification of Genes Related to Paulownia Witches’ Broom by AFLP and MSAP

Directory of Open Access Journals (Sweden)

Xibing Cao

2014-08-01

Full Text Available DNA methylation is believed to play important roles in regulating gene expression in plant growth and development. Paulownia witches’ broom (PaWB infection has been reported to be related to gene expression changes in paulownia plantlets. To determine whether DNA methylation is associated with gene expression changes in response to phytoplasma, we investigated variations in genomic DNA sequence and methylation in PaWB plantlets treated with methyl methane sulfonate (MMS using amplified fragment length polymorphism (AFLP and methylation-sensitive amplification polymorphism (MSAP techniques, respectively. The results indicated that PaWB seedings recovered a normal morphology after treatment with more than 15 mg·L−1 MMS. PaWB infection did not cause changes of the paulownia DNA sequence at the AFLP level; However, DNA methylation levels and patterns were altered. Quantitative real-time PCR (qRT-PCR showed that three of the methylated genes were up-regulated and three were down-regulated in the MMS-treated PaWB plantlets that had regained healthy morphology. These six genes might be involved in transcriptional regulation, plant defense, signal transduction and energy. The possible roles of these genes in PaWB are discussed. The results showed that changes of DNA methylation altered gene expression levels, and that MSAP might help identify genes related to PaWB.
Novel gene function revealed by mouse mutagenesis screens for models of age-related disease.

Science.gov (United States)

Potter, Paul K; Bowl, Michael R; Jeyarajan, Prashanthini; Wisby, Laura; Blease, Andrew; Goldsworthy, Michelle E; Simon, Michelle M; Greenaway, Simon; Michel, Vincent; Barnard, Alun; Aguilar, Carlos; Agnew, Thomas; Banks, Gareth; Blake, Andrew; Chessum, Lauren; Dorning, Joanne; Falcone, Sara; Goosey, Laurence; Harris, Shelley; Haynes, Andy; Heise, Ines; Hillier, Rosie; Hough, Tertius; Hoslin, Angela; Hutchison, Marie; King, Ruairidh; Kumar, Saumya; Lad, Heena V; Law, Gemma; MacLaren, Robert E; Morse, Susan; Nicol, Thomas; Parker, Andrew; Pickford, Karen; Sethi, Siddharth; Starbuck, Becky; Stelma, Femke; Cheeseman, Michael; Cross, Sally H; Foster, Russell G; Jackson, Ian J; Peirson, Stuart N; Thakker, Rajesh V; Vincent, Tonia; Scudamore, Cheryl; Wells, Sara; El-Amraoui, Aziz; Petit, Christine; Acevedo-Arozena, Abraham; Nolan, Patrick M; Cox, Roger; Mallon, Anne-Marie; Brown, Steve D M

2016-08-18

Determining the genetic bases of age-related disease remains a major challenge requiring a spectrum of approaches from human and clinical genetics to the utilization of model organism studies. Here we report a large-scale genetic screen in mice employing a phenotype-driven discovery platform to identify mutations resulting in age-related disease, both late-onset and progressive. We have utilized N-ethyl-N-nitrosourea mutagenesis to generate pedigrees of mutagenized mice that were subject to recurrent screens for mutant phenotypes as the mice aged. In total, we identify 105 distinct mutant lines from 157 pedigrees analysed, out of which 27 are late-onset phenotypes across a range of physiological systems. Using whole-genome sequencing we uncover the underlying genes for 44 of these mutant phenotypes, including 12 late-onset phenotypes. These genes reveal a number of novel pathways involved with age-related disease. We illustrate our findings by the recovery and characterization of a novel mouse model of age-related hearing loss.
X-exome sequencing of 405 unresolved families identifies seven novel intellectual disability genes.

Science.gov (United States)

Hu, H; Haas, S A; Chelly, J; Van Esch, H; Raynaud, M; de Brouwer, A P M; Weinert, S; Froyen, G; Frints, S G M; Laumonnier, F; Zemojtel, T; Love, M I; Richard, H; Emde, A-K; Bienek, M; Jensen, C; Hambrock, M; Fischer, U; Langnick, C; Feldkamp, M; Wissink-Lindhout, W; Lebrun, N; Castelnau, L; Rucci, J; Montjean, R; Dorseuil, O; Billuart, P; Stuhlmann, T; Shaw, M; Corbett, M A; Gardner, A; Willis-Owen, S; Tan, C; Friend, K L; Belet, S; van Roozendaal, K E P; Jimenez-Pocquet, M; Moizard, M-P; Ronce, N; Sun, R; O'Keeffe, S; Chenna, R; van Bömmel, A; Göke, J; Hackett, A; Field, M; Christie, L; Boyle, J; Haan, E; Nelson, J; Turner, G; Baynam, G; Gillessen-Kaesbach, G; Müller, U; Steinberger, D; Budny, B; Badura-Stronka, M; Latos-Bieleńska, A; Ousager, L B; Wieacker, P; Rodríguez Criado, G; Bondeson, M-L; Annerén, G; Dufke, A; Cohen, M; Van Maldergem, L; Vincent-Delorme, C; Echenne, B; Simon-Bouy, B; Kleefstra, T; Willemsen, M; Fryns, J-P; Devriendt, K; Ullmann, R; Vingron, M; Wrogemann, K; Wienker, T F; Tzschach, A; van Bokhoven, H; Gecz, J; Jentsch, T J; Chen, W; Ropers, H-H; Kalscheuer, V M

2016-01-01

X-linked intellectual disability (XLID) is a clinically and genetically heterogeneous disorder. During the past two decades in excess of 100 X-chromosome ID genes have been identified. Yet, a large number of families mapping to the X-chromosome remained unresolved suggesting that more XLID genes or loci are yet to be identified. Here, we have investigated 405 unresolved families with XLID. We employed massively parallel sequencing of all X-chromosome exons in the index males. The majority of these males were previously tested negative for copy number variations and for mutations in a subset of known XLID genes by Sanger sequencing. In total, 745 X-chromosomal genes were screened. After stringent filtering, a total of 1297 non-recurrent exonic variants remained for prioritization. Co-segregation analysis of potential clinically relevant changes revealed that 80 families (20%) carried pathogenic variants in established XLID genes. In 19 families, we detected likely causative protein truncating and missense variants in 7 novel and validated XLID genes (CLCN4, CNKSR2, FRMPD4, KLHL15, LAS1L, RLIM and USP27X) and potentially deleterious variants in 2 novel candidate XLID genes (CDK16 and TAF1). We show that the CLCN4 and CNKSR2 variants impair protein functions as indicated by electrophysiological studies and altered differentiation of cultured primary neurons from Clcn4(-/-) mice or after mRNA knock-down. The newly identified and candidate XLID proteins belong to pathways and networks with established roles in cognitive function and intellectual disability in particular. We suggest that systematic sequencing of all X-chromosomal genes in a cohort of patients with genetic evidence for X-chromosome locus involvement may resolve up to 58% of Fragile X-negative cases.
Prediction potential of candidate biomarker sets identified and validated on gene expression data from multiple datasets

Directory of Open Access Journals (Sweden)

Karacali Bilge

2007-10-01

Full Text Available Abstract Background Independently derived expression profiles of the same biological condition often have few genes in common. In this study, we created populations of expression profiles from publicly available microarray datasets of cancer (breast, lymphoma and renal samples linked to clinical information with an iterative machine learning algorithm. ROC curves were used to assess the prediction error of each profile for classification. We compared the prediction error of profiles correlated with molecular phenotype against profiles correlated with relapse-free status. Prediction error of profiles identified with supervised univariate feature selection algorithms were compared to profiles selected randomly from a all genes on the microarray platform and b a list of known disease-related genes (a priori selection. We also determined the relevance of expression profiles on test arrays from independent datasets, measured on either the same or different microarray platforms. Results Highly discriminative expression profiles were produced on both simulated gene expression data and expression data from breast cancer and lymphoma datasets on the basis of ER and BCL-6 expression, respectively. Use of relapse-free status to identify profiles for prognosis prediction resulted in poorly discriminative decision rules. Supervised feature selection resulted in more accurate classifications than random or a priori selection, however, the difference in prediction error decreased as the number of features increased. These results held when decision rules were applied across-datasets to samples profiled on the same microarray platform. Conclusion Our results show that many gene sets predict molecular phenotypes accurately. Given this, expression profiles identified using different training datasets should be expected to show little agreement. In addition, we demonstrate the difficulty in predicting relapse directly from microarray data using supervised machine
POSSIBLE RELATED FUNCTIONS OF THE NON-HOMOLOGOUS CO-REGULATED GENE PAIR PDCD10 AND SERPINI1

Directory of Open Access Journals (Sweden)

Concetta Scimone

2017-04-01

Full Text Available Gene expression in mammalians is a very finely controlled mechanism, and bidirectional promoters can be considered one of the most compelling examples of the accuracy of genic expression coordination. As recently reported, a bidirectional promoter regulates the expression of the PDCD10(whose mutations cause familial Cerebral Cavernous Malformations (CCMs and SERPINI1 gene pair, even though they are non-homologous genes. The aim of this study was to identify any potential common roles of these two coregulated genes. An in-silico approach was used to identify functional correlations, using the BioGraph, IPA® and Cytoscape tools and the KEGG pathway database. The results obtained show that PDCD10 and SERPINI1 may co-regulate some cellular processes, particularly those related to focal adhesion maintenance. All common pathways identified for PDCD10 and SERPINI1 are closely associated with the pathogenic characteristics of CCMs; we thus hypothesize that genes involved in these networks may contribute to the development of CCMs.
Gene Expression Signature Analysis Identifies Vorinostat as a Candidate Therapy for Gastric Cancer

Science.gov (United States)

Choi, Woonyoung; Park, Yun-Yong; Kim, KyoungHyun; Kim, Sang-Bae; Lee, Ju-Seog; Mills, Gordon B.; Cho, Jae Yong

2011-01-01

Background Gastric cancer continues to be one of the deadliest cancers in the world and therefore identification of new drugs targeting this type of cancer is thus of significant importance. The purpose of this study was to identify and validate a therapeutic agent which might improve the outcomes for gastric cancer patients in the future. Methodology/Principal Findings Using microarray technology, we generated a gene expression profile of human gastric cancer–specific genes from human gastric cancer tissue samples. We used this profile in the Broad Institute's Connectivity Map analysis to identify candidate therapeutic compounds for gastric cancer. We found the histone deacetylase inhibitor vorinostat as the lead compound and thus a potential therapeutic drug for gastric cancer. Vorinostat induced both apoptosis and autophagy in gastric cancer cell lines. Pharmacological and genetic inhibition of autophagy however, increased the therapeutic efficacy of vorinostat, indicating that a combination of vorinostat with autophagy inhibitors may therapeutically be more beneficial. Moreover, gene expression analysis of gastric cancer identified a collection of genes (ITGB5, TYMS, MYB, APOC1, CBX5, PLA2G2A, and KIF20A) whose expression was elevated in gastric tumor tissue and downregulated more than 2-fold by vorinostat treatment in gastric cancer cell lines. In contrast, SCGB2A1, TCN1, CFD, APLP1, and NQO1 manifested a reversed pattern. Conclusions/Significance We showed that analysis of gene expression signature may represent an emerging approach to discover therapeutic agents for gastric cancer, such as vorinostat. The observation of altered gene expression after vorinostat treatment may provide the clue to identify the molecular mechanism of vorinostat and those patients likely to benefit from vorinostat treatment. PMID:21931799
X-exome sequencing of 405 unresolved families identifies seven novel intellectual disability genes

OpenAIRE

Hu, H.; Haas, S.A.; Chelly, J.; Van Esch, H.; Raynaud, M.; de Brouwer, A.P.M.; Weinert, S.; Froyen, G.; Frints, S.G.M.; Laumonnier, F.; Zemojtel, T.; Love, M.I.; Richard, H.; Emde, A.K.; Bienek, M.

2016-01-01

X-linked intellectual disability (XLID) is a clinically and genetically heterogeneous disorder. During the past two decades in excess of 100 X-chromosome ID genes have been identified. Yet, a large number of families mapping to the X-chromosome remained unresolved suggesting that more XLID genes or loci are yet to be identified. Here, we have investigated 405 unresolved families with XLID. We employed massively parallel sequencing of all X-chromosome exons in the index males. The majority of ...
Comprehensive Analysis of Gene Expression Profiles of Sepsis-Induced Multiorgan Failure Identified Its Valuable Biomarkers.

Science.gov (United States)

Wang, Yumei; Yin, Xiaoling; Yang, Fang

2018-02-01

Sepsis is an inflammatory-related disease, and severe sepsis would induce multiorgan dysfunction, which is the most common cause of death of patients in noncoronary intensive care units. Progression of novel therapeutic strategies has proven to be of little impact on the mortality of severe sepsis, and unfortunately, its mechanisms still remain poorly understood. In this study, we analyzed gene expression profiles of severe sepsis with failure of lung, kidney, and liver for the identification of potential biomarkers. We first downloaded the gene expression profiles from the Gene Expression Omnibus and performed preprocessing of raw microarray data sets and identification of differential expression genes (DEGs) through the R programming software; then, significantly enriched functions of DEGs in lung, kidney, and liver failure sepsis samples were obtained from the Database for Annotation, Visualization, and Integrated Discovery; finally, protein-protein interaction network was constructed for DEGs based on the STRING database, and network modules were also obtained through the MCODE cluster method. As a result, lung failure sepsis has the highest number of DEGs of 859, whereas the number of DEGs in kidney and liver failure sepsis samples is 178 and 175, respectively. In addition, 17 overlaps were obtained among the three lists of DEGs. Biological processes related to immune and inflammatory response were found to be significantly enriched in DEGs. Network and module analysis identified four gene clusters in which all or most of genes were upregulated. The expression changes of Icam1 and Socs3 were further validated through quantitative PCR analysis. This study should shed light on the development of sepsis and provide potential therapeutic targets for sepsis-induced multiorgan failure.
Identifying candidate driver genes by integrative ovarian cancer genomics data

Science.gov (United States)

Lu, Xinguo; Lu, Jibo

2017-08-01

Integrative analysis of molecular mechanics underlying cancer can distinguish interactions that cannot be revealed based on one kind of data for the appropriate diagnosis and treatment of cancer patients. Tumor samples exhibit heterogeneity in omics data, such as somatic mutations, Copy Number Variations CNVs), gene expression profiles and so on. In this paper we combined gene co-expression modules and mutation modulators separately in tumor patients to obtain the candidate driver genes for resistant and sensitive tumor from the heterogeneous data. The final list of modulators identified are well known in biological processes associated with ovarian cancer, such as CCL17, CACTIN, CCL16, CCL22, APOB, KDF1, CCL11, HNF1B, LRG1, MED1 and so on, which can help to facilitate the discovery of biomarkers, molecular diagnostics, and drug discovery.
Common Mechanisms Underlying Refractive Error Identified in Functional Analysis of Gene Lists From Genome-Wide Association Study Results in 2 European British Cohorts

Science.gov (United States)

Hysi, Pirro G.; Mahroo, Omar A.; Cumberland, Phillippa; Wojciechowski, Robert; Williams, Katie M.; Young, Terri L.; Mackey, David A.; Rahi, Jugnoo S.; Hammond, Christopher J.

2014-01-01

IMPORTANCE To date, relatively few genes responsible for a fraction of heritability have been identified by means of large genetic association studies of refractive error. OBJECTIVE To explore the genetic mechanisms that lead to refractive error in the general population. DESIGN, SETTING, AND PARTICIPANTS Genome-wide association studies were carried out in 2 British population-based independent cohorts (N = 5928 participants) to identify genes moderately associated with refractive error. MAIN OUTCOMES AND MEASURES Enrichment analyses were used to identify sets of genes overrepresented in both cohorts. Enriched groups of genes were compared between both participating cohorts as a further measure against random noise. RESULTS Groups of genes enriched at highly significant statistical levels were remarkably consistent in both cohorts. In particular, these results indicated that plasma membrane (P = 7.64 × 10−30), cell-cell adhesion (P = 2.42 × 10−18), synaptic transmission (P = 2.70 × 10−14), calcium ion binding (P = 3.55 × 10−15), and cation channel activity (P = 2.77 × 10−14) were significantly overrepresented in relation to refractive error. CONCLUSIONS AND RELEVANCE These findings provide evidence that development of refractive error in the general population is related to the intensity of photosignal transduced from the retina, which may have implications for future interventions to minimize this disorder. Pathways connected to the procession of the nerve impulse are major mechanisms involved in the development of refractive error in populations of European origin. PMID:24264139
Gene-environment interaction involving recently identified colorectal cancer susceptibility loci

Science.gov (United States)

Kantor, Elizabeth D.; Hutter, Carolyn M.; Minnier, Jessica; Berndt, Sonja I.; Brenner, Hermann; Caan, Bette J.; Campbell, Peter T.; Carlson, Christopher S.; Casey, Graham; Chan, Andrew T.; Chang-Claude, Jenny; Chanock, Stephen J.; Cotterchio, Michelle; Du, Mengmeng; Duggan, David; Fuchs, Charles S.; Giovannucci, Edward L.; Gong, Jian; Harrison, Tabitha A.; Hayes, Richard B.; Henderson, Brian E.; Hoffmeister, Michael; Hopper, John L.; Jenkins, Mark A.; Jiao, Shuo; Kolonel, Laurence N.; Le Marchand, Loic; Lemire, Mathieu; Ma, Jing; Newcomb, Polly A.; Ochs-Balcom, Heather M.; Pflugeisen, Bethann M.; Potter, John D.; Rudolph, Anja; Schoen, Robert E.; Seminara, Daniela; Slattery, Martha L.; Stelling, Deanna L.; Thomas, Fridtjof; Thornquist, Mark; Ulrich, Cornelia M.; Warnick, Greg S.; Zanke, Brent W.; Peters, Ulrike; Hsu, Li; White, Emily

2014-01-01

BACKGROUND Genome-wide association studies have identified several single nucleotide polymorphisms (SNPs) that are associated with risk of colorectal cancer (CRC). Prior research has evaluated the presence of gene-environment interaction involving the first 10 identified susceptibility loci, but little work has been conducted on interaction involving SNPs at recently identified susceptibility loci, including: rs10911251, rs6691170, rs6687758, rs11903757, rs10936599, rs647161, rs1321311, rs719725, rs1665650, rs3824999, rs7136702, rs11169552, rs59336, rs3217810, rs4925386, and rs2423279. METHODS Data on 9160 cases and 9280 controls from the Genetics and Epidemiology of Colorectal Cancer Consortium (GECCO) and Colon Cancer Family Registry (CCFR) were used to evaluate the presence of interaction involving the above-listed SNPs and sex, body mass index (BMI), alcohol consumption, smoking, aspirin use, post-menopausal hormone (PMH) use, as well as intake of dietary calcium, dietary fiber, dietary folate, red meat, processed meat, fruit, and vegetables. Interaction was evaluated using a fixed-effects meta-analysis of an efficient Empirical Bayes estimator, and permutation was used to account for multiple comparisons. RESULTS None of the permutation-adjusted p-values reached statistical significance. CONCLUSIONS The associations between recently identified genetic susceptibility loci and CRC are not strongly modified by sex, BMI, alcohol, smoking, aspirin, PMH use, and various dietary factors. IMPACT Results suggest no evidence of strong gene-environment interactions involving the recently identified 16 susceptibility loci for CRC taken one at a time. PMID:24994789
Genome-wide siRNA-based functional genomics of pigmentation identifies novel genes and pathways that impact melanogenesis in human cells.

Directory of Open Access Journals (Sweden)

Anand K Ganesan

2008-12-01

Full Text Available Melanin protects the skin and eyes from the harmful effects of UV irradiation, protects neural cells from toxic insults, and is required for sound conduction in the inner ear. Aberrant regulation of melanogenesis underlies skin disorders (melasma and vitiligo, neurologic disorders (Parkinson's disease, auditory disorders (Waardenburg's syndrome, and opthalmologic disorders (age related macular degeneration. Much of the core synthetic machinery driving melanin production has been identified; however, the spectrum of gene products participating in melanogenesis in different physiological niches is poorly understood. Functional genomics based on RNA-mediated interference (RNAi provides the opportunity to derive unbiased comprehensive collections of pharmaceutically tractable single gene targets supporting melanin production. In this study, we have combined a high-throughput, cell-based, one-well/one-gene screening platform with a genome-wide arrayed synthetic library of chemically synthesized, small interfering RNAs to identify novel biological pathways that govern melanin biogenesis in human melanocytes. Ninety-two novel genes that support pigment production were identified with a low false discovery rate. Secondary validation and preliminary mechanistic studies identified a large panel of targets that converge on tyrosinase expression and stability. Small molecule inhibition of a family of gene products in this class was sufficient to impair chronic tyrosinase expression in pigmented melanoma cells and UV-induced tyrosinase expression in primary melanocytes. Isolation of molecular machinery known to support autophagosome biosynthesis from this screen, together with in vitro and in vivo validation, exposed a close functional relationship between melanogenesis and autophagy. In summary, these studies illustrate the power of RNAi-based functional genomics to identify novel genes, pathways, and pharmacologic agents that impact a biological phenotype
Identification of Immunity-Related Genes in Dialeurodes citri against Entomopathogenic Fungus Lecanicillium attenuatum by RNA-Seq Analysis.

Directory of Open Access Journals (Sweden)

Shijiang Yu

Full Text Available Dialeurodes citri is a major pest in citrus producing areas, and large-scale outbreaks have occurred increasingly often in recent years. Lecanicillium attenuatum is an important entomopathogenic fungus that can parasitize and kill D. citri. We separated the fungus from corpses of D. citri larvae. However, the sound immune defense system of pests makes infection by an entomopathogenic fungus difficult. Here we used RNA sequencing technology (RNA-Seq to build a transcriptome database for D. citri and performed digital gene expression profiling to screen genes that act in the immune defense of D. citri larvae infected with a pathogenic fungus. De novo assembly generated 84,733 unigenes with mean length of 772 nt. All unigenes were searched against GO, Nr, Swiss-Prot, COG, and KEGG databases and a total of 28,190 (33.3% unigenes were annotated. We identified 129 immunity-related unigenes in transcriptome database that were related to pattern recognition receptors, information transduction factors and response factors. From the digital gene expression profile, we identified 441 unigenes that were differentially expressed in D. citri infected with L. attenuatum. Through calculated Log2Ratio values, we identified genes for which fold changes in expression were obvious, including cuticle protein, vitellogenin, cathepsin, prophenoloxidase, clip-domain serine protease, lysozyme, and others. Subsequent quantitative real-time polymerase chain reaction analysis verified the results. The identified genes may serve as target genes for microbial control of D. citri.
Identification of Immunity-Related Genes in Dialeurodes citri against Entomopathogenic Fungus Lecanicillium attenuatum by RNA-Seq Analysis.

Science.gov (United States)

Yu, Shijiang; Ding, Lili; Luo, Ren; Li, Xiaojiao; Yang, Juan; Liu, Haoqiang; Cong, Lin; Ran, Chun

2016-01-01

Dialeurodes citri is a major pest in citrus producing areas, and large-scale outbreaks have occurred increasingly often in recent years. Lecanicillium attenuatum is an important entomopathogenic fungus that can parasitize and kill D. citri. We separated the fungus from corpses of D. citri larvae. However, the sound immune defense system of pests makes infection by an entomopathogenic fungus difficult. Here we used RNA sequencing technology (RNA-Seq) to build a transcriptome database for D. citri and performed digital gene expression profiling to screen genes that act in the immune defense of D. citri larvae infected with a pathogenic fungus. De novo assembly generated 84,733 unigenes with mean length of 772 nt. All unigenes were searched against GO, Nr, Swiss-Prot, COG, and KEGG databases and a total of 28,190 (33.3%) unigenes were annotated. We identified 129 immunity-related unigenes in transcriptome database that were related to pattern recognition receptors, information transduction factors and response factors. From the digital gene expression profile, we identified 441 unigenes that were differentially expressed in D. citri infected with L. attenuatum. Through calculated Log2Ratio values, we identified genes for which fold changes in expression were obvious, including cuticle protein, vitellogenin, cathepsin, prophenoloxidase, clip-domain serine protease, lysozyme, and others. Subsequent quantitative real-time polymerase chain reaction analysis verified the results. The identified genes may serve as target genes for microbial control of D. citri.
Gene activated by growth factors is related to the oncogene v-jun

International Nuclear Information System (INIS)

Ryder, K.; Lau, L.F.; Nathans, D.

1988-01-01

The authors have recently identified by cDNA cloning a set of genes that are rapidly activated in cultured mouse cells by protein growth factors. Here they report that the nucleotide sequence of a cDNA (clone 465) derived from one of these immediate early genes (hereafter called jun-B) encodes a protein homologous to that encoded by the avian sarcoma virus 17 oncogene v-jun. Homology between the jun-B and v-jun proteins is in two regions: one near the N terminus and the other at the C terminus. The latter sequence was shown to have regions of sequence similarity to the DNA-binding domain of the yeast transcriptional regulatory protein GCN4 and to the oncogenic protein fos. Southern blots of human, mouse, and chicken DNA demonstrate that jun-B and c-jun are different genes and that there may be other vertebrate genes related to jun-B and c-jun. These findings suggest that there is a jun family of genes encoding related transcriptional regulatory proteins. The jun-B protein, and perhaps other members of the jun family, may play a role in regulating the genomic response to growth factors
Leveraging Comparative Genomics to Identify and Functionally Characterize Genes Associated with Sperm Phenotypes in Python bivittatus (Burmese Python

Directory of Open Access Journals (Sweden)

Kristopher J. L. Irizarry

2016-01-01

Full Text Available Comparative genomics approaches provide a means of leveraging functional genomics information from a highly annotated model organism’s genome (such as the mouse genome in order to make physiological inferences about the role of genes and proteins in a less characterized organism’s genome (such as the Burmese python. We employed a comparative genomics approach to produce the functional annotation of Python bivittatus genes encoding proteins associated with sperm phenotypes. We identify 129 gene-phenotype relationships in the python which are implicated in 10 specific sperm phenotypes. Results obtained through our systematic analysis identified subsets of python genes exhibiting associations with gene ontology annotation terms. Functional annotation data was represented in a semantic scatter plot. Together, these newly annotated Python bivittatus genome resources provide a high resolution framework from which the biology relating to reptile spermatogenesis, fertility, and reproduction can be further investigated. Applications of our research include (1 production of genetic diagnostics for assessing fertility in domestic and wild reptiles; (2 enhanced assisted reproduction technology for endangered and captive reptiles; and (3 novel molecular targets for biotechnology-based approaches aimed at reducing fertility and reproduction of invasive reptiles. Additional enhancements to reptile genomic resources will further enhance their value.
Identification of pathogenic genes and upstream regulators in age-related macular degeneration.

Science.gov (United States)

Zhao, Bin; Wang, Mengya; Xu, Jing; Li, Min; Yu, Yuhui

2017-06-26

Age-related macular degeneration (AMD) is the leading cause of irreversible blindness in older individuals. Our study aims to identify the key genes and upstream regulators in AMD. To screen pathogenic genes of AMD, an integrated analysis was performed by using the microarray datasets in AMD derived from the Gene Expression Omnibus (GEO) database. The functional annotation and potential pathways of differentially expressed genes (DEGs) were further discovered by Gene Ontology (GO) and Kyoto Encyclopedia of Genes and Genomes (KEGG) enrichment analysis. We constructed the AMD-specific transcriptional regulatory network to find the crucial transcriptional factors (TFs) which target the DEGs in AMD. Quantitative real time polymerase chain reaction (qRT-PCR) was performed to verify the DEGs and TFs obtained by integrated analysis. From two GEO datasets obtained, we identified 1280 DEGs (730 up-regulated and 550 down-regulated genes) between AMD and normal control (NC). After KEGG analysis, steroid biosynthesis is a significantly enriched pathway for DEGs. The expression of 8 genes (TNC, GRP, TRAF6, ADAMTS5, GPX3, FAP, DHCR7 and FDFT1) was detected. Except for TNC and GPX3, the other 6 genes in qRT-PCR played the same pattern with that in our integrated analysis. The dysregulation of these eight genes may involve with the process of AMD. Two crucial transcription factors (c-rel and myogenin) were concluded to play a role in AMD. Especially, myogenin was associated with AMD by regulating TNC, GRP and FAP. Our finding can contribute to developing new potential biomarkers, revealing the underlying pathogenesis, and further raising new therapeutic targets for AMD.

Transcriptional profiling of whole blood identifies a unique 5-gene signature for myelofibrosis and imminent myelofibrosis transformation.

Directory of Open Access Journals (Sweden)

Hans Carl Hasselbalch

Full Text Available Identifying a distinct gene signature for myelofibrosis may yield novel information of the genes, which are responsible for progression of essential thrombocythemia and polycythemia vera towards myelofibrosis. We aimed at identifying a simple gene signature - composed of a few genes - which were selectively and highly deregulated in myelofibrosis patients. Gene expression microarray studies have been performed on whole blood from 69 patients with myeloproliferative neoplasms. Amongst the top-20 of the most upregulated genes in PMF compared to controls, we identified 5 genes (DEFA4, ELA2, OLFM4, CTSG, and AZU1, which were highly significantly deregulated in PMF only. None of these genes were significantly regulated in ET and PV patients. However, hierarchical cluster analysis showed that these genes were also highly expressed in a subset of patients with ET (n = 1 and PV (n = 4 transforming towards myelofibrosis and/or being featured by an aggressive phenotype. We have identified a simple 5-gene signature, which is uniquely and highly significantly deregulated in patients in transitional stages of ET and PV towards myelofibrosis and in patients with PMF only. Some of these genes are considered to be responsible for the derangement of bone marrow stroma in myelofibrosis. Accordingly, this gene-signature may reflect key processes in the pathogenesis and pathophysiology of myelofibrosis development.
New Mutation Identified in the SRY Gene High Mobility Group (HMG

Directory of Open Access Journals (Sweden)

Feride İffet Şahin

2013-06-01

Full Text Available Mutations in the SRY gene prevent the differentiation of the fetal gonads to testes and cause developing female phenotype, and as a result sex reversal and pure gonadal dysgenesis (Swyer syndrome can be developed. Different types of mutations identified in the SRY gene are responsible for 15% of the gonadal dysgenesis. In this study, we report a new mutation (R132P in the High Mobility Group (HMG region of SRY gene was detected in a patient with primary amenorrhea who has 46,XY karyotype. This mutation leads to replacement of the polar and basic arginine with a nonpolar hydrophobic proline residue at aminoacid 132 in the nuclear localization signal region of the protein. With this case report we want to emphasize the genetic approach to the patients with gonadal dysgenesis. If Y chromosome is detected during cytogenetic analysis, revealing the presence of the SRY gene and identification of mutations in this gene by sequencing analysis is become important in.
A Shortest-Path-Based Method for the Analysis and Prediction of Fruit-Related Genes in Arabidopsis thaliana.

Science.gov (United States)

Zhu, Liucun; Zhang, Yu-Hang; Su, Fangchu; Chen, Lei; Huang, Tao; Cai, Yu-Dong

2016-01-01

Biologically, fruits are defined as seed-bearing reproductive structures in angiosperms that develop from the ovary. The fertilization, development and maturation of fruits are crucial for plant reproduction and are precisely regulated by intrinsic genetic regulatory factors. In this study, we used Arabidopsis thaliana as a model organism and attempted to identify novel genes related to fruit-associated biological processes. Specifically, using validated genes, we applied a shortest-path-based method to identify several novel genes in a large network constructed using the protein-protein interactions observed in Arabidopsis thaliana. The described analyses indicate that several of the discovered genes are associated with fruit fertilization, development and maturation in Arabidopsis thaliana.
Selection on plant male function genes identifies candidates for reproductive isolation of yellow monkeyflowers.

Directory of Open Access Journals (Sweden)

Jan E Aagaard

Full Text Available Understanding the genetic basis of reproductive isolation promises insight into speciation and the origins of biological diversity. While progress has been made in identifying genes underlying barriers to reproduction that function after fertilization (post-zygotic isolation, we know much less about earlier acting pre-zygotic barriers. Of particular interest are barriers involved in mating and fertilization that can evolve extremely rapidly under sexual selection, suggesting they may play a prominent role in the initial stages of reproductive isolation. A significant challenge to the field of speciation genetics is developing new approaches for identification of candidate genes underlying these barriers, particularly among non-traditional model systems. We employ powerful proteomic and genomic strategies to study the genetic basis of conspecific pollen precedence, an important component of pre-zygotic reproductive isolation among yellow monkeyflowers (Mimulus spp. resulting from male pollen competition. We use isotopic labeling in combination with shotgun proteomics to identify more than 2,000 male function (pollen tube proteins within maternal reproductive structures (styles of M. guttatus flowers where pollen competition occurs. We then sequence array-captured pollen tube exomes from a large outcrossing population of M. guttatus, and identify those genes with evidence of selective sweeps or balancing selection consistent with their role in pollen competition. We also test for evidence of positive selection on these genes more broadly across yellow monkeyflowers, because a signal of adaptive divergence is a common feature of genes causing reproductive isolation. Together the molecular evolution studies identify 159 pollen tube proteins that are candidate genes for conspecific pollen precedence. Our work demonstrates how powerful proteomic and genomic tools can be readily adapted to non-traditional model systems, allowing for genome-wide screens
Comparative transcriptional profiling of the axolotl limb identifies a tripartite regeneration-specific gene program.

Directory of Open Access Journals (Sweden)

Dunja Knapp

Full Text Available Understanding how the limb blastema is established after the initial wound healing response is an important aspect of regeneration research. Here we performed parallel expression profile time courses of healing lateral wounds versus amputated limbs in axolotl. This comparison between wound healing and regeneration allowed us to identify amputation-specific genes. By clustering the expression profiles of these samples, we could detect three distinguishable phases of gene expression - early wound healing followed by a transition-phase leading to establishment of the limb development program, which correspond to the three phases of limb regeneration that had been defined by morphological criteria. By focusing on the transition-phase, we identified 93 strictly amputation-associated genes many of which are implicated in oxidative-stress response, chromatin modification, epithelial development or limb development. We further classified the genes based on whether they were or were not significantly expressed in the developing limb bud. The specific localization of 53 selected candidates within the blastema was investigated by in situ hybridization. In summary, we identified a set of genes that are expressed specifically during regeneration and are therefore, likely candidates for the regulation of blastema formation.
Glycosylation-related gene expression in HT29-MTX-E12 cells upon infection by Helicobacter pylori.

Science.gov (United States)

Cairns, Michael T; Gupta, Ananya; Naughton, Julie A; Kane, Marian; Clyne, Marguerite; Joshi, Lokesh

2017-10-07

To identify glycosylation-related genes in the HT29 derivative cell line, HT29-MTX-E12, showing differential expression on infection with Helicobacter pylori ( H. pylori ). Polarised HT29-MTX-E12 cells were infected for 24 h with H. pylori strain 26695. After infection RNA was isolated from both infected and non-infected host cells. Sufficient infections were carried out to provide triplicate samples for microarray analysis and for qRT-PCR analysis. RNA was isolated and hybridised to Affymetrix arrays. Analysis of microarray data identified genes significantly differentially expressed upon infection. Genes were grouped into gene ontology functional categories. Selected genes associated with host glycan structure (glycosyltransferases, hydrolases, lectins, mucins) were validated by real-time qRT-PCR analysis. Infection of host cells was confirmed by the isolation of live bacteria after 24 h incubation and by PCR amplification of bacteria-specific genes from the host cell RNA. H. pylori do not survive incubation under the adopted culture conditions unless they associate with the adherent mucus layer of the host cell. Microarray analysis identified a total of 276 genes that were significantly differentially expressed ( P < 0.05) upon H. pylori infection and where the fold change in expression was greater than 2. Six of these genes are involved in glycosylation-related processes. Real-time qRT-PCR demonstrated significant downregulation (1.8-fold, P < 0.05) of the mucin MUC20. REG4 was heavily expressed and significantly downregulated (3.1-fold, P < 0.05) upon infection. Gene ontology analysis was consistent with previous studies on H. pylori infection. Gene expression data suggest that infection with H. pylori causes a decrease in glycan synthesis, resulting in shorter and simpler glycan structures.
Validation of commonly used reference genes for sleep-related gene expression studies

Directory of Open Access Journals (Sweden)

Castro Rosa MRPS

2009-05-01

Full Text Available Abstract Background Sleep is a restorative process and is essential for maintenance of mental and physical health. In an attempt to understand the complexity of sleep, multidisciplinary strategies, including genetic approaches, have been applied to sleep research. Although quantitative real time PCR has been used in previous sleep-related gene expression studies, proper validation of reference genes is currently lacking. Thus, we examined the effect of total or paradoxical sleep deprivation (TSD or PSD on the expression stability of the following frequently used reference genes in brain and blood: beta-actin (b-actin, beta-2-microglobulin (B2M, glyceraldehyde-3-phosphate dehydrogenase (GAPDH, and hypoxanthine guanine phosphoribosyl transferase (HPRT. Results Neither TSD nor PSD affected the expression stability of all tested genes in both tissues indicating that b-actin, B2M, GAPDH and HPRT are appropriate reference genes for the sleep-related gene expression studies. In order to further verify these results, the relative expression of brain derived neurotrophic factor (BDNF and glycerol-3-phosphate dehydrogenase1 (GPD1 was evaluated in brain and blood, respectively. The normalization with each of four reference genes produced similar pattern of expression in control and sleep deprived rats, but subtle differences in the magnitude of expression fold change were observed which might affect the statistical significance. Conclusion This study demonstrated that sleep deprivation does not alter the expression stability of commonly used reference genes in brain and blood. Nonetheless, the use of multiple reference genes in quantitative RT-PCR is required for the accurate results.
Current Status and Challenges in Identifying Disease Resistance Genes in Brassica napus

Directory of Open Access Journals (Sweden)

Ting Xiang Neik

2017-11-01

Full Text Available Brassica napus is an economically important crop across different continents including temperate and subtropical regions in Europe, Canada, South Asia, China and Australia. Its widespread cultivation also brings setbacks as it plays host to fungal, oomycete and chytrid pathogens that can lead to serious yield loss. For sustainable crop production, identification of resistance (R genes in B. napus has become of critical importance. In this review, we discuss four key pathogens affecting Brassica crops: Clubroot (Plasmodiophora brassicae, Blackleg (Leptosphaeria maculans and L. biglobosa, Sclerotinia Stem Rot (Sclerotinia sclerotiorum, and Downy Mildew (Hyaloperonospora parasitica. We first review current studies covering prevalence of these pathogens on Brassica crops and highlight the R genes and QTL that have been identified from Brassica species against these pathogens. Insights into the relationships between the pathogen and its Brassica host, the unique host resistance mechanisms and how these affect resistance outcomes is also presented. We discuss challenges in identification and deployment of R genes in B. napus in relation to highly specific genetic interactions between host subpopulations and pathogen pathotypes and emphasize the need for common or shared techniques and research materials or tighter collaboration between researchers to reconcile the inconsistencies in the research outcomes. Using current genomics tools, we provide examples of how characterization and cloning of R genes in B. napus can be carried out more effectively. Lastly, we put forward strategies to breed resistant cultivars through introgressions supported by genomic approaches and suggest prospects that can be implemented in the future for a better, pathogen-resistant B. napus.
Current Status and Challenges in Identifying Disease Resistance Genes in Brassica napus

Science.gov (United States)

Neik, Ting Xiang; Barbetti, Martin J.; Batley, Jacqueline

2017-01-01

Brassica napus is an economically important crop across different continents including temperate and subtropical regions in Europe, Canada, South Asia, China and Australia. Its widespread cultivation also brings setbacks as it plays host to fungal, oomycete and chytrid pathogens that can lead to serious yield loss. For sustainable crop production, identification of resistance (R) genes in B. napus has become of critical importance. In this review, we discuss four key pathogens affecting Brassica crops: Clubroot (Plasmodiophora brassicae), Blackleg (Leptosphaeria maculans and L. biglobosa), Sclerotinia Stem Rot (Sclerotinia sclerotiorum), and Downy Mildew (Hyaloperonospora parasitica). We first review current studies covering prevalence of these pathogens on Brassica crops and highlight the R genes and QTL that have been identified from Brassica species against these pathogens. Insights into the relationships between the pathogen and its Brassica host, the unique host resistance mechanisms and how these affect resistance outcomes is also presented. We discuss challenges in identification and deployment of R genes in B. napus in relation to highly specific genetic interactions between host subpopulations and pathogen pathotypes and emphasize the need for common or shared techniques and research materials or tighter collaboration between researchers to reconcile the inconsistencies in the research outcomes. Using current genomics tools, we provide examples of how characterization and cloning of R genes in B. napus can be carried out more effectively. Lastly, we put forward strategies to breed resistant cultivars through introgressions supported by genomic approaches and suggest prospects that can be implemented in the future for a better, pathogen-resistant B. napus. PMID:29163558
Cloning arbuscule-related genes from mycorrhizas

DEFF Research Database (Denmark)

Burleigh, Stephen

2000-01-01

Until recently little was known about the identity of the genes expressed in the arbuscules of mycorrhizas, due in part to problems associated with cloning genes from the tissues of an obligate symbiont. However, the combination of advanced molecular techniques, innovative use of the materials...... available and fortuitous cloning has resulted in the recent identification of a number of arbuscule-related genes. This article provides a brief summary of the genes involved in arbuscule development, function and regulation, and the techniques used to study them. Molecular techniques include differential...
Calcitonin gene-related peptide and pain

DEFF Research Database (Denmark)

Schou, Wendy Sophie; Ashina, Sait; Amin, Faisal Mohammad

2017-01-01

and cerebrospinal fluid in subjects with musculoskeletal pain. A randomized clinical trial on monoclonal antibody, which selectively binds to and inhibits the activity of CGRP (galcanezumab) in patients with osteoarthritis knee pain, failed to demonstrate improvement of pain compared with placebo. No studies......BACKGROUND: Calcitonin gene-related peptide (CGRP) is widely distributed in nociceptive pathways in human peripheral and central nervous system and its receptors are also expressed in pain pathways. CGRP is involved in migraine pathophysiology but its role in non-headache pain has not been...... clarified. METHODS: We performed a systematic literature search on PubMed, Embase and ClinicalTrials.gov for articles on CGRP and non-headache pain covering human studies including experimental studies and randomized clinical trials. RESULTS: The literature search identified 375 citations of which 50...
Gene expression markers of age-related inflammation in two human cohorts.

Science.gov (United States)

Pilling, Luke C; Joehanes, Roby; Melzer, David; Harries, Lorna W; Henley, William; Dupuis, Josée; Lin, Honghuang; Mitchell, Marcus; Hernandez, Dena; Ying, Sai-Xia; Lunetta, Kathryn L; Benjamin, Emelia J; Singleton, Andrew; Levy, Daniel; Munson, Peter; Murabito, Joanne M; Ferrucci, Luigi

2015-10-01

Chronically elevated circulating inflammatory markers are common in older persons but mechanisms are unclear. Many blood transcripts (>800 genes) are associated with interleukin-6 protein levels (IL6) independent of age. We aimed to identify gene transcripts statistically mediating, as drivers or responders, the increasing levels of IL6 protein in blood at older ages. Blood derived in-vivo RNA from the Framingham Heart Study (FHS, n=2422, ages 40-92 yrs) and InCHIANTI study (n=694, ages 30-104 yrs), with Affymetrix and Illumina expression arrays respectively (>17,000 genes tested), were tested for statistical mediation of the age-IL6 association using resampling techniques, adjusted for confounders and multiple testing. In FHS, IL6 expression was not associated with IL6 protein levels in blood. 102 genes (0.6% of 17,324 expressed) statistically mediated the age-IL6 association of which 25 replicated in InCHIANTI (including 5 of the 10 largest effect genes). The largest effect gene (SLC4A10, coding for NCBE, a sodium bicarbonate transporter) mediated 19% (adjusted CI 8.9 to 34.1%) and replicated by PCR in InCHIANTI (n=194, 35.6% mediated, p=0.01). Other replicated mediators included PRF1 (perforin, a cytolytic protein in cytotoxic T lymphocytes and NK cells) and IL1B (Interleukin 1 beta): few other cytokines were significant mediators. This transcriptome-wide study on human blood identified a small distinct set of genes that statistically mediate the age-IL6 association. Findings are robust across two cohorts and different expression technologies. Raised IL6 levels may not derive from circulating white cells in age related inflammation. Published by Elsevier Inc.
Genome-wide gene expression dataset used to identify potential therapeutic targets in androgenetic alopecia

Directory of Open Access Journals (Sweden)

R. Dey-Rao

2017-08-01

Full Text Available The microarray dataset attached to this report is related to the research article with the title: “A genomic approach to susceptibility and pathogenesis leads to identifying potential novel therapeutic targets in androgenetic alopecia” (Dey-Rao and Sinha, 2017 [1]. Male-pattern hair loss that is induced by androgens (testosterone in genetically predisposed individuals is known as androgenetic alopecia (AGA. The raw dataset is being made publicly available to enable critical and/or extended analyses. Our related research paper utilizes the attached raw dataset, for genome-wide gene-expression associated investigations. Combined with several in silico bioinformatics-based analyses we were able to delineate five strategic molecular elements as potential novel targets towards future AGA-therapy.
Amygdala-enriched genes identified by microarray technology are restricted to specific amygdaloid subnuclei

OpenAIRE

Zirlinger, M.; Kreiman, Gabriel; Anderson, D. J.

2001-01-01

Microarray technology represents a potentially powerful method for identifying cell type- and regionally restricted genes expressed in the brain. Here we have combined a microarray analysis of differential gene expression among five selected brain regions, including the amygdala, cerebellum, hippocampus, olfactory bulb, and periaqueductal gray, with in situ hybridization. On average, 0.3% of the 34,000 genes interrogated were highly enriched in each of the five regions...
Back to the sea twice: identifying candidate plant genes for molecular evolution to marine life

Directory of Open Access Journals (Sweden)

Reusch Thorsten BH

2011-01-01

Full Text Available Abstract Background Seagrasses are a polyphyletic group of monocotyledonous angiosperms that have adapted to a completely submerged lifestyle in marine waters. Here, we exploit two collections of expressed sequence tags (ESTs of two wide-spread and ecologically important seagrass species, the Mediterranean seagrass Posidonia oceanica (L. Delile and the eelgrass Zostera marina L., which have independently evolved from aquatic ancestors. This replicated, yet independent evolutionary history facilitates the identification of traits that may have evolved in parallel and are possible instrumental candidates for adaptation to a marine habitat. Results In our study, we provide the first quantitative perspective on molecular adaptations in two seagrass species. By constructing orthologous gene clusters shared between two seagrasses (Z. marina and P. oceanica and eight distantly related terrestrial angiosperm species, 51 genes could be identified with detection of positive selection along the seagrass branches of the phylogenetic tree. Characterization of these positively selected genes using KEGG pathways and the Gene Ontology uncovered that these genes are mostly involved in translation, metabolism, and photosynthesis. Conclusions These results provide first insights into which seagrass genes have diverged from their terrestrial counterparts via an initial aquatic stage characteristic of the order and to the derived fully-marine stage characteristic of seagrasses. We discuss how adaptive changes in these processes may have contributed to the evolution towards an aquatic and marine existence.
Back to the sea twice: identifying candidate plant genes for molecular evolution to marine life.

Science.gov (United States)

Wissler, Lothar; Codoñer, Francisco M; Gu, Jenny; Reusch, Thorsten B H; Olsen, Jeanine L; Procaccini, Gabriele; Bornberg-Bauer, Erich

2011-01-12

Seagrasses are a polyphyletic group of monocotyledonous angiosperms that have adapted to a completely submerged lifestyle in marine waters. Here, we exploit two collections of expressed sequence tags (ESTs) of two wide-spread and ecologically important seagrass species, the Mediterranean seagrass Posidonia oceanica (L.) Delile and the eelgrass Zostera marina L., which have independently evolved from aquatic ancestors. This replicated, yet independent evolutionary history facilitates the identification of traits that may have evolved in parallel and are possible instrumental candidates for adaptation to a marine habitat. In our study, we provide the first quantitative perspective on molecular adaptations in two seagrass species. By constructing orthologous gene clusters shared between two seagrasses (Z. marina and P. oceanica) and eight distantly related terrestrial angiosperm species, 51 genes could be identified with detection of positive selection along the seagrass branches of the phylogenetic tree. Characterization of these positively selected genes using KEGG pathways and the Gene Ontology uncovered that these genes are mostly involved in translation, metabolism, and photosynthesis. These results provide first insights into which seagrass genes have diverged from their terrestrial counterparts via an initial aquatic stage characteristic of the order and to the derived fully-marine stage characteristic of seagrasses. We discuss how adaptive changes in these processes may have contributed to the evolution towards an aquatic and marine existence.
X-exome sequencing of 405 unresolved families identifies seven novel intellectual disability genes

NARCIS (Netherlands)

Hu, H; Haas, S.A.; Chelly, J.; Esch, H. Van; Raynaud, M.; Brouwer, A.P. de; Weinert, S.; Froyen, G.; Frints, S.G.; Laumonnier, F.; Zemojtel, T.; Love, M.I.; Richard, H.; Emde, A.K.; Bienek, M.; Jensen, C.; Hambrock, M.; Fischer, U.; Langnick, C.; Feldkamp, M.; Wissink-Lindhout, W.; Lebrun, N.; Castelnau, L.; Rucci, J.; Montjean, R.; Dorseuil, O.; Billuart, P.; Stuhlmann, T.; Shaw, M.; Corbett, M.A.; Gardner, A.; Willis-Owen, S.; Tan, C.; Friend, K.L.; Belet, S.; Roozendaal, K.E. van; Jimenez-Pocquet, M.; Moizard, M.P.; Ronce, N.; Sun, R.; O'Keeffe, S.; Chenna, R.; Bommel, A. van; Goke, J.; Hackett, A.; Field, M.; Christie, L.; Boyle, J.; Haan, E.; Nelson, J.; Turner, G.; Baynam, G.; Gillessen-Kaesbach, G.; Muller, U.; Steinberger, D.; Budny, B.; Badura-Stronka, M.; Latos-Bielenska, A.; Ousager, L.B.; Wieacker, P.; Rodriguez Criado, G.; Bondeson, M.L.; Anneren, G.; Dufke, A.; Cohen, M.; Maldergem, L. Van; Vincent-Delorme, C.; Echenne, B.; Simon-Bouy, B.; Kleefstra, T.; Willemsen, M.H.; Fryns, J.P.; Devriendt, K.; Ullmann, R.; Vingron, M.; Wrogemann, K.; Wienker, T.F.; Tzschach, A.; Bokhoven, H. van; Gecz, J.; Jentsch, T.J.; Chen, W.; Ropers, H.H.; Kalscheuer, V.M.

2016-01-01

X-linked intellectual disability (XLID) is a clinically and genetically heterogeneous disorder. During the past two decades in excess of 100 X-chromosome ID genes have been identified. Yet, a large number of families mapping to the X-chromosome remained unresolved suggesting that more XLID genes or
X-exome sequencing of 405 unresolved families identifies seven novel intellectual disability genes

DEFF Research Database (Denmark)

Hu, H; Haas, S A; Chelly, J

2016-01-01

X-linked intellectual disability (XLID) is a clinically and genetically heterogeneous disorder. During the past two decades in excess of 100 X-chromosome ID genes have been identified. Yet, a large number of families mapping to the X-chromosome remained unresolved suggesting that more XLID genes...
Calcitonin gene-related peptide and calcitonin in man

International Nuclear Information System (INIS)

Fischer, J.A.; Henke, H.; Petermann, J.B.; Tschopp, F.A.

1985-01-01

Calcitonin gene-related peptide has been identified in the human brain, spinal cord, pituitary and thyroid glands as assessed by RIA and RRA. An immunoreactive and receptoractive peak coeluting with synthetic hCGRP on gel permeation chromatography and HPLC has been recognized. The levels measured by RRA are generally higher than those by RIA. Different characteristics of hCGRP and sCT binding sites and the distinct regional distribution evaluated with membranes and receptor autoradiography indicate separate receptors of the two peptides. Our results suggest different physiological roles of CGRP and CT in the central nervous system which remain to be discovered. (Auth.)
Construction of an integrated gene regulatory network link to stress-related immune system in cattle.

Science.gov (United States)

Behdani, Elham; Bakhtiarizadeh, Mohammad Reza

2017-10-01

The immune system is an important biological system that is negatively impacted by stress. This study constructed an integrated regulatory network to enhance our understanding of the regulatory gene network used in the stress-related immune system. Module inference was used to construct modules of co-expressed genes with bovine leukocyte RNA-Seq data. Transcription factors (TFs) were then assigned to these modules using Lemon-Tree algorithms. In addition, the TFs assigned to each module were confirmed using the promoter analysis and protein-protein interactions data. Therefore, our integrated method identified three TFs which include one TF that is previously known to be involved in immune response (MYBL2) and two TFs (E2F8 and FOXS1) that had not been recognized previously and were identified for the first time in this study as novel regulatory candidates in immune response. This study provides valuable insights on the regulatory programs of genes involved in the stress-related immune system.

Discovery of genes related to insecticide resistance in Bactrocera dorsalis by functional genomic analysis of a de novo assembled transcriptome.

Science.gov (United States)

Hsu, Ju-Chun; Chien, Ting-Ying; Hu, Chia-Cheng; Chen, Mei-Ju May; Wu, Wen-Jer; Feng, Hai-Tung; Haymer, David S; Chen, Chien-Yu

2012-01-01

Insecticide resistance has recently become a critical concern for control of many insect pest species. Genome sequencing and global quantization of gene expression through analysis of the transcriptome can provide useful information relevant to this challenging problem. The oriental fruit fly, Bactrocera dorsalis, is one of the world's most destructive agricultural pests, and recently it has been used as a target for studies of genetic mechanisms related to insecticide resistance. However, prior to this study, the molecular data available for this species was largely limited to genes identified through homology. To provide a broader pool of gene sequences of potential interest with regard to insecticide resistance, this study uses whole transcriptome analysis developed through de novo assembly of short reads generated by next-generation sequencing (NGS). The transcriptome of B. dorsalis was initially constructed using Illumina's Solexa sequencing technology. Qualified reads were assembled into contigs and potential splicing variants (isotigs). A total of 29,067 isotigs have putative homologues in the non-redundant (nr) protein database from NCBI, and 11,073 of these correspond to distinct D. melanogaster proteins in the RefSeq database. Approximately 5,546 isotigs contain coding sequences that are at least 80% complete and appear to represent B. dorsalis genes. We observed a strong correlation between the completeness of the assembled sequences and the expression intensity of the transcripts. The assembled sequences were also used to identify large numbers of genes potentially belonging to families related to insecticide resistance. A total of 90 P450-, 42 GST-and 37 COE-related genes, representing three major enzyme families involved in insecticide metabolism and resistance, were identified. In addition, 36 isotigs were discovered to contain target site sequences related to four classes of resistance genes. Identified sequence motifs were also analyzed to
Two novel antimicrobial defensins from rice identified by gene coexpression network analyses.

Science.gov (United States)

Tantong, Supaluk; Pringsulaka, Onanong; Weerawanich, Kamonwan; Meeprasert, Arthitaya; Rungrotmongkol, Thanyada; Sarnthima, Rakrudee; Roytrakul, Sittiruk; Sirikantaramas, Supaart

2016-10-01

Defensins form an antimicrobial peptides (AMP) family, and have been widely studied in various plants because of their considerable inhibitory functions. However, their roles in rice (Oryza sativa L.) have not been characterized, even though rice is one of the most important staple crops that is susceptible to damaging infections. Additionally, a previous study identified 598 rice genes encoding cysteine-rich peptides, suggesting there are several uncharacterized AMPs in rice. We performed in silico gene expression and coexpression network analyses of all genes encoding defensin and defensin-like peptides, and determined that OsDEF7 and OsDEF8 are coexpressed with pathogen-responsive genes. Recombinant OsDEF7 and OsDEF8 could form homodimers. They inhibited the growth of the bacteria Xanthomonas oryzae pv. oryzae, X. oryzae pv. oryzicola, and Erwinia carotovora subsp. atroseptica with minimum inhibitory concentration (MIC) ranging from 0.6 to 63μg/mL. However, these OsDEFs are weakly active against the phytopathogenic fungi Helminthosporium oryzae and Fusarium oxysporum f.sp. cubense. This study describes a useful method for identifying potential plant AMPs with biological activities. Copyright © 2016 Elsevier Inc. All rights reserved.
Comparative analysis of the full genome of Helicobacter pylori isolate Sahul64 identifies genes of high divergence.

Science.gov (United States)

Lu, Wei; Wise, Michael J; Tay, Chin Yen; Windsor, Helen M; Marshall, Barry J; Peacock, Christopher; Perkins, Tim

2014-03-01

Isolates of Helicobacter pylori can be classified phylogeographically. High genetic diversity and rapid microevolution are a hallmark of H. pylori genomes, a phenomenon that is proposed to play a functional role in persistence and colonization of diverse human populations. To provide further genomic evidence in the lineage of H. pylori and to further characterize diverse strains of this pathogen in different human populations, we report the finished genome sequence of Sahul64, an H. pylori strain isolated from an indigenous Australian. Our analysis identified genes that were highly divergent compared to the 38 publically available genomes, which include genes involved in the biosynthesis and modification of lipopolysaccharide, putative prophage genes, restriction modification components, and hypothetical genes. Furthermore, the virulence-associated vacA locus is a pseudogene and the cag pathogenicity island (cagPAI) is not present. However, the genome does contain a gene cluster associated with pathogenicity, including dupA. Our analysis found that with the addition of Sahul64 to the 38 genomes, the core genome content of H. pylori is reduced by approximately 14% (∼170 genes) and the pan-genome has expanded from 2,070 to 2,238 genes. We have identified three putative horizontally acquired regions, including one that is likely to have been acquired from the closely related Helicobacter cetorum prior to speciation. Our results suggest that Sahul64, with the absence of cagPAI, highly divergent cell envelope proteins, and a predicted nontransportable VacA protein, could be more highly adapted to ancient indigenous Australian people but with lower virulence potential compared to other sequenced and cagPAI-positive H. pylori strains.
Whole genome-wide transcript profiling to identify differentially expressed genes associated with seed field emergence in two soybean low phytate mutants.

Science.gov (United States)

Yuan, Fengjie; Yu, Xiaomin; Dong, Dekun; Yang, Qinghua; Fu, Xujun; Zhu, Shenlong; Zhu, Danhua

2017-01-18

Seed germination is important to soybean (Glycine max) growth and development, ultimately affecting soybean yield. A lower seed field emergence has been the main hindrance for breeding soybeans low in phytate. Although this reduction could be overcome by additional breeding and selection, the mechanisms of seed germination in different low phytate mutants remain unknown. In this study, we performed a comparative transcript analysis of two low phytate soybean mutants (TW-1 and TW-1-M), which have the same mutation, a 2 bp deletion in GmMIPS1, but show a significant difference in seed field emergence, TW-1-M was higher than that of TW-1 . Numerous genes analyzed by RNA-Seq showed markedly different expression levels between TW-1-M and TW-1 mutants. Approximately 30,000-35,000 read-mapped genes and ~21000-25000 expressed genes were identified for each library. There were ~3900-9200 differentially expressed genes (DEGs) in each contrast library, the number of up-regulated genes was similar with down-regulated genes in the mutant TW-1and TW-1-M. Gene ontology functional categories of DEGs indicated that the ethylene-mediated signaling pathway, the abscisic acid-mediated signaling pathway, response to hormone, ethylene biosynthetic process, ethylene metabolic process, regulation of hormone levels, and oxidation-reduction process, regulation of flavonoid biosynthetic process and regulation of abscisic acid-activated signaling pathway had high correlations with seed germination. In total, 2457 DEGs involved in the above functional categories were identified. Twenty-two genes with 20 biological functions were the most highly up/down- regulated (absolute value Log2FC >5) in the high field emergence mutant TW-1-M and were related to metabolic or signaling pathways. Fifty-seven genes with 36 biological functions had the greatest expression abundance (FRPM >100) in germination-related pathways. Seed germination in the soybean low phytate mutants is a very complex process
Metagenomes reveal microbial structures, functional potentials, and biofouling-related genes in a membrane bioreactor.

Science.gov (United States)

Ma, Jinxing; Wang, Zhiwei; Li, Huan; Park, Hee-Deung; Wu, Zhichao

2016-06-01

Metagenomic sequencing was used to investigate the microbial structures, functional potentials, and biofouling-related genes in a membrane bioreactor (MBR). The results showed that the microbial community in the MBR was highly diverse. Notably, function analysis of the dominant genera indicated that common genes from different phylotypes were identified for important functional potentials with the observation of variation of abundances of genes in a certain taxon (e.g., Dechloromonas). Despite maintaining similar metabolic functional potentials with a parallel full-scale conventional activated sludge (CAS) system due to treating the identical wastewater, the MBR had more abundant nitrification-related bacteria and coding genes of ammonia monooxygenase, which could well explain its excellent ammonia removal in the low-temperature period. Furthermore, according to quantification of the genes involved in exopolysaccharide and extracellular polymeric substance (EPS) protein metabolism, the MBR did not show a much different potential in producing EPS compared to the CAS system, and bacteria from the membrane biofilm had lower abundances of genes associated with EPS biosynthesis and transport compared to the activated sludge in the MBR.
Exome Sequencing and Linkage Analysis Identified Novel Candidate Genes in Recessive Intellectual Disability Associated with Ataxia.

Science.gov (United States)

Jazayeri, Roshanak; Hu, Hao; Fattahi, Zohreh; Musante, Luciana; Abedini, Seyedeh Sedigheh; Hosseini, Masoumeh; Wienker, Thomas F; Ropers, Hans Hilger; Najmabadi, Hossein; Kahrizi, Kimia

2015-10-01

Intellectual disability (ID) is a neuro-developmental disorder which causes considerable socio-economic problems. Some ID individuals are also affected by ataxia, and the condition includes different mutations affecting several genes. We used whole exome sequencing (WES) in combination with homozygosity mapping (HM) to identify the genetic defects in five consanguineous families among our cohort study, with two affected children with ID and ataxia as major clinical symptoms. We identified three novel candidate genes, RIPPLY1, MRPL10, SNX14, and a new mutation in known gene SURF1. All are autosomal genes, except RIPPLY1, which is located on the X chromosome. Two are housekeeping genes, implicated in transcription and translation regulation and intracellular trafficking, and two encode mitochondrial proteins. The pathogenesis of these variants was evaluated by mutation classification, bioinformatic methods, review of medical and biological relevance, co-segregation studies in the particular family, and a normal population study. Linkage analysis and exome sequencing of a small number of affected family members is a powerful new technique which can be used to decrease the number of candidate genes in heterogenic disorders such as ID, and may even identify the responsible gene(s).
High-Throughput Screening to Identify Regulators of Meiosis-Specific Gene Expression in Saccharomyces cerevisiae.

Science.gov (United States)

Kassir, Yona

2017-01-01

Meiosis and gamete formation are processes that are essential for sexual reproduction in all eukaryotic organisms. Multiple intracellular and extracellular signals feed into pathways that converge on transcription factors that induce the expression of meiosis-specific genes. Once triggered the meiosis-specific gene expression program proceeds in a cascade that drives progress through the events of meiosis and gamete formation. Meiosis-specific gene expression is tightly controlled by a balance of positive and negative regulatory factors that respond to a plethora of signaling pathways. The budding yeast Saccharomyces cerevisiae has proven to be an outstanding model for the dissection of gametogenesis owing to the sophisticated genetic manipulations that can be performed with the cells. It is possible to use a variety selection and screening methods to identify genes and their functions. High-throughput screening technology has been developed to allow an array of all viable yeast gene deletion mutants to be screened for phenotypes and for regulators of gene expression. This chapter describes a protocol that has been used to screen a library of homozygous diploid yeast deletion strains to identify regulators of the meiosis-specific IME1 gene.
Dose-related gene expression changes in forebrain following acute, low-level chlorpyrifos exposure in neonatal rats

International Nuclear Information System (INIS)

Ray, Anamika; Liu Jing; Ayoubi, Patricia; Pope, Carey

2010-01-01

Chlorpyrifos (CPF) is a widely used organophosphorus insecticide (OP) and putative developmental neurotoxicant in humans. The acute toxicity of CPF is elicited by acetylcholinesterase (AChE) inhibition. We characterized dose-related (0.1, 0.5, 1 and 2 mg/kg) gene expression profiles and changes in cell signaling pathways 24 h following acute CPF exposure in 7-day-old rats. Microarray experiments indicated that approximately 9% of the 44,000 genes were differentially expressed following either one of the four CPF dosages studied (546, 505, 522, and 3,066 genes with 0.1, 0.5, 1.0 and 2.0 mg/kg CPF). Genes were grouped according to dose-related expression patterns using K-means clustering while gene networks and canonical pathways were evaluated using Ingenuity Pathway Analysis (registered) . Twenty clusters were identified and differential expression of selected genes was verified by RT-PCR. The four largest clusters (each containing from 276 to 905 genes) constituted over 50% of all differentially expressed genes and exhibited up-regulation following exposure to the highest dosage (2 mg/kg CPF). The total number of gene networks affected by CPF also rose sharply with the highest dosage of CPF (18, 16, 18 and 50 with 0.1, 0.5, 1 and 2 mg/kg CPF). Forebrain cholinesterase (ChE) activity was significantly reduced (26%) only in the highest dosage group. Based on magnitude of dose-related changes in differentially expressed genes, relative numbers of gene clusters and signaling networks affected, and forebrain ChE inhibition only at 2 mg/kg CPF, we focused subsequent analyses on this treatment group. Six canonical pathways were identified that were significantly affected by 2 mg/kg CPF (MAPK, oxidative stress, NFΚB, mitochondrial dysfunction, arylhydrocarbon receptor and adrenergic receptor signaling). Evaluation of different cellular functions of the differentially expressed genes suggested changes related to olfactory receptors, cell adhesion/migration, synapse
Caste-, sex-, and age-dependent expression of immune-related genes in a Japanese subterranean termite, Reticulitermes speratus.

Directory of Open Access Journals (Sweden)

Yuki Mitaka

Full Text Available Insects protect themselves from microbial infections through innate immune responses, including pathogen recognition, phagocytosis, the activation of proteolytic cascades, and the synthesis of antimicrobial peptides. Termites, eusocial insects inhabiting microbe-rich wood, live in closely-related family groups that are susceptible to shared pathogen infections. To resist pathogenic infection, termite families have evolved diverse immune adaptations at both individual and societal levels, and a strategy of trade-offs between reproduction and immunity has been suggested. Although termite immune-inducible genes have been identified, few studies have investigated the differential expression of these genes between reproductive and neuter castes, and between sexes in each caste. In this study, we compared the expression levels of immune-related genes among castes, sexes, and ages in a Japanese subterranean termite, Reticulitermes speratus. Using RNA-seq, we found 197 immune-related genes, including 40 pattern recognition proteins, 97 signalling proteins, 60 effectors. Among these genes, 174 showed differential expression among castes. Comparing expression levels between males and females in each caste, we found sexually dimorphic expression of immune-related genes not only in reproductive castes, but also in neuter castes. Moreover, we identified age-related differential expression of 162 genes in male and/or female reproductives. In addition, although R. speratus is known to use the antibacterial peptide C-type lysozyme as an egg recognition pheromone, we determined that R. speratus has not only C-type, but also P-type and I-type lysozymes, as well as other termite species. Our transcriptomic analyses revealed immune response plasticity among all castes, and sex-biased expression of immune genes even in neuter castes, suggesting a sexual division of labor in the immune system of R. speratus. This study heightens the understanding of the evolution of
New ALS-Related Genes Expand the Spectrum Paradigm of Amyotrophic Lateral Sclerosis.

Science.gov (United States)

Sabatelli, Mario; Marangi, Giuseppe; Conte, Amelia; Tasca, Giorgio; Zollino, Marcella; Lattante, Serena

2016-03-01

Amyotrophic Lateral Sclerosis (ALS) is characterized by the degeneration of upper and lower motor neurons. Clinical heterogeneity is a well-recognized feature of the disease as age of onset, site of onset and the duration of the disease can vary greatly among patients. A number of genes have been identified and associated to familial and sporadic forms of ALS but the majority of cases remains still unexplained. Recent breakthrough discoveries have demonstrated that clinical manifestations associated with ALS-related genes are not circumscribed to motor neurons involvement. In this view, ALS appears to be linked to different conditions over a continuum or spectrum in which overlapping phenotypes may be identified. In this review, we aim to examine the increasing number of spectra, including ALS/Frontotemporal Dementia and ALS/Myopathies spectra. Considering all these neurodegenerative disorders as different phenotypes of the same spectrum can help to identify common pathological pathways and consequently new therapeutic targets in these incurable diseases. © 2016 International Society of Neuropathology.
Integrative analysis of a cross-loci regulation network identifies App as a gene regulating insulin secretion from pancreatic islets.

Directory of Open Access Journals (Sweden)

Zhidong Tu

Full Text Available Complex diseases result from molecular changes induced by multiple genetic factors and the environment. To derive a systems view of how genetic loci interact in the context of tissue-specific molecular networks, we constructed an F2 intercross comprised of >500 mice from diabetes-resistant (B6 and diabetes-susceptible (BTBR mouse strains made genetically obese by the Leptin(ob/ob mutation (Lep(ob. High-density genotypes, diabetes-related clinical traits, and whole-transcriptome expression profiling in five tissues (white adipose, liver, pancreatic islets, hypothalamus, and gastrocnemius muscle were determined for all mice. We performed an integrative analysis to investigate the inter-relationship among genetic factors, expression traits, and plasma insulin, a hallmark diabetes trait. Among five tissues under study, there are extensive protein-protein interactions between genes responding to different loci in adipose and pancreatic islets that potentially jointly participated in the regulation of plasma insulin. We developed a novel ranking scheme based on cross-loci protein-protein network topology and gene expression to assess each gene's potential to regulate plasma insulin. Unique candidate genes were identified in adipose tissue and islets. In islets, the Alzheimer's gene App was identified as a top candidate regulator. Islets from 17-week-old, but not 10-week-old, App knockout mice showed increased insulin secretion in response to glucose or a membrane-permeant cAMP analog, in agreement with the predictions of the network model. Our result provides a novel hypothesis on the mechanism for the connection between two aging-related diseases: Alzheimer's disease and type 2 diabetes.
Identifying noncoding risk variants using disease-relevant gene regulatory networks.

Science.gov (United States)

Gao, Long; Uzun, Yasin; Gao, Peng; He, Bing; Ma, Xiaoke; Wang, Jiahui; Han, Shizhong; Tan, Kai

2018-02-16

Identifying noncoding risk variants remains a challenging task. Because noncoding variants exert their effects in the context of a gene regulatory network (GRN), we hypothesize that explicit use of disease-relevant GRNs can significantly improve the inference accuracy of noncoding risk variants. We describe Annotation of Regulatory Variants using Integrated Networks (ARVIN), a general computational framework for predicting causal noncoding variants. It employs a set of novel regulatory network-based features, combined with sequence-based features to infer noncoding risk variants. Using known causal variants in gene promoters and enhancers in a number of diseases, we show ARVIN outperforms state-of-the-art methods that use sequence-based features alone. Additional experimental validation using reporter assay further demonstrates the accuracy of ARVIN. Application of ARVIN to seven autoimmune diseases provides a holistic view of the gene subnetwork perturbed by the combinatorial action of the entire set of risk noncoding mutations.
A method to identify differential expression profiles of time-course gene data with Fourier transformation.

Science.gov (United States)

Kim, Jaehee; Ogden, Robert Todd; Kim, Haseong

2013-10-18

Time course gene expression experiments are an increasingly popular method for exploring biological processes. Temporal gene expression profiles provide an important characterization of gene function, as biological systems are both developmental and dynamic. With such data it is possible to study gene expression changes over time and thereby to detect differential genes. Much of the early work on analyzing time series expression data relied on methods developed originally for static data and thus there is a need for improved methodology. Since time series expression is a temporal process, its unique features such as autocorrelation between successive points should be incorporated into the analysis. This work aims to identify genes that show different gene expression profiles across time. We propose a statistical procedure to discover gene groups with similar profiles using a nonparametric representation that accounts for the autocorrelation in the data. In particular, we first represent each profile in terms of a Fourier basis, and then we screen out genes that are not differentially expressed based on the Fourier coefficients. Finally, we cluster the remaining gene profiles using a model-based approach in the Fourier domain. We evaluate the screening results in terms of sensitivity, specificity, FDR and FNR, compare with the Gaussian process regression screening in a simulation study and illustrate the results by application to yeast cell-cycle microarray expression data with alpha-factor synchronization.The key elements of the proposed methodology: (i) representation of gene profiles in the Fourier domain; (ii) automatic screening of genes based on the Fourier coefficients and taking into account autocorrelation in the data, while controlling the false discovery rate (FDR); (iii) model-based clustering of the remaining gene profiles. Using this method, we identified a set of cell-cycle-regulated time-course yeast genes. The proposed method is general and can be
Gene-set analysis based on the pharmacological profiles of drugs to identify repurposing opportunities in schizophrenia.

Science.gov (United States)

de Jong, Simone; Vidler, Lewis R; Mokrab, Younes; Collier, David A; Breen, Gerome

2016-08-01

Genome-wide association studies (GWAS) have identified thousands of novel genetic associations for complex genetic disorders, leading to the identification of potential pharmacological targets for novel drug development. In schizophrenia, 108 conservatively defined loci that meet genome-wide significance have been identified and hundreds of additional sub-threshold associations harbour information on the genetic aetiology of the disorder. In the present study, we used gene-set analysis based on the known binding targets of chemical compounds to identify the 'drug pathways' most strongly associated with schizophrenia-associated genes, with the aim of identifying potential drug repositioning opportunities and clues for novel treatment paradigms, especially in multi-target drug development. We compiled 9389 gene sets (2496 with unique gene content) and interrogated gene-based p-values from the PGC2-SCZ analysis. Although no single drug exceeded experiment wide significance (corrected pneratinib. This is a proof of principle analysis showing the potential utility of GWAS data of schizophrenia for the direct identification of candidate drugs and molecules that show polypharmacy. © The Author(s) 2016.
Identifying Genes Controlling Ferulate Cross-Linking Formation in Grass Cell Walls

Energy Technology Data Exchange (ETDEWEB)

de O. Buanafina, Marcia Maria [Pennsylvania State Univ., University Park, PA (United States)

2013-10-16

This proposal focuses on cell wall feruloylation and our long term goal is to identify and isolate novel genes controlling feruloylation and to characterize the phenotype of mutants in this pathway, with a spotlight on cell wall properties.
A Generally Applicable Translational Strategy Identifies S100A4 as a Candidate Gene in Allergy

DEFF Research Database (Denmark)

Bruhn, Sören; Fang, Yu; Barrenäs, Fredrik

2014-01-01

The identification of diagnostic markers and therapeutic candidate genes in common diseases is complicated by the involvement of thousands of genes. We hypothesized that genes co-regulated with a key gene in allergy, IL13, would form a module that could help to identify candidate genes. We identi...
Utility and Limitations of Using Gene Expression Data to Identify Functional Associations.

Directory of Open Access Journals (Sweden)

Sahra Uygun

2016-12-01

Full Text Available Gene co-expression has been widely used to hypothesize gene function through guilt-by association. However, it is not clear to what degree co-expression is informative, whether it can be applied to genes involved in different biological processes, and how the type of dataset impacts inferences about gene functions. Here our goal is to assess the utility and limitations of using co-expression as a criterion to recover functional associations between genes. By determining the percentage of gene pairs in a metabolic pathway with significant expression correlation, we found that many genes in the same pathway do not have similar transcript profiles and the choice of dataset, annotation quality, gene function, expression similarity measure, and clustering approach significantly impacts the ability to recover functional associations between genes using Arabidopsis thaliana as an example. Some datasets are more informative in capturing coordinated expression profiles and larger data sets are not always better. In addition, to recover the maximum number of known pathways and identify candidate genes with similar functions, it is important to explore rather exhaustively multiple dataset combinations, similarity measures, clustering algorithms and parameters. Finally, we validated the biological relevance of co-expression cluster memberships with an independent phenomics dataset and found that genes that consistently cluster with leucine degradation genes tend to have similar leucine levels in mutants. This study provides a framework for obtaining gene functional associations by maximizing the information that can be obtained from gene expression datasets.
Transcriptome analysis of the Cryptocaryon irritans tomont stage identifies potential genes for the detection and control of cryptocaryonosis

Directory of Open Access Journals (Sweden)

Wan Kiew-Lian

2010-01-01

Full Text Available Abstract Background Cryptocaryon irritans is a parasitic ciliate that causes cryptocaryonosis (white spot disease in marine fish. Diagnosis of cryptocaryonosis often depends on the appearance of white spots on the surface of the fish, which are usually visible only during later stages of the disease. Identifying suitable biomarkers of this parasite would aid the development of diagnostic tools and control strategies for C. irritans. The C. irritans genome is virtually unexplored; therefore, we generated and analyzed expressed sequence tags (ESTs of the parasite to identify genes that encode for surface proteins, excretory/secretory proteins and repeat-containing proteins. Results ESTs were generated from a cDNA library of C. irritans tomonts isolated from infected Asian sea bass, Lates calcarifer. Clustering of the 5356 ESTs produced 2659 unique transcripts (UTs containing 1989 singletons and 670 consensi. BLAST analysis showed that 74% of the UTs had significant similarity (E-value -5 to sequences that are currently available in the GenBank database, with more than 15% of the significant hits showing unknown function. Forty percent of the UTs had significant similarity to ciliates from the genera Tetrahymena and Paramecium. Comparative gene family analysis with related taxa showed that many protein families are conserved among the protozoans. Based on gene ontology annotation, functional groups were successfully assigned to 790 UTs. Genes encoding excretory/secretory proteins and membrane and membrane-associated proteins were identified because these proteins often function as antigens and are good antibody targets. A total of 481 UTs were classified as encoding membrane proteins, 54 were classified as encoding for membrane-bound proteins, and 155 were found to contain excretory/secretory protein-coding sequences. Amino acid repeat-containing proteins and GPI-anchored proteins were also identified as potential candidates for the development of
Gene Network for Identifying the Entropy Changes of Different Modules in Pediatric Sepsis

Directory of Open Access Journals (Sweden)

Jing Yang

2016-12-01

Full Text Available Background/Aims: Pediatric sepsis is a disease that threatens life of children. The incidence of pediatric sepsis is higher in developing countries due to various reasons, such as insufficient immunization and nutrition, water and air pollution, etc. Exploring the potential genes via different methods is of significance for the prevention and treatment of pediatric sepsis. This study aimed to identify potential genes associated with pediatric sepsis utilizing analysis of gene network and entropy. Methods: The mRNA expression in the blood samples collected from 20 septic children and 30 healthy controls was quantified by using Affymetrix HG-U133A microarray. Two condition-specific protein-protein interaction networks (PINs, one for the healthy control and the other one for the children with sepsis, were deduced by combining the fundamental human PINs with gene expression profiles in the two phenotypes. Subsequently, distinct modules from the two conditional networks were extracted by adopting a maximal clique-merging approach. Delta entropy (ΔS was calculated between sepsis and control modules. Results: Then, key genes displaying changes in gene composition were identified by matching the control and sepsis modules. Two objective modules were obtained, in which ribosomal protein RPL4 and RPL9 as well as TOP2A were probably considered as the key genes differentiating sepsis from healthy controls. Conclusion: According to previous reports and this work, TOP2A is the potential gene therapy target for pediatric sepsis. The relationship between pediatric sepsis and RPL4 and RPL9 needs further investigation.
Common genetic variation in six lipid-related and statin-related genes, statin use and risk of incident nonfatal myocardial infarction and stroke.

Science.gov (United States)

Hindorff, Lucia A; Lemaitre, Rozenn N; Smith, Nicholas L; Bis, Joshua C; Marciante, Kristin D; Rice, Kenneth M; Lumley, Thomas; Enquobahrie, Daniel A; Li, Guo; Heckbert, Susan R; Psaty, Bruce M

2008-08-01

Genetic polymorphisms are associated with lipid-lowering response to statins, but generalizeability to disease endpoints is unclear. The association between 82 common single nucleotide polymorphisms (SNPs) in six lipid-related or statin-related genes (ABCB1, CETP, HMGCR, LDLR, LIPC, NOS3) and incident nonfatal myocardial infarction (MI) and ischemic stroke was analyzed according to current statin use and overall in a population-based case-control study (856 MI, 368 stroke, 2686 controls). Common SNPs were chosen from resequencing data using pairwise linkage disequilibrium. Gene-level analyses (testing global association within a gene) and SNP-level analyses (comparing the number of observed vs. expected associations across all genes) were performed using logistic regression, setting nominal statistical significance at P value of less than 0.05. No gene-level interactions with statin use on MI or stroke were identified. Across all genes, two SNP-statin interactions on MI were observed (one ABCB1, one LIPC) and five interactions on stroke (one CETP, four LIPC). The strongest SNP-statin interaction was for synonymous CETP SNP rs5883 on stroke (P=0.008). Gene-level associations were present for LIPC and MI (P=0.026), but not other genes or outcomes. SNP-level associations included three SNPs with MI (one LDLR, two LIPC) and two SNPs with stroke (one CETP, one LDLR). The number of observed SNP associations was no greater than expected by chance. Several potential novel associations or interactions of SNPs in ABCB1, CETP, LDLR, and LIPC with MI and stroke were identified; however, our results should be regarded as hypothesis generating until corroborated by other studies.

MeInfoText 2.0: gene methylation and cancer relation extraction from biomedical literature

Directory of Open Access Journals (Sweden)

Fang Yu-Ching

2011-12-01

Full Text Available Abstract Background DNA methylation is regarded as a potential biomarker in the diagnosis and treatment of cancer. The relations between aberrant gene methylation and cancer development have been identified by a number of recent scientific studies. In a previous work, we used co-occurrences to mine those associations and compiled the MeInfoText 1.0 database. To reduce the amount of manual curation and improve the accuracy of relation extraction, we have now developed MeInfoText 2.0, which uses a machine learning-based approach to extract gene methylation-cancer relations. Description Two maximum entropy models are trained to predict if aberrant gene methylation is related to any type of cancer mentioned in the literature. After evaluation based on 10-fold cross-validation, the average precision/recall rates of the two models are 94.7/90.1 and 91.8/90% respectively. MeInfoText 2.0 provides the gene methylation profiles of different types of human cancer. The extracted relations with maximum probability, evidence sentences, and specific gene information are also retrievable. The database is available at http://bws.iis.sinica.edu.tw:8081/MeInfoText2/. Conclusion The previous version, MeInfoText, was developed by using association rules, whereas MeInfoText 2.0 is based on a new framework that combines machine learning, dictionary lookup and pattern matching for epigenetics information extraction. The results of experiments show that MeInfoText 2.0 outperforms existing tools in many respects. To the best of our knowledge, this is the first study that uses a hybrid approach to extract gene methylation-cancer relations. It is also the first attempt to develop a gene methylation and cancer relation corpus.
AbMiner: A bioinformatic resource on available monoclonal antibodies and corresponding gene identifiers for genomic, proteomic, and immunologic studies

Directory of Open Access Journals (Sweden)

Shankavaram Uma

2006-04-01

Full Text Available Abstract Background Monoclonal antibodies are used extensively throughout the biomedical sciences for detection of antigens, either in vitro or in vivo. We, for example, have used them for quantitation of proteins on "reverse-phase" protein lysate arrays. For those studies, we quality-controlled > 600 available monoclonal antibodies and also needed to develop precise information on the genes that encode their antigens. Translation among the various protein and gene identifier types proved non-trivial because of one-to-many and many-to-one relationships. To organize the antibody, protein, and gene information, we initially developed a relational database in Filemaker for our own use. When it became apparent that the information would be useful to many other researchers faced with the need to choose or characterize antibodies, we developed it further as AbMiner, a fully relational web-based database under MySQL, programmed in Java. Description AbMiner is a user-friendly, web-based relational database of information on > 600 commercially available antibodies that we validated by Western blot for protein microarray studies. It includes many types of information on the antibody, the immunogen, the vendor, the antigen, and the antigen's gene. Multiple gene and protein identifier types provide links to corresponding entries in a variety of other public databases, including resources for phosphorylation-specific antibodies. AbMiner also includes our quality-control data against a pool of 60 diverse cancer cell types (the NCI-60 and also protein expression levels for the NCI-60 cells measured using our high-density "reverse-phase" protein lysate microarrays for a selection of the listed antibodies. Some other available database resources give information on antibody specificity for one or a couple of cell types. In contrast, the data in AbMiner indicate specificity with respect to the antigens in a pool of 60 diverse cell types from nine different
AbMiner: a bioinformatic resource on available monoclonal antibodies and corresponding gene identifiers for genomic, proteomic, and immunologic studies.

Science.gov (United States)

Major, Sylvia M; Nishizuka, Satoshi; Morita, Daisaku; Rowland, Rick; Sunshine, Margot; Shankavaram, Uma; Washburn, Frank; Asin, Daniel; Kouros-Mehr, Hosein; Kane, David; Weinstein, John N

2006-04-06

Monoclonal antibodies are used extensively throughout the biomedical sciences for detection of antigens, either in vitro or in vivo. We, for example, have used them for quantitation of proteins on "reverse-phase" protein lysate arrays. For those studies, we quality-controlled > 600 available monoclonal antibodies and also needed to develop precise information on the genes that encode their antigens. Translation among the various protein and gene identifier types proved non-trivial because of one-to-many and many-to-one relationships. To organize the antibody, protein, and gene information, we initially developed a relational database in Filemaker for our own use. When it became apparent that the information would be useful to many other researchers faced with the need to choose or characterize antibodies, we developed it further as AbMiner, a fully relational web-based database under MySQL, programmed in Java. AbMiner is a user-friendly, web-based relational database of information on > 600 commercially available antibodies that we validated by Western blot for protein microarray studies. It includes many types of information on the antibody, the immunogen, the vendor, the antigen, and the antigen's gene. Multiple gene and protein identifier types provide links to corresponding entries in a variety of other public databases, including resources for phosphorylation-specific antibodies. AbMiner also includes our quality-control data against a pool of 60 diverse cancer cell types (the NCI-60) and also protein expression levels for the NCI-60 cells measured using our high-density "reverse-phase" protein lysate microarrays for a selection of the listed antibodies. Some other available database resources give information on antibody specificity for one or a couple of cell types. In contrast, the data in AbMiner indicate specificity with respect to the antigens in a pool of 60 diverse cell types from nine different tissues of origin. AbMiner is a relational database that
Quantifying The Relative Importance Of Phylogeny And Environmental Preferences As Drivers Of Gene Content In Prokaryotic Microorganisms

Directory of Open Access Journals (Sweden)

Javier eTamames

2016-03-01

Full Text Available Two complementary forces shape microbial genomes: vertical inheritance of genes by phylogenetic descent, and acquisition of new genes related to adaptation to particular habitats and lifestyles. Quantification of the relative importance of each driving force proved difficult. We determined the contribution of each factor, and identified particular genes or biochemical/cellular processes linked to environmental preferences (i.e., propensity of a taxon to live in particular habitats. Three types of data were confronted: [i] complete genomes, which provide gene content of different taxa; [ii] phylogenetic information, via alignment of 16S rRNA sequences, which allowed determination of the distance between taxa, and [iii] distribution of species in environments via 16S rRNA sampling experiments, reflecting environmental preferences of different taxa. The combination of these three datasets made it possible to describe and quantify the relationships among them. We found that, although phylogenetic descent was responsible for shaping most genomes, a discernible part of the latter was correlated to environmental adaptations. Particular families of genes were identified as environmental markers, as supported by direct studies such as metagenomic sequencing. These genes are likely important for adaptation of bacteria to particular conditions or habitats, such as carbohydrate or glycan metabolism genes being linked to host-associated environments.
Sex steroid-related candidate genes in psychiatric disorders.

Science.gov (United States)

Westberg, Lars; Eriksson, Elias

2008-07-01

Sex steroids readily pass the blood-brain barrier, and receptors for them are abundant in brain areas important for the regulation of emotions, cognition and behaviour. Animal experiments have revealed both important early effects of these hormones on brain development and their ongoing influence on brain morphology and neurotransmission in the adult organism. The important effects of sex steroids on human behaviour are illustrated by, for example, the effect of reduced levels of these hormones on sexual drive and conditions such as premenstrual dysphoric disorder, perimenopausal dysphoria, postpartum depression, postpartum psychosis, dysphoria induced by oral contraceptives or hormonal replacement therapy and anabolic steroid-induced aggression. The fact that men and women (as groups) differ with respect to the prevalence of several psychiatric disorders, certain aspects of cognitive function and certain personality traits may possibly also reflect an influence of sex steroids on human behaviour. The heritability of most behavioural traits, including personality, cognitive abilities and susceptibility to psychiatric illness, is considerable, but as yet, only few genes of definite importance in this context have been identified. Given the important role of sex steroids for brain function, it is unfortunate that relatively few studies so far have addressed the possible influence of sex steroid-related genes on interindividual differences with respect to personality, cognition and susceptibility to psychiatric disorders. To facilitate further research in this area, this review provides information on several such genes and summarizes what is currently known with respect to their possible influence on brain function.
The relation of serotonin-related gene and COMT gene polymorphisms with criminal behavior in schizophrenic disorder.

Science.gov (United States)

Koh, Kyung Bong; Choi, Eun Hee; Lee, Young-joon; Han, Mooyoung; Choi, Sang-Sup; Kim, So Won; Lee, Min Goo

2012-02-01

It has been suggested that patients with schizophrenia might be involved in criminal behavior, such as homicidal and violent behavior. However, the relationship between criminal behavior and genes in patients with schizophrenia has not been clearly elucidated. The objective of this study was to examine the relation between criminal behavior and serotonin-related gene or catechol-O-methyltransferase (COMT) gene polymorphisms in patients with schizophrenia. Serotonin-related and COMT polymorphic markers were assessed by using single nucleotide polymorphism (SNP) genotyping. Ninety-nine crime-related inpatients with schizophrenia (57 homicidal and 42 nonhomicidal violent) and 133 healthy subjects were enrolled between October 2005 and May 2008. Diagnoses were made according to the Diagnostic and Statistical Manual of Mental Disorders, Fourth Edition (DSM-IV) criteria. The genotype frequencies of tryptophan hydroxylase-1 (TPH1) A218C and COMT V158M were compared between groups. The TPH1 CC genotype had 2.7-fold higher odds of crime-related schizophrenia compared with A-carrier genotype after the analysis was controlled for sex and age (OR, 2.69; 95% CI, 1.22 - 5.91; P = .01). In addition, the TPH1 CC genotype had 3.4-fold higher odds of homicidal schizophrenia compared with A-carrier genotype after the analysis was controlled for sex and age (OR, 3.38; 95% CI, 1.40 - 8.18; P = .007). However, no significant differences were found in the frequencies of genotype of COMT polymorphism between criminal schizophrenics and healthy subjects, nor were any significant differences found between nonhomicidal schizophrenics and healthy subjects. These results indicate that the TPH1 CC recessive genotype is likely to be a genetic risk factor for criminal behavior, especially homicidal behavior in patients with schizophrenia. However, COMT gene polymorphisms were not associated with criminal behavior in schizophrenic patients. © Copyright 2012 Physicians Postgraduate Press, Inc.
Genome-wide analysis of cell wall-related genes in Tuber melanosporum.

Science.gov (United States)

Balestrini, Raffaella; Sillo, Fabiano; Kohler, Annegret; Schneider, Georg; Faccio, Antonella; Tisserant, Emilie; Martin, Francis; Bonfante, Paola

2012-06-01

A genome-wide inventory of proteins involved in cell wall synthesis and remodeling has been obtained by taking advantage of the recently released genome sequence of the ectomycorrhizal Tuber melanosporum black truffle. Genes that encode cell wall biosynthetic enzymes, enzymes involved in cell wall polysaccharide synthesis or modification, GPI-anchored proteins and other cell wall proteins were identified in the black truffle genome. As a second step, array data were validated and the symbiotic stage was chosen as the main focus. Quantitative RT-PCR experiments were performed on 29 selected genes to verify their expression during ectomycorrhizal formation. The results confirmed the array data, and this suggests that cell wall-related genes are required for morphogenetic transition from mycelium growth to the ectomycorrhizal branched hyphae. Labeling experiments were also performed on T. melanosporum mycelium and ectomycorrhizae to localize cell wall components.
Phase analysis of circadian-related genes in two tissues

Directory of Open Access Journals (Sweden)

Li Leping

2006-02-01

Full Text Available Abstract Background Recent circadian clock studies using gene expression microarray in two different tissues of mouse have revealed not all circadian-related genes are synchronized in phase or peak expression times across tissues in vivo. Instead, some circadian-related genes may be delayed by 4–8 hrs in peak expression in one tissue relative to the other. These interesting biological observations prompt a statistical question regarding how to distinguish the synchronized genes from genes that are systematically lagged in phase/peak expression time across two tissues. Results We propose a set of techniques from circular statistics to analyze phase angles of circadian-related genes in two tissues. We first estimate the phases of a cycling gene separately in each tissue, which are then used to estimate the paired angular difference of the phase angles of the gene in the two tissues. These differences are modeled as a mixture of two von Mises distributions which enables us to cluster genes into two groups; one group having synchronized transcripts with the same phase in the two tissues, the other containing transcripts with a discrepancy in phase between the two tissues. For each cluster of genes we assess the association of phases across the tissue types using circular-circular regression. We also develop a bootstrap methodology based on a circular-circular regression model to evaluate the improvement in fit provided by allowing two components versus a one-component von-Mises model. Conclusion We applied our proposed methodologies to the circadian-related genes common to heart and liver tissues in Storch et al. 2, and found that an estimated 80% of circadian-related transcripts common to heart and liver tissues were synchronized in phase, and the other 20% of transcripts were lagged about 8 hours in liver relative to heart. The bootstrap p-value for being one cluster is 0.063, which suggests the possibility of two clusters. Our methodologies can
Gene expression profiling in Entamoeba histolytica identifies key components in iron uptake and metabolism.

Directory of Open Access Journals (Sweden)

Nora Adriana Hernández-Cuevas

Full Text Available Entamoeba histolytica is an ameboid parasite that causes colonic dysentery and liver abscesses in humans. The parasite encounters dramatic changes in iron concentration during its invasion of the host, with relatively low levels in the intestinal lumen and then relatively high levels in the blood and liver. The liver notably contains sources of iron; therefore, the parasite's ability to use these sources might be relevant to its survival in the liver and thus the pathogenesis of liver abscesses. The objective of the present study was to identify factors involved in iron uptake, use and storage in E. histolytica. We compared the respective transcriptomes of E. histolytica trophozoites grown in normal medium (containing around 169 µM iron, low-iron medium (around 123 µM iron, iron-deficient medium (around 91 µM iron, and iron-deficient medium replenished with hemoglobin. The differentially expressed genes included those coding for the ATP-binding cassette transporters and major facilitator transporters (which share homology with bacterial siderophores and heme transporters and genes involved in heme biosynthesis and degradation. Iron deficiency was associated with increased transcription of genes encoding a subset of cell signaling molecules, some of which have previously been linked to adaptation to the intestinal environment and virulence. The present study is the first to have assessed the transcriptome of E. histolytica grown under various iron concentrations. Our results provide insights into the pathways involved in iron uptake and metabolism in this parasite.
Gene expression profiling in Entamoeba histolytica identifies key components in iron uptake and metabolism.

Science.gov (United States)

Hernández-Cuevas, Nora Adriana; Weber, Christian; Hon, Chung-Chau; Guillen, Nancy

2014-01-01

Entamoeba histolytica is an ameboid parasite that causes colonic dysentery and liver abscesses in humans. The parasite encounters dramatic changes in iron concentration during its invasion of the host, with relatively low levels in the intestinal lumen and then relatively high levels in the blood and liver. The liver notably contains sources of iron; therefore, the parasite's ability to use these sources might be relevant to its survival in the liver and thus the pathogenesis of liver abscesses. The objective of the present study was to identify factors involved in iron uptake, use and storage in E. histolytica. We compared the respective transcriptomes of E. histolytica trophozoites grown in normal medium (containing around 169 µM iron), low-iron medium (around 123 µM iron), iron-deficient medium (around 91 µM iron), and iron-deficient medium replenished with hemoglobin. The differentially expressed genes included those coding for the ATP-binding cassette transporters and major facilitator transporters (which share homology with bacterial siderophores and heme transporters) and genes involved in heme biosynthesis and degradation. Iron deficiency was associated with increased transcription of genes encoding a subset of cell signaling molecules, some of which have previously been linked to adaptation to the intestinal environment and virulence. The present study is the first to have assessed the transcriptome of E. histolytica grown under various iron concentrations. Our results provide insights into the pathways involved in iron uptake and metabolism in this parasite.
Susceptible genes and molecular pathways related to heavy ion irradiation in oral squamous cell carcinoma cells

International Nuclear Information System (INIS)

Fushimi, Kazuaki; Uzawa, Katsuhiro; Ishigami, Takashi; Yamamoto, Nobuharu; Kawata, Tetsuya; Shibahara, Takahiko; Ito, Hisao; Mizoe, Jun-etsu; Tsujii, Hirohiko; Tanzawa, Hideki

2008-01-01

Background and purpose: Heavy ion beams are high linear energy transfer (LET) radiation characterized by a higher relative biologic effectiveness than low LET radiation. The aim of the current study was to determine the difference of gene expression between heavy ion beams and X-rays in oral squamous cell carcinoma (OSCC)-derived cells. Materials and methods: The OSCC cells were irradiated with accelerated carbon or neon ion irradiation or X-rays using three different doses. We sought to identify genes the expression of which is affected by carbon and neon ion irradiation using Affymetrix GeneChip analysis. The identified genes were analyzed using the Ingenuity Pathway Analysis Tool to investigate the functional network and gene ontology. Changes in mRNA expression in the genes were assessed by real-time quantitative reverse transcriptase-polymerase chain reaction (qRT-PCR). Results: The microarray analysis identified 84 genes that were modulated by carbon and neon ion irradiation at all doses in OSCC cells. Among the genes, three genes (TGFBR2, SMURF2, and BMP7) and two genes (CCND1 and E2F3), respectively, were found to be involved in the transforming growth factor β-signaling pathway and cell cycle:G1/S checkpoint regulation pathway. The qRT-PCR data from the five genes after heavy ion irradiation were consistent with the microarray data (P < 0.01). Conclusion: Our findings should serve as a basis for global characterization of radiation-regulated genes and pathways in heavy ion-irradiated OSCC
Identifying time-delayed gene regulatory networks via an evolvable hierarchical recurrent neural network.

Science.gov (United States)

Kordmahalleh, Mina Moradi; Sefidmazgi, Mohammad Gorji; Harrison, Scott H; Homaifar, Abdollah

2017-01-01

The modeling of genetic interactions within a cell is crucial for a basic understanding of physiology and for applied areas such as drug design. Interactions in gene regulatory networks (GRNs) include effects of transcription factors, repressors, small metabolites, and microRNA species. In addition, the effects of regulatory interactions are not always simultaneous, but can occur after a finite time delay, or as a combined outcome of simultaneous and time delayed interactions. Powerful biotechnologies have been rapidly and successfully measuring levels of genetic expression to illuminate different states of biological systems. This has led to an ensuing challenge to improve the identification of specific regulatory mechanisms through regulatory network reconstructions. Solutions to this challenge will ultimately help to spur forward efforts based on the usage of regulatory network reconstructions in systems biology applications. We have developed a hierarchical recurrent neural network (HRNN) that identifies time-delayed gene interactions using time-course data. A customized genetic algorithm (GA) was used to optimize hierarchical connectivity of regulatory genes and a target gene. The proposed design provides a non-fully connected network with the flexibility of using recurrent connections inside the network. These features and the non-linearity of the HRNN facilitate the process of identifying temporal patterns of a GRN. Our HRNN method was implemented with the Python language. It was first evaluated on simulated data representing linear and nonlinear time-delayed gene-gene interaction models across a range of network sizes and variances of noise. We then further demonstrated the capability of our method in reconstructing GRNs of the Saccharomyces cerevisiae synthetic network for in vivo benchmarking of reverse-engineering and modeling approaches (IRMA). We compared the performance of our method to TD-ARACNE, HCC-CLINDE, TSNI and ebdbNet across different network
Apoptosis related genes expressed in cultured Fallopian tube epithelial cells infected in vitro with Neisseria gonorrhoeae

Directory of Open Access Journals (Sweden)

PAZ A REYES

2007-01-01

Full Text Available Background: Infection of the Fallopian tubes (FT by Neisseria gonorrhoeae (Ngo can lead to acute salpingitis, an inflammatory condition resulting in damage primarily to the ciliated cells, with loss of ciliary activity and sloughing of the cells from the epithelium. Recently, we have shown that Ngo infection induced apoptosis in FT epithelium cells by a TNF-alpha dependent mechanism that could contribute to the cell and tissue damage observed in gonococcal salpingitis. Aim: To investigate the apoptosis-related genes expressed during apoptosis induction in cultured FT epithelial cells infected in vitro by Ngo. Materials and Methods: In the current study, we used cDNA macroarrays and real time PCR to identify and determine the expression levels of apoptosis related genes during the in vitro gonococci infection of FT epithelial cells. Results: Significant apoptosis was induced following infection with Ngo. Macroarray analysis identified the expression of multiple genes of the TNF receptor family (TNFRSF1B, -4, -6, -10A, -10B and -10D and the Bcl-2 family (BAK1, BAX, BLK, HRK and MCL-1 without differences between controls and infected cells. This lack of difference was confirmed by RT-PCR of BAX, Bcl-2, TNFRS1A (TNFR-I and TNFRSF1B (TNFR-II. Conclusion: Several genes related to apoptosis are expressed in primary cultures of epithelial cells of the human Fallopian tube. Infection with Ngo induces apoptosis without changes in the pattern of gene expression of several apoptosis-related genes. Results strongly suggest that Ngo regulates apoptosis in the FT by post-transcriptional mechanisms that need to be further addressed
Analysis of pan-genome to identify the core genes and essential genes of Brucella spp.

Science.gov (United States)

Yang, Xiaowen; Li, Yajie; Zang, Juan; Li, Yexia; Bie, Pengfei; Lu, Yanli; Wu, Qingmin

2016-04-01

Brucella spp. are facultative intracellular pathogens, that cause a contagious zoonotic disease, that can result in such outcomes as abortion or sterility in susceptible animal hosts and grave, debilitating illness in humans. For deciphering the survival mechanism of Brucella spp. in vivo, 42 Brucella complete genomes from NCBI were analyzed for the pan-genome and core genome by identification of their composition and function of Brucella genomes. The results showed that the total 132,143 protein-coding genes in these genomes were divided into 5369 clusters. Among these, 1710 clusters were associated with the core genome, 1182 clusters with strain-specific genes and 2477 clusters with dispensable genomes. COG analysis indicated that 44 % of the core genes were devoted to metabolism, which were mainly responsible for energy production and conversion (COG category C), and amino acid transport and metabolism (COG category E). Meanwhile, approximately 35 % of the core genes were in positive selection. In addition, 1252 potential essential genes were predicted in the core genome by comparison with a prokaryote database of essential genes. The results suggested that the core genes in Brucella genomes are relatively conservation, and the energy and amino acid metabolism play a more important role in the process of growth and reproduction in Brucella spp. This study might help us to better understand the mechanisms of Brucella persistent infection and provide some clues for further exploring the gene modules of the intracellular survival in Brucella spp.
Plasticity-Related Gene Expression During Eszopiclone-Induced Sleep.

Science.gov (United States)

Gerashchenko, Dmitry; Pasumarthi, Ravi K; Kilduff, Thomas S

2017-07-01

Experimental evidence suggests that restorative processes depend on synaptic plasticity changes in the brain during sleep. We used the expression of plasticity-related genes to assess synaptic plasticity changes during drug-induced sleep. We first characterized sleep induced by eszopiclone in mice during baseline conditions and during the recovery from sleep deprivation. We then compared the expression of 18 genes and two miRNAs critically involved in synaptic plasticity in these mice. Gene expression was assessed in the cerebral cortex and hippocampus by the TaqMan reverse transcription polymerase chain reaction and correlated with sleep parameters. Eszopiclone reduced the latency to nonrapid eye movement (NREM) sleep and increased NREM sleep amounts. Eszopiclone had no effect on slow wave activity (SWA) during baseline conditions but reduced the SWA increase during recovery sleep (RS) after sleep deprivation. Gene expression analyses revealed three distinct patterns: (1) four genes had higher expression either in the cortex or hippocampus in the group of mice with increased amounts of wakefulness; (2) a large proportion of plasticity-related genes (7 out of 18 genes) had higher expression during RS in the cortex but not in the hippocampus; and (3) six genes and the two miRNAs showed no significant changes across conditions. Even at a relatively high dose (20 mg/kg), eszopiclone did not reduce the expression of plasticity-related genes during RS period in the cortex. These results indicate that gene expression associated with synaptic plasticity occurs in the cortex in the presence of a hypnotic medication. © Sleep Research Society 2017. Published by Oxford University Press on behalf of the Sleep Research Society. All rights reserved. For permissions, please e-mail journals.permissions@oup.com.
Gene expression profiling of prostate tissue identifies chromatin regulation as a potential link between obesity and lethal prostate cancer.

Science.gov (United States)

Ebot, Ericka M; Gerke, Travis; Labbé, David P; Sinnott, Jennifer A; Zadra, Giorgia; Rider, Jennifer R; Tyekucheva, Svitlana; Wilson, Kathryn M; Kelly, Rachel S; Shui, Irene M; Loda, Massimo; Kantoff, Philip W; Finn, Stephen; Vander Heiden, Matthew G; Brown, Myles; Giovannucci, Edward L; Mucci, Lorelei A

2017-11-01

Obese men are at higher risk of advanced prostate cancer and cancer-specific mortality; however, the biology underlying this association remains unclear. This study examined gene expression profiles of prostate tissue to identify biological processes differentially expressed by obesity status and lethal prostate cancer. Gene expression profiling was performed on tumor (n = 402) and adjacent normal (n = 200) prostate tissue from participants in 2 prospective cohorts who had been diagnosed with prostate cancer from 1982 to 2005. Body mass index (BMI) was calculated from the questionnaire immediately preceding cancer diagnosis. Men were followed for metastases or prostate cancer-specific death (lethal disease) through 2011. Gene Ontology biological processes differentially expressed by BMI were identified using gene set enrichment analysis. Pathway scores were computed by averaging the signal intensities of member genes. Odds ratios (ORs) for lethal prostate cancer were estimated with logistic regression. Among 402 men, 48% were healthy weight, 31% were overweight, and 21% were very overweight/obese. Fifteen gene sets were enriched in tumor tissue, but not normal tissue, of very overweight/obese men versus healthy-weight men; 5 of these were related to chromatin modification and remodeling (false-discovery rate 7, 41% vs 17%; P = 2 × 10 -4 ) and an increased risk of lethal disease that was independent of grade and stage (OR, 5.26; 95% confidence interval, 2.37-12.25). This study improves our understanding of the biology of aggressive prostate cancer and identifies a potential mechanistic link between obesity and prostate cancer death that warrants further study. Cancer 2017;123:4130-4138. © 2017 American Cancer Society. © 2017 American Cancer Society.
Sugarcane genes related to mitochondrial function

Directory of Open Access Journals (Sweden)

Fonseca Ghislaine V.

2001-01-01

Full Text Available Mitochondria function as metabolic powerhouses by generating energy through oxidative phosphorylation and have become the focus of renewed interest due to progress in understanding the subtleties of their biogenesis and the discovery of the important roles which these organelles play in senescence, cell death and the assembly of iron-sulfur (Fe/S centers. Using proteins from the yeast Saccharomyces cerevisiae, Homo sapiens and Arabidopsis thaliana we searched the sugarcane expressed sequence tag (SUCEST database for the presence of expressed sequence tags (ESTs with similarity to nuclear genes related to mitochondrial functions. Starting with 869 protein sequences, we searched for sugarcane EST counterparts to these proteins using the basic local alignment search tool TBLASTN similarity searching program run against 260,781 sugarcane ESTs contained in 81,223 clusters. We were able to recover 367 clusters likely to represent sugarcane orthologues of the corresponding genes from S. cerevisiae, H. sapiens and A. thaliana with E-value <= 10-10. Gene products belonging to all functional categories related to mitochondrial functions were found and this allowed us to produce an overview of the nuclear genes required for sugarcane mitochondrial biogenesis and function as well as providing a starting point for detailed analysis of sugarcane gene structure and physiology.
Robust Nonnegative Matrix Factorization via Joint Graph Laplacian and Discriminative Information for Identifying Differentially Expressed Genes

Directory of Open Access Journals (Sweden)

Ling-Yun Dai

2017-01-01

Full Text Available Differential expression plays an important role in cancer diagnosis and classification. In recent years, many methods have been used to identify differentially expressed genes. However, the recognition rate and reliability of gene selection still need to be improved. In this paper, a novel constrained method named robust nonnegative matrix factorization via joint graph Laplacian and discriminative information (GLD-RNMF is proposed for identifying differentially expressed genes, in which manifold learning and the discriminative label information are incorporated into the traditional nonnegative matrix factorization model to train the objective matrix. Specifically, L2,1-norm minimization is enforced on both the error function and the regularization term which is robust to outliers and noise in gene data. Furthermore, the multiplicative update rules and the details of convergence proof are shown for the new model. The experimental results on two publicly available cancer datasets demonstrate that GLD-RNMF is an effective method for identifying differentially expressed genes.
Integration of molecular biology tools for identifying promoters and genes abundantly expressed in flowers of Oncidium Gower Ramsey

Directory of Open Access Journals (Sweden)

Tung Shu-Yun

2011-04-01

Full Text Available Abstract Background Orchids comprise one of the largest families of flowering plants and generate commercially important flowers. However, model plants, such as Arabidopsis thaliana do not contain all plant genes, and agronomic and horticulturally important genera and species must be individually studied. Results Several molecular biology tools were used to isolate flower-specific gene promoters from Oncidium 'Gower Ramsey' (Onc. GR. A cDNA library of reproductive tissues was used to construct a microarray in order to compare gene expression in flowers and leaves. Five genes were highly expressed in flower tissues, and the subcellular locations of the corresponding proteins were identified using lip transient transformation with fluorescent protein-fusion constructs. BAC clones of the 5 genes, together with 7 previously published flower- and reproductive growth-specific genes in Onc. GR, were identified for cloning of their promoter regions. Interestingly, 3 of the 5 novel flower-abundant genes were putative trypsin inhibitor (TI genes (OnTI1, OnTI2 and OnTI3, which were tandemly duplicated in the same BAC clone. Their promoters were identified using transient GUS reporter gene transformation and stable A. thaliana transformation analyses. Conclusions By combining cDNA microarray, BAC library, and bombardment assay techniques, we successfully identified flower-directed orchid genes and promoters.
Candidate gene linkage approach to identify DNA variants that predispose to preterm birth

DEFF Research Database (Denmark)

Bream, Elise N A; Leppellere, Cara R; Cooper, Margaret E

2013-01-01

Background:The aim of this study was to identify genetic variants contributing to preterm birth (PTB) using a linkage candidate gene approach.Methods:We studied 99 single-nucleotide polymorphisms (SNPs) for 33 genes in 257 families with PTBs segregating. Nonparametric and parametric analyses were...... through the infant and/or the mother in the etiology of PTB....

Transcriptome analysis of recurrently deregulated genes across multiple cancers identifies new pan-cancer biomarkers

DEFF Research Database (Denmark)

Kaczkowski, Bogumil; Tanaka, Yuji; Kawaji, Hideya

2016-01-01

Genes that are commonly deregulated in cancer are clinically attractive as candidate pan-diagnostic markers and therapeutic targets. To globally identify such targets, we compared Cap Analysis of Gene Expression (CAGE) profiles from 225 different cancer cell lines and 339 corresponding primary cell...
A database of annotated promoters of genes associated with common respiratory and related diseases

KAUST Repository

Chowdhary, Rajesh

2012-07-01

Many genes have been implicated in the pathogenesis of common respiratory and related diseases (RRDs), yet the underlying mechanisms are largely unknown. Differential gene expression patterns in diseased and healthy individuals suggest that RRDs affect or are affected by modified transcription regulation programs. It is thus crucial to characterize implicated genes in terms of transcriptional regulation. For this purpose, we conducted a promoter analysis of genes associated with 11 common RRDs including allergic rhinitis, asthma, bronchiectasis, bronchiolitis, bronchitis, chronic obstructive pulmonary disease, cystic fibrosis, emphysema, eczema, psoriasis, and urticaria, many of which are thought to be genetically related. The objective of the present study was to obtain deeper insight into the transcriptional regulation of these disease-associated genes by annotating their promoter regions with transcription factors (TFs) and TF binding sites (TFBSs). We discovered many TFs that are significantly enriched in the target disease groups including associations that have been documented in the literature. We also identified a number of putative TFs/TFBSs that appear to be novel. The results of our analysis are provided in an online database that is freely accessible to researchers at http://www.respiratorygenomics.com. Promoter-associated TFBS information and related genomic features, such as histone modification sites, microsatellites, CpG islands, and SNPs, are graphically summarized in the database. Users can compare and contrast underlying mechanisms of specific RRDs relative to candidate genes, TFs, gene ontology terms, micro-RNAs, and biological pathways for the conduct of metaanalyses. This database represents a novel, useful resource for RRD researchers. Copyright © 2012 by the American Thoracic Society.
A database of annotated promoters of genes associated with common respiratory and related diseases

KAUST Repository

Chowdhary, Rajesh; Tan, Sinlam; Pavesi, Giulio; Jin, Gg; Dong, Difeng; Mathur, Sameer K.; Burkart, Arthur; Narang, Vipin; Glurich, Ingrid E.; Raby, Benjamin A.; Weiss, Scott T.; Limsoon, Wong; Liu, Jun; Bajic, Vladimir B.

2012-01-01

Many genes have been implicated in the pathogenesis of common respiratory and related diseases (RRDs), yet the underlying mechanisms are largely unknown. Differential gene expression patterns in diseased and healthy individuals suggest that RRDs affect or are affected by modified transcription regulation programs. It is thus crucial to characterize implicated genes in terms of transcriptional regulation. For this purpose, we conducted a promoter analysis of genes associated with 11 common RRDs including allergic rhinitis, asthma, bronchiectasis, bronchiolitis, bronchitis, chronic obstructive pulmonary disease, cystic fibrosis, emphysema, eczema, psoriasis, and urticaria, many of which are thought to be genetically related. The objective of the present study was to obtain deeper insight into the transcriptional regulation of these disease-associated genes by annotating their promoter regions with transcription factors (TFs) and TF binding sites (TFBSs). We discovered many TFs that are significantly enriched in the target disease groups including associations that have been documented in the literature. We also identified a number of putative TFs/TFBSs that appear to be novel. The results of our analysis are provided in an online database that is freely accessible to researchers at http://www.respiratorygenomics.com. Promoter-associated TFBS information and related genomic features, such as histone modification sites, microsatellites, CpG islands, and SNPs, are graphically summarized in the database. Users can compare and contrast underlying mechanisms of specific RRDs relative to candidate genes, TFs, gene ontology terms, micro-RNAs, and biological pathways for the conduct of metaanalyses. This database represents a novel, useful resource for RRD researchers. Copyright © 2012 by the American Thoracic Society.
Hypomethylation and Aberrant Expression of the Glioma Pathogenesis-Related 1 Gene in Wilms Tumors

Directory of Open Access Journals (Sweden)

Laxmi Chilukamarri

2007-11-01

Full Text Available Wilms tumors (WTs have a complex etiology, displaying genetic and epigenetic changes, including loss of imprinting (LOI and tumor suppressor gene silencing. To identify new regions of epigenetic perturbation in WTs, we screened kidney and tumor DNA using CpG island (CGI tags associated with cancer-specific DNA methylation changes. One such tag corresponded to a paralog of the glioma pathogenesis-related 1/related to testis-specific, vespid, and pathogenesis proteins 1 (GLIPR1/RTVP-1 gene, previously reported to be a tumor-suppressor gene silenced by hypermethylation in prostate cancer. Here we report methylation analysis of the GLIPR1/RTVP-1 gene in WTs and normal fetal and pediatric kidneys. Hypomethylation of the GLIPR1/RTVP-1 5'-region in WTs relative to normal tissue is observed in 21/24 (87.5% of WTs analyzed. Quantitative analysis of GLIPR1/RTVP-1 expression in 24 WTs showed elevated transcript levels in 16/24 WTs (67%, with 12 WTs displaying in excess of 20-fold overexpression relative to fetal kidney (FK control samples. Immunohistochemical analysis of FK and WT corroborates the RNA expression data and reveals high GLIPR1/RTVP-1 in WT blastemal cells together with variable levels in stromal and epithelial components. Hypomethylation is also evident in the WT precursor lesions and nephrogenic rests (NRs, supporting a role for GLIPR1/RTVP-1 deregulation early in Wilms tumorigenesis. Our data show that, in addition to gene dosage changes arising from LOI and hypermethylation-induced gene silencing, gene activation resulting from hypomethylation is also prevalent in WTs.
Generation and Analysis of Expressed Sequence Tags (ESTs from Halophyte Atriplex canescens to Explore Salt-Responsive Related Genes

Directory of Open Access Journals (Sweden)

Jingtao Li

2014-06-01

Full Text Available Little information is available on gene expression profiling of halophyte A. canescens. To elucidate the molecular mechanism for stress tolerance in A. canescens, a full-length complementary DNA library was generated from A. canescens exposed to 400 mM NaCl, and provided 343 high-quality ESTs. In an evaluation of 343 valid EST sequences in the cDNA library, 197 unigenes were assembled, among which 190 unigenes (83.1% ESTs were identified according to their significant similarities with proteins of known functions. All the 343 EST sequences have been deposited in the dbEST GenBank under accession numbers JZ535802 to JZ536144. According to Arabidopsis MIPS functional category and GO classifications, we identified 193 unigenes of the 311 annotations EST, representing 72 non-redundant unigenes sharing similarities with genes related to the defense response. The sets of ESTs obtained provide a rich genetic resource and 17 up-regulated genes related to salt stress resistance were identified by qRT-PCR. Six of these genes may contribute crucially to earlier and later stage salt stress resistance. Additionally, among the 343 unigenes sequences, 22 simple sequence repeats (SSRs were also identified contributing to the study of A. canescens resources.
Gene-Environment Interactions of Circadian-Related Genes for Cardiometabolic Traits

DEFF Research Database (Denmark)

Dashti, Hassan S; Follis, Jack L; Smith, Caren E

2015-01-01

OBJECTIVE: Common circadian-related gene variants associate with increased risk for metabolic alterations including type 2 diabetes. However, little is known about whether diet and sleep could modify associations between circadian-related variants (CLOCK-rs1801260, CRY2-rs11605924, MTNR1B-rs13871...
Gene-environment interactions of circadian-related genes for cardiometabolic traits

Science.gov (United States)

Objective: Common circadian-related gene variants associate with increased risk for metabolic alterations including type 2 diabetes. However, little is known about whether diet and sleep could modify associations between circadian-related variants (CLOCK-rs1801260, CRY2-rs11605924, MTNR1B-rs1387153,...
Use of an activated beta-catenin to identify Wnt pathway target genes in caenorhabditis elegans, including a subset of collagen genes expressed in late larval development.

Science.gov (United States)

Jackson, Belinda M; Abete-Luzi, Patricia; Krause, Michael W; Eisenmann, David M

2014-04-16

The Wnt signaling pathway plays a fundamental role during metazoan development, where it regulates diverse processes, including cell fate specification, cell migration, and stem cell renewal. Activation of the beta-catenin-dependent/canonical Wnt pathway up-regulates expression of Wnt target genes to mediate a cellular response. In the nematode Caenorhabditis elegans, a canonical Wnt signaling pathway regulates several processes during larval development; however, few target genes of this pathway have been identified. To address this deficit, we used a novel approach of conditionally activated Wnt signaling during a defined stage of larval life by overexpressing an activated beta-catenin protein, then used microarray analysis to identify genes showing altered expression compared with control animals. We identified 166 differentially expressed genes, of which 104 were up-regulated. A subset of the up-regulated genes was shown to have altered expression in mutants with decreased or increased Wnt signaling; we consider these genes to be bona fide C. elegans Wnt pathway targets. Among these was a group of six genes, including the cuticular collagen genes, bli-1 col-38, col-49, and col-71. These genes show a peak of expression in the mid L4 stage during normal development, suggesting a role in adult cuticle formation. Consistent with this finding, reduction of function for several of the genes causes phenotypes suggestive of defects in cuticle function or integrity. Therefore, this work has identified a large number of putative Wnt pathway target genes during larval life, including a small subset of Wnt-regulated collagen genes that may function in synthesis of the adult cuticle.
Previously unidentified changes in renal cell carcinoma gene expression identified by parametric analysis of microarray data

International Nuclear Information System (INIS)

Lenburg, Marc E; Liou, Louis S; Gerry, Norman P; Frampton, Garrett M; Cohen, Herbert T; Christman, Michael F

2003-01-01

Renal cell carcinoma is a common malignancy that often presents as a metastatic-disease for which there are no effective treatments. To gain insights into the mechanism of renal cell carcinogenesis, a number of genome-wide expression profiling studies have been performed. Surprisingly, there is very poor agreement among these studies as to which genes are differentially regulated. To better understand this lack of agreement we profiled renal cell tumor gene expression using genome-wide microarrays (45,000 probe sets) and compare our analysis to previous microarray studies. We hybridized total RNA isolated from renal cell tumors and adjacent normal tissue to Affymetrix U133A and U133B arrays. We removed samples with technical defects and removed probesets that failed to exhibit sequence-specific hybridization in any of the samples. We detected differential gene expression in the resulting dataset with parametric methods and identified keywords that are overrepresented in the differentially expressed genes with the Fisher-exact test. We identify 1,234 genes that are more than three-fold changed in renal tumors by t-test, 800 of which have not been previously reported to be altered in renal cell tumors. Of the only 37 genes that have been identified as being differentially expressed in three or more of five previous microarray studies of renal tumor gene expression, our analysis finds 33 of these genes (89%). A key to the sensitivity and power of our analysis is filtering out defective samples and genes that are not reliably detected. The widespread use of sample-wise voting schemes for detecting differential expression that do not control for false positives likely account for the poor overlap among previous studies. Among the many genes we identified using parametric methods that were not previously reported as being differentially expressed in renal cell tumors are several oncogenes and tumor suppressor genes that likely play important roles in renal cell
Gene co-expression analysis identifies gene clusters associated with isotropic and polarized growth in Aspergillus fumigatus conidia.

Science.gov (United States)

Baltussen, Tim J H; Coolen, Jordy P M; Zoll, Jan; Verweij, Paul E; Melchers, Willem J G

2018-04-26

Aspergillus fumigatus is a saprophytic fungus that extensively produces conidia. These microscopic asexually reproductive structures are small enough to reach the lungs. Germination of conidia followed by hyphal growth inside human lungs is a key step in the establishment of infection in immunocompromised patients. RNA-Seq was used to analyze the transcriptome of dormant and germinating A. fumigatus conidia. Construction of a gene co-expression network revealed four gene clusters (modules) correlated with a growth phase (dormant, isotropic growth, polarized growth). Transcripts levels of genes encoding for secondary metabolites were high in dormant conidia. During isotropic growth, transcript levels of genes involved in cell wall modifications increased. Two modules encoding for growth and cell cycle/DNA processing were associated with polarized growth. In addition, the co-expression network was used to identify highly connected intermodular hub genes. These genes may have a pivotal role in the respective module and could therefore be compelling therapeutic targets. Generally, cell wall remodeling is an important process during isotropic and polarized growth, characterized by an increase of transcripts coding for hyphal growth and cell cycle/DNA processing when polarized growth is initiated. Copyright © 2018 The Authors. Published by Elsevier Inc. All rights reserved.
Use of tiling array data and RNA secondary structure predictions to identify noncoding RNA genes

DEFF Research Database (Denmark)

Weile, Christian; Gardner, Paul P; Hedegaard, Mads M

2007-01-01

neuroblastoma cell line SK-N-AS. Using this strategy, we identify thousands of human candidate RNA genes. To further verify the expression of these genes, we focused on candidate genes that had a stable hairpin structures or a high level of covariance. Using northern blotting, we verify the expression of 2 out...
RNA-Seq analysis identifies key genes associated with haustorial development in the root hemiparasite Santalum album

Directory of Open Access Journals (Sweden)

Xinhua eZhang

2015-09-01

Full Text Available Santalum album (sandalwood is one of the economically important plant species in the Santalaceae for its production of highly valued perfume oils. Sandalwood is also a hemiparasitic tree that obtains some of its water and simple nutrients by tapping into other plants through haustoria which are highly specialized organs in parasitic angiosperms. However, an understanding of the molecular mechanisms involved in haustorium development is limited. In this study, RNA sequencing (RNA-seq analyses were performed to identify changes in gene expression and metabolic pathways associated with the development of the S. album haustorium. A total of 56,011 non-redundant contigs with a mean contig size of 618 bp were obtained by de novo assembly of the transcriptome of haustoria and non-haustorial seedling roots. A substantial number of the identified differentially expressed genes were involved in cell wall metabolism and protein metabolism, as well as mitochondrial electron transport functions. Phytohormone-mediated regulation might play an important role during haustorial development. Especially, auxin signaling is likely to be essential for haustorial initiation, and genes related to cytokinin and gibberellin biosynthesis and metabolism are involved in haustorial development. Our results suggest that genes encoding nodulin-like proteins may be important for haustorial morphogenesis in S. album. The obtained sequence data will become a rich resource for future research in this interesting species. This information improves our understanding of haustorium development in root hemiparasitic species and will allow further exploration of the detailed molecular mechanisms underlying plant parasitism.
Integration of TP53, DREAM, MMB-FOXM1 and RB-E2F target gene analyses identifies cell cycle gene regulatory networks.

Science.gov (United States)

Fischer, Martin; Grossmann, Patrick; Padi, Megha; DeCaprio, James A

2016-07-27

Cell cycle (CC) and TP53 regulatory networks are frequently deregulated in cancer. While numerous genome-wide studies of TP53 and CC-regulated genes have been performed, significant variation between studies has made it difficult to assess regulation of any given gene of interest. To overcome the limitation of individual studies, we developed a meta-analysis approach to identify high confidence target genes that reflect their frequency of identification in independent datasets. Gene regulatory networks were generated by comparing differential expression of TP53 and CC-regulated genes with chromatin immunoprecipitation studies for TP53, RB1, E2F, DREAM, B-MYB, FOXM1 and MuvB. RNA-seq data from p21-null cells revealed that gene downregulation by TP53 generally requires p21 (CDKN1A). Genes downregulated by TP53 were also identified as CC genes bound by the DREAM complex. The transcription factors RB, E2F1 and E2F7 bind to a subset of DREAM target genes that function in G1/S of the CC while B-MYB, FOXM1 and MuvB control G2/M gene expression. Our approach yields high confidence ranked target gene maps for TP53, DREAM, MMB-FOXM1 and RB-E2F and enables prediction and distinction of CC regulation. A web-based atlas at www.targetgenereg.org enables assessing the regulation of any human gene of interest. © The Author(s) 2016. Published by Oxford University Press on behalf of Nucleic Acids Research.
De novo characterization of fall dormant and nondormant alfalfa (Medicago sativa L.) leaf transcriptome and identification of candidate genes related to fall dormancy.

Science.gov (United States)

Zhang, Senhao; Shi, Yinghua; Cheng, Ningning; Du, Hongqi; Fan, Wenna; Wang, Chengzhang

2015-01-01

Alfalfa (Medicago sativa L.) is one of the most widely cultivated perennial forage legumes worldwide. Fall dormancy is an adaptive character related to the biomass production and winter survival in alfalfa. The physiological, biochemical and molecular mechanisms causing fall dormancy and the related genes have not been well studied. In this study, we sequenced two standard varieties of alfalfa (dormant and non-dormant) at two time points and generated approximately 160 million high quality paired-end sequence reads using sequencing by synthesis (SBS) technology. The de novo transcriptome assembly generated a set of 192,875 transcripts with an average length of 856 bp representing about 165.1 Mb of the alfalfa leaf transcriptome. After assembly, 111,062 (57.6%) transcripts were annotated against the NCBI non-redundant database. A total of 30,165 (15.6%) transcripts were mapped to 323 Kyoto Encyclopedia of Genes and Genomes pathways. We also identified 41,973 simple sequence repeats, which can be used to generate markers for alfalfa, and 1,541 transcription factors were identified across 1,350 transcripts. Gene expression between dormant and non-dormant alfalfa at different time points were performed, and we identified several differentially expressed genes potentially related to fall dormancy. The Gene Ontology and pathways information were also identified. We sequenced and assembled the leaf transcriptome of alfalfa related to fall dormancy, and also identified some genes of interest involved in the fall dormancy mechanism. Thus, our research focused on studying fall dormancy in alfalfa through transcriptome sequencing. The sequencing and gene expression data generated in this study may be used further to elucidate the complete mechanisms governing fall dormancy in alfalfa.
Single nucleotide polymorphisms (SNPs in coding regions of canine dopamine- and serotonin-related genes

Directory of Open Access Journals (Sweden)

Lingaas Frode

2008-01-01

Full Text Available Abstract Background Polymorphism in genes of regulating enzymes, transporters and receptors of the neurotransmitters of the central nervous system have been associated with altered behaviour, and single nucleotide polymorphisms (SNPs represent the most frequent type of genetic variation. The serotonin and dopamine signalling systems have a central influence on different behavioural phenotypes, both of invertebrates and vertebrates, and this study was undertaken in order to explore genetic variation that may be associated with variation in behaviour. Results Single nucleotide polymorphisms in canine genes related to behaviour were identified by individually sequencing eight dogs (Canis familiaris of different breeds. Eighteen genes from the dopamine and the serotonin systems were screened, revealing 34 SNPs distributed in 14 of the 18 selected genes. A total of 24,895 bp coding sequence was sequenced yielding an average frequency of one SNP per 732 bp (1/732. A total of 11 non-synonymous SNPs (nsSNPs, which may be involved in alteration of protein function, were detected. Of these 11 nsSNPs, six resulted in a substitution of amino acid residue with concomitant change in structural parameters. Conclusion We have identified a number of coding SNPs in behaviour-related genes, several of which change the amino acids of the proteins. Some of the canine SNPs exist in codons that are evolutionary conserved between five compared species, and predictions indicate that they may have a functional effect on the protein. The reported coding SNP frequency of the studied genes falls within the range of SNP frequencies reported earlier in the dog and other mammalian species. Novel SNPs are presented and the results show a significant genetic variation in expressed sequences in this group of genes. The results can contribute to an improved understanding of the genetics of behaviour.
Microarray analysis identified Puccinia striiformis f. sp. tritici genes involved in infection and sporulation.

Science.gov (United States)

Puccinia striiformis f. sp. tritici (Pst) causes stripe rust, one of the most important diseases of wheat worldwide. To identify Pst genes involved in infection and sporulation, a custom oligonucleotide Genechip was made using sequences of 442 genes selected from Pst cDNA libraries. Microarray analy...
Transcriptome analysis of Spodoptera frugiperda Sf9 cells reveals putative apoptosis-related genes and a preliminary apoptosis mechanism induced by azadirachtin.

Science.gov (United States)

Shu, Benshui; Zhang, Jingjing; Sethuraman, Veeran; Cui, Gaofeng; Yi, Xin; Zhong, Guohua

2017-10-16

As an important botanical pesticide, azadirachtin demonstrates broad insecticidal activity against many agricultural pests. The results of a previous study indicated the toxicity and apoptosis induction of azadirachtin in Spodoptera frugiperda Sf9 cells. However, the lack of genomic data has hindered a deeper investigation of apoptosis in Sf9 cells at a molecular level. In the present study, the complete transcriptome data for Sf9 cell line was accomplished using Illumina sequencing technology, and 97 putative apoptosis-related genes were identified through BLAST and KEGG orthologue annotations. Fragments of potential candidate apoptosis-related genes were cloned, and the mRNA expression patterns of ten identified genes regulated by azadirachtin were examined using qRT-PCR. Furthermore, Western blot analysis showed that six putative apoptosis-related proteins were upregulated after being treated with azadirachtin while the protein Bcl-2 were downregulated. These data suggested that both intrinsic and extrinsic apoptotic signal pathways comprising the identified potential apoptosis-related genes were potentially active in S. frugiperda. In addition, the preliminary results revealed that caspase-dependent or caspase-independent apoptotic pathways could function in azadirachtin-induced apoptosis in Sf9 cells.
Integrating Diverse Types of Genomic Data to Identify Genes that Underlie Adverse Pregnancy Phenotypes.

Directory of Open Access Journals (Sweden)

Jibril Hirbo

Full Text Available Progress in understanding complex genetic diseases has been bolstered by synthetic approaches that overlay diverse data types and analyses to identify functionally important genes. Pre-term birth (PTB, a major complication of pregnancy, is a leading cause of infant mortality worldwide. A major obstacle in addressing PTB is that the mechanisms controlling parturition and birth timing remain poorly understood. Integrative approaches that overlay datasets derived from comparative genomics with function-derived ones have potential to advance our understanding of the genetics of birth timing, and thus provide insights into the genes that may contribute to PTB. We intersected data from fast evolving coding and non-coding gene regions in the human and primate lineage with data from genes expressed in the placenta, from genes that show enriched expression only in the placenta, as well as from genes that are differentially expressed in four distinct PTB clinical subtypes. A large fraction of genes that are expressed in placenta, and differentially expressed in PTB clinical subtypes (23-34% are fast evolving, and are associated with functions that include adhesion neurodevelopmental and immune processes. Functional categories of genes that express fast evolution in coding regions differ from those linked to fast evolution in non-coding regions. Finally, there is a surprising lack of overlap between fast evolving genes that are differentially expressed in four PTB clinical subtypes. Integrative approaches, especially those that incorporate evolutionary perspectives, can be successful in identifying potential genetic contributions to complex genetic diseases, such as PTB.
Analysis of the WUSCHEL-RELATED HOMEOBOX gene family in Pinus pinaster: New insights into the gene family evolution.

Science.gov (United States)

Alvarez, José M; Bueno, Natalia; Cañas, Rafael A; Avila, Concepción; Cánovas, Francisco M; Ordás, Ricardo J

2018-02-01

WUSCHEL-RELATED HOMEOBOX (WOX) genes are key players controlling stem cells in plants and can be divided into three clades according to the time of their appearance during plant evolution. Our knowledge of stem cell function in vascular plants other than angiosperms is limited, they separated from gymnosperms ca 300 million years ago and their patterning during embryogenesis differs significantly. For this reason, we have used the model gymnosperm Pinus pinaster to identify WOX genes and perform a thorough analysis of their gene expression patterns. Using transcriptomic data from a comprehensive range of tissues and stages of development we have shown three major outcomes: that the P. pinaster genome encodes at least fourteen members of the WOX family spanning all the major clades, that the genome of gymnosperms contains a WOX gene with no homologues in angiosperms representing a transitional stage between intermediate- and WUS-clade proteins, and that we can detect discrete WUS and WOX5 transcripts for the first time in a gymnosperm. Copyright © 2017 Elsevier Masson SAS. All rights reserved.
Unpredictable neonatal stress enhances adult anxiety and alters amygdala gene expression related to serotonin and GABA.

Science.gov (United States)

Sarro, E C; Sullivan, R M; Barr, G

2014-01-31

Anxiety-related disorders are among the most common psychiatric illnesses, thought to have both genetic and environmental causes. Early-life trauma, such as abuse from a caregiver, can be predictable or unpredictable, each resulting in increased prevalence and severity of a unique set of disorders. In this study, we examined the influence of early unpredictable trauma on both the behavioral expression of adult anxiety and gene expression within the amygdala. Neonatal rats were exposed to unpaired odor-shock conditioning for 5 days, which produces deficits in adult behavior and amygdala dysfunction. In adulthood, we used the Light/Dark box test to measure anxiety-related behaviors, measuring the latency to enter the lit area and quantified urination and defecation. The amygdala was then dissected and a microarray analysis was performed to examine changes in gene expression. Animals that had received early unpredictable trauma displayed significantly longer latencies to enter the lit area and more defecation and urination. The microarray analysis revealed over-represented genes related to learning and memory, synaptic transmission and trans-membrane transport. Gene ontology and pathway analysis identified highly represented disease states related to anxiety phenotypes, including social anxiety, obsessive-compulsive disorders, post-traumatic stress disorder and bipolar disorder. Addiction-related genes were also overrepresented in this analysis. Unpredictable shock during early development increased anxiety-like behaviors in adulthood with concomitant changes in genes related to neurotransmission, resulting in gene expression patterns similar to anxiety-related psychiatric disorders. Copyright © 2013 IBRO. Published by Elsevier Ltd. All rights reserved.

Comparative Transcriptomics to Identify Novel Genes and Pathways in Dinoflagellates

Science.gov (United States)

Ryan, D.

2016-02-01

The unarmored dinoflagellate Karenia brevis is among the most prominent harmful, bloom-forming phytoplankton species in the Gulf of Mexico. During blooms, the polyketides PbTx-1 and PbTx-2 (brevetoxins) are produced by K. brevis. Brevetoxins negatively impact human health and the Gulf shellfish harvest. However, the genes underlying brevetoxin synthesis are currently unknown. Because the K. brevis genome is extremely large ( 1 × 1011 base pairs long), and with a high proportion of repetitive, non-coding DNA, it has not been sequenced. In fact, large, repetitive genomes are common among the dinoflagellate group. High-throughput RNA sequencing technology enabled us to assemble Karenia transcriptomes de novo and investigate potential genes in the brevetoxin pathway through comparative transcriptomics. The brevetoxin profile varies among K. brevis clonal cultures. For example, well-documented Wilson-CCFWC268 typically produces 8-10 pg PbTx per cell, whereas SP1 produces differences in gene expression. Of the 85,000 transcripts in the K. brevis transcriptome, 4,600 transcripts, including novel unannotated orthologs and putative polyketide synthases (PKSs), were only expressed by brevetoxin-producing K. brevis and K. papilionacea, not K. mikimotoi. Examination of gene expression between the typical- and low-toxin Wilson clones identified about 3,500 genes with significantly different expression levels, including 2 putative PKSs. One of the 2 PKSs was only found in the brevetoxin-producing Karenia species. These transcriptomes could not have been characterized without high-throughput RNA sequencing.
Transcriptome sequencing in pediatric acute lymphoblastic leukemia identifies fusion genes associated with distinct DNA methylation profiles

Directory of Open Access Journals (Sweden)

Yanara Marincevic-Zuniga

2017-08-01

Full Text Available Abstract Background Structural chromosomal rearrangements that lead to expressed fusion genes are a hallmark of acute lymphoblastic leukemia (ALL. In this study, we performed transcriptome sequencing of 134 primary ALL patient samples to comprehensively detect fusion transcripts. Methods We combined fusion gene detection with genome-wide DNA methylation analysis, gene expression profiling, and targeted sequencing to determine molecular signatures of emerging ALL subtypes. Results We identified 64 unique fusion events distributed among 80 individual patients, of which over 50% have not previously been reported in ALL. Although the majority of the fusion genes were found only in a single patient, we identified several recurrent fusion gene families defined by promiscuous fusion gene partners, such as ETV6, RUNX1, PAX5, and ZNF384, or recurrent fusion genes, such as DUX4-IGH. Our data show that patients harboring these fusion genes displayed characteristic genome-wide DNA methylation and gene expression signatures in addition to distinct patterns in single nucleotide variants and recurrent copy number alterations. Conclusion Our study delineates the fusion gene landscape in pediatric ALL, including both known and novel fusion genes, and highlights fusion gene families with shared molecular etiologies, which may provide additional information for prognosis and therapeutic options in the future.
Common mutations identified in the MLH1 gene in familial Lynch syndrome

Directory of Open Access Journals (Sweden)

Jisha Elias

2017-12-01

In this study we identified three families with Lynch syndrome from a rural cancer center in western India (KCHRC, Goraj, Gujarat, where 70-75 CRC patients are seen annually. DNA isolated from the blood of consented family members of all three families (8-10 members/family was subjected to NGS sequencing methods on an Illumina HiSeq 4000 platform. We identified unique mutations in the MLH1 gene in all three HNPCC family members. Two of the three unrelated families shared a common mutation (154delA and 156delA. Total 8 members of a family were identified as carriers for 156delA mutation of which 5 members were unaffected while 3 were affected (age of onset: 1 member <30yrs & 2 were>40yr. The family with 154delA mutation showed 2 affected members (>40yr carrying the mutations.LYS618DEL mutation found in 8 members of the third family showed that both affected and unaffected carried the mutation. Thus the common mutations identified in the MLH1 gene in two unrelated families had a high risk for lynch syndrome especially above the age of 40.
Foxtail millet NF-Y families: genome-wide survey and evolution analyses identified two functional genes important in abiotic stresses

Directory of Open Access Journals (Sweden)

Zhi-Juan eFeng

2015-12-01

Full Text Available It was reported that Nuclear Factor Y (NF-Y genes were involved in abiotic stress in plants. Foxtail millet (Setaria italica, an elite stress tolerant crop, provided an impetus for the investigation of the NF-Y families in abiotic responses. In the present study, a total of 39 NF-Y genes were identified in foxtail millet. Synteny analyses suggested that foxtail millet NF-Y genes had experienced rapid expansion and strong purifying selection during the process of plant evolution. De novo transcriptome assembly of foxtail millet revealed 11 drought up-regulated NF-Y genes. SiNF-YA1 and SiNF-YB8 were highly activated in leaves and/or roots by drought and salt stresses. Abscisic acid (ABA and H2O2 played positive roles in the induction of SiNF-YA1 and SiNF-YB8 under stress treatments. Transient luciferase (LUC expression assays revealed that SiNF-YA1 and SiNF-YB8 could activate the LUC gene driven by the tobacco (Nicotiana tobacam NtERD10, NtLEA5, NtCAT, NtSOD or NtPOD promoter under normal or stress conditions. Overexpression of SiNF-YA1 enhanced drought and salt tolerance by activating stress-related genes NtERD10 and NtCAT1 and by maintaining relatively stable relative water content (RWC and contents of chlorophyll, superoxide dismutase (SOD, peroxidase (POD, catalase (CAT and malondialdehyde (MDA in transgenic lines under stresses. SiNF-YB8 regulated expression of NtSOD, NtPOD, NtLEA5 and NtERD10 and conferred relatively high RWC and chlorophyll contents and low MDA content, resulting in drought and osmotic tolerance in transgenic lines under stresses. Therefore, SiNF-YA1 and SiNF-YB8 could activate stress-related genes and improve physiological traits, resulting in tolerance to abiotic stresses in plants. All these results will facilitate functional characterization of foxtail millet NF-Ys in future studies.
MiR-210 disturbs mitotic progression through regulating a group of mitosis-related genes

OpenAIRE

He, Jie; Wu, Jiangbin; Xu, Naihan; Xie, Weidong; Li, Mengnan; Li, Jianna; Jiang, Yuyang; Yang, Burton B.; Zhang, Yaou

2012-01-01

MiR-210 is up-regulated in multiple cancer types but its function is disputable and further investigation is necessary. Using a bioinformatics approach, we identified the putative target genes of miR-210 in hypoxia-induced CNE cells from genome-wide scale. Two functional gene groups related to cell cycle and RNA processing were recognized as the major targets of miR-210. Here, we investigated the molecular mechanism and biological consequence of miR-210 in cell cycle regulation, particularly ...
Candidate luminal B breast cancer genes identified by genome, gene expression and DNA methylation profiling.

Directory of Open Access Journals (Sweden)

Stéphanie Cornen

Full Text Available Breast cancers (BCs of the luminal B subtype are estrogen receptor-positive (ER+, highly proliferative, resistant to standard therapies and have a poor prognosis. To better understand this subtype we compared DNA copy number aberrations (CNAs, DNA promoter methylation, gene expression profiles, and somatic mutations in nine selected genes, in 32 luminal B tumors with those observed in 156 BCs of the other molecular subtypes. Frequent CNAs included 8p11-p12 and 11q13.1-q13.2 amplifications, 7q11.22-q34, 8q21.12-q24.23, 12p12.3-p13.1, 12q13.11-q24.11, 14q21.1-q23.1, 17q11.1-q25.1, 20q11.23-q13.33 gains and 6q14.1-q24.2, 9p21.3-p24,3, 9q21.2, 18p11.31-p11.32 losses. A total of 237 and 101 luminal B-specific candidate oncogenes and tumor suppressor genes (TSGs presented a deregulated expression in relation with their CNAs, including 11 genes previously reported associated with endocrine resistance. Interestingly, 88% of the potential TSGs are located within chromosome arm 6q, and seven candidate oncogenes are potential therapeutic targets. A total of 100 candidate oncogenes were validated in a public series of 5,765 BCs and the overexpression of 67 of these was associated with poor survival in luminal tumors. Twenty-four genes presented a deregulated expression in relation with a high DNA methylation level. FOXO3, PIK3CA and TP53 were the most frequent mutated genes among the nine tested. In a meta-analysis of next-generation sequencing data in 875 BCs, KCNB2 mutations were associated with luminal B cases while candidate TSGs MDN1 (6q15 and UTRN (6q24, were mutated in this subtype. In conclusion, we have reported luminal B candidate genes that may play a role in the development and/or hormone resistance of this aggressive subtype.
Integrated microarray and ChIP analysis identifies multiple Foxa2 dependent target genes in the notochord.

Science.gov (United States)

Tamplin, Owen J; Cox, Brian J; Rossant, Janet

2011-12-15

The node and notochord are key tissues required for patterning of the vertebrate body plan. Understanding the gene regulatory network that drives their formation and function is therefore important. Foxa2 is a key transcription factor at the top of this genetic hierarchy and finding its targets will help us to better understand node and notochord development. We performed an extensive microarray-based gene expression screen using sorted embryonic notochord cells to identify early notochord-enriched genes. We validated their specificity to the node and notochord by whole mount in situ hybridization. This provides the largest available resource of notochord-expressed genes, and therefore candidate Foxa2 target genes in the notochord. Using existing Foxa2 ChIP-seq data from adult liver, we were able to identify a set of genes expressed in the notochord that had associated regions of Foxa2-bound chromatin. Given that Foxa2 is a pioneer transcription factor, we reasoned that these sites might represent notochord-specific enhancers. Candidate Foxa2-bound regions were tested for notochord specific enhancer function in a zebrafish reporter assay and 7 novel notochord enhancers were identified. Importantly, sequence conservation or predictive models could not have readily identified these regions. Mutation of putative Foxa2 binding elements in two of these novel enhancers abrogated reporter expression and confirmed their Foxa2 dependence. The combination of highly specific gene expression profiling and genome-wide ChIP analysis is a powerful means of understanding developmental pathways, even for small cell populations such as the notochord. Copyright © 2011 Elsevier Inc. All rights reserved.
Analysis of Epidermal Growth Factor Receptor Related Gene Expression Changes in a Cellular and Animal Model of Parkinson’s Disease

Directory of Open Access Journals (Sweden)

In-Su Kim

2017-02-01

Full Text Available We employed transcriptome analysis of epidermal growth factor receptor related gene expression changes in cellular and animal models of Parkinson’s disease (PD. We used a well-known Parkinsonian toxin 1-methyl-4-phenylpyridine (MPP+ to induce neuronal apoptosis in the human neuroblastoma SH-SY5Y cell line. The MPP+-treatment of SH-SY5Y cells was capable of inducing neuro-apoptosis, but it remains unclear what kinds of transcriptional genes are affected by MPP+ toxicity. Therefore the pathways that were significantly perturbed in MPP+ treated human neuroblastoma SH-SY5Y cells were identified based on genome-wide gene expression data at two time points (24 and 48 h. We found that the Epidermal Growth Factor Receptor (EGFR pathway-related genes showed significantly differential expression at all time points. The EGFR pathway has been linked to diverse cellular events such as proliferation, differentiation, and apoptosis. Further, to evaluate the functional significance of the altered EGFR related gene expression observed in MPP+-treated SH-SY5Y cells, the EGFR related GJB2 (Cx26 gene expression was analyzed in an MPP+-intoxicated animal PD model. Our findings identify that the EGFR signaling pathway and its related genes, such as Cx26, might play a significant role in dopaminergic (DAergic neuronal cell death during the process of neuro-apoptosis and therefore can be focused on as potential targets for therapeutic intervention.
Gene methylation profiles of normal mucosa, and benign and malignant colorectal tumors identify early onset markers

Directory of Open Access Journals (Sweden)

Vatn Morten

2008-12-01

Full Text Available Abstract Background Multiple epigenetic and genetic changes have been reported in colorectal tumors, but few of these have clinical impact. This study aims to pinpoint epigenetic markers that can discriminate between non-malignant and malignant tissue from the large bowel, i.e. markers with diagnostic potential. The methylation status of eleven genes (ADAMTS1, CDKN2A, CRABP1, HOXA9, MAL, MGMT, MLH1, NR3C1, PTEN, RUNX3, and SCGB3A1 was determined in 154 tissue samples including normal mucosa, adenomas, and carcinomas of the colorectum. The gene-specific and widespread methylation status among the carcinomas was related to patient gender and age, and microsatellite instability status. Possible CIMP tumors were identified by comparing the methylation profile with microsatellite instability (MSI, BRAF-, KRAS-, and TP53 mutation status. Results The mean number of methylated genes per sample was 0.4 in normal colon mucosa from tumor-free individuals, 1.2 in mucosa from cancerous bowels, 2.2 in adenomas, and 3.9 in carcinomas. Widespread methylation was found in both adenomas and carcinomas. The promoters of ADAMTS1, MAL, and MGMT were frequently methylated in benign samples as well as in malignant tumors, independent of microsatellite instability. In contrast, normal mucosa samples taken from bowels without tumor were rarely methylated for the same genes. Hypermethylated CRABP1, MLH1, NR3C1, RUNX3, and SCGB3A1 were shown to be identifiers of carcinomas with microsatellite instability. In agreement with the CIMP concept, MSI and mutated BRAF were associated with samples harboring hypermethylation of several target genes. Conclusion Methylated ADAMTS1, MGMT, and MAL are suitable as markers for early tumor detection.
Systems Biology-Based Investigation of Cellular Antiviral Drug Targets Identified by Gene-Trap Insertional Mutagenesis.

Directory of Open Access Journals (Sweden)

Feixiong Cheng

2016-09-01

Full Text Available Viruses require host cellular factors for successful replication. A comprehensive systems-level investigation of the virus-host interactome is critical for understanding the roles of host factors with the end goal of discovering new druggable antiviral targets. Gene-trap insertional mutagenesis is a high-throughput forward genetics approach to randomly disrupt (trap host genes and discover host genes that are essential for viral replication, but not for host cell survival. In this study, we used libraries of randomly mutagenized cells to discover cellular genes that are essential for the replication of 10 distinct cytotoxic mammalian viruses, 1 gram-negative bacterium, and 5 toxins. We herein reported 712 candidate cellular genes, characterizing distinct topological network and evolutionary signatures, and occupying central hubs in the human interactome. Cell cycle phase-specific network analysis showed that host cell cycle programs played critical roles during viral replication (e.g. MYC and TAF4 regulating G0/1 phase. Moreover, the viral perturbation of host cellular networks reflected disease etiology in that host genes (e.g. CTCF, RHOA, and CDKN1B identified were frequently essential and significantly associated with Mendelian and orphan diseases, or somatic mutations in cancer. Computational drug repositioning framework via incorporating drug-gene signatures from the Connectivity Map into the virus-host interactome identified 110 putative druggable antiviral targets and prioritized several existing drugs (e.g. ajmaline that may be potential for antiviral indication (e.g. anti-Ebola. In summary, this work provides a powerful methodology with a tight integration of gene-trap insertional mutagenesis testing and systems biology to identify new antiviral targets and drugs for the development of broadly acting and targeted clinical antiviral therapeutics.
Transcriptomics Analysis of Crassostrea hongkongensis for the Discovery of Reproduction-Related Genes

Science.gov (United States)

Tong, Ying; Zhang, Yang; Huang, Jiaomei; Xiao, Shu; Zhang, Yuehuan; Li, Jun; Chen, Jinhui; Yu, Ziniu

2015-01-01

Background The reproductive mechanisms of mollusk species have been interesting targets in biological research because of the diverse reproductive strategies observed in this phylum. These species have also been studied for the development of fishery technologies in molluscan aquaculture. Although the molecular mechanisms underlying the reproductive process have been well studied in animal models, the relevant information from mollusks remains limited, particularly in species of great commercial interest. Crassostrea hongkongensis is the dominant oyster species that is distributed along the coast of the South China Sea and little genomic information on this species is available. Currently, high-throughput sequencing techniques have been widely used for investigating the basis of physiological processes and facilitating the establishment of adequate genetic selection programs. Results The C.hongkongensis transcriptome included a total of 1,595,855 reads, which were generated by 454 sequencing and were assembled into 41,472 contigs using de novo methods. Contigs were clustered into 33,920 isotigs and further grouped into 22,829 isogroups. Approximately 77.6% of the isogroups were successfully annotated by the Nr database. More than 1,910 genes were identified as being related to reproduction. Some key genes involved in germline development, sex determination and differentiation were identified for the first time in C.hongkongensis (nanos, piwi, ATRX, FoxL2, β-catenin, etc.). Gene expression analysis indicated that vasa, nanos, piwi, ATRX, FoxL2, β-catenin and SRD5A1 were highly or specifically expressed in C.hongkongensis gonads. Additionally, 94,056 single nucleotide polymorphisms (SNPs) and 1,699 simple sequence repeats (SSRs) were compiled. Conclusions Our study significantly increased C.hongkongensis genomic information based on transcriptomics analysis. The group of reproduction-related genes identified in the present study constitutes a new tool for research
Transcriptomics Analysis of Crassostrea hongkongensis for the Discovery of Reproduction-Related Genes.

Directory of Open Access Journals (Sweden)

Ying Tong

Full Text Available The reproductive mechanisms of mollusk species have been interesting targets in biological research because of the diverse reproductive strategies observed in this phylum. These species have also been studied for the development of fishery technologies in molluscan aquaculture. Although the molecular mechanisms underlying the reproductive process have been well studied in animal models, the relevant information from mollusks remains limited, particularly in species of great commercial interest. Crassostrea hongkongensis is the dominant oyster species that is distributed along the coast of the South China Sea and little genomic information on this species is available. Currently, high-throughput sequencing techniques have been widely used for investigating the basis of physiological processes and facilitating the establishment of adequate genetic selection programs.The C.hongkongensis transcriptome included a total of 1,595,855 reads, which were generated by 454 sequencing and were assembled into 41,472 contigs using de novo methods. Contigs were clustered into 33,920 isotigs and further grouped into 22,829 isogroups. Approximately 77.6% of the isogroups were successfully annotated by the Nr database. More than 1,910 genes were identified as being related to reproduction. Some key genes involved in germline development, sex determination and differentiation were identified for the first time in C.hongkongensis (nanos, piwi, ATRX, FoxL2, β-catenin, etc.. Gene expression analysis indicated that vasa, nanos, piwi, ATRX, FoxL2, β-catenin and SRD5A1 were highly or specifically expressed in C.hongkongensis gonads. Additionally, 94,056 single nucleotide polymorphisms (SNPs and 1,699 simple sequence repeats (SSRs were compiled.Our study significantly increased C.hongkongensis genomic information based on transcriptomics analysis. The group of reproduction-related genes identified in the present study constitutes a new tool for research on bivalve
Gene expression in triple-negative breast cancer in relation to survival.

Science.gov (United States)

Wang, Shuyang; Beeghly-Fadiel, Alicia; Cai, Qiuyin; Cai, Hui; Guo, Xingyi; Shi, Liang; Wu, Jie; Ye, Fei; Qiu, Qingchao; Zheng, Ying; Zheng, Wei; Bao, Ping-Ping; Shu, Xiao-Ou

2018-05-10

The identification of biomarkers related to the prognosis of triple-negative breast cancer (TNBC) is critically important for improved understanding of the biology that drives TNBC progression. We evaluated gene expression in total RNA isolated from formalin-fixed paraffin-embedded tumor samples using the NanoString nCounter assay for 469 TNBC cases from the Shanghai Breast Cancer Survival Study. We used Cox regression to quantify Hazard Ratios (HR) and corresponding confidence intervals (CI) for overall survival (OS) and disease-free survival (DFS) in models that included adjustment for breast cancer intrinsic subtype. Of 302 genes in our discovery analysis, 22 were further evaluated in relation to OS among 134 TNBC cases from the Nashville Breast Health Study and the Southern Community Cohort Study; 16 genes were further evaluated in relation to DFS in 335 TNBC cases from four gene expression omnibus datasets. Fixed-effect meta-analysis was used to combine results across data sources. Twofold higher expression of EOMES (HR 0.90, 95% CI 0.83-0.97), RASGRP1 (HR 0.89, 95% CI 0.82-0.97), and SOD2 (HR 0.80, 95% CI 0.66-0.96) was associated with better OS. Twofold higher expression of EOMES (HR 0.89, 95% CI 0.81-0.97) and RASGRP1 (HR 0.87, 95% CI 0.81-0.95) was also associated with better DFS. On the contrary, a doubling of FA2H (HR 1.14, 95% CI 1.06-1.22) and GSPT1 (HR 1.33, 95% CI 1.14-1.55) expression was associated with shorter DFS. We identified five genes (EOMES, FA2H, GSPT1, RASGRP1, and SOD2) that may serve as potential prognostic biomarkers and/or therapeutic targets for TNBC.
An Integrative Analysis to Identify Driver Genes in Esophageal Squamous Cell Carcinoma.

Directory of Open Access Journals (Sweden)

Genta Sawada

Full Text Available Few driver genes have been well established in esophageal squamous cell carcinoma (ESCC. Identification of the genomic aberrations that contribute to changes in gene expression profiles can be used to predict driver genes.We searched for driver genes in ESCC by integrative analysis of gene expression microarray profiles and copy number data. To narrow down candidate genes, we performed survival analysis on expression data and tested the genetic vulnerability of each genes using public RNAi screening data. We confirmed the results by performing RNAi experiments and evaluating the clinical relevance of candidate genes in an independent ESCC cohort.We found 10 significantly recurrent copy number alterations accompanying gene expression changes, including loci 11q13.2, 7p11.2, 3q26.33, and 17q12, which harbored CCND1, EGFR, SOX2, and ERBB2, respectively. Analysis of survival data and RNAi screening data suggested that GRB7, located on 17q12, was a driver gene in ESCC. In ESCC cell lines harboring 17q12 amplification, knockdown of GRB7 reduced the proliferation, migration, and invasion capacities of cells. Moreover, siRNA targeting GRB7 had a synergistic inhibitory effect when combined with trastuzumab, an anti-ERBB2 antibody. Survival analysis of the independent cohort also showed that high GRB7 expression was associated with poor prognosis in ESCC.Our integrative analysis provided important insights into ESCC pathogenesis. We identified GRB7 as a novel ESCC driver gene and potential new therapeutic target.
Genomic Prediction and Association Mapping of Curd-Related Traits in Gene Bank Accessions of Cauliflower.

Science.gov (United States)

Thorwarth, Patrick; Yousef, Eltohamy A A; Schmid, Karl J

2018-02-02

Genetic resources are an important source of genetic variation for plant breeding. Genome-wide association studies (GWAS) and genomic prediction greatly facilitate the analysis and utilization of useful genetic diversity for improving complex phenotypic traits in crop plants. We explored the potential of GWAS and genomic prediction for improving curd-related traits in cauliflower ( Brassica oleracea var. botrytis ) by combining 174 randomly selected cauliflower gene bank accessions from two different gene banks. The collection was genotyped with genotyping-by-sequencing (GBS) and phenotyped for six curd-related traits at two locations and three growing seasons. A GWAS analysis based on 120,693 single-nucleotide polymorphisms identified a total of 24 significant associations for curd-related traits. The potential for genomic prediction was assessed with a genomic best linear unbiased prediction model and BayesB. Prediction abilities ranged from 0.10 to 0.66 for different traits and did not differ between prediction methods. Imputation of missing genotypes only slightly improved prediction ability. Our results demonstrate that GWAS and genomic prediction in combination with GBS and phenotyping of highly heritable traits can be used to identify useful quantitative trait loci and genotypes among genetically diverse gene bank material for subsequent utilization as genetic resources in cauliflower breeding. Copyright © 2018 Thorwarth et al.
Genomic Prediction and Association Mapping of Curd-Related Traits in Gene Bank Accessions of Cauliflower

Directory of Open Access Journals (Sweden)

Patrick Thorwarth

2018-02-01

Full Text Available Genetic resources are an important source of genetic variation for plant breeding. Genome-wide association studies (GWAS and genomic prediction greatly facilitate the analysis and utilization of useful genetic diversity for improving complex phenotypic traits in crop plants. We explored the potential of GWAS and genomic prediction for improving curd-related traits in cauliflower (Brassica oleracea var. botrytis by combining 174 randomly selected cauliflower gene bank accessions from two different gene banks. The collection was genotyped with genotyping-by-sequencing (GBS and phenotyped for six curd-related traits at two locations and three growing seasons. A GWAS analysis based on 120,693 single-nucleotide polymorphisms identified a total of 24 significant associations for curd-related traits. The potential for genomic prediction was assessed with a genomic best linear unbiased prediction model and BayesB. Prediction abilities ranged from 0.10 to 0.66 for different traits and did not differ between prediction methods. Imputation of missing genotypes only slightly improved prediction ability. Our results demonstrate that GWAS and genomic prediction in combination with GBS and phenotyping of highly heritable traits can be used to identify useful quantitative trait loci and genotypes among genetically diverse gene bank material for subsequent utilization as genetic resources in cauliflower breeding.
Relative expression of genes related with cold tolerance in ...

African Journals Online (AJOL)

Low temperature is one of the main abiotic stresses affecting rice yield in Chile. Alterations in phenology and physiology of the crop are observed after a cold event. The objective of this work was to study the relative expression of genes related with cold stress in Chilean cultivars of rice. For this, we analyzed the expression ...
State-related alterations of gene expression in bipolar disorder

DEFF Research Database (Denmark)

Munkholm, Klaus; Vinberg, Maj; Berk, Michael

2012-01-01

Munkholm K, Vinberg M, Berk M, Kessing LV. State-related alterations of gene expression in bipolar disorder: a systematic review. Bipolar Disord 2012: 14: 684-696. © 2012 The Authors. Journal compilation © 2012 John Wiley & Sons A/S. Objective: Alterations in gene expression in bipolar disorder...... have been found in numerous studies. It is unclear whether such alterations are related to specific mood states. As a biphasic disorder, mood state-related alterations in gene expression have the potential to point to markers of disease activity, and trait-related alterations might indicate...... vulnerability pathways. This review therefore evaluated the evidence for whether gene expression in bipolar disorder is state or trait related. Methods: A systematic review, using the Preferred Reporting Items for Systematic Reviews and Meta-Analysis (PRISMA) guideline for reporting systematic reviews, based...
Linking the salt transcriptome with physiological responses of a salt-resistant Populus species as a strategy to identify genes important for stress acclimation.

Science.gov (United States)

Brinker, Monika; Brosché, Mikael; Vinocur, Basia; Abo-Ogiala, Atef; Fayyaz, Payam; Janz, Dennis; Ottow, Eric A; Cullmann, Andreas D; Saborowski, Joachim; Kangasjärvi, Jaakko; Altman, Arie; Polle, Andrea

2010-12-01

To investigate early salt acclimation mechanisms in a salt-tolerant poplar species (Populus euphratica), the kinetics of molecular, metabolic, and physiological changes during a 24-h salt exposure were measured. Three distinct phases of salt stress were identified by analyses of the osmotic pressure and the shoot water potential: dehydration, salt accumulation, and osmotic restoration associated with ionic stress. The duration and intensity of these phases differed between leaves and roots. Transcriptome analysis using P. euphratica-specific microarrays revealed clusters of coexpressed genes in these phases, with only 3% overlapping salt-responsive genes in leaves and roots. Acclimation of cellular metabolism to high salt concentrations involved remodeling of amino acid and protein biosynthesis and increased expression of molecular chaperones (dehydrins, osmotin). Leaves suffered initially from dehydration, which resulted in changes in transcript levels of mitochondrial and photosynthetic genes, indicating adjustment of energy metabolism. Initially, decreases in stress-related genes were found, whereas increases occurred only when leaves had restored the osmotic balance by salt accumulation. Comparative in silico analysis of the poplar stress regulon with Arabidopsis (Arabidopsis thaliana) orthologs was used as a strategy to reduce the number of candidate genes for functional analysis. Analysis of Arabidopsis knockout lines identified a lipocalin-like gene (AtTIL) and a gene encoding a protein with previously unknown functions (AtSIS) to play roles in salt tolerance. In conclusion, by dissecting the stress transcriptome of tolerant species, novel genes important for salt endurance can be identified.
Genome-wide strategies identify downstream target genes of chick connective tissue-associated transcription factors.

Science.gov (United States)

Orgeur, Mickael; Martens, Marvin; Leonte, Georgeta; Nassari, Sonya; Bonnin, Marie-Ange; Börno, Stefan T; Timmermann, Bernd; Hecht, Jochen; Duprez, Delphine; Stricker, Sigmar

2018-03-29

Connective tissues support organs and play crucial roles in development, homeostasis and fibrosis, yet our understanding of their formation is still limited. To gain insight into the molecular mechanisms of connective tissue specification, we selected five zinc-finger transcription factors - OSR1, OSR2, EGR1, KLF2 and KLF4 - based on their expression patterns and/or known involvement in connective tissue subtype differentiation. RNA-seq and ChIP-seq profiling of chick limb micromass cultures revealed a set of common genes regulated by all five transcription factors, which we describe as a connective tissue core expression set. This common core was enriched with genes associated with axon guidance and myofibroblast signature, including fibrosis-related genes. In addition, each transcription factor regulated a specific set of signalling molecules and extracellular matrix components. This suggests a concept whereby local molecular niches can be created by the expression of specific transcription factors impinging on the specification of local microenvironments. The regulatory network established here identifies common and distinct molecular signatures of limb connective tissue subtypes, provides novel insight into the signalling pathways governing connective tissue specification, and serves as a resource for connective tissue development. © 2018. Published by The Company of Biologists Ltd.

A stratified transcriptomics analysis of polygenic fat and lean mouse adipose tissues identifies novel candidate obesity genes.

Directory of Open Access Journals (Sweden)

Nicholas M Morton

Full Text Available Obesity and metabolic syndrome results from a complex interaction between genetic and environmental factors. In addition to brain-regulated processes, recent genome wide association studies have indicated that genes highly expressed in adipose tissue affect the distribution and function of fat and thus contribute to obesity. Using a stratified transcriptome gene enrichment approach we attempted to identify adipose tissue-specific obesity genes in the unique polygenic Fat (F mouse strain generated by selective breeding over 60 generations for divergent adiposity from a comparator Lean (L strain.To enrich for adipose tissue obesity genes a 'snap-shot' pooled-sample transcriptome comparison of key fat depots and non adipose tissues (muscle, liver, kidney was performed. Known obesity quantitative trait loci (QTL information for the model allowed us to further filter genes for increased likelihood of being causal or secondary for obesity. This successfully identified several genes previously linked to obesity (C1qr1, and Np3r as positional QTL candidate genes elevated specifically in F line adipose tissue. A number of novel obesity candidate genes were also identified (Thbs1, Ppp1r3d, Tmepai, Trp53inp2, Ttc7b, Tuba1a, Fgf13, Fmr that have inferred roles in fat cell function. Quantitative microarray analysis was then applied to the most phenotypically divergent adipose depot after exaggerating F and L strain differences with chronic high fat feeding which revealed a distinct gene expression profile of line, fat depot and diet-responsive inflammatory, angiogenic and metabolic pathways. Selected candidate genes Npr3 and Thbs1, as well as Gys2, a non-QTL gene that otherwise passed our enrichment criteria were characterised, revealing novel functional effects consistent with a contribution to obesity.A focussed candidate gene enrichment strategy in the unique F and L model has identified novel adipose tissue-enriched genes contributing to obesity.
Mining pathway associations for disease-related pathway activity analysis based on gene expression and methylation data.

Science.gov (United States)

Lee, Hyeonjeong; Shin, Miyoung

2017-01-01

The problem of discovering genetic markers as disease signatures is of great significance for the successful diagnosis, treatment, and prognosis of complex diseases. Even if many earlier studies worked on identifying disease markers from a variety of biological resources, they mostly focused on the markers of genes or gene-sets (i.e., pathways). However, these markers may not be enough to explain biological interactions between genetic variables that are related to diseases. Thus, in this study, our aim is to investigate distinctive associations among active pathways (i.e., pathway-sets) shown each in case and control samples which can be observed from gene expression and/or methylation data. The pathway-sets are obtained by identifying a set of associated pathways that are often active together over a significant number of class samples. For this purpose, gene expression or methylation profiles are first analyzed to identify significant (active) pathways via gene-set enrichment analysis. Then, regarding these active pathways, an association rule mining approach is applied to examine interesting pathway-sets in each class of samples (case or control). By doing so, the sets of associated pathways often working together in activity profiles are finally chosen as our distinctive signature of each class. The identified pathway-sets are aggregated into a pathway activity network (PAN), which facilitates the visualization of differential pathway associations between case and control samples. From our experiments with two publicly available datasets, we could find interesting PAN structures as the distinctive signatures of breast cancer and uterine leiomyoma cancer, respectively. Our pathway-set markers were shown to be superior or very comparable to other genetic markers (such as genes or gene-sets) in disease classification. Furthermore, the PAN structure, which can be constructed from the identified markers of pathway-sets, could provide deeper insights into
Identifying biological concepts from a protein-related corpus with a probabilistic topic model

Directory of Open Access Journals (Sweden)

Lu Xinghua

2006-02-01

Full Text Available Abstract Background Biomedical literature, e.g., MEDLINE, contains a wealth of knowledge regarding functions of proteins. Major recurring biological concepts within such text corpora represent the domains of this body of knowledge. The goal of this research is to identify the major biological topics/concepts from a corpus of protein-related MEDLINE© titles and abstracts by applying a probabilistic topic model. Results The latent Dirichlet allocation (LDA model was applied to the corpus. Based on the Bayesian model selection, 300 major topics were extracted from the corpus. The majority of identified topics/concepts was found to be semantically coherent and most represented biological objects or concepts. The identified topics/concepts were further mapped to the controlled vocabulary of the Gene Ontology (GO terms based on mutual information. Conclusion The major and recurring biological concepts within a collection of MEDLINE documents can be extracted by the LDA model. The identified topics/concepts provide parsimonious and semantically-enriched representation of the texts in a semantic space with reduced dimensionality and can be used to index text.
Effects of high temperature on photosynthesis and related gene expression in poplar

Science.gov (United States)

2014-01-01

Background High temperature, whether transitory or constant, causes physiological, biochemical and molecular changes that adversely affect tree growth and productivity by reducing photosynthesis. To elucidate the photosynthetic adaption response and examine the recovery capacity of trees under heat stress, we measured gas exchange, chlorophyll fluorescence, electron transport, water use efficiency, and reactive oxygen-producing enzyme activities in heat-stressed plants. Results We found that photosynthesis could completely recover after less than six hours of high temperature treatment, which might be a turning point in the photosynthetic response to heat stress. Genome-wide gene expression analysis at six hours of heat stress identified 29,896 differentially expressed genes (15,670 up-regulated and 14,226 down-regulated), including multiple classes of transcription factors. These interact with each other and regulate the expression of photosynthesis-related genes in response to heat stress, controlling carbon fixation and changes in stomatal conductance. Heat stress of more than twelve hours caused reduced electron transport, damaged photosystems, activated the glycolate pathway and caused H2O2 production; as a result, photosynthetic capacity did not recover completely. Conclusions This study provides a systematic physiological and global gene expression profile of the poplar photosynthetic response to heat stress and identifies the main limitations and threshold of photosynthesis under heat stress. It will expand our understanding of plant thermostability and provides a robust dataset for future studies. PMID:24774695
Cross-study analysis of gene expression data for intermediate neuroblastoma identifies two biological subtypes

International Nuclear Information System (INIS)

Warnat, Patrick; Oberthuer, André; Fischer, Matthias; Westermann, Frank; Eils, Roland; Brors, Benedikt

2007-01-01

Neuroblastoma patients show heterogeneous clinical courses ranging from life-threatening progression to spontaneous regression. Recently, gene expression profiles of neuroblastoma tumours were associated with clinically different phenotypes. However, such data is still rare for important patient subgroups, such as patients with MYCN non-amplified advanced stage disease. Prediction of the individual course of disease and optimal therapy selection in this cohort is challenging. Additional research effort is needed to describe the patterns of gene expression in this cohort and to identify reliable prognostic markers for this subset of patients. We combined gene expression data from two studies in a meta-analysis in order to investigate differences in gene expression of advanced stage (3 or 4) tumours without MYCN amplification that show contrasting outcomes (alive or dead) at five years after initial diagnosis. In addition, a predictive model for outcome was generated. Gene expression profiles from 66 patients were included from two studies using different microarray platforms. In the combined data set, 72 genes were identified as differentially expressed by meta-analysis at a false discovery rate (FDR) of 8.33%. Meta-analysis detected 34 differentially expressed genes that were not found as significant in either single study. Outcome prediction based on data of both studies resulted in a predictive accuracy of 77%. Moreover, the genes that were differentially expressed in subgroups of advanced stage patients without MYCN amplification accurately separated MYCN amplified tumours from low stage tumours without MYCN amplification. Our findings support the hypothesis that neuroblastoma consists of two biologically distinct subgroups that differ by characteristic gene expression patterns, which are associated with divergent clinical outcome
Mechanism-based biomarker gene sets for glutathione depletion-related hepatotoxicity in rats

International Nuclear Information System (INIS)

Gao Weihua; Mizukawa, Yumiko; Nakatsu, Noriyuki; Minowa, Yosuke; Yamada, Hiroshi; Ohno, Yasuo; Urushidani, Tetsuro

2010-01-01

Chemical-induced glutathione depletion is thought to be caused by two types of toxicological mechanisms: PHO-type glutathione depletion [glutathione conjugated with chemicals such as phorone (PHO) or diethyl maleate (DEM)], and BSO-type glutathione depletion [i.e., glutathione synthesis inhibited by chemicals such as L-buthionine-sulfoximine (BSO)]. In order to identify mechanism-based biomarker gene sets for glutathione depletion in rat liver, male SD rats were treated with various chemicals including PHO (40, 120 and 400 mg/kg), DEM (80, 240 and 800 mg/kg), BSO (150, 450 and 1500 mg/kg), and bromobenzene (BBZ, 10, 100 and 300 mg/kg). Liver samples were taken 3, 6, 9 and 24 h after administration and examined for hepatic glutathione content, physiological and pathological changes, and gene expression changes using Affymetrix GeneChip Arrays. To identify differentially expressed probe sets in response to glutathione depletion, we focused on the following two courses of events for the two types of mechanisms of glutathione depletion: a) gene expression changes occurring simultaneously in response to glutathione depletion, and b) gene expression changes after glutathione was depleted. The gene expression profiles of the identified probe sets for the two types of glutathione depletion differed markedly at times during and after glutathione depletion, whereas Srxn1 was markedly increased for both types as glutathione was depleted, suggesting that Srxn1 is a key molecule in oxidative stress related to glutathione. The extracted probe sets were refined and verified using various compounds including 13 additional positive or negative compounds, and they established two useful marker sets. One contained three probe sets (Akr7a3, Trib3 and Gstp1) that could detect conjugation-type glutathione depletors any time within 24 h after dosing, and the other contained 14 probe sets that could detect glutathione depletors by any mechanism. These two sets, with appropriate scoring
Mutations in the collagen XII gene define a new form of extracellular matrix-related myopathy.

Science.gov (United States)

Hicks, Debbie; Farsani, Golara Torabi; Laval, Steven; Collins, James; Sarkozy, Anna; Martoni, Elena; Shah, Ashoke; Zou, Yaqun; Koch, Manuel; Bönnemann, Carsten G; Roberts, Mark; Lochmüller, Hanns; Bushby, Kate; Straub, Volker

2014-05-01

Bethlem myopathy (BM) [MIM 158810] is a slowly progressive muscle disease characterized by contractures and proximal weakness, which can be caused by mutations in one of the collagen VI genes (COL6A1, COL6A2 and COL6A3). However, there may be additional causal genes to identify as in ∼50% of BM cases no mutations in the COL6 genes are identified. In a cohort of -24 patients with a BM-like phenotype, we first sequenced 12 candidate genes based on their function, including genes for known binding partners of collagen VI, and those enzymes involved in its correct post-translational modification, assembly and secretion. Proceeding to whole-exome sequencing (WES), we identified mutations in the COL12A1 gene, a member of the FACIT collagens (fibril-associated collagens with interrupted triple helices) in five individuals from two families. Both families showed dominant inheritance with a clinical phenotype resembling classical BM. Family 1 had a single-base substitution that led to the replacement of one glycine residue in the triple-helical domain, breaking the Gly-X-Y repeating pattern, and Family 2 had a missense mutation, which created a mutant protein with an unpaired cysteine residue. Abnormality at the protein level was confirmed in both families by the intracellular retention of collagen XII in patient dermal fibroblasts. The mutation in Family 2 leads to the up-regulation of genes associated with the unfolded protein response (UPR) pathway and swollen, dysmorphic rough-ER. We conclude that the spectrum of causative genes in extracellular matrix (ECM)-related myopathies be extended to include COL12A1.
Understanding Autoimmune Mechanisms in Multiple Sclerosis Using Gene Expression Microarrays: Treatment Effect and Cytokine-related Pathways

Directory of Open Access Journals (Sweden)

A. Achiron

2004-01-01

Full Text Available Multiple sclerosis (MS is a central nervous system disease in which activated autoreactive T-cells invade the blood brain barrier and initiate an inflammatory response that leads to myelin destruction and axonal loss. The etiology of MS, as well as the mechanisms associated with its unexpected onset, the unpredictable clinical course spanning decades, and the different rates of progression leading to disability over time, remains an enigma. We have applied gene expression microarrays technology in peripheral blood mononuclear cells (PBMC to better understand MS pathogenesis and better target treatment approaches. A signature of 535 genes were found to distinguish immunomodulatory treatment effects between 13 treated and 13 untreated MS patients. In addition, the expression pattern of 1109 gene transcripts that were previously reported to significantly differentiate between MS patients and healthy subjects were further analyzed to study the effect of cytokine-related pathways on disease pathogenesis. When relative gene expression for 26 MS patients was compared to 18 healthy controls, 30 genes related to various cytokine-associated pathways were identified. These genes belong to a variety of families such as interleukins, small inducible cytokine subfamily and tumor necrosis factor ligand and receptor. Further analysis disclosed seven cytokine-associated genes within the immunomodulatory treatment signature, and two cytokine-associated genes SCYA4 (small inducible cytokine A4 and FCAR (Fc fragment of IgA, CD89 that were common to both the MS gene expression signature and the immunomodulatory treatment gene expression signature. Our results indicate that cytokine-associated genes are involved in various pathogenic pathways in MS and also related to immunomodulatory treatment effects.
Metastatic canine mammary carcinomas can be identified by a gene expression profile that partly overlaps with human breast cancer profiles

International Nuclear Information System (INIS)

Klopfleisch, Robert; Lenze, Dido; Hummel, Michael; Gruber, Achim D

2010-01-01

Similar to human breast cancer mammary tumors of the female dog are commonly associated with a fatal outcome due to the development of distant metastases. However, the molecular defects leading to metastasis are largely unknown and the value of canine mammary carcinoma as a model for human breast cancer is unclear. In this study, we analyzed the gene expression signatures associated with mammary tumor metastasis and asked for parallels with the human equivalent. Messenger RNA expression profiles of twenty-seven lymph node metastasis positive or negative canine mammary carcinomas were established by microarray analysis. Differentially expressed genes were functionally characterized and associated with molecular pathways. The findings were also correlated with published data on human breast cancer. Metastatic canine mammary carcinomas had 1,011 significantly differentially expressed genes when compared to non-metastatic carcinomas. Metastatic carcinomas had a significant up-regulation of genes associated with cell cycle regulation, matrix modulation, protein folding and proteasomal degradation whereas cell differentiation genes, growth factor pathway genes and regulators of actin organization were significantly down-regulated. Interestingly, 265 of the 1,011 differentially expressed canine genes are also related to human breast cancer and, vice versa, parts of a human prognostic gene signature were identified in the expression profiles of the metastatic canine tumors. Metastatic canine mammary carcinomas can be discriminated from non-metastatic carcinomas by their gene expression profiles. More than one third of the differentially expressed genes are also described of relevance for human breast cancer. Many of the differentially expressed genes are linked to functions and pathways which appear to be relevant for the induction and maintenance of metastatic progression and may represent new therapeutic targets. Furthermore, dogs are in some aspects suitable as a
MiR-210 disturbs mitotic progression through regulating a group of mitosis-related genes.

Science.gov (United States)

He, Jie; Wu, Jiangbin; Xu, Naihan; Xie, Weidong; Li, Mengnan; Li, Jianna; Jiang, Yuyang; Yang, Burton B; Zhang, Yaou

2013-01-07

MiR-210 is up-regulated in multiple cancer types but its function is disputable and further investigation is necessary. Using a bioinformatics approach, we identified the putative target genes of miR-210 in hypoxia-induced CNE cells from genome-wide scale. Two functional gene groups related to cell cycle and RNA processing were recognized as the major targets of miR-210. Here, we investigated the molecular mechanism and biological consequence of miR-210 in cell cycle regulation, particularly mitosis. Hypoxia-induced up-regulation of miR-210 was highly correlated with the down-regulation of a group of mitosis-related genes, including Plk1, Cdc25B, Cyclin F, Bub1B and Fam83D. MiR-210 suppressed the expression of these genes by directly targeting their 3'-UTRs. Over-expression of exogenous miR-210 disturbed mitotic progression and caused aberrant mitosis. Furthermore, miR-210 mimic with pharmacological doses reduced tumor formation in a mouse metastatic tumor model. Taken together, these results implicate that miR-210 disturbs mitosis through targeting multi-genes involved in mitotic progression, which may contribute to its inhibitory role on tumor formation.
A genome scale RNAi screen identifies GLI1 as a novel gene regulating vorinostat sensitivity.

Science.gov (United States)

Falkenberg, K J; Newbold, A; Gould, C M; Luu, J; Trapani, J A; Matthews, G M; Simpson, K J; Johnstone, R W

2016-07-01

Vorinostat is an FDA-approved histone deacetylase inhibitor (HDACi) that has proven clinical success in some patients; however, it remains unclear why certain patients remain unresponsive to this agent and other HDACis. Constitutive STAT (signal transducer and activator of transcription) activation, overexpression of prosurvival Bcl-2 proteins and loss of HR23B have been identified as potential biomarkers of HDACi resistance; however, none have yet been used to aid the clinical utility of HDACi. Herein, we aimed to further elucidate vorinostat-resistance mechanisms through a functional genomics screen to identify novel genes that when knocked down by RNA interference (RNAi) sensitized cells to vorinostat-induced apoptosis. A synthetic lethal functional screen using a whole-genome protein-coding RNAi library was used to identify genes that when knocked down cooperated with vorinostat to induce tumor cell apoptosis in otherwise resistant cells. Through iterative screening, we identified 10 vorinostat-resistance candidate genes that sensitized specifically to vorinostat. One of these vorinostat-resistance genes was GLI1, an oncogene not previously known to regulate the activity of HDACi. Treatment of vorinostat-resistant cells with the GLI1 small-molecule inhibitor, GANT61, phenocopied the effect of GLI1 knockdown. The mechanism by which GLI1 loss of function sensitized tumor cells to vorinostat-induced apoptosis is at least in part through interactions with vorinostat to alter gene expression in a manner that favored apoptosis. Upon GLI1 knockdown and vorinostat treatment, BCL2L1 expression was repressed and overexpression of BCL2L1 inhibited GLI1-knockdown-mediated vorinostat sensitization. Taken together, we present the identification and characterization of GLI1 as a new HDACi resistance gene, providing a strong rationale for development of GLI1 inhibitors for clinical use in combination with HDACi therapy.
Identifying optimal reference genes for the normalization of microRNA expression in cucumber under viral stress

Science.gov (United States)

Liang, Chaoqiong; Hao, Jianjun; Meng, Yan; Luo, Laixin; Li, Jianqiang

2018-01-01

Cucumber green mottle mosaic virus (CGMMV) is an economically important pathogen and causes significant reduction of both yield and quality of cucumber (Cucumis sativus). Currently, there were no satisfied strategies for controlling the disease. A better understanding of microRNA (miRNA) expression related to the regulation of plant-virus interactions and virus resistance would be of great assistance when developing control strategies for CGMMV. However, accurate expression analysis is highly dependent on robust and reliable reference gene used as an internal control for normalization of miRNA expression. Most commonly used reference genes involved in CGMMV-infected cucumber are not universally expressed depending on tissue types and stages of plant development. It is therefore crucial to identify suitable reference genes in investigating the role of miRNA expression. In this study, seven reference genes, including Actin, Tubulin, EF-1α, 18S rRNA, Ubiquitin, GAPDH and Cyclophilin, were evaluated for the most accurate results in analyses using reverse transcription-quantitative polymerase chain reaction (RT-qPCR). Gene expression was assayed on cucumber leaves, stems and roots that were collected at different days post inoculation with CGMMV. The expression data were analyzed using algorithms including delta-Ct, geNorm, NormFinder, and BestKeeper as well as the comparative tool RefFinder. The reference genes were subsequently validated using miR159. The results showed that EF-1α and GAPDH were the most reliable reference genes for normalizing miRNA expression in leaf, root and stem samples, while Ubiquitin and EF-1α were the most suitable combination overall. PMID:29543906
Systematic Analysis of the 4-Coumarate:Coenzyme A Ligase (4CL Related Genes and Expression Profiling during Fruit Development in the Chinese Pear

Directory of Open Access Journals (Sweden)

Yunpeng Cao

2016-10-01

Full Text Available In plants, 4-coumarate:coenzyme A ligases (4CLs, comprising some of the adenylate-forming enzymes, are key enzymes involved in regulating lignin metabolism and the biosynthesis of flavonoids and other secondary metabolites. Although several 4CL-related proteins were shown to play roles in secondary metabolism, no comprehensive study on 4CL-related genes in the pear and other Rosaceae species has been reported. In this study, we identified 4CL-related genes in the apple, peach, yangmei, and pear genomes using DNATOOLS software and inferred their evolutionary relationships using phylogenetic analysis, collinearity analysis, conserved motif analysis, and structure analysis. A total of 149 4CL-related genes in four Rosaceous species (pear, apple, peach, and yangmei were identified, with 30 members in the pear. We explored the functions of several 4CL and acyl-coenzyme A synthetase (ACS genes during the development of pear fruit by quantitative real-time PCR (qRT-PCR. We found that duplication events had occurred in the 30 4CL-related genes in the pear. These duplicated 4CL-related genes are distributed unevenly across all pear chromosomes except chromosomes 4, 8, 11, and 12. The results of this study provide a basis for further investigation of both the functions and evolutionary history of 4CL-related genes.
Genetic interactions of MAF1 identify a role for Med20 in transcriptional repression of ribosomal protein genes.

Directory of Open Access Journals (Sweden)

Ian M Willis

2008-07-01

Full Text Available Transcriptional repression of ribosomal components and tRNAs is coordinately regulated in response to a wide variety of environmental stresses. Part of this response involves the convergence of different nutritional and stress signaling pathways on Maf1, a protein that is essential for repressing transcription by RNA polymerase (pol III in Saccharomyces cerevisiae. Here we identify the functions buffering yeast cells that are unable to down-regulate transcription by RNA pol III. MAF1 genetic interactions identified in screens of non-essential gene-deletions and conditionally expressed essential genes reveal a highly interconnected network of 64 genes involved in ribosome biogenesis, RNA pol II transcription, tRNA modification, ubiquitin-dependent proteolysis and other processes. A survey of non-essential MAF1 synthetic sick/lethal (SSL genes identified six gene-deletions that are defective in transcriptional repression of ribosomal protein (RP genes following rapamycin treatment. This subset of MAF1 SSL genes included MED20 which encodes a head module subunit of the RNA pol II Mediator complex. Genetic interactions between MAF1 and subunits in each structural module of Mediator were investigated to examine the functional relationship between these transcriptional regulators. Gene expression profiling identified a prominent and highly selective role for Med20 in the repression of RP gene transcription under multiple conditions. In addition, attenuated repression of RP genes by rapamycin was observed in a strain deleted for the Mediator tail module subunit Med16. The data suggest that Mediator and Maf1 function in parallel pathways to negatively regulate RP mRNA and tRNA synthesis.
Isolation and characterization of Agouti: a diabetes/obesity related gene

Energy Technology Data Exchange (ETDEWEB)

Woychik, Richard P. (Knoxville, TN)

2000-06-27

The present invention relates to the cloning and expression of the Agouti gene and analogous genes in transformed, transfected and transgenic mice. The present invention provides an animal model for the study of diabetes, obesity and tumors for the testing of potential therapeutic agents. The present invention provides oligonucleotide probes for the detection of the Agouti gene and mutations in the gene. The present invention also relates to the isolation and recombinant production of the Agouti gene product, production of antibodies to the Agouti gene product and their use as diagnostic and therapeutic agents.
Isolation and characterization of Agouti: a diabetes/obesity related gene

Energy Technology Data Exchange (ETDEWEB)

Woychik, Richard P. (Knoxville, TN)

1998-01-01

The present invention relates to the cloning and expression of the Agouti gene and analogous genes in transformed, transfected and transgenic mice. The present invention provides an animal model for the study of diabetes, obesity and tumors for the testing of potential therapeutic agents. The present invention provides oligonucleotide probes for the detection of the Agouti gene and mutations in the gene. The present invention also relates to the isolation and recombinant production of the Agouti gene product, production of antibodies to the Agouti gene product and their use as diagnostic and therapeutic agents.
Genetic Susceptibility to Vitiligo: GWAS Approaches for Identifying Vitiligo Susceptibility Genes and Loci

Science.gov (United States)

Shen, Changbing; Gao, Jing; Sheng, Yujun; Dou, Jinfa; Zhou, Fusheng; Zheng, Xiaodong; Ko, Randy; Tang, Xianfa; Zhu, Caihong; Yin, Xianyong; Sun, Liangdan; Cui, Yong; Zhang, Xuejun

2016-01-01

Vitiligo is an autoimmune disease with a strong genetic component, characterized by areas of depigmented skin resulting from loss of epidermal melanocytes. Genetic factors are known to play key roles in vitiligo through discoveries in association studies and family studies. Previously, vitiligo susceptibility genes were mainly revealed through linkage analysis and candidate gene studies. Recently, our understanding of the genetic basis of vitiligo has been rapidly advancing through genome-wide association study (GWAS). More than 40 robust susceptible loci have been identified and confirmed to be associated with vitiligo by using GWAS. Most of these associated genes participate in important pathways involved in the pathogenesis of vitiligo. Many susceptible loci with unknown functions in the pathogenesis of vitiligo have also been identified, indicating that additional molecular mechanisms may contribute to the risk of developing vitiligo. In this review, we summarize the key loci that are of genome-wide significance, which have been shown to influence vitiligo risk. These genetic loci may help build the foundation for genetic diagnosis and personalize treatment for patients with vitiligo in the future. However, substantial additional studies, including gene-targeted and functional studies, are required to confirm the causality of the genetic variants and their biological relevance in the development of vitiligo. PMID:26870082
A framework to identify gene expression profiles in a model of inflammation induced by lipopolysaccharide after treatment with thalidomide

Directory of Open Access Journals (Sweden)

Paiva Renata T

2012-06-01

Full Text Available Abstract Background Thalidomide is an anti-inflammatory and anti-angiogenic drug currently used for the treatment of several diseases, including erythema nodosum leprosum, which occurs in patients with lepromatous leprosy. In this research, we use DNA microarray analysis to identify the impact of thalidomide on gene expression responses in human cells after lipopolysaccharide (LPS stimulation. We employed a two-stage framework. Initially, we identified 1584 altered genes in response to LPS. Modulation of this set of genes was then analyzed in the LPS stimulated cells treated with thalidomide. Results We identified 64 genes with altered expression induced by thalidomide using the rank product method. In addition, the lists of up-regulated and down-regulated genes were investigated by means of bioinformatics functional analysis, which allowed for the identification of biological processes affected by thalidomide. Confirmatory analysis was done in five of the identified genes using real time PCR. Conclusions The results showed some genes that can further our understanding of the biological mechanisms in the action of thalidomide. Of the five genes evaluated with real time PCR, three were down regulated and two were up regulated confirming the initial results of the microarray analysis.
De novo assembly and characterization of the transcriptome of the parasitic weed dodder identifies genes associated with plant parasitism.

Science.gov (United States)

Ranjan, Aashish; Ichihashi, Yasunori; Farhi, Moran; Zumstein, Kristina; Townsley, Brad; David-Schwartz, Rakefet; Sinha, Neelima R

2014-11-01

Parasitic flowering plants are one of the most destructive agricultural pests and have major impact on crop yields throughout the world. Being dependent on finding a host plant for growth, parasitic plants penetrate their host using specialized organs called haustoria. Haustoria establish vascular connections with the host, which enable the parasite to steal nutrients and water. The underlying molecular and developmental basis of parasitism by plants is largely unknown. In order to investigate the process of parasitism, RNAs from different stages (i.e. seed, seedling, vegetative strand, prehaustoria, haustoria, and flower) were used to de novo assemble and annotate the transcriptome of the obligate plant stem parasite dodder (Cuscuta pentagona). The assembled transcriptome was used to dissect transcriptional dynamics during dodder development and parasitism and identified key gene categories involved in the process of plant parasitism. Host plant infection is accompanied by increased expression of parasite genes underlying transport and transporter categories, response to stress and stimuli, as well as genes encoding enzymes involved in cell wall modifications. By contrast, expression of photosynthetic genes is decreased in the dodder infective stages compared with normal stem. In addition, genes relating to biosynthesis, transport, and response of phytohormones, such as auxin, gibberellins, and strigolactone, were differentially expressed in the dodder infective stages compared with stems and seedlings. This analysis sheds light on the transcriptional changes that accompany plant parasitism and will aid in identifying potential gene targets for use in controlling the infestation of crops by parasitic weeds. © 2014 American Society of Plant Biologists. All Rights Reserved.
A comparative gene analysis with rice identified orthologous group II HKT genes and their association with Na(+) concentration in bread wheat.

Science.gov (United States)

Ariyarathna, H A Chandima K; Oldach, Klaus H; Francki, Michael G

2016-01-19

Although the HKT transporter genes ascertain some of the key determinants of crop salt tolerance mechanisms, the diversity and functional role of group II HKT genes are not clearly understood in bread wheat. The advanced knowledge on rice HKT and whole genome sequence was, therefore, used in comparative gene analysis to identify orthologous wheat group II HKT genes and their role in trait variation under different saline environments. The four group II HKTs in rice identified two orthologous gene families from bread wheat, including the known TaHKT2;1 gene family and a new distinctly different gene family designated as TaHKT2;2. A single copy of TaHKT2;2 was found on each homeologous chromosome arm 7AL, 7BL and 7DL and each gene was expressed in leaf blade, sheath and root tissues under non-stressed and at 200 mM salt stressed conditions. The proteins encoded by genes of the TaHKT2;2 family revealed more than 93% amino acid sequence identity but ≤52% amino acid identity compared to the proteins encoded by TaHKT2;1 family. Specifically, variations in known critical domains predicted functional differences between the two protein families. Similar to orthologous rice genes on chromosome 6L, TaHKT2;1 and TaHKT2;2 genes were located approximately 3 kb apart on wheat chromosomes 7AL, 7BL and 7DL, forming a static syntenic block in the two species. The chromosomal region on 7AL containing TaHKT2;1 7AL-1 co-located with QTL for shoot Na(+) concentration and yield in some saline environments. The differences in copy number, genes sequences and encoded proteins between TaHKT2;2 homeologous genes and other group II HKT gene families within and across species likely reflect functional diversity for ion selectivity and transport in plants. Evidence indicated that neither TaHKT2;2 nor TaHKT2;1 were associated with primary root Na(+) uptake but TaHKT2;1 may be associated with trait variation for Na(+) exclusion and yield in some but not all saline environments.

Pulmonary phenotypes associated with genetic variation in telomere-related genes.

Science.gov (United States)

Hoffman, Thijs W; van Moorsel, Coline H M; Borie, Raphael; Crestani, Bruno

2018-05-01

Genomic mutations in telomere-related genes have been recognized as a cause of familial forms of idiopathic pulmonary fibrosis (IPF). However, it has become increasingly clear that telomere syndromes and telomere shortening are associated with various types of pulmonary disease. Additionally, it was found that also single nucleotide polymorphisms (SNPs) in telomere-related genes are risk factors for the development of pulmonary disease. This review focuses on recent updates on pulmonary phenotypes associated with genetic variation in telomere-related genes. Genomic mutations in seven telomere-related genes cause pulmonary disease. Pulmonary phenotypes associated with these mutations range from many forms of pulmonary fibrosis to emphysema and pulmonary vascular disease. Telomere-related mutations account for up to 10% of sporadic IPF, 25% of familial IPF, 10% of connective-tissue disease-associated interstitial lung disease, and 1% of COPD. Mixed disease forms have also been found. Furthermore, SNPs in TERT, TERC, OBFC1, and RTEL1, as well as short telomere length, have been associated with several pulmonary diseases. Treatment of pulmonary disease caused by telomere-related gene variation is currently based on disease diagnosis and not on the underlying cause. Pulmonary phenotypes found in carriers of telomere-related gene mutations and SNPs are primarily pulmonary fibrosis, sometimes emphysema and rarely pulmonary vascular disease. Genotype-phenotype relations are weak, suggesting that environmental factors and genetic background of patients determine disease phenotypes to a large degree. A disease model is presented wherever genomic variation in telomere-related genes cause specific pulmonary disease phenotypes whenever triggered by environmental exposure, comorbidity, or unknown factors.
Comprehensive investigation of cytokine- and immune-related gene variants in HBV-associated hepatocellular carcinoma patients.

Science.gov (United States)

Yu, Fengxue; Zhang, Xiaolin; Tian, Suzhai; Geng, Lianxia; Xu, Weili; Ma, Ning; Wang, Mingbang; Jia, Yuan; Liu, Xuechen; Ma, Junji; Quan, Yuan; Zhang, Chaojun; Guo, Lina; An, Wenting; Liu, Dianwu

2017-12-22

Host genotype may be closely related to the different outcomes of Hepatitis B virus (HBV) infection. To identify the association of variants and HBV infection, we comprehensively investigated the cytokine- and immune-related gene mutations in patients with HBV associated hepatocellular carcinoma (HBV-HCC). Fifty-three HBV-HCC patients, 53 self-healing cases (SH) with HBV infection history and 53 healthy controls (HCs) were recruited, the whole exon region of 404 genes were sequenced at >900× depth. Comprehensive variants and gene levels were compared between HCC and HC, and HCC and SH. Thirty-nine variants (adjusted P HBV-HCC. Thirty-four variants were from eight human leukocyte antigen (HLA) genes that were previously reported to be associated with HBV-HCC. The novelties of our study are: five variants (rs579876, rs579877, rs368692979, NM_145007:c.*131_*130delTG, NM_139165:exon5:c.623-2->TT) from three genes ( REAT1E , NOD-like receptor (NLR) protein 11 ( NLRP11 ), hydroxy-carboxylic acid receptor 2 ( HCAR2 )) were found strongly associated with HBV-HCC. We found 39 different variants in 11 genes that were significantly related to HBV-HCC. Five of them were new findings. Our data implied that chronic hepatitis B patients who carry these variants are at a high risk of developing HCC. © 2017 The Author(s).
A Morpholino-based screen to identify novel genes involved in craniofacial morphogenesis

Science.gov (United States)

Melvin, Vida Senkus; Feng, Weiguo; Hernandez-Lagunas, Laura; Artinger, Kristin Bruk; Williams, Trevor

2014-01-01

BACKGROUND The regulatory mechanisms underpinning facial development are conserved between diverse species. Therefore, results from model systems provide insight into the genetic causes of human craniofacial defects. Previously, we generated a comprehensive dataset examining gene expression during development and fusion of the mouse facial prominences. Here, we used this resource to identify genes that have dynamic expression patterns in the facial prominences, but for which only limited information exists concerning developmental function. RESULTS This set of ~80 genes was used for a high throughput functional analysis in the zebrafish system using Morpholino gene knockdown technology. This screen revealed three classes of cranial cartilage phenotypes depending upon whether knockdown of the gene affected the neurocranium, viscerocranium, or both. The targeted genes that produced consistent phenotypes encoded proteins linked to transcription (meis1, meis2a, tshz2, vgll4l), signaling (pkdcc, vlk, macc1, wu:fb16h09), and extracellular matrix function (smoc2). The majority of these phenotypes were not altered by reduction of p53 levels, demonstrating that both p53 dependent and independent mechanisms were involved in the craniofacial abnormalities. CONCLUSIONS This Morpholino-based screen highlights new genes involved in development of the zebrafish craniofacial skeleton with wider relevance to formation of the face in other species, particularly mouse and human. PMID:23559552
A family-based association study identified CYP17 as a candidate gene for obesity susceptibility in Caucasians.

Science.gov (United States)

Yan, H; Guo, Y; Yang, T-L; Zhao, L-J; Deng, H-W

2012-08-06

The cytochrome P450c17α gene (CYP17) encodes a key biosynthesis enzyme of estrogen, which is critical in regulating adipogenesis and adipocyte development in humans. We therefore hypothesized that CYP17 is a candidate gene for predicting obesity. In order to test this hypothesis, we performed a family-based association test to investigate the relationship between the CYP17 gene and obesity phenotypes in a large sample comprising 1873 subjects from 405 Caucasian nuclear families of European origin recruited by the Osteoporosis Research Center of Creighton University, USA. Both single SNPs and haplotypes were tested for associations with obesity-related phenotypes, including body mass index (BMI) and fat mass. We identified three SNPs to be significantly associated with BMI, including rs3740397, rs6163, and rs619824. We further characterized the linkage disequilibrium structure for CYP17 and found that the whole CYP17 gene was located in a single-linkage disequilibrium block. This block was observed to be significantly associated with BMI. A major haplotype in this block was significantly associated with both BMI and fat mass. In conclusion, we suggest that the CYP17 gene has an effect on obesity in the Caucasian population. Further independent studies will be needed to confirm our findings.
Use of deep whole-genome sequencing data to identify structure risk variants in breast cancer susceptibility genes.

Science.gov (United States)

Guo, Xingyi; Shi, Jiajun; Cai, Qiuyin; Shu, Xiao-Ou; He, Jing; Wen, Wanqing; Allen, Jamie; Pharoah, Paul; Dunning, Alison; Hunter, David J; Kraft, Peter; Easton, Douglas F; Zheng, Wei; Long, Jirong

2018-03-01

Functional disruptions of susceptibility genes by large genomic structure variant (SV) deletions in germlines are known to be associated with cancer risk. However, few studies have been conducted to systematically search for SV deletions in breast cancer susceptibility genes. We analysed deep (> 30x) whole-genome sequencing (WGS) data generated in blood samples from 128 breast cancer patients of Asian and European descent with either a strong family history of breast cancer or early cancer onset disease. To identify SV deletions in known or suspected breast cancer susceptibility genes, we used multiple SV calling tools including Genome STRiP, Delly, Manta, BreakDancer and Pindel. SV deletions were detected by at least three of these bioinformatics tools in five genes. Specifically, we identified heterozygous deletions covering a fraction of the coding regions of BRCA1 (with approximately 80kb in two patients), and TP53 genes (with ∼1.6 kb in two patients), and of intronic regions (∼1 kb) of the PALB2 (one patient), PTEN (three patients) and RAD51C genes (one patient). We confirmed the presence of these deletions using real-time quantitative PCR (qPCR). Our study identified novel SV deletions in breast cancer susceptibility genes and the identification of such SV deletions may improve clinical testing.
Identification of pathogenic genes related to rheumatoid arthritis through integrated analysis of DNA methylation and gene expression profiling.

Science.gov (United States)

Zhang, Lei; Ma, Shiyun; Wang, Huailiang; Su, Hang; Su, Ke; Li, Longjie

2017-11-15

The purpose of our study was to identify new pathogenic genes used for exploring the pathogenesis of rheumatoid arthritis (RA). To screen pathogenic genes of RA, an integrated analysis was performed by using the microarray datasets in RA derived from the Gene Expression Omnibus (GEO) database. The functional annotation and potential pathways of differentially expressed genes (DEGs) were further discovered by Gene Ontology (GO) and Kyoto Encyclopedia of Genes and Genomes (KEGG) enrichment analysis. Afterwards, the integrated analysis of DNA methylation and gene expression profiling was used to screen crucial genes. In addition, we used RT-PCR and MSP to verify the expression levels and methylation status of these crucial genes in 20 synovial biopsy samples obtained from 10 RA model mice and 10 normal mice. BCL11B, CCDC88C, FCRLA and APOL6 were both up-regulated and hypomethylated in RA according to integrated analysis, RT-PCR and MSP verification. Four crucial genes (BCL11B, CCDC88C, FCRLA and APOL6) identified and analyzed in this study might be closely connected with the pathogenesis of RA. Copyright © 2017. Published by Elsevier B.V.
Integrative Analysis of DCE-MRI and Gene Expression Profiles in Construction of a Gene Classifier for Assessment of Hypoxia-Related Risk of Chemoradiotherapy Failure in Cervical Cancer

DEFF Research Database (Denmark)

Fjeldbo, Christina S; Julin, Cathinka H; Lando, Malin

2016-01-01

platforms. The prognostic value was independent of existing clinical markers, regardless of clinical endpoints. CONCLUSIONS: A robust DCE-MRI-associated gene classifier has been constructed that may be used to achieve an early indication of patients' risk of hypoxia-related chemoradiotherapy failure.......PURPOSE: A 31-gene expression signature reflected in dynamic contrast enhanced (DCE)-MR images and correlated with hypoxia-related aggressiveness in cervical cancer was identified in previous work. We here aimed to construct a dichotomous classifier with key signature genes and a predefined...... as an indicator of hypoxia. RESULTS: Classifier candidates were constructed by integrative analysis of ABrix and gene expression profiles in the training cohort and evaluated by a leave-one-out cross-validation approach. On the basis of their ability to separate patients correctly according to hypoxia status, a 6...
Novel mutations in the homogentisate 1,2 dioxygenase gene identified in Jordanian patients with alkaptonuria.

Science.gov (United States)

Al-sbou, Mohammed

2012-06-01

This study was conducted to identify mutations in the homogentisate 1,2 dioxygenase gene (HGD) in alkaptonuria patients among Jordanian population. Blood samples were collected from four alkaptonuria patients, four carriers, and two healthy volunteers. DNA was isolated from peripheral blood. All 14 exons of the HGD gene were amplified using the polymerase chain reaction (PCR) technique. The PCR products were then purified and analyzed by sequencing. Five mutations were identified in our samples. Four of them were novel C1273A, T1046G, 551-552insG, T533G and had not been previously reported, and one mutation T847C has been described before. The types of mutations identified were two missense mutations, one splice site mutation, one frameshift mutation, and one polymorphism. We present the first molecular study of the HGD gene in Jordanian alkaptonuria patients. This study provides valuable information about the molecular basis of alkaptonuria in Jordanian population.
Population effect model identifies gene expression predictors of survival outcomes in lung adenocarcinoma for both Caucasian and Asian patients.

Directory of Open Access Journals (Sweden)

Guoshuai Cai

Full Text Available We analyzed and integrated transcriptome data from two large studies of lung adenocarcinomas on distinct populations. Our goal was to investigate the variable gene expression alterations between paired tumor-normal tissues and prospectively identify those alterations that can reliably predict lung disease related outcomes across populations.We developed a mixed model that combined the paired tumor-normal RNA-seq from two populations. Alterations in gene expression common to both populations were detected and validated in two independent DNA microarray datasets. A 10-gene prognosis signature was developed through a l1 penalized regression approach and its prognostic value was evaluated in a third independent microarray cohort.Deregulation of apoptosis pathways and increased expression of cell cycle pathways were identified in tumors of both Caucasian and Asian lung adenocarcinoma patients. We demonstrate that a 10-gene biomarker panel can predict prognosis of lung adenocarcinoma in both Caucasians and Asians. Compared to low risk groups, high risk groups showed significantly shorter overall survival time (Caucasian patients data: HR = 3.63, p-value = 0.007; Asian patients data: HR = 3.25, p-value = 0.001.This study uses a statistical framework to detect DEGs between paired tumor and normal tissues that considers variances among patients and ethnicities, which will aid in understanding the common genes and signalling pathways with the largest effect sizes in ethnically diverse cohorts. We propose multifunctional markers for distinguishing tumor from normal tissue and prognosis for both populations studied.
Gene Expression Profiling Identifies Important Genes Affected by R2 Compound Disrupting FAK and P53 Complex

International Nuclear Information System (INIS)

Golubovskaya, Vita M.; Ho, Baotran; Conroy, Jeffrey; Liu, Song; Wang, Dan; Cance, William G.

2014-01-01

Focal Adhesion Kinase (FAK) is a non-receptor kinase that plays an important role in many cellular processes: adhesion, proliferation, invasion, angiogenesis, metastasis and survival. Recently, we have shown that Roslin 2 or R2 (1-benzyl-15,3,5,7-tetraazatricyclo[3.3.1.1~3,7~]decane) compound disrupts FAK and p53 proteins, activates p53 transcriptional activity, and blocks tumor growth. In this report we performed a microarray gene expression analysis of R2-treated HCT116 p53 +/+ and p53 −/− cells and detected 1484 genes that were significantly up- or down-regulated (p < 0.05) in HCT116 p53 +/+ cells but not in p53 −/− cells. Among up-regulated genes in HCT p53 +/+ cells we detected critical p53 targets: Mdm-2, Noxa-1, and RIP1. Among down-regulated genes, Met, PLK2, KIF14, BIRC2 and other genes were identified. In addition, a combination of R2 compound with M13 compound that disrupts FAK and Mmd-2 complex or R2 and Nutlin-1 that disrupts Mdm-2 and p53 decreased clonogenicity of HCT116 p53 +/+ colon cancer cells more significantly than each agent alone in a p53-dependent manner. Thus, the report detects gene expression profile in response to R2 treatment and demonstrates that the combination of drugs targeting FAK, Mdm-2, and p53 can be a novel therapy approach
Prioritizing chronic obstructive pulmonary disease (COPD) candidate genes in COPD-related networks.

Science.gov (United States)

Zhang, Yihua; Li, Wan; Feng, Yuyan; Guo, Shanshan; Zhao, Xilei; Wang, Yahui; He, Yuehan; He, Weiming; Chen, Lina

2017-11-28

Chronic obstructive pulmonary disease (COPD) is a multi-factor disease, which could be caused by many factors, including disturbances of metabolism and protein-protein interactions (PPIs). In this paper, a weighted COPD-related metabolic network and a weighted COPD-related PPI network were constructed base on COPD disease genes and functional information. Candidate genes in these weighted COPD-related networks were prioritized by making use of a gene prioritization method, respectively. Literature review and functional enrichment analysis of the top 100 genes in these two networks suggested the correlation of COPD and these genes. The performance of our gene prioritization method was superior to that of ToppGene and ToppNet for genes from the COPD-related metabolic network or the COPD-related PPI network after assessing using leave-one-out cross-validation, literature validation and functional enrichment analysis. The top-ranked genes prioritized from COPD-related metabolic and PPI networks could promote the better understanding about the molecular mechanism of this disease from different perspectives. The top 100 genes in COPD-related metabolic network or COPD-related PPI network might be potential markers for the diagnosis and treatment of COPD.
Computational modeling identifies key gene regulatory interactions underlying phenobarbital-mediated tumor promotion

Science.gov (United States)

Luisier, Raphaëlle; Unterberger, Elif B.; Goodman, Jay I.; Schwarz, Michael; Moggs, Jonathan; Terranova, Rémi; van Nimwegen, Erik

2014-01-01

Gene regulatory interactions underlying the early stages of non-genotoxic carcinogenesis are poorly understood. Here, we have identified key candidate regulators of phenobarbital (PB)-mediated mouse liver tumorigenesis, a well-characterized model of non-genotoxic carcinogenesis, by applying a new computational modeling approach to a comprehensive collection of in vivo gene expression studies. We have combined our previously developed motif activity response analysis (MARA), which models gene expression patterns in terms of computationally predicted transcription factor binding sites with singular value decomposition (SVD) of the inferred motif activities, to disentangle the roles that different transcriptional regulators play in specific biological pathways of tumor promotion. Furthermore, transgenic mouse models enabled us to identify which of these regulatory activities was downstream of constitutive androstane receptor and β-catenin signaling, both crucial components of PB-mediated liver tumorigenesis. We propose novel roles for E2F and ZFP161 in PB-mediated hepatocyte proliferation and suggest that PB-mediated suppression of ESR1 activity contributes to the development of a tumor-prone environment. Our study shows that combining MARA with SVD allows for automated identification of independent transcription regulatory programs within a complex in vivo tissue environment and provides novel mechanistic insights into PB-mediated hepatocarcinogenesis. PMID:24464994
A genetic screen identifies interferon-α effector genes required to suppress hepatitis C virus replication.

Science.gov (United States)

Fusco, Dahlene N; Brisac, Cynthia; John, Sinu P; Huang, Yi-Wen; Chin, Christopher R; Xie, Tiao; Zhao, Hong; Jilg, Nikolaus; Zhang, Leiliang; Chevaliez, Stephane; Wambua, Daniel; Lin, Wenyu; Peng, Lee; Chung, Raymond T; Brass, Abraham L

2013-06-01

Hepatitis C virus (HCV) infection is a leading cause of end-stage liver disease. Interferon-α (IFNα) is an important component of anti-HCV therapy; it up-regulates transcription of IFN-stimulated genes, many of which have been investigated for their antiviral effects. However, all of the genes required for the antiviral function of IFNα (IFN effector genes [IEGs]) are not known. IEGs include not only IFN-stimulated genes, but other nontranscriptionally induced genes that are required for the antiviral effect of IFNα. In contrast to candidate approaches based on analyses of messenger RNA (mRNA) expression, identification of IEGs requires a broad functional approach. We performed an unbiased genome-wide small interfering RNA screen to identify IEGs that inhibit HCV. Huh7.5.1 hepatoma cells were transfected with small interfering RNAs incubated with IFNα and then infected with JFH1 HCV. Cells were stained using HCV core antibody, imaged, and analyzed to determine the percent infection. Candidate IEGs detected in the screen were validated and analyzed further. The screen identified 120 previously unreported IEGs. From these, we more fully evaluated the following: asparagine-linked glycosylation 10 homolog (yeast, α-1,2-glucosyltransferase); butyrylcholinesterase; dipeptidyl-peptidase 4 (CD26, adenosine deaminase complexing protein 2); glucokinase (hexokinase 4) regulator; guanylate cyclase 1, soluble, β 3; MYST histone acetyltransferase 1; protein phosphatase 3 (formerly 2B), catalytic subunit, β isoform; peroxisomal proliferator-activated receptor-γ-DBD-interacting protein 1; and solute carrier family 27 (fatty acid transporter), member 2; and demonstrated that they enabled IFNα-mediated suppression of HCV at multiple steps of its life cycle. Expression of these genes had more potent effects against flaviviridae because a subset was required for IFNα to suppress dengue virus but not influenza A virus. In addition, many of the host genes detected in this
Genome-wide association study identifies TF as a significant modifier gene of iron metabolism in HFE hemochromatosis.

Science.gov (United States)

de Tayrac, Marie; Roth, Marie-Paule; Jouanolle, Anne-Marie; Coppin, Hélène; le Gac, Gérald; Piperno, Alberto; Férec, Claude; Pelucchi, Sara; Scotet, Virginie; Bardou-Jacquet, Edouard; Ropert, Martine; Bouvet, Régis; Génin, Emmanuelle; Mosser, Jean; Deugnier, Yves

2015-03-01

Hereditary hemochromatosis (HH) is the most common form of genetic iron loading disease. It is mainly related to the homozygous C282Y/C282Y mutation in the HFE gene that is, however, a necessary but not a sufficient condition to develop clinical and even biochemical HH. This suggests that modifier genes are likely involved in the expressivity of the disease. Our aim was to identify such modifier genes. We performed a genome-wide association study (GWAS) using DNA collected from 474 unrelated C282Y homozygotes. Associations were examined for both quantitative iron burden indices and clinical outcomes with 534,213 single nucleotide polymorphisms (SNP) genotypes, with replication analyses in an independent sample of 748 C282Y homozygotes from four different European centres. One SNP met genome-wide statistical significance for association with transferrin concentration (rs3811647, GWAS p value of 7×10(-9) and replication p value of 5×10(-13)). This SNP, located within intron 11 of the TF gene, had a pleiotropic effect on serum iron (GWAS p value of 4.9×10(-6) and replication p value of 3.2×10(-6)). Both serum transferrin and iron levels were associated with serum ferritin levels, amount of iron removed and global clinical stage (pHFE-associated HH (HFE-HH) patients, identified the rs3811647 polymorphism in the TF gene as the only SNP significantly associated with iron metabolism through serum transferrin and iron levels. Because these two outcomes were clearly associated with the biochemical and clinical expression of the disease, an indirect link between the rs3811647 polymorphism and the phenotypic presentation of HFE-HH is likely. Copyright © 2014 European Association for the Study of the Liver. Published by Elsevier B.V. All rights reserved.
Identification and detection of a novel human endogenous retrovirus-related gene, and structural characterization of its related elements

Directory of Open Access Journals (Sweden)

Qiaoyi Liang

2009-01-01

Full Text Available Up-regulation of human endogenous retroviruses (HERVs is associated with many diseases, including cancer. In this study, an H family HERV (HERV-H-related gene was identified and characterized. Its spliced transcript lacks protein-coding capacity and may belong to the emerging class of noncoding RNAs (ncRNAs. The 1.3-kb RNA consisting of four exons is transcribed from an Alu element upstream of a 5.0-kb structurally incomplete HERV-H element. RT-PCR and quantitative RT-PCR results indicated that expression of this HERV-related transcript was negatively associated with colon, stomach, and kidney cancers. Its expression was induced upon treatment with DNA methylation and histone deacetylation inhibitors. A BLAT search using long terminal repeats (LTRs identified 50 other LTR homogenous HERV-H elements. Further analysis of these elements revealed that all are structurally incomplete and only five exert transcriptional activity. The results presented here recommend further investigation into a potentially functional HERV-H-related ncRNA.
Mutation screening of the HGD gene identifies a novel alkaptonuria mutation with significant founder effect and high prevalence.

Science.gov (United States)

Sakthivel, Srinivasan; Zatkova, Andrea; Nemethova, Martina; Surovy, Milan; Kadasi, Ludevit; Saravanan, Madurai P

2014-05-01

Alkaptonuria (AKU) is an autosomal recessive disorder; caused by the mutations in the homogentisate 1, 2-dioxygenase (HGD) gene located on Chromosome 3q13.33. AKU is a rare disorder with an incidence of 1: 250,000 to 1: 1,000,000, but Slovakia and the Dominican Republic have a relatively higher incidence of 1: 19,000. Our study focused on studying the frequency of AKU and identification of HGD gene mutations in nomads. HGD gene sequencing was used to identify the mutations in alkaptonurics. For the past four years, from subjects suspected to be clinically affected, we found 16 positive cases among a randomly selected cohort of 41 Indian nomads (Narikuravar) settled in the specific area of Tamil Nadu, India. HGD gene mutation analysis showed that 11 of these patients carry the same homozygous splicing mutation c.87 + 1G > A; in five cases, this mutation was found to be heterozygous, while the second AKU-causing mutation was not identified in these patients. This result indicates that the founder effect and high degree of consanguineous marriages have contributed to AKU among nomads. Eleven positive samples were homozygous for a novel mutation c.87 + 1G > A, that abolishes an intron 2 donor splice site and most likely causes skipping of exon 2. The prevalence of AKU observed earlier seems to be highly increased in people of nomadic origin. © 2014 John Wiley & Sons Ltd/University College London.
Genome-wide significant localization for working and spatial memory: Identifying genes for psychosis using models of cognition.

Science.gov (United States)

Knowles, Emma E M; Carless, Melanie A; de Almeida, Marcio A A; Curran, Joanne E; McKay, D Reese; Sprooten, Emma; Dyer, Thomas D; Göring, Harald H; Olvera, Rene; Fox, Peter; Almasy, Laura; Duggirala, Ravi; Kent, Jack W; Blangero, John; Glahn, David C

2014-01-01

It is well established that risk for developing psychosis is largely mediated by the influence of genes, but identifying precisely which genes underlie that risk has been problematic. Focusing on endophenotypes, rather than illness risk, is one solution to this problem. Impaired cognition is a well-established endophenotype of psychosis. Here we aimed to characterize the genetic architecture of cognition using phenotypically detailed models as opposed to relying on general IQ or individual neuropsychological measures. In so doing we hoped to identify genes that mediate cognitive ability, which might also contribute to psychosis risk. Hierarchical factor models of genetically clustered cognitive traits were subjected to linkage analysis followed by QTL region-specific association analyses in a sample of 1,269 Mexican American individuals from extended pedigrees. We identified four genome wide significant QTLs, two for working and two for spatial memory, and a number of plausible and interesting candidate genes. The creation of detailed models of cognition seemingly enhanced the power to detect genetic effects on cognition and provided a number of possible candidate genes for psychosis. © 2013 Wiley Periodicals, Inc.
Two-stage case-control association study of dopamine-related genes and migraine

Directory of Open Access Journals (Sweden)

Pardo Julio

2009-09-01

Full Text Available Abstract Background We previously reported risk haplotypes for two genes related with serotonin and dopamine metabolism: MAOA in migraine without aura and DDC in migraine with aura. Herein we investigate the contribution to migraine susceptibility of eight additional genes involved in dopamine neurotransmission. Methods We performed a two-stage case-control association study of 50 tag single nucleotide polymorphisms (SNPs, selected according to genetic coverage parameters. The first analysis consisted of 263 patients and 274 controls and the replication study was composed by 259 cases and 287 controls. All cases were diagnosed according to ICHD-II criteria, were Spanish Caucasian, and were sex-matched with control subjects. Results Single-marker analysis of the first population identified nominal associations of five genes with migraine. After applying a false discovery rate correction of 10%, the differences remained significant only for DRD2 (rs2283265 and TH (rs2070762. Multiple-marker analysis identified a five-marker T-C-G-C-G (rs12363125-rs2283265-rs2242592-rs1554929-rs2234689 risk haplotype in DRD2 and a two-marker A-C (rs6356-rs2070762 risk haplotype in TH that remained significant after correction by permutations. These results, however, were not replicated in the second independent cohort. Conclusion The present study does not support the involvement of the DRD1, DRD2, DRD3, DRD5, DBH, COMT, SLC6A3 and TH genes in the genetic predisposition to migraine in the Spanish population.
Transcriptome analysis identifies genes involved in ethanol response of Saccharomyces cerevisiae in Agave tequilana juice.

Science.gov (United States)

Ramirez-Córdova, Jesús; Drnevich, Jenny; Madrigal-Pulido, Jaime Alberto; Arrizon, Javier; Allen, Kirk; Martínez-Velázquez, Moisés; Alvarez-Maya, Ikuri

2012-08-01

During ethanol fermentation, yeast cells are exposed to stress due to the accumulation of ethanol, cell growth is altered and the output of the target product is reduced. For Agave beverages, like tequila, no reports have been published on the global gene expression under ethanol stress. In this work, we used microarray analysis to identify Saccharomyces cerevisiae genes involved in the ethanol response. Gene expression of a tequila yeast strain of S. cerevisiae (AR5) was explored by comparing global gene expression with that of laboratory strain S288C, both after ethanol exposure. Additionally, we used two different culture conditions, cells grown in Agave tequilana juice as a natural fermentation media or grown in yeast-extract peptone dextrose as artificial media. Of the 6368 S. cerevisiae genes in the microarray, 657 genes were identified that had different expression responses to ethanol stress due to strain and/or media. A cluster of 28 genes was found over-expressed specifically in the AR5 tequila strain that could be involved in the adaptation to tequila yeast fermentation, 14 of which are unknown such as yor343c, ylr162w, ygr182c, ymr265c, yer053c-a or ydr415c. These could be the most suitable genes for transforming tequila yeast to increase ethanol tolerance in the tequila fermentation process. Other genes involved in response to stress (RFC4, TSA1, MLH1, PAU3, RAD53) or transport (CYB2, TIP20, QCR9) were expressed in the same cluster. Unknown genes could be good candidates for the development of recombinant yeasts with ethanol tolerance for use in industrial tequila fermentation.
Parallel analysis of tagged deletion mutants efficiently identifies genes involved in endoplasmic reticulum biogenesis.

Science.gov (United States)

Wright, Robin; Parrish, Mark L; Cadera, Emily; Larson, Lynnelle; Matson, Clinton K; Garrett-Engele, Philip; Armour, Chris; Lum, Pek Yee; Shoemaker, Daniel D

2003-07-30

Increased levels of HMG-CoA reductase induce cell type- and isozyme-specific proliferation of the endoplasmic reticulum. In yeast, the ER proliferations induced by Hmg1p consist of nuclear-associated stacks of smooth ER membranes known as karmellae. To identify genes required for karmellae assembly, we compared the composition of populations of homozygous diploid S. cerevisiae deletion mutants following 20 generations of growth with and without karmellae. Using an initial population of 1,557 deletion mutants, 120 potential mutants were identified as a result of three independent experiments. Each experiment produced a largely non-overlapping set of potential mutants, suggesting that differences in specific growth conditions could be used to maximize the comprehensiveness of similar parallel analysis screens. Only two genes, UBC7 and YAL011W, were identified in all three experiments. Subsequent analysis of individual mutant strains confirmed that each experiment was identifying valid mutations, based on the mutant's sensitivity to elevated HMG-CoA reductase and inability to assemble normal karmellae. The largest class of HMG-CoA reductase-sensitive mutations was a subset of genes that are involved in chromatin structure and transcriptional regulation, suggesting that karmellae assembly requires changes in transcription or that the presence of karmellae may interfere with normal transcriptional regulation. Copyright 2003 John Wiley & Sons, Ltd.

Functional modules by relating protein interaction networks and gene expression.

Science.gov (United States)

Tornow, Sabine; Mewes, H W

2003-11-01

Genes and proteins are organized on the basis of their particular mutual relations or according to their interactions in cellular and genetic networks. These include metabolic or signaling pathways and protein interaction, regulatory or co-expression networks. Integrating the information from the different types of networks may lead to the notion of a functional network and functional modules. To find these modules, we propose a new technique which is based on collective, multi-body correlations in a genetic network. We calculated the correlation strength of a group of genes (e.g. in the co-expression network) which were identified as members of a module in a different network (e.g. in the protein interaction network) and estimated the probability that this correlation strength was found by chance. Groups of genes with a significant correlation strength in different networks have a high probability that they perform the same function. Here, we propose evaluating the multi-body correlations by applying the superparamagnetic approach. We compare our method to the presently applied mean Pearson correlations and show that our method is more sensitive in revealing functional relationships.
Genome-wide association study identifies the SERPINB gene cluster as a susceptibility locus for food allergy.

Science.gov (United States)

Marenholz, Ingo; Grosche, Sarah; Kalb, Birgit; Rüschendorf, Franz; Blümchen, Katharina; Schlags, Rupert; Harandi, Neda; Price, Mareike; Hansen, Gesine; Seidenberg, Jürgen; Röblitz, Holger; Yürek, Songül; Tschirner, Sebastian; Hong, Xiumei; Wang, Xiaobin; Homuth, Georg; Schmidt, Carsten O; Nöthen, Markus M; Hübner, Norbert; Niggemann, Bodo; Beyer, Kirsten; Lee, Young-Ae

2017-10-20

Genetic factors and mechanisms underlying food allergy are largely unknown. Due to heterogeneity of symptoms a reliable diagnosis is often difficult to make. Here, we report a genome-wide association study on food allergy diagnosed by oral food challenge in 497 cases and 2387 controls. We identify five loci at genome-wide significance, the clade B serpin (SERPINB) gene cluster at 18q21.3, the cytokine gene cluster at 5q31.1, the filaggrin gene, the C11orf30/LRRC32 locus, and the human leukocyte antigen (HLA) region. Stratifying the results for the causative food demonstrates that association of the HLA locus is peanut allergy-specific whereas the other four loci increase the risk for any food allergy. Variants in the SERPINB gene cluster are associated with SERPINB10 expression in leukocytes. Moreover, SERPINB genes are highly expressed in the esophagus. All identified loci are involved in immunological regulation or epithelial barrier function, emphasizing the role of both mechanisms in food allergy.
Whole-exome sequencing and high throughput genotyping identified KCNJ11 as the thirteenth MODY gene.

Science.gov (United States)

Bonnefond, Amélie; Philippe, Julien; Durand, Emmanuelle; Dechaume, Aurélie; Huyvaert, Marlène; Montagne, Louise; Marre, Michel; Balkau, Beverley; Fajardy, Isabelle; Vambergue, Anne; Vatin, Vincent; Delplanque, Jérôme; Le Guilcher, David; De Graeve, Franck; Lecoeur, Cécile; Sand, Olivier; Vaxillaire, Martine; Froguel, Philippe

2012-01-01

Maturity-onset of the young (MODY) is a clinically heterogeneous form of diabetes characterized by an autosomal-dominant mode of inheritance, an onset before the age of 25 years, and a primary defect in the pancreatic beta-cell function. Approximately 30% of MODY families remain genetically unexplained (MODY-X). Here, we aimed to use whole-exome sequencing (WES) in a four-generation MODY-X family to identify a new susceptibility gene for MODY. WES (Agilent-SureSelect capture/Illumina-GAIIx sequencing) was performed in three affected and one non-affected relatives in the MODY-X family. We then performed a high-throughput multiplex genotyping (Illumina-GoldenGate assay) of the putative causal mutations in the whole family and in 406 controls. A linkage analysis was also carried out. By focusing on variants of interest (i.e. gains of stop codon, frameshift, non-synonymous and splice-site variants not reported in dbSNP130) present in the three affected relatives and not present in the control, we found 69 mutations. However, as WES was not uniform between samples, a total of 324 mutations had to be assessed in the whole family and in controls. Only one mutation (p.Glu227Lys in KCNJ11) co-segregated with diabetes in the family (with a LOD-score of 3.68). No KCNJ11 mutation was found in 25 other MODY-X unrelated subjects. Beyond neonatal diabetes mellitus (NDM), KCNJ11 is also a MODY gene ('MODY13'), confirming the wide spectrum of diabetes related phenotypes due to mutations in NDM genes (i.e. KCNJ11, ABCC8 and INS). Therefore, the molecular diagnosis of MODY should include KCNJ11 as affected carriers can be ideally treated with oral sulfonylureas.
Whole-exome sequencing and high throughput genotyping identified KCNJ11 as the thirteenth MODY gene.

Directory of Open Access Journals (Sweden)

Amélie Bonnefond

Full Text Available BACKGROUND: Maturity-onset of the young (MODY is a clinically heterogeneous form of diabetes characterized by an autosomal-dominant mode of inheritance, an onset before the age of 25 years, and a primary defect in the pancreatic beta-cell function. Approximately 30% of MODY families remain genetically unexplained (MODY-X. Here, we aimed to use whole-exome sequencing (WES in a four-generation MODY-X family to identify a new susceptibility gene for MODY. METHODOLOGY: WES (Agilent-SureSelect capture/Illumina-GAIIx sequencing was performed in three affected and one non-affected relatives in the MODY-X family. We then performed a high-throughput multiplex genotyping (Illumina-GoldenGate assay of the putative causal mutations in the whole family and in 406 controls. A linkage analysis was also carried out. PRINCIPAL FINDINGS: By focusing on variants of interest (i.e. gains of stop codon, frameshift, non-synonymous and splice-site variants not reported in dbSNP130 present in the three affected relatives and not present in the control, we found 69 mutations. However, as WES was not uniform between samples, a total of 324 mutations had to be assessed in the whole family and in controls. Only one mutation (p.Glu227Lys in KCNJ11 co-segregated with diabetes in the family (with a LOD-score of 3.68. No KCNJ11 mutation was found in 25 other MODY-X unrelated subjects. CONCLUSIONS/SIGNIFICANCE: Beyond neonatal diabetes mellitus (NDM, KCNJ11 is also a MODY gene ('MODY13', confirming the wide spectrum of diabetes related phenotypes due to mutations in NDM genes (i.e. KCNJ11, ABCC8 and INS. Therefore, the molecular diagnosis of MODY should include KCNJ11 as affected carriers can be ideally treated with oral sulfonylureas.
Effectively identifying regulatory hotspots while capturing expression heterogeneity in gene expression studies

Science.gov (United States)

2014-01-01

Expression quantitative trait loci (eQTL) mapping is a tool that can systematically identify genetic variation affecting gene expression. eQTL mapping studies have shown that certain genomic locations, referred to as regulatory hotspots, may affect the expression levels of many genes. Recently, studies have shown that various confounding factors may induce spurious regulatory hotspots. Here, we introduce a novel statistical method that effectively eliminates spurious hotspots while retaining genuine hotspots. Applied to simulated and real datasets, we validate that our method achieves greater sensitivity while retaining low false discovery rates compared to previous methods. PMID:24708878
Network-based differential gene expression analysis suggests cell cycle related genes regulated by E2F1 underlie the molecular difference between smoker and non-smoker lung adenocarcinoma

Science.gov (United States)

2013-01-01

Background Differential gene expression (DGE) analysis is commonly used to reveal the deregulated molecular mechanisms of complex diseases. However, traditional DGE analysis (e.g., the t test or the rank sum test) tests each gene independently without considering interactions between them. Top-ranked differentially regulated genes prioritized by the analysis may not directly relate to the coherent molecular changes underlying complex diseases. Joint analyses of co-expression and DGE have been applied to reveal the deregulated molecular modules underlying complex diseases. Most of these methods consist of separate steps: first to identify gene-gene relationships under the studied phenotype then to integrate them with gene expression changes for prioritizing signature genes, or vice versa. It is warrant a method that can simultaneously consider gene-gene co-expression strength and corresponding expression level changes so that both types of information can be leveraged optimally. Results In this paper, we develop a gene module based method for differential gene expression analysis, named network-based differential gene expression (nDGE) analysis, a one-step integrative process for prioritizing deregulated genes and grouping them into gene modules. We demonstrate that nDGE outperforms existing methods in prioritizing deregulated genes and discovering deregulated gene modules using simulated data sets. When tested on a series of smoker and non-smoker lung adenocarcinoma data sets, we show that top differentially regulated genes identified by the rank sum test in different sets are not consistent while top ranked genes defined by nDGE in different data sets significantly overlap. nDGE results suggest that a differentially regulated gene module, which is enriched for cell cycle related genes and E2F1 targeted genes, plays a role in the molecular differences between smoker and non-smoker lung adenocarcinoma. Conclusions In this paper, we develop nDGE to prioritize
Identifying the Gene Signatures from Gene-Pathway Bipartite Network Guarantees the Robust Model Performance on Predicting the Cancer Prognosis

Directory of Open Access Journals (Sweden)

Li He

2014-01-01

Full Text Available For the purpose of improving the prediction of cancer prognosis in the clinical researches, various algorithms have been developed to construct the predictive models with the gene signatures detected by DNA microarrays. Due to the heterogeneity of the clinical samples, the list of differentially expressed genes (DEGs generated by the statistical methods or the machine learning algorithms often involves a number of false positive genes, which are not associated with the phenotypic differences between the compared clinical conditions, and subsequently impacts the reliability of the predictive models. In this study, we proposed a strategy, which combined the statistical algorithm with the gene-pathway bipartite networks, to generate the reliable lists of cancer-related DEGs and constructed the models by using support vector machine for predicting the prognosis of three types of cancers, namely, breast cancer, acute myeloma leukemia, and glioblastoma. Our results demonstrated that, combined with the gene-pathway bipartite networks, our proposed strategy can efficiently generate the reliable cancer-related DEG lists for constructing the predictive models. In addition, the model performance in the swap analysis was similar to that in the original analysis, indicating the robustness of the models in predicting the cancer outcomes.
Transcriptional profiling identifies differentially expressed genes in developing turkey skeletal muscle

Directory of Open Access Journals (Sweden)

Velleman Sandra G

2011-03-01

Full Text Available Abstract Background Skeletal muscle growth and development from embryo to adult consists of a series of carefully regulated changes in gene expression. Understanding these developmental changes in agriculturally important species is essential to the production of high quality meat products. For example, consumer demand for lean, inexpensive meat products has driven the turkey industry to unprecedented production through intensive genetic selection. However, achievements of increased body weight and muscle mass have been countered by an increased incidence of myopathies and meat quality defects. In a previous study, we developed and validated a turkey skeletal muscle-specific microarray as a tool for functional genomics studies. The goals of the current study were to utilize this microarray to elucidate functional pathways of genes responsible for key events in turkey skeletal muscle development and to compare differences in gene expression between two genetic lines of turkeys. To achieve these goals, skeletal muscle samples were collected at three critical stages in muscle development: 18d embryo (hyperplasia, 1d post-hatch (shift from myoblast-mediated growth to satellite cell-modulated growth by hypertrophy, and 16wk (market age from two genetic lines: a randombred control line (RBC2 maintained without selection pressure, and a line (F selected from the RBC2 line for increased 16wk body weight. Array hybridizations were performed in two experiments: Experiment 1 directly compared the developmental stages within genetic line, while Experiment 2 directly compared the two lines within each developmental stage. Results A total of 3474 genes were differentially expressed (false discovery rate; FDR Conclusions The current study identified gene pathways and uncovered novel genes important in turkey muscle growth and development. Future experiments will focus further on several of these candidate genes and the expression and mechanism of action of
iTAR: a web server for identifying target genes of transcription factors using ChIP-seq or ChIP-chip data.

Science.gov (United States)

Yang, Chia-Chun; Andrews, Erik H; Chen, Min-Hsuan; Wang, Wan-Yu; Chen, Jeremy J W; Gerstein, Mark; Liu, Chun-Chi; Cheng, Chao

2016-08-12

Chromatin immunoprecipitation followed by massively parallel DNA sequencing (ChIP-seq) or microarray hybridization (ChIP-chip) has been widely used to determine the genomic occupation of transcription factors (TFs). We have previously developed a probabilistic method, called TIP (Target Identification from Profiles), to identify TF target genes using ChIP-seq/ChIP-chip data. To achieve high specificity, TIP applies a conservative method to estimate significance of target genes, with the trade-off being a relatively low sensitivity of target gene identification compared to other methods. Additionally, TIP's output does not render binding-peak locations or intensity, information highly useful for visualization and general experimental biological use, while the variability of ChIP-seq/ChIP-chip file formats has made input into TIP more difficult than desired. To improve upon these facets, here we present are fined TIP with key extensions. First, it implements a Gaussian mixture model for p-value estimation, increasing target gene identification sensitivity and more accurately capturing the shape of TF binding profile distributions. Second, it enables the incorporation of TF binding-peak data by identifying their locations in significant target gene promoter regions and quantifies their strengths. Finally, for full ease of implementation we have incorporated it into a web server ( http://syslab3.nchu.edu.tw/iTAR/ ) that enables flexibility of input file format, can be used across multiple species and genome assembly versions, and is freely available for public use. The web server additionally performs GO enrichment analysis for the identified target genes to reveal the potential function of the corresponding TF. The iTAR web server provides a user-friendly interface and supports target gene identification in seven species, ranging from yeast to human. To facilitate investigating the quality of ChIP-seq/ChIP-chip data, the web server generates the chart of the
A zebrafish screen for craniofacial mutants identifies wdr68 as a highly conserved gene required for endothelin-1 expression

Directory of Open Access Journals (Sweden)

Amsterdam Adam

2006-06-01

Full Text Available Abstract Background Craniofacial birth defects result from defects in cranial neural crest (NC patterning and morphogenesis. The vertebrate craniofacial skeleton is derived from cranial NC cells and the patterning of these cells occurs within the pharyngeal arches. Substantial efforts have led to the identification of several genes required for craniofacial skeletal development such as the endothelin-1 (edn1 signaling pathway that is required for lower jaw formation. However, many essential genes required for craniofacial development remain to be identified. Results Through screening a collection of insertional zebrafish mutants containing approximately 25% of the genes essential for embryonic development, we present the identification of 15 essential genes that are required for craniofacial development. We identified 3 genes required for hyomandibular development. We also identified zebrafish models for Campomelic Dysplasia and Ehlers-Danlos syndrome. To further demonstrate the utility of this method, we include a characterization of the wdr68 gene. We show that wdr68 acts upstream of the edn1 pathway and is also required for formation of the upper jaw equivalent, the palatoquadrate. We also present evidence that the level of wdr68 activity required for edn1 pathway function differs between the 1st and 2nd arches. Wdr68 interacts with two minibrain-related kinases, Dyrk1a and Dyrk1b, required for embryonic growth and myotube differentiation, respectively. We show that a GFP-Wdr68 fusion protein localizes to the nucleus with Dyrk1a in contrast to an engineered loss of function mutation Wdr68-T284F that no longer accumulated in the cell nucleus and failed to rescue wdr68 mutant animals. Wdr68 homologs appear to exist in all eukaryotic genomes. Notably, we found that the Drosophila wdr68 homolog CG14614 could substitute for the vertebrate wdr68 gene even though insects lack the NC cell lineage. Conclusion This work represents a systematic
Multiple gene analyses identify distinct “bois noir” phytoplasma genotypes in the Republic of Macedonia

Directory of Open Access Journals (Sweden)

Emilija KOSTADINOVSKA

2015-01-01

Full Text Available “Bois noir” (BN is a grapevine yellows disease, associated with phytoplasma strains related to ‘Candidatus Phytoplasma solani’, that causes severe losses to viticulture in the Euro-Mediterranean basin. Due to the complex ecological cycle of its etiological agent, BN epidemiology is only partially known, and no effective control strategies have been developed. Numerous studies have focused on molecular characterization of BN phytoplasma strains, to identify molecular markers useful to accurately describe their genetic diversity, geographic distribution and host range. In the present study, a multiple gene analysess were carried out on 16S rRNA, tuf, vmp1, and stamp genes to study the genetic variability among 18 BN phytoplasma strains detected in diverse regions of the Republic of Macedonia. Restriction fragment length polymorphism (RFLP assays showed the presence of one 16S rRNA (16SrXII-A, two tuf (tuf-type a, tuf-type b, five vmp1 (V2-TA, V3, V4, V14, V18, and three stamp (S1, S2, S3 gene patterns among the examined strains. Based on the collective RFLP patterns, seven genotypes (Mac1 to Mac7 were described as evidence for genetic heterogeneity, and highlighting their prevalence and distribution in the investigated regions. Phylogenetic analyses on vmp1 and stamp genes underlined the affiliation of Macedonian BN phytoplasma strains to clusters associated with distinct ecologies.
Sex-related differences in gene expression following Coxiella burnetii infection in mice: potential role of circadian rhythm.

Directory of Open Access Journals (Sweden)

Julien Textoris

Full Text Available BACKGROUND: Q fever, a zoonosis due to Coxiella burnetii infection, exhibits sexual dimorphism; men are affected more frequently and severely than women for a given exposure. Here we explore whether the severity of C. burnetii infection in mice is related to differences in male and female gene expression profiles. METHODOLOGY/PRINCIPAL FINDINGS: Mice were infected with C. burnetii for 24 hours, and gene expression was measured in liver cells using microarrays. Multiclass analysis identified 2,777 probes for which expression was specifically modulated by C. burnetti infection. Only 14% of the modulated genes were sex-independent, and the remaining 86% were differentially expressed in males and females. Castration of males and females showed that sex hormones were responsible for more than 60% of the observed gene modulation, and this reduction was most pronounced in males. Using functional annotation of modulated genes, we identified four clusters enriched in males that were related to cell-cell adhesion, signal transduction, defensins and cytokine/Jak-Stat pathways. Up-regulation of the IL-10 and Stat-3 genes may account for the high susceptibility of men with Q fever to C. burnetii infection and autoantibody production. Two clusters were identified in females, including the circadian rhythm pathway, which consists of positive (Clock, Arntl and negative (Per limbs of a feedback loop. We found that Clock and Arntl were down-modulated whereas Per was up-regulated; these changes may be associated with efficient bacterial elimination in females but not in males, in which an exacerbated host response would be prominent. CONCLUSION: This large-scale study revealed for the first time that circadian rhythm plays a major role in the anti-infectious response of mice, and it provides a new basis for elucidating the role of sexual dimorphism in human infections.
Genome-wide Analyses Identify KIF5A as a Novel ALS Gene

NARCIS (Netherlands)

Nicolas, Aude; Kenna, Kevin P.; Renton, Alan E.; Ticozzi, Nicola; Faghri, Faraz; Chia, Ruth; Dominov, Janice A.; Kenna, Brendan J.; Nalls, Mike A.; Keagle, Pamela; Rivera, Alberto M.; van Rheenen, Wouter; Murphy, Natalie A.; van Vugt, Joke J.F.A.; Geiger, Joshua T.; van der Spek, Rick; Pliner, Hannah A.; Smith, Bradley N.; Marangi, Giuseppe; Topp, Simon D.; Abramzon, Yevgeniya; Gkazi, Athina Soragia; Eicher, John D.; Kenna, Aoife; Logullo, Francesco O.; Simone, Isabella L.; Logroscino, Giancarlo; Salvi, Fabrizio; Bartolomei, Ilaria; Borghero, Giuseppe; Murru, Maria Rita; Costantino, Emanuela; Pani, Carla; Puddu, Roberta; Caredda, Carla; Piras, Valeria; Tranquilli, Stefania; Cuccu, Stefania; Corongiu, Daniela; Melis, Maurizio; Milia, Antonio; Marrosu, Francesco; Marrosu, Maria Giovanna; Floris, Gianluca; Cannas, Antonino; Capasso, Margherita; Caponnetto, Claudia; Mancardi, Gianluigi; Origone, Paola; Mandich, Paola; Conforti, Francesca L.; Cavallaro, Sebastiano; Mora, Gabriele; Marinou, Kalliopi; Sideri, Riccardo; Penco, Silvana; Mosca, Lorena; Lunetta, Christian; Pinter, Giuseppe Lauria; Corbo, Massimo; Riva, Nilo; Carrera, Paola; Volanti, Paolo; Mandrioli, Jessica; Fini, Nicola; Fasano, Antonio; Tremolizzo, Lucio; Arosio, Alessandro; Ferrarese, Carlo; Trojsi, Francesca; Tedeschi, Gioacchino; Monsurrò, Maria Rosaria; Piccirillo, Giovanni; Femiano, Cinzia; Ticca, Anna; Ortu, Enzo; La Bella, Vincenzo; Spataro, Rossella; Colletti, Tiziana; Sabatelli, Mario; Zollino, Marcella; Conte, Amelia; Luigetti, Marco; Lattante, Serena; Marangi, Giuseppe; Santarelli, Marialuisa; Petrucci, Antonio; Pugliatti, Maura; Pirisi, Angelo; Parish, Leslie D.; Occhineri, Patrizia; Giannini, Fabio; Battistini, Stefania; Ricci, Claudia; Benigni, Michele; Cau, Tea B.; Loi, Daniela; Calvo, Andrea; Moglia, Cristina; Brunetti, Maura; Barberis, Marco; Restagno, Gabriella; Casale, Federico; Marrali, Giuseppe; Fuda, Giuseppe; Ossola, Irene; Cammarosano, Stefania; Canosa, Antonio; Ilardi, Antonio; Manera, Umberto; Grassano, Maurizio; Tanel, Raffaella; Pisano, Fabrizio; Mora, Gabriele; Calvo, Andrea; Mazzini, Letizia; Riva, Nilo; Mandrioli, Jessica; Caponnetto, Claudia; Battistini, Stefania; Volanti, Paolo; La Bella, Vincenzo; Conforti, Francesca L.; Borghero, Giuseppe; Messina, Sonia; Simone, Isabella L.; Trojsi, Francesca; Salvi, Fabrizio; Logullo, Francesco O.; D'Alfonso, Sandra; Corrado, Lucia; Capasso, Margherita; Ferrucci, Luigi; Harms, Matthew B.; Goldstein, David B.; Shneider, Neil A.; Goutman, Stephen A.; Simmons, Zachary; Miller, Timothy M.; Chandran, Siddharthan; Pal, Suvankar; Manousakis, George; Appel, Stanley H.; Simpson, Ericka; Wang, Leo; Baloh, Robert H.; Gibson, Summer B.; Bedlack, Richard; Lacomis, David; Sareen, Dhruv; Sherman, Alexander; Bruijn, Lucie; Penny, Michelle; Moreno, Cristiane de Araujo Martins; Kamalakaran, Sitharthan; Goldstein, David B.; Allen, Andrew S.; Appel, Stanley; Baloh, Robert H.; Bedlack, Richard S.; Boone, Braden E.; Brown, Robert; Carulli, John P.; Chesi, Alessandra; Chung, Wendy K.; Cirulli, Elizabeth T.; Cooper, Gregory M.; Couthouis, Julien; Day-Williams, Aaron G.; Dion, Patrick A.; Gibson, Summer B.; Gitler, Aaron D.; Glass, Jonathan D.; Goldstein, David B.; Han, Yujun; Harms, Matthew B.; Harris, Tim; Hayes, Sebastian D.; Jones, Angela L.; Keebler, Jonathan; Krueger, Brian J.; Lasseigne, Brittany N.; Levy, Shawn E.; Lu, Yi Fan; Maniatis, Tom; McKenna-Yasek, Diane; Miller, Timothy M.; Myers, Richard M.; Petrovski, Slavé; Pulst, Stefan M.; Raphael, Alya R.; Ravits, John M.; Ren, Zhong; Rouleau, Guy A.; Sapp, Peter C.; Shneider, Neil A.; Simpson, Ericka; Sims, Katherine B.; Staropoli, John F.; Waite, Lindsay L.; Wang, Quanli; Wimbish, Jack R.; Xin, Winnie W.; Gitler, Aaron D.; Harris, Tim; Myers, Richard M.; Phatnani, Hemali; Kwan, Justin; Sareen, Dhruv; Broach, James R.; Simmons, Zachary; Arcila-Londono, Ximena; Lee, Edward B.; Van Deerlin, Vivianna M.; Shneider, Neil A.; Fraenkel, Ernest; Ostrow, Lyle W.; Baas, Frank; Zaitlen, Noah; Berry, James D.; Malaspina, Andrea; Fratta, Pietro; Cox, Gregory A.; Thompson, Leslie M.; Finkbeiner, Steve; Dardiotis, Efthimios; Miller, Timothy M.; Chandran, Siddharthan; Pal, Suvankar; Hornstein, Eran; MacGowan, Daniel J.L.; Heiman-Patterson, Terry D.; Hammell, Molly G.; Patsopoulos, Nikolaos A.; Dubnau, Joshua; Nath, Avindra; Phatnani, Hemali; Musunuri, Rajeeva Lochan; Evani, Uday Shankar; Abhyankar, Avinash; Zody, Michael C.; Kaye, Julia; Finkbeiner, Steven; Wyman, Stacia K.; LeNail, Alexander; Lima, Leandro; Fraenkel, Ernest; Rothstein, Jeffrey D.; Svendsen, Clive N.; Thompson, Leslie M.; Van Eyk, Jenny; Maragakis, Nicholas J.; Berry, James D.; Glass, Jonathan D.; Miller, Timothy M.; Kolb, Stephen J.; Baloh, Robert H.; Cudkowicz, Merit; Baxi, Emily; Kaye, Julia; Finkbeiner, Steven; Wyman, Stacia K.; Finkbeiner, Steven; LeNail, Alex; Lima, Leandro; Fraenkel, Ernest; Fraenkel, Ernest; Svendsen, Clive N.; Svendsen, Clive N.; Thompson, Leslie M.; Thompson, Leslie M.; Van Eyk, Jennifer E.; Berry, James D.; Berry, James D.; Miller, Timothy M.; Kolb, Stephen J.; Cudkowicz, Merit; Cudkowicz, Merit; Baxi, Emily; Benatar, Michael; Taylor, J. Paul; Wu, Gang; Rampersaud, Evadnie; Wuu, Joanne; Rademakers, Rosa; Züchner, Stephan; Schule, Rebecca; McCauley, Jacob; Hussain, Sumaira; Cooley, Anne; Wallace, Marielle; Clayman, Christine; Barohn, Richard; Statland, Jeffrey; Ravits, John M.; Swenson, Andrea; Jackson, Carlayne; Trivedi, Jaya; Khan, Shaida; Katz, Jonathan; Jenkins, Liberty; Burns, Ted; Gwathmey, Kelly; Caress, James; McMillan, Corey; Elman, Lauren; Pioro, Erik P.; Heckmann, Jeannine; So, Yuen; Walk, David; Maiser, Samuel; Zhang, Jinghui; Benatar, Michael; Taylor, J. Paul; Taylor, J. Paul; Rampersaud, Evadnie; Wu, Gang; Wuu, Joanne; Silani, Vincenzo; Ticozzi, Nicola; Gellera, Cinzia; Ratti, Antonia; Taroni, Franco; Lauria, Giuseppe; Verde, Federico; Fogh, Isabella; Tiloca, Cinzia; Comi, Giacomo P.; Sorarù, Gianni; Cereda, Cristina; D'Alfonso, Sandra; Corrado, Lucia; De Marchi, Fabiola; Corti, Stefania; Ceroni, Mauro; Mazzini, Letizia; Siciliano, Gabriele; Filosto, Massimiliano; Inghilleri, Maurizio; Peverelli, Silvia; Colombrita, Claudia; Poletti, Barbara; Maderna, Luca; Del Bo, Roberto; Gagliardi, Stella; Querin, Giorgia; Bertolin, Cinzia; Pensato, Viviana; Castellotti, Barbara; Lauria, Giuseppe; Verde, Federico; Fogh, Isabella; Tiloca, Cinzia; Fogh, Isabella; Comi, Giacomo P.; Sorarù, Gianni; Cereda, Cristina; Camu, William; Mouzat, Kevin; Lumbroso, Serge; Corcia, Philippe; Meininger, Vincent; Besson, Gérard; Lagrange, Emmeline; Clavelou, Pierre; Guy, Nathalie; Couratier, Philippe; Vourch, Patrick; Danel, Véronique; Bernard, Emilien; Lemasson, Gwendal; Corcia, Philippe; Laaksovirta, Hannu; Myllykangas, Liisa; Jansson, Lilja; Valori, Miko; Ealing, John; Hamdalla, Hisham; Rollinson, Sara; Pickering-Brown, Stuart; Orrell, Richard W.; Sidle, Katie C.; Malaspina, Andrea; Hardy, John; Singleton, Andrew B.; Johnson, Janel O.; Arepalli, Sampath; Sapp, Peter C.; McKenna-Yasek, Diane; Polak, Meraida; Asress, Seneshaw; Al-Sarraj, Safa; King, Andrew; Troakes, Claire; Vance, Caroline; de Belleroche, Jacqueline; Baas, Frank; ten Asbroek, Anneloor L.M.A.; Muñoz-Blanco, José Luis; Hernandez, Dena G.; Ding, Jinhui; Gibbs, J. Raphael; Scholz, Sonja W.; Scholz, Sonja W.; Floeter, Mary Kay; Campbell, Roy H.; Landi, Francesco; Bowser, Robert; Pulst, Stefan M.; Ravits, John M.; MacGowan, Daniel J.L.; Kirby, Janine; Pioro, Erik P.; Pamphlett, Roger; Broach, James; Gerhard, Glenn; Dunckley, Travis L.; Brady, Christopher B.; Brady, Christopher B.; Kowall, Neil W.; Troncoso, Juan C.; Le Ber, Isabelle; Mouzat, Kevin; Lumbroso, Serge; Mouzat, Kevin; Lumbroso, Serge; Heiman-Patterson, Terry D.; Heiman-Patterson, Terry D.; Kamel, Freya; Van Den Bosch, Ludo; Van Den Bosch, Ludo; Baloh, Robert H.; Strom, Tim M.; Meitinger, Thomas; Strom, Tim M.; Shatunov, Aleksey; Van Eijk, Kristel R.; de Carvalho, Mamede; de Carvalho, Mamede; Kooyman, Maarten; Middelkoop, Bas; Moisse, Matthieu; McLaughlin, Russell; Van Es, Michael A.; Weber, Markus; Boylan, Kevin B.; Van Blitterswijk, Marka; Rademakers, Rosa; Morrison, Karen; Basak, A. Nazli; Mora, Jesús S.; Drory, Vivian; Shaw, Pamela; Turner, Martin R.; Talbot, Kevin; Hardiman, Orla; Williams, Kelly L.; Fifita, Jennifer A.; Nicholson, Garth A.; Blair, Ian P.; Nicholson, Garth A.; Rouleau, Guy A.; Esteban-Pérez, Jesús; García-Redondo, Alberto; Al-Chalabi, Ammar; Al Kheifat, Ahmad; Al-Chalabi, Ammar; Andersen, Peter M.; Basak, A. Nazli; Blair, Ian P.; Chio, Adriano; Cooper-Knock, Jonathan; Corcia, Philippe; Couratier, Philippe; de Carvalho, Mamede; Dekker, Annelot; Drory, Vivian; Redondo, Alberto Garcia; Gotkine, Marc; Hardiman, Orla; Hide, Winston; Iacoangeli, Alfredo; Glass, Jonathan D.; Kenna, Kevin P.; Kiernan, Matthew; Kooyman, Maarten; Landers, John E.; McLaughlin, Russell; Middelkoop, Bas; Mill, Jonathan; Neto, Miguel Mitne; Moisse, Matthieu; Pardina, Jesus Mora; Morrison, Karen; Newhouse, Stephen; Pinto, Susana; Pulit, Sara; Robberecht, Wim; Shatunov, Aleksey; Shaw, Pamela; Shaw, Chris; Silani, Vincenzo; Sproviero, William; Tazelaar, Gijs; Ticozzi, Nicola; Van Damme, Philip; van den Berg, Leonard; van der Spek, Rick; Van Eijk, Kristel R.; Van Es, Michael A.; van Rheenen, Wouter; van Vugt, Joke J.F.A.; Veldink, Jan H.; Weber, Markus; Williams, Kelly L.; Van Damme, Philip; Robberecht, Wim; Zatz, Mayana; Robberecht, Wim; Bauer, Denis C.; Twine, Natalie A.; Rogaeva, Ekaterina; Zinman, Lorne; Ostrow, Lyle W.; Maragakis, Nicholas J.; Rothstein, Jeffrey D.; Simmons, Zachary; Cooper-Knock, Johnathan; Brice, Alexis; Goutman, Stephen A.; Feldman, Eva L.; Gibson, Summer B.; Taroni, Franco; Ratti, Antonia; Ratti, Antonia; Gellera, Cinzia; Van Damme, Philip; Robberecht, Wim; Fratta, Pietro; Sabatelli, Mario; Lunetta, Christian; Ludolph, Albert C.; Andersen, Peter M.; Weishaupt, Jochen H.; Camu, William; Trojanowski, John Q.; Van Deerlin, Vivianna M.; Brown, Robert H.; van den Berg, Leonard; Veldink, Jan H.; Harms, Matthew B.; Glass, Jonathan D.; Stone, David J.; Tienari, Pentti; Silani, Vincenzo; Silani, Vincenzo; Chiò, Adriano; Shaw, Christopher E.; Chiò, Adriano; Traynor, Bryan J.; Landers, John E.; Traynor, Bryan J.

2018-01-01

To identify novel genes associated with ALS, we undertook two lines of investigation. We carried out a genome-wide association study comparing 20,806 ALS cases and 59,804 controls. Independently, we performed a rare variant burden analysis comparing 1,138 index familial ALS cases and 19,494
Genes associated with thermosensitive genic male sterility in rice identified by comparative expression profiling.

Science.gov (United States)

Pan, Yufang; Li, Qiaofeng; Wang, Zhizheng; Wang, Yang; Ma, Rui; Zhu, Lili; He, Guangcun; Chen, Rongzhi

2014-12-16

Thermosensitive genic male sterile (TGMS) lines and photoperiod-sensitive genic male sterile (PGMS) lines have been successfully used in hybridization to improve rice yields. However, the molecular mechanisms underlying male sterility transitions in most PGMS/TGMS rice lines are unclear. In the recently developed TGMS-Co27 line, the male sterility is based on co-suppression of a UDP-glucose pyrophosphorylase gene (Ugp1), but further study is needed to fully elucidate the molecular mechanisms involved. Microarray-based transcriptome profiling of TGMS-Co27 and wild-type Hejiang 19 (H1493) plants grown at high and low temperatures revealed that 15462 probe sets representing 8303 genes were differentially expressed in the two lines, under the two conditions, or both. Environmental factors strongly affected global gene expression. Some genes important for pollen development were strongly repressed in TGMS-Co27 at high temperature. More significantly, series-cluster analysis of differentially expressed genes (DEGs) between TGMS-Co27 plants grown under the two conditions showed that low temperature induced the expression of a gene cluster. This cluster was found to be essential for sterility transition. It includes many meiosis stage-related genes that are probably important for thermosensitive male sterility in TGMS-Co27, inter alia: Arg/Ser-rich domain (RS)-containing zinc finger proteins, polypyrimidine tract-binding proteins (PTBs), DEAD/DEAH box RNA helicases, ZOS (C2H2 zinc finger proteins of Oryza sativa), at least one polyadenylate-binding protein and some other RNA recognition motif (RRM) domain-containing proteins involved in post-transcriptional processes, eukaryotic initiation factor 5B (eIF5B), ribosomal proteins (L37, L1p/L10e, L27 and L24), aminoacyl-tRNA synthetases (ARSs), eukaryotic elongation factor Tu (eEF-Tu) and a peptide chain release factor protein involved in translation. The differential expression of 12 DEGs that are important for pollen
Genome-wide RNAi screening identifies genes inhibiting the migration of glioblastoma cells.

Directory of Open Access Journals (Sweden)

Jian Yang

Full Text Available Glioblastoma Multiforme (GBM cells are highly invasive, infiltrating into the surrounding normal brain tissue, making it impossible to completely eradicate GBM tumors by surgery or radiation. Increasing evidence also shows that these migratory cells are highly resistant to cytotoxic reagents, but decreasing their migratory capability can re-sensitize them to chemotherapy. These evidences suggest that the migratory cell population may serve as a better therapeutic target for more effective treatment of GBM. In order to understand the regulatory mechanism underlying the motile phenotype, we carried out a genome-wide RNAi screen for genes inhibiting the migration of GBM cells. The screening identified a total of twenty-five primary hits; seven of them were confirmed by secondary screening. Further study showed that three of the genes, FLNA, KHSRP and HCFC1, also functioned in vivo, and knocking them down caused multifocal tumor in a mouse model. Interestingly, two genes, KHSRP and HCFC1, were also found to be correlated with the clinical outcome of GBM patients. These two genes have not been previously associated with cell migration.
Gene and MicroRNA transcriptome analysis of Parkinson's related LRRK2 mouse models.

Directory of Open Access Journals (Sweden)

Véronique Dorval

Full Text Available Mutations in leucine-rich repeat kinase 2 (LRRK2 are the most frequent cause of genetic Parkinson's disease (PD. The biological function of LRRK2 and how mutations lead to disease remain poorly defined. It has been proposed that LRRK2 could function in gene transcription regulation; however, this issue remains controversial. Here, we investigated in parallel gene and microRNA (miRNA transcriptome profiles of three different LRRK2 mouse models. Striatal tissue was isolated from adult LRRK2 knockout (KO mice, as well as mice expressing human LRRK2 wildtype (hLRRK2-WT or the PD-associated R1441G mutation (hLRRK2-R1441G. We identified a total of 761 genes and 24 miRNAs that were misregulated in the absence of LRRK2 when a false discovery rate of 0.2 was applied. Notably, most changes in gene expression were modest (i.e., <2 fold. By real-time quantitative RT-PCR, we confirmed the variations of selected genes (e.g., adra2, syt2, opalin and miRNAs (e.g., miR-16, miR-25. Surprisingly, little or no changes in gene expression were observed in mice expressing hLRRK2-WT or hLRRK2-R1441G when compared to non-transgenic controls. Nevertheless, a number of miRNAs were misexpressed in these models. Bioinformatics analysis identified several miRNA-dependent and independent networks dysregulated in LRRK2-deficient mice, including PD-related pathways. These results suggest that brain LRRK2 plays an overall modest role in gene transcription regulation in mammals; however, these effects seem context and RNA type-dependent. Our data thus set the stage for future investigations regarding LRRK2 function in PD development.
Patterns of expression of cell wall related genes in sugarcane

Directory of Open Access Journals (Sweden)

Lima D.U.

2001-01-01

Full Text Available Our search for genes related to cell wall metabolism in the sugarcane expressed sequence tag (SUCEST database (http://sucest.lbi.dcc.unicamp.br resulted in 3,283 reads (1% of the total reads which were grouped into 459 clusters (potential genes with an average of 7.1 reads per cluster. To more clearly display our correlation coefficients, we constructed surface maps which we used to investigate the relationship between cell wall genes and the sugarcane tissues libraries from which they came. The only significant correlations that we found between cell wall genes and/or their expression within particular libraries were neutral or synergetic. Genes related to cellulose biosynthesis were from the CesA family, and were found to be the most abundant cell wall related genes in the SUCEST database. We found that the highest number of CesA reads came from the root and stem libraries. The genes with the greatest number of reads were those involved in cell wall hydrolases (e.g. beta-1,3-glucanases, xyloglucan endo-beta-transglycosylase, beta-glucosidase and endo-beta-mannanase. Correlation analyses by surface mapping revealed that the expression of genes related to biosynthesis seems to be associated with the hydrolysis of hemicelluloses, pectin hydrolases being mainly associated with xyloglucan hydrolases. The patterns of cell wall related gene expression in sugarcane based on the number of reads per cluster reflected quite well the expected physiological characteristics of the tissues. This is the first work to provide a general view on plant cell wall metabolism through the expression of related genes in almost all the tissues of a plant at the same time. For example, developing flowers behaved similarly to both meristematic tissues and leaf-root transition zone tissues. Besides providing a basis for future research on the mechanisms of plant development which involve the cell wall, our findings will provide valuable tools for plant engineering in the
Gene expression analysis identifies new candidate genes associated with the development of black skin spots in Corriedale sheep.

Science.gov (United States)

Peñagaricano, Francisco; Zorrilla, Pilar; Naya, Hugo; Robello, Carlos; Urioste, Jorge I

2012-02-01

The white coat colour of sheep is an important economic trait. For unknown reasons, some animals are born with, and others develop with time, black skin spots that can also produce pigmented fibres. The presence of pigmented fibres in the white wool significantly decreases the fibre quality. The aim of this work was to study gene expression in black spots (with and without pigmented fibres) and white skin by microarray techniques, in order to identify the possible genes involved in the development of this trait. Five unrelated Corriedale sheep were used and, for each animal, the three possible comparisons (three different hybridisations) between the three samples of interest were performed. Differential gene expression patterns were analysed using different t-test approaches. Most of the major genes with well-known roles in skin pigmentation, e.g. ASIP, MC1R and C-KIT, showed no significant difference in the gene expression between white skin and black spots. On the other hand, many of the differentially expressed genes (raw P-value spots. The gene expression of C-FOS and KLF4, transcription factors involved in the cellular response to external factors such as ultraviolet light, was validated by quantitative polymerase chain reaction (PCR). This exploratory study provides a list of candidate genes that could be associated with the development of black skin spots that should be studied in more detail. Characterisation of these genes will enable us to discern the molecular mechanisms involved in the development of this feature and, hence, increase our understanding of melanocyte biology and skin pigmentation. In sheep, understanding this phenomenon is a first step towards developing molecular tools to assist in the selection against the presence of pigmented fibres in white wool.
Cross-species comparison of the gut: Differential gene expression sheds light on biological differences in closely related tenebrionids.

Science.gov (United States)

Oppert, Brenda; Perkin, Lindsey; Martynov, Alexander G; Elpidina, Elena N

2018-04-01

The gut is one of the primary interfaces between an insect and its environment. Understanding gene expression profiles in the insect gut can provide insight into interactions with the environment as well as identify potential control methods for pests. We compared the expression profiles of transcripts from the gut of larval stages of two coleopteran insects, Tenebrio molitor and Tribolium castaneum. These tenebrionids have different life cycles, varying in the duration and number of larval instars. T. castaneum has a sequenced genome and has been a model for coleopterans, and we recently obtained a draft genome for T. molitor. We assembled gut transcriptome reads from each insect to their respective genomes and filtered mapped reads to RPKM>1, yielding 11,521 and 17,871 genes in the T. castaneum and T. molitor datasets, respectively. There were identical GO terms in each dataset, and enrichment analyses also identified shared GO terms. From these datasets, we compiled an ortholog list of 6907 genes; 45% of the total assembled reads from T. castaneum were found in the top 25 orthologs, but only 27% of assembled reads were found in the top 25 T. molitor orthologs. There were 2281 genes unique to T. castaneum, and 2088 predicted genes unique to T. molitor, although improvements to the T. molitor genome will likely reduce these numbers as more orthologs are identified. We highlight a few unique genes in T. castaneum or T. molitor that may relate to distinct biological functions. A large number of putative genes expressed in the larval gut with uncharacterized functions (36 and 68% from T. castaneum and T. molitor, respectively) support the need for further research. These data are the first step in building a comprehensive understanding of the physiology of the gut in tenebrionid insects, illustrating commonalities and differences that may be related to speciation and environmental adaptation. Published by Elsevier Ltd.
Histological analysis and identification of spermatogenesis-related genes in 2-, 6-, and 12-month-old sheep testes

Science.gov (United States)

Bai, Man; Sun, Limin; Zhao, Jia; Xiang, Lujie; Cheng, Xiaoyin; Li, Jiarong; Jia, Chao; Jiang, Huaizhi

2017-10-01

Testis development and spermatogenesis are vital factors that influence male animal fertility. In order to identify spermatogenesis-related genes and further provide a theory basis for finding biomarkers related to male sheep fertility, 2-, 6-, and 12-month-old Small Tail Han Sheep testes were selected to investigate the dynamic changes of sheep testis development. Hematoxylin-eosin routine staining and RNA-Seq technique were used to perform histological and transcriptome analysis for these testes. The results showed that 630, 102, and 322 differentially expressed genes (DEGs) were identified in 2- vs 6-month-old, 6- vs 12-month-old, and 2- vs 12-month-old testes, respectively. GO and KEGG analysis showed the following: DEGs in 2- vs 6-month-old testes were mainly related to the GO terms of sexual maturation and the pathways of multiple metabolism and biosynthesis; in 6- vs 12-month-old testes, most of the GO terms that DEGs involved in were related to metabolism and translation processes; the most significantly enriched pathway is the ribosome pathway. The union of DEGs in 2- vs 6-month-old, 6- vs 12-month-old, and 2- vs 12-month-old testes was categorized into eight profiles by series cluster. Subsequently, the eight profiles were classified into four model profiles and four co-expression networks were constructed based on the DEGs in these model profiles. Finally, 29 key regulatory genes related to spermatogenesis were identified in the four co-expression networks. The expression of 13 DEGs (CA3, APOH, MYOC, CATSPER4, SYT6, SERPINA10, DAZL, ADIPOR2, RAB13, CEP41, SPAG4, ODF1, and FRG1) was validated by RT-PCR.

Identifying pathogenicity genes in the rubber tree anthracnose fungus Colletotrichum gloeosporioides through random insertional mutagenesis.

Science.gov (United States)

Cai, Zhiying; Li, Guohua; Lin, Chunhua; Shi, Tao; Zhai, Ligang; Chen, Yipeng; Huang, Guixiu

2013-07-19

To gain more insight into the molecular mechanisms of Colletotrichum gloeosporioides pathogenesis, Agrobacterium tumefaciens-mediated transformation (ATMT) was used to identify mutants of C. gloeosporioides impaired in pathogenicity. An ATMT library of 4128 C. gloeosporioides transformants was generated. Transformants were screened for defects in pathogenicity with a detached copper brown leaf assay. 32 mutants showing reproducible pathogenicity defects were obtained. Southern blot analysis showed 60.4% of the transformants had single-site T-DNA integrations. 16 Genomic sequences flanking T-DNA were recovered from mutants by thermal asymmetric interlaced PCR, and were used to isolate the tagged genes from the genome sequence of wild-type C. gloeosporioides by Basic Local Alignment Search Tool searches against the local genome database of the wild-type C. gloeosporioides. One potential pathogenicity genes encoded calcium-translocating P-type ATPase. Six potential pathogenicity genes had no known homologs in filamentous fungi and were likely to be novel fungal virulence factors. Two putative genes encoded Glycosyltransferase family 28 domain-containing protein and Mov34/MPN/PAD-1 family protein, respectively. Five potential pathogenicity genes had putative function matched with putative protein of other Colletotrichum species. Two known C. gloeosporioides pathogenicity genes were also identified, the encoding Glomerella cingulata hard-surface induced protein and C. gloeosporioides regulatory subunit of protein kinase A gene involved in cAMP-dependent PKA signal transduction pathway. Copyright © 2013 Elsevier GmbH. All rights reserved.
MVisAGe Identifies Concordant and Discordant Genomic Alterations of Driver Genes in Squamous Tumors.

Science.gov (United States)

Walter, Vonn; Du, Ying; Danilova, Ludmila; Hayward, Michele C; Hayes, D Neil

2018-06-15

Integrated analyses of multiple genomic datatypes are now common in cancer profiling studies. Such data present opportunities for numerous computational experiments, yet analytic pipelines are limited. Tools such as the cBioPortal and Regulome Explorer, although useful, are not easy to access programmatically or to implement locally. Here, we introduce the MVisAGe R package, which allows users to quantify gene-level associations between two genomic datatypes to investigate the effect of genomic alterations (e.g., DNA copy number changes on gene expression). Visualizing Pearson/Spearman correlation coefficients according to the genomic positions of the underlying genes provides a powerful yet novel tool for conducting exploratory analyses. We demonstrate its utility by analyzing three publicly available cancer datasets. Our approach highlights canonical oncogenes in chr11q13 that displayed the strongest associations between expression and copy number, including CCND1 and CTTN , genes not identified by copy number analysis in the primary reports. We demonstrate highly concordant usage of shared oncogenes on chr3q, yet strikingly diverse oncogene usage on chr11q as a function of HPV infection status. Regions of chr19 that display remarkable associations between methylation and gene expression were identified, as were previously unreported miRNA-gene expression associations that may contribute to the epithelial-to-mesenchymal transition. Significance: This study presents an important bioinformatics tool that will enable integrated analyses of multiple genomic datatypes. Cancer Res; 78(12); 3375-85. ©2018 AACR . ©2018 American Association for Cancer Research.
Evaluation of potential regulatory elements identified as DNase I hypersensitive sites in the CFTR gene

DEFF Research Database (Denmark)

Phylactides, M.; Rowntree, R.; Nuthall, H.

2002-01-01

hypersensitive sites (DHS) within the locus. We previously identified at least 12 clusters of DHS across the CFTR gene and here further evaluate DHS in introns 2,3,10,16,17a, 18, 20 and 21 to assess their functional importance in regulation of CFTR gene expression. Transient transfections of enhancer/reporter...
Genome-Wide Gene-Environment Study Identifies Glutamate Receptor Gene GRIN2A as a Parkinson's Disease Modifier Gene via Interaction with Coffee

Science.gov (United States)

Hamza, Taye H.; Chen, Honglei; Hill-Burns, Erin M.; Rhodes, Shannon L.; Montimurro, Jennifer; Kay, Denise M.; Tenesa, Albert; Kusel, Victoria I.; Sheehan, Patricia; Eaaswarkhanth, Muthukrishnan; Yearout, Dora; Samii, Ali; Roberts, John W.; Agarwal, Pinky; Bordelon, Yvette; Park, Yikyung; Wang, Liyong; Gao, Jianjun; Vance, Jeffery M.; Kendler, Kenneth S.; Bacanu, Silviu-Alin; Scott, William K.; Ritz, Beate; Nutt, John; Factor, Stewart A.; Zabetian, Cyrus P.; Payami, Haydeh

2011-01-01

Our aim was to identify genes that influence the inverse association of coffee with the risk of developing Parkinson's disease (PD). We used genome-wide genotype data and lifetime caffeinated-coffee-consumption data on 1,458 persons with PD and 931 without PD from the NeuroGenetics Research Consortium (NGRC), and we performed a genome-wide association and interaction study (GWAIS), testing each SNP's main-effect plus its interaction with coffee, adjusting for sex, age, and two principal components. We then stratified subjects as heavy or light coffee-drinkers and performed genome-wide association study (GWAS) in each group. We replicated the most significant SNP. Finally, we imputed the NGRC dataset, increasing genomic coverage to examine the region of interest in detail. The primary analyses (GWAIS, GWAS, Replication) were performed using genotyped data. In GWAIS, the most significant signal came from rs4998386 and the neighboring SNPs in GRIN2A. GRIN2A encodes an NMDA-glutamate-receptor subunit and regulates excitatory neurotransmission in the brain. Achieving P2df = 10−6, GRIN2A surpassed all known PD susceptibility genes in significance in the GWAIS. In stratified GWAS, the GRIN2A signal was present in heavy coffee-drinkers (OR = 0.43; P = 6×10−7) but not in light coffee-drinkers. The a priori Replication hypothesis that “Among heavy coffee-drinkers, rs4998386_T carriers have lower PD risk than rs4998386_CC carriers” was confirmed: ORReplication = 0.59, PReplication = 10−3; ORPooled = 0.51, PPooled = 7×10−8. Compared to light coffee-drinkers with rs4998386_CC genotype, heavy coffee-drinkers with rs4998386_CC genotype had 18% lower risk (P = 3×10−3), whereas heavy coffee-drinkers with rs4998386_TC genotype had 59% lower risk (P = 6×10−13). Imputation revealed a block of SNPs that achieved P2dfcoffee-drinkers. This study is proof of concept that inclusion of environmental factors can help identify genes that
Genome-wide gene-environment study identifies glutamate receptor gene GRIN2A as a Parkinson's disease modifier gene via interaction with coffee.

Science.gov (United States)

Hamza, Taye H; Chen, Honglei; Hill-Burns, Erin M; Rhodes, Shannon L; Montimurro, Jennifer; Kay, Denise M; Tenesa, Albert; Kusel, Victoria I; Sheehan, Patricia; Eaaswarkhanth, Muthukrishnan; Yearout, Dora; Samii, Ali; Roberts, John W; Agarwal, Pinky; Bordelon, Yvette; Park, Yikyung; Wang, Liyong; Gao, Jianjun; Vance, Jeffery M; Kendler, Kenneth S; Bacanu, Silviu-Alin; Scott, William K; Ritz, Beate; Nutt, John; Factor, Stewart A; Zabetian, Cyrus P; Payami, Haydeh

2011-08-01

Our aim was to identify genes that influence the inverse association of coffee with the risk of developing Parkinson's disease (PD). We used genome-wide genotype data and lifetime caffeinated-coffee-consumption data on 1,458 persons with PD and 931 without PD from the NeuroGenetics Research Consortium (NGRC), and we performed a genome-wide association and interaction study (GWAIS), testing each SNP's main-effect plus its interaction with coffee, adjusting for sex, age, and two principal components. We then stratified subjects as heavy or light coffee-drinkers and performed genome-wide association study (GWAS) in each group. We replicated the most significant SNP. Finally, we imputed the NGRC dataset, increasing genomic coverage to examine the region of interest in detail. The primary analyses (GWAIS, GWAS, Replication) were performed using genotyped data. In GWAIS, the most significant signal came from rs4998386 and the neighboring SNPs in GRIN2A. GRIN2A encodes an NMDA-glutamate-receptor subunit and regulates excitatory neurotransmission in the brain. Achieving P(2df) = 10(-6), GRIN2A surpassed all known PD susceptibility genes in significance in the GWAIS. In stratified GWAS, the GRIN2A signal was present in heavy coffee-drinkers (OR = 0.43; P = 6×10(-7)) but not in light coffee-drinkers. The a priori Replication hypothesis that "Among heavy coffee-drinkers, rs4998386_T carriers have lower PD risk than rs4998386_CC carriers" was confirmed: OR(Replication) = 0.59, P(Replication) = 10(-3); OR(Pooled) = 0.51, P(Pooled) = 7×10(-8). Compared to light coffee-drinkers with rs4998386_CC genotype, heavy coffee-drinkers with rs4998386_CC genotype had 18% lower risk (P = 3×10(-3)), whereas heavy coffee-drinkers with rs4998386_TC genotype had 59% lower risk (P = 6×10(-13)). Imputation revealed a block of SNPs that achieved P(2df)coffee-drinkers. This study is proof of concept that inclusion of environmental factors can help identify
RNA-Seq analysis and annotation of a draft blueberry genome assembly identifies candidate genes involved in fruit ripening, biosynthesis of bioactive compounds, and stage-specific alternative splicing.

Science.gov (United States)

Gupta, Vikas; Estrada, April D; Blakley, Ivory; Reid, Rob; Patel, Ketan; Meyer, Mason D; Andersen, Stig Uggerhøj; Brown, Allan F; Lila, Mary Ann; Loraine, Ann E

2015-01-01

Blueberries are a rich source of antioxidants and other beneficial compounds that can protect against disease. Identifying genes involved in synthesis of bioactive compounds could enable the breeding of berry varieties with enhanced health benefits. Toward this end, we annotated a previously sequenced draft blueberry genome assembly using RNA-Seq data from five stages of berry fruit development and ripening. Genome-guided assembly of RNA-Seq read alignments combined with output from ab initio gene finders produced around 60,000 gene models, of which more than half were similar to proteins from other species, typically the grape Vitis vinifera. Comparison of gene models to the PlantCyc database of metabolic pathway enzymes identified candidate genes involved in synthesis of bioactive compounds, including bixin, an apocarotenoid with potential disease-fighting properties, and defense-related cyanogenic glycosides, which are toxic. Cyanogenic glycoside (CG) biosynthetic enzymes were highly expressed in green fruit, and a candidate CG detoxification enzyme was up-regulated during fruit ripening. Candidate genes for ethylene, anthocyanin, and 400 other biosynthetic pathways were also identified. Homology-based annotation using Blast2GO and InterPro assigned Gene Ontology terms to around 15,000 genes. RNA-Seq expression profiling showed that blueberry growth, maturation, and ripening involve dynamic gene expression changes, including coordinated up- and down-regulation of metabolic pathway enzymes and transcriptional regulators. Analysis of RNA-seq alignments identified developmentally regulated alternative splicing, promoter use, and 3' end formation. We report genome sequence, gene models, functional annotations, and RNA-Seq expression data that provide an important new resource enabling high throughput studies in blueberry.
Candidate genes and pathogenesis investigation for sepsis-related acute respiratory distress syndrome based on gene expression profile.

Science.gov (United States)

Wang, Min; Yan, Jingjun; He, Xingxing; Zhong, Qiang; Zhan, Chengye; Li, Shusheng

2016-04-18

Acute respiratory distress syndrome (ARDS) is a potentially devastating form of acute inflammatory lung injury as well as a major cause of acute respiratory failure. Although researchers have made significant progresses in elucidating the pathophysiology of this complex syndrome over the years, the absence of a universal detail disease mechanism up until now has led to a series of practical problems for a definitive treatment. This study aimed to predict some genes or pathways associated with sepsis-related ARDS based on a public microarray dataset and to further explore the molecular mechanism of ARDS. A total of 122 up-regulated DEGs and 91 down-regulated differentially expressed genes (DEGs) were obtained. The up- and down-regulated DEGs were mainly involved in functions like mitotic cell cycle and pathway like cell cycle. Protein-protein interaction network of ARDS analysis revealed 20 hub genes including cyclin B1 (CCNB1), cyclin B2 (CCNB2) and topoisomerase II alpha (TOP2A). A total of seven transcription factors including forkhead box protein M1 (FOXM1) and 30 target genes were revealed in the transcription factor-target gene regulation network. Furthermore, co-cited genes including CCNB2-CCNB1 were revealed in literature mining for the relations ARDS related genes. Pathways like mitotic cell cycle were closed related with the development of ARDS. Genes including CCNB1, CCNB2 and TOP2A, as well as transcription factors like FOXM1 might be used as the novel gene therapy targets for sepsis related ARDS.
Transcriptome analysis of the exocarp of apple fruit identifies light-induced genes involved in red color pigmentation.

Science.gov (United States)

Vimolmangkang, Sornkanok; Zheng, Danman; Han, Yuepeng; Khan, M Awais; Soria-Guerra, Ruth Elena; Korban, Schuyler S

2014-01-15

Although the mechanism of light regulation of color pigmentation of apple fruit is not fully understood, it has been shown that light can regulate expression of genes in the anthocyanin biosynthesis pathway by inducing transcription factors (TFs). Moreover, expression of genes encoding enzymes involved in this pathway may be coordinately regulated by multiple TFs. In this study, fruits on trees of apple cv. Red Delicious were covered with paper bags during early stages of fruit development and then removed prior to maturation to analyze the transcriptome in the exocarp of apple fruit. Comparisons of gene expression profiles of fruit covered with paper bags (dark-grown treatment) and those subjected to 14 h light treatment, following removal of paper bags, were investigated using an apple microarray of 40,000 sequences. Expression profiles were investigated over three time points, at one week intervals, during fruit development. Overall, 736 genes with expression values greater than two-fold were found to be modulated by light treatment. Light-induced products were classified into 19 categories with highest scores in primary metabolism (17%) and transcription (12%). Based on the Arabidopsis gene ontology annotation, 18 genes were identified as TFs. To further confirm expression patterns of flavonoid-related genes, these were subjected to quantitative RT-PCR (qRT-PCR) using fruit of red-skinned apple cv. Red Delicious and yellow-skinned apple cv. Golden Delicious. Of these, two genes showed higher levels of expression in 'Red Delicious' than in 'Golden Delicious', and were likely involved in the regulation of fruit red color pigmentation. © 2013 Elsevier B.V. All rights reserved.
APRIL is a novel clinical chemo-resistance biomarker in colorectal adenocarcinoma identified by gene expression profiling

International Nuclear Information System (INIS)

Petty, Russell D; Wang, Weiguang; Gilbert, Fiona; Semple, Scot; Collie-Duguid, Elaina SR; Samuel, Leslie M; Murray, Graeme I; MacDonald, Graham; O'Kelly, Terrence; Loudon, Malcolm; Binnie, Norman; Aly, Emad; McKinlay, Aileen

2009-01-01

5-Fluorouracil(5FU) and oral analogues, such as capecitabine, remain one of the most useful agents for the treatment of colorectal adenocarcinoma. Low toxicity and convenience of administration facilitate use, however clinical resistance is a major limitation. Investigation has failed to fully explain the molecular mechanisms of resistance and no clinically useful predictive biomarkers for 5FU resistance have been identified. We investigated the molecular mechanisms of clinical 5FU resistance in colorectal adenocarcinoma patients in a prospective biomarker discovery project utilising gene expression profiling. The aim was to identify novel 5FU resistance mechanisms and qualify these as candidate biomarkers and therapeutic targets. Putative treatment specific gene expression changes were identified in a transcriptomics study of rectal adenocarcinomas, biopsied and profiled before and after pre-operative short-course radiotherapy or 5FU based chemo-radiotherapy, using microarrays. Tumour from untreated controls at diagnosis and resection identified treatment-independent gene expression changes. Candidate 5FU chemo-resistant genes were identified by comparison of gene expression data sets from these clinical specimens with gene expression signatures from our previous studies of colorectal cancer cell lines, where parental and daughter lines resistant to 5FU were compared. A colorectal adenocarcinoma tissue microarray (n = 234, resected tumours) was used as an independent set to qualify candidates thus identified. APRIL/TNFSF13 mRNA was significantly upregulated following 5FU based concurrent chemo-radiotherapy and in 5FU resistant colorectal adenocarcinoma cell lines but not in radiotherapy alone treated colorectal adenocarcinomas. Consistent withAPRIL's known function as an autocrine or paracrine secreted molecule, stromal but not tumour cell protein expression by immunohistochemistry was correlated with poor prognosis (p = 0.019) in the independent set
Integrative microRNA and proteomic approaches identify novel osteoarthritis genes and their collaborative metabolic and inflammatory networks.

Directory of Open Access Journals (Sweden)

Dimitrios Iliopoulos

Full Text Available BACKGROUND: Osteoarthritis is a multifactorial disease characterized by destruction of the articular cartilage due to genetic, mechanical and environmental components affecting more than 100 million individuals all over the world. Despite the high prevalence of the disease, the absence of large-scale molecular studies limits our ability to understand the molecular pathobiology of osteoathritis and identify targets for drug development. METHODOLOGY/PRINCIPAL FINDINGS: In this study we integrated genetic, bioinformatic and proteomic approaches in order to identify new genes and their collaborative networks involved in osteoarthritis pathogenesis. MicroRNA profiling of patient-derived osteoarthritic cartilage in comparison to normal cartilage, revealed a 16 microRNA osteoarthritis gene signature. Using reverse-phase protein arrays in the same tissues we detected 76 differentially expressed proteins between osteoarthritic and normal chondrocytes. Proteins such as SOX11, FGF23, KLF6, WWOX and GDF15 not implicated previously in the genesis of osteoarthritis were identified. Integration of microRNA and proteomic data with microRNA gene-target prediction algorithms, generated a potential "interactome" network consisting of 11 microRNAs and 58 proteins linked by 414 potential functional associations. Comparison of the molecular and clinical data, revealed specific microRNAs (miR-22, miR-103 and proteins (PPARA, BMP7, IL1B to be highly correlated with Body Mass Index (BMI. Experimental validation revealed that miR-22 regulated PPARA and BMP7 expression and its inhibition blocked inflammatory and catabolic changes in osteoarthritic chondrocytes. CONCLUSIONS/SIGNIFICANCE: Our findings indicate that obesity and inflammation are related to osteoarthritis, a metabolic disease affected by microRNA deregulation. Gene network approaches provide new insights for elucidating the complexity of diseases such as osteoarthritis. The integration of microRNA, proteomic
Transport of Magnesium by a Bacterial Nramp-Related Gene

Science.gov (United States)

Rodionov, Dmitry A.; Freedman, Benjamin G.; Senger, Ryan S.; Winkler, Wade C.

2014-01-01

Magnesium is an essential divalent metal that serves many cellular functions. While most divalent cations are maintained at relatively low intracellular concentrations, magnesium is maintained at a higher level (∼0.5–2.0 mM). Three families of transport proteins were previously identified for magnesium import: CorA, MgtE, and MgtA/MgtB P-type ATPases. In the current study, we find that expression of a bacterial protein unrelated to these transporters can fully restore growth to a bacterial mutant that lacks known magnesium transporters, suggesting it is a new importer for magnesium. We demonstrate that this transport activity is likely to be specific rather than resulting from substrate promiscuity because the proteins are incapable of manganese import. This magnesium transport protein is distantly related to the Nramp family of proteins, which have been shown to transport divalent cations but have never been shown to recognize magnesium. We also find gene expression of the new magnesium transporter to be controlled by a magnesium-sensing riboswitch. Importantly, we find additional examples of riboswitch-regulated homologues, suggesting that they are a frequent occurrence in bacteria. Therefore, our aggregate data discover a new and perhaps broadly important path for magnesium import and highlight how identification of riboswitch RNAs can help shed light on new, and sometimes unexpected, functions of their downstream genes. PMID:24968120
Exome sequencing identifies rare deleterious mutations in DNA repair genes FANCC and BLM as potential breast cancer susceptibility alleles.

Directory of Open Access Journals (Sweden)

Ella R Thompson

2012-09-01

Full Text Available Despite intensive efforts using linkage and candidate gene approaches, the genetic etiology for the majority of families with a multi-generational breast cancer predisposition is unknown. In this study, we used whole-exome sequencing of thirty-three individuals from 15 breast cancer families to identify potential predisposing genes. Our analysis identified families with heterozygous, deleterious mutations in the DNA repair genes FANCC and BLM, which are responsible for the autosomal recessive disorders Fanconi Anemia and Bloom syndrome. In total, screening of all exons in these genes in 438 breast cancer families identified three with truncating mutations in FANCC and two with truncating mutations in BLM. Additional screening of FANCC mutation hotspot exons identified one pathogenic mutation among an additional 957 breast cancer families. Importantly, none of the deleterious mutations were identified among 464 healthy controls and are not reported in the 1,000 Genomes data. Given the rarity of Fanconi Anemia and Bloom syndrome disorders among Caucasian populations, the finding of multiple deleterious mutations in these critical DNA repair genes among high-risk breast cancer families is intriguing and suggestive of a predisposing role. Our data demonstrate the utility of intra-family exome-sequencing approaches to uncover cancer predisposition genes, but highlight the major challenge of definitively validating candidates where the incidence of sporadic disease is high, germline mutations are not fully penetrant, and individual predisposition genes may only account for a tiny proportion of breast cancer families.
Gene Unprediction with Spurio: A tool to identify spurious protein sequences.

Science.gov (United States)

Höps, Wolfram; Jeffryes, Matt; Bateman, Alex

2018-01-01

We now have access to the sequences of tens of millions of proteins. These protein sequences are essential for modern molecular biology and computational biology. The vast majority of protein sequences are derived from gene prediction tools and have no experimental supporting evidence for their translation. Despite the increasing accuracy of gene prediction tools there likely exists a large number of spurious protein predictions in the sequence databases. We have developed the Spurio tool to help identify spurious protein predictions in prokaryotes. Spurio searches the query protein sequence against a prokaryotic nucleotide database using tblastn and identifies homologous sequences. The tblastn matches are used to score the query sequence's likelihood of being a spurious protein prediction using a Gaussian process model. The most informative feature is the appearance of stop codons within the presumed translation of homologous DNA sequences. Benchmarking shows that the Spurio tool is able to distinguish spurious from true proteins. However, transposon proteins are prone to be predicted as spurious because of the frequency of degraded homologs found in the DNA sequence databases. Our initial experiments suggest that less than 1% of the proteins in the UniProtKB sequence database are likely to be spurious and that Spurio is able to identify over 60 times more spurious proteins than the AntiFam resource. The Spurio software and source code is available under an MIT license at the following URL: https://bitbucket.org/bateman-group/spurio.
Butyrate induces profound changes in gene expression related to multiple signal pathways in bovine kidney epithelial cells

Directory of Open Access Journals (Sweden)

Li CongJun

2006-09-01

Full Text Available Abstract Background Global gene expression profiles of bovine kidney epithelial cells regulated by sodium butyrate were investigated with high-density oligonucleotide microarrays. The bovine microarray with 86,191 distinct 60mer oligonucleotides, each with 4 replicates, was designed and produced with Maskless Array Synthesizer technology. These oligonucleotides represent approximately 45,383 unique cattle sequences. Results 450 genes significantly regulated by butyrate with a median False Discovery Rate (FDR = 0 % were identified. The majority of these genes were repressed by butyrate and associated with cell cycle control. The expression levels of 30 selected genes identified by the microarray were confirmed using real-time PCR. The results from real-time PCR positively correlated (R = 0.867 with the results from the microarray. Conclusion This study presented the genes related to multiple signal pathways such as cell cycle control and apoptosis. The profound changes in gene expression elucidate the molecular basis for the pleiotropic effects of butyrate on biological processes. These findings enable better recognition of the full range of beneficial roles butyrate may play during cattle energy metabolism, cell growth and proliferation, and possibly in fighting gastrointestinal pathogens.
A systems biology pipeline identifies new immune and disease related molecular signatures and networks in human cells during microgravity exposure.

Science.gov (United States)

Mukhopadhyay, Sayak; Saha, Rohini; Palanisamy, Anbarasi; Ghosh, Madhurima; Biswas, Anupriya; Roy, Saheli; Pal, Arijit; Sarkar, Kathakali; Bagh, Sangram

2016-05-17

Microgravity is a prominent health hazard for astronauts, yet we understand little about its effect at the molecular systems level. In this study, we have integrated a set of systems-biology tools and databases and have analysed more than 8000 molecular pathways on published global gene expression datasets of human cells in microgravity. Hundreds of new pathways have been identified with statistical confidence for each dataset and despite the difference in cell types and experiments, around 100 of the new pathways are appeared common across the datasets. They are related to reduced inflammation, autoimmunity, diabetes and asthma. We have identified downregulation of NfκB pathway via Notch1 signalling as new pathway for reduced immunity in microgravity. Induction of few cancer types including liver cancer and leukaemia and increased drug response to cancer in microgravity are also found. Increase in olfactory signal transduction is also identified. Genes, based on their expression pattern, are clustered and mathematically stable clusters are identified. The network mapping of genes within a cluster indicates the plausible functional connections in microgravity. This pipeline gives a new systems level picture of human cells under microgravity, generates testable hypothesis and may help estimating risk and developing medicine for space missions.
A systems biology pipeline identifies new immune and disease related molecular signatures and networks in human cells during microgravity exposure

Science.gov (United States)

Mukhopadhyay, Sayak; Saha, Rohini; Palanisamy, Anbarasi; Ghosh, Madhurima; Biswas, Anupriya; Roy, Saheli; Pal, Arijit; Sarkar, Kathakali; Bagh, Sangram

2016-05-01

Microgravity is a prominent health hazard for astronauts, yet we understand little about its effect at the molecular systems level. In this study, we have integrated a set of systems-biology tools and databases and have analysed more than 8000 molecular pathways on published global gene expression datasets of human cells in microgravity. Hundreds of new pathways have been identified with statistical confidence for each dataset and despite the difference in cell types and experiments, around 100 of the new pathways are appeared common across the datasets. They are related to reduced inflammation, autoimmunity, diabetes and asthma. We have identified downregulation of NfκB pathway via Notch1 signalling as new pathway for reduced immunity in microgravity. Induction of few cancer types including liver cancer and leukaemia and increased drug response to cancer in microgravity are also found. Increase in olfactory signal transduction is also identified. Genes, based on their expression pattern, are clustered and mathematically stable clusters are identified. The network mapping of genes within a cluster indicates the plausible functional connections in microgravity. This pipeline gives a new systems level picture of human cells under microgravity, generates testable hypothesis and may help estimating risk and developing medicine for space missions.
Identifying Adverse Drug Events by Relational Learning.

Science.gov (United States)

Page, David; Costa, Vítor Santos; Natarajan, Sriraam; Barnard, Aubrey; Peissig, Peggy; Caldwell, Michael

2012-07-01

The pharmaceutical industry, consumer protection groups, users of medications and government oversight agencies are all strongly interested in identifying adverse reactions to drugs. While a clinical trial of a drug may use only a thousand patients, once a drug is released on the market it may be taken by millions of patients. As a result, in many cases adverse drug events (ADEs) are observed in the broader population that were not identified during clinical trials. Therefore, there is a need for continued, post-marketing surveillance of drugs to identify previously-unanticipated ADEs. This paper casts this problem as a reverse machine learning task , related to relational subgroup discovery and provides an initial evaluation of this approach based on experiments with an actual EMR/EHR and known adverse drug events.
Liver regeneration signature in hepatitis B virus (HBV-associated acute liver failure identified by gene expression profiling.

Directory of Open Access Journals (Sweden)

Oriel Nissim

Full Text Available The liver has inherent regenerative capacity via mitotic division of mature hepatocytes or, when the hepatic loss is massive or hepatocyte proliferation is impaired, through activation of hepatic stem/progenitor cells (HSPC. The dramatic clinical course of acute liver failure (ALF has posed major limitations to investigating the molecular mechanisms of liver regeneration and the role of HSPC in this setting. We investigated the molecular mechanisms of liver regeneration in 4 patients who underwent liver transplantation for hepatitis B virus (HBV-associated ALF.Gene expression profiling of 17 liver specimens from the 4 ALF cases and individual specimens from 10 liver donors documented a distinct gene signature for ALF. However, unsupervised multidimensional scaling and hierarchical clustering identified two clusters of ALF that segregated according to histopathological severity massive hepatic necrosis (MHN; 2 patients and submassive hepatic necrosis (SHN; 2 patients. We found that ALF is characterized by a strong HSPC gene signature, along with ductular reaction, both of which are more prominent in MHN. Interestingly, no evidence of further lineage differentiation was seen in MHN, whereas in SHN we detected cells with hepatocyte-like morphology. Strikingly, ALF was associated with a strong tumorigenesis gene signature. MHN had the greatest upregulation of stem cell genes (EpCAM, CK19, CK7, whereas the most up-regulated genes in SHN were related to cellular growth and proliferation. The extent of liver necrosis correlated with an overriding fibrogenesis gene signature, reflecting the wound-healing process.Our data provide evidence for a distinct gene signature in HBV-associated ALF whose intensity is directly correlated with the histopathological severity. HSPC activation and fibrogenesis positively correlated with the extent of liver necrosis. Moreover, we detected a tumorigenesis gene signature in ALF, emphasizing the close relationship between
Drug-related problems identified in medication reviews by Australian pharmacists

DEFF Research Database (Denmark)

Stafford, Andrew C; Tenni, Peter C; Peterson, Gregory M

2009-01-01

OBJECTIVE: In Australia, accredited pharmacists perform medication reviews for patients to identify and resolve drug-related problems. We analysed the drug-related problems identified in reviews for both home-dwelling and residential care-facility patients. The objective of this study was to exam......OBJECTIVE: In Australia, accredited pharmacists perform medication reviews for patients to identify and resolve drug-related problems. We analysed the drug-related problems identified in reviews for both home-dwelling and residential care-facility patients. The objective of this study....... These reviews had been self-selected by pharmacists and submitted as part of the reaccreditation process to the primary body responsible for accrediting Australian pharmacists to perform medication reviews. The drug-related problems identified in each review were classified by type and drugs involved. MAIN...... OUTCOME MEASURE: The number and nature of drug-related problems identified in pharmacist-conducted medication reviews. RESULTS: There were 1,038 drug-related problems identified in 234 medication reviews (mean 4.6 (+/-2.2) problems per review). The number of problems was higher (4.9 +/- 2.0 vs. 3.9 +/- 2...
Association Analysis Suggests SOD2 as a Newly Identified Candidate Gene Associated With Leprosy Susceptibility.

Science.gov (United States)

Ramos, Geovana Brotto; Salomão, Heloisa; Francio, Angela Schneider; Fava, Vinícius Medeiros; Werneck, Renata Iani; Mira, Marcelo Távora

2016-08-01

Genetic studies have identified several genes and genomic regions contributing to the control of host susceptibility to leprosy. Here, we test variants of the positional and functional candidate gene SOD2 for association with leprosy in 2 independent population samples. Family-based analysis revealed an association between leprosy and allele G of marker rs295340 (P = .042) and borderline evidence of an association between leprosy and alleles C and A of markers rs4880 (P = .077) and rs5746136 (P = .071), respectively. Findings were validated in an independent case-control sample for markers rs295340 (P = .049) and rs4880 (P = .038). These results suggest SOD2 as a newly identified gene conferring susceptibility to leprosy. © The Author 2016. Published by Oxford University Press for the Infectious Diseases Society of America. All rights reserved. For permissions, e-mail journals.permissions@oup.com.

Analysis of pools of targeted Salmonella deletion mutants identifies novel genes affecting fitness during competitive infection in mice.

Directory of Open Access Journals (Sweden)

Carlos A Santiviago

2009-07-01

Full Text Available Pools of mutants of minimal complexity but maximal coverage of genes of interest facilitate screening for genes under selection in a particular environment. We constructed individual deletion mutants in 1,023 Salmonella enterica serovar Typhimurium genes, including almost all genes found in Salmonella but not in related genera. All mutations were confirmed simultaneously using a novel amplification strategy to produce labeled RNA from a T7 RNA polymerase promoter, introduced during the construction of each mutant, followed by hybridization of this labeled RNA to a Typhimurium genome tiling array. To demonstrate the ability to identify fitness phenotypes using our pool of mutants, the pool was subjected to selection by intraperitoneal injection into BALB/c mice and subsequent recovery from spleens. Changes in the representation of each mutant were monitored using T7 transcripts hybridized to a novel inexpensive minimal microarray. Among the top 120 statistically significant spleen colonization phenotypes, more than 40 were mutations in genes with no previously known role in this model. Fifteen phenotypes were tested using individual mutants in competitive assays of intraperitoneal infection in mice and eleven were confirmed, including the first two examples of attenuation for sRNA mutants in Salmonella. We refer to the method as Array-based analysis of cistrons under selection (ABACUS.
Cloning of the relative genes of endocrine exophthalmos

International Nuclear Information System (INIS)

Zheng, JG

2004-01-01

Aim: In order to clarify the pathogenesis of endocrine exophthalmos, and lay foundations for finding the new functions of its relative genes, the cloning of its relative genes was carried out. Methods: The thyroid tissues of 10 hyperthyroidism patients, 5 of them with endocrine exophthalmos and 5 without that, were obtained. Their mRNA were collected respectively by using Quick Prep Micro mRNA purification kit. Then the same amount of the mRNA from 5 patients with endocrine exophthalmos was added into an eppendorf tube to form a mRNA pool. And that of the 5 patients without endocrine exophthalmos was also prepared as the other pool. As a model, the pool was used to synthesize the single and double chains of cDNA through SMART Tm PCR cDNA Synthesis Kit. The double chains cDNA from the endocrine exophthalmos patients, being used as tester, and that from the patients without endocrine exophthalmos, being used as driver, were digested by restriction endonucleases Hae III to get the fragments which was less than 500 bases. The tester cDNA was ligated with adapt or 1 or 2 respectively. Then the subtractive suppressive hybridization was performed between tester and driver cDNA. And the efficacies of subtraction were measured. The differential genes between the thyroid tissues of endocrine exophthalmos and the thyroid tissues without endocrine exophthalmos were obtained through two cycles of subtractive hybridization and two cycles PCR. The differential genes were cloned into the vector of pT-Adv, and then transformed into E.coliDH5a. 48 white clonies were selected to build the subtractive suppressive library of the relative genes of endocrine exophthalmos. The primer 2 was applied for the colony PCR of the relative genes. The amplified genes were obtained and purified by using Quaqwich Spine PCR Purification Kit. According to the principle of random primer, the double chains cDNA from the thyroid tissues with or without endocrine exophthalmos were digested by Hae III
Identification of flowering-related genes responsible for differences in bolting time between two radish inbred lines

Directory of Open Access Journals (Sweden)

Hye Sun Cho

2016-12-01

Full Text Available Late bolting after cold exposure is an economically important characteristic of radish (Raphanus sativus L., an important Brassicaceae root vegetable crop. However, little information is available regarding the genes and pathways that govern flowering time in this species. We performed high-throughput RNA sequencing analysis to elucidate the molecular mechanisms that determine the differences in flowering times between two radish lines, NH-JS1 (late bolting and NH-JS2 (early bolting. In total, 71,188 unigenes were identified by reference-guided assembly, of which 309, 788, and 980 genes were differentially expressed between the two inbred lines after 0, 15, and 35 days of vernalization, respectively. Among these genes, 218 homologs of Arabidopsis flowering-time (Ft genes were identified in the radish, and 49 of these genes were differentially expressed between the two radish lines in the presence or absence of vernalization treatment. Most of the Ft genes up-regulated in NH-JS1 vs NH-JS2 were repressors of flowering, such as RsFLC, consistent with the late-bolting phenotype of NH-JS1. Although the functions of genes down-regulated in NH-JS1 were less consistent with late-bolting characteristics than the up-regulated Ft genes, several Ft enhancer genes, including RsSOC1, a key floral integrator, showed an appropriate expression to the late-bolting phenotype. In addition, the patterns of gene expression related to the vernalization pathway closely corresponded with the different bolting times of the two inbred lines. These results suggest that the vernalization pathway is conserved between radish and Arabidopsis.
Expression profiling of Crambe abyssinica under arsenate stress identifies genes and gene networks involved in arsenic metabolism and detoxification

Directory of Open Access Journals (Sweden)

Kandasamy Suganthi

2010-06-01

Full Text Available Abstract Background Arsenic contamination is widespread throughout the world and this toxic metalloid is known to cause cancers of organs such as liver, kidney, skin, and lung in human. In spite of a recent surge in arsenic related studies, we are still far from a comprehensive understanding of arsenic uptake, detoxification, and sequestration in plants. Crambe abyssinica, commonly known as 'abyssinian mustard', is a non-food, high biomass oil seed crop that is naturally tolerant to heavy metals. Moreover, it accumulates significantly higher levels of arsenic as compared to other species of the Brassicaceae family. Thus, C. abyssinica has great potential to be utilized as an ideal inedible crop for phytoremediation of heavy metals and metalloids. However, the mechanism of arsenic metabolism in higher plants, including C. abyssinica, remains elusive. Results To identify the differentially expressed transcripts and the pathways involved in arsenic metabolism and detoxification, C. abyssinica plants were subjected to arsenate stress and a PCR-Select Suppression Subtraction Hybridization (SSH approach was employed. A total of 105 differentially expressed subtracted cDNAs were sequenced which were found to represent 38 genes. Those genes encode proteins functioning as antioxidants, metal transporters, reductases, enzymes involved in the protein degradation pathway, and several novel uncharacterized proteins. The transcripts corresponding to the subtracted cDNAs showed strong upregulation by arsenate stress as confirmed by the semi-quantitative RT-PCR. Conclusions Our study revealed novel insights into the plant defense mechanisms and the regulation of genes and gene networks in response to arsenate toxicity. The differential expression of transcripts encoding glutathione-S-transferases, antioxidants, sulfur metabolism, heat-shock proteins, metal transporters, and enzymes in the ubiquitination pathway of protein degradation as well as several unknown
Expression profiling of Crambe abyssinica under arsenate stress identifies genes and gene networks involved in arsenic metabolism and detoxification

Science.gov (United States)

2010-01-01

Background Arsenic contamination is widespread throughout the world and this toxic metalloid is known to cause cancers of organs such as liver, kidney, skin, and lung in human. In spite of a recent surge in arsenic related studies, we are still far from a comprehensive understanding of arsenic uptake, detoxification, and sequestration in plants. Crambe abyssinica, commonly known as 'abyssinian mustard', is a non-food, high biomass oil seed crop that is naturally tolerant to heavy metals. Moreover, it accumulates significantly higher levels of arsenic as compared to other species of the Brassicaceae family. Thus, C. abyssinica has great potential to be utilized as an ideal inedible crop for phytoremediation of heavy metals and metalloids. However, the mechanism of arsenic metabolism in higher plants, including C. abyssinica, remains elusive. Results To identify the differentially expressed transcripts and the pathways involved in arsenic metabolism and detoxification, C. abyssinica plants were subjected to arsenate stress and a PCR-Select Suppression Subtraction Hybridization (SSH) approach was employed. A total of 105 differentially expressed subtracted cDNAs were sequenced which were found to represent 38 genes. Those genes encode proteins functioning as antioxidants, metal transporters, reductases, enzymes involved in the protein degradation pathway, and several novel uncharacterized proteins. The transcripts corresponding to the subtracted cDNAs showed strong upregulation by arsenate stress as confirmed by the semi-quantitative RT-PCR. Conclusions Our study revealed novel insights into the plant defense mechanisms and the regulation of genes and gene networks in response to arsenate toxicity. The differential expression of transcripts encoding glutathione-S-transferases, antioxidants, sulfur metabolism, heat-shock proteins, metal transporters, and enzymes in the ubiquitination pathway of protein degradation as well as several unknown novel proteins serve as
Transcriptomic meta-analysis identifies gene expression characteristics in various samples of HIV-infected patients with nonprogressive disease.

Science.gov (United States)

Zhang, Le-Le; Zhang, Zi-Ning; Wu, Xian; Jiang, Yong-Jun; Fu, Ya-Jing; Shang, Hong

2017-09-12

A small proportion of HIV-infected patients remain clinically and/or immunologically stable for years, including elite controllers (ECs) who have undetectable viremia (10 years). However, the mechanism of nonprogression needs to be further resolved. In this study, a transcriptome meta-analysis was performed on nonprogressor and progressor microarray data to identify differential transcriptome pathways and potential biomarkers. Using the INMEX (integrative meta-analysis of expression data) program, we performed the meta-analysis to identify consistently differentially expressed genes (DEGs) in nonprogressors and further performed functional interpretation (gene ontology analysis and pathway analysis) of the DEGs identified in the meta-analysis. Five microarray datasets (81 cases and 98 controls in total), including whole blood, CD4 + and CD8 + T cells, were collected for meta-analysis. We determined that nonprogressors have reduced expression of important interferon-stimulated genes (ISGs), CD38, lymphocyte activation gene 3 (LAG-3) in whole blood, CD4 + and CD8 + T cells. Gene ontology (GO) analysis showed a significant enrichment in DEGs that function in the type I interferon signaling pathway. Upregulated pathways, including the PI3K-Akt signaling pathway in whole blood, cytokine-cytokine receptor interaction in CD4 + T cells and the MAPK signaling pathway in CD8 + T cells, were identified in nonprogressors compared with progressors. In each metabolic functional category, the number of downregulated DEGs was more than the upregulated DEGs, and almost all genes were downregulated DEGs in the oxidative phosphorylation (OXPHOS) and tricarboxylic acid (TCA) cycle in the three types of samples. Our transcriptomic meta-analysis provides a comprehensive evaluation of the gene expression profiles in major blood types of nonprogressors, providing new insights in the understanding of HIV pathogenesis and developing strategies to delay HIV disease progression.
De Novo Transcriptome Sequencing of Olea europaea L. to Identify Genes Involved in the Development of the Pollen Tube.

Science.gov (United States)

Iaria, Domenico; Chiappetta, Adriana; Muzzalupo, Innocenzo

2016-01-01

In olive (Olea europaea L.), the processes controlling self-incompatibility are still unclear and the molecular basis underlying this process are still not fully characterized. In order to determine compatibility relationships, using next-generation sequencing techniques and a de novo transcriptome assembly strategy, we show that pollen tubes from different olive plants, grown in vitro in a medium containing its own pistil and in combination pollen/pistil from self-sterile and self-fertile cultivars, have a distinct gene expression profile and many of the differentially expressed sequences between the samples fall within gene families involved in the development of the pollen tube, such as lipase, carboxylesterase, pectinesterase, pectin methylesterase, and callose synthase. Moreover, different genes involved in signal transduction, transcription, and growth are overrepresented. The analysis also allowed us to identify members in actin and actin depolymerization factor and fibrin gene family and member of the Ca(2+) binding gene family related to the development and polarization of pollen apical tip. The whole transcriptomic analysis, through the identification of the differentially expressed transcripts set and an extended functional annotation analysis, will lead to a better understanding of the mechanisms of pollen germination and pollen tube growth in the olive.
Global Gene-Expression Analysis to Identify Differentially Expressed Genes Critical for the Heat Stress Response in Brassica rapa.

Directory of Open Access Journals (Sweden)

Xiangshu Dong

Full Text Available Genome-wide dissection of the heat stress response (HSR is necessary to overcome problems in crop production caused by global warming. To identify HSR genes, we profiled gene expression in two Chinese cabbage inbred lines with different thermotolerances, Chiifu and Kenshin. Many genes exhibited >2-fold changes in expression upon exposure to 0.5- 4 h at 45°C (high temperature, HT: 5.2% (2,142 genes in Chiifu and 3.7% (1,535 genes in Kenshin. The most enriched GO (Gene Ontology items included 'response to heat', 'response to reactive oxygen species (ROS', 'response to temperature stimulus', 'response to abiotic stimulus', and 'MAPKKK cascade'. In both lines, the genes most highly induced by HT encoded small heat shock proteins (Hsps and heat shock factor (Hsf-like proteins such as HsfB2A (Bra029292, whereas high-molecular weight Hsps were constitutively expressed. Other upstream HSR components were also up-regulated: ROS-scavenging genes like glutathione peroxidase 2 (BrGPX2, Bra022853, protein kinases, and phosphatases. Among heat stress (HS marker genes in Arabidopsis, only exportin 1A (XPO1A (Bra008580, Bra006382 can be applied to B. rapa for basal thermotolerance (BT and short-term acquired thermotolerance (SAT gene. CYP707A3 (Bra025083, Bra021965, which is involved in the dehydration response in Arabidopsis, was associated with membrane leakage in both lines following HS. Although many transcription factors (TF genes, including DREB2A (Bra005852, were involved in HS tolerance in both lines, Bra024224 (MYB41 and Bra021735 (a bZIP/AIR1 [Anthocyanin-Impaired-Response-1] were specific to Kenshin. Several candidate TFs involved in thermotolerance were confirmed as HSR genes by real-time PCR, and these assignments were further supported by promoter analysis. Although some of our findings are similar to those obtained using other plant species, clear differences in Brassica rapa reveal a distinct HSR in this species. Our data could also provide a
A Genome-Wide Screen for Dendritically Localized RNAs Identifies Genes Required for Dendrite Morphogenesis

Directory of Open Access Journals (Sweden)

Mala Misra

2016-08-01

Full Text Available Localizing messenger RNAs at specific subcellular sites is a conserved mechanism for targeting the synthesis of cytoplasmic proteins to distinct subcellular domains, thereby generating the asymmetric protein distributions necessary for cellular and developmental polarity. However, the full range of transcripts that are asymmetrically distributed in specialized cell types, and the significance of their localization, especially in the nervous system, are not known. We used the EP-MS2 method, which combines EP transposon insertion with the MS2/MCP in vivo fluorescent labeling system, to screen for novel localized transcripts in polarized cells, focusing on the highly branched Drosophila class IV dendritic arborization neurons. Of a total of 541 lines screened, we identified 55 EP-MS2 insertions producing transcripts that were enriched in neuronal processes, particularly in dendrites. The 47 genes identified by these insertions encode molecularly diverse proteins, and are enriched for genes that function in neuronal development and physiology. RNAi-mediated knockdown confirmed roles for many of the candidate genes in dendrite morphogenesis. We propose that the transport of mRNAs encoded by these genes into the dendrites allows their expression to be regulated on a local scale during the dynamic developmental processes of dendrite outgrowth, branching, and/or remodeling.
Expression Analysis of Genes Related to Rice Resistance Against Brown Planthopper, Nilaparvata lugens

Directory of Open Access Journals (Sweden)

Panatda Jannoey

2017-05-01

Full Text Available Brown planthopper (BPH is an insect species that feeds on the vascular system of rice plants. To examine the defence mechanism of rice plants against BPH, the pathogenesis-related genes (PR1a, PR2, PR3, PR4, PR6, PR9, PR10a, PR13, PR15 and PRpha, signaling molecule synthesis genes (AOS, AXR, ACO and LOX, antioxidant-related genes (CAT, TRX, GST and SOD and lignin biosynthesis-related genes (CHS, CHI and C4H were investigated in a resistant rice variety. AOS, PR6, PR9 and PR15 genes showed significantly increased relative expression levels at 24.38-, 19.17-, 14.71-, and 12.74-fold compared to the control. Moderate increased relative expression levels of lignin biosynthesis-related gene (C4H, pathogenesis-related genes (PR4, PR10a and PRpha, and antioxidant-related gene (GST were found, while CHI, LOX, SOD, TRX1 and AXR showed decreased relative expression levels. It was thus clearly shown that wound-induced response genes were activated in rice plants after BPH attacks through AOS activation. Jasmonic acid signaling molecule may activate PR6, PR15, GST and CAT subsequently increasing their expression for H2O2 detoxification. PR6 were expressed at the highest relative level among the PR genes. These genes therefore have also a considerable synergistic role with the other genes against BPH by interfered their digestion tract system.
Associations between Single-Nucleotide Polymorphisms in Corticotropin-Releasing Hormone-Related Genes and Irritable Bowel Syndrome.

Directory of Open Access Journals (Sweden)

Ayaka Sasaki

Full Text Available Irritable bowel syndrome (IBS is a common functional disorder with distinct features of stress-related pathophysiology. A key mediator of the stress response is corticotropin-releasing hormone (CRH. Although some candidate genes have been identified in stress-related disorders, few studies have examined CRH-related gene polymorphisms. Therefore, we tested our hypothesis that single-nucleotide polymorphisms (SNPs in CRH-related genes influence the features of IBS.In total, 253 individuals (123 men and 130 women participated in this study. They comprised 111 IBS individuals and 142 healthy controls. The SNP genotypes in CRH (rs28364015 and rs6472258 and CRH-binding protein (CRH-BP (rs10474485 were determined by direct sequencing and real-time polymerase chain reaction. The emotional states of the subjects were evaluated using the State-Trait Anxiety Inventory, Perceived Stress Scale, and the Self-rating Depression Scale.Direct sequencing of the rs28364015 SNP of CRH revealed no genetic variation among the study subjects. There was no difference in the genotype distributions and allele frequencies of rs6472258 and rs10474485 between IBS individuals and controls. However, IBS subjects with diarrhea symptoms without the rs10474485 A allele showed a significantly higher emotional state score than carriers.These results suggest that the CRH and CRH-BP genes have no direct effect on IBS status. However, the CRH-BP SNP rs10474485 has some effect on IBS-related emotional abnormalities and resistance to psychosocial stress.
Expressed sequences tags of the anther smut fungus, Microbotryum violaceum, identify mating and pathogenicity genes

Directory of Open Access Journals (Sweden)

Devier Benjamin

2007-08-01

Full Text Available Abstract Background The basidiomycete fungus Microbotryum violaceum is responsible for the anther-smut disease in many plants of the Caryophyllaceae family and is a model in genetics and evolutionary biology. Infection is initiated by dikaryotic hyphae produced after the conjugation of two haploid sporidia of opposite mating type. This study describes M. violaceum ESTs corresponding to nuclear genes expressed during conjugation and early hyphal production. Results A normalized cDNA library generated 24,128 sequences, which were assembled into 7,765 unique genes; 25.2% of them displayed significant similarity to annotated proteins from other organisms, 74.3% a weak similarity to the same set of known proteins, and 0.5% were orphans. We identified putative pheromone receptors and genes that in other fungi are involved in the mating process. We also identified many sequences similar to genes known to be involved in pathogenicity in other fungi. The M. violaceum EST database, MICROBASE, is available on the Web and provides access to the sequences, assembled contigs, annotations and programs to compare similarities against MICROBASE. Conclusion This study provides a basis for cloning the mating type locus, for further investigation of pathogenicity genes in the anther smut fungi, and for comparative genomics.
The animal sialyltransferases and sialyltransferase-related genes: a phylogenetic approach.

Science.gov (United States)

Harduin-Lepers, Anne; Mollicone, Rosella; Delannoy, Philippe; Oriol, Rafael

2005-08-01

The animal sialyltransferases are Golgi type II transmembrane glycosyltransferases. Twenty distinct sialyltransferases have been identified in both human and murine genomes. These enzymes catalyze transfer of sialic acid from CMP-Neu5Ac to the glycan moiety of glycoconjugates. Despite low overall identities, they share four conserved peptide motifs [L (large), S (small), motif III, and motif VS (very small)] that are hallmarks for sialyltransferase identification. We have identified 155 new putative genes in 25 animal species, and we have exploited two lines of evidence: (1) sequence comparisons and (2) exon-intron organization of the genes. An ortholog to the ancestor present before the split of ST6Gal I and II subfamilies was detected in arthropods. An ortholog to the ancestor present before the split of ST6GalNAc III, IV, V, and VI subfamilies was detected in sea urchin. An ortholog to the ancestor present before the split of ST3Gal I and II subfamilies was detected in ciona, and an ortholog to the ancestor of all the ST8Sia was detected in amphioxus. Therefore, single examples of the four families (ST3Gal, ST6Gal, ST6GalNAc, and ST8Sia) have appeared in invertebrates, earlier than previously thought, whereas the four families were all detected in bony fishes, amphibians, birds, and mammals. As previously hypothesized, sequence similarities among sialyltransferases suggest a common genetic origin, by successive duplications of an ancestral gene, followed by divergent evolution. Finally, we propose predictions on these invertebrates sialyltransferase-related activities that have not previously been demonstrated and that will ultimately need to be substantiated by protein expression and enzymatic activity assays.
Meta-analysis of Drosophila circadian microarray studies identifies a novel set of rhythmically expressed genes.

Directory of Open Access Journals (Sweden)

Kevin P Keegan

2007-11-01

Full Text Available Five independent groups have reported microarray studies that identify dozens of rhythmically expressed genes in the fruit fly Drosophila melanogaster. Limited overlap among the lists of discovered genes makes it difficult to determine which, if any, exhibit truly rhythmic patterns of expression. We reanalyzed data from all five reports and found two sources for the observed discrepancies, the use of different expression pattern detection algorithms and underlying variation among the datasets. To improve upon the methods originally employed, we developed a new analysis that involves compilation of all existing data, application of identical transformation and standardization procedures followed by ANOVA-based statistical prescreening, and three separate classes of post hoc analysis: cross-correlation to various cycling waveforms, autocorrelation, and a previously described fast Fourier transform-based technique. Permutation-based statistical tests were used to derive significance measures for all post hoc tests. We find application of our method, most significantly the ANOVA prescreening procedure, significantly reduces the false discovery rate relative to that observed among the results of the original five reports while maintaining desirable statistical power. We identify a set of 81 cycling transcripts previously found in one or more of the original reports as well as a novel set of 133 transcripts not found in any of the original studies. We introduce a novel analysis method that compensates for variability observed among the original five Drosophila circadian array reports. Based on the statistical fidelity of our meta-analysis results, and the results of our initial validation experiments (quantitative RT-PCR, we predict many of our newly found genes to be bona fide cyclers, and suggest that they may lead to new insights into the pathways through which clock mechanisms regulate behavioral rhythms.
A comprehensive approach to identify reliable reference gene candidates to investigate the link between alcoholism and endocrinology in Sprague-Dawley rats.

Directory of Open Access Journals (Sweden)

Faten A Taki

Full Text Available Gender and hormonal differences are often correlated with alcohol dependence and related complications like addiction and breast cancer. Estrogen (E2 is an important sex hormone because it serves as a key protein involved in organism level signaling pathways. Alcoholism has been reported to affect estrogen receptor signaling; however, identifying the players involved in such multi-faceted syndrome is complex and requires an interdisciplinary approach. In many situations, preliminary investigations included a straight forward, yet informative biotechniques such as gene expression analyses using quantitative real time PCR (qRT-PCR. The validity of qRT-PCR-based conclusions is affected by the choice of reliable internal controls. With this in mind, we compiled a list of 15 commonly used housekeeping genes (HKGs as potential reference gene candidates in rat biological models. A comprehensive comparison among 5 statistical approaches (geNorm, dCt method, NormFinder, BestKeeper, and RefFinder was performed to identify the minimal number as well the most stable reference genes required for reliable normalization in experimental rat groups that comprised sham operated (SO, ovariectomized rats in the absence (OVX or presence of E2 (OVXE2. These rat groups were subdivided into subgroups that received alcohol in liquid diet or isocalroic control liquid diet for 12 weeks. Our results showed that U87, 5S rRNA, GAPDH, and U5a were the most reliable gene candidates for reference genes in heart and brain tissue. However, different gene stability ranking was specific for each tissue input combination. The present preliminary findings highlight the variability in reference gene rankings across different experimental conditions and analytic methods and constitute a fundamental step for gene expression assays.
Evolutionary inference across eukaryotes identifies specific pressures favoring mitochondrial gene retention

OpenAIRE

Williams, Ben; Johnston, Iain

2016-01-01

Since their endosymbiotic origin, mitochondria have lost most of their genes. Although many selective mechanisms underlying the evolution of mitochondrial genomes have been proposed, a data-driven exploration of these hypotheses is lacking, and a quantitatively supported consensus remains absent. We developed HyperTraPS, a methodology coupling stochastic modelling with Bayesian inference, to identify the ordering of evolutionary events and suggest their causes. Using 2015 complete mitochondri...
Identification of functionally related genes using data mining and data integration: a breast cancer case study

Directory of Open Access Journals (Sweden)

Zucchi Ileana

2009-10-01

Full Text Available Abstract Background The identification of the organisation and dynamics of molecular pathways is crucial for the understanding of cell function. In order to reconstruct the molecular pathways in which a gene of interest is involved in regulating a cell, it is important to identify the set of genes to which it interacts with to determine cell function. In this context, the mining and the integration of a large amount of publicly available data, regarding the transcriptome and the proteome states of a cell, are a useful resource to complement biological research. Results We describe an approach for the identification of genes that interact with each other to regulate cell function. The strategy relies on the analysis of gene expression profile similarity, considering large datasets of expression data. During the similarity evaluation, the methodology determines the most significant subset of samples in which the evaluated genes are highly correlated. Hence, the strategy enables the exclusion of samples that are not relevant for each gene pair analysed. This feature is important when considering a large set of samples characterised by heterogeneous experimental conditions where different pools of biological processes can be active across the samples. The putative partners of the studied gene are then further characterised, analysing the distribution of the Gene Ontology terms and integrating the protein-protein interaction (PPI data. The strategy was applied for the analysis of the functional relationships of a gene of known function, Pyruvate Kinase, and for the prediction of functional partners of the human transcription factor TBX3. In both cases the analysis was done on a dataset composed by breast primary tumour expression data derived from the literature. Integration and analysis of PPI data confirmed the prediction of the methodology, since the genes identified to be functionally related were associated to proteins close in the PPI network
Novel statistical framework to identify differentially expressed genes allowing transcriptomic background differences.

Science.gov (United States)

Ling, Zhi-Qiang; Wang, Yi; Mukaisho, Kenichi; Hattori, Takanori; Tatsuta, Takeshi; Ge, Ming-Hua; Jin, Li; Mao, Wei-Min; Sugihara, Hiroyuki

2010-06-01

Tests of differentially expressed genes (DEGs) from microarray experiments are based on the null hypothesis that genes that are irrelevant to the phenotype/stimulus are expressed equally in the target and control samples. However, this strict hypothesis is not always true, as there can be several transcriptomic background differences between target and control samples, including different cell/tissue types, different cell cycle stages and different biological donors. These differences lead to increased false positives, which have little biological/medical significance. In this article, we propose a statistical framework to identify DEGs between target and control samples from expression microarray data allowing transcriptomic background differences between these samples by introducing a modified null hypothesis that the gene expression background difference is normally distributed. We use an iterative procedure to perform robust estimation of the null hypothesis and identify DEGs as outliers. We evaluated our method using our own triplicate microarray experiment, followed by validations with reverse transcription-polymerase chain reaction (RT-PCR) and on the MicroArray Quality Control dataset. The evaluations suggest that our technique (i) results in less false positive and false negative results, as measured by the degree of agreement with RT-PCR of the same samples, (ii) can be applied to different microarray platforms and results in better reproducibility as measured by the degree of DEG identification concordance both intra- and inter-platforms and (iii) can be applied efficiently with only a few microarray replicates. Based on these evaluations, we propose that this method not only identifies more reliable and biologically/medically significant DEG, but also reduces the power-cost tradeoff problem in the microarray field. Source code and binaries freely available for download at http://comonca.org.cn/fdca/resources/softwares/deg.zip.
Transcriptome Analysis of Syringa oblata Lindl. Inflorescence Identifies Genes Associated with Pigment Biosynthesis and Scent Metabolism.

Directory of Open Access Journals (Sweden)

Jian Zheng

Full Text Available Syringa oblata Lindl. is a woody ornamental plant with high economic value and characteristics that include early flowering, multiple flower colors, and strong fragrance. Despite a long history of cultivation, the genetics and molecular biology of S. oblata are poorly understood. Transcriptome and expression profiling data are needed to identify genes and to better understand the biological mechanisms of floral pigments and scents in this species. Nine cDNA libraries were obtained from three replicates of three developmental stages: inflorescence with enlarged flower buds not protruded, inflorescence with corolla lobes not displayed, and inflorescence with flowers fully opened and emitting strong fragrance. Using the Illumina RNA-Seq technique, 319,425,972 clean reads were obtained and were assembled into 104,691 final unigenes (average length of 853 bp, 41.75% of which were annotated in the NCBI non-redundant protein database. Among the annotated unigenes, 36,967 were assigned to gene ontology categories and 19,956 were assigned to eukaryoticorthologous groups. Using the Kyoto Encyclopedia of Genes and Genomes pathway database, 12,388 unigenes were sorted into 286 pathways. Based on these transcriptomic data, we obtained a large number of candidate genes that were differentially expressed at different flower stages and that were related to floral pigment biosynthesis and fragrance metabolism. This comprehensive transcriptomic analysis provides fundamental information on the genes and pathways involved in flower secondary metabolism and development in S. oblata, providing a useful database for further research on S. oblata and other plants of genus Syringa.
Gene expression profiling reveals candidate genes related to residual feed intake in duodenum of laying ducks.

Science.gov (United States)

Zeng, T; Huang, L; Ren, J; Chen, L; Tian, Y; Huang, Y; Zhang, H; Du, J; Lu, L

2017-12-01

Feed represents two-thirds of the total costs of poultry production, especially in developing countries. Improvement in feed efficiency would reduce the amount of feed required for production (growth or laying), the production cost, and the amount of nitrogenous waste. The most commonly used measures for feed efficiency are feed conversion ratio (FCR) and residual feed intake (RFI). As a more suitable indicator assessing feed efficiency, RFI is defined as the difference between observed and expected feed intake based on maintenance and growth or laying. However, the genetic and biological mechanisms regulating RFI are largely unknown. Identifying molecular mechanisms explaining divergence in RFI in laying ducks would lead to the development of early detection methods for the selection of more efficient breeding poultry. The objective of this study was to identify duodenum genes and pathways through transcriptional profiling in 2 extreme RFI phenotypes (HRFI and LRFI) of the duck population. Phenotypic aspects of feed efficiency showed that RFI was strongly positive with FCR and feed intake (FI). Transcriptomic analysis identified 35 differentially expressed genes between LRFI and HRFI ducks. These genes play an important role in metabolism, digestibility, secretion, and innate immunity including (), (), (), β (), and (). These results improve our knowledge of the biological basis underlying RFI, which would be useful for further investigations of key candidate genes for RFI and for the development of biomarkers.

Yeast functional screen to identify genes conferring salt stress tolerance in Salicornia europaea.

Science.gov (United States)

Nakahara, Yoshiki; Sawabe, Shogo; Kainuma, Kenta; Katsuhara, Maki; Shibasaka, Mineo; Suzuki, Masanori; Yamamoto, Kosuke; Oguri, Suguru; Sakamoto, Hikaru

2015-01-01

Salinity is a critical environmental factor that adversely affects crop productivity. Halophytes have evolved various mechanisms to adapt to saline environments. Salicornia europaea L. is one of the most salt-tolerant plant species. It does not have special salt-secreting structures like a salt gland or salt bladder, and is therefore a good model for studying the common mechanisms underlying plant salt tolerance. To identify candidate genes encoding key proteins in the mediation of salt tolerance in S. europaea, we performed a functional screen of a cDNA library in yeast. The library was screened for genes that allowed the yeast to grow in the presence of 1.3 M NaCl. We obtained three full-length S. europaea genes that confer salt tolerance. The genes are predicted to encode (1) a novel protein highly homologous to thaumatin-like proteins, (2) a novel coiled-coil protein of unknown function, and (3) a novel short peptide of 32 residues. Exogenous application of a synthetic peptide corresponding to the 32 residues improved salt tolerance of Arabidopsis. The approach described in this report provides a rapid assay system for large-scale screening of S. europaea genes involved in salt stress tolerance and supports the identification of genes responsible for such mechanisms. These genes may be useful candidates for improving crop salt tolerance by genetic transformation.
Yeast functional screen to identify genes conferring salt stress tolerance in Salicornia europaea

Directory of Open Access Journals (Sweden)

Yoshiki eNakahara

2015-10-01

Full Text Available Salinity is a critical environmental factor that adversely affects crop productivity. Halophytes have evolved various mechanisms to adapt to saline environments. Salicornia europaea L. is one of the most salt-tolerant plant species. It does not have special salt-secreting structures like a salt gland or salt bladder, and is therefore a good model for studying the common mechanisms underlying plant salt tolerance. To identify candidate genes encoding key proteins in the mediation of salt tolerance in S. europaea, we performed a functional screen of a cDNA library in yeast. The library was screened for genes that allowed the yeast to grow in the presence of 1.3 M NaCl. We obtained three full-length S. europaea genes that confer salt tolerance. The genes are predicted to encode (1 a novel protein highly homologous to thaumatin-like proteins, (2 a novel coiled-coil protein of unknown function, and (3 a novel short peptide of 32 residues. Exogenous application of a synthetic peptide corresponding to the 32 residues improved salt tolerance of Arabidopsis. The approach described in this report provides a rapid assay system for large-scale screening of S. europaea genes involved in salt stress tolerance and supports the identification of genes responsible for such mechanisms. These genes may be useful candidates for improving crop salt tolerance by genetic transformation.
Identifying knowledge gaps for gene drive research to control invasive animal species: The next CRISPR step

Directory of Open Access Journals (Sweden)

Dorian Moro

2018-01-01

Full Text Available Invasive animals have been linked to the extinctions of native wildlife, and to significant agricultural financial losses or impacts. Current approaches to control invasive species require ongoing resources and management over large geographic scales, and often result in the short-term suppression of populations. New and innovative approaches are warranted. Recently, the RNA guided gene drive system based on CRISPR/Cas9 is being proposed as a potential gene editing tool that could be used by wildlife managers as a non-lethal addition or alternative to help reduce pest animal populations. While regulatory control and social acceptance are crucial issues that must be addressed, there is an opportunity now to identify the knowledge and research gaps that exist for some important invasive species. Here we systematically determine the knowledge gaps for pest species for which gene drives could potentially be applied. We apply a conceptual ecological risk framework within the gene drive context within an Australian environment to identify key requirements for undertaking work on seven exemplar invasive species in Australia. This framework allows an evaluation of the potential research on an invasive species of interest and within a gene drive and risk context. We consider the currently available biological, genetic and ecological information for the house mouse, European red fox, feral cat, European rabbit, cane toad, black rat and European starling to evaluate knowledge gaps and identify candidate species for future research. We discuss these findings in the context of future thematic areas of research worth pursuing in preparation for a more formal assessment of the use of gene drives as a novel strategy for the control of these and other invasive species. Keywords: Invasive species, Gene drive, CRISPR, Pest management, Islands
Codon 201Gly Polymorphic Type of the DCC Gene is Related to Disseminated Neuroblastoma

Directory of Open Access Journals (Sweden)

Xiao-Tang Kong

2001-01-01

Full Text Available The deleted in colorectal carcinoma (DCC gene is a potential tumor- suppressor gene on chromosome 18821.3. The relatively high frequency of loss of heterozygosity (LOH and loss of expression of this gene in neuroblastoma, especially in the advanced stages, imply the possibility of involvement of the DCC gene in progression of neuroblastoma. However, only few typical mutations have been identified in this gene, indicating that other possible mechanisms for the inactivation of this gene may exist. A polymorphic change (Arg to Gly at DCC codon 201 is related to advanced colorectal carcinoma and increases in the tumors with absent DCC protein expression. In order to understand whether this change is associated with the development or progression of neuroblastoma, we investigated codon 201 polymorphism of the DCC gene in 102 primary neuroblastomas by polymerase chain reaction single-strand conformation polymorphism. We found no missense or nonsense mutations, but a polymorphic change from CGA (Arg to GGA (Gly at codon 201 resulting in three types of polymorphism: codon 201Gly type, codon 201Arg/Gly type, and codon 201Arg type. The codon 201Gly type occurred more frequently in disseminated (stages IV and IVs neuroblastomas (72% than in localized (stages I, II, and III tumors (48% (P=.035, and normal controls (38% (P=.024. In addition, the codon 201Gly type was significantly more common in tumors found clinically (65% than in those found by mass screening (35% (P=.002. The results suggested that the codon 201Gly type of the DCC gene might be associated with a higher risk of disseminating neuroblastoma.
ZCURVE 3.0: identify prokaryotic genes with higher accuracy as well as automatically and accurately select essential genes

Science.gov (United States)

Hua, Zhi-Gang; Lin, Yan; Yuan, Ya-Zhou; Yang, De-Chang; Wei, Wen; Guo, Feng-Biao

2015-01-01

In 2003, we developed an ab initio program, ZCURVE 1.0, to find genes in bacterial and archaeal genomes. In this work, we present the updated version (i.e. ZCURVE 3.0). Using 422 prokaryotic genomes, the average accuracy was 93.7% with the updated version, compared with 88.7% with the original version. Such results also demonstrate that ZCURVE 3.0 is comparable with Glimmer 3.02 and may provide complementary predictions to it. In fact, the joint application of the two programs generated better results by correctly finding more annotated genes while also containing fewer false-positive predictions. As the exclusive function, ZCURVE 3.0 contains one post-processing program that can identify essential genes with high accuracy (generally >90%). We hope ZCURVE 3.0 will receive wide use with the web-based running mode. The updated ZCURVE can be freely accessed from http://cefg.uestc.edu.cn/zcurve/ or http://tubic.tju.edu.cn/zcurveb/ without any restrictions. PMID:25977299
Gene interactions in the DNA damage-response pathway identified by genome-wide RNA-interference analysis of synthetic lethality

NARCIS (Netherlands)

van Haaften, Gijs; Vastenhouw, Nadine L; Nollen, Ellen A A; Plasterk, Ronald H A; Tijsterman, Marcel

2004-01-01

Here, we describe a systematic search for synthetic gene interactions in a multicellular organism, the nematode Caenorhabditis elegans. We established a high-throughput method to determine synthetic gene interactions by genome-wide RNA interference and identified genes that are required to protect
Methods for simultaneously identifying coherent local clusters with smooth global patterns in gene expression profiles

Directory of Open Access Journals (Sweden)

Lee Yun-Shien

2008-03-01

Full Text Available Abstract Background The hierarchical clustering tree (HCT with a dendrogram 1 and the singular value decomposition (SVD with a dimension-reduced representative map 2 are popular methods for two-way sorting the gene-by-array matrix map employed in gene expression profiling. While HCT dendrograms tend to optimize local coherent clustering patterns, SVD leading eigenvectors usually identify better global grouping and transitional structures. Results This study proposes a flipping mechanism for a conventional agglomerative HCT using a rank-two ellipse (R2E, an improved SVD algorithm for sorting purpose seriation by Chen 3 as an external reference. While HCTs always produce permutations with good local behaviour, the rank-two ellipse seriation gives the best global grouping patterns and smooth transitional trends. The resulting algorithm automatically integrates the desirable properties of each method so that users have access to a clustering and visualization environment for gene expression profiles that preserves coherent local clusters and identifies global grouping trends. Conclusion We demonstrate, through four examples, that the proposed method not only possesses better numerical and statistical properties, it also provides more meaningful biomedical insights than other sorting algorithms. We suggest that sorted proximity matrices for genes and arrays, in addition to the gene-by-array expression matrix, can greatly aid in the search for comprehensive understanding of gene expression structures. Software for the proposed methods can be obtained at http://gap.stat.sinica.edu.tw/Software/GAP.
Phylogenomic analysis of UDP glycosyltransferase 1 multigene family in Linum usitatissimum identified genes with varied expression patterns

Science.gov (United States)

2012-01-01

Background The glycosylation process, catalyzed by ubiquitous glycosyltransferase (GT) family enzymes, is a prevalent modification of plant secondary metabolites that regulates various functions such as hormone homeostasis, detoxification of xenobiotics and biosynthesis and storage of secondary metabolites. Flax (Linum usitatissimum L.) is a commercially grown oilseed crop, important because of its essential fatty acids and health promoting lignans. Identification and characterization of UDP glycosyltransferase (UGT) genes from flax could provide valuable basic information about this important gene family and help to explain the seed specific glycosylated metabolite accumulation and other processes in plants. Plant genome sequencing projects are useful to discover complexity within this gene family and also pave way for the development of functional genomics approaches. Results Taking advantage of the newly assembled draft genome sequence of flax, we identified 137 UDP glycosyltransferase (UGT) genes from flax using a conserved signature motif. Phylogenetic analysis of these protein sequences clustered them into 14 major groups (A-N). Expression patterns of these genes were investigated using publicly available expressed sequence tag (EST), microarray data and reverse transcription quantitative real time PCR (RT-qPCR). Seventy-three per cent of these genes (100 out of 137) showed expression evidence in 15 tissues examined and indicated varied expression profiles. The RT-qPCR results of 10 selected genes were also coherent with the digital expression analysis. Interestingly, five duplicated UGT genes were identified, which showed differential expression in various tissues. Of the seven intron loss/gain positions detected, two intron positions were conserved among most of the UGTs, although a clear relationship about the evolution of these genes could not be established. Comparison of the flax UGTs with orthologs from four other sequenced dicot genomes indicated that
Phylogenomic analysis of UDP glycosyltransferase 1 multigene family in Linum usitatissimum identified genes with varied expression patterns

Directory of Open Access Journals (Sweden)

Barvkar Vitthal T

2012-05-01

Full Text Available Abstract Background The glycosylation process, catalyzed by ubiquitous glycosyltransferase (GT family enzymes, is a prevalent modification of plant secondary metabolites that regulates various functions such as hormone homeostasis, detoxification of xenobiotics and biosynthesis and storage of secondary metabolites. Flax (Linum usitatissimum L. is a commercially grown oilseed crop, important because of its essential fatty acids and health promoting lignans. Identification and characterization of UDP glycosyltransferase (UGT genes from flax could provide valuable basic information about this important gene family and help to explain the seed specific glycosylated metabolite accumulation and other processes in plants. Plant genome sequencing projects are useful to discover complexity within this gene family and also pave way for the development of functional genomics approaches. Results Taking advantage of the newly assembled draft genome sequence of flax, we identified 137 UDP glycosyltransferase (UGT genes from flax using a conserved signature motif. Phylogenetic analysis of these protein sequences clustered them into 14 major groups (A-N. Expression patterns of these genes were investigated using publicly available expressed sequence tag (EST, microarray data and reverse transcription quantitative real time PCR (RT-qPCR. Seventy-three per cent of these genes (100 out of 137 showed expression evidence in 15 tissues examined and indicated varied expression profiles. The RT-qPCR results of 10 selected genes were also coherent with the digital expression analysis. Interestingly, five duplicated UGT genes were identified, which showed differential expression in various tissues. Of the seven intron loss/gain positions detected, two intron positions were conserved among most of the UGTs, although a clear relationship about the evolution of these genes could not be established. Comparison of the flax UGTs with orthologs from four other sequenced dicot
RNA-Seq analysis of Citrus reticulata in the early stages of Xylella fastidiosa infection reveals auxin-related genes as a defense response.

Science.gov (United States)

Rodrigues, Carolina M; de Souza, Alessandra A; Takita, Marco A; Kishi, Luciano T; Machado, Marcos A

2013-10-03

Citrus variegated chlorosis (CVC), caused by Xylella fastidiosa, is one the most important citrus diseases, and affects all varieties of sweet orange (Citrus sinensis L. Osb). On the other hand, among the Citrus genus there are different sources of resistance against X. fastidiosa. For these species identifying these defense genes could be an important step towards obtaining sweet orange resistant varieties through breeding or genetic engineering. To assess these genes we made use of mandarin (C. reticulata Blanco) that is known to be resistant to CVC and shares agronomical characteristics with sweet orange. Thus, we investigated the gene expression in Ponkan mandarin at one day after infection with X. fastidiosa, using RNA-seq. A set of genes considered key elements in the resistance was used to confirm its regulation in mandarin compared with the susceptible sweet orange. Gene expression analysis of mock inoculated and infected tissues of Ponkan mandarin identified 667 transcripts repressed and 724 significantly induced in the later. Among the induced transcripts, we identified genes encoding proteins similar to Pattern Recognition Receptors. Furthermore, many genes involved in secondary metabolism, biosynthesis and cell wall modification were upregulated as well as in synthesis of abscisic acid, jasmonic acid and auxin. This work demonstrated that the defense response to the perception of bacteria involves cell wall modification and activation of hormone pathways, which probably lead to the induction of other defense-related genes. We also hypothesized the induction of auxin-related genes indicates that resistant plants initially recognize X. fastidiosa as a necrotrophic pathogen.
A CRISPR-Based Screen Identifies Genes Essential for West-Nile-Virus-Induced Cell Death.

Science.gov (United States)

Ma, Hongming; Dang, Ying; Wu, Yonggan; Jia, Gengxiang; Anaya, Edgar; Zhang, Junli; Abraham, Sojan; Choi, Jang-Gi; Shi, Guojun; Qi, Ling; Manjunath, N; Wu, Haoquan

2015-07-28

West Nile virus (WNV) causes an acute neurological infection attended by massive neuronal cell death. However, the mechanism(s) behind the virus-induced cell death is poorly understood. Using a library containing 77,406 sgRNAs targeting 20,121 genes, we performed a genome-wide screen followed by a second screen with a sub-library. Among the genes identified, seven genes, EMC2, EMC3, SEL1L, DERL2, UBE2G2, UBE2J1, and HRD1, stood out as having the strongest phenotype, whose knockout conferred strong protection against WNV-induced cell death with two different WNV strains and in three cell lines. Interestingly, knockout of these genes did not block WNV replication. Thus, these appear to be essential genes that link WNV replication to downstream cell death pathway(s). In addition, the fact that all of these genes belong to the ER-associated protein degradation (ERAD) pathway suggests that this might be the primary driver of WNV-induced cell death. Copyright © 2015 The Authors. Published by Elsevier Inc. All rights reserved.
A novel APOC2 gene mutation identified in a Chinese patient with severe hypertriglyceridemia and recurrent pancreatitis.

Science.gov (United States)

Jiang, Jingjing; Wang, Yuhui; Ling, Yan; Kayoumu, Abudurexiti; Liu, George; Gao, Xin

2016-01-16

The severe forms of hypertriglyceridemia are usually caused by genetic defects. In this study, we described a Chinese female with severe hypertriglyceridemia caused by a novel homozygous mutation in the APOC2 gene. Lipid profiles of the pedigree were studied in detail. LPL and HL activity were also measured. The coding regions of 5 candidate genes (namely LPL, APOC2, APOA5, LMF1, and GPIHBP1) were sequenced using genomic DNA from peripheral leucocytes. The ApoE gene was also genotyped. Serum triglyceride level was extremely high in the proband, compared with other family members. Plasma LPL activity was also significantly reduced in the proband. Serum ApoCII was very low in the proband as well as in the heterozygous mutation carriers. A novel mutation (c.86A > CC) was identified on exon 3 [corrected] of the APOC2 gene, which converted the Asp [corrected] codon at position 29 into Ala, followed by a termination codon (TGA). This study presented the first case of ApoCII deficiency in the Chinese population, with a novel mutation c.86A > CC in the APOC2 gene identified. Serum ApoCII protein might be a useful screening test for identifying mutation carriers.
Gene Transfers Between Distantly Related Organisms

Science.gov (United States)

Doolittle, Russell F.

2003-01-01

With the completion of numerous microbial genome sequences, reports of individual gene transfers between distantly related prokaryotes have become commonplace. On the other hand, transfers between prokaryotes and eukaryotes still excite the imagination. Many of these claims may be premature, but some are certainly valid. In this chapter, the kinds of supporting data needed to propose transfers between distantly related organisms and cite some interesting examples are considered.
Gene expression changes for antioxidants pathways in the mouse cochlea: relations to age-related hearing deficits.

Directory of Open Access Journals (Sweden)

Sherif F Tadros

Full Text Available Age-related hearing loss - presbycusis - is the number one neurodegenerative disorder and top communication deficit of our aged population. Like many aging disorders of the nervous system, damage from free radicals linked to production of reactive oxygen and/or nitrogen species (ROS and RNS, respectively may play key roles in disease progression. The efficacy of the antioxidant systems, e.g., glutathione and thioredoxin, is an important factor in pathophysiology of the aging nervous system. In this investigation, relations between the expression of antioxidant-related genes in the auditory portion of the inner ear - cochlea, and age-related hearing loss was explored for CBA/CaJ mice. Forty mice were classified into four groups according to age and degree of hearing loss. Cochlear mRNA samples were collected and cDNA generated. Using Affymetrix® GeneChip, the expressions of 56 antioxidant-related gene probes were analyzed to estimate the differences in gene expression between the four subject groups. The expression of Glutathione peroxidase 6, Gpx6; Thioredoxin reductase 1, Txnrd1; Isocitrate dehydrogenase 1, Idh1; and Heat shock protein 1, Hspb1; were significantly different, or showed large fold-change differences between subject groups. The Gpx6, Txnrd1 and Hspb1 gene expression changes were validated using qPCR. The Gpx6 gene was upregulated while the Txnrd1 gene was downregulated with age/hearing loss. The Hspb1 gene was found to be downregulated in middle-aged animals as well as those with mild presbycusis, whereas it was upregulated in those with severe presbycusis. These results facilitate development of future interventions to predict, prevent or slow down the progression of presbycusis.
A data mining approach for classifying DNA repair genes into ageing-related or non-ageing-related

Directory of Open Access Journals (Sweden)

Vasieva Olga

2011-01-01

Full Text Available Abstract Background The ageing of the worldwide population means there is a growing need for research on the biology of ageing. DNA damage is likely a key contributor to the ageing process and elucidating the role of different DNA repair systems in ageing is of great interest. In this paper we propose a data mining approach, based on classification methods (decision trees and Naive Bayes, for analysing data about human DNA repair genes. The goal is to build classification models that allow us to discriminate between ageing-related and non-ageing-related DNA repair genes, in order to better understand their different properties. Results The main patterns discovered by the classification methods are as follows: (a the number of protein-protein interactions was a predictor of DNA repair proteins being ageing-related; (b the use of predictor attributes based on protein-protein interactions considerably increased predictive accuracy of attributes based on Gene Ontology (GO annotations; (c GO terms related to "response to stimulus" seem reasonably good predictors of ageing-relatedness for DNA repair genes; (d interaction with the XRCC5 (Ku80 protein is a strong predictor of ageing-relatedness for DNA repair genes; and (e DNA repair genes with a high expression in T lymphocytes are more likely to be ageing-related. Conclusions The above patterns are broadly integrated in an analysis discussing relations between Ku, the non-homologous end joining DNA repair pathway, ageing and lymphocyte development. These patterns and their analysis support non-homologous end joining double strand break repair as central to the ageing-relatedness of DNA repair genes. Our work also showcases the use of protein interaction partners to improve accuracy in data mining methods and our approach could be applied to other ageing-related pathways.
Linking the Salt Transcriptome with Physiological Responses of a Salt-Resistant Populus Species as a Strategy to Identify Genes Important for Stress Acclimation1[W][OA

Science.gov (United States)

Brinker, Monika; Brosché, Mikael; Vinocur, Basia; Abo-Ogiala, Atef; Fayyaz, Payam; Janz, Dennis; Ottow, Eric A.; Cullmann, Andreas D.; Saborowski, Joachim; Kangasjärvi, Jaakko; Altman, Arie; Polle, Andrea

2010-01-01

To investigate early salt acclimation mechanisms in a salt-tolerant poplar species (Populus euphratica), the kinetics of molecular, metabolic, and physiological changes during a 24-h salt exposure were measured. Three distinct phases of salt stress were identified by analyses of the osmotic pressure and the shoot water potential: dehydration, salt accumulation, and osmotic restoration associated with ionic stress. The duration and intensity of these phases differed between leaves and roots. Transcriptome analysis using P. euphratica-specific microarrays revealed clusters of coexpressed genes in these phases, with only 3% overlapping salt-responsive genes in leaves and roots. Acclimation of cellular metabolism to high salt concentrations involved remodeling of amino acid and protein biosynthesis and increased expression of molecular chaperones (dehydrins, osmotin). Leaves suffered initially from dehydration, which resulted in changes in transcript levels of mitochondrial and photosynthetic genes, indicating adjustment of energy metabolism. Initially, decreases in stress-related genes were found, whereas increases occurred only when leaves had restored the osmotic balance by salt accumulation. Comparative in silico analysis of the poplar stress regulon with Arabidopsis (Arabidopsis thaliana) orthologs was used as a strategy to reduce the number of candidate genes for functional analysis. Analysis of Arabidopsis knockout lines identified a lipocalin-like gene (AtTIL) and a gene encoding a protein with previously unknown functions (AtSIS) to play roles in salt tolerance. In conclusion, by dissecting the stress transcriptome of tolerant species, novel genes important for salt endurance can be identified. PMID:20959419
Expression of isgylation related genes in regenerating rat liver

Directory of Open Access Journals (Sweden)

Kuklin A. V.

2015-10-01

Full Text Available Our recent studies have revealed the early up-regulated expression of interferon alpha (IFNα in the liver, induced by partial hepatectomy. The role of this cytokine of innate immune response in liver regeneration is still controversial. Aim. To analyze expression of canonical interferon-stimulated genes Ube1l, Ube2l6, Trim25, Usp18 and Isg15 during the liver transition from quiescence to proliferation induced by partial hepatectomy, and acute phase response induced by laparotomy. These genes are responsible for posttranslational modification of proteins by ISGylation. The expression of genes encoding TATA binding protein (TBP and 18S rRNA served as indirect general markers of transcriptional and translational activities. Methods. The abundance of investigated RNAs was assessed in total liver RNA by real time RT–qPCR. Results. Partial hepatecomy induced steady upregulation of the Tbp and 18S rRNA genes expression during 12 hours post-surgery and downregulation or no change in expression of ISGylation-related genes during the first 3 hours followed by slight upregulation at 12 hours. The level of Isg15 transcripts was permanently below that of the control during the prereplicative period. Laparotomy induced a continuous downregulation of Tbp and 18S rRNA expression and early (1–3h upregulation of ISGylation–related transcripts followed by a sharp drop at 6 hours and slight increase/decrease at 12 hours. The changes in the abundance of Ifnα and ISGylation-related mRNAs were oppositely directed at each stage of the response to partial hepatectomy and laparotomy. Conclusion. We suggest that the expression of ISGylation-related genes does not depend on the expression of Ifnα gene after both surgeries. The indirect indices of transcription and translation as well as the expression of ISGylation-relaled genes are principally different in response to partial hepatectomy and laparotomy and argue for the high specificity of innate immune response.
Research progress on related genes for primary open angle glaucoma

Directory of Open Access Journals (Sweden)

Ailijiang·Aierken

2014-04-01

Full Text Available Primary open angle glaucoma(POAGis the main cause of blindness with visual field damage and optic nerve degeneration. In recent years, a lot of researches have been done, showing that genetic factors and gene mutation play an important role in POAG. There are more than 20 related POAG genes. Now we will review the related genes of POAG, especially the well known causative genes of MYOC, OPTN, WDR36, and CAV1/CAV2, in terms of their locations, structures, research progress, et al, and provide a reference for genetic research in primary open-angle glaucoma.
Comprehensive analysis of genic male sterility-related genes in Brassica rapa using a newly developed Br300K oligomeric chip.

Directory of Open Access Journals (Sweden)

Xiangshu Dong

Full Text Available To identify genes associated with genic male sterility (GMS that could be useful for hybrid breeding in Chinese cabbage (Brassicarapa ssp. pekinensis, floral bud transcriptome analysis was carried out using a B. rapa microarray with 300,000 probes (Br300K. Among 47,548 clones deposited on a Br300K microarray with seven probes of 60 nt length within the 3' 150 bp region, a total of 10,622 genes were differentially expressed between fertile and sterile floral buds; 4,774 and 5,848 genes were up-regulated over 2-fold in fertile and sterile buds, respectively. However, the expression of 1,413 and 199 genes showed fertile and sterile bud-specific features, respectively. Genes expressed specifically in fertile buds, possibly GMS-related genes, included homologs of several Arabidopsis male sterility-related genes, genes associated with the cell wall and synthesis of its surface proteins, pollen wall and coat components, signaling components, and nutrient supplies. However, most early genes for pollen development, genes for primexine and callose formation, and genes for pollen maturation and anther dehiscence showed no difference in expression between fertile and sterile buds. Some of the known genes associated with Arabidopsis pollen development showed similar expression patterns to those seen in this study, while others did not. BrbHLH89 and BrMYP99 are putative GMS genes. Additionally, 17 novel genes identified only in B. rapa were specifically and highly expressed only in fertile buds, implying the possible involvement in male fertility. All data suggest that Chinese cabbage GMS might be controlled by genes acting in post-meiotic tapetal development that are different from those known to be associated with Arabidopsis male sterility.
Comparative Transcriptome Analysis Identifies Putative Genes Involved in Steroid Biosynthesis in Euphorbia tirucalli

Directory of Open Access Journals (Sweden)

Weibo Qiao

2018-01-01

Full Text Available Phytochemical analysis of different Euphorbia tirucalli tissues revealed a contrasting tissue-specificity for the biosynthesis of euphol and β-sitosterol, which represent the two pharmaceutically active steroids in E. tirucalli. To uncover the molecular mechanism underlying this tissue-specificity for phytochemicals, a comprehensive E. tirucalli transcriptome derived from its root, stem, leaf and latex was constructed, and a total of 91,619 unigenes were generated with 51.08% being successfully annotated against the non-redundant (Nr protein database. A comparison of the transcriptome from different tissues discovered members of unigenes in the upstream steps of sterol backbone biosynthesis leading to this tissue-specific sterol biosynthesis. Among them, the putative oxidosqualene cyclase (OSC encoding genes involved in euphol synthesis were notably identified, and their expressions were significantly up-regulated in the latex. In addition, genome-wide differentially expressed genes (DEGs in the different E. tirucalli tissues were identified. The cluster analysis of those DEGs showed a unique expression pattern in the latex compared with other tissues. The DEGs identified in this study would enrich the insights of sterol biosynthesis and the regulation mechanism of this latex-specificity.

Gene Expression Profiling and Association with Prion-Related Lesions in the Medulla Oblongata of Symptomatic Natural Scrapie Animals

Science.gov (United States)

Filali, Hicham; Martin-Burriel, Inmaculada; Harders, Frank; Varona, Luis; Lyahyai, Jaber; Zaragoza, Pilar; Pumarola, Martí; Badiola, Juan J.; Bossers, Alex; Bolea, Rosa

2011-01-01

The pathogenesis of natural scrapie and other prion diseases remains unclear. Examining transcriptome variations in infected versus control animals may highlight new genes potentially involved in some of the molecular mechanisms of prion-induced pathology. The aim of this work was to identify disease-associated alterations in the gene expression profiles of the caudal medulla oblongata (MO) in sheep presenting the symptomatic phase of natural scrapie. The gene expression patterns in the MO from 7 sheep that had been naturally infected with scrapie were compared with 6 controls using a Central Veterinary Institute (CVI) custom designed 4×44K microarray. The microarray consisted of a probe set on the previously sequenced ovine tissue library by CVI and was supplemented with all of the Ovis aries transcripts that are currently publicly available. Over 350 probe sets displayed greater than 2-fold changes in expression. We identified 148 genes from these probes, many of which encode proteins that are involved in the immune response, ion transport, cell adhesion, and transcription. Our results confirm previously published gene expression changes that were observed in murine models with induced scrapie. Moreover, we have identified new genes that exhibit differential expression in scrapie and could be involved in prion neuropathology. Finally, we have investigated the relationship between gene expression profiles and the appearance of the main scrapie-related lesions, including prion protein deposition, gliosis and spongiosis. In this context, the potential impacts of these gene expression changes in the MO on scrapie development are discussed. PMID:21629698
Network-Based Integration of GWAS and Gene Expression Identifies a HOX-Centric Network Associated with Serous Ovarian Cancer Risk.

Science.gov (United States)

Kar, Siddhartha P; Tyrer, Jonathan P; Li, Qiyuan; Lawrenson, Kate; Aben, Katja K H; Anton-Culver, Hoda; Antonenkova, Natalia; Chenevix-Trench, Georgia; Baker, Helen; Bandera, Elisa V; Bean, Yukie T; Beckmann, Matthias W; Berchuck, Andrew; Bisogna, Maria; Bjørge, Line; Bogdanova, Natalia; Brinton, Louise; Brooks-Wilson, Angela; Butzow, Ralf; Campbell, Ian; Carty, Karen; Chang-Claude, Jenny; Chen, Yian Ann; Chen, Zhihua; Cook, Linda S; Cramer, Daniel; Cunningham, Julie M; Cybulski, Cezary; Dansonka-Mieszkowska, Agnieszka; Dennis, Joe; Dicks, Ed; Doherty, Jennifer A; Dörk, Thilo; du Bois, Andreas; Dürst, Matthias; Eccles, Diana; Easton, Douglas F; Edwards, Robert P; Ekici, Arif B; Fasching, Peter A; Fridley, Brooke L; Gao, Yu-Tang; Gentry-Maharaj, Aleksandra; Giles, Graham G; Glasspool, Rosalind; Goode, Ellen L; Goodman, Marc T; Grownwald, Jacek; Harrington, Patricia; Harter, Philipp; Hein, Alexander; Heitz, Florian; Hildebrandt, Michelle A T; Hillemanns, Peter; Hogdall, Estrid; Hogdall, Claus K; Hosono, Satoyo; Iversen, Edwin S; Jakubowska, Anna; Paul, James; Jensen, Allan; Ji, Bu-Tian; Karlan, Beth Y; Kjaer, Susanne K; Kelemen, Linda E; Kellar, Melissa; Kelley, Joseph; Kiemeney, Lambertus A; Krakstad, Camilla; Kupryjanczyk, Jolanta; Lambrechts, Diether; Lambrechts, Sandrina; Le, Nhu D; Lee, Alice W; Lele, Shashi; Leminen, Arto; Lester, Jenny; Levine, Douglas A; Liang, Dong; Lissowska, Jolanta; Lu, Karen; Lubinski, Jan; Lundvall, Lene; Massuger, Leon; Matsuo, Keitaro; McGuire, Valerie; McLaughlin, John R; McNeish, Iain A; Menon, Usha; Modugno, Francesmary; Moysich, Kirsten B; Narod, Steven A; Nedergaard, Lotte; Ness, Roberta B; Nevanlinna, Heli; Odunsi, Kunle; Olson, Sara H; Orlow, Irene; Orsulic, Sandra; Weber, Rachel Palmieri; Pearce, Celeste Leigh; Pejovic, Tanja; Pelttari, Liisa M; Permuth-Wey, Jennifer; Phelan, Catherine M; Pike, Malcolm C; Poole, Elizabeth M; Ramus, Susan J; Risch, Harvey A; Rosen, Barry; Rossing, Mary Anne; Rothstein, Joseph H; Rudolph, Anja; Runnebaum, Ingo B; Rzepecka, Iwona K; Salvesen, Helga B; Schildkraut, Joellen M; Schwaab, Ira; Shu, Xiao-Ou; Shvetsov, Yurii B; Siddiqui, Nadeem; Sieh, Weiva; Song, Honglin; Southey, Melissa C; Sucheston-Campbell, Lara E; Tangen, Ingvild L; Teo, Soo-Hwang; Terry, Kathryn L; Thompson, Pamela J; Timorek, Agnieszka; Tsai, Ya-Yu; Tworoger, Shelley S; van Altena, Anne M; Van Nieuwenhuysen, Els; Vergote, Ignace; Vierkant, Robert A; Wang-Gohrke, Shan; Walsh, Christine; Wentzensen, Nicolas; Whittemore, Alice S; Wicklund, Kristine G; Wilkens, Lynne R; Woo, Yin-Ling; Wu, Xifeng; Wu, Anna; Yang, Hannah; Zheng, Wei; Ziogas, Argyrios; Sellers, Thomas A; Monteiro, Alvaro N A; Freedman, Matthew L; Gayther, Simon A; Pharoah, Paul D P

2015-10-01

Genome-wide association studies (GWAS) have so far reported 12 loci associated with serous epithelial ovarian cancer (EOC) risk. We hypothesized that some of these loci function through nearby transcription factor (TF) genes and that putative target genes of these TFs as identified by coexpression may also be enriched for additional EOC risk associations. We selected TF genes within 1 Mb of the top signal at the 12 genome-wide significant risk loci. Mutual information, a form of correlation, was used to build networks of genes strongly coexpressed with each selected TF gene in the unified microarray dataset of 489 serous EOC tumors from The Cancer Genome Atlas. Genes represented in this dataset were subsequently ranked using a gene-level test based on results for germline SNPs from a serous EOC GWAS meta-analysis (2,196 cases/4,396 controls). Gene set enrichment analysis identified six networks centered on TF genes (HOXB2, HOXB5, HOXB6, HOXB7 at 17q21.32 and HOXD1, HOXD3 at 2q31) that were significantly enriched for genes from the risk-associated end of the ranked list (P < 0.05 and FDR < 0.05). These results were replicated (P < 0.05) using an independent association study (7,035 cases/21,693 controls). Genes underlying enrichment in the six networks were pooled into a combined network. We identified a HOX-centric network associated with serous EOC risk containing several genes with known or emerging roles in serous EOC development. Network analysis integrating large, context-specific datasets has the potential to offer mechanistic insights into cancer susceptibility and prioritize genes for experimental characterization. ©2015 American Association for Cancer Research.
Candidate gene resequencing to identify rare, pedigree-specific variants influencing healthy aging phenotypes in the long life family study

DEFF Research Database (Denmark)

Druley, Todd E; Wang, Lihua; Lin, Shiow J

2016-01-01

from six pedigrees. OBFC1 (chromosome 10) is involved in telomere maintenance, and falls within a linkage peak recently reported from an analysis of telomere length in LLFS families. Two different algorithms for single gene associations identified three genes with an enrichment of variation......BACKGROUND: The Long Life Family Study (LLFS) is an international study to identify the genetic components of various healthy aging phenotypes. We hypothesized that pedigree-specific rare variants at longevity-associated genes could have a similar functional impact on healthy phenotypes. METHODS......: We performed custom hybridization capture sequencing to identify the functional variants in 464 candidate genes for longevity or the major diseases of aging in 615 pedigrees (4,953 individuals) from the LLFS, using a multiplexed, custom hybridization capture. Variants were analyzed individually...
Microevolution of Virulence-Related Genes in Helicobacter pylori Familial Infection.

Directory of Open Access Journals (Sweden)

Yoshikazu Furuta

Full Text Available Helicobacter pylori, a bacterial pathogen that can infect human stomach causing gastritis, ulcers and cancer, is known to have a high degree of genome/epigenome diversity as the result of mutation and recombination. The bacteria often infect in childhood and persist for the life of the host. One of the reasons of the rapid evolution of H. pylori is that it changes its genome drastically for adaptation to a new host. To investigate microevolution and adaptation of the H. pylori genome, we undertook whole genome sequencing of the same or very similar sequence type in multi-locus sequence typing (MLST with seven genes in members of the same family consisting of parents and children in Japan. Detection of nucleotide substitutions revealed likely transmission pathways involving children. Nonsynonymous (amino acid changing mutations were found in virulence-related genes (cag genes, vacA, hcpDX, tnfα, ggt, htrA and the collagenase gene, outer membrane protein (OMP genes and other cell surface-related protein genes, signal transduction genes and restriction-modification genes. We reconstructed various pathways by which H. pylori can adapt to a new human host, and our results raised the possibility that the mutational changes in virulence-related genes have a role in adaptation to a child host. Changes in restriction-modification genes might remodel the methylome and transcriptome to help adaptation. This study has provided insights into H. pylori transmission and virulence and has implications for basic research as well as clinical practice.
Gene profile analysis of osteoblast genes differentially regulated by histone deacetylase inhibitors

Directory of Open Access Journals (Sweden)

Lamblin Anne-Francoise

2007-10-01

Full Text Available Abstract Background Osteoblast differentiation requires the coordinated stepwise expression of multiple genes. Histone deacetylase inhibitors (HDIs accelerate the osteoblast differentiation process by blocking the activity of histone deacetylases (HDACs, which alter gene expression by modifying chromatin structure. We previously demonstrated that HDIs and HDAC3 shRNAs accelerate matrix mineralization and the expression of osteoblast maturation genes (e.g. alkaline phosphatase, osteocalcin. Identifying other genes that are differentially regulated by HDIs might identify new pathways that contribute to osteoblast differentiation. Results To identify other osteoblast genes that are altered early by HDIs, we incubated MC3T3-E1 preosteoblasts with HDIs (trichostatin A, MS-275, or valproic acid for 18 hours in osteogenic conditions. The promotion of osteoblast differentiation by HDIs in this experiment was confirmed by osteogenic assays. Gene expression profiles relative to vehicle-treated cells were assessed by microarray analysis with Affymetrix GeneChip 430 2.0 arrays. The regulation of several genes by HDIs in MC3T3-E1 cells and primary osteoblasts was verified by quantitative real-time PCR. Nine genes were differentially regulated by at least two-fold after exposure to each of the three HDIs and six were verified by PCR in osteoblasts. Four of the verified genes (solute carrier family 9 isoform 3 regulator 1 (Slc9a3r1, sorbitol dehydrogenase 1, a kinase anchor protein, and glutathione S-transferase alpha 4 were induced. Two genes (proteasome subunit, beta type 10 and adaptor-related protein complex AP-4 sigma 1 were suppressed. We also identified eight growth factors and growth factor receptor genes that are significantly altered by each of the HDIs, including Frizzled related proteins 1 and 4, which modulate the Wnt signaling pathway. Conclusion This study identifies osteoblast genes that are regulated early by HDIs and indicates pathways that
Transcriptomic network analysis of micronuclei-related genes: a case study

DEFF Research Database (Denmark)

van Leeuwen, D. M.; Pedersen, Marie; Knudsen, Lisbeth E.

2011-01-01

checkpoint and aneuploidy. The MN-related gene network was tested against a transcriptomics case study associated with MN measurements. In this case study, transcriptomic data from children and adults differentially exposed to ambient air pollution in the Czech Republic were analysed and visualised......Mechanistically relevant information on responses of humans to xenobiotic exposure in relation to chemically induced biological effects, such as micronuclei (MN) formation can be obtained through large-scale transcriptomics studies. Network analysis may enhance the analysis and visualisation...... of such data. Therefore, this study aimed to develop a 'MN formation' network based on a priori knowledge, by using the pathway tool MetaCore. The gene network contained 27 genes and three gene complexes that are related to processes involved in MN formation, e.g. spindle assembly checkpoint, cell cycle...
Gene trapping identifies a putative tumor suppressor and a new inducer of cell migration

International Nuclear Information System (INIS)

Guardiola-Serrano, Francisca; Haendeler, Judith; Lukosz, Margarete; Sturm, Karsten; Melchner, Harald von; Altschmied, Joachim

2008-01-01

Tumor necrosis factor alpha (TNFα) is a pleiotropic cytokine involved in apoptotic cell death, cellular proliferation, differentiation, inflammation, and tumorigenesis. In tumors it is secreted by tumor associated macrophages and can have both pro- and anti-tumorigenic effects. To identify genes regulated by TNFα, we performed a gene trap screen in the mammary carcinoma cell line MCF-7 and recovered 64 unique, TNFα-induced gene trap integration sites. Among these were the genes coding for the zinc finger protein ZC3H10 and for the transcription factor grainyhead-like 3 (GRHL3). In line with the dual effects of TNFα on tumorigenesis, we found that ZC3H10 inhibits anchorage independent growth in soft agar suggesting a tumor suppressor function, whereas GRHL3 strongly stimulated the migration of endothelial cells which is consistent with an angiogenic, pro-tumorigenic function
A genetic screen for modifiers of UFO meristem activity identifies three novel FUSED FLORAL ORGANS genes required for early flower development in Arabidopsis.

Science.gov (United States)

Levin, J Z; Fletcher, J C; Chen, X; Meyerowitz, E M

1998-06-01

In a screen to identify novel genes required for early Arabidopsis flower development, we isolated four independent mutations that enhance the Ufo phenotype toward the production of filamentous structures in place of flowers. The mutants fall into three complementation groups, which we have termed FUSED FLORAL ORGANS (FFO) loci. ffo mutants have specific defects in floral organ separation and/or positioning; thus, the FFO genes identify components of a boundary formation mechanism(s) acting between developing floral organ primordia. FFO1 and FFO3 have specific functions in cauline leaf/stem separation and in first- and third-whorl floral organ separation, with FFO3 likely acting to establish and FFO1 to maintain floral organ boundaries. FFO2 acts at early floral stages to regulate floral organ number and positioning and to control organ separation within and between whorls. Plants doubly mutant for two ffo alleles display additive phenotypes, indicating that the FFO genes may act in separate pathways. Plants doubly mutant for an ffo gene and for ufo, lfy, or clv3 reveal that the FFO genes play roles related to those of UFO and LFY in floral meristem initiation and that FFO2 and FFO3 may act to control cell proliferation late in inflorescence development.
Tissue distribution of the dystrophin-related gene product and expression in the mdx and dy mouse

Energy Technology Data Exchange (ETDEWEB)

Love, D.R.; Marsden, R.F.; Bloomfield, J.F.; Davies, K.E. (John Radcliffe Hospital, Oxford (England)); Morris, G.E.; Ellis, J.M. (North East Wales Inst., Deeside, Wales (England)); Fairbrother, U.; Edwards, Y.H. (Univ. College London (England)); Slater, C.P. (Newcastle General Hospital, Newcastle-upon-Tyne (England)); Parry, D.J. (Univ. of Ottawa, Ontario (Canada))

1991-04-15

The authors have previously reported a dystrophin-related locus (DMDL for Duchenne muscular dystrophy-like) on human chromosome 6 that maps close to the dy mutation on mouse chromosome 10. Here they show that this gene is expressed in a wide range of tissues at varying levels. The transcript is particularly abundant in several human fetal tissues, including heart, placenta, and intestine. Studies with antisera raised against a DMDL fusion protein identify a 400,000 M{sub r} protein in all mouse tissues tested, including those of mdx and dy mice. Unlike the dystrophin gene, the DMDL gene transcript is not differentially spliced at the 3{prime} end in either fetal muscle or brain.
Genome-wide gene-environment study identifies glutamate receptor gene GRIN2A as a Parkinson's disease modifier gene via interaction with coffee.

Directory of Open Access Journals (Sweden)

Taye H Hamza

2011-08-01

concept that inclusion of environmental factors can help identify genes that are missed in GWAS. Both adenosine antagonists (caffeine-like and glutamate antagonists (GRIN2A-related are being tested in clinical trials for treatment of PD. GRIN2A may be a useful pharmacogenetic marker for subdividing individuals in clinical trials to determine which medications might work best for which patients.
Resistance-related gene transcription and antioxidant enzyme ...

African Journals Online (AJOL)

The two tobacco relatives of Nicotiana alata and Nicotiana longiflora display a high level of resistance against Colletotrichum nicotianae and the two genes NTF6 and NtPAL related to pathogen defense transcription were higher in N. alata and N. longiflora than the commercial cv. K326. Inoculation with C. nicotianae ...
A gene-trap strategy identifies quiescence-induced genes in ...

Indian Academy of Sciences (India)

PRAKASH KUMAR G

and Walsh 1996). The balance between proliferation and ... In three lines, insertion occurred in genes previously implicated in the control of quiescence, i.e. ...... arrest-specific traps fall into different functional classes, such as cytoskeletal ...
Pathways-driven sparse regression identifies pathways and genes associated with high-density lipoprotein cholesterol in two Asian cohorts.

Directory of Open Access Journals (Sweden)

Matt Silver

2013-11-01

Full Text Available Standard approaches to data analysis in genome-wide association studies (GWAS ignore any potential functional relationships between gene variants. In contrast gene pathways analysis uses prior information on functional structure within the genome to identify pathways associated with a trait of interest. In a second step, important single nucleotide polymorphisms (SNPs or genes may be identified within associated pathways. The pathways approach is motivated by the fact that genes do not act alone, but instead have effects that are likely to be mediated through their interaction in gene pathways. Where this is the case, pathways approaches may reveal aspects of a trait's genetic architecture that would otherwise be missed when considering SNPs in isolation. Most pathways methods begin by testing SNPs one at a time, and so fail to capitalise on the potential advantages inherent in a multi-SNP, joint modelling approach. Here, we describe a dual-level, sparse regression model for the simultaneous identification of pathways and genes associated with a quantitative trait. Our method takes account of various factors specific to the joint modelling of pathways with genome-wide data, including widespread correlation between genetic predictors, and the fact that variants may overlap multiple pathways. We use a resampling strategy that exploits finite sample variability to provide robust rankings for pathways and genes. We test our method through simulation, and use it to perform pathways-driven gene selection in a search for pathways and genes associated with variation in serum high-density lipoprotein cholesterol levels in two separate GWAS cohorts of Asian adults. By comparing results from both cohorts we identify a number of candidate pathways including those associated with cardiomyopathy, and T cell receptor and PPAR signalling. Highlighted genes include those associated with the L-type calcium channel, adenylate cyclase, integrin, laminin, MAPK
Pathways-Driven Sparse Regression Identifies Pathways and Genes Associated with High-Density Lipoprotein Cholesterol in Two Asian Cohorts

Science.gov (United States)

Silver, Matt; Chen, Peng; Li, Ruoying; Cheng, Ching-Yu; Wong, Tien-Yin; Tai, E-Shyong; Teo, Yik-Ying; Montana, Giovanni

2013-01-01

Standard approaches to data analysis in genome-wide association studies (GWAS) ignore any potential functional relationships between gene variants. In contrast gene pathways analysis uses prior information on functional structure within the genome to identify pathways associated with a trait of interest. In a second step, important single nucleotide polymorphisms (SNPs) or genes may be identified within associated pathways. The pathways approach is motivated by the fact that genes do not act alone, but instead have effects that are likely to be mediated through their interaction in gene pathways. Where this is the case, pathways approaches may reveal aspects of a trait's genetic architecture that would otherwise be missed when considering SNPs in isolation. Most pathways methods begin by testing SNPs one at a time, and so fail to capitalise on the potential advantages inherent in a multi-SNP, joint modelling approach. Here, we describe a dual-level, sparse regression model for the simultaneous identification of pathways and genes associated with a quantitative trait. Our method takes account of various factors specific to the joint modelling of pathways with genome-wide data, including widespread correlation between genetic predictors, and the fact that variants may overlap multiple pathways. We use a resampling strategy that exploits finite sample variability to provide robust rankings for pathways and genes. We test our method through simulation, and use it to perform pathways-driven gene selection in a search for pathways and genes associated with variation in serum high-density lipoprotein cholesterol levels in two separate GWAS cohorts of Asian adults. By comparing results from both cohorts we identify a number of candidate pathways including those associated with cardiomyopathy, and T cell receptor and PPAR signalling. Highlighted genes include those associated with the L-type calcium channel, adenylate cyclase, integrin, laminin, MAPK signalling and immune
Sequencing of sporadic Attention-Deficit Hyperactivity Disorder (ADHD) identifies novel and potentially pathogenic de novo variants and excludes overlap with genes associated with autism spectrum disorder.

Science.gov (United States)

Kim, Daniel Seung; Burt, Amber A; Ranchalis, Jane E; Wilmot, Beth; Smith, Joshua D; Patterson, Karynne E; Coe, Bradley P; Li, Yatong K; Bamshad, Michael J; Nikolas, Molly; Eichler, Evan E; Swanson, James M; Nigg, Joel T; Nickerson, Deborah A; Jarvik, Gail P

2017-06-01

Attention-Deficit Hyperactivity Disorder (ADHD) has high heritability; however, studies of common variation account for ADHD variance. Using data from affected participants without a family history of ADHD, we sought to identify de novo variants that could account for sporadic ADHD. Considering a total of 128 families, two analyses were conducted in parallel: first, in 11 unaffected parent/affected proband trios (or quads with the addition of an unaffected sibling) we completed exome sequencing. Six de novo missense variants at highly conserved bases were identified and validated from four of the 11 families: the brain-expressed genes TBC1D9, DAGLA, QARS, CSMD2, TRPM2, and WDR83. Separately, in 117 unrelated probands with sporadic ADHD, we sequenced a panel of 26 genes implicated in intellectual disability (ID) and autism spectrum disorder (ASD) to evaluate whether variation in ASD/ID-associated genes were also present in participants with ADHD. Only one putative deleterious variant (Gln600STOP) in CHD1L was identified; this was found in a single proband. Notably, no other nonsense, splice, frameshift, or highly conserved missense variants in the 26 gene panel were identified and validated. These data suggest that de novo variant analysis in families with independently adjudicated sporadic ADHD diagnosis can identify novel genes implicated in ADHD pathogenesis. Moreover, that only one of the 128 cases (0.8%, 11 exome, and 117 MIP sequenced participants) had putative deleterious variants within our data in 26 genes related to ID and ASD suggests significant independence in the genetic pathogenesis of ADHD as compared to ASD and ID phenotypes. © 2017 Wiley Periodicals, Inc. © 2017 Wiley Periodicals, Inc.
Differentially expressed genes of Tetrahymena thermophila in response to tributyltin (TBT) identified by suppression subtractive hybridization and real time quantitative PCR.

Science.gov (United States)

Feng, Lifang; Miao, Wei; Wu, Yuxuan

2007-02-15

Tributyltin (TBT) is widely used as antifouling paints, agriculture biocides, and plastic stabilizers around the world, resulting in great pollution problem in aquatic environments. However, it has been short of the biomonitor to detect TBT in freshwater. We constructed the suppression subtractive hybridization library of Tetrahymena thermophila exposed to TBT, and screened out 101 Expressed Sequence Tags whose expressions were significantly up- or down-regulated with TBT treatment. From this, a series of genes related to the TBT toxicity were discovered, such as glutathione-S-transferase gene (down-regulated), plasma membrane Ca2+ ATPase isoforms 3 gene (up-regulated) and NgoA (up-regulated). Furthermore, their expressions under different concentrations of TBT treatment (0.5-40 ppb) were detected by real time fluorescent quantitative PCR. The differentially expressed genes of T. thermophila in response to TBT were identified, which provide the basic to make Tetrahymena as a sensitive, rapid and convenient TBT biomonitor in freshwater based on rDNA inducible expression system.
Detection of Gene Interactions Based on Syntactic Relations

Directory of Open Access Journals (Sweden)

Mi-Young Kim

2008-01-01

Full Text Available Interactions between proteins and genes are considered essential in the description of biomolecular phenomena, and networks of interactions are applied in a system's biology approach. Recently, many studies have sought to extract information from biomolecular text using natural language processing technology. Previous studies have asserted that linguistic information is useful for improving the detection of gene interactions. In particular, syntactic relations among linguistic information are good for detecting gene interactions. However, previous systems give a reasonably good precision but poor recall. To improve recall without sacrificing precision, this paper proposes a three-phase method for detecting gene interactions based on syntactic relations. In the first phase, we retrieve syntactic encapsulation categories for each candidate agent and target. In the second phase, we construct a verb list that indicates the nature of the interaction between pairs of genes. In the last phase, we determine direction rules to detect which of two genes is the agent or target. Even without biomolecular knowledge, our method performs reasonably well using a small training dataset. While the first phase contributes to improve recall, the second and third phases contribute to improve precision. In the experimental results using ICML 05 Workshop on Learning Language in Logic (LLL05 data, our proposed method gave an F-measure of 67.2% for the test data, significantly outperforming previous methods. We also describe the contribution of each phase to the performance.
Flavonoid Biosynthesis Genes Putatively Identified in the Aromatic Plant Polygonum minus via Expressed Sequences Tag (EST Analysis

Directory of Open Access Journals (Sweden)

Zamri Zainal

2012-02-01

Full Text Available P. minus is an aromatic plant, the leaf of which is widely used as a food additive and in the perfume industry. The leaf also accumulates secondary metabolites that act as active ingredients such as flavonoid. Due to limited genomic and transcriptomic data, the biosynthetic pathway of flavonoids is currently unclear. Identification of candidate genes involved in the flavonoid biosynthetic pathway will significantly contribute to understanding the biosynthesis of active compounds. We have constructed a standard cDNA library from P. minus leaves, and two normalized full-length enriched cDNA libraries were constructed from stem and root organs in order to create a gene resource for the biosynthesis of secondary metabolites, especially flavonoid biosynthesis. Thus, large‑scale sequencing of P. minus cDNA libraries identified 4196 expressed sequences tags (ESTs which were deposited in dbEST in the National Center of Biotechnology Information (NCBI. From the three constructed cDNA libraries, 11 ESTs encoding seven genes were mapped to the flavonoid biosynthetic pathway. Finally, three flavonoid biosynthetic pathway-related ESTs chalcone synthase, CHS (JG745304, flavonol synthase, FLS (JG705819 and leucoanthocyanidin dioxygenase, LDOX (JG745247 were selected for further examination by quantitative RT-PCR (qRT-PCR in different P. minus organs. Expression was detected in leaf, stem and root. Gene expression studies have been initiated in order to better understand the underlying physiological processes.
Association of aryl hydrocarbon receptor-related gene variants with the severity of autism spectrum disorders

Directory of Open Access Journals (Sweden)

Takashi X. Fujisawa

2016-11-01

Full Text Available Exposure to environmental chemicals, such as dioxin, is known to have adverse effects on the homeostasis of gonadal steroids, thereby potentially altering the sexual differentiation of the brain to express autistic traits. Dioxin-like chemicals act on the aryl hydrocarbon receptor (AhR, polymorphisms and mutations of AhR-related gene may exert pathological influences on sexual differentiation of the brain, causing autistic traits. To ascertain the relationship between AhR-related gene polymorphisms and autism susceptibility, we identified genotypes of them in patients and controls and determined whether there are different gene and genotype distributions between both groups. In addition, to clarify the relationships between the polymorphisms and the severity of autism, we compared the two genotypes of AhR-related genes (rs2066853, rs2228099 with the severity of autistic symptoms. Although no statistically significant difference was found between autism spectrum disorder (ASD patients and control individuals for the genotypic distribution of any of the polymorphisms studied herein, a significant difference in the total score of severity was observed in rs2228099 polymorphism, suggesting that the polymorphism modifies the severity of ASD symptoms but not ASD susceptibility. Moreover, we found that a significant difference in the social communication score of severity was observed. These results suggest that the rs2228099 polymorphism is possibly associated with the severity of social communication impairment among the diverse ASD symptoms.
Genes2WordCloud: a quick way to identify biological themes from gene lists and free text.

Science.gov (United States)

Baroukh, Caroline; Jenkins, Sherry L; Dannenfelser, Ruth; Ma'ayan, Avi

2011-10-13

Word-clouds recently emerged on the web as a solution for quickly summarizing text by maximizing the display of most relevant terms about a specific topic in the minimum amount of space. As biologists are faced with the daunting amount of new research data commonly presented in textual formats, word-clouds can be used to summarize and represent biological and/or biomedical content for various applications. Genes2WordCloud is a web application that enables users to quickly identify biological themes from gene lists and research relevant text by constructing and displaying word-clouds. It provides users with several different options and ideas for the sources that can be used to generate a word-cloud. Different options for rendering and coloring the word-clouds give users the flexibility to quickly generate customized word-clouds of their choice. Genes2WordCloud is a word-cloud generator and a word-cloud viewer that is based on WordCram implemented using Java, Processing, AJAX, mySQL, and PHP. Text is fetched from several sources and then processed to extract the most relevant terms with their computed weights based on word frequencies. Genes2WordCloud is freely available for use online; it is open source software and is available for installation on any web-site along with supporting documentation at http://www.maayanlab.net/G2W. Genes2WordCloud provides a useful way to summarize and visualize large amounts of textual biological data or to find biological themes from several different sources. The open source availability of the software enables users to implement customized word-clouds on their own web-sites and desktop applications.

Some links on this page may take you to non-federal websites. Their policies may differ from this site.