WorldWideScience

Sample records for candidate gene sets

  1. No Evidence That Schizophrenia Candidate Genes Are More Associated With Schizophrenia Than Noncandidate Genes.

    Science.gov (United States)

    Johnson, Emma C; Border, Richard; Melroy-Greif, Whitney E; de Leeuw, Christiaan A; Ehringer, Marissa A; Keller, Matthew C

    2017-11-15

    A recent analysis of 25 historical candidate gene polymorphisms for schizophrenia in the largest genome-wide association study conducted to date suggested that these commonly studied variants were no more associated with the disorder than would be expected by chance. However, the same study identified other variants within those candidate genes that demonstrated genome-wide significant associations with schizophrenia. As such, it is possible that variants within historic schizophrenia candidate genes are associated with schizophrenia at levels above those expected by chance, even if the most-studied specific polymorphisms are not. The present study used association statistics from the largest schizophrenia genome-wide association study conducted to date as input to a gene set analysis to investigate whether variants within schizophrenia candidate genes are enriched for association with schizophrenia. As a group, variants in the most-studied candidate genes were no more associated with schizophrenia than were variants in control sets of noncandidate genes. While a small subset of candidate genes did appear to be significantly associated with schizophrenia, these genes were not particularly noteworthy given the large number of more strongly associated noncandidate genes. The history of schizophrenia research should serve as a cautionary tale to candidate gene investigators examining other phenotypes: our findings indicate that the most investigated candidate gene hypotheses of schizophrenia are not well supported by genome-wide association studies, and it is likely that this will be the case for other complex traits as well. Copyright © 2017 Society of Biological Psychiatry. Published by Elsevier Inc. All rights reserved.

  2. Disease candidate gene identification and prioritization using protein interaction networks

    Directory of Open Access Journals (Sweden)

    Aronow Bruce J

    2009-02-01

    Full Text Available Abstract Background Although most of the current disease candidate gene identification and prioritization methods depend on functional annotations, the coverage of the gene functional annotations is a limiting factor. In the current study, we describe a candidate gene prioritization method that is entirely based on protein-protein interaction network (PPIN analyses. Results For the first time, extended versions of the PageRank and HITS algorithms, and the K-Step Markov method are applied to prioritize disease candidate genes in a training-test schema. Using a list of known disease-related genes from our earlier study as a training set ("seeds", and the rest of the known genes as a test list, we perform large-scale cross validation to rank the candidate genes and also evaluate and compare the performance of our approach. Under appropriate settings – for example, a back probability of 0.3 for PageRank with Priors and HITS with Priors, and step size 6 for K-Step Markov method – the three methods achieved a comparable AUC value, suggesting a similar performance. Conclusion Even though network-based methods are generally not as effective as integrated functional annotation-based methods for disease candidate gene prioritization, in a one-to-one comparison, PPIN-based candidate gene prioritization performs better than all other gene features or annotations. Additionally, we demonstrate that methods used for studying both social and Web networks can be successfully used for disease candidate gene prioritization.

  3. Dissecting the organ specificity of insecticide resistance candidate genes in Anopheles gambiae: known and novel candidate genes.

    Science.gov (United States)

    Ingham, Victoria A; Jones, Christopher M; Pignatelli, Patricia; Balabanidou, Vasileia; Vontas, John; Wagstaff, Simon C; Moore, Jonathan D; Ranson, Hilary

    2014-11-25

    be recommended as a preferred means to identify new candidate insecticide resistant genes. Instead the rich data set on in vivo sites of transcription should be consulted when designing follow up qPCR validation steps, or for screening known candidates in field populations.

  4. The prediction of candidate genes for cervix related cancer through gene ontology and graph theoretical approach.

    Science.gov (United States)

    Hindumathi, V; Kranthi, T; Rao, S B; Manimaran, P

    2014-06-01

    With rapidly changing technology, prediction of candidate genes has become an indispensable task in recent years mainly in the field of biological research. The empirical methods for candidate gene prioritization that succors to explore the potential pathway between genetic determinants and complex diseases are highly cumbersome and labor intensive. In such a scenario predicting potential targets for a disease state through in silico approaches are of researcher's interest. The prodigious availability of protein interaction data coupled with gene annotation renders an ease in the accurate determination of disease specific candidate genes. In our work we have prioritized the cervix related cancer candidate genes by employing Csaba Ortutay and his co-workers approach of identifying the candidate genes through graph theoretical centrality measures and gene ontology. With the advantage of the human protein interaction data, cervical cancer gene sets and the ontological terms, we were able to predict 15 novel candidates for cervical carcinogenesis. The disease relevance of the anticipated candidate genes was corroborated through a literature survey. Also the presence of the drugs for these candidates was detected through Therapeutic Target Database (TTD) and DrugMap Central (DMC) which affirms that they may be endowed as potential drug targets for cervical cancer.

  5. Reranking candidate gene models with cross-species comparison for improved gene prediction

    Directory of Open Access Journals (Sweden)

    Pereira Fernando CN

    2008-10-01

    Full Text Available Abstract Background Most gene finders score candidate gene models with state-based methods, typically HMMs, by combining local properties (coding potential, splice donor and acceptor patterns, etc. Competing models with similar state-based scores may be distinguishable with additional information. In particular, functional and comparative genomics datasets may help to select among competing models of comparable probability by exploiting features likely to be associated with the correct gene models, such as conserved exon/intron structure or protein sequence features. Results We have investigated the utility of a simple post-processing step for selecting among a set of alternative gene models, using global scoring rules to rerank competing models for more accurate prediction. For each gene locus, we first generate the K best candidate gene models using the gene finder Evigan, and then rerank these models using comparisons with putative orthologous genes from closely-related species. Candidate gene models with lower scores in the original gene finder may be selected if they exhibit strong similarity to probable orthologs in coding sequence, splice site location, or signal peptide occurrence. Experiments on Drosophila melanogaster demonstrate that reranking based on cross-species comparison outperforms the best gene models identified by Evigan alone, and also outperforms the comparative gene finders GeneWise and Augustus+. Conclusion Reranking gene models with cross-species comparison improves gene prediction accuracy. This straightforward method can be readily adapted to incorporate additional lines of evidence, as it requires only a ranked source of candidate gene models.

  6. Ranking candidate disease genes from gene expression and protein interaction: a Katz-centrality based approach.

    Directory of Open Access Journals (Sweden)

    Jing Zhao

    Full Text Available Many diseases have complex genetic causes, where a set of alleles can affect the propensity of getting the disease. The identification of such disease genes is important to understand the mechanistic and evolutionary aspects of pathogenesis, improve diagnosis and treatment of the disease, and aid in drug discovery. Current genetic studies typically identify chromosomal regions associated specific diseases. But picking out an unknown disease gene from hundreds of candidates located on the same genomic interval is still challenging. In this study, we propose an approach to prioritize candidate genes by integrating data of gene expression level, protein-protein interaction strength and known disease genes. Our method is based only on two, simple, biologically motivated assumptions--that a gene is a good disease-gene candidate if it is differentially expressed in cases and controls, or that it is close to other disease-gene candidates in its protein interaction network. We tested our method on 40 diseases in 58 gene expression datasets of the NCBI Gene Expression Omnibus database. On these datasets our method is able to predict unknown disease genes as well as identifying pleiotropic genes involved in the physiological cellular processes of many diseases. Our study not only provides an effective algorithm for prioritizing candidate disease genes but is also a way to discover phenotypic interdependency, cooccurrence and shared pathophysiology between different disorders.

  7. Finding gene regulatory network candidates using the gene expression knowledge base.

    Science.gov (United States)

    Venkatesan, Aravind; Tripathi, Sushil; Sanz de Galdeano, Alejandro; Blondé, Ward; Lægreid, Astrid; Mironov, Vladimir; Kuiper, Martin

    2014-12-10

    Network-based approaches for the analysis of large-scale genomics data have become well established. Biological networks provide a knowledge scaffold against which the patterns and dynamics of 'omics' data can be interpreted. The background information required for the construction of such networks is often dispersed across a multitude of knowledge bases in a variety of formats. The seamless integration of this information is one of the main challenges in bioinformatics. The Semantic Web offers powerful technologies for the assembly of integrated knowledge bases that are computationally comprehensible, thereby providing a potentially powerful resource for constructing biological networks and network-based analysis. We have developed the Gene eXpression Knowledge Base (GeXKB), a semantic web technology based resource that contains integrated knowledge about gene expression regulation. To affirm the utility of GeXKB we demonstrate how this resource can be exploited for the identification of candidate regulatory network proteins. We present four use cases that were designed from a biological perspective in order to find candidate members relevant for the gastrin hormone signaling network model. We show how a combination of specific query definitions and additional selection criteria derived from gene expression data and prior knowledge concerning candidate proteins can be used to retrieve a set of proteins that constitute valid candidates for regulatory network extensions. Semantic web technologies provide the means for processing and integrating various heterogeneous information sources. The GeXKB offers biologists such an integrated knowledge resource, allowing them to address complex biological questions pertaining to gene expression. This work illustrates how GeXKB can be used in combination with gene expression results and literature information to identify new potential candidates that may be considered for extending a gene regulatory network.

  8. Identification of candidate genes for dyslexia susceptibility on chromosome 18.

    Directory of Open Access Journals (Sweden)

    Thomas S Scerri

    2010-10-01

    Full Text Available Six independent studies have identified linkage to chromosome 18 for developmental dyslexia or general reading ability. Until now, no candidate genes have been identified to explain this linkage. Here, we set out to identify the gene(s conferring susceptibility by a two stage strategy of linkage and association analysis.Linkage analysis: 264 UK families and 155 US families each containing at least one child diagnosed with dyslexia were genotyped with a dense set of microsatellite markers on chromosome 18. Association analysis: Using a discovery sample of 187 UK families, nearly 3000 SNPs were genotyped across the chromosome 18 dyslexia susceptibility candidate region. Following association analysis, the top ranking SNPs were then genotyped in the remaining samples. The linkage analysis revealed a broad signal that spans approximately 40 Mb from 18p11.2 to 18q12.2. Following the association analysis and subsequent replication attempts, we observed consistent association with the same SNPs in three genes; melanocortin 5 receptor (MC5R, dymeclin (DYM and neural precursor cell expressed, developmentally down-regulated 4-like (NEDD4L.Along with already published biological evidence, MC5R, DYM and NEDD4L make attractive candidates for dyslexia susceptibility genes. However, further replication and functional studies are still required.

  9. Candidate Gene Identification of Flowering Time Genes in Cotton

    Directory of Open Access Journals (Sweden)

    Corrinne E. Grover

    2015-07-01

    Full Text Available Flowering time control is critically important to all sexually reproducing angiosperms in both natural ecological and agronomic settings. Accordingly, there is much interest in defining the genes involved in the complex flowering-time network and how these respond to natural and artificial selection, the latter often entailing transitions in day-length responses. Here we describe a candidate gene analysis in the cotton genus , which uses homologs from the well-described flowering network to bioinformatically and phylogenetically identify orthologs in the published genome sequence from Ulbr., one of the two model diploid progenitors of the commercially important allopolyploid cottons, L. and L. Presence and patterns of expression were evaluated from 13 aboveground tissues related to flowering for each of the candidate genes using allopolyploid as a model. Furthermore, we use a comparative context to determine copy number variability of each key gene family across 10 published angiosperm genomes. Data suggest a pattern of repeated loss of duplicates following ancient whole-genome doubling events in diverse lineages. The data presented here provide a foundation for understanding both the parallel evolution of day-length neutrality in domesticated cottons and the flowering-time network, in general, in this important crop plant.

  10. Comparative genomic analysis of Brucella abortus vaccine strain 104M reveals a set of candidate genes associated with its virulence attenuation.

    Science.gov (United States)

    Yu, Dong; Hui, Yiming; Zai, Xiaodong; Xu, Junjie; Liang, Long; Wang, Bingxiang; Yue, Junjie; Li, Shanhu

    2015-01-01

    The Brucella abortus strain 104M, a spontaneously attenuated strain, has been used as a vaccine strain in humans against brucellosis for 6 decades in China. Despite many studies, the molecular mechanisms that cause the attenuation are still unclear. Here, we determined the whole-genome sequence of 104M and conducted a comprehensive comparative analysis against the whole genome sequences of the virulent strain, A13334, and other reference strains. This analysis revealed a highly similar genome structure between 104M and A13334. The further comparative genomic analysis between 104M and A13334 revealed a set of genes missing in 104M. Some of these genes were identified to be directly or indirectly associated with virulence. Similarly, a set of mutations in the virulence-related genes was also identified, which may be related to virulence alteration. This study provides a set of candidate genes associated with virulence attenuation in B.abortus vaccine strain 104M.

  11. Candidate gene analysis using imputed genotypes: cell cycle single-nucleotide polymorphisms and ovarian cancer risk

    DEFF Research Database (Denmark)

    Goode, Ellen L; Fridley, Brooke L; Vierkant, Robert A

    2009-01-01

    Polymorphisms in genes critical to cell cycle control are outstanding candidates for association with ovarian cancer risk; numerous genes have been interrogated by multiple research groups using differing tagging single-nucleotide polymorphism (SNP) sets. To maximize information gleaned from......, and rs3212891; CDK2 rs2069391, rs2069414, and rs17528736; and CCNE1 rs3218036. These results exemplify the utility of imputation in candidate gene studies and lend evidence to a role of cell cycle genes in ovarian cancer etiology, suggest a reduced set of SNPs to target in additional cases and controls....

  12. Analysis of breast cancer metastasis candidate genes from next generation-sequencing via systematic functional genomics

    DEFF Research Database (Denmark)

    Blomstrøm, Monica Marie

    2016-01-01

    several growth modulators and invasion modulators were identified and independently validated. These candidates revealed a group of genes with metastasis-related functions in vitro that are involved in RNA-related processes, such as RNA-processing. Moreover, a general feature was that proliferation......) and non-CSCs. The main goal of this project was to functionally characterize a set of candidate genes recovered from next-generation sequencing analysis for their role in breast cancer metastasis formation. The starting gene set comprised 104 gene variants; i.e. 57 wildtype and 47 mutated variants. During...

  13. Evaluating historical candidate genes for schizophrenia

    DEFF Research Database (Denmark)

    Farrell, M S; Werge, T; Sklar, P

    2015-01-01

    Prior to the genome-wide association era, candidate gene studies were a major approach in schizophrenia genetics. In this invited review, we consider the current status of 25 historical candidate genes for schizophrenia (for example, COMT, DISC1, DTNBP1 and NRG1). The initial study for 24 of thes...

  14. A random set scoring model for prioritization of disease candidate genes using protein complexes and data-mining of GeneRIF, OMIM and PubMed records.

    Science.gov (United States)

    Jiang, Li; Edwards, Stefan M; Thomsen, Bo; Workman, Christopher T; Guldbrandtsen, Bernt; Sørensen, Peter

    2014-09-24

    Prioritizing genetic variants is a challenge because disease susceptibility loci are often located in genes of unknown function or the relationship with the corresponding phenotype is unclear. A global data-mining exercise on the biomedical literature can establish the phenotypic profile of genes with respect to their connection to disease phenotypes. The importance of protein-protein interaction networks in the genetic heterogeneity of common diseases or complex traits is becoming increasingly recognized. Thus, the development of a network-based approach combined with phenotypic profiling would be useful for disease gene prioritization. We developed a random-set scoring model and implemented it to quantify phenotype relevance in a network-based disease gene-prioritization approach. We validated our approach based on different gene phenotypic profiles, which were generated from PubMed abstracts, OMIM, and GeneRIF records. We also investigated the validity of several vocabulary filters and different likelihood thresholds for predicted protein-protein interactions in terms of their effect on the network-based gene-prioritization approach, which relies on text-mining of the phenotype data. Our method demonstrated good precision and sensitivity compared with those of two alternative complex-based prioritization approaches. We then conducted a global ranking of all human genes according to their relevance to a range of human diseases. The resulting accurate ranking of known causal genes supported the reliability of our approach. Moreover, these data suggest many promising novel candidate genes for human disorders that have a complex mode of inheritance. We have implemented and validated a network-based approach to prioritize genes for human diseases based on their phenotypic profile. We have devised a powerful and transparent tool to identify and rank candidate genes. Our global gene prioritization provides a unique resource for the biological interpretation of data

  15. Beyond main effects of gene-sets: harsh parenting moderates the association between a dopamine gene-set and child externalizing behavior.

    Science.gov (United States)

    Windhorst, Dafna A; Mileva-Seitz, Viara R; Rippe, Ralph C A; Tiemeier, Henning; Jaddoe, Vincent W V; Verhulst, Frank C; van IJzendoorn, Marinus H; Bakermans-Kranenburg, Marian J

    2016-08-01

    In a longitudinal cohort study, we investigated the interplay of harsh parenting and genetic variation across a set of functionally related dopamine genes, in association with children's externalizing behavior. This is one of the first studies to employ gene-based and gene-set approaches in tests of Gene by Environment (G × E) effects on complex behavior. This approach can offer an important alternative or complement to candidate gene and genome-wide environmental interaction (GWEI) studies in the search for genetic variation underlying individual differences in behavior. Genetic variants in 12 autosomal dopaminergic genes were available in an ethnically homogenous part of a population-based cohort. Harsh parenting was assessed with maternal (n = 1881) and paternal (n = 1710) reports at age 3. Externalizing behavior was assessed with the Child Behavior Checklist (CBCL) at age 5 (71 ± 3.7 months). We conducted gene-set analyses of the association between variation in dopaminergic genes and externalizing behavior, stratified for harsh parenting. The association was statistically significant or approached significance for children without harsh parenting experiences, but was absent in the group with harsh parenting. Similarly, significant associations between single genes and externalizing behavior were only found in the group without harsh parenting. Effect sizes in the groups with and without harsh parenting did not differ significantly. Gene-environment interaction tests were conducted for individual genetic variants, resulting in two significant interaction effects (rs1497023 and rs4922132) after correction for multiple testing. Our findings are suggestive of G × E interplay, with associations between dopamine genes and externalizing behavior present in children without harsh parenting, but not in children with harsh parenting experiences. Harsh parenting may overrule the role of genetic factors in externalizing behavior. Gene-based and gene-set

  16. Candidate genes in panic disorder

    DEFF Research Database (Denmark)

    Howe, A. S.; Buttenschön, Henriette N; Bani-Fatemi, A.

    2016-01-01

    The utilization of molecular genetics approaches in examination of panic disorder (PD) has implicated several variants as potential susceptibility factors for panicogenesis. However, the identification of robust PD susceptibility genes has been complicated by phenotypic diversity, underpowered...... association studies and ancestry-specific effects. In the present study, we performed a succinct review of case-control association studies published prior to April 2015. Meta-analyses were performed for candidate gene variants examined in at least three studies using the Cochrane Mantel-Haenszel fixed......-effect model. Secondary analyses were also performed to assess the influences of sex, agoraphobia co-morbidity and ancestry-specific effects on panicogenesis. Meta-analyses were performed on 23 variants in 20 PD candidate genes. Significant associations after correction for multiple testing were observed...

  17. Speeding disease gene discovery by sequence based candidate prioritization

    Directory of Open Access Journals (Sweden)

    Porteous David J

    2005-03-01

    Full Text Available Abstract Background Regions of interest identified through genetic linkage studies regularly exceed 30 centimorgans in size and can contain hundreds of genes. Traditionally this number is reduced by matching functional annotation to knowledge of the disease or phenotype in question. However, here we show that disease genes share patterns of sequence-based features that can provide a good basis for automatic prioritization of candidates by machine learning. Results We examined a variety of sequence-based features and found that for many of them there are significant differences between the sets of genes known to be involved in human hereditary disease and those not known to be involved in disease. We have created an automatic classifier called PROSPECTR based on those features using the alternating decision tree algorithm which ranks genes in the order of likelihood of involvement in disease. On average, PROSPECTR enriches lists for disease genes two-fold 77% of the time, five-fold 37% of the time and twenty-fold 11% of the time. Conclusion PROSPECTR is a simple and effective way to identify genes involved in Mendelian and oligogenic disorders. It performs markedly better than the single existing sequence-based classifier on novel data. PROSPECTR could save investigators looking at large regions of interest time and effort by prioritizing positional candidate genes for mutation detection and case-control association studies.

  18. VennPainter: A Tool for the Comparison and Identification of Candidate Genes Based on Venn Diagrams.

    Directory of Open Access Journals (Sweden)

    Guoliang Lin

    Full Text Available VennPainter is a program for depicting unique and shared sets of genes lists and generating Venn diagrams, by using the Qt C++ framework. The software produces Classic Venn, Edwards' Venn and Nested Venn diagrams and allows for eight sets in a graph mode and 31 sets in data processing mode only. In comparison, previous programs produce Classic Venn and Edwards' Venn diagrams and allow for a maximum of six sets. The software incorporates user-friendly features and works in Windows, Linux and Mac OS. Its graphical interface does not require a user to have programing skills. Users can modify diagram content for up to eight datasets because of the Scalable Vector Graphics output. VennPainter can provide output results in vertical, horizontal and matrix formats, which facilitates sharing datasets as required for further identification of candidate genes. Users can obtain gene lists from shared sets by clicking the numbers on the diagram. Thus, VennPainter is an easy-to-use, highly efficient, cross-platform and powerful program that provides a more comprehensive tool for identifying candidate genes and visualizing the relationships among genes or gene families in comparative analysis.

  19. The Candidate Cancer Gene Database: a database of cancer driver genes from forward genetic screens in mice.

    Science.gov (United States)

    Abbott, Kenneth L; Nyre, Erik T; Abrahante, Juan; Ho, Yen-Yi; Isaksson Vogel, Rachel; Starr, Timothy K

    2015-01-01

    Identification of cancer driver gene mutations is crucial for advancing cancer therapeutics. Due to the overwhelming number of passenger mutations in the human tumor genome, it is difficult to pinpoint causative driver genes. Using transposon mutagenesis in mice many laboratories have conducted forward genetic screens and identified thousands of candidate driver genes that are highly relevant to human cancer. Unfortunately, this information is difficult to access and utilize because it is scattered across multiple publications using different mouse genome builds and strength metrics. To improve access to these findings and facilitate meta-analyses, we developed the Candidate Cancer Gene Database (CCGD, http://ccgd-starrlab.oit.umn.edu/). The CCGD is a manually curated database containing a unified description of all identified candidate driver genes and the genomic location of transposon common insertion sites (CISs) from all currently published transposon-based screens. To demonstrate relevance to human cancer, we performed a modified gene set enrichment analysis using KEGG pathways and show that human cancer pathways are highly enriched in the database. We also used hierarchical clustering to identify pathways enriched in blood cancers compared to solid cancers. The CCGD is a novel resource available to scientists interested in the identification of genetic drivers of cancer. © The Author(s) 2014. Published by Oxford University Press on behalf of Nucleic Acids Research.

  20. Prediction potential of candidate biomarker sets identified and validated on gene expression data from multiple datasets

    Directory of Open Access Journals (Sweden)

    Karacali Bilge

    2007-10-01

    Full Text Available Abstract Background Independently derived expression profiles of the same biological condition often have few genes in common. In this study, we created populations of expression profiles from publicly available microarray datasets of cancer (breast, lymphoma and renal samples linked to clinical information with an iterative machine learning algorithm. ROC curves were used to assess the prediction error of each profile for classification. We compared the prediction error of profiles correlated with molecular phenotype against profiles correlated with relapse-free status. Prediction error of profiles identified with supervised univariate feature selection algorithms were compared to profiles selected randomly from a all genes on the microarray platform and b a list of known disease-related genes (a priori selection. We also determined the relevance of expression profiles on test arrays from independent datasets, measured on either the same or different microarray platforms. Results Highly discriminative expression profiles were produced on both simulated gene expression data and expression data from breast cancer and lymphoma datasets on the basis of ER and BCL-6 expression, respectively. Use of relapse-free status to identify profiles for prognosis prediction resulted in poorly discriminative decision rules. Supervised feature selection resulted in more accurate classifications than random or a priori selection, however, the difference in prediction error decreased as the number of features increased. These results held when decision rules were applied across-datasets to samples profiled on the same microarray platform. Conclusion Our results show that many gene sets predict molecular phenotypes accurately. Given this, expression profiles identified using different training datasets should be expected to show little agreement. In addition, we demonstrate the difficulty in predicting relapse directly from microarray data using supervised machine

  1. Candidate gene studies and the quest for the entrepreneurial gene

    NARCIS (Netherlands)

    M.J.H.M. van der Loos (Matthijs); Ph.D. Koellinger (Philipp); P.J.F. Groenen (Patrick); C.A. Rietveld (Niels); F. Rivadeneira Ramirez (Fernando); F.J.A. van Rooij (Frank); A.G. Uitterlinden (André); A. Hofman (Albert); A.R. Thurik (Roy)

    2011-01-01

    textabstractCandidate gene studies of human behavior are gaining interest in economics and entrepreneurship research. Performing and interpreting these studies is not straightforward because the selection of candidates influences the interpretation of the results. As an example, Nicolaou et al.

  2. Candidate genes in ocular dominance plasticity

    NARCIS (Netherlands)

    Rietman, M.L.; Sommeijer, J.-P.; Levelt, C.N.; Heimel, J.A.; Brussaard, A.B.; Borst, J.G.G.; Elgersma, Y.; Galjart, N.; van der Horst, G.T.; Pennartz, C.M.; Smit, A.B.; Spruijt, B.M.; Verhage, M.; de Zeeuw, C.I.

    2012-01-01

    Many studies have been devoted to the identification of genes involved in experience-dependent plasticity in the visual cortex. To discover new candidate genes, we have reexamined data from one such study on ocular dominance (OD) plasticity in recombinant inbred BXD mouse strains. We have correlated

  3. Discovery of cancer common and specific driver gene sets

    Science.gov (United States)

    2017-01-01

    Abstract Cancer is known as a disease mainly caused by gene alterations. Discovery of mutated driver pathways or gene sets is becoming an important step to understand molecular mechanisms of carcinogenesis. However, systematically investigating commonalities and specificities of driver gene sets among multiple cancer types is still a great challenge, but this investigation will undoubtedly benefit deciphering cancers and will be helpful for personalized therapy and precision medicine in cancer treatment. In this study, we propose two optimization models to de novo discover common driver gene sets among multiple cancer types (ComMDP) and specific driver gene sets of one certain or multiple cancer types to other cancers (SpeMDP), respectively. We first apply ComMDP and SpeMDP to simulated data to validate their efficiency. Then, we further apply these methods to 12 cancer types from The Cancer Genome Atlas (TCGA) and obtain several biologically meaningful driver pathways. As examples, we construct a common cancer pathway model for BRCA and OV, infer a complex driver pathway model for BRCA carcinogenesis based on common driver gene sets of BRCA with eight cancer types, and investigate specific driver pathways of the liquid cancer lymphoblastic acute myeloid leukemia (LAML) versus other solid cancer types. In these processes more candidate cancer genes are also found. PMID:28168295

  4. Utilization of gene mapping and candidate gene mutation screening for diagnosing clinically equivocal conditions: a Norrie disease case study.

    Science.gov (United States)

    Chini, Vasiliki; Stambouli, Danai; Nedelea, Florina Mihaela; Filipescu, George Alexandru; Mina, Diana; Kambouris, Marios; El-Shantil, Hatem

    2014-06-01

    Prenatal diagnosis was requested for an undiagnosed eye disease showing X-linked inheritance in a family. No medical records existed for the affected family members. Mapping of the X chromosome and candidate gene mutation screening identified a c.C267A[p.F89L] mutation in NPD previously described as possibly causing Norrie disease. The detection of the c.C267A[p.F89L] variant in another unrelated family confirms the pathogenic nature of the mutation for the Norrie disease phenotype. Gene mapping, haplotype analysis, and candidate gene screening have been previously utilized in research applications but were applied here in a diagnostic setting due to the scarcity of available clinical information. The clinical diagnosis and mutation identification were critical for providing proper genetic counseling and prenatal diagnosis for this family.

  5. ENU Mutagenesis in Mice Identifies Candidate Genes For Hypogonadism

    Science.gov (United States)

    Weiss, Jeffrey; Hurley, Lisa A.; Harris, Rebecca M.; Finlayson, Courtney; Tong, Minghan; Fisher, Lisa A.; Moran, Jennifer L.; Beier, David R.; Mason, Christopher; Jameson, J. Larry

    2012-01-01

    Genome-wide mutagenesis was performed in mice to identify candidate genes for male infertility, for which the predominant causes remain idiopathic. Mice were mutagenized using N-ethyl-N-nitrosourea (ENU), bred, and screened for phenotypes associated with the male urogenital system. Fifteen heritable lines were isolated and chromosomal loci were assigned using low density genome-wide SNP arrays. Ten of the fifteen lines were pursued further using higher resolution SNP analysis to narrow the candidate gene regions. Exon sequencing of candidate genes identified mutations in mice with cystic kidneys (Bicc1), cryptorchidism (Rxfp2), restricted germ cell deficiency (Plk4), and severe germ cell deficiency (Prdm9). In two other lines with severe hypogonadism candidate sequencing failed to identify mutations, suggesting defects in genes with previously undocumented roles in gonadal function. These genomic intervals were sequenced in their entirety and a candidate mutation was identified in SnrpE in one of the two lines. The line harboring the SnrpE variant retains substantial spermatogenesis despite small testis size, an unusual phenotype. In addition to the reproductive defects, heritable phenotypes were observed in mice with ataxia (Myo5a), tremors (Pmp22), growth retardation (unknown gene), and hydrocephalus (unknown gene). These results demonstrate that the ENU screen is an effective tool for identifying potential causes of male infertility. PMID:22258617

  6. Deep Sequencing of 71 Candidate Genes to Characterize Variation Associated with Alcohol Dependence.

    Science.gov (United States)

    Clark, Shaunna L; McClay, Joseph L; Adkins, Daniel E; Kumar, Gaurav; Aberg, Karolina A; Nerella, Srilaxmi; Xie, Linying; Collins, Ann L; Crowley, James J; Quackenbush, Corey R; Hilliard, Christopher E; Shabalin, Andrey A; Vrieze, Scott I; Peterson, Roseann E; Copeland, William E; Silberg, Judy L; McGue, Matt; Maes, Hermine; Iacono, William G; Sullivan, Patrick F; Costello, Elizabeth J; van den Oord, Edwin J

    2017-04-01

    Previous genomewide association studies (GWASs) have identified a number of putative risk loci for alcohol dependence (AD). However, only a few loci have replicated and these replicated variants only explain a small proportion of AD risk. Using an innovative approach, the goal of this study was to generate hypotheses about potentially causal variants for AD that can be explored further through functional studies. We employed targeted capture of 71 candidate loci and flanking regions followed by next-generation deep sequencing (mean coverage 78X) in 806 European Americans. Regions included in our targeted capture library were genes identified through published GWAS of alcohol, all human alcohol and aldehyde dehydrogenases, reward system genes including dopaminergic and opioid receptors, prioritized candidate genes based on previous associations, and genes involved in the absorption, distribution, metabolism, and excretion of drugs. We performed single-locus tests to determine if any single variant was associated with AD symptom count. Sets of variants that overlapped with biologically meaningful annotations were tested for association in aggregate. No single, common variant was significantly associated with AD in our study. We did, however, find evidence for association with several variant sets. Two variant sets were significant at the q-value <0.10 level: a genic enhancer for ADHFE1 (p = 1.47 × 10 -5 ; q = 0.019), an alcohol dehydrogenase, and ADORA1 (p = 5.29 × 10 -5 ; q = 0.035), an adenosine receptor that belongs to a G-protein-coupled receptor gene family. To our knowledge, this is the first sequencing study of AD to examine variants in entire genes, including flanking and regulatory regions. We found that in addition to protein coding variant sets, regulatory variant sets may play a role in AD. From these findings, we have generated initial functional hypotheses about how these sets may influence AD. Copyright © 2017 by the Research Society on

  7. [Obesity studies in candidate genes].

    Science.gov (United States)

    Ochoa, María del Carmen; Martí, Amelia; Martínez, J Alfredo

    2004-04-17

    There are more than 430 chromosomic regions with gene variants involved in body weight regulation and obesity development. Polymorphisms in genes related to energy expenditure--uncoupling proteins (UCPs), related to adipogenesis and insulin resistance--hormone-sensitive lipase (HLS), peroxisome proliferator-activated receptor gamma (PPAR gamma), beta adrenergic receptors (ADRB2,3), and alfa tumor necrosis factor (TNF-alpha), and related to food intake--ghrelin (GHRL)--appear to be associated with obesity phenotypes. Obesity risk depends on two factors: a) genetic variants in candidate genes, and b) biographical exposure to environmental risk factors. It is necessary to perform new studies, with appropriate control groups and designs, in order to reach relevant conclusions with regard to gene/environmental (diet, lifestyle) interactions.

  8. Degrees of separation as a statistical tool for evaluating candidate genes.

    Science.gov (United States)

    Nelson, Ronald M; Pettersson, Mats E

    2014-12-01

    Selection of candidate genes is an important step in the exploration of complex genetic architecture. The number of gene networks available is increasing and these can provide information to help with candidate gene selection. It is currently common to use the degree of connectedness in gene networks as validation in Genome Wide Association (GWA) and Quantitative Trait Locus (QTL) mapping studies. However, it can cause misleading results if not validated properly. Here we present a method and tool for validating the gene pairs from GWA studies given the context of the network they co-occur in. It ensures that proposed interactions and gene associations are not statistical artefacts inherent to the specific gene network architecture. The CandidateBacon package provides an easy and efficient method to calculate the average degree of separation (DoS) between pairs of genes to currently available gene networks. We show how these empirical estimates of average connectedness are used to validate candidate gene pairs. Validation of interacting genes by comparing their connectedness with the average connectedness in the gene network will provide support for said interactions by utilising the growing amount of gene network information available. Copyright © 2014 Elsevier Ltd. All rights reserved.

  9. Candidate genes detected in transcriptome studies are strongly dependent on genetic background.

    Directory of Open Access Journals (Sweden)

    Pernille Sarup

    2011-01-01

    Full Text Available Whole genome transcriptomic studies can point to potential candidate genes for organismal traits. However, the importance of potential candidates is rarely followed up through functional studies and/or by comparing results across independent studies. We have analysed the overlap of candidate genes identified from studies of gene expression in Drosophila melanogaster using similar technical platforms. We found little overlap across studies between putative candidate genes for the same traits in the same sex. Instead there was a high degree of overlap between different traits and sexes within the same genetic backgrounds. Putative candidates found using transcriptomics therefore appear very sensitive to genetic background and this can mask or override effects of treatments. The functional importance of putative candidate genes emerging from transcriptome studies needs to be validated through additional experiments and in future studies we suggest a focus on the genes, networks and pathways affecting traits in a consistent manner across backgrounds.

  10. Novel gene sets improve set-level classification of prokaryotic gene expression data.

    Science.gov (United States)

    Holec, Matěj; Kuželka, Ondřej; Železný, Filip

    2015-10-28

    Set-level classification of gene expression data has received significant attention recently. In this setting, high-dimensional vectors of features corresponding to genes are converted into lower-dimensional vectors of features corresponding to biologically interpretable gene sets. The dimensionality reduction brings the promise of a decreased risk of overfitting, potentially resulting in improved accuracy of the learned classifiers. However, recent empirical research has not confirmed this expectation. Here we hypothesize that the reported unfavorable classification results in the set-level framework were due to the adoption of unsuitable gene sets defined typically on the basis of the Gene ontology and the KEGG database of metabolic networks. We explore an alternative approach to defining gene sets, based on regulatory interactions, which we expect to collect genes with more correlated expression. We hypothesize that such more correlated gene sets will enable to learn more accurate classifiers. We define two families of gene sets using information on regulatory interactions, and evaluate them on phenotype-classification tasks using public prokaryotic gene expression data sets. From each of the two gene-set families, we first select the best-performing subtype. The two selected subtypes are then evaluated on independent (testing) data sets against state-of-the-art gene sets and against the conventional gene-level approach. The novel gene sets are indeed more correlated than the conventional ones, and lead to significantly more accurate classifiers. The novel gene sets are indeed more correlated than the conventional ones, and lead to significantly more accurate classifiers. Novel gene sets defined on the basis of regulatory interactions improve set-level classification of gene expression data. The experimental scripts and other material needed to reproduce the experiments are available at http://ida.felk.cvut.cz/novelgenesets.tar.gz.

  11. Functional validation of candidate genes detected by genomic feature models

    DEFF Research Database (Denmark)

    Rohde, Palle Duun; Østergaard, Solveig; Kristensen, Torsten Nygaard

    2018-01-01

    Understanding the genetic underpinnings of complex traits requires knowledge of the genetic variants that contribute to phenotypic variability. Reliable statistical approaches are needed to obtain such knowledge. In genome-wide association studies, variants are tested for association with trait...... then functionally assessed whether the identified candidate genes affected locomotor activity by reducing gene expression using RNA interference. In five of the seven candidate genes tested, reduced gene expression altered the phenotype. The ranking of genes within the predictive GO term was highly correlated...

  12. Genome-wide association links candidate genes to resistance to Plum Pox Virus in apricot (Prunus armeniaca).

    Science.gov (United States)

    Mariette, Stéphanie; Wong Jun Tai, Fabienne; Roch, Guillaume; Barre, Aurélien; Chague, Aurélie; Decroocq, Stéphane; Groppi, Alexis; Laizet, Yec'han; Lambert, Patrick; Tricon, David; Nikolski, Macha; Audergon, Jean-Marc; Abbott, Albert G; Decroocq, Véronique

    2016-01-01

    In fruit tree species, many important traits have been characterized genetically by using single-family descent mapping in progenies segregating for the traits. However, most mapped loci have not been sufficiently resolved to the individual genes due to insufficient progeny sizes for high resolution mapping and the previous lack of whole-genome sequence resources of the study species. To address this problem for Plum Pox Virus (PPV) candidate resistance gene identification in Prunus species, we implemented a genome-wide association (GWA) approach in apricot. This study exploited the broad genetic diversity of the apricot (Prunus armeniaca) germplasm containing resistance to PPV, next-generation sequence-based genotyping, and the high-quality peach (Prunus persica) genome reference sequence for single nucleotide polymorphism (SNP) identification. The results of this GWA study validated previously reported PPV resistance quantitative trait loci (QTL) intervals, highlighted other potential resistance loci, and resolved each to a limited set of candidate genes for further study. This work substantiates the association genetics approach for resolution of QTL to candidate genes in apricot and suggests that this approach could simplify identification of other candidate genes for other marked trait intervals in this germplasm. © 2015 INRA, UMR 1332 BFP New Phytologist © 2015 New Phytologist Trust.

  13. Prioritization of candidate disease genes by combining topological similarity and semantic similarity.

    Science.gov (United States)

    Liu, Bin; Jin, Min; Zeng, Pan

    2015-10-01

    The identification of gene-phenotype relationships is very important for the treatment of human diseases. Studies have shown that genes causing the same or similar phenotypes tend to interact with each other in a protein-protein interaction (PPI) network. Thus, many identification methods based on the PPI network model have achieved good results. However, in the PPI network, some interactions between the proteins encoded by candidate gene and the proteins encoded by known disease genes are very weak. Therefore, some studies have combined the PPI network with other genomic information and reported good predictive performances. However, we believe that the results could be further improved. In this paper, we propose a new method that uses the semantic similarity between the candidate gene and known disease genes to set the initial probability vector of a random walk with a restart algorithm in a human PPI network. The effectiveness of our method was demonstrated by leave-one-out cross-validation, and the experimental results indicated that our method outperformed other methods. Additionally, our method can predict new causative genes of multifactor diseases, including Parkinson's disease, breast cancer and obesity. The top predictions were good and consistent with the findings in the literature, which further illustrates the effectiveness of our method. Copyright © 2015 Elsevier Inc. All rights reserved.

  14. Candidate genes and molecular markers associated with heat tolerance in colonial Bentgrass.

    Science.gov (United States)

    Jespersen, David; Belanger, Faith C; Huang, Bingru

    2017-01-01

    Elevated temperature is a major abiotic stress limiting the growth of cool-season grasses during the summer months. The objectives of this study were to determine the genetic variation in the expression patterns of selected genes involved in several major metabolic pathways regulating heat tolerance for two genotypes contrasting in heat tolerance to confirm their status as potential candidate genes, and to identify PCR-based markers associated with candidate genes related to heat tolerance in a colonial (Agrostis capillaris L.) x creeping bentgrass (Agrostis stolonifera L.) hybrid backcross population. Plants were subjected to heat stress in controlled-environmental growth chambers for phenotypic evaluation and determination of genetic variation in candidate gene expression. Molecular markers were developed for genes involved in protein degradation (cysteine protease), antioxidant defense (catalase and glutathione-S-transferase), energy metabolism (glyceraldehyde-3-phosphate dehydrogenase), cell expansion (expansin), and stress protection (heat shock proteins HSP26, HSP70, and HSP101). Kruskal-Wallis analysis, a commonly used non-parametric test used to compare population individuals with or without the gene marker, found the physiological traits of chlorophyll content, electrolyte leakage, normalized difference vegetative index, and turf quality were associated with all candidate gene markers with the exception of HSP101. Differential gene expression was frequently found for the tested candidate genes. The development of candidate gene markers for important heat tolerance genes may allow for the development of new cultivars with increased abiotic stress tolerance using marker-assisted selection.

  15. Candidate genes and molecular markers associated with heat tolerance in colonial Bentgrass.

    Directory of Open Access Journals (Sweden)

    David Jespersen

    Full Text Available Elevated temperature is a major abiotic stress limiting the growth of cool-season grasses during the summer months. The objectives of this study were to determine the genetic variation in the expression patterns of selected genes involved in several major metabolic pathways regulating heat tolerance for two genotypes contrasting in heat tolerance to confirm their status as potential candidate genes, and to identify PCR-based markers associated with candidate genes related to heat tolerance in a colonial (Agrostis capillaris L. x creeping bentgrass (Agrostis stolonifera L. hybrid backcross population. Plants were subjected to heat stress in controlled-environmental growth chambers for phenotypic evaluation and determination of genetic variation in candidate gene expression. Molecular markers were developed for genes involved in protein degradation (cysteine protease, antioxidant defense (catalase and glutathione-S-transferase, energy metabolism (glyceraldehyde-3-phosphate dehydrogenase, cell expansion (expansin, and stress protection (heat shock proteins HSP26, HSP70, and HSP101. Kruskal-Wallis analysis, a commonly used non-parametric test used to compare population individuals with or without the gene marker, found the physiological traits of chlorophyll content, electrolyte leakage, normalized difference vegetative index, and turf quality were associated with all candidate gene markers with the exception of HSP101. Differential gene expression was frequently found for the tested candidate genes. The development of candidate gene markers for important heat tolerance genes may allow for the development of new cultivars with increased abiotic stress tolerance using marker-assisted selection.

  16. Candidate genes for cross-resistance against DNA-damaging drugs

    DEFF Research Database (Denmark)

    Wittig, Rainer; Nessling, Michelle; Will, Rainer D

    2002-01-01

    Drug resistance of tumor cells leads to major drawbacks in the treatment of cancer. To identify candidate genes for drug resistance, we compared the expression patterns of the drug-sensitive human malignant melanoma cell line MeWo and three derived sublines with acquired resistance to the DNA...... as several apoptosis-related genes, in particular STK17A and CRYAB. As MPP1 and CRYAB are also among the 14 genes differentially expressed in all three of the drug-resistant sublines, they represent the strongest candidates for resistance against DNA-damaging drugs....

  17. Candidate genes for performance in horses, including monocarboxylate transporters

    Directory of Open Access Journals (Sweden)

    Inaê Cristina Regatieri

    Full Text Available ABSTRACT: Some horse breeds are highly selected for athletic activities. The athletic potential of each animal can be measured by its performance in sports. High athletic performance depends on the animal capacity to produce energy through aerobic and anaerobic metabolic pathways, among other factors. Transmembrane proteins called monocarboxylate transporters, mainly the isoform 1 (MCT1 and its ancillary protein CD147, can help the organism to adapt to physiological stress caused by physical exercise, transporting lactate and H+ ions. Horse breeds are selected for different purposes so we might expect differences in the amount of those proteins and in the genotypic frequencies for genes that play a significant role in the performance of the animals. The study of MCT1 and CD147 gene polymorphisms, which can affect the formation of the proteins and transport of lactate and H+, can provide enough information to be used for selection of athletic horses increasingly resistant to intense exercise. Two other candidate genes, the PDK4 and DMRT3, have been associated with athletic potential and indicated as possible markers for performance in horses. The oxidation of fatty acids is highly effective in generating ATP and is controlled by the expression of PDK4 (pyruvate dehydrogenase kinase, isozyme 4 in skeletal muscle during and after exercise. The doublesex and mab-3 related transcription factor 3 (DMRT3 gene encodes an important transcription factor in the setting of spinal cord circuits controlling movement in vertebrates and may be associated with gait performance in horses. This review describes how the monocarboxylate transporters work during physical exercise in athletic horses and the influence of polymorphisms in candidate genes for athletic performance in horses.

  18. Candidate genes for COPD: current evidence and research

    Directory of Open Access Journals (Sweden)

    Kim WJ

    2015-10-01

    Full Text Available Woo Jin Kim,1 Sang Do Lee2 1Department of Internal Medicine and Environmental Health Center, Kangwon National University, Chuncheon, 2Department of Pulmonary and Critical Care Medicine, Clinical Research Center for Chronic Obstructive Airway Diseases, Asan Medical Center, University of Ulsan College of Medicine, Seoul, South Korea Abstract: COPD is a common complex disease characterized by progressive airflow limitation. Several genome-wide association studies (GWASs have discovered genes that are associated with COPD. Recently, candidate genes for COPD identified by GWASs include CHRNA3/5 (cholinergic nicotine receptor alpha 3/5, IREB2 (iron regulatory binding protein 2, HHIP (hedgehog-interacting protein, FAM13A (family with sequence similarity 13, member A, and AGER (advanced glycosylation end product–specific receptor. Their association with COPD susceptibility has been replicated in multiple populations. Since these candidate genes have not been considered in COPD, their pathological roles are still largely unknown. Herein, we review some evidences that they can be effective drug targets or serve as biomarkers for diagnosis or subtyping. However, more study is required to understand the functional roles of these candidate genes. Future research is needed to characterize the effect of genetic variants, validate gene function in humans and model systems, and elucidate the genes’ transcriptional and posttranscriptional regulatory mechanisms. Keywords: chronic obstructive pulmonary disease, genetics, genome-wide association study

  19. Characterization of Gene Candidates for Vacuolar Sodium Transport from Hordeum Vulgare

    KAUST Repository

    Scheu, Arne Hagen August

    2017-05-01

    Soil salinity is a major abiotic stress for land plants, and multiple mechanisms of salt tolerance have evolved. Tissue tolerance is one of these mechanisms, which involves the sequestration of sodium into the vacuole to retain low cytosolic sodium concentrations. This enables the plant to maintain cellular functions, and ultimately maintain growth and yield. However, the molecular components involved in tissue tolerance remain elusive. Several candidate genes for vacuolar sodium sequestration have recently been identified by proteome analysis of vacuolar membranes purified from the salt-tolerant cereal Hordeum vulgare (barley). In this study, I aimed to characterize these candidates in more detail. I successfully cloned coding sequences for the majority of candidate genes with primers designed based on the barley reference genome sequence. During the course of this study a newer genome sequence with improved annotations was published, to which I also compared my observations. To study the candidate genes, I used the heterologous expression system Saccharomyces cerevisiae (yeast). I used several salt sensitive yeast strains (deficient in intrinsic sodium transporters) to test whether the candidate genes would affect their salt tolerance by mediating the sequestration of sodium into the yeast vacuole. I observed a reduction in growth upon expression for several of the gene candidate under salt-stress conditions. However, confocal microscopy suggests that most gene products are subject to degradation, and did not localize to the vacuolar membrane (tonoplast). Therefore, growth effects cannot be linked to protein function without further evidence. Various potential causes are discussed, including inaccuracies in the genome resource used as reference for primer design and issues inherent to the model system. Finally, I make suggestions on how to proceed to further characterize the candidate genes and hopefully identify novel sodium transporters from barley.

  20. Inferring Gene Regulatory Networks Using Conditional Regulation Pattern to Guide Candidate Genes.

    Directory of Open Access Journals (Sweden)

    Fei Xiao

    Full Text Available Combining path consistency (PC algorithms with conditional mutual information (CMI are widely used in reconstruction of gene regulatory networks. CMI has many advantages over Pearson correlation coefficient in measuring non-linear dependence to infer gene regulatory networks. It can also discriminate the direct regulations from indirect ones. However, it is still a challenge to select the conditional genes in an optimal way, which affects the performance and computation complexity of the PC algorithm. In this study, we develop a novel conditional mutual information-based algorithm, namely RPNI (Regulation Pattern based Network Inference, to infer gene regulatory networks. For conditional gene selection, we define the co-regulation pattern, indirect-regulation pattern and mixture-regulation pattern as three candidate patterns to guide the selection of candidate genes. To demonstrate the potential of our algorithm, we apply it to gene expression data from DREAM challenge. Experimental results show that RPNI outperforms existing conditional mutual information-based methods in both accuracy and time complexity for different sizes of gene samples. Furthermore, the robustness of our algorithm is demonstrated by noisy interference analysis using different types of noise.

  1. Generating Genome-Scale Candidate Gene Lists for Pharmacogenomics

    DEFF Research Database (Denmark)

    Hansen, Niclas Tue; Brunak, Søren; Altman, R. B.

    2009-01-01

    A critical task in pharmacogenomics is identifying genes that may be important modulators of drug response. High-throughput experimental methods are often plagued by false positives and do not take advantage of existing knowledge. Candidate gene lists can usefully summarize existing knowledge...

  2. Characterisation of five candidate genes within the ETEC F4ab/ac candidate region in pigs

    DEFF Research Database (Denmark)

    Jacobsen, Mette Juul; Cirera Salicio, Susanna; Joller, David

    2011-01-01

    by haplotype sharing to a 2.5 Mb region on pig chromosome 13, a region containing 18 annotated genes. FINDINGS: The coding regions of five candidate genes for susceptibility to ETEC F4ab/ac infection (TFRC, ACK1, MUC20, MUC4 and KIAA0226), all located in the 2.5 Mb region, were investigated for the presence...... polymorphism in exon 22 of KIAA0226. Transcriptional profiles of the five genes were investigated in a porcine tissue panel including various intestinal tissues. All five genes were expressed in intestinal tissues at different levels but none of the genes were found differentially expressed between ETEC F4ab/ac...... of the amino acids composition. However, we cannot exclude that the five tested genes are bona fide candidate genes for susceptibility to ETEC F4ab/ac infection since the identified polymorphism might affect the translational apparatus, alternative splice forms may exist and post translational mechanisms might...

  3. Web tools for the prioritization of candidate disease genes.

    NARCIS (Netherlands)

    Oti, M.O.; Ballouz, S.; Wouters, M.A.

    2011-01-01

    Despite increasing sequencing capacity, genetic disease investigation still frequently results in the identification of loci containing multiple candidate disease genes that need to be tested for involvement in the disease. This process can be expedited by prioritizing the candidates prior to

  4. Network Based Integrated Analysis of Phenotype-Genotype Data for Prioritization of Candidate Symptom Genes

    Directory of Open Access Journals (Sweden)

    Xing Li

    2014-01-01

    Full Text Available Background. Symptoms and signs (symptoms in brief are the essential clinical manifestations for individualized diagnosis and treatment in traditional Chinese medicine (TCM. To gain insights into the molecular mechanism of symptoms, we develop a computational approach to identify the candidate genes of symptoms. Methods. This paper presents a network-based approach for the integrated analysis of multiple phenotype-genotype data sources and the prediction of the prioritizing genes for the associated symptoms. The method first calculates the similarities between symptoms and diseases based on the symptom-disease relationships retrieved from the PubMed bibliographic database. Then the disease-gene associations and protein-protein interactions are utilized to construct a phenotype-genotype network. The PRINCE algorithm is finally used to rank the potential genes for the associated symptoms. Results. The proposed method gets reliable gene rank list with AUC (area under curve 0.616 in classification. Some novel genes like CALCA, ESR1, and MTHFR were predicted to be associated with headache symptoms, which are not recorded in the benchmark data set, but have been reported in recent published literatures. Conclusions. Our study demonstrated that by integrating phenotype-genotype relationships into a complex network framework it provides an effective approach to identify candidate genes of symptoms.

  5. Functional validation of candidate genes detected by genomic feature models

    DEFF Research Database (Denmark)

    Rohde, Palle Duun; Østergaard, Solveig; Kristensen, Torsten Nygaard

    2018-01-01

    to investigate locomotor activity, and applied genomic feature prediction models to identify gene ontology (GO) cate- gories predictive of this phenotype. Next, we applied the covariance association test to partition the genomic variance of the predictive GO terms to the genes within these terms. We...... then functionally assessed whether the identified candidate genes affected locomotor activity by reducing gene expression using RNA interference. In five of the seven candidate genes tested, reduced gene expression altered the phenotype. The ranking of genes within the predictive GO term was highly correlated......Understanding the genetic underpinnings of complex traits requires knowledge of the genetic variants that contribute to phenotypic variability. Reliable statistical approaches are needed to obtain such knowledge. In genome-wide association studies, variants are tested for association with trait...

  6. Candidate gene identification of ovulation-inducing genes by RNA sequencing with an in vivo assay in zebrafish.

    Directory of Open Access Journals (Sweden)

    Wanlada Klangnurak

    Full Text Available We previously reported the microarray-based selection of three ovulation-related genes in zebrafish. We used a different selection method in this study, RNA sequencing analysis. An additional eight up-regulated candidates were found as specifically up-regulated genes in ovulation-induced samples. Changes in gene expression were confirmed by qPCR analysis. Furthermore, up-regulation prior to ovulation during natural spawning was verified in samples from natural pairing. Gene knock-out zebrafish strains of one of the candidates, the starmaker gene (stm, were established by CRISPR genome editing techniques. Unexpectedly, homozygous mutants were fertile and could spawn eggs. However, a high percentage of unfertilized eggs and abnormal embryos were produced from these homozygous females. The results suggest that the stm gene is necessary for fertilization. In this study, we selected additional ovulation-inducing candidate genes, and a novel function of the stm gene was investigated.

  7. Evaluation of candidate reference genes for gene expression normalization in Brassica juncea using real time quantitative RT-PCR.

    Directory of Open Access Journals (Sweden)

    Ruby Chandna

    Full Text Available The real time quantitative reverse transcription PCR (qRT-PCR is becoming increasingly important to gain insight into function of genes. Given the increased sensitivity, ease and reproducibility of qRT-PCR, the requirement of suitable reference genes for normalization has become important and stringent. It is now known that the expression of internal control genes in living organism vary considerably during developmental stages and under different experimental conditions. For economically important Brassica crops, only a couple of reference genes are reported till date. In this study, expression stability of 12 candidate reference genes including ACT2, ELFA, GAPDH, TUA, UBQ9 (traditional housekeeping genes, ACP, CAC, SNF, TIPS-41, TMD, TSB and ZNF (new candidate reference genes, in a diverse set of 49 tissue samples representing different developmental stages, stress and hormone treated conditions and cultivars of Brassica juncea has been validated. For the normalization of vegetative stages the ELFA, ACT2, CAC and TIPS-41 combination would be appropriate whereas TIPS-41 along with CAC would be suitable for normalization of reproductive stages. A combination of GAPDH, TUA, TIPS-41 and CAC were identified as the most suitable reference genes for total developmental stages. In various stress and hormone treated samples, UBQ9 and TIPS-41 had the most stable expression. Across five cultivars of B. juncea, the expression of CAC and TIPS-41 did not vary significantly and were identified as the most stably expressed reference genes. This study provides comprehensive information that the new reference genes selected herein performed better than the traditional housekeeping genes. The selection of most suitable reference genes depends on the experimental conditions, and is tissue and cultivar-specific. Further, to attain accuracy in the results more than one reference genes are necessary for normalization.

  8. LOD score exclusion analyses for candidate genes using random population samples.

    Science.gov (United States)

    Deng, H W; Li, J; Recker, R R

    2001-05-01

    While extensive analyses have been conducted to test for, no formal analyses have been conducted to test against, the importance of candidate genes with random population samples. We develop a LOD score approach for exclusion analyses of candidate genes with random population samples. Under this approach, specific genetic effects and inheritance models at candidate genes can be analysed and if a LOD score is < or = - 2.0, the locus can be excluded from having an effect larger than that specified. Computer simulations show that, with sample sizes often employed in association studies, this approach has high power to exclude a gene from having moderate genetic effects. In contrast to regular association analyses, population admixture will not affect the robustness of our analyses; in fact, it renders our analyses more conservative and thus any significant exclusion result is robust. Our exclusion analysis complements association analysis for candidate genes in random population samples and is parallel to the exclusion mapping analyses that may be conducted in linkage analyses with pedigrees or relative pairs. The usefulness of the approach is demonstrated by an application to test the importance of vitamin D receptor and estrogen receptor genes underlying the differential risk to osteoporotic fractures.

  9. The cld mutation: narrowing the critical chromosomal region and selecting candidate genes.

    Science.gov (United States)

    Péterfy, Miklós; Mao, Hui Z; Doolittle, Mark H

    2006-10-01

    Combined lipase deficiency (cld) is a recessive, lethal mutation specific to the tw73 haplotype on mouse Chromosome 17. While the cld mutation results in lipase proteins that are inactive, aggregated, and retained in the endoplasmic reticulum (ER), it maps separately from the lipase structural genes. We have narrowed the gene critical region by about 50% using the tw18 haplotype for deletion mapping and a recombinant chromosome used originally to map cld with respect to the phenotypic marker tf. The region now extends from 22 to 25.6 Mbp on the wild-type chromosome, currently containing 149 genes and 50 expressed sequence tags (ESTs). To identify the affected gene, we have selected candidates based on their known role in associated biological processes, cellular components, and molecular functions that best fit with the predicted function of the cld gene. A secondary approach was based on differences in mRNA levels between mutant (cld/cld) and unaffected (+/cld) cells. Using both approaches, we have identified seven functional candidates with an ER localization and/or an involvement in protein maturation and folding that could explain the lipase deficiency, and six expression candidates that exhibit large differences in mRNA levels between mutant and unaffected cells. Significantly, two genes were found to be candidates with regard to both function and expression, thus emerging as the strongest candidates for cld. We discuss the implications of our mapping results and our selection of candidates with respect to other genes, deletions, and mutations occurring in the cld critical region.

  10. Identification of Inherited Retinal Disease-Associated Genetic Variants in 11 Candidate Genes.

    Science.gov (United States)

    Astuti, Galuh D N; van den Born, L Ingeborgh; Khan, M Imran; Hamel, Christian P; Bocquet, Béatrice; Manes, Gaël; Quinodoz, Mathieu; Ali, Manir; Toomes, Carmel; McKibbin, Martin; El-Asrag, Mohammed E; Haer-Wigman, Lonneke; Inglehearn, Chris F; Black, Graeme C M; Hoyng, Carel B; Cremers, Frans P M; Roosing, Susanne

    2018-01-10

    Inherited retinal diseases (IRDs) display an enormous genetic heterogeneity. Whole exome sequencing (WES) recently identified genes that were mutated in a small proportion of IRD cases. Consequently, finding a second case or family carrying pathogenic variants in the same candidate gene often is challenging. In this study, we searched for novel candidate IRD gene-associated variants in isolated IRD families, assessed their causality, and searched for novel genotype-phenotype correlations. Whole exome sequencing was performed in 11 probands affected with IRDs. Homozygosity mapping data was available for five cases. Variants with minor allele frequencies ≤ 0.5% in public databases were selected as candidate disease-causing variants. These variants were ranked based on their: (a) presence in a gene that was previously implicated in IRD; (b) minor allele frequency in the Exome Aggregation Consortium database (ExAC); (c) in silico pathogenicity assessment using the combined annotation dependent depletion (CADD) score; and (d) interaction of the corresponding protein with known IRD-associated proteins. Twelve unique variants were found in 11 different genes in 11 IRD probands. Novel autosomal recessive and dominant inheritance patterns were found for variants in Small Nuclear Ribonucleoprotein U5 Subunit 200 ( SNRNP200 ) and Zinc Finger Protein 513 ( ZNF513 ), respectively. Using our pathogenicity assessment, a variant in DEAH-Box Helicase 32 ( DHX32 ) was the top ranked novel candidate gene to be associated with IRDs, followed by eight medium and lower ranked candidate genes. The identification of candidate disease-associated sequence variants in 11 single families underscores the notion that the previously identified IRD-associated genes collectively carry > 90% of the defects implicated in IRDs. To identify multiple patients or families with variants in the same gene and thereby provide extra proof for pathogenicity, worldwide data sharing is needed.

  11. Mining biological databases for candidate disease genes

    Science.gov (United States)

    Braun, Terry A.; Scheetz, Todd; Webster, Gregg L.; Casavant, Thomas L.

    2001-07-01

    The publicly-funded effort to sequence the complete nucleotide sequence of the human genome, the Human Genome Project (HGP), has currently produced more than 93% of the 3 billion nucleotides of the human genome into a preliminary `draft' format. In addition, several valuable sources of information have been developed as direct and indirect results of the HGP. These include the sequencing of model organisms (rat, mouse, fly, and others), gene discovery projects (ESTs and full-length), and new technologies such as expression analysis and resources (micro-arrays or gene chips). These resources are invaluable for the researchers identifying the functional genes of the genome that transcribe and translate into the transcriptome and proteome, both of which potentially contain orders of magnitude more complexity than the genome itself. Preliminary analyses of this data identified approximately 30,000 - 40,000 human `genes.' However, the bulk of the effort still remains -- to identify the functional and structural elements contained within the transcriptome and proteome, and to associate function in the transcriptome and proteome to genes. A fortuitous consequence of the HGP is the existence of hundreds of databases containing biological information that may contain relevant data pertaining to the identification of disease-causing genes. The task of mining these databases for information on candidate genes is a commercial application of enormous potential. We are developing a system to acquire and mine data from specific databases to aid our efforts to identify disease genes. A high speed cluster of Linux of workstations is used to analyze sequence and perform distributed sequence alignments as part of our data mining and processing. This system has been used to mine GeneMap99 sequences within specific genomic intervals to identify potential candidate disease genes associated with Bardet-Biedle Syndrome (BBS).

  12. Multiplex reverse transcription-polymerase chain reaction combined with on-chip electrophoresis as a rapid screening tool for candidate gene sets

    DEFF Research Database (Denmark)

    Wittig, Rainer; Salowsky, Rüdiger; Blaich, Stephanie

    2005-01-01

    Combining multiplex reverse transcription-polymerase chain reaction (mRT-PCR) with microfluidic amplicon analysis, we developed an assay for the rapid and reliable semiquantitative expression screening of 11 candidate genes for drug resistance in human malignant melanoma. The functionality of thi...

  13. Defining the Human Macula Transcriptome and Candidate Retinal Disease Genes UsingEyeSAGE

    Science.gov (United States)

    Rickman, Catherine Bowes; Ebright, Jessica N.; Zavodni, Zachary J.; Yu, Ling; Wang, Tianyuan; Daiger, Stephen P.; Wistow, Graeme; Boon, Kathy; Hauser, Michael A.

    2009-01-01

    Purpose To develop large-scale, high-throughput annotation of the human macula transcriptome and to identify and prioritize candidate genes for inherited retinal dystrophies, based on ocular-expression profiles using serial analysis of gene expression (SAGE). Methods Two human retina and two retinal pigment epithelium (RPE)/choroid SAGE libraries made from matched macula or midperipheral retina and adjacent RPE/choroid of morphologically normal 28- to 66-year-old donors and a human central retina longSAGE library made from 41- to 66-year-old donors were generated. Their transcription profiles were entered into a relational database, EyeSAGE, including microarray expression profiles of retina and publicly available normal human tissue SAGE libraries. EyeSAGE was used to identify retina- and RPE-specific and -associated genes, and candidate genes for retina and RPE disease loci. Differential and/or cell-type specific expression was validated by quantitative and single-cell RT-PCR. Results Cone photoreceptor-associated gene expression was elevated in the macula transcription profiles. Analysis of the longSAGE retina tags enhanced tag-to-gene mapping and revealed alternatively spliced genes. Analysis of candidate gene expression tables for the identified Bardet-Biedl syndrome disease gene (BBS5) in the BBS5 disease region table yielded BBS5 as the top candidate. Compelling candidates for inherited retina diseases were identified. Conclusions The EyeSAGE database, combining three different gene-profiling platforms including the authors’ multidonor-derived retina/RPE SAGE libraries and existing single-donor retina/RPE libraries, is a powerful resource for definition of the retina and RPE transcriptomes. It can be used to identify retina-specific genes, including alternatively spliced transcripts and to prioritize candidate genes within mapped retinal disease regions. PMID:16723438

  14. Genome-wide association study identifies candidate genes for starch content regulation in maize kernels

    Directory of Open Access Journals (Sweden)

    Na Liu

    2016-07-01

    Full Text Available Kernel starch content is an important trait in maize (Zea mays L. as it accounts for 65% to 75% of the dry kernel weight and positively correlates with seed yield. A number of starch synthesis-related genes have been identified in maize in recent years. However, many loci underlying variation in starch content among maize inbred lines still remain to be identified. The current study is a genome-wide association study that used a set of 263 maize inbred lines. In this panel, the average kernel starch content was 66.99%, ranging from 60.60% to 71.58% over the three study years. These inbred lines were genotyped with the SNP50 BeadChip maize array, which is comprised of 56,110 evenly spaced, random SNPs. Population structure was controlled by a mixed linear model (MLM as implemented in the software package TASSEL. After the statistical analyses, four SNPs were identified as significantly associated with starch content (P ≤ 0.0001, among which one each are located on chromosomes 1 and 5 and two are on chromosome 2. Furthermore, 77 candidate genes associated with starch synthesis were found within the 100-kb intervals containing these four QTLs, and four highly associated genes were within 20-kb intervals of the associated SNPs. Among the four genes, Glucose-1-phosphate adenylyltransferase (APS1; Gene ID GRMZM2G163437 is known as an important regulator of kernel starch content. The identified SNPs, QTLs, and candidate genes may not only be readily used for germplasm improvement by marker-assisted selection in breeding, but can also elucidate the genetic basis of starch content. Further studies on these identified candidate genes may help determine the molecular mechanisms regulating kernel starch content in maize and other important cereal crops.

  15. High-throughput analysis of candidate imprinted genes and allele-specific gene expression in the human term placenta

    Directory of Open Access Journals (Sweden)

    Clark Taane G

    2010-04-01

    Full Text Available Abstract Background Imprinted genes show expression from one parental allele only and are important for development and behaviour. This extreme mode of allelic imbalance has been described for approximately 56 human genes. Imprinting status is often disrupted in cancer and dysmorphic syndromes. More subtle variation of gene expression, that is not parent-of-origin specific, termed 'allele-specific gene expression' (ASE is more common and may give rise to milder phenotypic differences. Using two allele-specific high-throughput technologies alongside bioinformatics predictions, normal term human placenta was screened to find new imprinted genes and to ascertain the extent of ASE in this tissue. Results Twenty-three family trios of placental cDNA, placental genomic DNA (gDNA and gDNA from both parents were tested for 130 candidate genes with the Sequenom MassArray system. Six genes were found differentially expressed but none imprinted. The Illumina ASE BeadArray platform was then used to test 1536 SNPs in 932 genes. The array was enriched for the human orthologues of 124 mouse candidate genes from bioinformatics predictions and 10 human candidate imprinted genes from EST database mining. After quality control pruning, a total of 261 informative SNPs (214 genes remained for analysis. Imprinting with maternal expression was demonstrated for the lymphocyte imprinted gene ZNF331 in human placenta. Two potential differentially methylated regions (DMRs were found in the vicinity of ZNF331. None of the bioinformatically predicted candidates tested showed imprinting except for a skewed allelic expression in a parent-specific manner observed for PHACTR2, a neighbour of the imprinted PLAGL1 gene. ASE was detected for two or more individuals in 39 candidate genes (18%. Conclusions Both Sequenom and Illumina assays were sensitive enough to study imprinting and strong allelic bias. Previous bioinformatics approaches were not predictive of new imprinted genes

  16. CANDIDATE GENE ANALYSIS IN ISRAELI SOLDIERS WITH STRESS FRACTURES

    Directory of Open Access Journals (Sweden)

    Ran Yanovich

    2012-03-01

    Full Text Available To investigate the association of polymorphisms within candidate genes which we hypothesized may contribute to stress fracture predisposition, a case-control, cross- sectional study design was employed. Genotyping 268 Single Nucleotide Polymorphisms- SNPs within 17 genes in 385 Israeli young male and female recruits (182 with and 203 without stress fractures. Twenty-five polymorphisms within 9 genes (NR3C1, ANKH, VDR, ROR2, CALCR, IL6, COL1A2, CBG, and LRP4 showed statistically significant differences (p < 0.05 in the distribution between stress fracture cases and non stress fracture controls. Seventeen genetic variants were associated with an increased stress fracture risk, and eight variants with a decreased stress fracture risk. None of the SNP associations remained significant after correcting for multiple comparisons (false discovery rate- FDR. Our findings suggest that genes may be involved in stress fracture pathogenesis. Specifically, the CALCR and the VDR genes are intriguing candidates. The putative involvement of these genes in stress fracture predisposition requires analysis of more cases and controls and sequencing the relevant genomic regions, in order to define the specific gene mutations

  17. Whole genome homology-based identification of candidate genes ...

    African Journals Online (AJOL)

    Josephine Erhiakporeh

    2016-07-06

    Jul 6, 2016 ... candidate genes for drought tolerance in sesame. (Sesamum ... Our results provided genomic resources for further functional analysis and genetic engineering .... reverse transcribed using the Reverse Transcription System.

  18. Test for positional candidate genes for body composition on pig chromosome 6

    Directory of Open Access Journals (Sweden)

    Pérez-Enciso Miguel

    2002-07-01

    Full Text Available Abstract One QTL affecting backfat thickness (BF, intramuscular fat content (IMF and eye muscle area (MA was previously localized on porcine chromosome 6 in an F2 cross between Iberian and Landrace pigs. This work was done to study the effect of two positional candidate genes on these traits: H-FABP and LEPR genes. The QTL mapping analysis was repeated with a regression method using genotypes for seven microsatellites and two PCR-RFLPs in the H-FABP and LEPR genes. H-FABP and LEPR genes were located at 85.4 and 107 cM respectively, by linkage analysis. The effects of the candidate gene polymorphisms were analyzed in two ways. When an animal model was fitted, both genes showed significant effects on fatness traits, the H-FABP polymorphism showed significant effects on IMF and MA, and the LEPR polymorphism on BF and IMF. But when the candidate gene effect was included in a QTL regression analysis these associations were not observed, suggesting that they must not be the causal mutations responsible for the effects found. Differences in the results of both analyses showed the inadequacy of the animal model approach for the evaluation of positional candidate genes in populations with linkage disequilibrium, when the probabilities of the parental origin of the QTL alleles are not included in the model.

  19. Association of candidate genes with drought tolerance traits in diverse perennial ryegrass accessions

    Science.gov (United States)

    Xiaoqing Yu; Guihua Bai; Shuwei Liu; Na Luo; Ying Wang; Douglas S. Richmond; Paula M. Pijut; Scott A. Jackson; Jianming Yu; Yiwei. Jiang

    2013-01-01

    Drought is a major environmental stress limiting growth of perennial grasses in temperate regions. Plant drought tolerance is a complex trait that is controlled by multiple genes. Candidate gene association mapping provides a powerful tool for dissection of complex traits. Candidate gene association mapping of drought tolerance traits was conducted in 192 diverse...

  20. Transcriptome and proteome data reveal candidate genes for pollinator attraction in sexually deceptive orchids.

    Science.gov (United States)

    Sedeek, Khalid E M; Qi, Weihong; Schauer, Monica A; Gupta, Alok K; Poveda, Lucy; Xu, Shuqing; Liu, Zhong-Jian; Grossniklaus, Ueli; Schiestl, Florian P; Schlüter, Philipp M

    2013-01-01

    Sexually deceptive orchids of the genus Ophrys mimic the mating signals of their pollinator females to attract males as pollinators. This mode of pollination is highly specific and leads to strong reproductive isolation between species. This study aims to identify candidate genes responsible for pollinator attraction and reproductive isolation between three closely related species, O. exaltata, O. sphegodes and O. garganica. Floral traits such as odour, colour and morphology are necessary for successful pollinator attraction. In particular, different odour hydrocarbon profiles have been linked to differences in specific pollinator attraction among these species. Therefore, the identification of genes involved in these traits is important for understanding the molecular basis of pollinator attraction by sexually deceptive orchids. We have created floral reference transcriptomes and proteomes for these three Ophrys species using a combination of next-generation sequencing (454 and Solexa), Sanger sequencing, and shotgun proteomics (tandem mass spectrometry). In total, 121 917 unique transcripts and 3531 proteins were identified. This represents the first orchid proteome and transcriptome from the orchid subfamily Orchidoideae. Proteome data revealed proteins corresponding to 2644 transcripts and 887 proteins not observed in the transcriptome. Candidate genes for hydrocarbon and anthocyanin biosynthesis were represented by 156 and 61 unique transcripts in 20 and 7 genes classes, respectively. Moreover, transcription factors putatively involved in the regulation of flower odour, colour and morphology were annotated, including Myb, MADS and TCP factors. Our comprehensive data set generated by combining transcriptome and proteome technologies allowed identification of candidate genes for pollinator attraction and reproductive isolation among sexually deceptive orchids. This includes genes for hydrocarbon and anthocyanin biosynthesis and regulation, and the development of

  1. BEEF CATTLE MUSCULARITY CANDIDATE GENES

    Directory of Open Access Journals (Sweden)

    Irida Novianti

    2010-04-01

    Full Text Available Muscularity is a potential indicator for the selection of more productive cattle. Mapping quantitative trait loci (QTL for traits related to muscularity is useful to identify the genomic regions where the genes affecting muscularity reside. QTL analysis from a Limousin-Jersey double backcross herd was conducted using QTL Express software with cohort and breed as the fixed effects. Nine QTL suggested to have an association with muscularity were identified on cattle chromosomes BTA 1, 2, 3, 4, 5, 8, 12, 14 and 17. The myostatin gene is located at the centromeric end of chromosome 2 and not surprisingly, the Limousin myostatin F94L variant accounted for the QTL on BTA2. However, when the myostatin F94L genotype was included as an additional fixed effect, the QTL on BTA17 was also no longer significant. This result suggests that there may be gene(s that have epistatic effects with myostatin located on cattle chromosome 17. Based on the position of the QTL in base pairs, all the genes that reside in the region were determined using the Ensembl data base (www.ensembl.org. There were two potential candidate genes residing within these QTL regions were selected. They were Smad nuclear interacting protein 1 (SNIP1 and similar to follistatin-like 5 (FSTL5. (JIIPB 2010 Vol 20 No 1: 1-10

  2. Functional validation of GWAS gene candidates for abnormal liver function during zebrafish liver development

    Directory of Open Access Journals (Sweden)

    Leah Y. Liu

    2013-09-01

    Genome-wide association studies (GWAS have revealed numerous associations between many phenotypes and gene candidates. Frequently, however, further elucidation of gene function has not been achieved. A recent GWAS identified 69 candidate genes associated with elevated liver enzyme concentrations, which are clinical markers of liver disease. To investigate the role of these genes in liver homeostasis, we narrowed down this list to 12 genes based on zebrafish orthology, zebrafish liver expression and disease correlation. To assess the function of gene candidates during liver development, we assayed hepatic progenitors at 48 hours post fertilization (hpf and hepatocytes at 72 hpf using in situ hybridization following morpholino knockdown in zebrafish embryos. Knockdown of three genes (pnpla3, pklr and mapk10 decreased expression of hepatic progenitor cells, whereas knockdown of eight genes (pnpla3, cpn1, trib1, fads2, slc2a2, pklr, mapk10 and samm50 decreased cell-specific hepatocyte expression. We then induced liver injury in zebrafish embryos using acetaminophen exposure and observed changes in liver toxicity incidence in morphants. Prioritization of GWAS candidates and morpholino knockdown expedites the study of newly identified genes impacting liver development and represents a feasible method for initial assessment of candidate genes to instruct further mechanistic analyses. Our analysis can be extended to GWAS for additional disease-associated phenotypes.

  3. Candidate genes for COPD in two large data sets.

    Science.gov (United States)

    Bakke, P S; Zhu, G; Gulsvik, A; Kong, X; Agusti, A G N; Calverley, P M A; Donner, C F; Levy, R D; Make, B J; Paré, P D; Rennard, S I; Vestbo, J; Wouters, E F M; Anderson, W; Lomas, D A; Silverman, E K; Pillai, S G

    2011-02-01

    Lack of reproducibility of findings has been a criticism of genetic association studies on complex diseases, such as chronic obstructive pulmonary disease (COPD). We selected 257 polymorphisms of 16 genes with reported or potential relationships to COPD and genotyped these variants in a case-control study that included 953 COPD cases and 956 control subjects. We explored the association of these polymorphisms to three COPD phenotypes: a COPD binary phenotype and two quantitative traits (post-bronchodilator forced expiratory volume in 1 s (FEV₁) % predicted and FEV₁/forced vital capacity (FVC)). The polymorphisms significantly associated to these phenotypes in this first study were tested in a second, family-based study that included 635 pedigrees with 1,910 individuals. Significant associations to the binary COPD phenotype in both populations were seen for STAT1 (rs13010343) and NFKBIB/SIRT2 (rs2241704) (p<0.05). Single-nucleotide polymorphisms rs17467825 and rs1155563 of the GC gene were significantly associated with FEV₁ % predicted and FEV₁/FVC, respectively, in both populations (p<0.05). This study has replicated associations to COPD phenotypes in the STAT1, NFKBIB/SIRT2 and GC genes in two independent populations, the associations of the former two genes representing novel findings.

  4. Candidate luminal B breast cancer genes identified by genome, gene expression and DNA methylation profiling.

    Directory of Open Access Journals (Sweden)

    Stéphanie Cornen

    Full Text Available Breast cancers (BCs of the luminal B subtype are estrogen receptor-positive (ER+, highly proliferative, resistant to standard therapies and have a poor prognosis. To better understand this subtype we compared DNA copy number aberrations (CNAs, DNA promoter methylation, gene expression profiles, and somatic mutations in nine selected genes, in 32 luminal B tumors with those observed in 156 BCs of the other molecular subtypes. Frequent CNAs included 8p11-p12 and 11q13.1-q13.2 amplifications, 7q11.22-q34, 8q21.12-q24.23, 12p12.3-p13.1, 12q13.11-q24.11, 14q21.1-q23.1, 17q11.1-q25.1, 20q11.23-q13.33 gains and 6q14.1-q24.2, 9p21.3-p24,3, 9q21.2, 18p11.31-p11.32 losses. A total of 237 and 101 luminal B-specific candidate oncogenes and tumor suppressor genes (TSGs presented a deregulated expression in relation with their CNAs, including 11 genes previously reported associated with endocrine resistance. Interestingly, 88% of the potential TSGs are located within chromosome arm 6q, and seven candidate oncogenes are potential therapeutic targets. A total of 100 candidate oncogenes were validated in a public series of 5,765 BCs and the overexpression of 67 of these was associated with poor survival in luminal tumors. Twenty-four genes presented a deregulated expression in relation with a high DNA methylation level. FOXO3, PIK3CA and TP53 were the most frequent mutated genes among the nine tested. In a meta-analysis of next-generation sequencing data in 875 BCs, KCNB2 mutations were associated with luminal B cases while candidate TSGs MDN1 (6q15 and UTRN (6q24, were mutated in this subtype. In conclusion, we have reported luminal B candidate genes that may play a role in the development and/or hormone resistance of this aggressive subtype.

  5. Epidermal growth factor gene is a newly identified candidate gene for gout

    OpenAIRE

    Lin Han; Chunwei Cao; Zhaotong Jia; Shiguo Liu; Zhen Liu; Ruosai Xin; Can Wang; Xinde Li; Wei Ren; Xuefeng Wang; Changgui Li

    2016-01-01

    Chromosome 4q25 has been identified as a genomic region associated with gout. However, the associations of gout with the genes in this region have not yet been confirmed. Here, we performed two-stage analysis to determine whether variations in candidate genes in the 4q25 region are associated with gout in a male Chinese Han population. We first evaluated 96 tag single nucleotide polymorphisms (SNPs) in eight inflammatory/immune pathway- or glucose/lipid metabolism-related genes in the 4q25 re...

  6. Defining a new candidate gene for amelogenesis imperfecta: from molecular genetics to biochemistry.

    Science.gov (United States)

    Urzúa, Blanca; Ortega-Pinto, Ana; Morales-Bozo, Irene; Rojas-Alcayaga, Gonzalo; Cifuentes, Víctor

    2011-02-01

    Amelogenesis imperfecta is a group of genetic conditions that affect the structure and clinical appearance of tooth enamel. The types (hypoplastic, hypocalcified, and hypomature) are correlated with defects in different stages of the process of enamel synthesis. Autosomal dominant, recessive, and X-linked types have been previously described. These disorders are considered clinically and genetically heterogeneous in etiology, involving a variety of genes, such as AMELX, ENAM, DLX3, FAM83H, MMP-20, KLK4, and WDR72. The mutations identified within these causal genes explain less than half of all cases of amelogenesis imperfecta. Most of the candidate and causal genes currently identified encode proteins involved in enamel synthesis. We think it is necessary to refocus the search for candidate genes using biochemical processes. This review provides theoretical evidence that the human SLC4A4 gene (sodium bicarbonate cotransporter) may be a new candidate gene.

  7. Genome-wide transcriptome study in wheat identified candidate genes related to processing quality, majority of them showing interaction (quality x development) and having temporal and spatial distributions

    Science.gov (United States)

    2014-01-01

    Background The cultivated bread wheat (Triticum aestivum L.) possesses unique flour quality, which can be processed into many end-use food products such as bread, pasta, chapatti (unleavened flat bread), biscuit, etc. The present wheat varieties require improvement in processing quality to meet the increasing demand of better quality food products. However, processing quality is very complex and controlled by many genes, which have not been completely explored. To identify the candidate genes whose expressions changed due to variation in processing quality and interaction (quality x development), genome-wide transcriptome studies were performed in two sets of diverse Indian wheat varieties differing for chapatti quality. It is also important to understand the temporal and spatial distributions of their expressions for designing tissue and growth specific functional genomics experiments. Results Gene-specific two-way ANOVA analysis of expression of about 55 K transcripts in two diverse sets of Indian wheat varieties for chapatti quality at three seed developmental stages identified 236 differentially expressed probe sets (10-fold). Out of 236, 110 probe sets were identified for chapatti quality. Many processing quality related key genes such as glutenin and gliadins, puroindolines, grain softness protein, alpha and beta amylases, proteases, were identified, and many other candidate genes related to cellular and molecular functions were also identified. The ANOVA analysis revealed that the expression of 56 of 110 probe sets was involved in interaction (quality x development). Majority of the probe sets showed differential expression at early stage of seed development i.e. temporal expression. Meta-analysis revealed that the majority of the genes expressed in one or a few growth stages indicating spatial distribution of their expressions. The differential expressions of a few candidate genes such as pre-alpha/beta-gliadin and gamma gliadin were validated by RT

  8. Genome-wide transcriptome study in wheat identified candidate genes related to processing quality, majority of them showing interaction (quality x development) and having temporal and spatial distributions.

    Science.gov (United States)

    Singh, Anuradha; Mantri, Shrikant; Sharma, Monica; Chaudhury, Ashok; Tuli, Rakesh; Roy, Joy

    2014-01-16

    The cultivated bread wheat (Triticum aestivum L.) possesses unique flour quality, which can be processed into many end-use food products such as bread, pasta, chapatti (unleavened flat bread), biscuit, etc. The present wheat varieties require improvement in processing quality to meet the increasing demand of better quality food products. However, processing quality is very complex and controlled by many genes, which have not been completely explored. To identify the candidate genes whose expressions changed due to variation in processing quality and interaction (quality x development), genome-wide transcriptome studies were performed in two sets of diverse Indian wheat varieties differing for chapatti quality. It is also important to understand the temporal and spatial distributions of their expressions for designing tissue and growth specific functional genomics experiments. Gene-specific two-way ANOVA analysis of expression of about 55 K transcripts in two diverse sets of Indian wheat varieties for chapatti quality at three seed developmental stages identified 236 differentially expressed probe sets (10-fold). Out of 236, 110 probe sets were identified for chapatti quality. Many processing quality related key genes such as glutenin and gliadins, puroindolines, grain softness protein, alpha and beta amylases, proteases, were identified, and many other candidate genes related to cellular and molecular functions were also identified. The ANOVA analysis revealed that the expression of 56 of 110 probe sets was involved in interaction (quality x development). Majority of the probe sets showed differential expression at early stage of seed development i.e. temporal expression. Meta-analysis revealed that the majority of the genes expressed in one or a few growth stages indicating spatial distribution of their expressions. The differential expressions of a few candidate genes such as pre-alpha/beta-gliadin and gamma gliadin were validated by RT-PCR. Therefore, this study

  9. Computational analysis of candidate disease genes and variants for Salt-sensitive hypertension in indigenous Southern Africans

    KAUST Repository

    Tiffin, Nicki

    2010-09-27

    Multiple factors underlie susceptibility to essential hypertension, including a significant genetic and ethnic component, and environmental effects. Blood pressure response of hypertensive individuals to salt is heterogeneous, but salt sensitivity appears more prevalent in people of indigenous African origin. The underlying genetics of salt-sensitive hypertension, however, are poorly understood. In this study, computational methods including text- and data-mining have been used to select and prioritize candidate aetiological genes for salt-sensitive hypertension. Additionally, we have compared allele frequencies and copy number variation for single nucleotide polymorphisms in candidate genes between indigenous Southern African and Caucasian populations, with the aim of identifying candidate genes with significant variability between the population groups: identifying genetic variability between population groups can exploit ethnic differences in disease prevalence to aid with prioritisation of good candidate genes. Our top-ranking candidate genes include parathyroid hormone precursor (PTH) and type-1angiotensin II receptor (AGTR1). We propose that the candidate genes identified in this study warrant further investigation as potential aetiological genes for salt-sensitive hypertension. © 2010 Tiffin et al.

  10. Molecular genetic gene-environment studies using candidate genes in schizophrenia: a systematic review.

    Science.gov (United States)

    Modinos, Gemma; Iyegbe, Conrad; Prata, Diana; Rivera, Margarita; Kempton, Matthew J; Valmaggia, Lucia R; Sham, Pak C; van Os, Jim; McGuire, Philip

    2013-11-01

    The relatively high heritability of schizophrenia suggests that genetic factors play an important role in the etiology of the disorder. On the other hand, a number of environmental factors significantly influence its incidence. As few direct genetic effects have been demonstrated, and there is considerable inter-individual heterogeneity in the response to the known environmental factors, interactions between genetic and environmental factors may be important in determining whether an individual develops the disorder. To date, a considerable number of studies of gene-environment interactions (G×E) in schizophrenia have employed a hypothesis-based molecular genetic approach using candidate genes, which have led to a range of different findings. This systematic review aims to summarize the results from molecular genetic candidate studies and to review challenges and opportunities of this approach in psychosis research. Finally, we discuss the potential of future prospects, such as new studies that combine hypothesis-based molecular genetic candidate approaches with agnostic genome-wide association studies in determining schizophrenia risk. © 2013 Elsevier B.V. All rights reserved.

  11. Candidate innate immune system gene expression in the ecological model Daphnia.

    Science.gov (United States)

    Decaestecker, Ellen; Labbé, Pierrick; Ellegaard, Kirsten; Allen, Judith E; Little, Tom J

    2011-10-01

    The last ten years have witnessed increasing interest in host-pathogen interactions involving invertebrate hosts. The invertebrate innate immune system is now relatively well characterised, but in a limited range of genetic model organisms and under a limited number of conditions. Immune systems have been little studied under real-world scenarios of environmental variation and parasitism. Thus, we have investigated expression of candidate innate immune system genes in the water flea Daphnia, a model organism for ecological genetics, and whose capacity for clonal reproduction facilitates an exceptionally rigorous control of exposure dose or the study of responses at many time points. A unique characteristic of the particular Daphnia clones and pathogen strain combinations used presently is that they have been shown to be involved in specific host-pathogen coevolutionary interactions in the wild. We choose five genes, which are strong candidates to be involved in Daphnia-pathogen interactions, given that they have been shown to code for immune effectors in related organisms. Differential expression of these genes was quantified by qRT-PCR following exposure to the bacterial pathogen Pasteuria ramosa. Constitutive expression levels differed between host genotypes, and some genes appeared to show correlated expression. However, none of the genes appeared to show a major modification of expression level in response to Pasteuria exposure. By applying knowledge from related genetic model organisms (e.g. Drosophila) to models for the study of evolutionary ecology and coevolution (i.e. Daphnia), the candidate gene approach is temptingly efficient. However, our results show that detection of only weak patterns is likely if one chooses target genes for study based on previously identified genome sequences by comparison to homologues from other related organisms. Future work on the Daphnia-Pasteuria system will need to balance a candidate gene approach with more comprehensive

  12. Survey of Candidate Genes for Maize Resistance to Infection by Aspergillus flavus and/or Aflatoxin Contamination

    Science.gov (United States)

    Hawkins, Leigh K.; Tang, Juliet D.; Tomashek, John; Alves Oliveira, Dafne; Ogunola, Oluwaseun F.; Smith, J. Spencer; Williams, W. Paul

    2018-01-01

    Many projects have identified candidate genes for resistance to aflatoxin accumulation or Aspergillus flavus infection and growth in maize using genetic mapping, genomics, transcriptomics and/or proteomics studies. However, only a small percentage of these candidates have been validated in field conditions, and their relative contribution to resistance, if any, is unknown. This study presents a consolidated list of candidate genes identified in past studies or in-house studies, with descriptive data including genetic location, gene annotation, known protein identifiers, and associated pathway information, if known. A candidate gene pipeline to test the phenotypic effect of any maize DNA sequence on aflatoxin accumulation resistance was used in this study to determine any measurable effect on polymorphisms within or linked to the candidate gene sequences, and the results are published here. PMID:29385107

  13. Survey of Candidate Genes for Maize Resistance to Infection by Aspergillus flavus and/or Aflatoxin Contamination

    Directory of Open Access Journals (Sweden)

    Leigh K. Hawkins

    2018-01-01

    Full Text Available Many projects have identified candidate genes for resistance to aflatoxin accumulation or Aspergillus flavus infection and growth in maize using genetic mapping, genomics, transcriptomics and/or proteomics studies. However, only a small percentage of these candidates have been validated in field conditions, and their relative contribution to resistance, if any, is unknown. This study presents a consolidated list of candidate genes identified in past studies or in-house studies, with descriptive data including genetic location, gene annotation, known protein identifiers, and associated pathway information, if known. A candidate gene pipeline to test the phenotypic effect of any maize DNA sequence on aflatoxin accumulation resistance was used in this study to determine any measurable effect on polymorphisms within or linked to the candidate gene sequences, and the results are published here.

  14. A Generally Applicable Translational Strategy Identifies S100A4 as a Candidate Gene in Allergy

    DEFF Research Database (Denmark)

    Bruhn, Sören; Fang, Yu; Barrenäs, Fredrik

    2014-01-01

    The identification of diagnostic markers and therapeutic candidate genes in common diseases is complicated by the involvement of thousands of genes. We hypothesized that genes co-regulated with a key gene in allergy, IL13, would form a module that could help to identify candidate genes. We identi...

  15. from microarrays and quantitative trait loci to candidate genes

    Indian Academy of Sciences (India)

    Unknown

    2004-10-15

    Oct 15, 2004 ... to candidate genes – A research plan and preliminary results using Drosophila as a model organism and climatic ... Recent developments in molecular genetics ..... scientists in agriculture, medicine and psychology for test-.

  16. Identifying Candidate Reprogramming Genes in Mouse Induced Pluripotent Stem Cells.

    Science.gov (United States)

    Gao, Fang; Li, Jingyu; Zhang, Heng; Yang, Xu; An, Tiezhu

    2017-08-01

    Factor-based induced reprogramming approaches have tremendous potential for human regenerative medicine, but the efficiencies of these approaches are still low. In this study, we analyzed the global transcriptional profiles of mouse induced pluripotent stem cells (miPSCs) and mouse embryonic stem cells (mESCs) from seven different labs and present here the first successful clustering according to cell type, not by lab of origin. We identified 2131 different expression genes (DEs) as candidate pluripotency-associated genes by comparing mESCs/miPSCs with somatic cells and 720 DEs between miPSCs and mESCs. Interestingly, there was a significant overlap between the two DE sets. Therefore, we defined the overlap DEs as "consensus DEs" including 313 miPSC-specific genes expressed at a higher level in miPSCs versus mESCs and 184 mESC-specific genes in total and reasoned that these may contribute to the differences in pluripotency between mESCs and miPSCs. A classification of "consensus DEs" according to their different expression levels between somatic cells and mESCs/miPSCs shows that 86% of the miPSC-specific genes are more highly expressed in somatic cells, while 73% of mESC-specific genes are highly expressed in mESCs/miPSCs, indicating that the miPSCs have not efficiently silenced the expression pattern of the somatic cells from which they are derived and failed to completely induce the genes with high expression levels in mESCs. We further revealed a strong correlation between oocyte-enriched factors and insufficiently induced mESC-specific genes and identified 11 hub genes via network analysis. In light of these findings, we postulated that these key hub genes might not only drive somatic cell nuclear transfer (SCNT) reprogramming but also augment the efficiency and quality of miPSC reprogramming.

  17. Resolving candidate genes of mouse skeletal muscle QTL via RNA-Seq and expression network analyses

    Directory of Open Access Journals (Sweden)

    Lionikas Arimantas

    2012-11-01

    Full Text Available Abstract Background We have recently identified a number of Quantitative Trait Loci (QTL contributing to the 2-fold muscle weight difference between the LG/J and SM/J mouse strains and refined their confidence intervals. To facilitate nomination of the candidate genes responsible for these differences we examined the transcriptome of the tibialis anterior (TA muscle of each strain by RNA-Seq. Results 13,726 genes were expressed in mouse skeletal muscle. Intersection of a set of 1061 differentially expressed transcripts with a mouse muscle Bayesian Network identified a coherent set of differentially expressed genes that we term the LG/J and SM/J Regulatory Network (LSRN. The integration of the QTL, transcriptome and the network analyses identified eight key drivers of the LSRN (Kdr, Plbd1, Mgp, Fah, Prss23, 2310014F06Rik, Grtp1, Stk10 residing within five QTL regions, which were either polymorphic or differentially expressed between the two strains and are strong candidates for quantitative trait genes (QTGs underlying muscle mass. The insight gained from network analysis including the ability to make testable predictions is illustrated by annotating the LSRN with knowledge-based signatures and showing that the SM/J state of the network corresponds to a more oxidative state. We validated this prediction by NADH tetrazolium reductase staining in the TA muscle revealing higher oxidative potential of the SM/J compared to the LG/J strain (p Conclusion Thus, integration of fine resolution QTL mapping, RNA-Seq transcriptome information and mouse muscle Bayesian Network analysis provides a novel and unbiased strategy for nomination of muscle QTGs.

  18. Candidate genes for drought tolerance and improved productivity in ...

    Indian Academy of Sciences (India)

    Madhu

    Improving drought tolerance and productivity is one of the most difficult tasks for ... Keywords. Candidate gene; mapping population; polymerase chain reaction; single marker analysis. .... ple and the mean value computed. 2.4 Isolation of DNA.

  19. Prioritization of candidate disease genes by topological similarity between disease and protein diffusion profiles.

    Science.gov (United States)

    Zhu, Jie; Qin, Yufang; Liu, Taigang; Wang, Jun; Zheng, Xiaoqi

    2013-01-01

    Identification of gene-phenotype relationships is a fundamental challenge in human health clinic. Based on the observation that genes causing the same or similar phenotypes tend to correlate with each other in the protein-protein interaction network, a lot of network-based approaches were proposed based on different underlying models. A recent comparative study showed that diffusion-based methods achieve the state-of-the-art predictive performance. In this paper, a new diffusion-based method was proposed to prioritize candidate disease genes. Diffusion profile of a disease was defined as the stationary distribution of candidate genes given a random walk with restart where similarities between phenotypes are incorporated. Then, candidate disease genes are prioritized by comparing their diffusion profiles with that of the disease. Finally, the effectiveness of our method was demonstrated through the leave-one-out cross-validation against control genes from artificial linkage intervals and randomly chosen genes. Comparative study showed that our method achieves improved performance compared to some classical diffusion-based methods. To further illustrate our method, we used our algorithm to predict new causing genes of 16 multifactorial diseases including Prostate cancer and Alzheimer's disease, and the top predictions were in good consistent with literature reports. Our study indicates that integration of multiple information sources, especially the phenotype similarity profile data, and introduction of global similarity measure between disease and gene diffusion profiles are helpful for prioritizing candidate disease genes. Programs and data are available upon request.

  20. Gene set analysis of the EADGENE chicken data-set

    DEFF Research Database (Denmark)

    Skarman, Axel; Jiang, Li; Hornshøj, Henrik

    2009-01-01

     Abstract Background: Gene set analysis is considered to be a way of improving our biological interpretation of the observed expression patterns. This paper describes different methods applied to analyse expression data from a chicken DNA microarray dataset. Results: Applying different gene set...... analyses to the chicken expression data led to different ranking of the Gene Ontology terms tested. A method for prediction of possible annotations was applied. Conclusion: Biological interpretation based on gene set analyses dependent on the statistical method used. Methods for predicting the possible...

  1. Time-Course Gene Set Analysis for Longitudinal Gene Expression Data.

    Directory of Open Access Journals (Sweden)

    Boris P Hejblum

    2015-06-01

    Full Text Available Gene set analysis methods, which consider predefined groups of genes in the analysis of genomic data, have been successfully applied for analyzing gene expression data in cross-sectional studies. The time-course gene set analysis (TcGSA introduced here is an extension of gene set analysis to longitudinal data. The proposed method relies on random effects modeling with maximum likelihood estimates. It allows to use all available repeated measurements while dealing with unbalanced data due to missing at random (MAR measurements. TcGSA is a hypothesis driven method that identifies a priori defined gene sets with significant expression variations over time, taking into account the potential heterogeneity of expression within gene sets. When biological conditions are compared, the method indicates if the time patterns of gene sets significantly differ according to these conditions. The interest of the method is illustrated by its application to two real life datasets: an HIV therapeutic vaccine trial (DALIA-1 trial, and data from a recent study on influenza and pneumococcal vaccines. In the DALIA-1 trial TcGSA revealed a significant change in gene expression over time within 69 gene sets during vaccination, while a standard univariate individual gene analysis corrected for multiple testing as well as a standard a Gene Set Enrichment Analysis (GSEA for time series both failed to detect any significant pattern change over time. When applied to the second illustrative data set, TcGSA allowed the identification of 4 gene sets finally found to be linked with the influenza vaccine too although they were found to be associated to the pneumococcal vaccine only in previous analyses. In our simulation study TcGSA exhibits good statistical properties, and an increased power compared to other approaches for analyzing time-course expression patterns of gene sets. The method is made available for the community through an R package.

  2. Next-generation sequencing to identify candidate genes and develop diagnostic markers for a novel Phytophthora resistance gene, RpsHC18, in soybean.

    Science.gov (United States)

    Zhong, Chao; Sun, Suli; Li, Yinping; Duan, Canxing; Zhu, Zhendong

    2018-03-01

    A novel Phytophthora sojae resistance gene RpsHC18 was identified and finely mapped on soybean chromosome 3. Two NBS-LRR candidate genes were identified and two diagnostic markers of RpsHC18 were developed. Phytophthora root rot caused by Phytophthora sojae is a destructive disease of soybean. The most effective disease-control strategy is to deploy resistant cultivars carrying Phytophthora-resistant Rps genes. The soybean cultivar Huachun 18 has a broad and distinct resistance spectrum to 12 P. sojae isolates. Quantitative trait loci sequencing (QTL-seq), based on the whole-genome resequencing (WGRS) of two extreme resistant and susceptible phenotype bulks from an F 2:3 population, was performed, and one 767-kb genomic region with ΔSNP-index ≥ 0.9 on chromosome 3 was identified as the RpsHC18 candidate region in Huachun 18. The candidate region was reduced to a 146-kb region by fine mapping. Nonsynonymous SNP and haplotype analyses were carried out in the 146-kb region among ten soybean genotypes using WGRS. Four specific nonsynonymous SNPs were identified in two nucleotide-binding sites-leucine-rich repeat (NBS-LRR) genes, RpsHC18-NBL1 and RpsHC18-NBL2, which were considered to be the candidate genes. Finally, one specific SNP marker in each candidate gene was successfully developed using a tetra-primer ARMS-PCR assay, and the two markers were verified to be specific for RpsHC18 and to effectively distinguish other known Rps genes. In this study, we applied an integrated genomic-based strategy combining WGRS with traditional genetic mapping to identify RpsHC18 candidate genes and develop diagnostic markers. These results suggest that next-generation sequencing is a precise, rapid and cost-effective way to identify candidate genes and develop diagnostic markers, and it can accelerate Rps gene cloning and marker-assisted selection for breeding of P. sojae-resistant soybean cultivars.

  3. Transcriptomic Analysis Reveals Candidate Genes for Female Sterility in Pomegranate Flowers

    Directory of Open Access Journals (Sweden)

    Lina Chen

    2017-08-01

    Full Text Available Pomegranate has two types of flowers on the same plant: functional male flowers (FMF and bisexual flowers (BF. BF are female-fertile flowers that can set fruits. FMF are female-sterile flowers that fail to set fruit and that eventually drop. The putative cause of pomegranate FMF female sterility is abnormal ovule development. However, the key stage at which the FMF pomegranate ovules become abnormal and the mechanism of regulation of pomegranate female sterility remain unknown. Here, we studied ovule development in FMF and BF, using scanning electron microscopy to explore the key stage at which ovule development was terminated and then analyzed genes differentially expressed (differentially expressed genes – DEGs between FMF and BF to investigate the mechanism responsible for pomegranate female sterility. Ovule development in FMF ceased following the formation of the inner integument primordium. The key stage for the termination of FMF ovule development was when the bud vertical diameter was 5.0–13.0 mm. Candidate genes influencing ovule development may be crucial factors in pomegranate female sterility. INNER OUTER (INO/YABBY4 (Gglean016270 and AINTEGUMENTA (ANT homolog genes (Gglean003340 and Gglean011480, which regulate the development of the integument, showed down-regulation in FMF at the key stage of ovule development cessation (ATNSII. Their upstream regulator genes, such as AGAMOUS-like (AG-like (Gglean028014, Gglean026618, and Gglean028632 and SPOROCYTELESS (SPL homolog genes (Gglean005812, also showed differential expression pattern between BF and FMF at this key stage. The differential expression of the ethylene response signal genes, ETR (ethylene-resistant (Gglean022853 and ERF1/2 (ethylene-responsive factor (Gglean022880, between FMF and BF indicated that ethylene signaling may also be an important factor in the development of pomegranate female sterility. The increase in BF observed after spraying with ethephon supported this

  4. Knowledge Discovery in Biological Databases for Revealing Candidate Genes Linked to Complex Phenotypes.

    Science.gov (United States)

    Hassani-Pak, Keywan; Rawlings, Christopher

    2017-06-13

    Genetics and "omics" studies designed to uncover genotype to phenotype relationships often identify large numbers of potential candidate genes, among which the causal genes are hidden. Scientists generally lack the time and technical expertise to review all relevant information available from the literature, from key model species and from a potentially wide range of related biological databases in a variety of data formats with variable quality and coverage. Computational tools are needed for the integration and evaluation of heterogeneous information in order to prioritise candidate genes and components of interaction networks that, if perturbed through potential interventions, have a positive impact on the biological outcome in the whole organism without producing negative side effects. Here we review several bioinformatics tools and databases that play an important role in biological knowledge discovery and candidate gene prioritization. We conclude with several key challenges that need to be addressed in order to facilitate biological knowledge discovery in the future.

  5. Grass cell wall feruloylation: distribution of bound ferulate and candidate gene expression in Brachypodium distachyon

    Directory of Open Access Journals (Sweden)

    Hugo Bruno Correa Molinari

    2013-03-01

    Full Text Available The cell walls of grasses such as wheat, maize, rice and sugar cane, contain large amounts of ferulate that is ester-linked to the cell wall polysaccharide glucuronoarabinoxylan (GAX. This ferulate is considered to limit the digestibility of polysaccharide in grass biomass as it forms covalent linkages between polysaccharide and lignin components. Candidate genes within a grass-specific clade of the BAHD acyl-coA transferase superfamily have been identified as being responsible for the ester linkage of ferulate to GAX. Manipulation of these BAHD genes may therefore be a biotechnological target for increasing efficiency of conversion of grass biomass into biofuel. Here, we describe the expression of these candidate genes and amounts of bound ferulate from various tissues and developmental stages of the model grass Brachypodium distachyon. BAHD candidate transcripts and significant amounts of bound ferulate were present in every tissue and developmental stage. We hypothesise that BAHD candidate genes similar to the recently described rice OsPMT gene (PMT sub-clade are principally responsible for the bound coumaric acid (pCA, and that other BAHD candidates (non-PMT sub-clade are responsible for bound ferulic acid (FA. There were some similarities with between the ratio of expression non-PMT / PMT genes and the ratio of bound FA / pCA between tissue types, compatible with this hypothesis. However, much further work to modify BAHD genes in grasses and to characterise the heterologously expressed proteins is required to demonstrate their function.

  6. Candidate genes have sex-specific effects on timing of spring migration and moult speed in a long-distance migratory bird.

    Science.gov (United States)

    Bazzi, Gaia; Podofillini, Stefano; Gatti, Emanuele; Gianfranceschi, Luca; Cecere, Jacopo G; Spina, Fernando; Saino, Nicola; Rubolini, Diego

    2017-10-01

    The timing of major life-history events, such as migration and moult, is set by endogenous circadian and circannual clocks, that have been well characterized at the molecular level. Conversely, the genetic sources of variation in phenology and in other behavioral traits have been sparsely addressed. It has been proposed that inter-individual variability in the timing of seasonal events may arise from allelic polymorphism at phenological candidate genes involved in the signaling cascade of the endogenous clocks. In this study of a long-distance migratory passerine bird, the willow warbler Phylloscopus trochilus , we investigated whether allelic variation at 5 polymorphic loci of 4 candidate genes ( Adcyap1 , Clock , Creb1 , and Npas2 ), predicted 2 major components of the annual schedule, namely timing of spring migration across the central Mediterranean sea and moult speed, the latter gauged from ptilochronological analyses of tail feathers moulted in the African winter quarters. We identified a novel Clock gene locus ( Clock region 3) showing polyQ polymorphism, which was however not significantly associated with any phenotypic trait. Npas2 allele size predicted male (but not female) spring migration date, with males bearing longer alleles migrating significantly earlier than those bearing shorter alleles. Creb1 allele size significantly predicted male (but not female) moult speed, longer alleles being associated with faster moult. All other genotype-phenotype associations were statistically non-significant. These findings provide new evidence for a role of candidate genes in modulating the phenology of different circannual activities in long-distance migratory birds, and for the occurrence of sex-specific candidate gene effects.

  7. Replication of type 2 diabetes candidate genes variations in three geographically unrelated Indian population groups.

    Science.gov (United States)

    Ali, Shafat; Chopra, Rupali; Manvati, Siddharth; Singh, Yoginder Pal; Kaul, Nabodita; Behura, Anita; Mahajan, Ankit; Sehajpal, Prabodh; Gupta, Subash; Dhar, Manoj K; Chainy, Gagan B N; Bhanwer, Amarjit S; Sharma, Swarkar; Bamezai, Rameshwar N K

    2013-01-01

    Type 2 diabetes (T2D) is a syndrome of multiple metabolic disorders and is genetically heterogeneous. India comprises one of the largest global populations with highest number of reported type 2 diabetes cases. However, limited information about T2D associated loci is available for Indian populations. It is, therefore, pertinent to evaluate the previously associated candidates as well as identify novel genetic variations in Indian populations to understand the extent of genetic heterogeneity. We chose to do a cost effective high-throughput mass-array genotyping and studied the candidate gene variations associated with T2D in literature. In this case-control candidate genes association study, 91 SNPs from 55 candidate genes have been analyzed in three geographically independent population groups from India. We report the genetic variants in five candidate genes: TCF7L2, HHEX, ENPP1, IDE and FTO, are significantly associated (after Bonferroni correction, ppopulation. Interestingly, SNP rs7903146 of the TCF7L2 gene passed the genome wide significance threshold (combined P value = 2.05E-08) in the studied populations. We also observed the association of rs7903146 with blood glucose (fasting and postprandial) levels, supporting the role of TCF7L2 gene in blood glucose homeostasis. Further, we noted that the moderate risk provided by the independently associated loci in combined population with Odds Ratio (OR)<1.38 increased to OR = 2.44, (95%CI = 1.67-3.59) when the risk providing genotypes of TCF7L2, HHEX, ENPP1 and FTO genes were combined, suggesting the importance of gene-gene interactions evaluation in complex disorders like T2D.

  8. Looking into flowering time in almond (Prunus dulcis (Mill) D. A. Webb): the candidate gene approach.

    Science.gov (United States)

    Silva, C; Garcia-Mas, J; Sánchez, A M; Arús, P; Oliveira, M M

    2005-03-01

    Blooming time is one of the most important agronomic traits in almond. Biochemical and molecular events underlying flowering regulation must be understood before methods to stimulate late flowering can be developed. Attempts to elucidate the genetic control of this process have led to the identification of a major gene (Lb) and quantitative trait loci (QTLs) linked to observed phenotypic differences, but although this gene and these QTLs have been placed on the Prunus reference genetic map, their sequences and specific functions remain unknown. The aim of our investigation was to associate these loci with known genes using a candidate gene approach. Two almond cDNAs and eight Prunus expressed sequence tags were selected as candidate genes (CGs) since their sequences were highly identical to those of flowering regulatory genes characterized in other species. The CGs were amplified from both parental lines of the mapping population using specific primers. Sequence comparison revealed DNA polymorphisms between the parental lines, mainly of the single nucleotide type. Polymorphisms were used to develop co-dominant cleaved amplified polymorphic sequence markers or length polymorphisms based on insertion/deletion events for mapping the candidate genes on the Prunus reference map. Ten candidate genes were assigned to six linkage groups in the Prunus genome. The positions of two of these were compatible with the regions where two QTLs for blooming time were detected. One additional candidate was localized close to the position of the Evergrowing gene, which determines a non-deciduous behaviour in peach.

  9. Identification of a set of genes showing regionally enriched expression in the mouse brain

    Directory of Open Access Journals (Sweden)

    Marra Marco A

    2008-07-01

    Full Text Available Abstract Background The Pleiades Promoter Project aims to improve gene therapy by designing human mini-promoters ( Results We have utilized LongSAGE to identify regionally enriched transcripts in the adult mouse brain. As supplemental strategies, we also performed a meta-analysis of published literature and inspected the Allen Brain Atlas in situ hybridization data. From a set of approximately 30,000 mouse genes, 237 were identified as showing specific or enriched expression in 30 target regions of the mouse brain. GO term over-representation among these genes revealed co-involvement in various aspects of central nervous system development and physiology. Conclusion Using a multi-faceted expression validation approach, we have identified mouse genes whose human orthologs are good candidates for design of mini-promoters. These mouse genes represent molecular markers in several discrete brain regions/cell-types, which could potentially provide a mechanistic explanation of unique functions performed by each region. This set of markers may also serve as a resource for further studies of gene regulatory elements influencing brain expression.

  10. Defining the Sequence Elements and Candidate Genes for the Coloboma Mutation.

    Directory of Open Access Journals (Sweden)

    Elizabeth A. Robb

    Full Text Available The chicken coloboma mutation exhibits features similar to human congenital developmental malformations such as ocular coloboma, cleft-palate, dwarfism, and polydactyly. The coloboma-associated region and encoded genes were investigated using advanced genomic, genetic, and gene expression technologies. Initially, the mutation was linked to a 990 kb region encoding 11 genes; the application of the genetic and genomic tools led to a reduction of the linked region to 176 kb and the elimination of 7 genes. Furthermore, bioinformatics analyses of capture array-next generation sequence data identified genetic elements including SNPs, insertions, deletions, gaps, chromosomal rearrangements, and miRNA binding sites within the introgressed causative region relative to the reference genome sequence. Coloboma-specific variants within exons, UTRs, and splice sites were studied for their contribution to the mutant phenotype. Our compiled results suggest three genes for future studies. The three candidate genes, SLC30A5 (a zinc transporter, CENPH (a centromere protein, and CDK7 (a cyclin-dependent kinase, are differentially expressed (compared to normal embryos at stages and in tissues affected by the coloboma mutation. Of these genes, two (SLC30A5 and CENPH are considered high-priority candidate based upon studies in other vertebrate model systems.

  11. Gene set analysis using variance component tests.

    Science.gov (United States)

    Huang, Yen-Tsung; Lin, Xihong

    2013-06-28

    Gene set analyses have become increasingly important in genomic research, as many complex diseases are contributed jointly by alterations of numerous genes. Genes often coordinate together as a functional repertoire, e.g., a biological pathway/network and are highly correlated. However, most of the existing gene set analysis methods do not fully account for the correlation among the genes. Here we propose to tackle this important feature of a gene set to improve statistical power in gene set analyses. We propose to model the effects of an independent variable, e.g., exposure/biological status (yes/no), on multiple gene expression values in a gene set using a multivariate linear regression model, where the correlation among the genes is explicitly modeled using a working covariance matrix. We develop TEGS (Test for the Effect of a Gene Set), a variance component test for the gene set effects by assuming a common distribution for regression coefficients in multivariate linear regression models, and calculate the p-values using permutation and a scaled chi-square approximation. We show using simulations that type I error is protected under different choices of working covariance matrices and power is improved as the working covariance approaches the true covariance. The global test is a special case of TEGS when correlation among genes in a gene set is ignored. Using both simulation data and a published diabetes dataset, we show that our test outperforms the commonly used approaches, the global test and gene set enrichment analysis (GSEA). We develop a gene set analyses method (TEGS) under the multivariate regression framework, which directly models the interdependence of the expression values in a gene set using a working covariance. TEGS outperforms two widely used methods, GSEA and global test in both simulation and a diabetes microarray data.

  12. Polymorphisms of candidate genes associated with meat quality and ...

    African Journals Online (AJOL)

    Hung Nguyen

    Abstract. The objectives of this study were to analyse genotype distribution and sequence variations of candidate genes putatively associated with meat quality and disease resistance in exotic and indigenous. Vietnamese pig breeds. For this purpose, 340 pigs from four indigenous and two exotic breeds were included.

  13. Replication of type 2 diabetes candidate genes variations in three geographically unrelated Indian population groups.

    Directory of Open Access Journals (Sweden)

    Shafat Ali

    Full Text Available Type 2 diabetes (T2D is a syndrome of multiple metabolic disorders and is genetically heterogeneous. India comprises one of the largest global populations with highest number of reported type 2 diabetes cases. However, limited information about T2D associated loci is available for Indian populations. It is, therefore, pertinent to evaluate the previously associated candidates as well as identify novel genetic variations in Indian populations to understand the extent of genetic heterogeneity. We chose to do a cost effective high-throughput mass-array genotyping and studied the candidate gene variations associated with T2D in literature. In this case-control candidate genes association study, 91 SNPs from 55 candidate genes have been analyzed in three geographically independent population groups from India. We report the genetic variants in five candidate genes: TCF7L2, HHEX, ENPP1, IDE and FTO, are significantly associated (after Bonferroni correction, p<5.5E-04 with T2D susceptibility in combined population. Interestingly, SNP rs7903146 of the TCF7L2 gene passed the genome wide significance threshold (combined P value = 2.05E-08 in the studied populations. We also observed the association of rs7903146 with blood glucose (fasting and postprandial levels, supporting the role of TCF7L2 gene in blood glucose homeostasis. Further, we noted that the moderate risk provided by the independently associated loci in combined population with Odds Ratio (OR<1.38 increased to OR = 2.44, (95%CI = 1.67-3.59 when the risk providing genotypes of TCF7L2, HHEX, ENPP1 and FTO genes were combined, suggesting the importance of gene-gene interactions evaluation in complex disorders like T2D.

  14. CAsubtype: An R Package to Identify Gene Sets Predictive of Cancer Subtypes and Clinical Outcomes.

    Science.gov (United States)

    Kong, Hualei; Tong, Pan; Zhao, Xiaodong; Sun, Jielin; Li, Hua

    2018-03-01

    In the past decade, molecular classification of cancer has gained high popularity owing to its high predictive power on clinical outcomes as compared with traditional methods commonly used in clinical practice. In particular, using gene expression profiles, recent studies have successfully identified a number of gene sets for the delineation of cancer subtypes that are associated with distinct prognosis. However, identification of such gene sets remains a laborious task due to the lack of tools with flexibility, integration and ease of use. To reduce the burden, we have developed an R package, CAsubtype, to efficiently identify gene sets predictive of cancer subtypes and clinical outcomes. By integrating more than 13,000 annotated gene sets, CAsubtype provides a comprehensive repertoire of candidates for new cancer subtype identification. For easy data access, CAsubtype further includes the gene expression and clinical data of more than 2000 cancer patients from TCGA. CAsubtype first employs principal component analysis to identify gene sets (from user-provided or package-integrated ones) with robust principal components representing significantly large variation between cancer samples. Based on these principal components, CAsubtype visualizes the sample distribution in low-dimensional space for better understanding of the distinction between samples and classifies samples into subgroups with prevalent clustering algorithms. Finally, CAsubtype performs survival analysis to compare the clinical outcomes between the identified subgroups, assessing their clinical value as potentially novel cancer subtypes. In conclusion, CAsubtype is a flexible and well-integrated tool in the R environment to identify gene sets for cancer subtype identification and clinical outcome prediction. Its simple R commands and comprehensive data sets enable efficient examination of the clinical value of any given gene set, thus facilitating hypothesis generating and testing in biological and

  15. Identification of candidate genes associated with leaf senescence in cultivated sunflower (Helianthus annuus L..

    Directory of Open Access Journals (Sweden)

    Sebastian Moschen

    Full Text Available Cultivated sunflower (Helianthus annuus L., an important source of edible vegetable oil, shows rapid onset of senescence, which limits production by reducing photosynthetic capacity under specific growing conditions. Carbon for grain filling depends strongly on light interception by green leaf area, which diminishes during grain filling due to leaf senescence. Transcription factors (TFs regulate the progression of leaf senescence in plants and have been well explored in model systems, but information for many agronomic crops remains limited. Here, we characterize the expression profiles of a set of putative senescence associated genes (SAGs identified by a candidate gene approach and sunflower microarray expression studies. We examined a time course of sunflower leaves undergoing natural senescence and used quantitative PCR (qPCR to measure the expression of 11 candidate genes representing the NAC, WRKY, MYB and NF-Y TF families. In addition, we measured physiological parameters such as chlorophyll, total soluble sugars and nitrogen content. The expression of Ha-NAC01, Ha-NAC03, Ha-NAC04, Ha-NAC05 and Ha-MYB01 TFs increased before the remobilization rate increased and therefore, before the appearance of the first physiological symptoms of senescence, whereas Ha-NAC02 expression decreased. In addition, we also examined the trifurcate feed-forward pathway (involving ORE1, miR164, and ethylene insensitive 2 previously reported for Arabidopsis. We measured transcription of Ha-NAC01 (the sunflower homolog of ORE1 and Ha-EIN2, along with the levels of miR164, in two leaves from different stem positions, and identified differences in transcription between basal and upper leaves. Interestingly, Ha-NAC01 and Ha-EIN2 transcription profiles showed an earlier up-regulation in upper leaves of plants close to maturity, compared with basal leaves of plants at pre-anthesis stages. These results suggest that the H. annuus TFs characterized in this work could

  16. Identification of candidate genes associated with leaf senescence in cultivated sunflower (Helianthus annuus L.).

    Science.gov (United States)

    Moschen, Sebastian; Bengoa Luoni, Sofia; Paniego, Norma B; Hopp, H Esteban; Dosio, Guillermo A A; Fernandez, Paula; Heinz, Ruth A

    2014-01-01

    Cultivated sunflower (Helianthus annuus L.), an important source of edible vegetable oil, shows rapid onset of senescence, which limits production by reducing photosynthetic capacity under specific growing conditions. Carbon for grain filling depends strongly on light interception by green leaf area, which diminishes during grain filling due to leaf senescence. Transcription factors (TFs) regulate the progression of leaf senescence in plants and have been well explored in model systems, but information for many agronomic crops remains limited. Here, we characterize the expression profiles of a set of putative senescence associated genes (SAGs) identified by a candidate gene approach and sunflower microarray expression studies. We examined a time course of sunflower leaves undergoing natural senescence and used quantitative PCR (qPCR) to measure the expression of 11 candidate genes representing the NAC, WRKY, MYB and NF-Y TF families. In addition, we measured physiological parameters such as chlorophyll, total soluble sugars and nitrogen content. The expression of Ha-NAC01, Ha-NAC03, Ha-NAC04, Ha-NAC05 and Ha-MYB01 TFs increased before the remobilization rate increased and therefore, before the appearance of the first physiological symptoms of senescence, whereas Ha-NAC02 expression decreased. In addition, we also examined the trifurcate feed-forward pathway (involving ORE1, miR164, and ethylene insensitive 2) previously reported for Arabidopsis. We measured transcription of Ha-NAC01 (the sunflower homolog of ORE1) and Ha-EIN2, along with the levels of miR164, in two leaves from different stem positions, and identified differences in transcription between basal and upper leaves. Interestingly, Ha-NAC01 and Ha-EIN2 transcription profiles showed an earlier up-regulation in upper leaves of plants close to maturity, compared with basal leaves of plants at pre-anthesis stages. These results suggest that the H. annuus TFs characterized in this work could play important

  17. Positional RNA-Seq identifies candidate genes for phenotypic engineering of sexual traits

    NARCIS (Netherlands)

    Arbore, Roberto; Sekii, Kiyono; Beisel, Christian; Ladurner, Peter; Berezikov, Eugene; Schaerer, Lukas

    2015-01-01

    Introduction: RNA interference (RNAi) of trait-specific genes permits the manipulation of specific phenotypic traits ("phenotypic engineering") and thus represents a powerful tool to test trait function in evolutionary studies. The identification of suitable candidate genes, however, often relies on

  18. Candidate genes for drought tolerance and improved productivity in ...

    Indian Academy of Sciences (India)

    Madhu

    tropics. Improving drought tolerance and productivity is one of the most difficult tasks for cereal breeders. The diffi- culty arises from the diverse strategies adopted by plants themselves to combat drought stress depending on the timing,. Candidate genes for drought tolerance and improved productivity in rice (Oryza sativa L.).

  19. Polymorphisms of candidate genes associated with meat quality and ...

    African Journals Online (AJOL)

    The objectives of this study were to analyse genotype distribution and sequence variations of candidate genes putatively associated with meat quality and disease resistance in exotic and indigenous Vietnamese pig breeds. For this purpose, 340 pigs from four indigenous and two exotic breeds were included in the analysis ...

  20. Novel candidate genes important for asthma and hypertension comorbidity revealed from associative gene networks.

    Science.gov (United States)

    Saik, Olga V; Demenkov, Pavel S; Ivanisenko, Timofey V; Bragina, Elena Yu; Freidin, Maxim B; Goncharova, Irina A; Dosenko, Victor E; Zolotareva, Olga I; Hofestaedt, Ralf; Lavrik, Inna N; Rogaev, Evgeny I; Ivanisenko, Vladimir A

    2018-02-13

    Hypertension and bronchial asthma are a major issue for people's health. As of 2014, approximately one billion adults, or ~ 22% of the world population, have had hypertension. As of 2011, 235-330 million people globally have been affected by asthma and approximately 250,000-345,000 people have died each year from the disease. The development of the effective treatment therapies against these diseases is complicated by their comorbidity features. This is often a major problem in diagnosis and their treatment. Hence, in this study the bioinformatical methodology for the analysis of the comorbidity of these two diseases have been developed. As such, the search for candidate genes related to the comorbid conditions of asthma and hypertension can help in elucidating the molecular mechanisms underlying the comorbid condition of these two diseases, and can also be useful for genotyping and identifying new drug targets. Using ANDSystem, the reconstruction and analysis of gene networks associated with asthma and hypertension was carried out. The gene network of asthma included 755 genes/proteins and 62,603 interactions, while the gene network of hypertension - 713 genes/proteins and 45,479 interactions. Two hundred and five genes/proteins and 9638 interactions were shared between asthma and hypertension. An approach for ranking genes implicated in the comorbid condition of two diseases was proposed. The approach is based on nine criteria for ranking genes by their importance, including standard methods of gene prioritization (Endeavor, ToppGene) as well as original criteria that take into account the characteristics of an associative gene network and the presence of known polymorphisms in the analysed genes. According to the proposed approach, the genes IL10, TLR4, and CAT had the highest priority in the development of comorbidity of these two diseases. Additionally, it was revealed that the list of top genes is enriched with apoptotic genes and genes involved in

  1. No Evidence That Schizophrenia Candidate Genes Are More Associated With Schizophrenia Than Noncandidate Genes

    NARCIS (Netherlands)

    Johnson, Emma C; Border, Richard; Melroy-Greif, Whitney E; de Leeuw, Christiaan A; Ehringer, Marissa A; Keller, Matthew C

    2017-01-01

    BACKGROUND: A recent analysis of 25 historical candidate gene polymorphisms for schizophrenia in the largest genome-wide association study conducted to date suggested that these commonly studied variants were no more associated with the disorder than would be expected by chance. However, the same

  2. Genome-Wide Association Study Identifying Candidate Genes Influencing Important Agronomic Traits of Flax (Linum usitatissimum L.) Using SLAF-seq.

    Science.gov (United States)

    Xie, Dongwei; Dai, Zhigang; Yang, Zemao; Sun, Jian; Zhao, Debao; Yang, Xue; Zhang, Liguo; Tang, Qing; Su, Jianguang

    2017-01-01

    Flax ( Linum usitatissimum L.) is an important cash crop, and its agronomic traits directly affect yield and quality. Molecular studies on flax remain inadequate because relatively few flax genes have been associated with agronomic traits or have been identified as having potential applications. To identify markers and candidate genes that can potentially be used for genetic improvement of crucial agronomic traits, we examined 224 specimens of core flax germplasm; specifically, phenotypic data for key traits, including plant height, technical length, number of branches, number of fruits, and 1000-grain weight were investigated under three environmental conditions before specific-locus amplified fragment sequencing (SLAF-seq) was employed to perform a genome-wide association study (GWAS) for these five agronomic traits. Subsequently, the results were used to screen single nucleotide polymorphism (SNP) loci and candidate genes that exhibited a significant correlation with the important agronomic traits. Our analyses identified a total of 42 SNP loci that showed significant correlations with the five important agronomic flax traits. Next, candidate genes were screened in the 10 kb zone of each of the 42 SNP loci. These SNP loci were then analyzed by a more stringent screening via co-identification using both a general linear model (GLM) and a mixed linear model (MLM) as well as co-occurrences in at least two of the three environments, whereby 15 final candidate genes were obtained. Based on these results, we determined that UGT and PL are candidate genes for plant height, GRAS and XTH are candidate genes for the number of branches, Contig1437 and LU0019C12 are candidate genes for the number of fruits, and PHO1 is a candidate gene for the 1000-seed weight. We propose that the identified SNP loci and corresponding candidate genes might serve as a biological basis for improving crucial agronomic flax traits.

  3. Genome-Wide Association Study Identifying Candidate Genes Influencing Important Agronomic Traits of Flax (Linum usitatissimum L. Using SLAF-seq

    Directory of Open Access Journals (Sweden)

    Dongwei Xie

    2018-01-01

    Full Text Available Flax (Linum usitatissimum L. is an important cash crop, and its agronomic traits directly affect yield and quality. Molecular studies on flax remain inadequate because relatively few flax genes have been associated with agronomic traits or have been identified as having potential applications. To identify markers and candidate genes that can potentially be used for genetic improvement of crucial agronomic traits, we examined 224 specimens of core flax germplasm; specifically, phenotypic data for key traits, including plant height, technical length, number of branches, number of fruits, and 1000-grain weight were investigated under three environmental conditions before specific-locus amplified fragment sequencing (SLAF-seq was employed to perform a genome-wide association study (GWAS for these five agronomic traits. Subsequently, the results were used to screen single nucleotide polymorphism (SNP loci and candidate genes that exhibited a significant correlation with the important agronomic traits. Our analyses identified a total of 42 SNP loci that showed significant correlations with the five important agronomic flax traits. Next, candidate genes were screened in the 10 kb zone of each of the 42 SNP loci. These SNP loci were then analyzed by a more stringent screening via co-identification using both a general linear model (GLM and a mixed linear model (MLM as well as co-occurrences in at least two of the three environments, whereby 15 final candidate genes were obtained. Based on these results, we determined that UGT and PL are candidate genes for plant height, GRAS and XTH are candidate genes for the number of branches, Contig1437 and LU0019C12 are candidate genes for the number of fruits, and PHO1 is a candidate gene for the 1000-seed weight. We propose that the identified SNP loci and corresponding candidate genes might serve as a biological basis for improving crucial agronomic flax traits.

  4. Selection in the dopamine receptor 2 gene: a candidate SNP study

    Directory of Open Access Journals (Sweden)

    Tobias Göllner

    2015-08-01

    Full Text Available Dopamine is a major neurotransmitter in the human brain and is associated with various diseases. Schizophrenia, for example, is treated by blocking the dopamine receptors type 2. Shaner, Miller & Mintz (2004 stated that schizophrenia was the low fitness variant of a highly variable mental trait. We therefore explore whether the dopamine receptor 2 gene (DRD2 underwent any selection processes. We acquired genotype data of the 1,000 Genomes project (phase I, which contains 1,093 individuals from 14 populations. We included single nucleotide polymorphisms (SNPs with two minor allele frequencies (MAFs in the analysis: MAF over 0.05 and over 0.01. This is equivalent to 151 SNPs (MAF > 0.05 and 246 SNPs (MAF > 0.01 for DRD2. We used two different approaches (an outlier approach and a Bayesian approach to detect loci under selection. The combined results of both approaches yielded nine (MAF > 0.05 and two candidate SNPs (MAF > 0.01, under balancing selection. We also found weak signs for directional selection on DRD2, but in our opinion these were too weak to draw any final conclusions on directional selection in DRD2. All candidates for balancing selection are in the intronic region of the gene and only one (rs12574471 has been mentioned in the literature. Two of our candidate SNPs are located in specific regions of the gene: rs80215768 lies within a promoter flanking region and rs74751335 lies within a transcription factor binding site. We strongly encourage research on our candidate SNPs and their possible effects.

  5. Mapping a candidate gene (MdMYB10 for red flesh and foliage colour in apple

    Directory of Open Access Journals (Sweden)

    Allan Andrew C

    2007-07-01

    Full Text Available Abstract Background Integrating plant genomics and classical breeding is a challenge for both plant breeders and molecular biologists. Marker-assisted selection (MAS is a tool that can be used to accelerate the development of novel apple varieties such as cultivars that have fruit with anthocyanin through to the core. In addition, determining the inheritance of novel alleles, such as the one responsible for red flesh, adds to our understanding of allelic variation. Our goal was to map candidate anthocyanin biosynthetic and regulatory genes in a population segregating for the red flesh phenotypes. Results We have identified the Rni locus, a major genetic determinant of the red foliage and red colour in the core of apple fruit. In a population segregating for the red flesh and foliage phenotype we have determined the inheritance of the Rni locus and DNA polymorphisms of candidate anthocyanin biosynthetic and regulatory genes. Simple Sequence Repeats (SSRs and Single Nucleotide Polymorphisms (SNPs in the candidate genes were also located on an apple genetic map. We have shown that the MdMYB10 gene co-segregates with the Rni locus and is on Linkage Group (LG 09 of the apple genome. Conclusion We have performed candidate gene mapping in a fruit tree crop and have provided genetic evidence that red colouration in the fruit core as well as red foliage are both controlled by a single locus named Rni. We have shown that the transcription factor MdMYB10 may be the gene underlying Rni as there were no recombinants between the marker for this gene and the red phenotype in a population of 516 individuals. Associating markers derived from candidate genes with a desirable phenotypic trait has demonstrated the application of genomic tools in a breeding programme of a horticultural crop species.

  6. TargetMine, an integrated data warehouse for candidate gene prioritisation and target discovery.

    Directory of Open Access Journals (Sweden)

    Yi-An Chen

    Full Text Available Prioritising candidate genes for further experimental characterisation is a non-trivial challenge in drug discovery and biomedical research in general. An integrated approach that combines results from multiple data types is best suited for optimal target selection. We developed TargetMine, a data warehouse for efficient target prioritisation. TargetMine utilises the InterMine framework, with new data models such as protein-DNA interactions integrated in a novel way. It enables complicated searches that are difficult to perform with existing tools and it also offers integration of custom annotations and in-house experimental data. We proposed an objective protocol for target prioritisation using TargetMine and set up a benchmarking procedure to evaluate its performance. The results show that the protocol can identify known disease-associated genes with high precision and coverage. A demonstration version of TargetMine is available at http://targetmine.nibio.go.jp/.

  7. Principles for the organization of gene-sets.

    Science.gov (United States)

    Li, Wentian; Freudenberg, Jan; Oswald, Michaela

    2015-12-01

    A gene-set, an important concept in microarray expression analysis and systems biology, is a collection of genes and/or their products (i.e. proteins) that have some features in common. There are many different ways to construct gene-sets, but a systematic organization of these ways is lacking. Gene-sets are mainly organized ad hoc in current public-domain databases, with group header names often determined by practical reasons (such as the types of technology in obtaining the gene-sets or a balanced number of gene-sets under a header). Here we aim at providing a gene-set organization principle according to the level at which genes are connected: homology, physical map proximity, chemical interaction, biological, and phenotypic-medical levels. We also distinguish two types of connections between genes: actual connection versus sharing of a label. Actual connections denote direct biological interactions, whereas shared label connection denotes shared membership in a group. Some extensions of the framework are also addressed such as overlapping of gene-sets, modules, and the incorporation of other non-protein-coding entities such as microRNAs. Copyright © 2015 Elsevier Ltd. All rights reserved.

  8. Identifying Novel Candidate Genes Related to Apoptosis from a Protein-Protein Interaction Network

    Directory of Open Access Journals (Sweden)

    Baoman Wang

    2015-01-01

    Full Text Available Apoptosis is the process of programmed cell death (PCD that occurs in multicellular organisms. This process of normal cell death is required to maintain the balance of homeostasis. In addition, some diseases, such as obesity, cancer, and neurodegenerative diseases, can be cured through apoptosis, which produces few side effects. An effective comprehension of the mechanisms underlying apoptosis will be helpful to prevent and treat some diseases. The identification of genes related to apoptosis is essential to uncover its underlying mechanisms. In this study, a computational method was proposed to identify novel candidate genes related to apoptosis. First, protein-protein interaction information was used to construct a weighted graph. Second, a shortest path algorithm was applied to the graph to search for new candidate genes. Finally, the obtained genes were filtered by a permutation test. As a result, 26 genes were obtained, and we discuss their likelihood of being novel apoptosis-related genes by collecting evidence from published literature.

  9. 'Omics' approaches in tomato aimed at identifying candidate genes ...

    African Journals Online (AJOL)

    adriana

    2013-12-04

    Dec 4, 2013 ... approaches could be combined in order to identify candidate genes for the genetic control of ascorbic ..... applied to other traits under the complex control of many ... Engineering increased vitamin C levels in ... Chem. Biol. 13:532–538. Giovannucci E, Rimm EB, Liu Y, Stampfer MJ, Willett WC (2002). A.

  10. Gene Network Construction from Microarray Data Identifies a Key Network Module and Several Candidate Hub Genes in Age-Associated Spatial Learning Impairment.

    Science.gov (United States)

    Uddin, Raihan; Singh, Shiva M

    2017-01-01

    As humans age many suffer from a decrease in normal brain functions including spatial learning impairments. This study aimed to better understand the molecular mechanisms in age-associated spatial learning impairment (ASLI). We used a mathematical modeling approach implemented in Weighted Gene Co-expression Network Analysis (WGCNA) to create and compare gene network models of young (learning unimpaired) and aged (predominantly learning impaired) brains from a set of exploratory datasets in rats in the context of ASLI. The major goal was to overcome some of the limitations previously observed in the traditional meta- and pathway analysis using these data, and identify novel ASLI related genes and their networks based on co-expression relationship of genes. This analysis identified a set of network modules in the young, each of which is highly enriched with genes functioning in broad but distinct GO functional categories or biological pathways. Interestingly, the analysis pointed to a single module that was highly enriched with genes functioning in "learning and memory" related functions and pathways. Subsequent differential network analysis of this "learning and memory" module in the aged (predominantly learning impaired) rats compared to the young learning unimpaired rats allowed us to identify a set of novel ASLI candidate hub genes. Some of these genes show significant repeatability in networks generated from independent young and aged validation datasets. These hub genes are highly co-expressed with other genes in the network, which not only show differential expression but also differential co-expression and differential connectivity across age and learning impairment. The known function of these hub genes indicate that they play key roles in critical pathways, including kinase and phosphatase signaling, in functions related to various ion channels, and in maintaining neuronal integrity relating to synaptic plasticity and memory formation. Taken together, they

  11. Identification of Candidate B-Lymphoma Genes by Cross-Species Gene Expression Profiling

    Science.gov (United States)

    Tompkins, Van S.; Han, Seong-Su; Olivier, Alicia; Syrbu, Sergei; Bair, Thomas; Button, Anna; Jacobus, Laura; Wang, Zebin; Lifton, Samuel; Raychaudhuri, Pradip; Morse, Herbert C.; Weiner, George; Link, Brian; Smith, Brian J.; Janz, Siegfried

    2013-01-01

    Comparative genome-wide expression profiling of malignant tumor counterparts across the human-mouse species barrier has a successful track record as a gene discovery tool in liver, breast, lung, prostate and other cancers, but has been largely neglected in studies on neoplasms of mature B-lymphocytes such as diffuse large B cell lymphoma (DLBCL) and Burkitt lymphoma (BL). We used global gene expression profiles of DLBCL-like tumors that arose spontaneously in Myc-transgenic C57BL/6 mice as a phylogenetically conserved filter for analyzing the human DLBCL transcriptome. The human and mouse lymphomas were found to have 60 concordantly deregulated genes in common, including 8 genes that Cox hazard regression analysis associated with overall survival in a published landmark dataset of DLBCL. Genetic network analysis of the 60 genes followed by biological validation studies indicate FOXM1 as a candidate DLBCL and BL gene, supporting a number of studies contending that FOXM1 is a therapeutic target in mature B cell tumors. Our findings demonstrate the value of the “mouse filter” for genomic studies of human B-lineage neoplasms for which a vast knowledge base already exists. PMID:24130802

  12. Genetic determinants of facial clefting: analysis of 357 candidate genes using two national cleft studies from Scandinavia.

    Directory of Open Access Journals (Sweden)

    Astanand Jugessur

    Full Text Available Facial clefts are common birth defects with a strong genetic component. To identify fetal genetic risk factors for clefting, 1536 SNPs in 357 candidate genes were genotyped in two population-based samples from Scandinavia (Norway: 562 case-parent and 592 control-parent triads; Denmark: 235 case-parent triads.We used two complementary statistical methods, TRIMM and HAPLIN, to look for associations across these two national samples. TRIMM tests for association in each gene by using multi-SNP genotypes from case-parent triads directly without the need to infer haplotypes. HAPLIN on the other hand estimates the full haplotype distribution over a set of SNPs and estimates relative risks associated with each haplotype. For isolated cleft lip with or without cleft palate (I-CL/P, TRIMM and HAPLIN both identified significant associations with IRF6 and ADH1C in both populations, but only HAPLIN found an association with FGF12. For isolated cleft palate (I-CP, TRIMM found associations with ALX3, MKX, and PDGFC in both populations, but only the association with PDGFC was identified by HAPLIN. In addition, HAPLIN identified an association with ETV5 that was not detected by TRIMM.Strong associations with seven genes were replicated in the Scandinavian samples and our approach effectively replicated the strongest previously known association in clefting--with IRF6. Based on two national cleft cohorts of similar ancestry, two robust statistical methods and a large panel of SNPs in the most promising cleft candidate genes to date, this study identified a previously unknown association with clefting for ADH1C and provides additional candidates and analytic approaches to advance the field.

  13. Association analysis of nine candidate gene polymorphisms in Indian patients with type 2 diabetic retinopathy

    Directory of Open Access Journals (Sweden)

    Govindarajan Gowthaman

    2010-11-01

    Full Text Available Abstract Background Diabetic retinopathy (DR is classically defined as a microvasculopathy that primarily affects the small blood vessels of the inner retina as a complication of diabetes mellitus (DM.It is a multifactorial disease with a strong genetic component. The aim of this study is to investigate the association of a set of nine candidate genes with the development of diabetic retinopathy in a South Indian cohort who have type 2 diabetes mellitus (T2DM. Methods Seven candidate genes (RAGE, PEDF, AKR1B1, EPO, HTRA1, ICAM and HFE were chosen based on reported association with DR in the literature. Two more, CFH and ARMS2, were chosen based on their roles in biological pathways previously implicated in DR. Fourteen single nucleotide polymorphisms (SNPs and one dinucleotide repeat polymorphism, previously reported to show association with DR or other related diseases, were genotyped in 345 DR and 356 diabetic patients without retinopathy (DNR. The genes which showed positive association in this screening set were tested further in additional sets of 100 DR and 90 DNR additional patients from the Aravind Eye Hospital. Those which showed association in the secondary screen were subjected to a combined analysis with the 100 DR and 100 DNR subjects previously recruited and genotyped through the Sankara Nethralaya Hospital, India. Genotypes were evaluated using a combination of direct sequencing, TaqMan SNP genotyping, RFLP analysis, and SNaPshot PCR assays. Chi-square and Fisher exact tests were used to analyze the genotype and allele frequencies. Results Among the nine loci (15 polymorphisms screened, SNP rs2070600 (G82S in the RAGE gene, showed significant association with DR (allelic P = 0.016, dominant model P = 0.012, compared to DNR. SNP rs2070600 further showed significant association with DR in the confirmation cohort (P = 0.035, dominant model P = 0.032. Combining the two cohorts gave an allelic P HTRA1, rs11200638 (G>A, showed marginal

  14. Candidate Genes Detected in Transcriptome Studies are Strongly Dependent on Genetic Background

    DEFF Research Database (Denmark)

    Sarup, Pernille Merete; Sørensen, Jesper Givskov; Kristensen, Torsten Nygård

    2011-01-01

    identified from studies of gene expression in Drosophila melanogaster using similar technical platforms. We found little overlap across studies between putative candidate genes for the same traits in the same sex. Instead there was a high degree of overlap between different traits and sexes within the same...

  15. Case-control approach application for finding a relationship between candidate genes and clinical mastitis in Holstein dairy cattle.

    Science.gov (United States)

    Bagheri, Masoumeh; Moradi-Sharhrbabak, M; Miraie-Ashtiani, R; Safdari-Shahroudi, M; Abdollahi-Arpanahi, R

    2016-02-01

    Mastitis is a major source of economic loss in dairy herds. The objective of this research was to evaluate the association between genotypes within SLC11A1 and CXCR1 candidate genes and clinical mastitis in Holstein dairy cattle using the selective genotyping method. The data set contained clinical mastitis records of 3,823 Holstein cows from two Holstein dairy herds located in two different regions in Iran. Data included the number of cases of clinical mastitis per lactation. Selective genotyping was based on extreme values for clinical mastitis residuals (CMR) from mixed model analyses. Two extreme groups consisting of 135 cows were formed (as cases and controls), and genotyped for the two candidate genes, namely, SLC11A1 and CXCR1, using polymerase chain reaction-single strand conformation polymorphism (PCR-SSCP) and polymerase chain reaction-restriction fragment length polymorphism (PCR-RFLP), respectively. Associations between single nucleotide polymorphism (SNP) genotypes with CMR and breeding values for milk and protein yield were carried out by applying logistic regression analyses, i.e. estimating the probability of the heterogeneous genotype in the dependency of values for CMR and breeding values (BVs). The sequencing results revealed a novel mutation in 1139 bp of exon 11 of the SLC11A1 gene and this SNP had a significant association with CMR (P G and these genotypes had significant relationships with CMR. Overall, the results showed that SLC11A1 and CXCR1 are valuable candidate genes for the improvement of mastitis resistance as well as production traits in dairy cattle populations.

  16. Effect of the absolute statistic on gene-sampling gene-set analysis methods.

    Science.gov (United States)

    Nam, Dougu

    2017-06-01

    Gene-set enrichment analysis and its modified versions have commonly been used for identifying altered functions or pathways in disease from microarray data. In particular, the simple gene-sampling gene-set analysis methods have been heavily used for datasets with only a few sample replicates. The biggest problem with this approach is the highly inflated false-positive rate. In this paper, the effect of absolute gene statistic on gene-sampling gene-set analysis methods is systematically investigated. Thus far, the absolute gene statistic has merely been regarded as a supplementary method for capturing the bidirectional changes in each gene set. Here, it is shown that incorporating the absolute gene statistic in gene-sampling gene-set analysis substantially reduces the false-positive rate and improves the overall discriminatory ability. Its effect was investigated by power, false-positive rate, and receiver operating curve for a number of simulated and real datasets. The performances of gene-set analysis methods in one-tailed (genome-wide association study) and two-tailed (gene expression data) tests were also compared and discussed.

  17. Identification of candidate genes for dissecting complex branch number trait in chickpea.

    Science.gov (United States)

    Bajaj, Deepak; Upadhyaya, Hari D; Das, Shouvik; Kumar, Vinod; Gowda, C L L; Sharma, Shivali; Tyagi, Akhilesh K; Parida, Swarup K

    2016-04-01

    The present study exploited integrated genomics-assisted breeding strategy for genetic dissection of complex branch number quantitative trait in chickpea. Candidate gene-based association analysis in a branch number association panel was performed by utilizing the genotyping data of 401 SNP allelic variants mined from 27 known cloned branch number gene orthologs of chickpea. The genome-wide association study (GWAS) integrating both genome-wide GBS- (4556 SNPs) and candidate gene-based genotyping information of 4957 SNPs in a structured population of 60 sequenced desi and kabuli accessions (with 350-400 kb LD decay), detected 11 significant genomic loci (genes) associated (41% combined PVE) with branch number in chickpea. Of these, seven branch number-associated genes were further validated successfully in two inter (ICC 4958 × ICC 17160)- and intra (ICC 12299 × ICC 8261)-specific mapping populations. The axillary meristem and shoot apical meristem-specific expression, including differential up- and down-regulation (4-5 fold) of the validated seven branch number-associated genes especially in high branch number as compared to the low branch number-containing parental accessions and homozygous individuals of two aforesaid mapping populations was apparent. Collectively, this combinatorial genomic approach delineated diverse naturally occurring novel functional SNP allelic variants in seven potential known/candidate genes [PIN1 (PIN-FORMED protein 1), TB1 (teosinte branched 1), BA1/LAX1 (BARREN STALK1/LIKE AUXIN1), GRAS8 (gibberellic acid insensitive/GAI, Repressor of ga13/RGA and Scarecrow8/SCR8), ERF (ethylene-responsive element-binding factor), MAX2 (more axillary growth 2) and lipase] governing chickpea branch number. The useful information generated from this study have potential to expedite marker-assisted genetic enhancement by developing high-yielding cultivars with more number of productive (pods and seeds) branches in chickpea. Copyright © 2016 Elsevier

  18. Integrated database for identifying candidate genes for Aspergillus flavus resistance in maize.

    Science.gov (United States)

    Kelley, Rowena Y; Gresham, Cathy; Harper, Jonathan; Bridges, Susan M; Warburton, Marilyn L; Hawkins, Leigh K; Pechanova, Olga; Peethambaran, Bela; Pechan, Tibor; Luthe, Dawn S; Mylroie, J E; Ankala, Arunkanth; Ozkan, Seval; Henry, W B; Williams, W P

    2010-10-07

    Aspergillus flavus Link:Fr, an opportunistic fungus that produces aflatoxin, is pathogenic to maize and other oilseed crops. Aflatoxin is a potent carcinogen, and its presence markedly reduces the value of grain. Understanding and enhancing host resistance to A. flavus infection and/or subsequent aflatoxin accumulation is generally considered an efficient means of reducing grain losses to aflatoxin. Different proteomic, genomic and genetic studies of maize (Zea mays L.) have generated large data sets with the goal of identifying genes responsible for conferring resistance to A. flavus, or aflatoxin. In order to maximize the usage of different data sets in new studies, including association mapping, we have constructed a relational database with web interface integrating the results of gene expression, proteomic (both gel-based and shotgun), Quantitative Trait Loci (QTL) genetic mapping studies, and sequence data from the literature to facilitate selection of candidate genes for continued investigation. The Corn Fungal Resistance Associated Sequences Database (CFRAS-DB) (http://agbase.msstate.edu/) was created with the main goal of identifying genes important to aflatoxin resistance. CFRAS-DB is implemented using MySQL as the relational database management system running on a Linux server, using an Apache web server, and Perl CGI scripts as the web interface. The database and the associated web-based interface allow researchers to examine many lines of evidence (e.g. microarray, proteomics, QTL studies, SNP data) to assess the potential role of a gene or group of genes in the response of different maize lines to A. flavus infection and subsequent production of aflatoxin by the fungus. CFRAS-DB provides the first opportunity to integrate data pertaining to the problem of A. flavus and aflatoxin resistance in maize in one resource and to support queries across different datasets. The web-based interface gives researchers different query options for mining the database

  19. Using microarrays to identify positional candidate genes for QTL: the case study of ACTH response in pigs.

    Science.gov (United States)

    Jouffe, Vincent; Rowe, Suzanne; Liaubet, Laurence; Buitenhuis, Bart; Hornshøj, Henrik; SanCristobal, Magali; Mormède, Pierre; de Koning, D J

    2009-07-16

    Microarray studies can supplement QTL studies by suggesting potential candidate genes in the QTL regions, which by themselves are too large to provide a limited selection of candidate genes. Here we provide a case study where we explore ways to integrate QTL data and microarray data for the pig, which has only a partial genome sequence. We outline various procedures to localize differentially expressed genes on the pig genome and link this with information on published QTL. The starting point is a set of 237 differentially expressed cDNA clones in adrenal tissue from two pig breeds, before and after treatment with adrenocorticotropic hormone (ACTH). Different approaches to localize the differentially expressed (DE) genes to the pig genome showed different levels of success and a clear lack of concordance for some genes between the various approaches. For a focused analysis on 12 genes, overlapping QTL from the public domain were presented. Also, differentially expressed genes underlying QTL for ACTH response were described. Using the latest version of the draft sequence, the differentially expressed genes were mapped to the pig genome. This enabled co-location of DE genes and previously studied QTL regions, but the draft genome sequence is still incomplete and will contain many errors. A further step to explore links between DE genes and QTL at the pathway level was largely unsuccessful due to the lack of annotation of the pig genome. This could be improved by further comparative mapping analyses but this would be time consuming. This paper provides a case study for the integration of QTL data and microarray data for a species with limited genome sequence information and annotation. The results illustrate the challenges that must be addressed but also provide a roadmap for future work that is applicable to other non-model species.

  20. GeneTopics - interpretation of gene sets via literature-driven topic models

    Science.gov (United States)

    2013-01-01

    Background Annotation of a set of genes is often accomplished through comparison to a library of labelled gene sets such as biological processes or canonical pathways. However, this approach might fail if the employed libraries are not up to date with the latest research, don't capture relevant biological themes or are curated at a different level of granularity than is required to appropriately analyze the input gene set. At the same time, the vast biomedical literature offers an unstructured repository of the latest research findings that can be tapped to provide thematic sub-groupings for any input gene set. Methods Our proposed method relies on a gene-specific text corpus and extracts commonalities between documents in an unsupervised manner using a topic model approach. We automatically determine the number of topics summarizing the corpus and calculate a gene relevancy score for each topic allowing us to eliminate non-specific topics. As a result we obtain a set of literature topics in which each topic is associated with a subset of the input genes providing directly interpretable keywords and corresponding documents for literature research. Results We validate our method based on labelled gene sets from the KEGG metabolic pathway collection and the genetic association database (GAD) and show that the approach is able to detect topics consistent with the labelled annotation. Furthermore, we discuss the results on three different types of experimentally derived gene sets, (1) differentially expressed genes from a cardiac hypertrophy experiment in mice, (2) altered transcript abundance in human pancreatic beta cells, and (3) genes implicated by GWA studies to be associated with metabolite levels in a healthy population. In all three cases, we are able to replicate findings from the original papers in a quick and semi-automated manner. Conclusions Our approach provides a novel way of automatically generating meaningful annotations for gene sets that are directly

  1. Analysis of a positional candidate gene for inflammatory bowel disease: NRAMP2

    NARCIS (Netherlands)

    Stokkers, P. C.; Huibregtse, K.; Leegwater, A. C.; Reitsma, P. H.; Tytgat, G. N.; van Deventer, S. J.

    2000-01-01

    Genome scans have identified a region spanning 40 cM on the long arm of chromosome 12 as a susceptibility locus for inflammatory bowel disease (IBD). This locus contains several candidate genes for IBD, one of which is the gene for the natural resistance associated macrophage protein 2 (NRAMP2).

  2. MAGMA: generalized gene-set analysis of GWAS data.

    Science.gov (United States)

    de Leeuw, Christiaan A; Mooij, Joris M; Heskes, Tom; Posthuma, Danielle

    2015-04-01

    By aggregating data for complex traits in a biologically meaningful way, gene and gene-set analysis constitute a valuable addition to single-marker analysis. However, although various methods for gene and gene-set analysis currently exist, they generally suffer from a number of issues. Statistical power for most methods is strongly affected by linkage disequilibrium between markers, multi-marker associations are often hard to detect, and the reliance on permutation to compute p-values tends to make the analysis computationally very expensive. To address these issues we have developed MAGMA, a novel tool for gene and gene-set analysis. The gene analysis is based on a multiple regression model, to provide better statistical performance. The gene-set analysis is built as a separate layer around the gene analysis for additional flexibility. This gene-set analysis also uses a regression structure to allow generalization to analysis of continuous properties of genes and simultaneous analysis of multiple gene sets and other gene properties. Simulations and an analysis of Crohn's Disease data are used to evaluate the performance of MAGMA and to compare it to a number of other gene and gene-set analysis tools. The results show that MAGMA has significantly more power than other tools for both the gene and the gene-set analysis, identifying more genes and gene sets associated with Crohn's Disease while maintaining a correct type 1 error rate. Moreover, the MAGMA analysis of the Crohn's Disease data was found to be considerably faster as well.

  3. Identification of Candidate Genes Responsible for Stem Pith Production Using Expression Analysis in Solid-Stemmed Wheat.

    Science.gov (United States)

    Oiestad, A J; Martin, J M; Cook, J; Varella, A C; Giroux, M J

    2017-07-01

    The wheat stem sawfly (WSS) is an economically important pest of wheat in the Northern Great Plains. The primary means of WSS control is resistance associated with the single quantitative trait locus (QTL) , which controls most stem solidness variation. The goal of this study was to identify stem solidness candidate genes via RNA-seq. This study made use of 28 single nucleotide polymorphism (SNP) makers derived from expressed sequence tags (ESTs) linked to contained within a 5.13 cM region. Allele specific expression of EST markers was examined in stem tissue for solid and hollow-stemmed pairs of two spring wheat near isogenic lines (NILs) differing for the QTL. Of the 28 ESTs, 13 were located within annotated genes and 10 had detectable stem expression. Annotated genes corresponding to four of the ESTs were differentially expressed between solid and hollow-stemmed NILs and represent possible stem solidness gene candidates. Further examination of the 5.13 cM region containing the 28 EST markers identified 260 annotated genes. Twenty of the 260 linked genes were up-regulated in hollow NIL stems, while only seven genes were up-regulated in solid NIL stems. An -methyltransferase within the region of interest was identified as a candidate based on differential expression between solid and hollow-stemmed NILs and putative function. Further study of these candidate genes may lead to the identification of the gene(s) controlling stem solidness and an increased ability to select for wheat stem solidness and manage WSS. Copyright © 2017 Crop Science Society of America.

  4. Associations of candidate genes to age-related macular degeneration among racial/ethnic groups in the multi-ethnic study of atherosclerosis.

    Science.gov (United States)

    Klein, Ronald; Li, Xiaohui; Kuo, Jane Z; Klein, Barbara E K; Cotch, Mary Frances; Wong, Tien Y; Taylor, Kent D; Rotter, Jerome I

    2013-11-01

    To describe the relationships of selected candidate genes to the prevalence of early age-related macular degeneration (AMD) in a cohort of whites, blacks, Hispanics, and Chinese Americans. Cross-sectional study. setting: Multicenter study. study population: A total of 2456 persons aged 45-84 years with genotype information and fundus photographs. procedures: Twelve of 2862 single nucleotide polymorphisms (SNPs) from 11 of 233 candidate genes for cardiovascular disease were selected for analysis based on screening with marginal unadjusted P value ethnic groups. Logistic regression models tested for association in case-control samples. main outcome measure: Prevalence of early AMD. Early AMD was present in 4.0% of the cohort and varied from 2.4% in blacks to 6.0% in whites. The odds ratio increased from 2.3 for 1 to 10.0 for 4 risk alleles in a joint effect analysis of Age-Related Maculopathy Susceptibility 2 rs10490924 and Complement Factor H Y402H (P for trend = 4.2×10(-7)). Frequencies of each SNP varied among the racial/ethnic groups. Adjusting for age and other factors, few statistically significant associations of the 12 SNPs with AMD were consistent across all groups. In a multivariate model, most candidate genes did not attenuate the comparatively higher odds of AMD in whites. The higher frequency of risk alleles for several SNPs in Chinese Americans may partially explain their AMD frequency's approaching that of whites. The relationships of 11 candidate genes to early AMD varied among 4 racial/ethnic groups, and partially explained the observed variations in early AMD prevalence among them. Copyright © 2013 Elsevier Inc. All rights reserved.

  5. A cross-study gene set enrichment analysis identifies critical pathways in endometriosis

    Directory of Open Access Journals (Sweden)

    Bai Chunyan

    2009-09-01

    Full Text Available Abstract Background Endometriosis is an enigmatic disease. Gene expression profiling of endometriosis has been used in several studies, but few studies went further to classify subtypes of endometriosis based on expression patterns and to identify possible pathways involved in endometriosis. Some of the observed pathways are more inconsistent between the studies, and these candidate pathways presumably only represent a fraction of the pathways involved in endometriosis. Methods We applied a standardised microarray preprocessing and gene set enrichment analysis to six independent studies, and demonstrated increased concordance between these gene datasets. Results We find 16 up-regulated and 19 down-regulated pathways common in ovarian endometriosis data sets, 22 up-regulated and one down-regulated pathway common in peritoneal endometriosis data sets. Among them, 12 up-regulated and 1 down-regulated were found consistent between ovarian and peritoneal endometriosis. The main canonical pathways identified are related to immunological and inflammatory disease. Early secretory phase has the most over-represented pathways in the three uterine cycle phases. There are no overlapping significant pathways between the dataset from human endometrial endothelial cells and the datasets from ovarian endometriosis which used whole tissues. Conclusion The study of complex diseases through pathway analysis is able to highlight genes weakly connected to the phenotype which may be difficult to detect by using classical univariate statistics. By standardised microarray preprocessing and GSEA, we have increased the concordance in identifying many biological mechanisms involved in endometriosis. The identified gene pathways will shed light on the understanding of endometriosis and promote the development of novel therapies.

  6. Integrated bioinformatics analysis reveals key candidate genes and pathways in breast cancer.

    Science.gov (United States)

    Wang, Yuzhi; Zhang, Yi; Huang, Qian; Li, Chengwen

    2018-04-19

    Breast cancer (BC) is the leading malignancy in women worldwide, yet relatively little is known about the genes and signaling pathways involved in BC tumorigenesis and progression. The present study aimed to elucidate potential key candidate genes and pathways in BC. Five gene expression profile data sets (GSE22035, GSE3744, GSE5764, GSE21422 and GSE26910) were downloaded from the Gene Expression Omnibus (GEO) database, which included data from 113 tumorous and 38 adjacent non‑tumorous tissue samples. Differentially expressed genes (DEGs) were identified using t‑tests in the limma R package. These DEGs were subsequently investigated by pathway enrichment analysis and a protein‑protein interaction (PPI) network was constructed. The most significant module from the PPI network was selected for pathway enrichment analysis. In total, 227 DEGs were identified, of which 82 were upregulated and 145 were downregulated. Pathway enrichment analysis results revealed that the upregulated DEGs were mainly enriched in 'cell division', the 'proteinaceous extracellular matrix (ECM)', 'ECM structural constituents' and 'ECM‑receptor interaction', whereas downregulated genes were mainly enriched in 'response to drugs', 'extracellular space', 'transcriptional activator activity' and the 'peroxisome proliferator‑activated receptor signaling pathway'. The PPI network contained 174 nodes and 1,257 edges. DNA topoisomerase 2‑a, baculoviral inhibitor of apoptosis repeat‑containing protein 5, cyclin‑dependent kinase 1, G2/mitotic‑specific cyclin‑B1 and kinetochore protein NDC80 homolog were identified as the top 5 hub genes. Furthermore, the genes in the most significant module were predominantly involved in 'mitotic nuclear division', 'mid‑body', 'protein binding' and 'cell cycle'. In conclusion, the DEGs, relative pathways and hub genes identified in the present study may aid in understanding of the molecular mechanisms underlying BC progression and provide

  7. No Association between Personality and Candidate Gene Polymorphisms in a Wild Bird Population.

    Directory of Open Access Journals (Sweden)

    Hannah A Edwards

    Full Text Available Consistency of between-individual differences in behaviour or personality is a phenomenon in populations that can have ecological consequences and evolutionary potential. One way that behaviour can evolve is to have a genetic basis. Identifying the molecular genetic basis of personality could therefore provide insight into how and why such variation is maintained, particularly in natural populations. Previously identified candidate genes for personality in birds include the dopamine receptor D4 (DRD4, and serotonin transporter (SERT. Studies of wild bird populations have shown that exploratory and bold behaviours are associated with polymorphisms in both DRD4 and SERT. Here we tested for polymorphisms in DRD4 and SERT in the Seychelles warbler (Acrocephalus sechellensis population on Cousin Island, Seychelles, and then investigated correlations between personality and polymorphisms in these genes. We found no genetic variation in DRD4, but identified four polymorphisms in SERT that clustered into five haplotypes. There was no correlation between bold or exploratory behaviours and SERT polymorphisms/haplotypes. The null result was not due to lack of power, and indicates that there was no association between these behaviours and variation in the candidate genes tested in this population. These null findings provide important data to facilitate representative future meta-analyses on candidate personality genes.

  8. Association analysis of 94 candidate genes and schizophrenia-related endophenotypes.

    Directory of Open Access Journals (Sweden)

    Tiffany A Greenwood

    Full Text Available While it is clear that schizophrenia is highly heritable, the genetic basis of this heritability is complex. Human genetic, brain imaging, and model organism studies have met with only modest gains. A complementary research tactic is to evaluate the genetic substrates of quantitative endophenotypes with demonstrated deficits in schizophrenia patients. We used an Illumina custom 1,536-SNP array to interrogate 94 functionally relevant candidate genes for schizophrenia and evaluate association with both the qualitative diagnosis of schizophrenia and quantitative endophenotypes for schizophrenia. Subjects included 219 schizophrenia patients and normal comparison subjects of European ancestry and 76 schizophrenia patients and normal comparison subjects of African ancestry, all ascertained by the UCSD Schizophrenia Research Program. Six neurophysiological and neurocognitive endophenotype test paradigms were assessed: prepulse inhibition (PPI, P50 suppression, the antisaccade oculomotor task, the Letter-Number Span Test, the California Verbal Learning Test-II, and the Wisconsin Card Sorting Test-64 Card Version. These endophenotype test paradigms yielded six primary endophenotypes with prior evidence of heritability and demonstrated schizophrenia-related impairments, as well as eight secondary measures investigated as candidate endophenotypes. Schizophrenia patients showed significant deficits on ten of the endophenotypic measures, replicating prior studies and facilitating genetic analyses of these phenotypes. A total of 38 genes were found to be associated with at least one endophenotypic measure or schizophrenia with an empirical p-value<0.01. Many of these genes have been shown to interact on a molecular level, and eleven genes displayed evidence for pleiotropy, revealing associations with three or more endophenotypic measures. Among these genes were ERBB4 and NRG1, providing further support for a role of these genes in schizophrenia susceptibility

  9. Discovery of new candidate genes for rheumatoid arthritis through integration of genetic association data with expression pathway analysis.

    Science.gov (United States)

    Shchetynsky, Klementy; Diaz-Gallo, Lina-Marcella; Folkersen, Lasse; Hensvold, Aase Haj; Catrina, Anca Irinel; Berg, Louise; Klareskog, Lars; Padyukov, Leonid

    2017-02-02

    Here we integrate verified signals from previous genetic association studies with gene expression and pathway analysis for discovery of new candidate genes and signaling networks, relevant for rheumatoid arthritis (RA). RNA-sequencing-(RNA-seq)-based expression analysis of 377 genes from previously verified RA-associated loci was performed in blood cells from 5 newly diagnosed, non-treated patients with RA, 7 patients with treated RA and 12 healthy controls. Differentially expressed genes sharing a similar expression pattern in treated and untreated RA sub-groups were selected for pathway analysis. A set of "connector" genes derived from pathway analysis was tested for differential expression in the initial discovery cohort and validated in blood cells from 73 patients with RA and in 35 healthy controls. There were 11 qualifying genes selected for pathway analysis and these were grouped into two evidence-based functional networks, containing 29 and 27 additional connector molecules. The expression of genes, corresponding to connector molecules was then tested in the initial RNA-seq data. Differences in the expression of ERBB2, TP53 and THOP1 were similar in both treated and non-treated patients with RA and an additional nine genes were differentially expressed in at least one group of patients compared to healthy controls. The ERBB2, TP53. THOP1 expression profile was successfully replicated in RNA-seq data from peripheral blood mononuclear cells from healthy controls and non-treated patients with RA, in an independent collection of samples. Integration of RNA-seq data with findings from association studies, and consequent pathway analysis implicate new candidate genes, ERBB2, TP53 and THOP1 in the pathogenesis of RA.

  10. Evaluation of common genetic variants in 82 candidate genes as risk factors for neural tube defects

    LENUS (Irish Health Repository)

    Pangilinan, Faith

    2012-08-02

    AbstractBackgroundNeural tube defects (NTDs) are common birth defects (~1 in 1000 pregnancies in the US and Europe) that have complex origins, including environmental and genetic factors. A low level of maternal folate is one well-established risk factor, with maternal periconceptional folic acid supplementation reducing the occurrence of NTD pregnancies by 50-70%. Gene variants in the folate metabolic pathway (e.g., MTHFR rs1801133 (677 C > T) and MTHFD1 rs2236225 (R653Q)) have been found to increase NTD risk. We hypothesized that variants in additional folate\\/B12 pathway genes contribute to NTD risk.MethodsA tagSNP approach was used to screen common variation in 82 candidate genes selected from the folate\\/B12 pathway and NTD mouse models. We initially genotyped polymorphisms in 320 Irish triads (NTD cases and their parents), including 301 cases and 341 Irish controls to perform case–control and family based association tests. Significantly associated polymorphisms were genotyped in a secondary set of 250 families that included 229 cases and 658 controls. The combined results for 1441 SNPs were used in a joint analysis to test for case and maternal effects.ResultsNearly 70 SNPs in 30 genes were found to be associated with NTDs at the p < 0.01 level. The ten strongest association signals (p-value range: 0.0003–0.0023) were found in nine genes (MFTC, CDKN2A, ADA, PEMT, CUBN, GART, DNMT3A, MTHFD1 and T (Brachyury)) and included the known NTD risk factor MTHFD1 R653Q (rs2236225). The single strongest signal was observed in a new candidate, MFTC rs17803441 (OR = 1.61 [1.23-2.08], p = 0.0003 for the minor allele). Though nominally significant, these associations did not remain significant after correction for multiple hypothesis testing.ConclusionsTo our knowledge, with respect to sample size and scope of evaluation of candidate polymorphisms, this is the largest NTD genetic association study reported to date. The scale of the study and the

  11. Screening key candidate genes and pathways involved in insulinoma by microarray analysis.

    Science.gov (United States)

    Zhou, Wuhua; Gong, Li; Li, Xuefeng; Wan, Yunyan; Wang, Xiangfei; Li, Huili; Jiang, Bin

    2018-06-01

    Insulinoma is a rare type tumor and its genetic features remain largely unknown. This study aimed to search for potential key genes and relevant enriched pathways of insulinoma.The gene expression data from GSE73338 were downloaded from Gene Expression Omnibus database. Differentially expressed genes (DEGs) were identified between insulinoma tissues and normal pancreas tissues, followed by pathway enrichment analysis, protein-protein interaction (PPI) network construction, and module analysis. The expressions of candidate key genes were validated by quantitative real-time polymerase chain reaction (RT-PCR) in insulinoma tissues.A total of 1632 DEGs were obtained, including 1117 upregulated genes and 514 downregulated genes. Pathway enrichment results showed that upregulated DEGs were significantly implicated in insulin secretion, and downregulated DEGs were mainly enriched in pancreatic secretion. PPI network analysis revealed 7 hub genes with degrees more than 10, including GCG (glucagon), GCGR (glucagon receptor), PLCB1 (phospholipase C, beta 1), CASR (calcium sensing receptor), F2R (coagulation factor II thrombin receptor), GRM1 (glutamate metabotropic receptor 1), and GRM5 (glutamate metabotropic receptor 5). DEGs involved in the significant modules were enriched in calcium signaling pathway, protein ubiquitination, and platelet degranulation. Quantitative RT-PCR data confirmed that the expression trends of these hub genes were similar to the results of bioinformatic analysis.The present study demonstrated that candidate DEGs and enriched pathways were the potential critical molecule events involved in the development of insulinoma, and these findings were useful for better understanding of insulinoma genesis.

  12. Computerized detection of multiple sclerosis candidate regions based on a level set method using an artificial neural network

    International Nuclear Information System (INIS)

    Kuwazuru, Junpei; Magome, Taiki; Arimura, Hidetaka; Yamashita, Yasuo; Oki, Masafumi; Toyofuku, Fukai; Kakeda, Shingo; Yamamoto, Daisuke

    2010-01-01

    Yamamoto et al. developed the system for computer-aided detection of multiple sclerosis (MS) candidate regions. In a level set method in their proposed method, they employed the constant threshold value for the edge indicator function related to a speed function of the level set method. However, it would be appropriate to adjust the threshold value to each MS candidate region, because the edge magnitudes in MS candidates differ from each other. Our purpose of this study was to develop a computerized detection of MS candidate regions in MR images based on a level set method using an artificial neural network (ANN). To adjust the threshold value for the edge indicator function in the level set method to each true positive (TP) and false positive (FP) region, we constructed the ANN. The ANN could provide the suitable threshold value for each candidate region in the proposed level set method so that TP regions can be segmented and FP regions can be removed. Our proposed method detected MS regions at a sensitivity of 82.1% with 0.204 FPs per slice and similarity index of MS candidate regions was 0.717 on average. (author)

  13. Integration of gene-based markers in a pearl millet genetic map for identification of candidate genes underlying drought tolerance quantitative trait loci

    Directory of Open Access Journals (Sweden)

    Sehgal Deepmala

    2012-01-01

    Full Text Available Abstract Background Identification of genes underlying drought tolerance (DT quantitative trait loci (QTLs will facilitate understanding of molecular mechanisms of drought tolerance, and also will accelerate genetic improvement of pearl millet through marker-assisted selection. We report a map based on genes with assigned functional roles in plant adaptation to drought and other abiotic stresses and demonstrate its use in identifying candidate genes underlying a major DT-QTL. Results Seventy five single nucleotide polymorphism (SNP and conserved intron spanning primer (CISP markers were developed from available expressed sequence tags (ESTs using four genotypes, H 77/833-2, PRLT 2/89-33, ICMR 01029 and ICMR 01004, representing parents of two mapping populations. A total of 228 SNPs were obtained from 30.5 kb sequenced region resulting in a SNP frequency of 1/134 bp. The positions of major pearl millet linkage group (LG 2 DT-QTLs (reported from crosses H 77/833-2 × PRLT 2/89-33 and 841B × 863B were added to the present consensus function map which identified 18 genes, coding for PSI reaction center subunit III, PHYC, actin, alanine glyoxylate aminotransferase, uridylate kinase, acyl-CoA oxidase, dipeptidyl peptidase IV, MADS-box, serine/threonine protein kinase, ubiquitin conjugating enzyme, zinc finger C- × 8-C × 5-C × 3-H type, Hd3, acetyl CoA carboxylase, chlorophyll a/b binding protein, photolyase, protein phosphatase1 regulatory subunit SDS22 and two hypothetical proteins, co-mapping in this DT-QTL interval. Many of these candidate genes were found to have significant association with QTLs of grain yield, flowering time and leaf rolling under drought stress conditions. Conclusions We have exploited available pearl millet EST sequences to generate a mapped resource of seventy five new gene-based markers for pearl millet and demonstrated its use in identifying candidate genes underlying a major DT-QTL in this species. The reported gene

  14. Candidate genes revealed by a genome scan for mosquito resistance to a bacterial insecticide: sequence and gene expression variations

    Directory of Open Access Journals (Sweden)

    David Jean-Philippe

    2009-11-01

    Full Text Available Abstract Background Genome scans are becoming an increasingly popular approach to study the genetic basis of adaptation and speciation, but on their own, they are often helpless at identifying the specific gene(s or mutation(s targeted by selection. This shortcoming is hopefully bound to disappear in the near future, thanks to the wealth of new genomic resources that are currently being developed for many species. In this article, we provide a foretaste of this exciting new era by conducting a genome scan in the mosquito Aedes aegypti with the aim to look for candidate genes involved in resistance to Bacillus thuringiensis subsp. israelensis (Bti insecticidal toxins. Results The genome of a Bti-resistant and a Bti-susceptible strains was surveyed using about 500 MITE-based molecular markers, and the loci showing the highest inter-strain genetic differentiation were sequenced and mapped on the Aedes aegypti genome sequence. Several good candidate genes for Bti-resistance were identified in the vicinity of these highly differentiated markers. Two of them, coding for a cadherin and a leucine aminopeptidase, were further examined at the sequence and gene expression levels. In the resistant strain, the cadherin gene displayed patterns of nucleotide polymorphisms consistent with the action of positive selection (e.g. an excess of high compared to intermediate frequency mutations, as well as a significant under-expression compared to the susceptible strain. Conclusion Both sequence and gene expression analyses agree to suggest a role for positive selection in the evolution of this cadherin gene in the resistant strain. However, it is unlikely that resistance to Bti is conferred by this gene alone, and further investigation will be needed to characterize other genes significantly associated with Bti resistance in Ae. aegypti. Beyond these results, this article illustrates how genome scans can build on the body of new genomic information (here, full

  15. Genomic dissection and prioritizing of candidate genes of QTL for ...

    Indian Academy of Sciences (India)

    Genomic dissection and prioritizing of candidate genes of QTL for regulating spontaneous arthritis on chromosome 1 in mice deficient for interleukin-1 receptor antagonist. Yanhong Cao, Jifei Zhang, Yan Jiao, Jian Yan, Feng Jiao, XiaoYun Liu, Robert W. Williams, Karen A. Hasty,. John M. Stuart and Weikuan Gu. J. Genet.

  16. Genome-Wide Association Study with Sequence Variants Identifies Candidate Genes for Mastitis Resistance in Dairy Cattle

    DEFF Research Database (Denmark)

    Sahana, Goutam; Guldbrandtsen, Bernt; Bendixen, Christian

    Six genomic regions affecting clinical mastitis were identified through a GWAS study with imputed BovineHD chip genotype data in the Nordic Holstein cattle population. The association analyses were carried out using a SNP-by-SNP analysis by fitting the regression of allele dosage and a polygenic...... Effect Predictor (VEP) vers. 2.6 using ENSEMBL vers. 67 databases. Candidate polymorphisms affecting clinical mastitis were selected based on their association with the traits and functional annotations. A strong positional candidate gene for mastitis resistance on chromosome-6 is the NPFFR2 which...... Factor Receptor Alpha (LIFR) emerged as a strong candidate gene for mastitis resistance. The LIFR gene is involved in acute phase response and is expressed in saliva and mammary gland....

  17. Validation of candidate genes associated with cardiovascular risk factors in psychiatric patients

    Science.gov (United States)

    Windemuth, Andreas; de Leon, Jose; Goethe, John W.; Schwartz, Harold I.; Woolley, Stephen; Susce, Margaret; Kocherla, Mohan; Bogaard, Kali; Holford, Theodore R.; Seip, Richard L.; Ruaño, Gualberto

    2016-01-01

    The purpose of this study was to identify genetic variants predictive of cardiovascular risk factors in a psychiatric population treated with second generation antipsychotics (SGA). 924 patients undergoing treatment for severe mental illness at four US hospitals were genotyped at 1.2 million single nucleotide polymorphisms. Patients were assessed for fasting serum lipid (low density lipoprotein cholesterol [LDLc], high density lipoprotein cholesterol [HDLc], and triglycerides) and obesity phenotypes (body mass index, BMI). Thirteen candidate genes from previous studies of the same phenotypes in non-psychiatric populations were tested for association. We confirmed 8 of the 13 candidate genes at the 95% confidence level. An increased genetic effect size was observed for triglycerides in the psychiatric population compared to that in the cardiovascular population. PMID:21851846

  18. Selection and Validation of Reference Genes for qRT-PCR Expression Analysis of Candidate Genes Involved in Olfactory Communication in the Butterfly Bicyclus anynana

    OpenAIRE

    Arun, Alok; Bauml?, V?ronique; Amelot, Ga?l; Nieberding, Caroline M.

    2015-01-01

    Real-time quantitative reverse transcription PCR (qRT-PCR) is a technique widely used to quantify the transcriptional expression level of candidate genes. qRT-PCR requires the selection of one or several suitable reference genes, whose expression profiles remain stable across conditions, to normalize the qRT-PCR expression profiles of candidate genes. Although several butterfly species (Lepidoptera) have become important models in molecular evolutionary ecology, so far no study aimed at ident...

  19. Gene-set analysis based on the pharmacological profiles of drugs to identify repurposing opportunities in schizophrenia.

    Science.gov (United States)

    de Jong, Simone; Vidler, Lewis R; Mokrab, Younes; Collier, David A; Breen, Gerome

    2016-08-01

    Genome-wide association studies (GWAS) have identified thousands of novel genetic associations for complex genetic disorders, leading to the identification of potential pharmacological targets for novel drug development. In schizophrenia, 108 conservatively defined loci that meet genome-wide significance have been identified and hundreds of additional sub-threshold associations harbour information on the genetic aetiology of the disorder. In the present study, we used gene-set analysis based on the known binding targets of chemical compounds to identify the 'drug pathways' most strongly associated with schizophrenia-associated genes, with the aim of identifying potential drug repositioning opportunities and clues for novel treatment paradigms, especially in multi-target drug development. We compiled 9389 gene sets (2496 with unique gene content) and interrogated gene-based p-values from the PGC2-SCZ analysis. Although no single drug exceeded experiment wide significance (corrected pneratinib. This is a proof of principle analysis showing the potential utility of GWAS data of schizophrenia for the direct identification of candidate drugs and molecules that show polypharmacy. © The Author(s) 2016.

  20. A cross-species genetic analysis identifies candidate genes for mouse anxiety and human bipolar disorder

    Directory of Open Access Journals (Sweden)

    David G Ashbrook

    2015-07-01

    Full Text Available Bipolar disorder (BD is a significant neuropsychiatric disorder with a lifetime prevalence of ~1%. To identify genetic variants underlying BD genome-wide association studies (GWAS have been carried out. While many variants of small effect associated with BD have been identified few have yet been confirmed, partly because of the low power of GWAS due to multiple comparisons being made. Complementary mapping studies using murine models have identified genetic variants for behavioral traits linked to BD, often with high power, but these identified regions often contain too many genes for clear identification of candidate genes. In the current study we have aligned human BD GWAS results and mouse linkage studies to help define and evaluate candidate genes linked to BD, seeking to use the power of the mouse mapping with the precision of GWAS. We use quantitative trait mapping for open field test and elevated zero maze data in the largest mammalian model system, the BXD recombinant inbred mouse population, to identify genomic regions associated with these BD-like phenotypes. We then investigate these regions in whole genome data from the Psychiatric Genomics Consortium’s bipolar disorder GWAS to identify candidate genes associated with BD. Finally we establish the biological relevance and pathways of these genes in a comprehensive systems genetics analysis.We identify four genes associated with both mouse anxiety and human BD. While TNR is a novel candidate for BD, we can confirm previously suggested associations with CMYA5, MCTP1 and RXRG. A cross-species, systems genetics analysis shows that MCTP1, RXRG and TNR coexpress with genes linked to psychiatric disorders and identify the striatum as a potential site of action. CMYA5, MCTP1, RXRG and TNR are associated with mouse anxiety and human BD. We hypothesize that MCTP1, RXRG and TNR influence intercellular signaling in the striatum.

  1. Convergent functional genomics in addiction research - a translational approach to study candidate genes and gene networks.

    Science.gov (United States)

    Spanagel, Rainer

    2013-01-01

    Convergent functional genomics (CFG) is a translational methodology that integrates in a Bayesian fashion multiple lines of evidence from studies in human and animal models to get a better understanding of the genetics of a disease or pathological behavior. Here the integration of data sets that derive from forward genetics in animals and genetic association studies including genome wide association studies (GWAS) in humans is described for addictive behavior. The aim of forward genetics in animals and association studies in humans is to identify mutations (e.g. SNPs) that produce a certain phenotype; i.e. "from phenotype to genotype". Most powerful in terms of forward genetics is combined quantitative trait loci (QTL) analysis and gene expression profiling in recombinant inbreed rodent lines or genetically selected animals for a specific phenotype, e.g. high vs. low drug consumption. By Bayesian scoring genomic information from forward genetics in animals is then combined with human GWAS data on a similar addiction-relevant phenotype. This integrative approach generates a robust candidate gene list that has to be functionally validated by means of reverse genetics in animals; i.e. "from genotype to phenotype". It is proposed that studying addiction relevant phenotypes and endophenotypes by this CFG approach will allow a better determination of the genetics of addictive behavior.

  2. Glutamatergic and GABAergic gene sets in attention-deficit/hyperactivity disorder: association to overlapping traits in ADHD and autism.

    Science.gov (United States)

    Naaijen, J; Bralten, J; Poelmans, G; Glennon, J C; Franke, B; Buitelaar, J K

    2017-01-10

    Attention-deficit/hyperactivity disorder (ADHD) and autism spectrum disorders (ASD) often co-occur. Both are highly heritable; however, it has been difficult to discover genetic risk variants. Glutamate and GABA are main excitatory and inhibitory neurotransmitters in the brain; their balance is essential for proper brain development and functioning. In this study we investigated the role of glutamate and GABA genetics in ADHD severity, autism symptom severity and inhibitory performance, based on gene set analysis, an approach to investigate multiple genetic variants simultaneously. Common variants within glutamatergic and GABAergic genes were investigated using the MAGMA software in an ADHD case-only sample (n=931), in which we assessed ASD symptoms and response inhibition on a Stop task. Gene set analysis for ADHD symptom severity, divided into inattention and hyperactivity/impulsivity symptoms, autism symptom severity and inhibition were performed using principal component regression analyses. Subsequently, gene-wide association analyses were performed. The glutamate gene set showed an association with severity of hyperactivity/impulsivity (P=0.009), which was robust to correcting for genome-wide association levels. The GABA gene set showed nominally significant association with inhibition (P=0.04), but this did not survive correction for multiple comparisons. None of single gene or single variant associations was significant on their own. By analyzing multiple genetic variants within candidate gene sets together, we were able to find genetic associations supporting the involvement of excitatory and inhibitory neurotransmitter systems in ADHD and ASD symptom severity in ADHD.

  3. Natural Genetic Variation and Candidate Genes for Morphological Traits in Drosophila melanogaster

    Science.gov (United States)

    Carreira, Valeria Paula; Mensch, Julián; Hasson, Esteban; Fanara, Juan José

    2016-01-01

    Body size is a complex character associated to several fitness related traits that vary within and between species as a consequence of environmental and genetic factors. Latitudinal and altitudinal clines for different morphological traits have been described in several species of Drosophila and previous work identified genomic regions associated with such variation in D. melanogaster. However, the genetic factors that orchestrate morphological variation have been barely studied. Here, our main objective was to investigate genetic variation for different morphological traits associated to the second chromosome in natural populations of D. melanogaster along latitudinal and altitudinal gradients in Argentina. Our results revealed weak clinal signals and a strong population effect on morphological variation. Moreover, most pairwise comparisons between populations were significant. Our study also showed important within-population genetic variation, which must be associated to the second chromosome, as the lines are otherwise genetically identical. Next, we examined the contribution of different candidate genes to natural variation for these traits. We performed quantitative complementation tests using a battery of lines bearing mutated alleles at candidate genes located in the second chromosome and six second chromosome substitution lines derived from natural populations which exhibited divergent phenotypes. Results of complementation tests revealed that natural variation at all candidate genes studied, invected, Fasciclin 3, toucan, Reticulon-like1, jing and CG14478, affects the studied characters, suggesting that they are Quantitative Trait Genes for morphological traits. Finally, the phenotypic patterns observed suggest that different alleles of each gene might contribute to natural variation for morphological traits. However, non-additive effects cannot be ruled out, as wild-derived strains differ at myriads of second chromosome loci that may interact

  4. Genetics of human longevity with emphasis on the relevance of HSP70 as candidate genes

    DEFF Research Database (Denmark)

    Singh, Ripudaman; Kølvrå, Steen; Rattan, Suresh I S

    2007-01-01

    Human longevity is determined to a certain extent by genetic factors. Several candidate genes have been studied for their association with human longevity, but the data collected so far are inconclusive. One of the reasons is the choice of the candidate genes in addition to the choice...... of an appropriate study design and methodology. Since aging is characterized by a progressive accumulation of molecular damage and an attenuation of the cellular defense mechanisms, the focus of studies on human longevity association with genes has now shifted to the pathways of cellular maintenance and repair...... mechanisms. One such pathway includes the battery of stress response genes, especially the heat shock protein HSP70 genes. Three such genes, HSPA1A, HSPA1B and HSPA1L, are present within the MHC-III region on the short arm of chromosome 6. We and others have found alleles, genotypes and haplotypes which have...

  5. Integrative analysis of survival-associated gene sets in breast cancer.

    Science.gov (United States)

    Varn, Frederick S; Ung, Matthew H; Lou, Shao Ke; Cheng, Chao

    2015-03-12

    Patient gene expression information has recently become a clinical feature used to evaluate breast cancer prognosis. The emergence of prognostic gene sets that take advantage of these data has led to a rich library of information that can be used to characterize the molecular nature of a patient's cancer. Identifying robust gene sets that are consistently predictive of a patient's clinical outcome has become one of the main challenges in the field. We inputted our previously established BASE algorithm with patient gene expression data and gene sets from MSigDB to develop the gene set activity score (GSAS), a metric that quantitatively assesses a gene set's activity level in a given patient. We utilized this metric, along with patient time-to-event data, to perform survival analyses to identify the gene sets that were significantly correlated with patient survival. We then performed cross-dataset analyses to identify robust prognostic gene sets and to classify patients by metastasis status. Additionally, we created a gene set network based on component gene overlap to explore the relationship between gene sets derived from MSigDB. We developed a novel gene set based on this network's topology and applied the GSAS metric to characterize its role in patient survival. Using the GSAS metric, we identified 120 gene sets that were significantly associated with patient survival in all datasets tested. The gene overlap network analysis yielded a novel gene set enriched in genes shared by the robustly predictive gene sets. This gene set was highly correlated to patient survival when used alone. Most interestingly, removal of the genes in this gene set from the gene pool on MSigDB resulted in a large reduction in the number of predictive gene sets, suggesting a prominent role for these genes in breast cancer progression. The GSAS metric provided a useful medium by which we systematically investigated how gene sets from MSigDB relate to breast cancer patient survival. We used

  6. Candidate genes expressed in human islets and their role in the pathogenesis of type 1 diabetes

    DEFF Research Database (Denmark)

    Storling, Joachim; Brorsson, Caroline Anna

    2013-01-01

    In type 1 diabetes (T1D), the insulin-producing β cells are destroyed by an immune-mediated process leading to complete insulin deficiency. There is a strong genetic component in T1D. Genes located in the human leukocyte antigen (HLA) region are the most important genetic determinants of disease......, but more than 40 additional loci are known to significantly affect T1D risk. Since most of the currently known genetic candidates have annotated immune cell functions, it is generally considered that most of the genetic susceptibility in T1D is caused by variation in genes affecting immune cell function....... Recent studies, however, indicate that most T1D candidate genes are expressed in human islets suggesting that the functions of the genes are not restricted to immune cells, but also play roles in the islets and possibly the β cells. Several candidates change expression levels within the islets following...

  7. Photoreceptor dysplasia (pd) in miniature schnauzer dogs: evaluation of candidate genes by molecular genetic analysis.

    Science.gov (United States)

    Zhang, Q; Baldwin, V J; Acland, G M; Parshall, C J; Haskel, J; Aguirre, G D; Ray, K

    1999-01-01

    Photoreceptor dysplasia (pd) is one of a group of at least six distinct autosomal and one X-linked retinal disorders identified in dogs which are collectively known as progressive retinal atrophy (PRA). It is an early onset retinal disease identified in miniature schnauzer dogs, and pedigree analysis and breeding studies have established autosomal recessive inheritance of the disease. Using a gene-based approach, a number of retina-expressed genes, including some members of the phototransduction pathway, have been causally implicated in retinal diseases of humans and other animals. Here we examined seven such potential candidate genes (opsin, RDS/peripherin, ROM1, rod cGMP-gated cation channel alpha-subunit, and three subunits of transducin) for their causal association with the pd locus by testing segregation of intragenic markers with the disease locus, or, in the absence of informative polymorphisms, sequencing of the coding regions of the genes. Based on these results, we have conclusively excluded four photoreceptor-specific genes as candidates for pd by linkage analysis. For three other photoreceptor-specific genes, we did not find any mutation in the coding sequences of the genes and have excluded them provisionally. Formal exclusion would require investigation of the levels of expression of the candidate genes in pd-affected dogs relative to age-matched controls. At present we are building suitable informative pedigrees for the disease locus with a sufficient number of meiosis to be useful for genomewide screening. This should identify markers linked to the disease locus and eventually permit progress toward the identification of the photoreceptor dysplasia gene and the disease-causing mutation.

  8. Upper-Lower Bounds Candidate Sets Searching Algorithm for Bayesian Network Structure Learning

    Directory of Open Access Journals (Sweden)

    Guangyi Liu

    2014-01-01

    Full Text Available Bayesian network is an important theoretical model in artificial intelligence field and also a powerful tool for processing uncertainty issues. Considering the slow convergence speed of current Bayesian network structure learning algorithms, a fast hybrid learning method is proposed in this paper. We start with further analysis of information provided by low-order conditional independence testing, and then two methods are given for constructing graph model of network, which is theoretically proved to be upper and lower bounds of the structure space of target network, so that candidate sets are given as a result; after that a search and scoring algorithm is operated based on the candidate sets to find the final structure of the network. Simulation results show that the algorithm proposed in this paper is more efficient than similar algorithms with the same learning precision.

  9. Expression studies of the obesity candidate gene FTO in pig

    DEFF Research Database (Denmark)

    Madsen, Majbritt Busk; Birck, Malene Muusfeldt; Fredholm, Merete

    2010-01-01

    Obesity is an increasing problem worldwide and research on candidate genes in good animal models is highly needed. The pig is an excellent model as its metabolism, organ size, and eating habits resemble that of humans. The present study is focused on the characterization of the fat mass and obesity...... associated gene (FTO) in pig. This gene has recently been associated with increased body mass index in several human populations. To establish information on the expression profile of FTO in the pig we performed quantitative PCR in a panel of adult pig tissues and in tissues sampled at different...... and cerebellum). Additionally, in order to see the involvement of the FTO gene in obesity, the changes in expression level were investigated in a nutritional study in brain of Gottingen minipigs under a high cholesterol diet. Significantly higher (P

  10. Candidate gene association analyses for ketosis resistance in Holsteins.

    Science.gov (United States)

    Kroezen, V; Schenkel, F S; Miglior, F; Baes, C F; Squires, E J

    2018-06-01

    High-yielding dairy cattle are susceptible to ketosis, a metabolic disease that negatively affects the health, fertility, and milk production of the cow. Interest in breeding for more robust dairy cattle with improved resistance to disease is global; however, genetic evaluations for ketosis would benefit from the additional information provided by genetic markers. Candidate genes that are proposed to have a biological role in the pathogenesis of ketosis were investigated in silico and a custom panel of 998 putative single nucleotide polymorphism (SNP) markers was developed. The objective of this study was to test the associations of these new markers with deregressed estimated breeding values (EBV) for ketosis. A sample of 653 Canadian Holstein cows that had been previously genotyped with a medium-density SNP chip were regenotyped with the custom panel. The EBV for ketosis in first and later lactations were obtained for each animal and deregressed for use as pseudo-phenotypes for association analyses. Results of the mixed inheritance model for single SNP association analyses suggested 15 markers in 6 unique candidate genes were associated with the studied trait. Genes encoding proteins involved in metabolic processes, including the synthesis and degradation of fatty acids and ketone bodies, gluconeogenesis, lipid mobilization, and the citric acid cycle, were identified to contain SNP associated with ketosis resistance. This work confirmed the presence of previously described quantitative trait loci for dairy cattle, suggested novel markers for ketosis-resistance, and provided insight into the underlying biology of this disease. Copyright © 2018 American Dairy Science Association. Published by Elsevier Inc. All rights reserved.

  11. Evaluation of common genetic variants in 82 candidate genes as risk factors for neural tube defects

    Directory of Open Access Journals (Sweden)

    Pangilinan Faith

    2012-08-01

    Full Text Available Abstract Background Neural tube defects (NTDs are common birth defects (~1 in 1000 pregnancies in the US and Europe that have complex origins, including environmental and genetic factors. A low level of maternal folate is one well-established risk factor, with maternal periconceptional folic acid supplementation reducing the occurrence of NTD pregnancies by 50-70%. Gene variants in the folate metabolic pathway (e.g., MTHFR rs1801133 (677 C > T and MTHFD1 rs2236225 (R653Q have been found to increase NTD risk. We hypothesized that variants in additional folate/B12 pathway genes contribute to NTD risk. Methods A tagSNP approach was used to screen common variation in 82 candidate genes selected from the folate/B12 pathway and NTD mouse models. We initially genotyped polymorphisms in 320 Irish triads (NTD cases and their parents, including 301 cases and 341 Irish controls to perform case–control and family based association tests. Significantly associated polymorphisms were genotyped in a secondary set of 250 families that included 229 cases and 658 controls. The combined results for 1441 SNPs were used in a joint analysis to test for case and maternal effects. Results Nearly 70 SNPs in 30 genes were found to be associated with NTDs at the p MFTC, CDKN2A, ADA, PEMT, CUBN, GART, DNMT3A, MTHFD1 and T (Brachyury and included the known NTD risk factor MTHFD1 R653Q (rs2236225. The single strongest signal was observed in a new candidate, MFTC rs17803441 (OR = 1.61 [1.23-2.08], p = 0.0003 for the minor allele. Though nominally significant, these associations did not remain significant after correction for multiple hypothesis testing. Conclusions To our knowledge, with respect to sample size and scope of evaluation of candidate polymorphisms, this is the largest NTD genetic association study reported to date. The scale of the study and the stringency of correction are likely to have contributed to real associations failing to survive

  12. Characterization of Gene Candidates for Vacuolar Sodium Transport from Hordeum Vulgare

    KAUST Repository

    Scheu, Arne Hagen August

    2017-01-01

    Various potential causes are discussed, including inaccuracies in the genome resource used as reference for primer design and issues inherent to the model system. Finally, I make suggestions on how to proceed to further characterize the candidate genes and hopefully identify novel sodium transporters from barley.

  13. A Genome-Wide Association Study on the Seedless Phenotype in Banana (Musa spp. Reveals the Potential of a Selected Panel to Detect Candidate Genes in a Vegetatively Propagated Crop.

    Directory of Open Access Journals (Sweden)

    Julie Sardos

    Full Text Available Banana (Musa sp. is a vegetatively propagated, low fertility, potentially hybrid and polyploid crop. These qualities make the breeding and targeted genetic improvement of this crop a difficult and long process. The Genome-Wide Association Study (GWAS approach is becoming widely used in crop plants and has proven efficient to detecting candidate genes for traits of interest, especially in cereals. GWAS has not been applied yet to a vegetatively propagated crop. However, successful GWAS in banana would considerably help unravel the genomic basis of traits of interest and therefore speed up this crop improvement. We present here a dedicated panel of 105 accessions of banana, freely available upon request, and their corresponding GBS data. A set of 5,544 highly reliable markers revealed high levels of admixture in most accessions, except for a subset of 33 individuals from Papua. A GWAS on the seedless phenotype was then successfully applied to the panel. By applying the Mixed Linear Model corrected for both kinship and structure as implemented in TASSEL, we detected 13 candidate genomic regions in which we found a number of genes potentially linked with the seedless phenotype (i.e. parthenocarpy combined with female sterility. An additional GWAS performed on the unstructured Papuan subset composed of 33 accessions confirmed six of these regions as candidate. Out of both sets of analyses, one strong candidate gene for female sterility, a putative orthologous gene to Histidine Kinase CKI1, was identified. The results presented here confirmed the feasibility and potential of GWAS when applied to small sets of banana accessions, at least for traits underpinned by a few loci. As phenotyping in banana is extremely space and time-consuming, this latest finding is of particular importance in the context of banana improvement.

  14. A Genome-Wide Association Study on the Seedless Phenotype in Banana (Musa spp.) Reveals the Potential of a Selected Panel to Detect Candidate Genes in a Vegetatively Propagated Crop.

    Science.gov (United States)

    Sardos, Julie; Rouard, Mathieu; Hueber, Yann; Cenci, Alberto; Hyma, Katie E; van den Houwe, Ines; Hribova, Eva; Courtois, Brigitte; Roux, Nicolas

    2016-01-01

    Banana (Musa sp.) is a vegetatively propagated, low fertility, potentially hybrid and polyploid crop. These qualities make the breeding and targeted genetic improvement of this crop a difficult and long process. The Genome-Wide Association Study (GWAS) approach is becoming widely used in crop plants and has proven efficient to detecting candidate genes for traits of interest, especially in cereals. GWAS has not been applied yet to a vegetatively propagated crop. However, successful GWAS in banana would considerably help unravel the genomic basis of traits of interest and therefore speed up this crop improvement. We present here a dedicated panel of 105 accessions of banana, freely available upon request, and their corresponding GBS data. A set of 5,544 highly reliable markers revealed high levels of admixture in most accessions, except for a subset of 33 individuals from Papua. A GWAS on the seedless phenotype was then successfully applied to the panel. By applying the Mixed Linear Model corrected for both kinship and structure as implemented in TASSEL, we detected 13 candidate genomic regions in which we found a number of genes potentially linked with the seedless phenotype (i.e. parthenocarpy combined with female sterility). An additional GWAS performed on the unstructured Papuan subset composed of 33 accessions confirmed six of these regions as candidate. Out of both sets of analyses, one strong candidate gene for female sterility, a putative orthologous gene to Histidine Kinase CKI1, was identified. The results presented here confirmed the feasibility and potential of GWAS when applied to small sets of banana accessions, at least for traits underpinned by a few loci. As phenotyping in banana is extremely space and time-consuming, this latest finding is of particular importance in the context of banana improvement.

  15. Epidermal growth factor gene is a newly identified candidate gene for gout.

    Science.gov (United States)

    Han, Lin; Cao, Chunwei; Jia, Zhaotong; Liu, Shiguo; Liu, Zhen; Xin, Ruosai; Wang, Can; Li, Xinde; Ren, Wei; Wang, Xuefeng; Li, Changgui

    2016-08-10

    Chromosome 4q25 has been identified as a genomic region associated with gout. However, the associations of gout with the genes in this region have not yet been confirmed. Here, we performed two-stage analysis to determine whether variations in candidate genes in the 4q25 region are associated with gout in a male Chinese Han population. We first evaluated 96 tag single nucleotide polymorphisms (SNPs) in eight inflammatory/immune pathway- or glucose/lipid metabolism-related genes in the 4q25 region in 480 male gout patients and 480 controls. The SNP rs12504538, located in the elongation of very-long-chain-fatty-acid-like family member 6 gene (Elovl6), was found to be associated with gout susceptibility (Padjusted = 0.00595). In the second stage of analysis, we performed fine mapping analysis of 93 tag SNPs in Elovl6 and in the epidermal growth factor gene (EGF) and its flanking regions in 1017 male patients gout and 1897 healthy male controls. We observed a significant association between the T allele of EGF rs2298999 and gout (odds ratio = 0.77, 95% confidence interval = 0.67-0.88, Padjusted = 6.42 × 10(-3)). These results provide the first evidence for an association between the EGF rs2298999 C/T polymorphism and gout. Our findings should be validated in additional populations.

  16. Investigation of the molecular relationship between breast cancer and obesity by candidate gene prioritization methods

    Directory of Open Access Journals (Sweden)

    Saba Garshasbi

    2015-10-01

    Full Text Available Background: Cancer and obesity are two major public health concerns. More than 12 million cases of cancer are reported annually. Many reports confirmed obesity as a risk factor for cancer. The molecular relationship between obesity and breast cancer has not been clear yet. The purpose of this study was to investigate priorities of effective genes in the molecular relationship between obesity and breast cancer. Methods: In this study, computer simulation method was used for prioritizing the genes that involved in the molecular links between obesity and breast cancer in laboratory of systems biology and bioinformatics (LBB, Tehran University, Tehran, Iran, from March to July 2014. In this study, ENDEAVOUR software was used for prioritizing the genes and integrating multiple data sources was used for data analysis. Training genes were selected from effective genes in obesity and/or breast cancer. Two groups of candidate genes were selected. The first group was included the existential genes in 5 common region chromosomes (between obesity and breast cancer and the second group was included the results of genes microarray data analysis of research Creighton, et al (In 2012 on patients with breast cancer. The microarray data were analyzed with GER2 software (R online software on GEO website. Finally, both training and candidate genes were entered in ENDEAVOUR software package. Results: The candidate genes were prioritized to four style and five genes in ten of the first priorities were repeated twice. In other word, the outcome of prioritizing of 72 genes (Product of microarray data analysis and genes of 5 common chromosome regions (Between obesity and breast cancer showed, 5 genes (TNFRSF10B, F2, IGFALS, NTRK3 and HSP90B1 were the priorities in the molecular connection between obesity and breast cancer. Conclusion: There are some common genes between breast cancer and obesity. So, molecular relationship is confirmed. In this study the possible effect

  17. Identification of novel candidate target genes in amplicons of Glioblastoma multiforme tumors detected by expression and CGH microarray profiling

    Directory of Open Access Journals (Sweden)

    Hernández-Moneo Jose-Luis

    2006-09-01

    Full Text Available Abstract Background Conventional cytogenetic and comparative genomic hybridization (CGH studies in brain malignancies have shown that glioblastoma multiforme (GBM is characterized by complex structural and numerical alterations. However, the limited resolution of these techniques has precluded the precise identification of detailed specific gene copy number alterations. Results We performed a genome-wide survey of gene copy number changes in 20 primary GBMs by CGH on cDNA microarrays. A novel amplicon at 4p15, and previously uncharacterized amplicons at 13q32-34 and 1q32 were detected and are analyzed here. These amplicons contained amplified genes not previously reported. Other amplified regions containg well-known oncogenes in GBMs were also detected at 7p12 (EGFR, 7q21 (CDK6, 4q12 (PDGFRA, and 12q13-15 (MDM2 and CDK4. In order to identify the putative target genes of the amplifications, and to determine the changes in gene expression levels associated with copy number change events, we carried out parallel gene expression profiling analyses using the same cDNA microarrays. We detected overexpression of the novel amplified genes SLA/LP and STIM2 (4p15, and TNFSF13B and COL4A2 (13q32-34. Some of the candidate target genes of amplification (EGFR, CDK6, MDM2, CDK4, and TNFSF13B were tested in an independent set of 111 primary GBMs by using FISH and immunohistological assays. The novel candidate 13q-amplification target TNFSF13B was amplified in 8% of the tumors, and showed protein expression in 20% of the GBMs. Conclusion This high-resolution analysis allowed us to propose novel candidate target genes such as STIM2 at 4p15, and TNFSF13B or COL4A2 at 13q32-34 that could potentially contribute to the pathogenesis of these tumors and which would require futher investigations. We showed that overexpression of the amplified genes could be attributable to gene dosage and speculate that deregulation of those genes could be important in the development

  18. Candidate gene linkage approach to identify DNA variants that predispose to preterm birth

    DEFF Research Database (Denmark)

    Bream, Elise N A; Leppellere, Cara R; Cooper, Margaret E

    2013-01-01

    Background:The aim of this study was to identify genetic variants contributing to preterm birth (PTB) using a linkage candidate gene approach.Methods:We studied 99 single-nucleotide polymorphisms (SNPs) for 33 genes in 257 families with PTBs segregating. Nonparametric and parametric analyses were...... through the infant and/or the mother in the etiology of PTB....

  19. Gene Expression Analysis in Tubule Interstitial Compartments Reveals Candidate Agents for IgA Nephropathy

    Directory of Open Access Journals (Sweden)

    Jinling Wang

    2014-09-01

    Full Text Available Background/Aims: Our aim was to explore the molecular mechanism underlying development of IgA nephropathy and discover candidate agents for IgA nephropathy. Methods: The differentially expressed genes (DEGs between patients with IgA nephropathy and normal controls were identified by the data of GSE35488 downloaded from GEO (Gene Expression Omnibus database. The co-expressed gene pairs among DEGs were screened to construct the gene-gene interaction network. Gene Ontology (GO enrichment analysis was performed to analyze the functions of DEGs. The biologically active small molecules capable of targeting IgA nephropathy were identified using the Connectivity Map (cMap database. Results: A total of 55 genes involved in response to organic substance, transcription factor activity and response to steroid hormone stimulus were identified to be differentially expressed in IgA nephropathy patients compared to healthy individuals. A network with 45 co-expressed gene pairs was constructed. DEGs in the network were significantly enriched in response to organic substance. Additionally, a group of small molecules were identified, such as doxorubicin and thapsigargin. Conclusion: Our work provided a systematic insight in understanding the mechanism of IgA nephropathy. Small molecules such as thapsigargin might be potential candidate agents for the treatment of IgA nephropathy.

  20. Case-control study of candidate gene methylation and adenomatous polyp formation.

    Science.gov (United States)

    Alexander, M; Burch, J B; Steck, S E; Chen, C-F; Hurley, T G; Cavicchia, P; Shivappa, N; Guess, J; Zhang, H; Youngstedt, S D; Creek, K E; Lloyd, S; Jones, K; Hébert, J R

    2017-02-01

    Colorectal cancer (CRC) is one of the most common and preventable forms of cancer but remains the second leading cause of cancer-related death. Colorectal adenomas are precursor lesions that develop in 70-90 % of CRC cases. Identification of peripheral biomarkers for adenomas would help to enhance screening efforts. This exploratory study examined the methylation status of 20 candidate markers in peripheral blood leukocytes and their association with adenoma formation. Patients recruited from a local endoscopy clinic provided informed consent and completed an interview to ascertain demographic, lifestyle, and adenoma risk factors. Cases were individuals with a histopathologically confirmed adenoma, and controls included patients with a normal colonoscopy or those with histopathological findings not requiring heightened surveillance (normal biopsy, hyperplastic polyp). Methylation-specific polymerase chain reaction was used to characterize candidate gene promoter methylation. Odds ratios (ORs) and 95 % confidence intervals (95% CIs) were calculated using unconditional multivariable logistic regression to test the hypothesis that candidate gene methylation differed between cases and controls, after adjustment for confounders. Complete data were available for 107 participants; 36 % had adenomas (men 40 %, women 31 %). Hypomethylation of the MINT1 locus (OR 5.3, 95% CI 1.0-28.2) and the PER1 (OR 2.9, 95% CI 1.1-7.7) and PER3 (OR 11.6, 95% CI 1.6-78.5) clock gene promoters was more common among adenoma cases. While specificity was moderate to high for the three markers (71-97 %), sensitivity was relatively low (18-45 %). Follow-up of these epigenetic markers is suggested to further evaluate their utility for adenoma screening or surveillance.

  1. Selection and validation of reference genes for qRT-PCR expression analysis of candidate genes involved in olfactory communication in the butterfly Bicyclus anynana.

    Directory of Open Access Journals (Sweden)

    Alok Arun

    Full Text Available Real-time quantitative reverse transcription PCR (qRT-PCR is a technique widely used to quantify the transcriptional expression level of candidate genes. qRT-PCR requires the selection of one or several suitable reference genes, whose expression profiles remain stable across conditions, to normalize the qRT-PCR expression profiles of candidate genes. Although several butterfly species (Lepidoptera have become important models in molecular evolutionary ecology, so far no study aimed at identifying reference genes for accurate data normalization for any butterfly is available. The African bush brown butterfly Bicyclus anynana has drawn considerable attention owing to its suitability as a model for evolutionary ecology, and we here provide a maiden extensive study to identify suitable reference gene in this species. We monitored the expression profile of twelve reference genes: eEF-1α, FK506, UBQL40, RpS8, RpS18, HSP, GAPDH, VATPase, ACT3, TBP, eIF2 and G6PD. We tested the stability of their expression profiles in three different tissues (wings, brains, antennae, two developmental stages (pupal and adult and two sexes (male and female, all of which were subjected to two food treatments (food stress and control feeding ad libitum. The expression stability and ranking of twelve reference genes was assessed using two algorithm-based methods, NormFinder and geNorm. Both methods identified RpS8 as the best suitable reference gene for expression data normalization. We also showed that the use of two reference genes is sufficient to effectively normalize the qRT-PCR data under varying tissues and experimental conditions that we used in B. anynana. Finally, we tested the effect of choosing reference genes with different stability on the normalization of the transcript abundance of a candidate gene involved in olfactory communication in B. anynana, the Fatty Acyl Reductase 2, and we confirmed that using an unstable reference gene can drastically alter the

  2. Selection and validation of reference genes for qRT-PCR expression analysis of candidate genes involved in olfactory communication in the butterfly Bicyclus anynana.

    Science.gov (United States)

    Arun, Alok; Baumlé, Véronique; Amelot, Gaël; Nieberding, Caroline M

    2015-01-01

    Real-time quantitative reverse transcription PCR (qRT-PCR) is a technique widely used to quantify the transcriptional expression level of candidate genes. qRT-PCR requires the selection of one or several suitable reference genes, whose expression profiles remain stable across conditions, to normalize the qRT-PCR expression profiles of candidate genes. Although several butterfly species (Lepidoptera) have become important models in molecular evolutionary ecology, so far no study aimed at identifying reference genes for accurate data normalization for any butterfly is available. The African bush brown butterfly Bicyclus anynana has drawn considerable attention owing to its suitability as a model for evolutionary ecology, and we here provide a maiden extensive study to identify suitable reference gene in this species. We monitored the expression profile of twelve reference genes: eEF-1α, FK506, UBQL40, RpS8, RpS18, HSP, GAPDH, VATPase, ACT3, TBP, eIF2 and G6PD. We tested the stability of their expression profiles in three different tissues (wings, brains, antennae), two developmental stages (pupal and adult) and two sexes (male and female), all of which were subjected to two food treatments (food stress and control feeding ad libitum). The expression stability and ranking of twelve reference genes was assessed using two algorithm-based methods, NormFinder and geNorm. Both methods identified RpS8 as the best suitable reference gene for expression data normalization. We also showed that the use of two reference genes is sufficient to effectively normalize the qRT-PCR data under varying tissues and experimental conditions that we used in B. anynana. Finally, we tested the effect of choosing reference genes with different stability on the normalization of the transcript abundance of a candidate gene involved in olfactory communication in B. anynana, the Fatty Acyl Reductase 2, and we confirmed that using an unstable reference gene can drastically alter the expression

  3. A stratified transcriptomics analysis of polygenic fat and lean mouse adipose tissues identifies novel candidate obesity genes.

    Directory of Open Access Journals (Sweden)

    Nicholas M Morton

    Full Text Available Obesity and metabolic syndrome results from a complex interaction between genetic and environmental factors. In addition to brain-regulated processes, recent genome wide association studies have indicated that genes highly expressed in adipose tissue affect the distribution and function of fat and thus contribute to obesity. Using a stratified transcriptome gene enrichment approach we attempted to identify adipose tissue-specific obesity genes in the unique polygenic Fat (F mouse strain generated by selective breeding over 60 generations for divergent adiposity from a comparator Lean (L strain.To enrich for adipose tissue obesity genes a 'snap-shot' pooled-sample transcriptome comparison of key fat depots and non adipose tissues (muscle, liver, kidney was performed. Known obesity quantitative trait loci (QTL information for the model allowed us to further filter genes for increased likelihood of being causal or secondary for obesity. This successfully identified several genes previously linked to obesity (C1qr1, and Np3r as positional QTL candidate genes elevated specifically in F line adipose tissue. A number of novel obesity candidate genes were also identified (Thbs1, Ppp1r3d, Tmepai, Trp53inp2, Ttc7b, Tuba1a, Fgf13, Fmr that have inferred roles in fat cell function. Quantitative microarray analysis was then applied to the most phenotypically divergent adipose depot after exaggerating F and L strain differences with chronic high fat feeding which revealed a distinct gene expression profile of line, fat depot and diet-responsive inflammatory, angiogenic and metabolic pathways. Selected candidate genes Npr3 and Thbs1, as well as Gys2, a non-QTL gene that otherwise passed our enrichment criteria were characterised, revealing novel functional effects consistent with a contribution to obesity.A focussed candidate gene enrichment strategy in the unique F and L model has identified novel adipose tissue-enriched genes contributing to obesity.

  4. Identifying candidate driver genes by integrative ovarian cancer genomics data

    Science.gov (United States)

    Lu, Xinguo; Lu, Jibo

    2017-08-01

    Integrative analysis of molecular mechanics underlying cancer can distinguish interactions that cannot be revealed based on one kind of data for the appropriate diagnosis and treatment of cancer patients. Tumor samples exhibit heterogeneity in omics data, such as somatic mutations, Copy Number Variations CNVs), gene expression profiles and so on. In this paper we combined gene co-expression modules and mutation modulators separately in tumor patients to obtain the candidate driver genes for resistant and sensitive tumor from the heterogeneous data. The final list of modulators identified are well known in biological processes associated with ovarian cancer, such as CCL17, CACTIN, CCL16, CCL22, APOB, KDF1, CCL11, HNF1B, LRG1, MED1 and so on, which can help to facilitate the discovery of biomarkers, molecular diagnostics, and drug discovery.

  5. Identification of a core set of rhizobial infection genes using data from single cell-types

    Directory of Open Access Journals (Sweden)

    Da-Song eChen

    2015-07-01

    Full Text Available Genome-wide expression studies on nodulation have varied in their scale from entire root systems to dissected nodules or root sections containing nodule primordia. More recently efforts have focused on developing methods for isolation of root hairs from infected plants and the application of laser-capture microdissection technology to nodules. Here we analyze two published data sets to identify a core set of infection genes that are expressed in the nodule and in root hairs during infection. Among the genes identified were those encoding phenylpropanoid biosynthesis enzymes including Chalcone-O-Methyltransferase which is required for the production of the potent Nod gene inducer 4’,4-dihydroxy-2-methoxychalcone. A promoter-GUS analysis in transgenic hairy roots for two genes encoding Chalcone-O-Methyltransferase isoforms revealed their expression in rhizobially infected root hairs and the nodule infection zone but not in the nitrogen fixation zone. We also describe a group of Rhizobially Induced Peroxidases whose expression overlaps with the production of superoxide in rhizobially infected root hairs and in nodules and roots. Finally, we identify a cohort of co-regulated transcription factors as candidate regulators of these processes.

  6. Transcriptomic Analysis Using Olive Varieties and Breeding Progenies Identifies Candidate Genes Involved in Plant Architecture.

    Science.gov (United States)

    González-Plaza, Juan J; Ortiz-Martín, Inmaculada; Muñoz-Mérida, Antonio; García-López, Carmen; Sánchez-Sevilla, José F; Luque, Francisco; Trelles, Oswaldo; Bejarano, Eduardo R; De La Rosa, Raúl; Valpuesta, Victoriano; Beuzón, Carmen R

    2016-01-01

    Plant architecture is a critical trait in fruit crops that can significantly influence yield, pruning, planting density and harvesting. Little is known about how plant architecture is genetically determined in olive, were most of the existing varieties are traditional with an architecture poorly suited for modern growing and harvesting systems. In the present study, we have carried out microarray analysis of meristematic tissue to compare expression profiles of olive varieties displaying differences in architecture, as well as seedlings from their cross pooled on the basis of their sharing architecture-related phenotypes. The microarray used, previously developed by our group has already been applied to identify candidates genes involved in regulating juvenile to adult transition in the shoot apex of seedlings. Varieties with distinct architecture phenotypes and individuals from segregating progenies displaying opposite architecture features were used to link phenotype to expression. Here, we identify 2252 differentially expressed genes (DEGs) associated to differences in plant architecture. Microarray results were validated by quantitative RT-PCR carried out on genes with functional annotation likely related to plant architecture. Twelve of these genes were further analyzed in individual seedlings of the corresponding pool. We also examined Arabidopsis mutants in putative orthologs of these targeted candidate genes, finding altered architecture for most of them. This supports a functional conservation between species and potential biological relevance of the candidate genes identified. This study is the first to identify genes associated to plant architecture in olive, and the results obtained could be of great help in future programs aimed at selecting phenotypes adapted to modern cultivation practices in this species.

  7. Exome sequencing of Pakistani consanguineous families identifies 30 novel candidate genes for recessive intellectual disability.

    Science.gov (United States)

    Riazuddin, S; Hussain, M; Razzaq, A; Iqbal, Z; Shahzad, M; Polla, D L; Song, Y; van Beusekom, E; Khan, A A; Tomas-Roca, L; Rashid, M; Zahoor, M Y; Wissink-Lindhout, W M; Basra, M A R; Ansar, M; Agha, Z; van Heeswijk, K; Rasheed, F; Van de Vorst, M; Veltman, J A; Gilissen, C; Akram, J; Kleefstra, T; Assir, M Z; Grozeva, D; Carss, K; Raymond, F L; O'Connor, T D; Riazuddin, S A; Khan, S N; Ahmed, Z M; de Brouwer, A P M; van Bokhoven, H; Riazuddin, S

    2017-11-01

    Intellectual disability (ID) is a clinically and genetically heterogeneous disorder, affecting 1-3% of the general population. Although research into the genetic causes of ID has recently gained momentum, identification of pathogenic mutations that cause autosomal recessive ID (ARID) has lagged behind, predominantly due to non-availability of sizeable families. Here we present the results of exome sequencing in 121 large consanguineous Pakistani ID families. In 60 families, we identified homozygous or compound heterozygous DNA variants in a single gene, 30 affecting reported ID genes and 30 affecting novel candidate ID genes. Potential pathogenicity of these alleles was supported by co-segregation with the phenotype, low frequency in control populations and the application of stringent bioinformatics analyses. In another eight families segregation of multiple pathogenic variants was observed, affecting 19 genes that were either known or are novel candidates for ID. Transcriptome profiles of normal human brain tissues showed that the novel candidate ID genes formed a network significantly enriched for transcriptional co-expression (P<0.0001) in the frontal cortex during fetal development and in the temporal-parietal and sub-cortex during infancy through adulthood. In addition, proteins encoded by 12 novel ID genes directly interact with previously reported ID proteins in six known pathways essential for cognitive function (P<0.0001). These results suggest that disruptions of temporal parietal and sub-cortical neurogenesis during infancy are critical to the pathophysiology of ID. These findings further expand the existing repertoire of genes involved in ARID, and provide new insights into the molecular mechanisms and the transcriptome map of ID.

  8. Molecular evolution of candidate genes for crop-related traits in sunflower (Helianthus annuus L.).

    Science.gov (United States)

    Mandel, Jennifer R; McAssey, Edward V; Nambeesan, Savithri; Garcia-Navarro, Elena; Burke, John M

    2014-01-01

    Evolutionary analyses aimed at detecting the molecular signature of selection during crop domestication and/or improvement can be used to identify genes or genomic regions of likely agronomic importance. Here, we describe the DNA sequence-based characterization of a pool of candidate genes for crop-related traits in sunflower. These genes, which were identified based on homology to genes of known effect in other study systems, were initially sequenced from a panel of improved lines. All genes that exhibited a paucity of sequence diversity, consistent with the possible effects of selection during the evolution of cultivated sunflower, were then sequenced from a panel of wild sunflower accessions an outgroup. These data enabled formal tests for the effects of selection in shaping sequence diversity at these loci. When selection was detected, we further sequenced these genes from a panel of primitive landraces, thereby allowing us to investigate the likely timing of selection (i.e., domestication vs. improvement). We ultimately identified seven genes that exhibited the signature of positive selection during either domestication or improvement. Genetic mapping of a subset of these genes revealed co-localization between candidates for genes involved in the determination of flowering time, seed germination, plant growth/development, and branching and QTL that were previously identified for these traits in cultivated × wild sunflower mapping populations.

  9. Zebrafish Expression Ontology of Gene Sets (ZEOGS): A Tool to Analyze Enrichment of Zebrafish Anatomical Terms in Large Gene Sets

    Science.gov (United States)

    Marsico, Annalisa

    2013-01-01

    Abstract The zebrafish (Danio rerio) is an established model organism for developmental and biomedical research. It is frequently used for high-throughput functional genomics experiments, such as genome-wide gene expression measurements, to systematically analyze molecular mechanisms. However, the use of whole embryos or larvae in such experiments leads to a loss of the spatial information. To address this problem, we have developed a tool called Zebrafish Expression Ontology of Gene Sets (ZEOGS) to assess the enrichment of anatomical terms in large gene sets. ZEOGS uses gene expression pattern data from several sources: first, in situ hybridization experiments from the Zebrafish Model Organism Database (ZFIN); second, it uses the Zebrafish Anatomical Ontology, a controlled vocabulary that describes connected anatomical structures; and third, the available connections between expression patterns and anatomical terms contained in ZFIN. Upon input of a gene set, ZEOGS determines which anatomical structures are overrepresented in the input gene set. ZEOGS allows one for the first time to look at groups of genes and to describe them in terms of shared anatomical structures. To establish ZEOGS, we first tested it on random gene selections and on two public microarray datasets with known tissue-specific gene expression changes. These tests showed that ZEOGS could reliably identify the tissues affected, whereas only very few enriched terms to none were found in the random gene sets. Next we applied ZEOGS to microarray datasets of 24 and 72 h postfertilization zebrafish embryos treated with beclomethasone, a potent glucocorticoid. This analysis resulted in the identification of several anatomical terms related to glucocorticoid-responsive tissues, some of which were stage-specific. Our studies highlight the ability of ZEOGS to extract spatial information from datasets derived from whole embryos, indicating that ZEOGS could be a useful tool to automatically analyze gene

  10. Zebrafish Expression Ontology of Gene Sets (ZEOGS): a tool to analyze enrichment of zebrafish anatomical terms in large gene sets.

    Science.gov (United States)

    Prykhozhij, Sergey V; Marsico, Annalisa; Meijsing, Sebastiaan H

    2013-09-01

    The zebrafish (Danio rerio) is an established model organism for developmental and biomedical research. It is frequently used for high-throughput functional genomics experiments, such as genome-wide gene expression measurements, to systematically analyze molecular mechanisms. However, the use of whole embryos or larvae in such experiments leads to a loss of the spatial information. To address this problem, we have developed a tool called Zebrafish Expression Ontology of Gene Sets (ZEOGS) to assess the enrichment of anatomical terms in large gene sets. ZEOGS uses gene expression pattern data from several sources: first, in situ hybridization experiments from the Zebrafish Model Organism Database (ZFIN); second, it uses the Zebrafish Anatomical Ontology, a controlled vocabulary that describes connected anatomical structures; and third, the available connections between expression patterns and anatomical terms contained in ZFIN. Upon input of a gene set, ZEOGS determines which anatomical structures are overrepresented in the input gene set. ZEOGS allows one for the first time to look at groups of genes and to describe them in terms of shared anatomical structures. To establish ZEOGS, we first tested it on random gene selections and on two public microarray datasets with known tissue-specific gene expression changes. These tests showed that ZEOGS could reliably identify the tissues affected, whereas only very few enriched terms to none were found in the random gene sets. Next we applied ZEOGS to microarray datasets of 24 and 72 h postfertilization zebrafish embryos treated with beclomethasone, a potent glucocorticoid. This analysis resulted in the identification of several anatomical terms related to glucocorticoid-responsive tissues, some of which were stage-specific. Our studies highlight the ability of ZEOGS to extract spatial information from datasets derived from whole embryos, indicating that ZEOGS could be a useful tool to automatically analyze gene expression

  11. MAGMA: Generalized Gene-Set Analysis of GWAS Data

    NARCIS (Netherlands)

    de Leeuw, C.A.; Mooij, J.M.; Heskes, T.; Posthuma, D.

    2015-01-01

    By aggregating data for complex traits in a biologically meaningful way, gene and gene-set analysis constitute a valuable addition to single-marker analysis. However, although various methods for gene and gene-set analysis currently exist, they generally suffer from a number of issues. Statistical

  12. MAGMA: generalized gene-set analysis of GWAS data.

    NARCIS (Netherlands)

    de Leeuw, C.A.; Mooij, J.M.; Heskes, T.; Posthuma, D.

    2015-01-01

    By aggregating data for complex traits in a biologically meaningful way, gene and gene-set analysis constitute a valuable addition to single-marker analysis. However, although various methods for gene and gene-set analysis currently exist, they generally suffer from a number of issues. Statistical

  13. Gene set analysis of purine and pyrimidine antimetabolites cancer therapies.

    Science.gov (United States)

    Fridley, Brooke L; Batzler, Anthony; Li, Liang; Li, Fang; Matimba, Alice; Jenkins, Gregory D; Ji, Yuan; Wang, Liewei; Weinshilboum, Richard M

    2011-11-01

    Responses to therapies, either with regard to toxicities or efficacy, are expected to involve complex relationships of gene products within the same molecular pathway or functional gene set. Therefore, pathways or gene sets, as opposed to single genes, may better reflect the true underlying biology and may be more appropriate units for analysis of pharmacogenomic studies. Application of such methods to pharmacogenomic studies may enable the detection of more subtle effects of multiple genes in the same pathway that may be missed by assessing each gene individually. A gene set analysis of 3821 gene sets is presented assessing the association between basal messenger RNA expression and drug cytotoxicity using ethnically defined human lymphoblastoid cell lines for two classes of drugs: pyrimidines [gemcitabine (dFdC) and arabinoside] and purines [6-thioguanine and 6-mercaptopurine]. The gene set nucleoside-diphosphatase activity was found to be significantly associated with both dFdC and arabinoside, whereas gene set γ-aminobutyric acid catabolic process was associated with dFdC and 6-thioguanine. These gene sets were significantly associated with the phenotype even after adjusting for multiple testing. In addition, five associated gene sets were found in common between the pyrimidines and two gene sets for the purines (3',5'-cyclic-AMP phosphodiesterase activity and γ-aminobutyric acid catabolic process) with a P value of less than 0.0001. Functional validation was attempted with four genes each in gene sets for thiopurine and pyrimidine antimetabolites. All four genes selected from the pyrimidine gene sets (PSME3, CANT1, ENTPD6, ADRM1) were validated, but only one (PDE4D) was validated for the thiopurine gene sets. In summary, results from the gene set analysis of pyrimidine and purine therapies, used often in the treatment of various cancers, provide novel insight into the relationship between genomic variation and drug response.

  14. Epidermal growth factor gene is a newly identified candidate gene for gout

    Science.gov (United States)

    Han, Lin; Cao, Chunwei; Jia, Zhaotong; Liu, Shiguo; Liu, Zhen; Xin, Ruosai; Wang, Can; Li, Xinde; Ren, Wei; Wang, Xuefeng; Li, Changgui

    2016-01-01

    Chromosome 4q25 has been identified as a genomic region associated with gout. However, the associations of gout with the genes in this region have not yet been confirmed. Here, we performed two-stage analysis to determine whether variations in candidate genes in the 4q25 region are associated with gout in a male Chinese Han population. We first evaluated 96 tag single nucleotide polymorphisms (SNPs) in eight inflammatory/immune pathway- or glucose/lipid metabolism-related genes in the 4q25 region in 480 male gout patients and 480 controls. The SNP rs12504538, located in the elongation of very-long-chain-fatty-acid-like family member 6 gene (Elovl6), was found to be associated with gout susceptibility (Padjusted = 0.00595). In the second stage of analysis, we performed fine mapping analysis of 93 tag SNPs in Elovl6 and in the epidermal growth factor gene (EGF) and its flanking regions in 1017 male patients gout and 1897 healthy male controls. We observed a significant association between the T allele of EGF rs2298999 and gout (odds ratio = 0.77, 95% confidence interval = 0.67–0.88, Padjusted = 6.42 × 10−3). These results provide the first evidence for an association between the EGF rs2298999 C/T polymorphism and gout. Our findings should be validated in additional populations. PMID:27506295

  15. Fine mapping and identification of a candidate gene for the barley Un8 true loose smut resistance gene.

    Science.gov (United States)

    Zang, Wen; Eckstein, Peter E; Colin, Mark; Voth, Doug; Himmelbach, Axel; Beier, Sebastian; Stein, Nils; Scoles, Graham J; Beattie, Aaron D

    2015-07-01

    The candidate gene for the barley Un8 true loose smut resistance gene encodes a deduced protein containing two tandem protein kinase domains. In North America, durable resistance against all known isolates of barley true loose smut, caused by the basidiomycete pathogen Ustilago nuda (Jens.) Rostr. (U. nuda), is under the control of the Un8 resistance gene. Previous genetic studies mapped Un8 to the long arm of chromosome 5 (1HL). Here, a population of 4625 lines segregating for Un8 was used to delimit the Un8 gene to a 0.108 cM interval on chromosome arm 1HL, and assign it to fingerprinted contig 546 of the barley physical map. The minimal tilling path was identified for the Un8 locus using two flanking markers and consisted of two overlapping bacterial artificial chromosomes. One gene located close to a marker co-segregating with Un8 showed high sequence identity to a disease resistance gene containing two kinase domains. Sequence of the candidate gene from the parents of the segregating population, and in an additional 19 barley lines representing a broader spectrum of diversity, showed there was no intron in alleles present in either resistant or susceptible lines, and fifteen amino acid variations unique to the deduced protein sequence in resistant lines differentiated it from the deduced protein sequences in susceptible lines. Some of these variations were present within putative functional domains which may cause a loss of function in the deduced protein sequences within susceptible lines.

  16. Candidate gene association studies in syndromic and non-syndromic cleft lip and palate

    Energy Technology Data Exchange (ETDEWEB)

    Daack-Hirsch, S.; Basart, A.; Frischmeyer, P. [Univ. of Iowa, IA (United States)] [and others

    1994-09-01

    Using ongoing case ascertainment through a birth defects registry, we have collected 219 nuclear families with non-syndromic cleft lip and/or palate and 111 families with a collection of syndromic forms. Syndromic cases include 24 with recognized forms and 72 with unrecognized syndromes. Candidate gene studies as well as genome-wide searches for evidence of microdeletions and isodisomy are currently being carried out. Candidate gene association studies, to date, have made use of PCR-based polymorphisms for TGFA, MSX1, CLPG13 (a CA repeat associated with a human homologue of a locus that results in craniofacial dysmorphogenesis in the mouse) and an STRP found in a Van der Woude syndrome microdeletion. Control tetranucleotide repeats, which insure that population-based differences are not responsible for any observed associations, are also tested. Studies of the syndromic cases have included the same list of candidate genes searching for evidence of microdeletions and a genome-wide search using tri- and tetranucleotide polymorphic markers to search for isodisomy or structural rearrangements. Significant associations have previously been identified for TGFA, and, in this report, identified for MSX1 and nonsyndromic cleft palate only (p = 0.04, uncorrected). Preliminary results of the genome-wide scan for isodisomy has returned no true positives and there has been no evidence for microdeletion cases.

  17. Gene set analysis for GWAS

    DEFF Research Database (Denmark)

    Debrabant, Birgit; Soerensen, Mette

    2014-01-01

    Abstract We discuss the use of modified Kolmogorov-Smirnov (KS) statistics in the context of gene set analysis and review corresponding null and alternative hypotheses. Especially, we show that, when enhancing the impact of highly significant genes in the calculation of the test statistic, the co...

  18. A hybrid approach of gene sets and single genes for the prediction of survival risks with gene expression data.

    Science.gov (United States)

    Seok, Junhee; Davis, Ronald W; Xiao, Wenzhong

    2015-01-01

    Accumulated biological knowledge is often encoded as gene sets, collections of genes associated with similar biological functions or pathways. The use of gene sets in the analyses of high-throughput gene expression data has been intensively studied and applied in clinical research. However, the main interest remains in finding modules of biological knowledge, or corresponding gene sets, significantly associated with disease conditions. Risk prediction from censored survival times using gene sets hasn't been well studied. In this work, we propose a hybrid method that uses both single gene and gene set information together to predict patient survival risks from gene expression profiles. In the proposed method, gene sets provide context-level information that is poorly reflected by single genes. Complementarily, single genes help to supplement incomplete information of gene sets due to our imperfect biomedical knowledge. Through the tests over multiple data sets of cancer and trauma injury, the proposed method showed robust and improved performance compared with the conventional approaches with only single genes or gene sets solely. Additionally, we examined the prediction result in the trauma injury data, and showed that the modules of biological knowledge used in the prediction by the proposed method were highly interpretable in biology. A wide range of survival prediction problems in clinical genomics is expected to benefit from the use of biological knowledge.

  19. Evaluating the consistency of gene sets used in the analysis of bacterial gene expression data

    Directory of Open Access Journals (Sweden)

    Tintle Nathan L

    2012-08-01

    Full Text Available Abstract Background Statistical analyses of whole genome expression data require functional information about genes in order to yield meaningful biological conclusions. The Gene Ontology (GO and Kyoto Encyclopedia of Genes and Genomes (KEGG are common sources of functionally grouped gene sets. For bacteria, the SEED and MicrobesOnline provide alternative, complementary sources of gene sets. To date, no comprehensive evaluation of the data obtained from these resources has been performed. Results We define a series of gene set consistency metrics directly related to the most common classes of statistical analyses for gene expression data, and then perform a comprehensive analysis of 3581 Affymetrix® gene expression arrays across 17 diverse bacteria. We find that gene sets obtained from GO and KEGG demonstrate lower consistency than those obtained from the SEED and MicrobesOnline, regardless of gene set size. Conclusions Despite the widespread use of GO and KEGG gene sets in bacterial gene expression data analysis, the SEED and MicrobesOnline provide more consistent sets for a wide variety of statistical analyses. Increased use of the SEED and MicrobesOnline gene sets in the analysis of bacterial gene expression data may improve statistical power and utility of expression data.

  20. Fast-Solving Quasi-Optimal LS-S3VM Based on an Extended Candidate Set.

    Science.gov (United States)

    Ma, Yuefeng; Liang, Xun; Kwok, James T; Li, Jianping; Zhou, Xiaoping; Zhang, Haiyan

    2018-04-01

    The semisupervised least squares support vector machine (LS-S 3 VM) is an important enhancement of least squares support vector machines in semisupervised learning. Given that most data collected from the real world are without labels, semisupervised approaches are more applicable than standard supervised approaches. Although a few training methods for LS-S 3 VM exist, the problem of deriving the optimal decision hyperplane efficiently and effectually has not been solved. In this paper, a fully weighted model of LS-S 3 VM is proposed, and a simple integer programming (IP) model is introduced through an equivalent transformation to solve the model. Based on the distances between the unlabeled data and the decision hyperplane, a new indicator is designed to represent the possibility that the label of an unlabeled datum should be reversed in each iteration during training. Using the indicator, we construct an extended candidate set consisting of the indices of unlabeled data with high possibilities, which integrates more information from unlabeled data. Our algorithm is degenerated into a special scenario of the previous algorithm when the extended candidate set is reduced into a set with only one element. Two strategies are utilized to determine the descent directions based on the extended candidate set. Furthermore, we developed a novel method for locating a good starting point based on the properties of the equivalent IP model. Combined with the extended candidate set and the carefully computed starting point, a fast algorithm to solve LS-S 3 VM quasi-optimally is proposed. The choice of quasi-optimal solutions results in low computational cost and avoidance of overfitting. Experiments show that our algorithm equipped with the two designed strategies is more effective than other algorithms in at least one of the following three aspects: 1) computational complexity; 2) generalization ability; and 3) flexibility. However, our algorithm and other algorithms have

  1. Studying the Complex Expression Dependences between Sets of Coexpressed Genes

    Directory of Open Access Journals (Sweden)

    Mario Huerta

    2014-01-01

    Full Text Available Organisms simplify the orchestration of gene expression by coregulating genes whose products function together in the cell. The use of clustering methods to obtain sets of coexpressed genes from expression arrays is very common; nevertheless there are no appropriate tools to study the expression networks among these sets of coexpressed genes. The aim of the developed tools is to allow studying the complex expression dependences that exist between sets of coexpressed genes. For this purpose, we start detecting the nonlinear expression relationships between pairs of genes, plus the coexpressed genes. Next, we form networks among sets of coexpressed genes that maintain nonlinear expression dependences between all of them. The expression relationship between the sets of coexpressed genes is defined by the expression relationship between the skeletons of these sets, where this skeleton represents the coexpressed genes with a well-defined nonlinear expression relationship with the skeleton of the other sets. As a result, we can study the nonlinear expression relationships between a target gene and other sets of coexpressed genes, or start the study from the skeleton of the sets, to study the complex relationships of activation and deactivation between the sets of coexpressed genes that carry out the different cellular processes present in the expression experiments.

  2. Integrative analysis of gene expression and DNA methylation using unsupervised feature extraction for detecting candidate cancer biomarkers.

    Science.gov (United States)

    Moon, Myungjin; Nakai, Kenta

    2018-04-01

    Currently, cancer biomarker discovery is one of the important research topics worldwide. In particular, detecting significant genes related to cancer is an important task for early diagnosis and treatment of cancer. Conventional studies mostly focus on genes that are differentially expressed in different states of cancer; however, noise in gene expression datasets and insufficient information in limited datasets impede precise analysis of novel candidate biomarkers. In this study, we propose an integrative analysis of gene expression and DNA methylation using normalization and unsupervised feature extractions to identify candidate biomarkers of cancer using renal cell carcinoma RNA-seq datasets. Gene expression and DNA methylation datasets are normalized by Box-Cox transformation and integrated into a one-dimensional dataset that retains the major characteristics of the original datasets by unsupervised feature extraction methods, and differentially expressed genes are selected from the integrated dataset. Use of the integrated dataset demonstrated improved performance as compared with conventional approaches that utilize gene expression or DNA methylation datasets alone. Validation based on the literature showed that a considerable number of top-ranked genes from the integrated dataset have known relationships with cancer, implying that novel candidate biomarkers can also be acquired from the proposed analysis method. Furthermore, we expect that the proposed method can be expanded for applications involving various types of multi-omics datasets.

  3. Tracking difference in gene expression in a time-course experiment using gene set enrichment analysis.

    Directory of Open Access Journals (Sweden)

    Pui Shan Wong

    Full Text Available Fistulifera sp. strain JPCC DA0580 is a newly sequenced pennate diatom that is capable of simultaneously growing and accumulating lipids. This is a unique trait, not found in other related microalgae so far. It is able to accumulate between 40 to 60% of its cell weight in lipids, making it a strong candidate for the production of biofuel. To investigate this characteristic, we used RNA-Seq data gathered at four different times while Fistulifera sp. strain JPCC DA0580 was grown in oil accumulating and non-oil accumulating conditions. We then adapted gene set enrichment analysis (GSEA to investigate the relationship between the difference in gene expression of 7,822 genes and metabolic functions in our data. We utilized information in the KEGG pathway database to create the gene sets and changed GSEA to use re-sampling so that data from the different time points could be included in the analysis. Our GSEA method identified photosynthesis, lipid synthesis and amino acid synthesis related pathways as processes that play a significant role in oil production and growth in Fistulifera sp. strain JPCC DA0580. In addition to GSEA, we visualized the results by creating a network of compounds and reactions, and plotted the expression data on top of the network. This made existing graph algorithms available to us which we then used to calculate a path that metabolizes glucose into triacylglycerol (TAG in the smallest number of steps. By visualizing the data this way, we observed a separate up-regulation of genes at different times instead of a concerted response. We also identified two metabolic paths that used less reactions than the one shown in KEGG and showed that the reactions were up-regulated during the experiment. The combination of analysis and visualization methods successfully analyzed time-course data, identified important metabolic pathways and provided new hypotheses for further research.

  4. Candidate genes that may be responsible for the unusual resistances exhibited by Bacillus pumilus SAFR-032 spores.

    Directory of Open Access Journals (Sweden)

    Madhan R Tirumalai

    Full Text Available The spores of several Bacillus species, including Bacillus pumilus SAFR-032 and B. safensis FO-36b, which were isolated from the spacecraft assembly facility at NASA's Jet Propulsion Laboratory, are unusually resistant to UV radiation and hydrogen peroxide. In order to identify candidate genes that might be associated with these resistances, the whole genome of B. pumilus SAFR-032, and the draft genome of B. safensis FO-36b were compared in detail with the very closely related type strain B. pumilus ATCC7061(T. 170 genes are considered characteristic of SAFR-032, because they are absent from both FO-36b and ATCC7061(T. Forty of these SAFR-032 characteristic genes are entirely unique open reading frames. In addition, four genes are unique to the genomes of the resistant SAFR-032 and FO-36b. Fifty three genes involved in spore coat formation, regulation and germination, DNA repair, and peroxide resistance, are missing from all three genomes. The vast majority of these are cleanly deleted from their usual genomic context without any obvious replacement. Several DNA repair and peroxide resistance genes earlier reported to be unique to SAFR-032 are in fact shared with ATCC7061(T and no longer considered to be promising candidates for association with the elevated resistances. Instead, several SAFR-032 characteristic genes were identified, which along with one or more of the unique SAFR-032 genes may be responsible for the elevated resistances. These new candidates include five genes associated with DNA repair, namely, BPUM_0608 a helicase, BPUM_0652 an ATP binding protein, BPUM_0653 an endonuclease, BPUM_0656 a DNA cytosine-5- methyltransferase, and BPUM_3674 a DNA helicase. Three of these candidate genes are in immediate proximity of two conserved hypothetical proteins, BPUM_0654 and BPUM_0655 that are also absent from both FO-36b and ATCC7061(T. This cluster of five genes is considered to be an especially promising target for future experimental

  5. Longevity Candidate Genes and Their Association With Personality Traits in the Elderly

    NARCIS (Netherlands)

    Luciano, M.; Lopez, L.M.; de Moor, M.H.M.; Harris, S.E.; Davies, G.; Nutile, T.; Krueger, R.F.; Esko, T.; Schlessinger, D.; Toshiko, T.; Derringer, J.; Realo, A.; Hansell, N.K.; Pergadia, M.L.; Pesonen, A.-K.; Sanna, S.; Terracciano, A.; Madden, P.A.F.; Penninx, B.W.J.H.; Spinhoven, Ph.D.; Hartman, C.A.; Oostra, B.A.; Janssens, A.C.J.W.; Eriksson, J.G.; Starr, J.M.; Cannas, A.; Ferrucci, L.; Metspalu, A.; Wright, M.J.; Heath, A.C.; van Duijn, C.M.; Bierut, L.J.; Raikkonen, K.; Martin, N.G.; Ciullo, M.; Rujescu, D.; Boomsma, D.I.; Deary, I.J.

    2012-01-01

    Human longevity and personality traits are both heritable and are consistently linked at the phenotypic level. We test the hypothesis that candidate genes influencing longevity in lower organisms are associated with variance in the five major dimensions of human personality (measured by the NEO-FFI

  6. Longevity candidate genes and their association with personality traits in the elderly

    NARCIS (Netherlands)

    Luciano, Michelle; Lopez, Lorna M.; de Moor, Marleen H. M.; Harris, Sarah E.; Davies, Gail; Nutile, Teresa; Krueger, Robert F.; Esko, Tonu; Schlessinger, David; Toshiko, Tanaka; Derringer, Jaime L.; Realo, Anu; Hansell, Narelle K.; Pergadia, Michele L.; Pesonen, Anu-Katriina; Sanna, Serena; Terracciano, Antonio; Madden, Pamela A. F.; Penninx, Brenda; Spinhoven, Philip; Hartman, Catherina A.; Oostra, Ben A.; Janssens, A. Cecile J. W.; Eriksson, Johan G.; Starr, John M.; Cannas, Alessandra; Ferrucci, Luigi; Metspalu, Andres; Wright, Margeret J.; Heath, Andrew C.; van Duijn, Cornelia M.; Bierut, Laura J.; Raikkonen, Katri; Martin, Nicholas G.; Ciullo, Marina; Rujescu, Dan; Boomsma, Dorret I.; Deary, Ian J.

    Human longevity and personality traits are both heritable and are consistently linked at the phenotypic level. We test the hypothesis that candidate genes influencing longevity in lower organisms are associated with variance in the five major dimensions of human personality (measured by the NEO-FFI

  7. Using the gene ontology to scan multilevel gene sets for associations in genome wide association studies.

    Science.gov (United States)

    Schaid, Daniel J; Sinnwell, Jason P; Jenkins, Gregory D; McDonnell, Shannon K; Ingle, James N; Kubo, Michiaki; Goss, Paul E; Costantino, Joseph P; Wickerham, D Lawrence; Weinshilboum, Richard M

    2012-01-01

    Gene-set analyses have been widely used in gene expression studies, and some of the developed methods have been extended to genome wide association studies (GWAS). Yet, complications due to linkage disequilibrium (LD) among single nucleotide polymorphisms (SNPs), and variable numbers of SNPs per gene and genes per gene-set, have plagued current approaches, often leading to ad hoc "fixes." To overcome some of the current limitations, we developed a general approach to scan GWAS SNP data for both gene-level and gene-set analyses, building on score statistics for generalized linear models, and taking advantage of the directed acyclic graph structure of the gene ontology when creating gene-sets. However, other types of gene-set structures can be used, such as the popular Kyoto Encyclopedia of Genes and Genomes (KEGG). Our approach combines SNPs into genes, and genes into gene-sets, but assures that positive and negative effects of genes on a trait do not cancel. To control for multiple testing of many gene-sets, we use an efficient computational strategy that accounts for LD and provides accurate step-down adjusted P-values for each gene-set. Application of our methods to two different GWAS provide guidance on the potential strengths and weaknesses of our proposed gene-set analyses. © 2011 Wiley Periodicals, Inc.

  8. Alienness: Rapid Detection of Candidate Horizontal Gene Transfers across the Tree of Life

    Directory of Open Access Journals (Sweden)

    Corinne Rancurel

    2017-09-01

    Full Text Available Horizontal gene transfer (HGT is the transmission of genes between organisms by other means than parental to offspring inheritance. While it is prevalent in prokaryotes, HGT is less frequent in eukaryotes and particularly in Metazoa. Here, we propose Alienness, a taxonomy-aware web application available at http://alienness.sophia.inra.fr. Alienness parses BLAST results against public libraries to rapidly identify candidate HGT in any genome of interest. Alienness takes as input the result of a BLAST of a whole proteome of interest against any National Center for Biotechnology Information (NCBI protein library. The user defines recipient (e.g., Metazoa and donor (e.g., bacteria, fungi branches of interest in the NCBI taxonomy. Based on the best BLAST E-values of candidate donor and recipient taxa, Alienness calculates an Alien Index (AI for each query protein. An AI > 0 indicates a better hit to candidate donor than recipient taxa and a possible HGT. Higher AI represent higher gap of E-values between candidate donor and recipient and a more likely HGT. We confirmed the accuracy of Alienness on phylogenetically confirmed HGT of non-metazoan origin in plant-parasitic nematodes. Alienness scans whole proteomes to rapidly identify possible HGT in any species of interest and thus fosters exploration of HGT more easily and largely across the tree of life.

  9. Chronic obstructive pulmonary disease candidate gene prioritization based on metabolic networks and functional information.

    Directory of Open Access Journals (Sweden)

    Xinyan Wang

    Full Text Available Chronic obstructive pulmonary disease (COPD is a multi-factor disease, in which metabolic disturbances played important roles. In this paper, functional information was integrated into a COPD-related metabolic network to assess similarity between genes. Then a gene prioritization method was applied to the COPD-related metabolic network to prioritize COPD candidate genes. The gene prioritization method was superior to ToppGene and ToppNet in both literature validation and functional enrichment analysis. Top-ranked genes prioritized from the metabolic perspective with functional information could promote the better understanding about the molecular mechanism of this disease. Top 100 genes might be potential markers for diagnostic and effective therapies.

  10. Linkage mapping of candidate genes for induce resistance and growth promotion by trichoderma koningiopsis (th003) in tomato solanum lycopersicum

    International Nuclear Information System (INIS)

    Simbaqueba, Jaime; Cotes, Alba Marina; Barrero, Luz Stella

    2011-01-01

    Induced systemic resistance (ISR) is a mechanism by which plants enhance defenses against any stress condition. ISR and growth promotion are enhanced when tomato (Solanum lycopersicum) is inoculated with several strains of Trichoderma ssp. this study aims to genetically map tomato candidate genes involved in ISR and growth promotion induced by the Colombian native isolate Trichoderma koningiopsis th003. Forty-nine candidate genes previously identified on tomato plants treated with th003 and T. hamatum T382 strains were evaluated for polymorphisms and 16 of them were integrated on the highly saturated genetic linkage map named TOMATO EXPEN 2000. The location of six unigenes was similar to the location of resistance gene analogs (RGAS), defense related ests and resistance QTLs previously reported, suggesting new possible candidates for these quantitative trait loci (QTL) regions. The candidate gene-markers may be used for future ISR or growth promotion assisted selection in tomato.

  11. Systematic identification and validation of candidate genes for detection of circulating tumor cells in peripheral blood specimens of colorectal cancer patients.

    Science.gov (United States)

    Findeisen, Peter; Röckel, Matthias; Nees, Matthias; Röder, Christian; Kienle, Peter; Von Knebel Doeberitz, Magnus; Kalthoff, Holger; Neumaier, Michael

    2008-11-01

    The presence of tumor cells in peripheral blood is being regarded increasingly as a clinically relevant prognostic factor for colorectal cancer patients. Current molecular methods are very sensitive but due to low specificity their diagnostic value is limited. This study was undertaken in order to systematically identify and validate new colorectal cancer (CRC) marker genes for improved detection of minimal residual disease in peripheral blood mononuclear cells of colorectal cancer patients. Marker genes with upregulated gene expression in colorectal cancer tissue and cell lines were identified using microarray experiments and publicly available gene expression data. A systematic iterative approach was used to reduce a set of 346 candidate genes, reportedly associated with CRC to a selection of candidate genes that were then further validated by relative quantitative real-time RT-PCR. Analytical sensitivity of RT-PCR assays was determined by spiking experiments with CRC cells. Diagnostic sensitivity as well as specificity was tested on a control group consisting of 18 CRC patients compared to 12 individuals without malignant disease. From a total of 346-screened genes only serine (or cysteine) proteinase inhibitor, clade B (ovalbumin), member 5 (SERPINB5) showed significantly elevated transcript levels in peripheral venous blood specimens of tumor patients when compared to the nonmalignant control group. These results were confirmed by analysis of an enlarged collective consisting of 63 CRC patients and 36 control individuals without malignant disease. In conclusion SERPINB5 seems to be a promising marker for detection of circulating tumor cells in peripheral blood of colorectal cancer patients.

  12. Bioinformatics-driven identification and examination of candidate genes for non-alcoholic fatty liver disease.

    Directory of Open Access Journals (Sweden)

    Karina Banasik

    2011-01-01

    Full Text Available Candidate genes for non-alcoholic fatty liver disease (NAFLD identified by a bioinformatics approach were examined for variant associations to quantitative traits of NAFLD-related phenotypes.By integrating public database text mining, trans-organism protein-protein interaction transferal, and information on liver protein expression a protein-protein interaction network was constructed and from this a smaller isolated interactome was identified. Five genes from this interactome were selected for genetic analysis. Twenty-one tag single-nucleotide polymorphisms (SNPs which captured all common variation in these genes were genotyped in 10,196 Danes, and analyzed for association with NAFLD-related quantitative traits, type 2 diabetes (T2D, central obesity, and WHO-defined metabolic syndrome (MetS.273 genes were included in the protein-protein interaction analysis and EHHADH, ECHS1, HADHA, HADHB, and ACADL were selected for further examination. A total of 10 nominal statistical significant associations (P<0.05 to quantitative metabolic traits were identified. Also, the case-control study showed associations between variation in the five genes and T2D, central obesity, and MetS, respectively. Bonferroni adjustments for multiple testing negated all associations.Using a bioinformatics approach we identified five candidate genes for NAFLD. However, we failed to provide evidence of associations with major effects between SNPs in these five genes and NAFLD-related quantitative traits, T2D, central obesity, and MetS.

  13. Association study of candidate genes for susceptibility to schizophrenia and bipolar disorder on chromosome 22Q13

    DEFF Research Database (Denmark)

    Severinsen, Jacob; Binderup, Helle; Mors, Ole

    Chromosome 22q is suspected to harbor risk genes for schizophrenia as well as bipolar affective disorder. This is evidenced through genetic mapping studies, investigations of cytogenetic abnormalities, and direct examination of candidate genes. In a recent study of distantly related patients from...... the Faroe Islands we have obtained evidence suggesting two regions on chromosome 22q13 to potentially harbor susceptibility genes for both schizophrenia and bipolar affective disorder. We have selected a number of candidate genes from these two regions for further analysis, including the neuro-gene WKL1...... and unrelated controls, and in a Scottish case-control sample comprising 200 schizophrenics, 200 bipolar patients and 200 controls. None of the investigated SNPs have so far showed strong evidence of association to either bipolar disorder or schizophrenia....

  14. Sequence-Based Introgression Mapping Identifies Candidate White Mold Tolerance Genes in Common Bean

    Directory of Open Access Journals (Sweden)

    Sujan Mamidi

    2016-07-01

    Full Text Available White mold, caused by the necrotrophic fungus (Lib. de Bary, is a major disease of common bean ( L.. WM7.1 and WM8.3 are two quantitative trait loci (QTL with major effects on tolerance to the pathogen. Advanced backcross populations segregating individually for either of the two QTL, and a recombinant inbred (RI population segregating for both QTL were used to fine map and confirm the genetic location of the QTL. The QTL intervals were physically mapped using the reference common bean genome sequence, and the physical intervals for each QTL were further confirmed by sequence-based introgression mapping. Using whole-genome sequence data from susceptible and tolerant DNA pools, introgressed regions were identified as those with significantly higher numbers of single-nucleotide polymorphisms (SNPs relative to the whole genome. By combining the QTL and SNP data, WM7.1 was located to a 660-kb region that contained 41 gene models on the proximal end of chromosome Pv07, while the WM8.3 introgression was narrowed to a 1.36-Mb region containing 70 gene models. The most polymorphic candidate gene in the WM7.1 region encodes a BEACH-domain protein associated with apoptosis. Within the WM8.3 interval, a receptor-like protein with the potential to recognize pathogen effectors was the most polymorphic gene. The use of gene and sequence-based mapping identified two candidate genes whose putative functions are consistent with the current model of pathogenicity.

  15. Integrative Functional Genomics for Systems Genetics in GeneWeaver.org.

    Science.gov (United States)

    Bubier, Jason A; Langston, Michael A; Baker, Erich J; Chesler, Elissa J

    2017-01-01

    The abundance of existing functional genomics studies permits an integrative approach to interpreting and resolving the results of diverse systems genetics studies. However, a major challenge lies in assembling and harmonizing heterogeneous data sets across species for facile comparison to the positional candidate genes and coexpression networks that come from systems genetic studies. GeneWeaver is an online database and suite of tools at www.geneweaver.org that allows for fast aggregation and analysis of gene set-centric data. GeneWeaver contains curated experimental data together with resource-level data such as GO annotations, MP annotations, and KEGG pathways, along with persistent stores of user entered data sets. These can be entered directly into GeneWeaver or transferred from widely used resources such as GeneNetwork.org. Data are analyzed using statistical tools and advanced graph algorithms to discover new relations, prioritize candidate genes, and generate function hypotheses. Here we use GeneWeaver to find genes common to multiple gene sets, prioritize candidate genes from a quantitative trait locus, and characterize a set of differentially expressed genes. Coupling a large multispecies repository curated and empirical functional genomics data to fast computational tools allows for the rapid integrative analysis of heterogeneous data for interpreting and extrapolating systems genetics results.

  16. Exome sequencing of a large family identifies potential candidate genes contributing risk to bipolar disorder.

    Science.gov (United States)

    Zhang, Tianxiao; Hou, Liping; Chen, David T; McMahon, Francis J; Wang, Jen-Chyong; Rice, John P

    2018-03-01

    Bipolar disorder is a mental illness with lifetime prevalence of about 1%. Previous genetic studies have identified multiple chromosomal linkage regions and candidate genes that might be associated with bipolar disorder. The present study aimed to identify potential susceptibility variants for bipolar disorder using 6 related case samples from a four-generation family. A combination of exome sequencing and linkage analysis was performed to identify potential susceptibility variants for bipolar disorder. Our study identified a list of five potential candidate genes for bipolar disorder. Among these five genes, GRID1(Glutamate Receptor Delta-1 Subunit), which was previously reported to be associated with several psychiatric disorders and brain related traits, is particularly interesting. Variants with functional significance in this gene were identified from two cousins in our bipolar disorder pedigree. Our findings suggest a potential role for these genes and the related rare variants in the onset and development of bipolar disorder in this one family. Additional research is needed to replicate these findings and evaluate their patho-biological significance. Copyright © 2017 Elsevier B.V. All rights reserved.

  17. The CanOE strategy: integrating genomic and metabolic contexts across multiple prokaryote genomes to find candidate genes for orphan enzymes.

    Directory of Open Access Journals (Sweden)

    Adam Alexander Thil Smith

    2012-05-01

    Full Text Available Of all biochemically characterized metabolic reactions formalized by the IUBMB, over one out of four have yet to be associated with a nucleic or protein sequence, i.e. are sequence-orphan enzymatic activities. Few bioinformatics annotation tools are able to propose candidate genes for such activities by exploiting context-dependent rather than sequence-dependent data, and none are readily accessible and propose result integration across multiple genomes. Here, we present CanOE (Candidate genes for Orphan Enzymes, a four-step bioinformatics strategy that proposes ranked candidate genes for sequence-orphan enzymatic activities (or orphan enzymes for short. The first step locates "genomic metabolons", i.e. groups of co-localized genes coding proteins catalyzing reactions linked by shared metabolites, in one genome at a time. These metabolons can be particularly helpful for aiding bioanalysts to visualize relevant metabolic data. In the second step, they are used to generate candidate associations between un-annotated genes and gene-less reactions. The third step integrates these gene-reaction associations over several genomes using gene families, and summarizes the strength of family-reaction associations by several scores. In the final step, these scores are used to rank members of gene families which are proposed for metabolic reactions. These associations are of particular interest when the metabolic reaction is a sequence-orphan enzymatic activity. Our strategy found over 60,000 genomic metabolons in more than 1,000 prokaryote organisms from the MicroScope platform, generating candidate genes for many metabolic reactions, of which more than 70 distinct orphan reactions. A computational validation of the approach is discussed. Finally, we present a case study on the anaerobic allantoin degradation pathway in Escherichia coli K-12.

  18. CADM1 is a strong neuroblastoma candidate gene that maps within a 3.72 Mb critical region of loss on 11q23

    International Nuclear Information System (INIS)

    Michels, Evi; Speleman, Frank; Hoebeeck, Jasmien; De Preter, Katleen; Schramm, Alexander; Brichard, Bénédicte; De Paepe, Anne; Eggert, Angelika; Laureys, Geneviève; Vandesompele, Jo

    2008-01-01

    Recurrent loss of part of the long arm of chromosome 11 is a well established hallmark of a subtype of aggressive neuroblastomas. Despite intensive mapping efforts to localize the culprit 11q tumour suppressor gene, this search has been unsuccessful thus far as no sufficiently small critical region could be delineated for selection of candidate genes. To refine the critical region of 11q loss, the chromosome 11 status of 100 primary neuroblastoma tumours and 29 cell lines was analyzed using a BAC array containing a chromosome 11 tiling path. For the genes mapping within our refined region of loss, meta-analysis on published neuroblastoma mRNA gene expression datasets was performed for candidate gene selection. The DNA methylation status of the resulting candidate gene was determined using re-expression experiments by treatment of neuroblastoma cells with the demethylating agent 5-aza-2'-deoxycytidine and bisulphite sequencing. Two small critical regions of loss within 11q23 at chromosomal band 11q23.1-q23.2 (1.79 Mb) and 11q23.2-q23.3 (3.72 Mb) were identified. In a first step towards further selection of candidate neuroblastoma tumour suppressor genes, we performed a meta-analysis on published expression profiles of 692 neuroblastoma tumours. Integration of the resulting candidate gene list with expression data of neuroblastoma progenitor cells pinpointed CADM1 as a compelling candidate gene. Meta-analysis indicated that CADM1 expression has prognostic significance and differential expression for the gene was noted in unfavourable neuroblastoma versus normal neuroblasts. Methylation analysis provided no evidence for a two-hit mechanism in 11q deleted cell lines. Our study puts CADM1 forward as a strong candidate neuroblastoma suppressor gene. Further functional studies are warranted to elucidate the role of CADM1 in neuroblastoma development and to investigate the possibility of CADM1 haploinsufficiency in neuroblastoma

  19. Physical mapping of the major early-onset familial Alzheimer`s disease locus on chromosome 14 and analysis of candidate gene sequences

    Energy Technology Data Exchange (ETDEWEB)

    Tanzi, R.E.; Romano, D.M.; Crowley, A.C. [Harvard Medical School, Charlestown, MA (United States)] [and others

    1994-09-01

    Genetic studies of kindreds displaying evidence for familial AD (FAD) have led to the localization of gene defects responsible for this disorder on chromosomes 14, 19, and 21. A minor early-onset FAD gene on chromosome 21 has been identified to enode the amyloid precursor protein (APP), and the late-onset FAD susceptibility locus on chromosome 19 has been shown to be in linkage disequilibrium with the E4 allele of the APOE gene. Meanwhile, the locus responsible for the major form of early-onset FAD on chromosome 14q24 has not yet been identified. By recombinational analysis, we have refined the minimal candidate region containing the gene defect to approximately 3 megabases in 14q24. We will describe our laboratory`s progress on attempts to finely localize this locus, as well as test known candidate genes from this region for either inclusion in the minimal candidate region or the presence of pathogenic mutations. Candidate genes that have been tested so far include cFOS, heat shock protein 70 member (HSF2A), transforming growth factor beta (TGFB3), the trifunctional protein C1-THF synthase (MTHFD), bradykinin receptor (BR), and the E2k component of a-ketoglutarate dehydrogenase. HSP2A, E2k, MTHFD, and BR do not map to the current defined minimal candidate region; however, sequence analysis must be performed to confirm exclusion of these genes as true candidates. Meanwhile, no pathogenic mutations have yet been found in cFOS or TGFB3. We have also isolated a large number of novel transcribed sequences from the minimal candidate region in the form of {open_quotes}trapped exons{close_quotes} from cosmids identified by hybridization to select YAC clones; we are currently in the process of searching for pathogenic mutations in these exons in affected individuals from FAD families.

  20. IGSA: Individual Gene Sets Analysis, including Enrichment and Clustering.

    Science.gov (United States)

    Wu, Lingxiang; Chen, Xiujie; Zhang, Denan; Zhang, Wubing; Liu, Lei; Ma, Hongzhe; Yang, Jingbo; Xie, Hongbo; Liu, Bo; Jin, Qing

    2016-01-01

    Analysis of gene sets has been widely applied in various high-throughput biological studies. One weakness in the traditional methods is that they neglect the heterogeneity of genes expressions in samples which may lead to the omission of some specific and important gene sets. It is also difficult for them to reflect the severities of disease and provide expression profiles of gene sets for individuals. We developed an application software called IGSA that leverages a powerful analytical capacity in gene sets enrichment and samples clustering. IGSA calculates gene sets expression scores for each sample and takes an accumulating clustering strategy to let the samples gather into the set according to the progress of disease from mild to severe. We focus on gastric, pancreatic and ovarian cancer data sets for the performance of IGSA. We also compared the results of IGSA in KEGG pathways enrichment with David, GSEA, SPIA, ssGSEA and analyzed the results of IGSA clustering and different similarity measurement methods. Notably, IGSA is proved to be more sensitive and specific in finding significant pathways, and can indicate related changes in pathways with the severity of disease. In addition, IGSA provides with significant gene sets profile for each sample.

  1. The Molecular Signatures Database (MSigDB) hallmark gene set collection.

    Science.gov (United States)

    Liberzon, Arthur; Birger, Chet; Thorvaldsdóttir, Helga; Ghandi, Mahmoud; Mesirov, Jill P; Tamayo, Pablo

    2015-12-23

    The Molecular Signatures Database (MSigDB) is one of the most widely used and comprehensive databases of gene sets for performing gene set enrichment analysis. Since its creation, MSigDB has grown beyond its roots in metabolic disease and cancer to include >10,000 gene sets. These better represent a wider range of biological processes and diseases, but the utility of the database is reduced by increased redundancy across, and heterogeneity within, gene sets. To address this challenge, here we use a combination of automated approaches and expert curation to develop a collection of "hallmark" gene sets as part of MSigDB. Each hallmark in this collection consists of a "refined" gene set, derived from multiple "founder" sets, that conveys a specific biological state or process and displays coherent expression. The hallmarks effectively summarize most of the relevant information of the original founder sets and, by reducing both variation and redundancy, provide more refined and concise inputs for gene set enrichment analysis.

  2. SNP Variation in MicroRNA Biogenesis Pathway Genes as a New Innovation Strategy for Alzheimer Disease Diagnostics: A Study of 10 Candidate Genes in an Understudied Population From the Eastern Mediterranean.

    Science.gov (United States)

    Görücü Yilmaz, Şenay; Erdal, Mehmet E; Avci Özge, Aynur; Sungur, Mehmet A

    2016-01-01

    Alzheimer disease (AD) is a common complex neurodegenerative disorder accounting for nearly 50% to 70% of dementias worldwide. Yet the current diagnostic options for AD are limited. New diagnostic innovation strategies focusing on novel molecules and pathways are sorely needed. In this connection, microRNAs (miRNAs) are conserved small noncoding RNAs that regulate posttranscriptional gene expression and are vital for neuronal development and its functional sustainability. Conceivably, biological pathways responsible for the biogenesis of miRNAs represent a veritable set of upstream candidate genes that can be potentially associated with the AD pathophysiology. Notably, whereas functional single-nucleotide polymorphisms (SNPs) in miRNA biogenesis pathway genes have been studied in other complex diseases, surprisingly, virtually no such study has been conducted on their relevance in AD. Moreover, novel diagnostics identified in easily accessible peripheral tissues such as the whole blood samples represent the initial entry or gateway points on the biomarker discovery critical path for AD. To the best of our knowledge, we report here the first association study of functional SNPs, as measured by real-time PCR in 10 "upstream" candidate genes critically situated on the miRNA biogenesis pathway, in a large sample of AD patients (N=172) and healthy controls (N=109) in a hitherto understudied world population from the Mersin region of the Eastern Mediterranean. We observed a significant association between 2 candidate genes and AD, TARBP2 rs784567 genotype and AD (χ=6.292, P=0.043), and a trend for RNASEN rs10719 genotype (χ=4.528, P=0.104) and allele (P=0.035). Functional SNP variations in the other 8 candidate genes (DGCR8, XPO5, RAN, DICER1, AGO1, AGO2, GEMIN3, and GEMIN4) did not associate with AD in our sample. Given the putative biological importance of miRNA biogenesis pathways, these emerging data can provide a new foundation to stimulate future debate and

  3. Exome Sequencing and Linkage Analysis Identified Novel Candidate Genes in Recessive Intellectual Disability Associated with Ataxia.

    Science.gov (United States)

    Jazayeri, Roshanak; Hu, Hao; Fattahi, Zohreh; Musante, Luciana; Abedini, Seyedeh Sedigheh; Hosseini, Masoumeh; Wienker, Thomas F; Ropers, Hans Hilger; Najmabadi, Hossein; Kahrizi, Kimia

    2015-10-01

    Intellectual disability (ID) is a neuro-developmental disorder which causes considerable socio-economic problems. Some ID individuals are also affected by ataxia, and the condition includes different mutations affecting several genes. We used whole exome sequencing (WES) in combination with homozygosity mapping (HM) to identify the genetic defects in five consanguineous families among our cohort study, with two affected children with ID and ataxia as major clinical symptoms. We identified three novel candidate genes, RIPPLY1, MRPL10, SNX14, and a new mutation in known gene SURF1. All are autosomal genes, except RIPPLY1, which is located on the X chromosome. Two are housekeeping genes, implicated in transcription and translation regulation and intracellular trafficking, and two encode mitochondrial proteins. The pathogenesis of these variants was evaluated by mutation classification, bioinformatic methods, review of medical and biological relevance, co-segregation studies in the particular family, and a normal population study. Linkage analysis and exome sequencing of a small number of affected family members is a powerful new technique which can be used to decrease the number of candidate genes in heterogenic disorders such as ID, and may even identify the responsible gene(s).

  4. SNP discovery in candidate adaptive genes using exon capture in a free-ranging alpine ungulate

    Science.gov (United States)

    Roffler, Gretchen H.; Amish, Stephen J.; Smith, Seth; Cosart, Ted F.; Kardos, Marty; Schwartz, Michael K.; Luikart, Gordon

    2016-01-01

    Identification of genes underlying genomic signatures of natural selection is key to understanding adaptation to local conditions. We used targeted resequencing to identify SNP markers in 5321 candidate adaptive genes associated with known immunological, metabolic and growth functions in ovids and other ungulates. We selectively targeted 8161 exons in protein-coding and nearby 5′ and 3′ untranslated regions of chosen candidate genes. Targeted sequences were taken from bighorn sheep (Ovis canadensis) exon capture data and directly from the domestic sheep genome (Ovis aries v. 3; oviAri3). The bighorn sheep sequences used in the Dall's sheep (Ovis dalli dalli) exon capture aligned to 2350 genes on the oviAri3 genome with an average of 2 exons each. We developed a microfluidic qPCR-based SNP chip to genotype 476 Dall's sheep from locations across their range and test for patterns of selection. Using multiple corroborating approaches (lositan and bayescan), we detected 28 SNP loci potentially under selection. We additionally identified candidate loci significantly associated with latitude, longitude, precipitation and temperature, suggesting local environmental adaptation. The three methods demonstrated consistent support for natural selection on nine genes with immune and disease-regulating functions (e.g. Ovar-DRA, APC, BATF2, MAGEB18), cell regulation signalling pathways (e.g. KRIT1, PI3K, ORRC3), and respiratory health (CYSLTR1). Characterizing adaptive allele distributions from novel genetic techniques will facilitate investigation of the influence of environmental variation on local adaptation of a northern alpine ungulate throughout its range. This research demonstrated the utility of exon capture for gene-targeted SNP discovery and subsequent SNP chip genotyping using low-quality samples in a nonmodel species.

  5. RNA-Seq analysis reveals candidate genes for ontogenic resistance in Malus-Venturia pathosystem.

    Directory of Open Access Journals (Sweden)

    Michele Gusberti

    Full Text Available Ontogenic scab resistance in apple leaves and fruits is a horizontal resistance against the plant pathogen Venturia inaequalis and is expressed as a decrease in disease symptoms and incidence with the ageing of the leaves. Several studies at the biochemical level tried to unveil the nature of this resistance; however, no conclusive results were reported. We decided therefore to investigate the genetic origin of this phenomenon by performing a full quantitative transcriptome sequencing and comparison of young (susceptible and old (ontogenic resistant leaves, infected or not with the pathogen. Two time points at 72 and 96 hours post-inoculation were chosen for RNA sampling and sequencing. Comparison between the different conditions (young and old leaves, inoculated or not should allow the identification of differentially expressed genes which may represent different induced plant defence reactions leading to ontogenic resistance or may be the cause of a constitutive (uninoculated with the pathogen shift toward resistance in old leaves. Differentially expressed genes were then characterised for their function by homology to A. thaliana and other plant genes, particularly looking for genes involved in pathways already suspected of appertaining to ontogenic resistance in apple or other hosts, or to plant defence mechanisms in general. IN THIS WORK, FIVE CANDIDATE GENES PUTATIVELY INVOLVED IN THE ONTOGENIC RESISTANCE OF APPLE WERE IDENTIFIED: a gene encoding an "enhanced disease susceptibility 1 protein" was found to be down-regulated in both uninoculated and inoculated old leaves at 96 hpi, while the other four genes encoding proteins (metallothionein3-like protein, lipoxygenase, lipid transfer protein, and a peroxidase 3 were found to be constitutively up-regulated in inoculated and uninoculated old leaves. The modulation of the five candidate genes has been validated using the real-time quantitative PCR. Thus, ontogenic resistance may be the result

  6. Exome sequencing of oral squamous cell carcinoma in users of Arabian snuff reveals novel candidates for driver genes.

    Science.gov (United States)

    Al-Hebshi, Nezar Noor; Li, Shiyong; Nasher, Akram Thabet; El-Setouhy, Maged; Alsanosi, Rashad; Blancato, Jan; Loffredo, Christopher

    2016-07-15

    The study sought to identify genetic aberrations driving oral squamous cell carcinoma (OSCC) development among users of shammah, an Arabian preparation of smokeless tobacco. Twenty archival OSCC samples, 15 of which with a history of shammah exposure, were whole-exome sequenced at an average depth of 127×. Somatic mutations were identified using a novel, matched controls-independent filtration algorithm. CODEX and Exomedepth coupled with a novel, Database of Genomic Variant-based filter were employed to call somatic gene-copy number variations. Significantly mutated genes were identified with Oncodrive FM and the Youn and Simon's method. Candidate driver genes were nominated based on Gene Set Enrichment Analysis. The observed mutational spectrum was similar to that reported by the TCGA project. In addition to confirming known genes of OSCC (TP53, CDKNA2, CASP8, PIK3CA, HRAS, FAT1, TP63, CCND1 and FADD) the analysis identified several candidate novel driver events including mutations of NOTCH3, CSMD3, CRB1, CLTCL1, OSMR and TRPM2, amplification of the proto-oncogenes FOSL1, RELA, TRAF6, MDM2, FRS2 and BAG1, and deletion of the recently described tumor suppressor SMARCC1. Analysis also revealed significantly altered pathways not previously implicated in OSCC including Oncostatin-M signalling pathway, AP-1 and C-MYB transcription networks and endocytosis. There was a trend for higher number of mutations, amplifications and driver events in samples with history of shammah exposure particularly those that tested EBV positive, suggesting an interaction between tobacco exposure and EBV. The work provides further evidence for the genetic heterogeneity of oral cancer and suggests shammah-associated OSCC is characterized by extensive amplification of oncogenes. © 2016 UICC.

  7. SNP discovery in candidate adaptive genes using exon capture in a free-ranging alpine ungulate

    Science.gov (United States)

    Gretchen H. Roffler; Stephen J. Amish; Seth Smith; Ted Cosart; Marty Kardos; Michael K. Schwartz; Gordon Luikart

    2016-01-01

    Identification of genes underlying genomic signatures of natural selection is key to understanding adaptation to local conditions. We used targeted resequencing to identify SNP markers in 5321 candidate adaptive genes associated with known immunological, metabolic and growth functions in ovids and other ungulates. We selectively targeted 8161 exons in protein-coding...

  8. The genome sequence of the most widely cultivated cacao type and its use to identify candidate genes regulating pod color.

    Science.gov (United States)

    Motamayor, Juan C; Mockaitis, Keithanne; Schmutz, Jeremy; Haiminen, Niina; Livingstone, Donald; Cornejo, Omar; Findley, Seth D; Zheng, Ping; Utro, Filippo; Royaert, Stefan; Saski, Christopher; Jenkins, Jerry; Podicheti, Ram; Zhao, Meixia; Scheffler, Brian E; Stack, Joseph C; Feltus, Frank A; Mustiga, Guiliana M; Amores, Freddy; Phillips, Wilbert; Marelli, Jean Philippe; May, Gregory D; Shapiro, Howard; Ma, Jianxin; Bustamante, Carlos D; Schnell, Raymond J; Main, Dorrie; Gilbert, Don; Parida, Laxmi; Kuhn, David N

    2013-06-03

    Theobroma cacao L. cultivar Matina 1-6 belongs to the most cultivated cacao type. The availability of its genome sequence and methods for identifying genes responsible for important cacao traits will aid cacao researchers and breeders. We describe the sequencing and assembly of the genome of Theobroma cacao L. cultivar Matina 1-6. The genome of the Matina 1-6 cultivar is 445 Mbp, which is significantly larger than a sequenced Criollo cultivar, and more typical of other cultivars. The chromosome-scale assembly, version 1.1, contains 711 scaffolds covering 346.0 Mbp, with a contig N50 of 84.4 kbp, a scaffold N50 of 34.4 Mbp, and an evidence-based gene set of 29,408 loci. Version 1.1 has 10x the scaffold N50 and 4x the contig N50 as Criollo, and includes 111 Mb more anchored sequence. The version 1.1 assembly has 4.4% gap sequence, while Criollo has 10.9%. Through a combination of haplotype, association mapping and gene expression analyses, we leverage this robust reference genome to identify a promising candidate gene responsible for pod color variation. We demonstrate that green/red pod color in cacao is likely regulated by the R2R3 MYB transcription factor TcMYB113, homologs of which determine pigmentation in Rosaceae, Solanaceae, and Brassicaceae. One SNP within the target site for a highly conserved trans-acting siRNA in dicots, found within TcMYB113, seems to affect transcript levels of this gene and therefore pod color variation. We report a high-quality sequence and annotation of Theobroma cacao L. and demonstrate its utility in identifying candidate genes regulating traits.

  9. Transferability of microsatellite markers located in candidate genes for wood properties between Eucalyptus species

    Directory of Open Access Journals (Sweden)

    Cintia V. Acuña

    2014-12-01

    Full Text Available Aim of study:  To analyze the feasibility of extrapolating conclusions on wood quality genetic control between different Eucalyptus species, particularly from species with better genomic information, to those less characterized. For this purpose, the first step is to analyze the conservation and cross-transferability of microsatellites markers (SSRs located in candidate genes.Area of study: Eucalyptus species implanted in Argentina coming from different Australian origins.Materials and methods: Twelve validated and polymorphic SSRs in candidate genes (SSR-CGs for wood quality in E. globulus were selected for cross species amplification in six species: E. grandis, E. saligna, E. dunnii, E. viminalis, E. camaldulensis and E. tereticornis.Main results: High cross-species transferability (92% to 100% was found for the 12 polymorphic SSRs detected in E. globulus. These markers revealed allelic diversity in nine important candidate genes: cinnamoyl CoA reductase (CCR, cellulose synthase 3 (CesA3, the transcription factor LIM1, homocysteine S-methyltransferase (HMT, shikimate kinase (SK, xyloglucan endotransglycosylase 2 (XTH2, glutathione S-transferase (GST, glutamate decarboxylase (GAD and peroxidase (PER.Research highlights: The markers described are potentially suitable for comparative QTL mapping, molecular marker assisted breeding (MAB and for population genetic studies across different species within the subgenus Symphyomyrtus.Keywords: validation; cross-transferability; SSR; functional markers; eucalypts; Symphyomyrtus.

  10. Identification of microdeletions in candidate genes for cleft lip and/or palate

    DEFF Research Database (Denmark)

    Shi, Min; Mostowska, Adrianna; Jugessur, Astanand

    2009-01-01

    for deletion detection. Apparent Mendelian inconsistencies between parents and children suggested deletion events in 15 individuals in 11 genomic regions. We confirmed deletions involving CYP1B1, FGF10, SP8, SUMO1, TBX1, TFAP2A, and UGT7A1, including both de novo and familial cases. Deletions of SUMO1, TBX1......, and TFAP2A are likely to be etiologic. CONCLUSIONS: These deletions suggest the potential roles of genes or regulatory elements contained within deleted regions in the etiology of clefting. Our analysis took advantage of genotypes from a candidate-gene-based SNP survey and proved to be an efficient...... analytical approach to interrogate genes potentially involved in clefting. This can serve as a model to find genes playing a role in complex traits in general....

  11. Targeting 160 candidate genes for blood pressure regulation with a genome-wide genotyping array.

    Directory of Open Access Journals (Sweden)

    Siim Sõber

    2009-06-01

    Full Text Available The outcome of Genome-Wide Association Studies (GWAS has challenged the field of blood pressure (BP genetics as previous candidate genes have not been among the top loci in these scans. We used Affymetrix 500K genotyping data of KORA S3 cohort (n = 1,644; Southern-Germany to address (i SNP coverage in 160 BP candidate genes; (ii the evidence for associations with BP traits in genome-wide and replication data, and haplotype analysis. In total, 160 gene regions (genic region+/-10 kb covered 2,411 SNPs across 11.4 Mb. Marker densities in genes varied from 0 (n = 11 to 0.6 SNPs/kb. On average 52.5% of the HAPMAP SNPs per gene were captured. No evidence for association with BP was obtained for 1,449 tested SNPs. Considerable associations (P50% of HAPMAP SNPs were tagged. In general, genes with higher marker density (>0.2 SNPs/kb revealed a better chance to reach close to significance associations. Although, none of the detected P-values remained significant after Bonferroni correction (P<0.05/2319, P<2.15 x 10(-5, the strength of some detected associations was close to this level: rs10889553 (LEPR and systolic BP (SBP (P = 4.5 x 10(-5 as well as rs10954174 (LEP and diastolic BP (DBP (P = 5.20 x 10(-5. In total, 12 markers in 7 genes (ADRA2A, LEP, LEPR, PTGER3, SLC2A1, SLC4A2, SLC8A1 revealed considerable association (P<10(-3 either with SBP, DBP, and/or hypertension (HYP. None of these were confirmed in replication samples (KORA S4, HYPEST, BRIGHT. However, supportive evidence for the association of rs10889553 (LEPR and rs11195419 (ADRA2A with BP was obtained in meta-analysis across samples stratified either by body mass index, smoking or alcohol consumption. Haplotype analysis highlighted LEPR and PTGER3. In conclusion, the lack of associations in BP candidate genes may be attributed to inadequate marker coverage on the genome-wide arrays, small phenotypic effects of the loci and/or complex interaction with life-style and metabolic parameters.

  12. Candidate Smoke Region Segmentation of Fire Video Based on Rough Set Theory

    Directory of Open Access Journals (Sweden)

    Yaqin Zhao

    2015-01-01

    Full Text Available Candidate smoke region segmentation is the key link of smoke video detection; an effective and prompt method of candidate smoke region segmentation plays a significant role in a smoke recognition system. However, the interference of heavy fog and smoke-color moving objects greatly degrades the recognition accuracy. In this paper, a novel method of candidate smoke region segmentation based on rough set theory is presented. First, Kalman filtering is used to update video background in order to exclude the interference of static smoke-color objects, such as blue sky. Second, in RGB color space smoke regions are segmented by defining the upper approximation, lower approximation, and roughness of smoke-color distribution. Finally, in HSV color space small smoke regions are merged by the definition of equivalence relation so as to distinguish smoke images from heavy fog images in terms of V component value variety from center to edge of smoke region. The experimental results on smoke region segmentation demonstrated the effectiveness and usefulness of the proposed scheme.

  13. Prioritizing chronic obstructive pulmonary disease (COPD) candidate genes in COPD-related networks.

    Science.gov (United States)

    Zhang, Yihua; Li, Wan; Feng, Yuyan; Guo, Shanshan; Zhao, Xilei; Wang, Yahui; He, Yuehan; He, Weiming; Chen, Lina

    2017-11-28

    Chronic obstructive pulmonary disease (COPD) is a multi-factor disease, which could be caused by many factors, including disturbances of metabolism and protein-protein interactions (PPIs). In this paper, a weighted COPD-related metabolic network and a weighted COPD-related PPI network were constructed base on COPD disease genes and functional information. Candidate genes in these weighted COPD-related networks were prioritized by making use of a gene prioritization method, respectively. Literature review and functional enrichment analysis of the top 100 genes in these two networks suggested the correlation of COPD and these genes. The performance of our gene prioritization method was superior to that of ToppGene and ToppNet for genes from the COPD-related metabolic network or the COPD-related PPI network after assessing using leave-one-out cross-validation, literature validation and functional enrichment analysis. The top-ranked genes prioritized from COPD-related metabolic and PPI networks could promote the better understanding about the molecular mechanism of this disease from different perspectives. The top 100 genes in COPD-related metabolic network or COPD-related PPI network might be potential markers for the diagnosis and treatment of COPD.

  14. Candidate gene association study in type 2 diabetes indicates a role for genes involved in beta-cell function as well as insulin action.

    Directory of Open Access Journals (Sweden)

    Inês Barroso

    2003-10-01

    Full Text Available Type 2 diabetes is an increasingly common, serious metabolic disorder with a substantial inherited component. It is characterised by defects in both insulin secretion and action. Progress in identification of specific genetic variants predisposing to the disease has been limited. To complement ongoing positional cloning efforts, we have undertaken a large-scale candidate gene association study. We examined 152 SNPs in 71 candidate genes for association with diabetes status and related phenotypes in 2,134 Caucasians in a case-control study and an independent quantitative trait (QT cohort in the United Kingdom. Polymorphisms in five of 15 genes (33% encoding molecules known to primarily influence pancreatic beta-cell function-ABCC8 (sulphonylurea receptor, KCNJ11 (KIR6.2, SLC2A2 (GLUT2, HNF4A (HNF4alpha, and INS (insulin-significantly altered disease risk, and in three genes, the risk allele, haplotype, or both had a biologically consistent effect on a relevant physiological trait in the QT study. We examined 35 genes predicted to have their major influence on insulin action, and three (9%-INSR, PIK3R1, and SOS1-showed significant associations with diabetes. These results confirm the genetic complexity of Type 2 diabetes and provide evidence that common variants in genes influencing pancreatic beta-cell function may make a significant contribution to the inherited component of this disease. This study additionally demonstrates that the systematic examination of panels of biological candidate genes in large, well-characterised populations can be an effective complement to positional cloning approaches. The absence of large single-gene effects and the detection of multiple small effects accentuate the need for the study of larger populations in order to reliably identify the size of effect we now expect for complex diseases.

  15. Selection and validation of a set of reliable reference genes for quantitative RT-PCR studies in the brain of the Cephalopod Mollusc Octopus vulgaris

    Directory of Open Access Journals (Sweden)

    Biffali Elio

    2009-07-01

    Full Text Available Abstract Background Quantitative real-time polymerase chain reaction (RT-qPCR is valuable for studying the molecular events underlying physiological and behavioral phenomena. Normalization of real-time PCR data is critical for a reliable mRNA quantification. Here we identify reference genes to be utilized in RT-qPCR experiments to normalize and monitor the expression of target genes in the brain of the cephalopod mollusc Octopus vulgaris, an invertebrate. Such an approach is novel for this taxon and of advantage in future experiments given the complexity of the behavioral repertoire of this species when compared with its relatively simple neural organization. Results We chose 16S, and 18S rRNA, actB, EEF1A, tubA and ubi as candidate reference genes (housekeeping genes, HKG. The expression of 16S and 18S was highly variable and did not meet the requirements of candidate HKG. The expression of the other genes was almost stable and uniform among samples. We analyzed the expression of HKG into two different set of animals using tissues taken from the central nervous system (brain parts and mantle (here considered as control tissue by BestKeeper, geNorm and NormFinder. We found that HKG expressions differed considerably with respect to brain area and octopus samples in an HKG-specific manner. However, when the mantle is treated as control tissue and the entire central nervous system is considered, NormFinder revealed tubA and ubi as the most suitable HKG pair. These two genes were utilized to evaluate the relative expression of the genes FoxP, creb, dat and TH in O. vulgaris. Conclusion We analyzed the expression profiles of some genes here identified for O. vulgaris by applying RT-qPCR analysis for the first time in cephalopods. We validated candidate reference genes and found the expression of ubi and tubA to be the most appropriate to evaluate the expression of target genes in the brain of different octopuses. Our results also underline the

  16. Association of Candidate Genes With Submergence Response in Perennial Ryegrass

    Directory of Open Access Journals (Sweden)

    Xicheng Wang

    2017-05-01

    Full Text Available Perennial ryegrass is a popular cool-season grass species due to its high quality for forage and turf. The objective of this study was to identify associations of candidate genes with growth and physiological traits to submergence stress and recovery after de-submergence in a global collection of 94 perennial ryegrass accessions. Accessions varied largely in leaf color, plant height (HT, leaf fresh weight (LFW, leaf dry weight (LDW, and chlorophyll fluorescence (Fv/Fm at 7 days of submergence and in HT, LFW and LDW at 7 days of recovery in two experiments. Among 26 candidate genes tested by various models, single nucleotide polymorphisms (SNPs in 10 genes showed significant associations with traits including 16 associations for control, 10 for submergence, and 8 for recovery. Under submergence, Lp1-SST encoding sucrose:sucrose 1-fructosyltransferase and LpGA20ox encoding gibberellin 20-oxidase were associated with LFW and LDW, and LpACO1 encoding 1-aminocyclopropane-1-carboxylic acid oxidase was associated with LFW. Associations between Lp1-SST and HT, Lp6G-FFT encoding fructan:fructan 6G-fructosyltransferase and Fv/Fm, LpCAT encoding catalase and HT were also detected under submergence stress. Upon de-submergence, Lp1-SST, Lp6G-FFT, and LpPIP1 encoding plasma membrane intrinsic protein type 1 were associated with LFW or LDW, while LpCBF1b encoding C-repeat binding factor were associated with HT. Nine significant SNPs in Lp1-SST, Lp6G-FFT, LpCAT, and LpACO1 resulted in amino acid changes with five substitutions found in Lp1-SST under submergence or recovery. The results indicated that allelic diversity in genes involved in carbohydrate and antioxidant metabolism, ethylene and gibberellin biosynthesis, and transcript factor could contribute to growth variations in perennial ryegrass under submergence stress and recovery after de-submergence.

  17. Polymorphism’s assessment of children’s candidate genes associated with low-level long-term exposure to strontium in drinking water

    OpenAIRE

    N.V. Zaitseva; O.V. Dolgilh; A.V. Krivtsov; K.G. Starkova; V.A. Luchnikova; O.A. Bubnov; E.A. Otavina; N.V. Bezruchenko; N.A. Vdovina

    2015-01-01

    A sequencing of the candidate genes of the pupils, exposed to strontium by the method of targeted resequencing has been performed. It is shown, that under conditions of increased revenues of strontium in drinking water the number of polymorphonuclear altered portions of candidate genes increases. As a result of the targeted resequencing in conditions of strontium exposure, the maximum polymorph modifications of the following genes are defined: sulfotransferase 1A1 (SULT1A1) and methylenetetra...

  18. Identification of KIF3A as a Novel Candidate Gene for Childhood Asthma Using RNA Expression and Population Allelic Frequencies Differences

    Science.gov (United States)

    Butsch Kovacic, Melinda; Biagini Myers, Jocelyn M.; Wang, Ning; Martin, Lisa J.; Lindsey, Mark; Ericksen, Mark B.; He, Hua; Patterson, Tia L.; Baye, Tesfaye M.; Torgerson, Dara; Roth, Lindsey A.; Gupta, Jayanta; Sivaprasad, Umasundari; Gibson, Aaron M.; Tsoras, Anna M.; Hu, Donglei; Eng, Celeste; Chapela, Rocío; Rodríguez-Santana, José R.; Rodríguez-Cintrón, William; Avila, Pedro C.; Beckman, Kenneth; Seibold, Max A.; Gignoux, Chris; Musaad, Salma M.; Chen, Weiguo; Burchard, Esteban González; Khurana Hershey, Gurjit K.

    2011-01-01

    Background Asthma is a chronic inflammatory disease with a strong genetic predisposition. A major challenge for candidate gene association studies in asthma is the selection of biologically relevant genes. Methodology/Principal Findings Using epithelial RNA expression arrays, HapMap allele frequency variation, and the literature, we identified six possible candidate susceptibility genes for childhood asthma including ADCY2, DNAH5, KIF3A, PDE4B, PLAU, SPRR2B. To evaluate these genes, we compared the genotypes of 194 predominantly tagging SNPs in 790 asthmatic, allergic and non-allergic children. We found that SNPs in all six genes were nominally associated with asthma (pasthma (OR = 2.3, pasthma population attributable risk of 18.5%. The association between KIF3A rs7737031 and asthma was validated in 3 independent populations, further substantiating the validity of our gene selection approach. Conclusions/Significance Our study demonstrates that KIF3A, a member of the kinesin superfamily of microtubule associated motors that are important in the transport of protein complexes within cilia, is a novel candidate gene for childhood asthma. Polymorphisms in KIF3A may in part be responsible for poor mucus and/or allergen clearance from the airways. Furthermore, our study provides a promising framework for the identification and evaluation of novel candidate susceptibility genes. PMID:21912604

  19. No Association between Variation in Longevity Candidate Genes and Aging-related Phenotypes in Oldest-old Danes

    DEFF Research Database (Denmark)

    Sørensen, Mette; Nygaard, Marianne; Debrabant, Birgit

    2016-01-01

    additional genes repeatedly considered as candidates for human longevity: APOE, APOA4, APOC3, ACE, CETP, HFE, IL6, IL6R, MTHFR, TGFB1, SIRTs 1, 3, 6; and HSPAs 1A, 1L, 14. Altogether, 1,049 single nucleotide polymorphisms (SNPs) were genotyped in 1,088 oldest-old (age 92-93 years) Danes and analysed......In this study we explored the association between aging-related phenotypes previously reported to predict survival in old age and variation in 77 genes from the DNA repair pathway, 32 genes from the growth hormone 1/ insulin-like growth factor 1/insulin (GH/IGF-1/INS) signalling pathway and 16...... in the relevant phenotype over time (7 years of follow-up) and none of the SNPs could be confirmed in a replication sample of 1,281 oldest-old Danes (age 94-100). Hence, our study does not support association between common variation in the investigated longevity candidate genes and aging-related phenotypes...

  20. Exomic sequencing of immune-related genes reveals novel candidate variants associated with alopecia universalis.

    Directory of Open Access Journals (Sweden)

    Seungbok Lee

    Full Text Available Alopecia areata (AA is a common autoimmune disorder mostly presented as round patches of hair loss and subclassified into alopecia totalis/alopecia universalis (AT/AU based on the area of alopecia. Although AA is relatively common, only 5% of AA patients progress to AT/AU, which affect the whole scalp and whole body respectively. To determine genetic determinants of this orphan disease, we undertook whole-exome sequencing of 6 samples from AU patients, and 26 variants in immune-related genes were selected as candidates. When an additional 14 AU samples were genotyped for these candidates, 6 of them remained at the level of significance in comparison with 155 Asian controls (p<1.92×10(-3. Linkage disequilibrium was observed between some of the most significant SNPs, including rs41559420 of HLA-DRB5 (p<0.001, OR 44.57 and rs28362679 of BTNL2 (p<0.001, OR 30.21. While BTNL2 was reported as a general susceptibility gene of AA previously, HLA-DRB5 has not been implicated in AA. In addition, we found several genetic variants in novel genes (HLA-DMB, TLR1, and PMS2 and discovered an additional locus on HLA-A, a known susceptibility gene of AA. This study provides further evidence for the association of previously reported genes with AA and novel findings such as HLA-DRB5, which might represent a hidden culprit gene for AU.

  1. Early embryonic failure: Expression and imprinted status of candidate genes on human chromosome 21

    Energy Technology Data Exchange (ETDEWEB)

    Sherman, L.S.; Bennett, P.R.; Moore, G.E. [Queen Charlotte`s and Chelsea Hospital, London (United Kingdom)

    1994-09-01

    Two cases of maternal uniparental (hetero)disomy for human chromosome 21 (mUPD21) have been identified in a systematic search for UPD in 23 cases of early embryonic failure (EEF). Bi-parental origin of the other chromosome pairs was confirmed using specific VNTR probes or dinucleotide repeat analysis. Both maternally and paternally derived isochromosomes 21q have previously been identified in two individuals with normal phenotypes. Full UPD21 has a different mechanism of origin than uniparental isochromosome 21q and its effect on imprinted genes and phenotypic outcome will therefore not necessarily be the same. EEF associated with mUPD21 suggests that developmentally important genes on HSA 21 may be imprinted such that they are only expressed from either the maternally or paternally derived alleles. We have searched for monoallelic expression of candidate genes on HSA 21 in human pregnancy (CBS, IFNAR, COL6A1) using intragenic DNA polymorphisms. These genes were chosen either because their murine homologues lie in imprinted regions or because they are potentially important in embryogenesis. Once imprinted candidate genes have been identified, their methylation status and expression in normal, early embryonic failure and uniparental disomy 21 pregnancies will be studied. At the same time, a larger number of cases of EEF are being examined to further investigate the incidence of UPD21 in this group.

  2. Polymorphism’s assessment of children’s candidate genes associated with low-level long-term exposure to strontium in drinking water

    Directory of Open Access Journals (Sweden)

    N.V. Zaitseva

    2015-12-01

    Full Text Available A sequencing of the candidate genes of the pupils, exposed to strontium by the method of targeted resequencing has been performed. It is shown, that under conditions of increased revenues of strontium in drinking water the number of polymorphonuclear altered portions of candidate genes increases. As a result of the targeted resequencing in conditions of strontium exposure, the maximum polymorph modifications of the following genes are defined: sulfotransferase 1A1 (SULT1A1 and methylenetetrahydrofolate. It was shown that the structure of the mutations in conditions of the strontium exposure was characterized by the formation of defects in the gene mapping detoxification (38.5 % of all mutations and immunoregulation (22.5 %. Analysis of the cause-effect relationships in the system "factor - the number of mutations" revealed that candidate genes reflecting strontium exposure conditions (content of strontium in drinking water is 1.3 MAC, are genes: cytochrome P450, glutathione - transaminase (detoxification; dopamine (CNS, interleukin 17 and the major histocompatibility complex (immune system, methylene-tetra-hydro-folate-reductase (reproduction.

  3. Next-generation text-mining mediated generation of chemical response-specific gene sets for interpretation of gene expression data

    Science.gov (United States)

    2013-01-01

    Background Availability of chemical response-specific lists of genes (gene sets) for pharmacological and/or toxic effect prediction for compounds is limited. We hypothesize that more gene sets can be created by next-generation text mining (next-gen TM), and that these can be used with gene set analysis (GSA) methods for chemical treatment identification, for pharmacological mechanism elucidation, and for comparing compound toxicity profiles. Methods We created 30,211 chemical response-specific gene sets for human and mouse by next-gen TM, and derived 1,189 (human) and 588 (mouse) gene sets from the Comparative Toxicogenomics Database (CTD). We tested for significant differential expression (SDE) (false discovery rate -corrected p-values sets and the CTD-derived gene sets in gene expression (GE) data sets of five chemicals (from experimental models). We tested for SDE of gene sets for six fibrates in a peroxisome proliferator-activated receptor alpha (PPARA) knock-out GE dataset and compared to results from the Connectivity Map. We tested for SDE of 319 next-gen TM-derived gene sets for environmental toxicants in three GE data sets of triazoles, and tested for SDE of 442 gene sets associated with embryonic structures. We compared the gene sets to triazole effects seen in the Whole Embryo Culture (WEC), and used principal component analysis (PCA) to discriminate triazoles from other chemicals. Results Next-gen TM-derived gene sets matching the chemical treatment were significantly altered in three GE data sets, and the corresponding CTD-derived gene sets were significantly altered in five GE data sets. Six next-gen TM-derived and four CTD-derived fibrate gene sets were significantly altered in the PPARA knock-out GE dataset. None of the fibrate signatures in cMap scored significant against the PPARA GE signature. 33 environmental toxicant gene sets were significantly altered in the triazole GE data sets. 21 of these toxicants had a similar toxicity pattern as the

  4. Linkage study of nonsyndromic cleft lip with or without cleft palate using candidate genes and mapped polymorphic markers

    Energy Technology Data Exchange (ETDEWEB)

    Stein, J.D.; Nelson, L.D.; Conner, B.J. [Univ. of Texas, Houston (United States)] [and others

    1994-09-01

    Nonsyndromic cleft lip with or without cleft palate (CL(P)) involves fusion or growth failure of facial primordia during development. Complex segregation analysis of clefting populations suggest that an autosomal dominant gene may play a role in this common craniofacial disorder. We have ascertained 16 multigenerational families with CL(P) and tested linkage to 29 candidate genes and 139 mapped short tandem repeat markers. The candidate genes were selected based on their expression in craniofacial development or were identified through murine models. These include: TGF{alpha}, TGF{beta}1, TGF{beta}2, TGF{beta}3, EGF, EGFR, GRAS, cMyc, FGFR, Jun, JunB, PDFG{alpha}, PDGF{beta}, IGF2R, GCR Hox7, Hox8, Hox2B, twirler, 5 collagen and 3 extracellular matrix genes. Linkage was tested assuming an autosomal dominant model with sex-specific decreased penetrance. Linkage to all of the candidate loci was excluded in 11 families. RARA was tested and was not informative. However, haplotype analysis of markers flanking RARA on 17q allowed exclusion of this candidate locus. We have previously excluded linkage to 61 STR markers in 11 families. Seventy-eight mapped short tandem repeat markers have recently been tested in 16 families and 30 have been excluded. The remaining are being analyzed and an exclusion map is being developed based on the entire study results.

  5. A data science approach to candidate gene selection of pain regarded as a process of learning and neural plasticity.

    Science.gov (United States)

    Ultsch, Alfred; Kringel, Dario; Kalso, Eija; Mogil, Jeffrey S; Lötsch, Jörn

    2016-12-01

    The increasing availability of "big data" enables novel research approaches to chronic pain while also requiring novel techniques for data mining and knowledge discovery. We used machine learning to combine the knowledge about n = 535 genes identified empirically as relevant to pain with the knowledge about the functions of thousands of genes. Starting from an accepted description of chronic pain as displaying systemic features described by the terms "learning" and "neuronal plasticity," a functional genomics analysis proposed that among the functions of the 535 "pain genes," the biological processes "learning or memory" (P = 8.6 × 10) and "nervous system development" (P = 2.4 × 10) are statistically significantly overrepresented as compared with the annotations to these processes expected by chance. After establishing that the hypothesized biological processes were among important functional genomics features of pain, a subset of n = 34 pain genes were found to be annotated with both Gene Ontology terms. Published empirical evidence supporting their involvement in chronic pain was identified for almost all these genes, including 1 gene identified in March 2016 as being involved in pain. By contrast, such evidence was virtually absent in a randomly selected set of 34 other human genes. Hence, the present computational functional genomics-based method can be used for candidate gene selection, providing an alternative to established methods.

  6. Type 1 Diabetes Candidate Genes Linked to Pancreatic Islet Cell Inflammation and Beta-Cell Apoptosis

    DEFF Research Database (Denmark)

    Størling, Joachim; Pociot, Flemming

    2017-01-01

    (GWAS) have identified more than 50 genetic regions that affect the risk of developing T1D. Most of these susceptibility loci, however, harbor several genes, and the causal variant(s) and gene(s) for most of the loci remain to be established. A significant part of the genes located in the T1D...... susceptibility loci are expressed in human islets and β cells and mounting evidence suggests that some of these genes modulate the β-cell response to the immune system and viral infection and regulate apoptotic β-cell death. Here, we discuss the current status of T1D susceptibility loci and candidate genes...

  7. The genome sequence of the most widely cultivated cacao type and its use to identify candidate genes regulating pod color

    Science.gov (United States)

    2013-01-01

    Background Theobroma cacao L. cultivar Matina 1-6 belongs to the most cultivated cacao type. The availability of its genome sequence and methods for identifying genes responsible for important cacao traits will aid cacao researchers and breeders. Results We describe the sequencing and assembly of the genome of Theobroma cacao L. cultivar Matina 1-6. The genome of the Matina 1-6 cultivar is 445 Mbp, which is significantly larger than a sequenced Criollo cultivar, and more typical of other cultivars. The chromosome-scale assembly, version 1.1, contains 711 scaffolds covering 346.0 Mbp, with a contig N50 of 84.4 kbp, a scaffold N50 of 34.4 Mbp, and an evidence-based gene set of 29,408 loci. Version 1.1 has 10x the scaffold N50 and 4x the contig N50 as Criollo, and includes 111 Mb more anchored sequence. The version 1.1 assembly has 4.4% gap sequence, while Criollo has 10.9%. Through a combination of haplotype, association mapping and gene expression analyses, we leverage this robust reference genome to identify a promising candidate gene responsible for pod color variation. We demonstrate that green/red pod color in cacao is likely regulated by the R2R3 MYB transcription factor TcMYB113, homologs of which determine pigmentation in Rosaceae, Solanaceae, and Brassicaceae. One SNP within the target site for a highly conserved trans-acting siRNA in dicots, found within TcMYB113, seems to affect transcript levels of this gene and therefore pod color variation. Conclusions We report a high-quality sequence and annotation of Theobroma cacao L. and demonstrate its utility in identifying candidate genes regulating traits. PMID:23731509

  8. Comparative Genomic Analysis of Neutrophilic Iron(II Oxidizer Genomes for Candidate Genes in Extracellular Electron Transfer

    Directory of Open Access Journals (Sweden)

    Shaomei He

    2017-08-01

    Full Text Available Extracellular electron transfer (EET is recognized as a key biochemical process in circumneutral pH Fe(II-oxidizing bacteria (FeOB. In this study, we searched for candidate EET genes in 73 neutrophilic FeOB genomes, among which 43 genomes are complete or close-to-complete and the rest have estimated genome completeness ranging from 5 to 91%. These neutrophilic FeOB span members of the microaerophilic, anaerobic phototrophic, and anaerobic nitrate-reducing FeOB groups. We found that many microaerophilic and several anaerobic FeOB possess homologs of Cyc2, an outer membrane cytochrome c originally identified in Acidithiobacillus ferrooxidans. The “porin-cytochrome c complex” (PCC gene clusters homologous to MtoAB/PioAB are present in eight FeOB, accounting for 19% of complete and close-to-complete genomes examined, whereas PCC genes homologous to OmbB-OmaB-OmcB in Geobacter sulfurreducens are absent. Further, we discovered gene clusters that may potentially encode two novel PCC types. First, a cluster (tentatively named “PCC3” encodes a porin, an extracellular and a periplasmic cytochrome c with remarkably large numbers of heme-binding motifs. Second, a cluster (tentatively named “PCC4” encodes a porin and three periplasmic multiheme cytochromes c. A conserved inner membrane protein (IMP encoded in PCC3 and PCC4 gene clusters might be responsible for translocating electrons across the inner membrane. Other bacteria possessing PCC3 and PCC4 are mostly Proteobacteria isolated from environments with a potential niche for Fe(II oxidation. In addition to cytochrome c, multicopper oxidase (MCO genes potentially involved in Fe(II oxidation were also identified. Notably, candidate EET genes were not found in some FeOB, especially the anaerobic ones, probably suggesting EET genes or Fe(II oxidation mechanisms are different from the searched models. Overall, based on current EET models, the search extends our understanding of bacterial EET and

  9. Expression and functional assessment of candidate type 2 diabetes susceptibility genes identify four new genes contributing to human insulin secretion

    Directory of Open Access Journals (Sweden)

    Fatou K. Ndiaye

    2017-06-01

    Full Text Available Objectives: Genome-wide association studies (GWAS have identified >100 loci independently contributing to type 2 diabetes (T2D risk. However, translational implications for precision medicine and for the development of novel treatments have been disappointing, due to poor knowledge of how these loci impact T2D pathophysiology. Here, we aimed to measure the expression of genes located nearby T2D associated signals and to assess their effect on insulin secretion from pancreatic beta cells. Methods: The expression of 104 candidate T2D susceptibility genes was measured in a human multi-tissue panel, through PCR-free expression assay. The effects of the knockdown of beta-cell enriched genes were next investigated on insulin secretion from the human EndoC-βH1 beta-cell line. Finally, we performed RNA-sequencing (RNA-seq so as to assess the pathways affected by the knockdown of the new genes impacting insulin secretion from EndoC-βH1, and we analyzed the expression of the new genes in mouse models with altered pancreatic beta-cell function. Results: We found that the candidate T2D susceptibility genes' expression is significantly enriched in pancreatic beta cells obtained by laser capture microdissection or sorted by flow cytometry and in EndoC-βH1 cells, but not in insulin sensitive tissues. Furthermore, the knockdown of seven T2D-susceptibility genes (CDKN2A, GCK, HNF4A, KCNK16, SLC30A8, TBC1D4, and TCF19 with already known expression and/or function in beta cells changed insulin secretion, supporting our functional approach. We showed first evidence for a role in insulin secretion of four candidate T2D-susceptibility genes (PRC1, SRR, ZFAND3, and ZFAND6 with no previous knowledge of presence and function in beta cells. RNA-seq in EndoC-βH1 cells with decreased expression of PRC1, SRR, ZFAND6, or ZFAND3 identified specific gene networks related to T2D pathophysiology. Finally, a positive correlation between the expression of Ins2 and the

  10. A comprehensive candidate gene approach identifies genetic variation associated with osteosarcoma

    International Nuclear Information System (INIS)

    Mirabello, Lisa; Grotmol, Tom; Douglass, Chester; Hayes, Richard B; Hoover, Robert N; Savage, Sharon A; Yu, Kai; Berndt, Sonja I; Burdett, Laurie; Wang, Zhaoming; Chowdhury, Salma; Teshome, Kedest; Uzoka, Arinze; Hutchinson, Amy

    2011-01-01

    Osteosarcoma (OS) is a bone malignancy which occurs primarily in adolescents. Since it occurs during a period of rapid growth, genes important in bone formation and growth are plausible modifiers of risk. Genes involved in DNA repair and ribosomal function may contribute to OS pathogenesis, because they maintain the integrity of critical cellular processes. We evaluated these hypotheses in an OS association study of genes from growth/hormone, bone formation, DNA repair, and ribosomal pathways. We evaluated 4836 tag-SNPs across 255 candidate genes in 96 OS cases and 1426 controls. Logistic regression models were used to estimate the odds ratios (OR) and 95% confidence intervals (CI). Twelve SNPs in growth or DNA repair genes were significantly associated with OS after Bonferroni correction. Four SNPs in the DNA repair gene FANCM (ORs 1.9-2.0, P = 0.003-0.004) and 2 SNPs downstream of the growth hormone gene GH1 (OR 1.6, P = 0.002; OR 0.5, P = 0.0009) were significantly associated with OS. One SNP in the region of each of the following genes was significant: MDM2, MPG, FGF2, FGFR3, GNRH2, and IGF1. Our results suggest that several SNPs in biologically plausible pathways are associated with OS. Larger studies are required to confirm our findings

  11. Bioinformatics-Driven Identification and Examination of Candidate Genes for Non-Alcoholic Fatty Liver Disease

    DEFF Research Database (Denmark)

    Banasik, Karina; Justesen, Johanne M.; Hornbak, Malene

    2011-01-01

    Objective: Candidate genes for non-alcoholic fatty liver disease (NAFLD) identified by a bioinformatics approach were examined for variant associations to quantitative traits of NAFLD-related phenotypes. Research Design and Methods: By integrating public database text mining, trans-organism protein...

  12. A meta-analysis based method for prioritizing candidate genes involved in a pre-specific function

    Directory of Open Access Journals (Sweden)

    Jingjing Zhai

    2016-12-01

    Full Text Available The identification of genes associated with a given biological function in plants remains a challenge, although network-based gene prioritization algorithms have been developed for Arabidopsis thaliana and many non-model plant species. Nevertheless, these network-based gene prioritization algorithms have encountered several problems; one in particular is that of unsatisfactory prediction accuracy due to limited network coverage, varying link quality, and/or uncertain network connectivity. Thus a model that integrates complementary biological data may be expected to increase the prediction accuracy of gene prioritization. Towards this goal, we developed a novel gene prioritization method named RafSee, to rank candidate genes using a random forest algorithm that integrates sequence, evolutionary, and epigenetic features of plants. Subsequently, we proposed an integrative approach named RAP (Rank Aggregation-based data fusion for gene Prioritization, in which an order statistics-based meta-analysis was used to aggregate the rank of the network-based gene prioritization method and RafSee, for accurately prioritizing candidate genes involved in a pre-specific biological function. Finally, we showcased the utility of RAP by prioritizing 380 flowering-time genes in Arabidopsis. The ‘leave-one-out’ cross-validation experiment showed that RafSee could work as a complement to a current state-of-art network-based gene prioritization system (AraNet v2. Moreover, RAP ranked 53.68% (204/380 flowering-time genes higher than AraNet v2, resulting in an 39.46% improvement in term of the first quartile rank. Further evaluations also showed that RAP was effective in prioritizing genes-related to different abiotic stresses. To enhance the usability of RAP for Arabidopsis and non-model plant species, an R package implementing the method is freely available at http://bioinfo.nwafu.edu.cn/software.

  13. EXONSAMPLER: a computer program for genome-wide and candidate gene exon sampling for targeted next-generation sequencing.

    Science.gov (United States)

    Cosart, Ted; Beja-Pereira, Albano; Luikart, Gordon

    2014-11-01

    The computer program EXONSAMPLER automates the sampling of thousands of exon sequences from publicly available reference genome sequences and gene annotation databases. It was designed to provide exon sequences for the efficient, next-generation gene sequencing method called exon capture. The exon sequences can be sampled by a list of gene name abbreviations (e.g. IFNG, TLR1), or by sampling exons from genes spaced evenly across chromosomes. It provides a list of genomic coordinates (a bed file), as well as a set of sequences in fasta format. User-adjustable parameters for collecting exon sequences include a minimum and maximum acceptable exon length, maximum number of exonic base pairs (bp) to sample per gene, and maximum total bp for the entire collection. It allows for partial sampling of very large exons. It can preferentially sample upstream (5 prime) exons, downstream (3 prime) exons, both external exons, or all internal exons. It is written in the Python programming language using its free libraries. We describe the use of EXONSAMPLER to collect exon sequences from the domestic cow (Bos taurus) genome for the design of an exon-capture microarray to sequence exons from related species, including the zebu cow and wild bison. We collected ~10% of the exome (~3 million bp), including 155 candidate genes, and ~16,000 exons evenly spaced genomewide. We prioritized the collection of 5 prime exons to facilitate discovery and genotyping of SNPs near upstream gene regulatory DNA sequences, which control gene expression and are often under natural selection. © 2014 John Wiley & Sons Ltd.

  14. Targeted sequencing of established and candidate colorectal cancer genes in the Colon Cancer Family Registry Cohort.

    Science.gov (United States)

    Raskin, Leon; Guo, Yan; Du, Liping; Clendenning, Mark; Rosty, Christophe; Lindor, Noralane M; Gruber, Stephen B; Buchanan, Daniel D

    2017-11-07

    The underlying genetic cause of colorectal cancer (CRC) can be identified for 5-10% of all cases, while at least 20% of CRC cases are thought to be due to inherited genetic factors. Screening for highly penetrant mutations in genes associated with Mendelian cancer syndromes using next-generation sequencing (NGS) can be prohibitively expensive for studies requiring large samples sizes. The aim of the study was to identify rare single nucleotide variants and small indels in 40 established or candidate CRC susceptibility genes in 1,046 familial CRC cases (including both MSS and MSI-H tumor subtypes) and 1,006 unrelated controls from the Colon Cancer Family Registry Cohort using a robust and cost-effective DNA pooling NGS strategy. We identified 264 variants in 38 genes that were observed only in cases, comprising either very rare (minor allele frequency cancer susceptibility genes BAP1, CDH1, CHEK2, ENG, and MSH3 . For the candidate CRC genes, we identified likely pathogenic variants in the helicase domain of POLQ and in the LRIG1 , SH2B3 , and NOS1 genes and present their clinicopathological characteristics. Using a DNA pooling NGS strategy, we identified novel germline mutations in established CRC susceptibility genes in familial CRC cases. Further studies are required to support the role of POLQ , LRIG1 , SH2B3 and NOS1 as CRC susceptibility genes.

  15. APPRIS 2017: principal isoforms for multiple gene sets

    Science.gov (United States)

    Rodriguez-Rivas, Juan; Di Domenico, Tomás; Vázquez, Jesús; Valencia, Alfonso

    2018-01-01

    Abstract The APPRIS database (http://appris-tools.org) uses protein structural and functional features and information from cross-species conservation to annotate splice isoforms in protein-coding genes. APPRIS selects a single protein isoform, the ‘principal’ isoform, as the reference for each gene based on these annotations. A single main splice isoform reflects the biological reality for most protein coding genes and APPRIS principal isoforms are the best predictors of these main proteins isoforms. Here, we present the updates to the database, new developments that include the addition of three new species (chimpanzee, Drosophila melangaster and Caenorhabditis elegans), the expansion of APPRIS to cover the RefSeq gene set and the UniProtKB proteome for six species and refinements in the core methods that make up the annotation pipeline. In addition APPRIS now provides a measure of reliability for individual principal isoforms and updates with each release of the GENCODE/Ensembl and RefSeq reference sets. The individual GENCODE/Ensembl, RefSeq and UniProtKB reference gene sets for six organisms have been merged to produce common sets of splice variants. PMID:29069475

  16. Next-generation text-mining mediated generation of chemical response-specific gene sets for interpretation of gene expression data

    Directory of Open Access Journals (Sweden)

    Hettne Kristina M

    2013-01-01

    Full Text Available Abstract Background Availability of chemical response-specific lists of genes (gene sets for pharmacological and/or toxic effect prediction for compounds is limited. We hypothesize that more gene sets can be created by next-generation text mining (next-gen TM, and that these can be used with gene set analysis (GSA methods for chemical treatment identification, for pharmacological mechanism elucidation, and for comparing compound toxicity profiles. Methods We created 30,211 chemical response-specific gene sets for human and mouse by next-gen TM, and derived 1,189 (human and 588 (mouse gene sets from the Comparative Toxicogenomics Database (CTD. We tested for significant differential expression (SDE (false discovery rate -corrected p-values Results Next-gen TM-derived gene sets matching the chemical treatment were significantly altered in three GE data sets, and the corresponding CTD-derived gene sets were significantly altered in five GE data sets. Six next-gen TM-derived and four CTD-derived fibrate gene sets were significantly altered in the PPARA knock-out GE dataset. None of the fibrate signatures in cMap scored significant against the PPARA GE signature. 33 environmental toxicant gene sets were significantly altered in the triazole GE data sets. 21 of these toxicants had a similar toxicity pattern as the triazoles. We confirmed embryotoxic effects, and discriminated triazoles from other chemicals. Conclusions Gene set analysis with next-gen TM-derived chemical response-specific gene sets is a scalable method for identifying similarities in gene responses to other chemicals, from which one may infer potential mode of action and/or toxic effect.

  17. Selection and validation of potato candidate genes for maturity corrected resistance to Phytophthora infestans based on differential expression combined with SNP association and linkage mapping

    Directory of Open Access Journals (Sweden)

    Meki Shehabu Muktar

    2015-09-01

    Full Text Available Late blight of potato (Solanum tuberosum L. caused by the oomycete Phytophthora infestans (Mont. de Bary, is one of the most important bottlenecks of potato production worldwide. Cultivars with high levels of durable, race unspecific, quantitative resistance are part of a solution to this problem. However, breeding for quantitative resistance is hampered by the correlation between resistance and late plant maturity, which is an undesirable agricultural attribute. The objectives of our research are (i the identification of genes that condition quantitative resistance to P. infestans not compromised by late plant maturity and (ii the discovery of diagnostic single nucleotide polymorphism (SNP markers to be used as molecular tools to increase efficiency and precision of resistance breeding. Twenty two novel candidate genes were selected based on comparative transcript profiling by SuperSAGE (serial analysis of gene expression in groups of plants with contrasting levels of maturity corrected resistance (MCR. Reproducibility of differential expression was tested by quantitative real time PCR and allele specific pyrosequencing in four new sets of genotype pools with contrasting late blight resistance levels, at three infection time points and in three independent infection experiments. Reproducibility of expression patterns ranged from 28% to 97%. Association mapping in a panel of 184 tetraploid cultivars identified SNPs in five candidate genes that were associated with MCR. These SNPs can be used in marker-assisted resistance breeding. Linkage mapping in two half-sib families (n = 111 identified SNPs in three candidate genes that were linked with MCR. The differentially expressed genes that showed association and/or linkage with MCR putatively function in phytosterol synthesis, fatty acid synthesis, asparagine synthesis, chlorophyll synthesis, cell wall modification and in the response to pathogen elicitors.

  18. Selection and validation of a set of reliable reference genes for quantitative sod gene expression analysis in C. elegans

    Directory of Open Access Journals (Sweden)

    Vandesompele Jo

    2008-01-01

    Full Text Available Abstract Background In the nematode Caenorhabditis elegans the conserved Ins/IGF-1 signaling pathway regulates many biological processes including life span, stress response, dauer diapause and metabolism. Detection of differentially expressed genes may contribute to a better understanding of the mechanism by which the Ins/IGF-1 signaling pathway regulates these processes. Appropriate normalization is an essential prerequisite for obtaining accurate and reproducible quantification of gene expression levels. The aim of this study was to establish a reliable set of reference genes for gene expression analysis in C. elegans. Results Real-time quantitative PCR was used to evaluate the expression stability of 12 candidate reference genes (act-1, ama-1, cdc-42, csq-1, eif-3.C, mdh-1, gpd-2, pmp-3, tba-1, Y45F10D.4, rgs-6 and unc-16 in wild-type, three Ins/IGF-1 pathway mutants, dauers and L3 stage larvae. After geNorm analysis, cdc-42, pmp-3 and Y45F10D.4 showed the most stable expression pattern and were used to normalize 5 sod expression levels. Significant differences in mRNA levels were observed for sod-1 and sod-3 in daf-2 relative to wild-type animals, whereas in dauers sod-1, sod-3, sod-4 and sod-5 are differentially expressed relative to third stage larvae. Conclusion Our findings emphasize the importance of accurate normalization using stably expressed reference genes. The methodology used in this study is generally applicable to reliably quantify gene expression levels in the nematode C. elegans using quantitative PCR.

  19. Candidates in Astroviruses, Seadornaviruses, Cytorhabdoviruses and Coronaviruses for +1 frame overlapping genes accessed by leaky scanning

    Directory of Open Access Journals (Sweden)

    Atkins John F

    2010-01-01

    Full Text Available Abstract Background Overlapping genes are common in RNA viruses where they serve as a mechanism to optimize the coding potential of compact genomes. However, annotation of overlapping genes can be difficult using conventional gene-finding software. Recently we have been using a number of complementary approaches to systematically identify previously undetected overlapping genes in RNA virus genomes. In this article we gather together a number of promising candidate new overlapping genes that may be of interest to the community. Results Overlapping gene predictions are presented for the astroviruses, seadornaviruses, cytorhabdoviruses and coronaviruses (families Astroviridae, Reoviridae, Rhabdoviridae and Coronaviridae, respectively.

  20. A Multiple Interaction Analysis Reveals ADRB3 as a Potential Candidate for Gallbladder Cancer Predisposition via a Complex Interaction with Other Candidate Gene Variations

    Directory of Open Access Journals (Sweden)

    Rajani Rai

    2015-11-01

    Full Text Available Gallbladder cancer is the most common and a highly aggressive biliary tract malignancy with a dismal outcome. The pathogenesis of the disease is multifactorial, comprising the combined effect of multiple genetic variations of mild consequence along with numerous dietary and environmental risk factors. Previously, we demonstrated the association of several candidate gene variations with GBC risk. In this study, we aimed to identify the combination of gene variants and their possible interactions contributing towards genetic susceptibility of GBC. Here, we performed Multifactor-Dimensionality Reduction (MDR and Classification and Regression Tree Analysis (CRT to investigate the gene–gene interactions and the combined effect of 14 SNPs in nine genes (DR4 (rs20576, rs6557634; FAS (rs2234767; FASL (rs763110; DCC (rs2229080, rs4078288, rs7504990, rs714; PSCA (rs2294008, rs2978974; ADRA2A (rs1801253; ADRB1 (rs1800544; ADRB3 (rs4994; CYP17 (rs2486758 involved in various signaling pathways. Genotyping was accomplished by PCR-RFLP or Taqman allelic discrimination assays. SPSS software version 16.0 and MDR software version 2.0 were used for all the statistical analysis. Single locus investigation demonstrated significant association of DR4 (rs20576, rs6557634, DCC (rs714, rs2229080, rs4078288 and ADRB3 (rs4994 polymorphisms with GBC risk. MDR analysis revealed ADRB3 (rs4994 to be crucial candidate in GBC susceptibility that may act either alone (p < 0.0001, CVC = 10/10 or in combination with DCC (rs714 and rs2229080, p < 0.0001, CVC = 9/10. Our CRT results are in agreement with the above findings. Further, in-silico results of studied SNPs advocated their role in splicing, transcriptional and/or protein coding regulation. Overall, our result suggested complex interactions amongst the studied SNPs and ADRB3 rs4994 as candidate influencing GBC susceptibility.

  1. Identification of single nucleotide polymorphisms (SNPs) at candidate genes involved in abiotic stress in two Prosopis species of hybrids

    OpenAIRE

    Maria F. Pomponio; Susana Marcucci Poltri; Diego Lopez Lauenstein; Susana Torales

    2014-01-01

    Aim of the study: Identify and compare SNPs on candidate genes related to abiotic stress in Prosopis chilensis, Prosopis flexuosa and interspecific hybridsArea of the study: Chaco árido, Argentina. Material and Methods: Fragments from 6 candidate genes were sequenced in 60 genotypes. DNA polymorphisms were analyzed.Main Results: The analysis revealed that the hybrids had the highest rate of polymorphism, followed by P. flexuosa and P. chilensis, the values found are comparable to other forest...

  2. Diversifying Selection in the Wheat Stem Rust Fungus Acts Predominantly on Pathogen-Associated Gene Families and Reveals Candidate Effectors

    Directory of Open Access Journals (Sweden)

    Jana eSperschneider

    2014-09-01

    Full Text Available Plant pathogens cause severe losses to crop plants and threaten global food production. One striking example is the wheat stem rust fungus, Puccinia graminis f. sp. tritici, which can rapidly evolve new virulent pathotypes in response to resistant host lines. Like several other filamentous fungal and oomycete plant pathogens, its genome features expanded gene families that have been implicated in host-pathogen interactions, possibly encoding effector proteins that interact directly with target host defence proteins. Previous efforts to understand virulence largely relied on the prediction of secreted, small and cysteine-rich proteins as candidate effectors and thus delivered an overwhelming number of candidates. Here, we implement an alternative analysis strategy that uses the signal of adaptive evolution as a line of evidence for effector function, combined with comparative information and expression data. We demonstrate that in planta up-regulated genes that are rapidly evolving are found almost exclusively in pathogen-associated gene families, affirming the impact of host-pathogen co-evolution on genome structure and the adaptive diversification of specialised gene families. In particular, we predict 42 effector candidates that are conserved only across pathogens, induced during infection and rapidly evolving. One of our top candidates has recently been shown to induce genotype-specific hypersensitive cell death in wheat. This shows that comparative genomics incorporating the evolutionary signal of adaptation is powerful for predicting effector candidates for laboratory verification. Our system can be applied to a wide range of pathogens and will give insight into host-pathogen dynamics, ultimately leading to progress in strategies for disease control.

  3. Candidate gene approach for parasite resistance in sheep--variation in immune pathway genes and association with fecal egg count.

    Directory of Open Access Journals (Sweden)

    Kathiravan Periasamy

    Full Text Available Sheep chromosome 3 (Oar3 has the largest number of QTLs reported to be significantly associated with resistance to gastro-intestinal nematodes. This study aimed to identify single nucleotide polymorphisms (SNPs within candidate genes located in sheep chromosome 3 as well as genes involved in major immune pathways. A total of 41 SNPs were identified across 38 candidate genes in a panel of unrelated sheep and genotyped in 713 animals belonging to 22 breeds across Asia, Europe and South America. The variations and evolution of immune pathway genes were assessed in sheep populations across these macro-environmental regions that significantly differ in the diversity and load of pathogens. The mean minor allele frequency (MAF did not vary between Asian and European sheep reflecting the absence of ascertainment bias. Phylogenetic analysis revealed two major clusters with most of South Asian, South East Asian and South West Asian breeds clustering together while European and South American sheep breeds clustered together distinctly. Analysis of molecular variance revealed strong phylogeographic structure at loci located in immune pathway genes, unlike microsatellite and genome wide SNP markers. To understand the influence of natural selection processes, SNP loci located in chromosome 3 were utilized to reconstruct haplotypes, the diversity of which showed significant deviations from selective neutrality. Reduced Median network of reconstructed haplotypes showed balancing selection in force at these loci. Preliminary association of SNP genotypes with phenotypes recorded 42 days post challenge revealed significant differences (P<0.05 in fecal egg count, body weight change and packed cell volume at two, four and six SNP loci respectively. In conclusion, the present study reports strong phylogeographic structure and balancing selection operating at SNP loci located within immune pathway genes. Further, SNP loci identified in the study were found to have

  4. Characterization of the Gray Whale Eschrichtius robustus Genome and a Genotyping Array Based on Single-Nucleotide Polymorphisms in Candidate Genes.

    Science.gov (United States)

    DeWoody, J Andrew; Fernandez, Nadia B; Brüniche-Olsen, Anna; Antonides, Jennifer D; Doyle, Jacqueline M; San Miguel, Phillip; Westerman, Rick; Vertyankin, Vladimir V; Godard-Codding, Céline A J; Bickham, John W

    2017-06-01

    Genetic and genomic approaches have much to offer in terms of ecology, evolution, and conservation. To better understand the biology of the gray whale Eschrichtius robustus (Lilljeborg, 1861), we sequenced the genome and produced an assembly that contains ∼95% of the genes known to be highly conserved among eukaryotes. From this assembly, we annotated 22,711 genes and identified 2,057,254 single-nucleotide polymorphisms (SNPs). Using this assembly, we generated a curated list of candidate genes potentially subject to strong natural selection, including genes associated with osmoregulation, oxygen binding and delivery, and other aspects of marine life. From these candidate genes, we queried 92 autosomal protein-coding markers with a panel of 96 SNPs that also included 2 sexing and 2 mitochondrial markers. Genotyping error rates, calculated across loci and across 69 intentional replicate samples, were low (0.021%), and observed heterozygosity was 0.33 averaged over all autosomal markers. This level of variability provides substantial discriminatory power across loci (mean probability of identity of 1.6 × 10 -25 and mean probability of exclusion >0.999 with neither parent known), indicating that these markers provide a powerful means to assess parentage and relatedness in gray whales. We found 29 unique multilocus genotypes represented among our 36 biopsies (indicating that we inadvertently sampled 7 whales twice). In total, we compiled an individual data set of 28 western gray whales (WGSs) and 1 presumptive eastern gray whale (EGW). The lone EGW we sampled was no more or less related to the WGWs than expected by chance alone. The gray whale genomes reported here will enable comparative studies of natural selection in cetaceans, and the SNP markers should be highly informative for future studies of gray whale evolution, population structure, demography, and relatedness.

  5. Exploring candidate genes for pericarp russet pigmentation of sand pear (Pyrus pyrifolia via RNA-Seq data in two genotypes contrasting for pericarp color.

    Directory of Open Access Journals (Sweden)

    Yue-zhi Wang

    Full Text Available Sand pear (Pyrus pyrifolia russet pericarp is an important trait affecting both the quality and stress tolerance of fruits. This trait is controlled by a relative complex genetic process, with some fundamental biological questions such as how many and which genes are involved in the process remaining elusive. In this study, we explored differentially expressed genes between the russet- and green-pericarp offspring from the sand pear (Pyrus pyrifolia cv. 'Qingxiang' × 'Cuiguan' F1 group by RNA-seq-based bulked segregant analysis (BSA. A total of 29,100 unigenes were identified and 206 of which showed significant differences in expression level (log2fold values>1 between the two types of pericarp pools. Gene Ontology (GO analyses detected 123 unigenes in GO terms related to 'cellular_component' and 'biological_process', suggesting developmental and growth differentiations between the two types. GO categories associated with various aspects of 'lipid metabolic processes', 'transport', 'response to stress', 'oxidation-reduction process' and more were enriched with genes with divergent expressions between the two libraries. Detailed examination of a selected set of these categories revealed repressed expressions of candidate genes for suberin, cutin and wax biosynthesis in the russet pericarps.Genes encoding putative cinnamoyl-CoA reductase (CCR, cinnamyl alcohol dehydrogenase (CAD and peroxidase (POD that are involved in the lignin biosynthesis were suggested to be candidates for pigmentation of sand pear russet pericarps. Nine differentially expressed genes were analyzed for their expressions using qRT-PCR and the results were consistent with those obtained from Illumina RNA-sequencing. This study provides a comprehensive molecular biology insight into the sand pear pericarp pigmentation and appearance quality formation.

  6. Gene expression differences between Noccaea caerulescens ecotypes help to identify candidate genes for metal phytoremediation.

    Science.gov (United States)

    Halimaa, Pauliina; Lin, Ya-Fen; Ahonen, Viivi H; Blande, Daniel; Clemens, Stephan; Gyenesei, Attila; Häikiö, Elina; Kärenlampi, Sirpa O; Laiho, Asta; Aarts, Mark G M; Pursiheimo, Juha-Pekka; Schat, Henk; Schmidt, Holger; Tuomainen, Marjo H; Tervahauta, Arja I

    2014-03-18

    Populations of Noccaea caerulescens show tremendous differences in their capacity to hyperaccumulate and hypertolerate metals. To explore the differences that could contribute to these traits, we undertook SOLiD high-throughput sequencing of the root transcriptomes of three phenotypically well-characterized N. caerulescens accessions, i.e., Ganges, La Calamine, and Monte Prinzera. Genes with possible contribution to zinc, cadmium, and nickel hyperaccumulation and hypertolerance were predicted. The most significant differences between the accessions were related to metal ion (di-, trivalent inorganic cation) transmembrane transporter activity, iron and calcium ion binding, (inorganic) anion transmembrane transporter activity, and antioxidant activity. Analysis of correlation between the expression profile of each gene and the metal-related characteristics of the accessions disclosed both previously characterized (HMA4, HMA3) and new candidate genes (e.g., for nickel IRT1, ZIP10, and PDF2.3) as possible contributors to the hyperaccumulation/tolerance phenotype. A number of unknown Noccaea-specific transcripts also showed correlation with Zn(2+), Cd(2+), or Ni(2+) hyperaccumulation/tolerance. This study shows that N. caerulescens populations have evolved great diversity in the expression of metal-related genes, facilitating adaptation to various metalliferous soils. The information will be helpful in the development of improved plants for metal phytoremediation.

  7. Meta-analysis and candidate gene mining of low-phosphorus tolerance in maize.

    Science.gov (United States)

    Zhang, Hongwei; Uddin, Mohammed Shalim; Zou, Cheng; Xie, Chuanxiao; Xu, Yunbi; Li, Wen-Xue

    2014-03-01

    Plants with tolerance to low-phosphorus (P) can grow better under low-P conditions, and understanding of genetic mechanisms of low-P tolerance can not only facilitate identifying relevant genes but also help to develop low-P tolerant cultivars. QTL meta-analysis was conducted after a comprehensive review of the reports on QTL mapping for low-P tolerance-related traits in maize. Meta-analysis produced 23 consensus QTL (cQTL), 17 of which located in similar chromosome regions to those previously reported to influence root traits. Meanwhile, candidate gene mining yielded 215 genes, 22 of which located in the cQTL regions. These 22 genes are homologous to 14 functionally characterized genes that were found to participate in plant low-P tolerance, including genes encoding miR399s, Pi transporters and purple acid phosphatases. Four cQTL loci (cQTL2-1, cQTL5-3, cQTL6-2, and cQTL10-2) may play important roles for low-P tolerance because each contains more original QTL and has better consistency across previous reports. © 2014 Institute of Botany, Chinese Academy of Sciences.

  8. Single-Cell RNA-Seq of Mouse Dopaminergic Neurons Informs Candidate Gene Selection for Sporadic Parkinson Disease.

    Science.gov (United States)

    Hook, Paul W; McClymont, Sarah A; Cannon, Gabrielle H; Law, William D; Morton, A Jennifer; Goff, Loyal A; McCallion, Andrew S

    2018-03-01

    Genetic variation modulating risk of sporadic Parkinson disease (PD) has been primarily explored through genome-wide association studies (GWASs). However, like many other common genetic diseases, the impacted genes remain largely unknown. Here, we used single-cell RNA-seq to characterize dopaminergic (DA) neuron populations in the mouse brain at embryonic and early postnatal time points. These data facilitated unbiased identification of DA neuron subpopulations through their unique transcriptional profiles, including a postnatal neuroblast population and substantia nigra (SN) DA neurons. We use these population-specific data to develop a scoring system to prioritize candidate genes in all 49 GWAS intervals implicated in PD risk, including genes with known PD associations and many with extensive supporting literature. As proof of principle, we confirm that the nigrostriatal pathway is compromised in Cplx1-null mice. Ultimately, this systematic approach establishes biologically pertinent candidates and testable hypotheses for sporadic PD, informing a new era of PD genetic research. Copyright © 2018 American Society of Human Genetics. All rights reserved.

  9. Analysis of 60 reported glioma risk SNPs replicates published GWAS findings but fails to replicate associations from published candidate-gene studies.

    Science.gov (United States)

    Walsh, Kyle M; Anderson, Erik; Hansen, Helen M; Decker, Paul A; Kosel, Matt L; Kollmeyer, Thomas; Rice, Terri; Zheng, Shichun; Xiao, Yuanyuan; Chang, Jeffrey S; McCoy, Lucie S; Bracci, Paige M; Wiemels, Joe L; Pico, Alexander R; Smirnov, Ivan; Lachance, Daniel H; Sicotte, Hugues; Eckel-Passow, Jeanette E; Wiencke, John K; Jenkins, Robert B; Wrensch, Margaret R

    2013-02-01

    Genomewide association studies (GWAS) and candidate-gene studies have implicated single-nucleotide polymorphisms (SNPs) in at least 45 different genes as putative glioma risk factors. Attempts to validate these associations have yielded variable results and few genetic risk factors have been consistently replicated. We conducted a case-control study of Caucasian glioma cases and controls from the University of California San Francisco (810 cases, 512 controls) and the Mayo Clinic (852 cases, 789 controls) in an attempt to replicate previously reported genetic risk factors for glioma. Sixty SNPs selected from the literature (eight from GWAS and 52 from candidate-gene studies) were successfully genotyped on an Illumina custom genotyping panel. Eight SNPs in/near seven different genes (TERT, EGFR, CCDC26, CDKN2A, PHLDB1, RTEL1, TP53) were significantly associated with glioma risk in the combined dataset (P 0.05). Although several confirmed associations are located near genes long known to be involved in gliomagenesis (e.g., EGFR, CDKN2A, TP53), these associations were first discovered by the GWAS approach and are in noncoding regions. These results highlight that the deficiencies of the candidate-gene approach lay in selecting both appropriate genes and relevant SNPs within these genes. © 2012 WILEY PERIODICALS, INC.

  10. Patterns of linkage disequilibrium and haplotype distribution in disease candidate genes.

    Science.gov (United States)

    Long, Ji-Rong; Zhao, Lan-Juan; Liu, Peng-Yuan; Lu, Yan; Dvornyk, Volodymyr; Shen, Hui; Liu, Yong-Jun; Zhang, Yuan-Yuan; Xiong, Dong-Hai; Xiao, Peng; Deng, Hong-Wen

    2004-05-24

    The adequacy of association studies for complex diseases depends critically on the existence of linkage disequilibrium (LD) between functional alleles and surrounding SNP markers. We examined the patterns of LD and haplotype distribution in eight candidate genes for osteoporosis and/or obesity using 31 SNPs in 1,873 subjects. These eight genes are apolipoprotein E (APOE), type I collagen alpha1 (COL1A1), estrogen receptor-alpha (ER-alpha), leptin receptor (LEPR), parathyroid hormone (PTH)/PTH-related peptide receptor type 1 (PTHR1), transforming growth factor-beta1 (TGF-beta1), uncoupling protein 3 (UCP3), and vitamin D (1,25-dihydroxyvitamin D3) receptor (VDR). Yin yang haplotypes, two high-frequency haplotypes composed of completely mismatching SNP alleles, were examined. To quantify LD patterns, two common measures of LD, D' and r2, were calculated for the SNPs within the genes. The haplotype distribution varied in the different genes. Yin yang haplotypes were observed only in PTHR1 and UCP3. D' ranged from 0.020 to 1.000 with the average of 0.475, whereas the average r2 was 0.158 (ranging from 0.000 to 0.883). A decay of LD was observed as the intermarker distance increased, however, there was a great difference in LD characteristics of different genes or even in different regions within gene. The differences in haplotype distributions and LD patterns among the genes underscore the importance of characterizing genomic regions of interest prior to association studies.

  11. Analysis of positional candidate genes in the AAA1 susceptibility locus for abdominal aortic aneurysms on chromosome 19

    Directory of Open Access Journals (Sweden)

    Ferrell Robert E

    2011-01-01

    Full Text Available Abstract Background Abdominal aortic aneurysm (AAA is a complex disorder with multiple genetic risk factors. Using affected relative pair linkage analysis, we previously identified an AAA susceptibility locus on chromosome 19q13. This locus has been designated as the AAA1 susceptibility locus in the Online Mendelian Inheritance in Man (OMIM database. Methods Nine candidate genes were selected from the AAA1 locus based on their function, as well as mRNA expression levels in the aorta. A sample of 394 cases and 419 controls was genotyped for 41 SNPs located in or around the selected nine candidate genes using the Illumina GoldenGate platform. Single marker and haplotype analyses were performed. Three genes (CEBPG, PEPD and CD22 were selected for DNA sequencing based on the association study results, and exonic regions were analyzed. Immunohistochemical staining of aortic tissue sections from AAA and control individuals was carried out for the CD22 and PEPD proteins with specific antibodies. Results Several SNPs were nominally associated with AAA (p CEBPG, peptidase D (PEPD, and CD22. Haplotype analysis found a nominally associated 5-SNP haplotype in the CEBPG/PEPD locus, as well as a nominally associated 2-SNP haplotype in the CD22 locus. DNA sequencing of the coding regions revealed no variation in CEBPG. Seven sequence variants were identified in PEPD, including three not present in the NCBI SNP (dbSNP database. Sequencing of all 14 exons of CD22 identified 20 sequence variants, five of which were in the coding region and six were in the 3'-untranslated region. Five variants were not present in dbSNP. Immunohistochemical staining for CD22 revealed protein expression in lymphocytes present in the aneurysmal aortic wall only and no detectable expression in control aorta. PEPD protein was expressed in fibroblasts and myofibroblasts in the media-adventitia border in both aneurysmal and non-aneurysmal tissue samples. Conclusions Association testing

  12. Comparative study on gene set and pathway topology-based enrichment methods.

    Science.gov (United States)

    Bayerlová, Michaela; Jung, Klaus; Kramer, Frank; Klemm, Florian; Bleckmann, Annalen; Beißbarth, Tim

    2015-10-22

    Enrichment analysis is a popular approach to identify pathways or sets of genes which are significantly enriched in the context of differentially expressed genes. The traditional gene set enrichment approach considers a pathway as a simple gene list disregarding any knowledge of gene or protein interactions. In contrast, the new group of so called pathway topology-based methods integrates the topological structure of a pathway into the analysis. We comparatively investigated gene set and pathway topology-based enrichment approaches, considering three gene set and four topological methods. These methods were compared in two extensive simulation studies and on a benchmark of 36 real datasets, providing the same pathway input data for all methods. In the benchmark data analysis both types of methods showed a comparable ability to detect enriched pathways. The first simulation study was conducted with KEGG pathways, which showed considerable gene overlaps between each other. In this study with original KEGG pathways, none of the topology-based methods outperformed the gene set approach. Therefore, a second simulation study was performed on non-overlapping pathways created by unique gene IDs. Here, methods accounting for pathway topology reached higher accuracy than the gene set methods, however their sensitivity was lower. We conducted one of the first comprehensive comparative works on evaluating gene set against pathway topology-based enrichment methods. The topological methods showed better performance in the simulation scenarios with non-overlapping pathways, however, they were not conclusively better in the other scenarios. This suggests that simple gene set approach might be sufficient to detect an enriched pathway under realistic circumstances. Nevertheless, more extensive studies and further benchmark data are needed to systematically evaluate these methods and to assess what gain and cost pathway topology information introduces into enrichment analysis. Both

  13. Fine mapping and candidate gene analysis of the virescent gene v 1 in Upland cotton (Gossypium hirsutum).

    Science.gov (United States)

    Mao, Guangzhi; Ma, Qiang; Wei, Hengling; Su, Junji; Wang, Hantao; Ma, Qifeng; Fan, Shuli; Song, Meizhen; Zhang, Xianlong; Yu, Shuxun

    2018-02-01

    The young leaves of virescent mutants are yellowish and gradually turn green as the plants reach maturity. Understanding the genetic basis of virescent mutants can aid research of the regulatory mechanisms underlying chloroplast development and chlorophyll biosynthesis, as well as contribute to the application of virescent traits in crop breeding. In this study, fine mapping was employed, and a recessive gene (v 1 ) from a virescent mutant of Upland cotton was narrowed to an 84.1-Kb region containing ten candidate genes. The GhChlI gene encodes the cotton Mg-chelatase I subunit (CHLI) and was identified as the candidate gene for the virescent mutation using gene annotation. BLAST analysis showed that the GhChlI gene has two copies, Gh_A10G0282 and Gh_D10G0283. Sequence analysis indicated that the coding region (CDS) of GhChlI is 1269 bp in length, with three predicted exons and one non-synonymous nucleotide mutation (G1082A) in the third exon of Gh_D10G0283, with an amino acid (AA) substitution of arginine (R) to lysine (K). GhChlI-silenced TM-1 plants exhibited a lower GhChlI expression level, a lower chlorophyll content, and the virescent phenotype. Analysis of upstream regulatory elements and expression levels of GhChlI showed that the expression quantity of GhChlI may be normal, and with the development of the true leaf, the increase in the Gh_A10G0282 dosage may partially make up for the deficiency of Gh_D10G0283 in the v 1 mutant. Phylogenetic analysis and sequence alignment revealed that the protein sequence encoded by the third exon of GhChlI is highly conserved across diverse plant species, in which AA substitutions among the completely conserved residues frequently result in changes in leaf color in various species. These results suggest that the mutation (G1082A) within the GhChlI gene may cause a functional defect of the GhCHLI subunit and thus the virescent phenotype in the v 1 mutant. The GhChlI mutation not only provides a tool for understanding the

  14. LEGO: a novel method for gene set over-representation analysis by incorporating network-based gene weights.

    Science.gov (United States)

    Dong, Xinran; Hao, Yun; Wang, Xiao; Tian, Weidong

    2016-01-11

    Pathway or gene set over-representation analysis (ORA) has become a routine task in functional genomics studies. However, currently widely used ORA tools employ statistical methods such as Fisher's exact test that reduce a pathway into a list of genes, ignoring the constitutive functional non-equivalent roles of genes and the complex gene-gene interactions. Here, we develop a novel method named LEGO (functional Link Enrichment of Gene Ontology or gene sets) that takes into consideration these two types of information by incorporating network-based gene weights in ORA analysis. In three benchmarks, LEGO achieves better performance than Fisher and three other network-based methods. To further evaluate LEGO's usefulness, we compare LEGO with five gene expression-based and three pathway topology-based methods using a benchmark of 34 disease gene expression datasets compiled by a recent publication, and show that LEGO is among the top-ranked methods in terms of both sensitivity and prioritization for detecting target KEGG pathways. In addition, we develop a cluster-and-filter approach to reduce the redundancy among the enriched gene sets, making the results more interpretable to biologists. Finally, we apply LEGO to two lists of autism genes, and identify relevant gene sets to autism that could not be found by Fisher.

  15. Using RNA-Seq Data to Evaluate Reference Genes Suitable for Gene Expression Studies in Soybean.

    Directory of Open Access Journals (Sweden)

    Aldrin Kay-Yuen Yim

    Full Text Available Differential gene expression profiles often provide important clues for gene functions. While reverse transcription quantitative real-time polymerase chain reaction (RT-qPCR is an important tool, the validity of the results depends heavily on the choice of proper reference genes. In this study, we employed new and published RNA-sequencing (RNA-Seq datasets (26 sequencing libraries in total to evaluate reference genes reported in previous soybean studies. In silico PCR showed that 13 out of 37 previously reported primer sets have multiple targets, and 4 of them have amplicons with different sizes. Using a probabilistic approach, we identified new and improved candidate reference genes. We further performed 2 validation tests (with 26 RNA samples on 8 commonly used reference genes and 7 newly identified candidates, using RT-qPCR. In general, the new candidate reference genes exhibited more stable expression levels under the tested experimental conditions. The three newly identified candidate reference genes Bic-C2, F-box protein2, and VPS-like gave the best overall performance, together with the commonly used ELF1b. It is expected that the proposed probabilistic model could serve as an important tool to identify stable reference genes when more soybean RNA-Seq data from different growth stages and treatments are used.

  16. Identification of Single Nucleotide Polymorphisms and analysis of Linkage Disequilibrium in sunflower elite inbred lines using the candidate gene approach

    Directory of Open Access Journals (Sweden)

    Heinz Ruth A

    2008-01-01

    Full Text Available Abstract Background Association analysis is a powerful tool to identify gene loci that may contribute to phenotypic variation. This includes the estimation of nucleotide diversity, the assessment of linkage disequilibrium structure (LD and the evaluation of selection processes. Trait mapping by allele association requires a high-density map, which could be obtained by the addition of Single Nucleotide Polymorphisms (SNPs and short insertion and/or deletions (indels to SSR and AFLP genetic maps. Nucleotide diversity analysis of randomly selected candidate regions is a promising approach for the success of association analysis and fine mapping in the sunflower genome. Moreover, knowledge of the distance over which LD persists, in agronomically meaningful sunflower accessions, is important to establish the density of markers and the experimental design for association analysis. Results A set of 28 candidate genes related to biotic and abiotic stresses were studied in 19 sunflower inbred lines. A total of 14,348 bp of sequence alignment was analyzed per individual. In average, 1 SNP was found per 69 nucleotides and 38 indels were identified in the complete data set. The mean nucleotide polymorphism was moderate (θ = 0.0056, as expected for inbred materials. The number of haplotypes per region ranged from 1 to 9 (mean = 3.54 ± 1.88. Model-based population structure analysis allowed detection of admixed individuals within the set of accessions examined. Two putative gene pools were identified (G1 and G2, with a large proportion of the inbred lines being assigned to one of them (G1. Consistent with the absence of population sub-structuring, LD for G1 decayed more rapidly (r2 = 0.48 at 643 bp; trend line, pooled data than the LD trend line for the entire set of 19 individuals (r2 = 0.64 for the same distance. Conclusion Knowledge about the patterns of diversity and the genetic relationships between breeding materials could be an invaluable aid in crop

  17. Evaluation and validation of candidate endogenous control genes for real-time quantitative PCR studies of breast cancer

    Directory of Open Access Journals (Sweden)

    Miller Nicola

    2007-11-01

    Full Text Available Abstract Background Real-time quantitative PCR (RQ-PCR forms the basis of many breast cancer biomarker studies and novel prognostic assays, paving the way towards personalised cancer treatments. Normalisation of relative RQ-PCR data is required to control for non-biological variation introduced during sample preparation. Endogenous control (EC genes, used in this context, should ideally be expressed constitutively and uniformly across treatments in all test samples. Despite widespread recognition that the accuracy of the normalised data is largely dependent on the reliability of the EC, there are no reports of the systematic validation of genes commonly used for this purpose in the analysis of gene expression by RQ-PCR in primary breast cancer tissues. The aim of this study was to identify the most suitable endogenous control genes for RQ-PCR analysis of primary breast tissue from a panel of eleven candidates in current use. Oestrogen receptor alpha (ESR1 was used a target gene to compare the effect of choice of EC on the estimate of gene quantity. Results The expression and validity of candidate ECs (GAPDH, TFRC, ABL, PPIA, HPRT1, RPLP0, B2M, GUSB, MRPL19, PUM1 and PSMC4 was determined in 6 benign and 21 malignant primary breast cancer tissues. Gene expression data was analysed using two different statistical models. MRPL19 and PPIA were identified as the most stable and reliable EC genes, while GUSB, RPLP0 and ABL were least stable. There was a highly significant difference in variance between ECs. ESR1 expression was appreciably higher in malignant compared to benign tissues and there was a significant effect of EC on the magnitude of the error associated with the relative quantity of ESR1. Conclusion We have validated two endogenous control genes, MRPL19 and PPIA, for RQ-PCR analysis of gene expression in primary breast tissue. Of the genes in current use in this field, the above combination offers increased accuracy and resolution in the

  18. Linkage analysis of candidate genes in autoimmune thyroid disease. II. Selected gender-related genes and the X-chromosome. International Consortium for the Genetics of Autoimmune Thyroid Disease.

    Science.gov (United States)

    Barbesino, G; Tomer, Y; Concepcion, E S; Davies, T F; Greenberg, D A

    1998-09-01

    Hashimoto's thyroiditis (HT) and Graves' disease (GD) are autoimmune thyroid diseases (AITD) in which multiple genetic factors are suspected to play an important role. Until now, only a few minor risk factors for these diseases have been identified. Susceptibility seems to be stronger in women, pointing toward a possible role for genes related to sex steroid action or mechanisms related to genes on the X-chromosome. We have studied a total of 45 multiplex families, each containing at least 2 members affected with either GD (55 patients) or HT (72 patients), and used linkage analysis to target as candidate susceptibility loci genes involved in estrogen activity, such as the estrogen receptor alpha and beta and the aromatase genes. We then screened the entire X-chromosome using a set of polymorphic microsatellite markers spanning the whole chromosome. We found a region of the X-chromosome (Xq21.33-22) giving positive logarithm of odds (LOD) scores and then reanalyzed this area with dense markers in a multipoint analysis. Our results excluded linkage to the estrogen receptor alpha and aromatase genes when either the patients with GD only, those with HT only, or those with any AITD were considered as affected. Linkage to the estrogen receptor beta could not be totally ruled out, partly due to incomplete mapping information for the gene itself at this time. The X-chromosome data revealed consistently positive LOD scores (maximum of 1.88 for marker DXS8020 and GD patients) when either definition of affectedness was considered. Analysis of the family data using a multipoint analysis with eight closely linked markers generated LOD scores suggestive of linkage to GD in a chromosomal area (Xq21.33-22) extending for about 6 cM and encompassing four markers. The maximum LOD score (2.5) occurred at DXS8020. In conclusion, we ruled out a major role for estrogen receptor alpha and the aromatase genes in the genetic predisposition to AITD. Estrogen receptor beta remains a

  19. Targeted sequencing of 351 candidate genes for epileptic encephalopathy in a large cohort of patients

    DEFF Research Database (Denmark)

    de Kovel, Carolien G F; Brilstra, Eva H; van Kempen, Marjan J A

    2016-01-01

    BACKGROUND: Many genes are candidates for involvement in epileptic encephalopathy (EE) because one or a few possibly pathogenic variants have been found in patients, but insufficient genetic or functional evidence exists for a definite annotation. METHODS: To increase the number of validated EE...

  20. Elevated risks for amyotrophic lateral sclerosis and blood disorders in Ashkenazi schizophrenic pedigrees suggest new candidate genes in schizophrenia

    Energy Technology Data Exchange (ETDEWEB)

    Goodman, A.B. [Columbia Univ. School of Public Health, New York, NY (United States)

    1994-09-15

    Among relatives of Ashkenazi schizophrenic probands the rate of amyotrophic lateral sclerosis was 3/1,000, compared to expected population rates of approximately 2/100,000. Relative risk of bleeding disorders, including hematologic cancers, was increased more than three-fold compared to controls. Co-occurrence of motor neuron disease and blood dyscrasias, accompanied by psychosis, has long been recognized. A virally-mediated autoimmune pathogenesis has been proposed. However, the familial co-occurrence of these three disease entities raises the possibility that the disease constellation be considered as a manifestation of a common underlying genetic defect. Such expansion of the spectrum of affectation might enhance the power of both candidate gene and linkage studies. Based on these findings, the loci suggested as candidate regions in schizophrenia include a potential hot spot on chromosome 21q21-q22, involving the superoxide dismutase and amyloid precursor protein genes. Alternatively, genes on other chromosomes involved in the expression, transcription, or regulation of these genes, or associated with the illnesses of high frequency in these pedigrees are suggested. Candidates include the choroid plexus transport protein, transthyretin at 18q11.2-q12.1; the t(14;18)(q22;21) characterizing B-cell lymphoma-2, the most common form of hematologic cancer; and the 14q24 locus of early onset Alzheimer`s disease, c-Fos, transforming growth factor beta 3, and heat shock protein A2. Expression of hematologic cancers and the suggested candidate genes are known to involve retinoid pathways, and retinoid disregulation has been proposed as a cause of schizophrenia. 67 refs., 2 figs., 1 tab.

  1. Selection on plant male function genes identifies candidates for reproductive isolation of yellow monkeyflowers.

    Directory of Open Access Journals (Sweden)

    Jan E Aagaard

    Full Text Available Understanding the genetic basis of reproductive isolation promises insight into speciation and the origins of biological diversity. While progress has been made in identifying genes underlying barriers to reproduction that function after fertilization (post-zygotic isolation, we know much less about earlier acting pre-zygotic barriers. Of particular interest are barriers involved in mating and fertilization that can evolve extremely rapidly under sexual selection, suggesting they may play a prominent role in the initial stages of reproductive isolation. A significant challenge to the field of speciation genetics is developing new approaches for identification of candidate genes underlying these barriers, particularly among non-traditional model systems. We employ powerful proteomic and genomic strategies to study the genetic basis of conspecific pollen precedence, an important component of pre-zygotic reproductive isolation among yellow monkeyflowers (Mimulus spp. resulting from male pollen competition. We use isotopic labeling in combination with shotgun proteomics to identify more than 2,000 male function (pollen tube proteins within maternal reproductive structures (styles of M. guttatus flowers where pollen competition occurs. We then sequence array-captured pollen tube exomes from a large outcrossing population of M. guttatus, and identify those genes with evidence of selective sweeps or balancing selection consistent with their role in pollen competition. We also test for evidence of positive selection on these genes more broadly across yellow monkeyflowers, because a signal of adaptive divergence is a common feature of genes causing reproductive isolation. Together the molecular evolution studies identify 159 pollen tube proteins that are candidate genes for conspecific pollen precedence. Our work demonstrates how powerful proteomic and genomic tools can be readily adapted to non-traditional model systems, allowing for genome-wide screens

  2. The first set of EST resource for gene discovery and marker development in pigeonpea (Cajanus cajan L.

    Directory of Open Access Journals (Sweden)

    Byregowda Munishamappa

    2010-03-01

    .8% in molecular function. Further, 19 genes were identified differentially expressed between FW- responsive genotypes and 20 between SMD- responsive genotypes. Generated ESTs were compiled together with 908 ESTs available in public domain, at the time of analysis, and a set of 5,085 unigenes were defined that were used for identification of molecular markers in pigeonpea. For instance, 3,583 simple sequence repeat (SSR motifs were identified in 1,365 unigenes and 383 primer pairs were designed. Assessment of a set of 84 primer pairs on 40 elite pigeonpea lines showed polymorphism with 15 (28.8% markers with an average of four alleles per marker and an average polymorphic information content (PIC value of 0.40. Similarly, in silico mining of 133 contigs with ≥ 5 sequences detected 102 single nucleotide polymorphisms (SNPs in 37 contigs. As an example, a set of 10 contigs were used for confirming in silico predicted SNPs in a set of four genotypes using wet lab experiments. Occurrence of SNPs were confirmed for all the 6 contigs for which scorable and sequenceable amplicons were generated. PCR amplicons were not obtained in case of 4 contigs. Recognition sites for restriction enzymes were identified for 102 SNPs in 37 contigs that indicates possibility of assaying SNPs in 37 genes using cleaved amplified polymorphic sequences (CAPS assay. Conclusion The pigeonpea EST dataset generated here provides a transcriptomic resource for gene discovery and development of functional markers associated with biotic stress resistance. Sequence analyses of this dataset have showed conservation of a considerable number of pigeonpea transcripts across legume and model plant species analysed as well as some putative pigeonpea specific genes. Validation of identified biotic stress responsive genes should provide candidate genes for allele mining as well as candidate markers for molecular breeding.

  3. Genetic variation at hair length candidate genes in elephants and the extinct woolly mammoth

    Directory of Open Access Journals (Sweden)

    Tisdale Michele

    2009-09-01

    Full Text Available Abstract Background Like humans, the living elephants are unusual among mammals in being sparsely covered with hair. Relative to extant elephants, the extinct woolly mammoth, Mammuthus primigenius, had a dense hair cover and extremely long hair, which likely were adaptations to its subarctic habitat. The fibroblast growth factor 5 (FGF5 gene affects hair length in a diverse set of mammalian species. Mutations in FGF5 lead to recessive long hair phenotypes in mice, dogs, and cats; and the gene has been implicated in hair length variation in rabbits. Thus, FGF5 represents a leading candidate gene for the phenotypic differences in hair length notable between extant elephants and the woolly mammoth. We therefore sequenced the three exons (except for the 3' UTR and a portion of the promoter of FGF5 from the living elephantid species (Asian, African savanna and African forest elephants and, using protocols for ancient DNA, from a woolly mammoth. Results Between the extant elephants and the mammoth, two single base substitutions were observed in FGF5, neither of which alters the amino acid sequence. Modeling of the protein structure suggests that the elephantid proteins fold similarly to the human FGF5 protein. Bioinformatics analyses and DNA sequencing of another locus that has been implicated in hair cover in humans, type I hair keratin pseudogene (KRTHAP1, also yielded negative results. Interestingly, KRTHAP1 is a pseudogene in elephantids as in humans (although fully functional in non-human primates. Conclusion The data suggest that the coding sequence of the FGF5 gene is not the critical determinant of hair length differences among elephantids. The results are discussed in the context of hairlessness among mammals and in terms of the potential impact of large body size, subarctic conditions, and an aquatic ancestor on hair cover in the Proboscidea.

  4. PSPHL as a candidate gene influencing racial disparities in endometrial cancer incidence and survival

    Directory of Open Access Journals (Sweden)

    Jay eAllard

    2012-07-01

    Full Text Available Endometrial cancer is the most commonly diagnosed gynecologic malignancy in the United States and is characterized by a well recognized racial disparity in both incidence and survival. Specifically Caucasians are about two times more likely to develop endometrial cancer than are African Americans. However, African American women are more likely to die from this disease than are Caucasians. The basis for this disparity remains unknown. Previous studies have identified differences in the types and frequencies of gene mutations among endometrial cancers from Caucasians and African Americans suggesting. We performed a gene expression microarray study in an effort to further examine differences between African American and Caucasian women’s endometrial cancers. This expression screen identified a list of potential biomarkers differentially expressed between these two groups of cancers. Of these we identified a poorly characterized transcript with a region of homology to phospho serine phospatase (PSPH and designated phospho serine phospatase like (PSPHL as the most differentially over-expressed gene in cancers from African Americans. We clarified the nature of expressed transcripts. Northern blot analysis confirmed PSPHL messages under 1 KB. Sequence analysis of transcripts confirmed two alternate open reading frame (ORF isoforms due to alternative splicing events. Splice specific primer sets confirmed both isoforms were differentially expressed in tissues from Caucasians and African Americans. We further examined the expression in other tissues from women to include normal endometrium, normal and malignant ovary. In all cases PSPHL expression was more often present in tissues from African-Americans than Caucasians. Our data confirm the African-American based expression of the PSPHL transcript several tissue types. PSPHL represents a candidate gene that might influence the observed racial disparity in endometrial and other cancers.

  5. A family with X-linked anophthalmia: exclusion of SOX3 as a candidate gene.

    Science.gov (United States)

    Slavotinek, Anne; Lee, Stephen S; Hamilton, Steven P

    2005-10-01

    We report on a four-generation family with X-linked anophthalmia in four affected males and show that this family has LOD scores consistent with linkage to Xq27, the third family reported to be linked to the ANOP1 locus. We sequenced the SOX3 gene at Xq27 as a candidate gene for the X-linked anophthalmia based on the high homology of this gene to SOX2, a gene previously mutated in bilateral anophthlamia. However, no amino acid sequence alterations were identified in SOX3. We have improved the definition of the phenotype in males with anophthalmia linked to the ANOP1 locus, as microcephaly, ocular colobomas, and severe renal malformations have not been described in families linked to ANOP1. (c) 2005 Wiley-Liss, Inc.

  6. Gene Duplication and Gene Expression Changes Play a Role in the Evolution of Candidate Pollen Feeding Genes in Heliconius Butterflies.

    Science.gov (United States)

    Smith, Gilbert; Macias-Muñoz, Aide; Briscoe, Adriana D

    2016-09-02

    Heliconius possess a unique ability among butterflies to feed on pollen. Pollen feeding significantly extends their lifespan, and is thought to have been important to the diversification of the genus. We used RNA sequencing to examine feeding-related gene expression in the mouthparts of four species of Heliconius and one nonpollen feeding species, Eueides isabella We hypothesized that genes involved in morphology and protein metabolism might be upregulated in Heliconius because they have longer proboscides than Eueides, and because pollen contains more protein than nectar. Using de novo transcriptome assemblies, we tested these hypotheses by comparing gene expression in mouthparts against antennae and legs. We first looked for genes upregulated in mouthparts across all five species and discovered several hundred genes, many of which had functional annotations involving metabolism of proteins (cocoonase), lipids, and carbohydrates. We then looked specifically within Heliconius where we found eleven common upregulated genes with roles in morphology (CPR cuticle proteins), behavior (takeout-like), and metabolism (luciferase-like). Closer examination of these candidates revealed that cocoonase underwent several duplications along the lineage leading to heliconiine butterflies, including two Heliconius-specific duplications. Luciferase-like genes also underwent duplication within lepidopterans, and upregulation in Heliconius mouthparts. Reverse-transcription PCR confirmed that three cocoonases, a peptidase, and one luciferase-like gene are expressed in the proboscis with little to no expression in labial palps and salivary glands. Our results suggest pollen feeding, like other dietary specializations, was likely facilitated by adaptive expansions of preexisting genes-and that the butterfly proboscis is involved in digestive enzyme production. © The Author(s) 2016. Published by Oxford University Press on behalf of the Society for Molecular Biology and Evolution.

  7. Mutation analysis of the candidate genes -, , and in patients with arrhythmogenic right ventricular cardiomyopathy

    DEFF Research Database (Denmark)

    Refsgaard, Lena; Olesen, Morten Salling; Møller, Daniel Vega

    2012-01-01

    INTRODUCTION: Arrhythmogenic right ventricular cardiomyopathy (ARVC) is a genetically determined heart disease characterized by fibrofatty infiltrations in the myocardium, right and/or left ventricular involvement, and ventricular tachyarrhythmias. Although ten genes have been associated with ARVC......, only about 40% of the patients have an identifiable disease-causing mutation. In the present study we aimed at investigating the involvement of the genes SCN1B-SCN4B, FHL1, and LMNA in the pathogenesis of ARVC. METHODS: Sixty-five unrelated patients (55 fulfilling ARVC criteria and 10 borderline cases...... of the variants was non-synonymous. No disease-causing mutations were identified. CONCLUSIONS: In our limited sized cohort the six studied candidate genes were not associated with ARVC....

  8. Basal host resistance of barley to powdery mildew: connecting quantitative trait loci and candidate genes

    NARCIS (Netherlands)

    Aghnoum, R.; Marcel, T.C.; Johrde, A.; Pecchioni, N.; Schweizer, P.; Niks, R.E.

    2010-01-01

    The basal resistance of barley to powdery mildew (Blumeria graminis f. sp. hordei) is a quantitatively inherited trait that is based on nonhypersensitive mechanisms of defense. A functional genomic approach indicates that many plant candidate genes are involved in the defense against formation of

  9. A Public Platform for the Verification of the Phenotypic Effect of Candidate Genes for Resistance to Aflatoxin Accumulation and Aspergillus flavus Infection in Maize

    Directory of Open Access Journals (Sweden)

    Xueyan Shan

    2011-06-01

    Full Text Available A public candidate gene testing pipeline for resistance to aflatoxin accumulation or Aspergillus flavus infection in maize is presented here. The pipeline consists of steps for identifying, testing, and verifying the association of selected maize gene sequences with resistance under field conditions. Resources include a database of genetic and protein sequences associated with the reduction in aflatoxin contamination from previous studies; eight diverse inbred maize lines for polymorphism identification within any maize gene sequence; four Quantitative Trait Loci (QTL mapping populations and one association mapping panel, all phenotyped for aflatoxin accumulation resistance and associated phenotypes; and capacity for Insertion/Deletion (InDel and SNP genotyping in the population(s for mapping. To date, ten genes have been identified as possible candidate genes and put through the candidate gene testing pipeline, and results are presented here to demonstrate the utility of the pipeline.

  10. Candidate gene association mapping of Sclerotinia stalk rot resistance in sunflower (Helianthus annuus L.) uncovers the importance of COI1 homologs.

    Science.gov (United States)

    Talukder, Zahirul I; Hulke, Brent S; Qi, Lili; Scheffler, Brian E; Pegadaraju, Venkatramana; McPhee, Kevin; Gulya, Thomas J

    2014-01-01

    Functional markers for Sclerotinia basal stalk rot resistance in sunflower were obtained using gene-level information from the model species Arabidopsis thaliana. Sclerotinia stalk rot, caused by Sclerotinia sclerotiorum, is one of the most destructive diseases of sunflower (Helianthus annuus L.) worldwide. Markers for genes controlling resistance to S. sclerotiorum will enable efficient marker-assisted selection (MAS). We sequenced eight candidate genes homologous to Arabidopsis thaliana defense genes known to be associated with Sclerotinia disease resistance in a sunflower association mapping population evaluated for Sclerotinia stalk rot resistance. The total candidate gene sequence regions covered a concatenated length of 3,791 bp per individual. A total of 187 polymorphic sites were detected for all candidate gene sequences, 149 of which were single nucleotide polymorphisms (SNPs) and 38 were insertions/deletions. Eight SNPs in the coding regions led to changes in amino acid codons. Linkage disequilibrium decay throughout the candidate gene regions declined on average to an r (2) = 0.2 for genetic intervals of 120 bp, but extended up to 350 bp with r (2) = 0.1. A general linear model with modification to account for population structure was found the best fitting model for this population and was used for association mapping. Both HaCOI1-1 and HaCOI1-2 were found to be strongly associated with Sclerotinia stalk rot resistance and explained 7.4 % of phenotypic variation in this population. These SNP markers associated with Sclerotinia stalk rot resistance can potentially be applied to the selection of favorable genotypes, which will significantly improve the efficiency of MAS during the development of stalk rot resistant cultivars.

  11. Next-generation text-mining mediated generation of chemical response-specific gene sets for interpretation of gene expression data

    NARCIS (Netherlands)

    Hettne, K.M.; Boorsma, A.; Dartel, D.A. van; Goeman, J.J.; Jong, E. de; Piersma, A.H.; Stierum, R.H.; Kleinjans, J.C.; Kors, J.A.

    2013-01-01

    BACKGROUND: Availability of chemical response-specific lists of genes (gene sets) for pharmacological and/or toxic effect prediction for compounds is limited. We hypothesize that more gene sets can be created by next-generation text mining (next-gen TM), and that these can be used with gene set

  12. Next-generation text-mining mediated generation of chemical response-specific gene sets for interpretation of gene expression data

    NARCIS (Netherlands)

    Hettne, K.M.; Boorsma, A.; Dartel, van D.A.M.; Goeman, J.J.; Jong, de E.; Piersma, A.H.; Stierum, R.H.; Kleinjans, J.C.; Kors, J.A.

    2013-01-01

    Background: Availability of chemical response-specific lists of genes (gene sets) for pharmacological and/or toxic effect prediction for compounds is limited. We hypothesize that more gene sets can be created by next-generation text mining (next-gen TM), and that these can be used with gene set

  13. Scuba: scalable kernel-based gene prioritization.

    Science.gov (United States)

    Zampieri, Guido; Tran, Dinh Van; Donini, Michele; Navarin, Nicolò; Aiolli, Fabio; Sperduti, Alessandro; Valle, Giorgio

    2018-01-25

    The uncovering of genes linked to human diseases is a pressing challenge in molecular biology and precision medicine. This task is often hindered by the large number of candidate genes and by the heterogeneity of the available information. Computational methods for the prioritization of candidate genes can help to cope with these problems. In particular, kernel-based methods are a powerful resource for the integration of heterogeneous biological knowledge, however, their practical implementation is often precluded by their limited scalability. We propose Scuba, a scalable kernel-based method for gene prioritization. It implements a novel multiple kernel learning approach, based on a semi-supervised perspective and on the optimization of the margin distribution. Scuba is optimized to cope with strongly unbalanced settings where known disease genes are few and large scale predictions are required. Importantly, it is able to efficiently deal both with a large amount of candidate genes and with an arbitrary number of data sources. As a direct consequence of scalability, Scuba integrates also a new efficient strategy to select optimal kernel parameters for each data source. We performed cross-validation experiments and simulated a realistic usage setting, showing that Scuba outperforms a wide range of state-of-the-art methods. Scuba achieves state-of-the-art performance and has enhanced scalability compared to existing kernel-based approaches for genomic data. This method can be useful to prioritize candidate genes, particularly when their number is large or when input data is highly heterogeneous. The code is freely available at https://github.com/gzampieri/Scuba .

  14. Candidate Genes for Aggressiveness in a Natural Fusarium culmorum Population Greatly Differ between Wheat and Rye Head Blight

    Directory of Open Access Journals (Sweden)

    Valheria Castiblanco

    2018-01-01

    Full Text Available Fusarium culmorum is one of the species causing Fusarium head blight (FHB in cereals in Europe. We aimed to investigate the association between the nucleotide diversity of ten F. culmorum candidate genes and field ratings of aggressiveness in winter rye. A total of 100 F. culmorum isolates collected from natural infections were phenotyped for FHB at two locations and two years. Variance components for aggressiveness showed significant isolate and isolate-by-environment variance, as expected for quantitative host-pathogen interactions. Further analysis of the isolate-by-environment interaction revealed the dominant role of the isolate-by-year over isolate-by-location interaction. One single-nucleotide polymorphism (SNP in the cutinase (CUT gene was found to be significantly (p < 0.001 associated with aggressiveness and explained 16.05% of the genotypic variance of this trait in rye. The SNP was located 60 base pairs before the start codon, which suggests a role in transcriptional regulation. Compared to a previous study in winter wheat with the same nucleotide sequences, a larger variation of pathogen aggressiveness on rye was found and a different candidate gene was associated with pathogen aggressiveness. This is the first report on the association of field aggressiveness and a host-specific candidate gene codifying for a protein that belongs to the secretome in F. culmorum.

  15. Whole Exome Sequencing in Females with Autism Implicates Novel and Candidate Genes

    Directory of Open Access Journals (Sweden)

    Merlin G. Butler

    2015-01-01

    Full Text Available Classical autism or autistic disorder belongs to a group of genetically heterogeneous conditions known as Autism Spectrum Disorders (ASD. Heritability is estimated as high as 90% for ASD with a recently reported compilation of 629 clinically relevant candidate and known genes. We chose to undertake a descriptive next generation whole exome sequencing case study of 30 well-characterized Caucasian females with autism (average age, 7.7 ± 2.6 years; age range, 5 to 16 years from multiplex families. Genomic DNA was used for whole exome sequencing via paired-end next generation sequencing approach and X chromosome inactivation status. The list of putative disease causing genes was developed from primary selection criteria using machine learning-derived classification score and other predictive parameters (GERP2, PolyPhen2, and SIFT. We narrowed the variant list to 10 to 20 genes and screened for biological significance including neural development, function and known neurological disorders. Seventy-eight genes identified met selection criteria ranging from 1 to 9 filtered variants per female. Five females presented with functional variants of X-linked genes (IL1RAPL1, PIR, GABRQ, GPRASP2, SYTL4 with cadherin, protocadherin and ankyrin repeat gene families most commonly altered (e.g., CDH6, FAT2, PCDH8, CTNNA3, ANKRD11. Other genes related to neurogenesis and neuronal migration (e.g., SEMA3F, MIDN, were also identified.

  16. Dynamic QTL analysis and candidate gene mapping for waterlogging tolerance at maize seedling stage.

    Directory of Open Access Journals (Sweden)

    Khalid A Osman

    Full Text Available Soil waterlogging is one of the major abiotic stresses adversely affecting maize growth and yield. To identify dynamic expression of genes or quantitative trait loci (QTL, QTL associated with plant height, root length, root dry weight, shoot dry weight and total dry weight were identified via conditional analysis in a mixed linear model and inclusive composite interval mapping method at three respective periods under waterlogging and control conditions. A total of 13, 19 and 23 QTL were detected at stages 3D|0D (the period during 0-3 d of waterlogging, 6D|3D and 9D|6D, respectively. The effects of each QTL were moderate and distributed over nine chromosomes, singly explaining 4.14-18.88% of the phenotypic variation. Six QTL (ph6-1, rl1-2, sdw4-1, sdw7-1, tdw4-1 and tdw7-1 were identified at two consistent stages of seedling development, which could reflect a continuous expression of genes; the remaining QTL were detected at only one stage. Thus, expression of most QTL was influenced by the developmental status. In order to provide additional evidence regarding the role of corresponding genes in waterlogging tolerance, mapping of Expressed Sequence Tags markers and microRNAs were conducted. Seven candidate genes were observed to co-localize with the identified QTL on chromosomes 1, 4, 6, 7 and 9, and may be important candidate genes for waterlogging tolerance. These results are a good starting point for understanding the genetic basis for selectively expressing of QTL in different stress periods and the common genetic control mechanism of the co-localized traits.

  17. Genetic and Proteomic Interrogation of Lower Confidence Candidate Genes Reveals Signaling Networks in beta-Catenin-Active Cancers | Office of Cancer Genomics

    Science.gov (United States)

    Genome-scale expression studies and comprehensive loss-of-function genetic screens have focused almost exclusively on the highest confidence candidate genes. Here, we describe a strategy for characterizing the lower confidence candidates identified by such approaches.

  18. Association Study of 60 Candidate Genes with Antipsychotic-induced Weight Gain in Schizophrenia Patients.

    Science.gov (United States)

    Ryu, S; Huh, I-S; Cho, E-Y; Cho, Y; Park, T; Yoon, S C; Joo, Y H; Hong, K S

    2016-03-01

    This study aimed to investigate the association of multiple candidate genes with weight gain and appetite change during antipsychotic treatment. A total of 233 single nucleotide polymorphisms (SNPs) within 60 candidate genes were genotyped. BMI changes for up to 8 weeks in 84 schizophrenia patients receiving antipsychotic medication were analyzed using a linear mixed model. In addition, we assessed appetite change during antipsychotic treatment in a different group of 46 schizophrenia patients using the Drug-Related Eating Behavior Questionnaire. No SNP showed a statistically significant association with BMI or appetite change after correction for multiple testing. We observed trends of association (PGHRL showed suggestive evidence of association with not only weight gain (P=0.001) but also appetite change (P=0.042). Patients carrying the GG genotype of rs696217 exhibited higher increase in both BMI and appetite compared to patients carrying the GT/TT genotype. Our findings suggested the involvement of a GHRL polymorphism in weight gain, which was specifically mediated by appetite change, during antipsychotic treatment in schizophrenia patients. © Georg Thieme Verlag KG Stuttgart · New York.

  19. Genetic mapping reveals a candidate gene (ClFS1) for fruit shape in watermelon (Citrullus lanatus L.).

    Science.gov (United States)

    Dou, Junling; Zhao, Shengjie; Lu, Xuqiang; He, Nan; Zhang, Lei; Ali, Aslam; Kuang, Hanhui; Liu, Wenge

    2018-04-01

    A 159 bp deletion in ClFS1 gene encoding IQD protein is responsible for fruit shape in watermelon. Watermelon [Citrullus lanatus (Thunb.) Matsum. & Nakai] is known for its rich diversity in fruit size and shape. Fruit shape has been one of the major objectives of watermelon breeding. However, the candidate genes and the underlying genetic mechanism for such an important trait in watermelon are unknown. In this study, we identified a locus on chromosome 3 of watermelon genome controlling fruit shape. Segregation analysis in F 2 and BC 1 populations derived from a cross between two inbred lines "Duan125" (elongate fruit) and "Zhengzhouzigua" (spherical fruit) suggests that fruit shape of watermelon is controlled by a single locus and elongate fruit (OO) is incompletely dominant to spherical fruit (oo) with the heterozygote (Oo) being oval fruit. GWAS profiles among 315 accessions identified a major locus designated on watermelon chromosome 3, which was confirmed by BSA-seq mapping in the F 2 population. The candidate gene was mapped to a region 46 kb on chromosome 3. There were only four genes present in the corresponding region in the reference genome. Four candidate genes were sequenced in this region, revealing that the CDS of Cla011257 had a 159 bp deletion which resulted in the omission of 53 amino acids in elongate watermelon. An indel marker was derived from the 159 bp deletion to test the F 2 population and 105 watermelon accessions. The results showed that Cla011257 cosegregated with watermelon fruit shape. In addition, the Cla011257 expression was the highest at ovary formation stage. The predicted protein of the Cla011257 gene fitted in IQD protein family which was reported to have association with cell arrays and Ca 2+ -CaM signaling modules. Clear understanding of the genes facilitating the fruit shape along with marker association selection will be an effective way to develop new cultivars.

  20. Gene set analysis: limitations in popular existing methods and proposed improvements.

    Science.gov (United States)

    Mishra, Pashupati; Törönen, Petri; Leino, Yrjö; Holm, Liisa

    2014-10-01

    Gene set analysis is the analysis of a set of genes that collectively contribute to a biological process. Most popular gene set analysis methods are based on empirical P-value that requires large number of permutations. Despite numerous gene set analysis methods developed in the past decade, the most popular methods still suffer from serious limitations. We present a gene set analysis method (mGSZ) based on Gene Set Z-scoring function (GSZ) and asymptotic P-values. Asymptotic P-value calculation requires fewer permutations, and thus speeds up the gene set analysis process. We compare the GSZ-scoring function with seven popular gene set scoring functions and show that GSZ stands out as the best scoring function. In addition, we show improved performance of the GSA method when the max-mean statistics is replaced by the GSZ scoring function. We demonstrate the importance of both gene and sample permutations by showing the consequences in the absence of one or the other. A comparison of asymptotic and empirical methods of P-value estimation demonstrates a clear advantage of asymptotic P-value over empirical P-value. We show that mGSZ outperforms the state-of-the-art methods based on two different evaluations. We compared mGSZ results with permutation and rotation tests and show that rotation does not improve our asymptotic P-values. We also propose well-known asymptotic distribution models for three of the compared methods. mGSZ is available as R package from cran.r-project.org. © The Author 2014. Published by Oxford University Press. All rights reserved. For Permissions, please e-mail: journals.permissions@oup.com.

  1. Beyond main effects of gene-sets: harsh parenting moderates the association between a dopamine gene-set and child externalizing behavior

    NARCIS (Netherlands)

    J. Windhorst (Judith); V. Mileva-Seitz (Viara); R.C.A. Rippe (Ralph C.A.); H.W. Tiemeier (Henning); V.W.V. Jaddoe (Vincent); F.C. Verhulst (Frank); M.H. van IJzendoorn (Rien); M.J. Bakermans-Kranenburg (Marian)

    2016-01-01

    textabstractBackground: In a longitudinal cohort study, we investigated the interplay of harsh parenting and genetic variation across a set of functionally related dopamine genes, in association with children's externalizing behavior. This is one of the first studies to employ gene-based and

  2. Genome-wide association studies and epistasis analyses of candidate genes related to age at menarche and age at natural menopause in a Korean population.

    Science.gov (United States)

    Pyun, Jung-A; Kim, Sunshin; Cho, Nam H; Koh, InSong; Lee, Jong-Young; Shin, Chol; Kwack, KyuBum

    2014-05-01

    The aim of this study was to identify polymorphisms and gene-gene interactions that are significantly associated with age at menarche and age at menopause in a Korean population. A total of 3,452 and 1,827 women participated in studies of age at menarche and age at natural menopause, respectively. Linear regression analyses adjusted for residence area were used to perform genome-wide association studies (GWAS), candidate gene association studies, and interactions between the candidate genes for age at menarche and age at natural menopause. In GWAS, four single nucleotide polymorphisms (SNPs; rs7528241, rs1324329, rs11597068, and rs6495785) were strongly associated with age at natural menopause (lowest P = 9.66 × 10). However, GWAS of age at menarche did not reveal any strong associations. In candidate gene association studies, SNPs with P menopause, there was a significant interaction between intronic SNPs on ADAM metallopeptidase with thrombospondin type I motif 9 (ADAMTS9) and SMAD family member 3 (SMAD3) genes (P = 9.52 × 10). For age at menarche, there were three significant interactions between three intronic SNPs on follicle-stimulating hormone receptor (FSHR) gene and one SNP located at the 3' flanking region of insulin-like growth factor 2 receptor (IGF2R) gene (lowest P = 1.95 × 10). Novel SNPs and synergistic interactions between candidate genes are significantly associated with age at menarche and age at natural menopause in a Korean population.

  3. Analysis of PSPHL as a Candidate Gene Influencing the Racial Disparity in Endometrial Cancer

    Energy Technology Data Exchange (ETDEWEB)

    Allard, Jay E. [Walter Reed Army Medical Center, Washington, DC (United States); Chandramouli, Gadisetti V. R. [Department of Obstetrics, Gynecology and Reproductive Biology, Michigan State University College of Human Medicine, Grand Rapids, MI (United States); Stagliano, Katherine [Curtis and Elizabeth Anderson Cancer Institute at Memorial Health University Medical Center, Savannah, GA (United States); Hood, Brian L. [Women’s Health Integrated Research Center at Inova Health System, Annandale, VA (United States); Litzi, Tracy [Walter Reed Army Medical Center, Washington, DC (United States); Women’s Health Integrated Research Center at Inova Health System, Annandale, VA (United States); Shoji, Yutaka [Department of Obstetrics, Gynecology and Reproductive Biology, Michigan State University College of Human Medicine, Grand Rapids, MI (United States); Curtis and Elizabeth Anderson Cancer Institute at Memorial Health University Medical Center, Savannah, GA (United States); Boyd, Jeff [Curtis and Elizabeth Anderson Cancer Institute at Memorial Health University Medical Center, Savannah, GA (United States); Fox Chase Cancer Center, Philadelphia, PA (United States); Berchuck, Andrew [Division of Gynecologic Oncology, Duke University, Durham, NC (United States); Conrads, Thomas P. [Curtis and Elizabeth Anderson Cancer Institute at Memorial Health University Medical Center, Savannah, GA (United States); Maxwell, G. Larry [Walter Reed Army Medical Center, Washington, DC (United States); Women’s Health Integrated Research Center at Inova Health System, Annandale, VA (United States); Risinger, John I., E-mail: john.risinger@hc.msu.edu [Department of Obstetrics, Gynecology and Reproductive Biology, Michigan State University College of Human Medicine, Grand Rapids, MI (United States); Curtis and Elizabeth Anderson Cancer Institute at Memorial Health University Medical Center, Savannah, GA (United States)

    2012-07-04

    Endometrial cancer is the most commonly diagnosed gynecologic malignancy in the United States. A well recognized disparity by race in both incidence and survival outcome exists for this cancer. Specifically Caucasians are about two times more likely to develop endometrial cancer than are African-Americans. However, African-American women are more likely to die from this disease than are Caucasians. The basis for this disparity remains unknown. Previous studies have identified differences in the types and frequencies of gene mutations among endometrial cancers from Caucasians and African-Americans suggesting that the tumors from these two groups might have differing underlying genetic defects. We performed a gene expression microarray study in an effort to identify differentially expressed transcripts between African-American and Caucasian women’s endometrial cancers. Our gene expression screen identified a list of potential biomarkers that are differentially expressed between these two groups of cancers. Of these we identified a poorly characterized transcript with a region of homology to phospho serine phosphatase (PSPH) and designated phospho serine phosphatase like (PSPHL) as the most differentially over-expressed gene in cancers from African-Americans. We further clarified the nature of expressed transcripts. Northern blot analysis confirmed the message was limited to a transcript of under 1 kB. Sequence analysis of transcripts confirmed two alternate open reading frame (ORF) isoforms due to alternative splicing events. Splice specific primer sets confirmed both isoforms were differentially expressed in tissues from Caucasians and African-Americans. We further examined the expression in other tissues from women to include normal endometrium, normal and malignant ovary. In all cases PSPHL expression was more often present in tissues from African-Americans than Caucasians. Our data confirm the African-American based expression of the PSPHL transcript in

  4. Analysis of PSPHL as a Candidate Gene Influencing the Racial Disparity in Endometrial Cancer

    International Nuclear Information System (INIS)

    Allard, Jay E.; Chandramouli, Gadisetti V. R.; Stagliano, Katherine; Hood, Brian L.; Litzi, Tracy; Shoji, Yutaka; Boyd, Jeff; Berchuck, Andrew; Conrads, Thomas P.; Maxwell, G. Larry; Risinger, John I.

    2012-01-01

    Endometrial cancer is the most commonly diagnosed gynecologic malignancy in the United States. A well recognized disparity by race in both incidence and survival outcome exists for this cancer. Specifically Caucasians are about two times more likely to develop endometrial cancer than are African-Americans. However, African-American women are more likely to die from this disease than are Caucasians. The basis for this disparity remains unknown. Previous studies have identified differences in the types and frequencies of gene mutations among endometrial cancers from Caucasians and African-Americans suggesting that the tumors from these two groups might have differing underlying genetic defects. We performed a gene expression microarray study in an effort to identify differentially expressed transcripts between African-American and Caucasian women’s endometrial cancers. Our gene expression screen identified a list of potential biomarkers that are differentially expressed between these two groups of cancers. Of these we identified a poorly characterized transcript with a region of homology to phospho serine phosphatase (PSPH) and designated phospho serine phosphatase like (PSPHL) as the most differentially over-expressed gene in cancers from African-Americans. We further clarified the nature of expressed transcripts. Northern blot analysis confirmed the message was limited to a transcript of under 1 kB. Sequence analysis of transcripts confirmed two alternate open reading frame (ORF) isoforms due to alternative splicing events. Splice specific primer sets confirmed both isoforms were differentially expressed in tissues from Caucasians and African-Americans. We further examined the expression in other tissues from women to include normal endometrium, normal and malignant ovary. In all cases PSPHL expression was more often present in tissues from African-Americans than Caucasians. Our data confirm the African-American based expression of the PSPHL transcript in

  5. Characterization of the canine desmin (DES) gene and evaluation as a candidate gene for dilated cardiomyopathy in the Dobermann.

    Science.gov (United States)

    Stabej, Polona; Imholz, Sandra; Versteeg, Serge A; Zijlstra, Carla; Stokhof, Arnold A; Domanjko-Petric, Aleksandra; Leegwater, Peter A J; van Oost, Bernard A

    2004-10-13

    Canine-dilated cardiomyopathy (DCM) in dogs is a disease of the myocardium associated with dilatation and impaired contraction of the ventricles and is suspected to have a genetic cause. A missense mutation in the desmin gene (DES) causes DCM in a human family. Human DCM closely resembles the canine disease. In the present study, we evaluated whether DES gene mutations are responsible for DCM in Dobermann dogs. We have isolated bacterial artificial chromosome clones (BACs) containing the canine DES gene and determined the chromosomal location by fluorescence in situ hybridization (FISH). Using data deposited in the NCBI trace archive and GenBank, the canine DES gene DNA sequence was assembled and seven single nucleotide polymorphisms (SNPs) were identified. From the canine DES gene BAC clones, a polymorphic microsatellite marker was isolated. The microsatellite marker and four informative desmin SNPs were typed in a Dobermann family with frequent DCM occurrence, but the disease phenotype did not associate with a desmin haplotype. We concluded that mutations in the DES gene do not play a role in Dobermann DCM. Availability of the microsatellite marker, SNPs and DNA sequence reported in this study enable fast evaluation of the DES gene as a DCM candidate gene in other dog breeds with DCM occurrence.

  6. Candidate gene analysis and exome sequencing confirm LBX1 as a susceptibility gene for idiopathic scoliosis

    DEFF Research Database (Denmark)

    Grauers, Anna; Wang, Jingwen; Einarsdottir, Elisabet

    2015-01-01

    samples from 100 surgically treated idiopathic scoliosis patients. Novel or rare missense, nonsense, or splice site variants were selected for individual genotyping in the 1,739 cases and 1,812 controls. In addition, the 5'UTR, noncoding exon and promoter regions of LBX1, not covered by exome sequencing...... by exome sequencing after filtration and an initial genotyping validation. However, we could not verify any association to idiopathic scoliosis in the large cohort of 1,739 cases and 1,812 controls. We did not find any variants in the 5'UTR, noncoding exon and promoter regions of LBX1. CONCLUSIONS: Here...... that are significantly associated with idiopathic scoliosis in Asian and Caucasian populations, rs11190870 close to the LBX1 gene being the most replicated finding. PURPOSE: The aim of the present study was to investigate the genetics of idiopathic scoliosis in a Scandinavian cohort by performing a candidate gene study...

  7. TGIF1 is a potential candidate gene for high myopia in ethnic Kashmiri population.

    Science.gov (United States)

    Ahmed, Ishfaq; Rasool, Shabhat; Jan, Tariq; Qureshi, Tariq; Naykoo, Niyaz A; Andrabi, Khurshid I

    2014-03-01

    High myopia is a complex disorder that imposes serious consequences on ocular health. Linkage analysis has identified several genetic loci with a series of potential candidate genes that reveal an ambiguous pattern of association with high myopia due to population heterogeneity. We have accordingly chosen to examine the prospect of association of one such gene [transforming growth β-induced factor 1 (TGIF1)] in population that is purely ethnic (Kashmiri) and represents a homogeneous cohort from Northern India. Cases with high myopia with a spherical equivalent of ≥-6 diopters (D) and emmetropic controls with spherical equivalent within ±0.5 D in one or both eyes represented by a sample size of 212 ethnic Kashmiri subjects and 239 matched controls. Genomic DNA was genotyped for sequence variations in TGIF1 gene and allele frequencies tested for Hardy-Weinberg disequilibrium. Potential association was evaluated using χ(2) or Fisher's exact test. Two previously reported missense variations C > T, rs4468717 (first base of codon 143) changing proline to serine and rs2229333 (second base of codon 143) changing proline to leucine were identified in exon 10 of TGIF1. Both variations exhibited possibly significant (p population. In silico predictions show that substitutions are likely to have an impact on the structure and functional properties of the protein, making it imperative to understand their functional consequences in relation to high myopia. TGIF1 is a relevant candidate gene with potential to contribute in the genesis of high myopia.

  8. RNA deep sequencing reveals novel candidate genes and polymorphisms in boar testis and liver tissues with divergent androstenone levels.

    Directory of Open Access Journals (Sweden)

    Asep Gunawan

    Full Text Available Boar taint is an unpleasant smell and taste of pork meat derived from some entire male pigs. The main causes of boar taint are the two compounds androstenone (5α-androst-16-en-3-one and skatole (3-methylindole. It is crucial to understand the genetic mechanism of boar taint to select pigs for lower androstenone levels and thus reduce boar taint. The aim of the present study was to investigate transcriptome differences in boar testis and liver tissues with divergent androstenone levels using RNA deep sequencing (RNA-Seq. The total number of reads produced for each testis and liver sample ranged from 13,221,550 to 33,206,723 and 12,755,487 to 46,050,468, respectively. In testis samples 46 genes were differentially regulated whereas 25 genes showed differential expression in the liver. The fold change values ranged from -4.68 to 2.90 in testis samples and -2.86 to 3.89 in liver samples. Differentially regulated genes in high androstenone testis and liver samples were enriched in metabolic processes such as lipid metabolism, small molecule biochemistry and molecular transport. This study provides evidence for transcriptome profile and gene polymorphisms of boars with divergent androstenone level using RNA-Seq technology. Digital gene expression analysis identified candidate genes in flavin monooxygenease family, cytochrome P450 family and hydroxysteroid dehydrogenase family. Moreover, polymorphism and association analysis revealed mutation in IRG6, MX1, IFIT2, CYP7A1, FMO5 and KRT18 genes could be potential candidate markers for androstenone levels in boars. Further studies are required for proving the role of candidate genes to be used in genomic selection against boar taint in pig breeding programs.

  9. Genome-wide association study identified genetic variations and candidate genes for plant architecture component traits in Chinese upland cotton.

    Science.gov (United States)

    Su, Junji; Li, Libei; Zhang, Chi; Wang, Caixiang; Gu, Lijiao; Wang, Hantao; Wei, Hengling; Liu, Qibao; Huang, Long; Yu, Shuxun

    2018-06-01

    Thirty significant associations between 22 SNPs and five plant architecture component traits in Chinese upland cotton were identified via GWAS. Four peak SNP loci located on chromosome D03 were simultaneously associated with more plant architecture component traits. A candidate gene, Gh_D03G0922, might be responsible for plant height in upland cotton. A compact plant architecture is increasingly required for mechanized harvesting processes in China. Therefore, cotton plant architecture is an important trait, and its components, such as plant height, fruit branch length and fruit branch angle, affect the suitability of a cultivar for mechanized harvesting. To determine the genetic basis of cotton plant architecture, a genome-wide association study (GWAS) was performed using a panel composed of 355 accessions and 93,250 single nucleotide polymorphisms (SNPs) identified using the specific-locus amplified fragment sequencing method. Thirty significant associations between 22 SNPs and five plant architecture component traits were identified via GWAS. Most importantly, four peak SNP loci located on chromosome D03 were simultaneously associated with more plant architecture component traits, and these SNPs were harbored in one linkage disequilibrium block. Furthermore, 21 candidate genes for plant architecture were predicted in a 0.95-Mb region including the four peak SNPs. One of these genes (Gh_D03G0922) was near the significant SNP D03_31584163 (8.40 kb), and its Arabidopsis homologs contain MADS-box domains that might be involved in plant growth and development. qRT-PCR showed that the expression of Gh_D03G0922 was upregulated in the apical buds and young leaves of the short and compact cotton varieties, and virus-induced gene silencing (VIGS) proved that the silenced plants exhibited increased PH. These results indicate that Gh_D03G0922 is likely the candidate gene for PH in cotton. The genetic variations and candidate genes identified in this study lay a foundation

  10. Testing candidate genes for attention-deficit/hyperactivity disorder in fruit flies using a high throughput assay for complex behavior

    DEFF Research Database (Denmark)

    Rohde, Palle Duun; Madsen, Lisbeth Strøm; Arvidson, Sandra Marie Neumann

    2016-01-01

    Fruit flies are important model organisms for functional testing of candidate genes in multiple disciplines, including the study of human diseases. Here we use a high-throughput locomotor activity assay to test the response on activity behavior of gene disruption in Drosophila melanogaster. The aim...

  11. Identification of Photosynthesis-Associated C4 Candidate Genes through Comparative Leaf Gradient Transcriptome in Multiple Lineages of C3 and C4 Species

    Science.gov (United States)

    Ding, Zehong; Weissmann, Sarit; Wang, Minghui; Du, Baijuan; Huang, Lei; Wang, Lin; Tu, Xiaoyu; Zhong, Silin; Myers, Christopher; Brutnell, Thomas P.; Sun, Qi; Li, Pinghua

    2015-01-01

    Leaves of C4 crops usually have higher radiation, water and nitrogen use efficiencies compared to the C3 species. Engineering C4 traits into C3 crops has been proposed as one of the most promising ways to repeal the biomass yield ceiling. To better understand the function of C4 photosynthesis, and to identify candidate genes that are associated with the C4 pathways, a comparative transcription network analysis was conducted on leaf developmental gradients of three C4 species including maize, green foxtail and sorghum and one C3 species, rice. By combining the methods of gene co-expression and differentially co-expression networks, we identified a total of 128 C4 specific genes. Besides the classic C4 shuttle genes, a new set of genes associated with light reaction, starch and sucrose metabolism, metabolites transportation, as well as transcription regulation, were identified as involved in C4 photosynthesis. These findings will provide important insights into the differential gene regulation between C3 and C4 species, and a good genetic resource for establishing C4 pathways in C3 crops. PMID:26465154

  12. Identification of Photosynthesis-Associated C4 Candidate Genes through Comparative Leaf Gradient Transcriptome in Multiple Lineages of C3 and C4 Species.

    Science.gov (United States)

    Ding, Zehong; Weissmann, Sarit; Wang, Minghui; Du, Baijuan; Huang, Lei; Wang, Lin; Tu, Xiaoyu; Zhong, Silin; Myers, Christopher; Brutnell, Thomas P; Sun, Qi; Li, Pinghua

    2015-01-01

    Leaves of C4 crops usually have higher radiation, water and nitrogen use efficiencies compared to the C3 species. Engineering C4 traits into C3 crops has been proposed as one of the most promising ways to repeal the biomass yield ceiling. To better understand the function of C4 photosynthesis, and to identify candidate genes that are associated with the C4 pathways, a comparative transcription network analysis was conducted on leaf developmental gradients of three C4 species including maize, green foxtail and sorghum and one C3 species, rice. By combining the methods of gene co-expression and differentially co-expression networks, we identified a total of 128 C4 specific genes. Besides the classic C4 shuttle genes, a new set of genes associated with light reaction, starch and sucrose metabolism, metabolites transportation, as well as transcription regulation, were identified as involved in C4 photosynthesis. These findings will provide important insights into the differential gene regulation between C3 and C4 species, and a good genetic resource for establishing C4 pathways in C3 crops.

  13. Identification of Photosynthesis-Associated C4 Candidate Genes through Comparative Leaf Gradient Transcriptome in Multiple Lineages of C3 and C4 Species.

    Directory of Open Access Journals (Sweden)

    Zehong Ding

    Full Text Available Leaves of C4 crops usually have higher radiation, water and nitrogen use efficiencies compared to the C3 species. Engineering C4 traits into C3 crops has been proposed as one of the most promising ways to repeal the biomass yield ceiling. To better understand the function of C4 photosynthesis, and to identify candidate genes that are associated with the C4 pathways, a comparative transcription network analysis was conducted on leaf developmental gradients of three C4 species including maize, green foxtail and sorghum and one C3 species, rice. By combining the methods of gene co-expression and differentially co-expression networks, we identified a total of 128 C4 specific genes. Besides the classic C4 shuttle genes, a new set of genes associated with light reaction, starch and sucrose metabolism, metabolites transportation, as well as transcription regulation, were identified as involved in C4 photosynthesis. These findings will provide important insights into the differential gene regulation between C3 and C4 species, and a good genetic resource for establishing C4 pathways in C3 crops.

  14. Cis-eQTL analysis and functional validation of candidate susceptibility genes for high-grade serous ovarian cancer.

    Science.gov (United States)

    Lawrenson, Kate; Li, Qiyuan; Kar, Siddhartha; Seo, Ji-Heui; Tyrer, Jonathan; Spindler, Tassja J; Lee, Janet; Chen, Yibu; Karst, Alison; Drapkin, Ronny; Aben, Katja K H; Anton-Culver, Hoda; Antonenkova, Natalia; Baker, Helen; Bandera, Elisa V; Bean, Yukie; Beckmann, Matthias W; Berchuck, Andrew; Bisogna, Maria; Bjorge, Line; Bogdanova, Natalia; Brinton, Louise A; Brooks-Wilson, Angela; Bruinsma, Fiona; Butzow, Ralf; Campbell, Ian G; Carty, Karen; Chang-Claude, Jenny; Chenevix-Trench, Georgia; Chen, Anne; Chen, Zhihua; Cook, Linda S; Cramer, Daniel W; Cunningham, Julie M; Cybulski, Cezary; Dansonka-Mieszkowska, Agnieszka; Dennis, Joe; Dicks, Ed; Doherty, Jennifer A; Dörk, Thilo; du Bois, Andreas; Dürst, Matthias; Eccles, Diana; Easton, Douglas T; Edwards, Robert P; Eilber, Ursula; Ekici, Arif B; Fasching, Peter A; Fridley, Brooke L; Gao, Yu-Tang; Gentry-Maharaj, Aleksandra; Giles, Graham G; Glasspool, Rosalind; Goode, Ellen L; Goodman, Marc T; Grownwald, Jacek; Harrington, Patricia; Harter, Philipp; Hasmad, Hanis Nazihah; Hein, Alexander; Heitz, Florian; Hildebrandt, Michelle A T; Hillemanns, Peter; Hogdall, Estrid; Hogdall, Claus; Hosono, Satoyo; Iversen, Edwin S; Jakubowska, Anna; James, Paul; Jensen, Allan; Ji, Bu-Tian; Karlan, Beth Y; Kruger Kjaer, Susanne; Kelemen, Linda E; Kellar, Melissa; Kelley, Joseph L; Kiemeney, Lambertus A; Krakstad, Camilla; Kupryjanczyk, Jolanta; Lambrechts, Diether; Lambrechts, Sandrina; Le, Nhu D; Lee, Alice W; Lele, Shashi; Leminen, Arto; Lester, Jenny; Levine, Douglas A; Liang, Dong; Lissowska, Jolanta; Lu, Karen; Lubinski, Jan; Lundvall, Lene; Massuger, Leon F A G; Matsuo, Keitaro; McGuire, Valerie; McLaughlin, John R; Nevanlinna, Heli; McNeish, Ian; Menon, Usha; Modugno, Francesmary; Moysich, Kirsten B; Narod, Steven A; Nedergaard, Lotte; Ness, Roberta B; Azmi, Mat Adenan Noor; Odunsi, Kunle; Olson, Sara H; Orlow, Irene; Orsulic, Sandra; Weber, Rachel Palmieri; Pearce, Celeste L; Pejovic, Tanja; Pelttari, Liisa M; Permuth-Wey, Jennifer; Phelan, Catherine M; Pike, Malcolm C; Poole, Elizabeth M; Ramus, Susan J; Risch, Harvey A; Rosen, Barry; Rossing, Mary Anne; Rothstein, Joseph H; Rudolph, Anja; Runnebaum, Ingo B; Rzepecka, Iwona K; Salvesen, Helga B; Schildkraut, Joellen M; Schwaab, Ira; Sellers, Thomas A; Shu, Xiao-Ou; Shvetsov, Yurii B; Siddiqui, Nadeem; Sieh, Weiva; Song, Honglin; Southey, Melissa C; Sucheston, Lara; Tangen, Ingvild L; Teo, Soo-Hwang; Terry, Kathryn L; Thompson, Pamela J; Timorek, Agnieszka; Tsai, Ya-Yu; Tworoger, Shelley S; van Altena, Anne M; Van Nieuwenhuysen, Els; Vergote, Ignace; Vierkant, Robert A; Wang-Gohrke, Shan; Walsh, Christine; Wentzensen, Nicolas; Whittemore, Alice S; Wicklund, Kristine G; Wilkens, Lynne R; Woo, Yin-Ling; Wu, Xifeng; Wu, Anna H; Yang, Hannah; Zheng, Wei; Ziogas, Argyrios; Monteiro, Alvaro; Pharoah, Paul D; Gayther, Simon A; Freedman, Matthew L

    2015-09-22

    Genome-wide association studies have reported 11 regions conferring risk of high-grade serous epithelial ovarian cancer (HGSOC). Expression quantitative trait locus (eQTL) analyses can identify candidate susceptibility genes at risk loci. Here we evaluate cis-eQTL associations at 47 regions associated with HGSOC risk (P≤10(-5)). For three cis-eQTL associations (P<1.4 × 10(-3), FDR<0.05) at 1p36 (CDC42), 1p34 (CDCA8) and 2q31 (HOXD9), we evaluate the functional role of each candidate by perturbing expression of each gene in HGSOC precursor cells. Overexpression of HOXD9 increases anchorage-independent growth, shortens population-doubling time and reduces contact inhibition. Chromosome conformation capture identifies an interaction between rs2857532 and the HOXD9 promoter, suggesting this SNP is a leading causal variant. Transcriptomic profiling after HOXD9 overexpression reveals enrichment of HGSOC risk variants within HOXD9 target genes (P=6 × 10(-10) for risk variants (P<10(-4)) within 10 kb of a HOXD9 target gene in ovarian cells), suggesting a broader role for this network in genetic susceptibility to HGSOC.

  15. Isolation and characterization of the human CDX1 gene: A candidate gene for diastrophic dysplasia

    Energy Technology Data Exchange (ETDEWEB)

    Bonner, C.; Loftus, S.; Wasmuth, J.J. [Univ. of California, Irvine, CA (United States)

    1994-09-01

    Diastrophic dysplasia is an autosomal recessive disorder characterized by short stature, dislocation of the joints, spinal deformities and malformation of the hands and feet. Multipoint linkage analysis places the diastrophic dysplasia (DTD) locus in 5q31-5q34. Linkage disequilibrium mapping places the DTD locus near CSFIR in the direction of PDGFRB (which is tandem to CSFIR). This same study tentatively placed PDGFRB and DTD proximal to CSFIR. Our results, as well as recently reported work from other laboratories, suggest that PDGFRB (and possibly DTD) is distal rather than proximal to CSFIR. We have constructed a cosmid contig covering approximately 200 kb of the region containing CSFIR. Several exons have been {open_quotes}trapped{close_quotes} from these cosmids using exon amplification. One of these exons was trapped from a cosmid isolated from a walk from PDGFRB, approximately 80 kb from CSFIR. This exon was sequenced and was determined to be 89% identical to the nucleotide sequence of exon two of the murine CDX1 gene (100% amino acid identity). The exon was used to isolate the human CDX gene. Sequence analysis of the human CDX1 gene indicates a very high degree of homology to the murine gene. CDX1 is a caudal type homeobox gene expressed during gastrulation. In the mouse, expression during gastrulation begins in the primitive streak and subsequently localizes to the ectodermal and mesodermal cells of the primitive streak, neural tube, somites, and limb buds. Later in gastrulation, CDX1 expression becomes most prominent in the mesoderm of the forelimbs, and, to a lesser extent, the hindlimbs. CDX1 is an intriguing candidate gene for diastrophic dysplasia. We are currently screening DNA from affected individuals and hope to shortly determine whether CDX1 is involved in this disorder.

  16. GSMA: Gene Set Matrix Analysis, An Automated Method for Rapid Hypothesis Testing of Gene Expression Data

    Directory of Open Access Journals (Sweden)

    Chris Cheadle

    2007-01-01

    Full Text Available Background: Microarray technology has become highly valuable for identifying complex global changes in gene expression patterns. The assignment of functional information to these complex patterns remains a challenging task in effectively interpreting data and correlating results from across experiments, projects and laboratories. Methods which allow the rapid and robust evaluation of multiple functional hypotheses increase the power of individual researchers to data mine gene expression data more efficiently.Results: We have developed (gene set matrix analysis GSMA as a useful method for the rapid testing of group-wise up- or downregulation of gene expression simultaneously for multiple lists of genes (gene sets against entire distributions of gene expression changes (datasets for single or multiple experiments. The utility of GSMA lies in its flexibility to rapidly poll gene sets related by known biological function or as designated solely by the end-user against large numbers of datasets simultaneously.Conclusions: GSMA provides a simple and straightforward method for hypothesis testing in which genes are tested by groups across multiple datasets for patterns of expression enrichment.

  17. Evaluation of 6 candidate genes on chromosome 11q23 for coeliac disease susceptibility: a case control study

    Directory of Open Access Journals (Sweden)

    Close Eimear

    2010-05-01

    Full Text Available Abstract Background Recent whole genome analysis and follow-up studies have identified many new risk variants for coeliac disease (CD, gluten intolerance. The majority of newly associated regions encode candidate genes with a clear functional role in T-cell regulation. Furthermore, the newly discovered risk loci, together with the well established HLA locus, account for less than 50% of the heritability of CD, suggesting that numerous additional loci remain undiscovered. Linkage studies have identified some well-replicated risk regions, most notably chromosome 5q31 and 11q23. Methods We have evaluated six candidate genes in one of these regions (11q23, namely CD3E, CD3D, CD3G, IL10RA, THY1 and IL18, as risk factors for CD using a 2-phase candidate gene approach directed at chromosome 11q. 377 CD cases and 349 ethnically matched controls were used in the initial screening, followed by an extended sample of 171 additional coeliac cases and 536 additional controls. Results Promotor SNPs (-607, -137 in the IL18 gene, which has shown association with several autoimmune diseases, initially suggested association with CD (P IL18-137/-607 also supported this effect, primarily due to one relatively rare haplotype IL18-607C/-137C (P Conclusion Haplotypes of the IL18 promotor region may contribute to CD risk, consistent with this cytokine's role in maintaining inflammation in active CD.

  18. Annotating gene sets by mining large literature collections with protein networks.

    Science.gov (United States)

    Wang, Sheng; Ma, Jianzhu; Yu, Michael Ku; Zheng, Fan; Huang, Edward W; Han, Jiawei; Peng, Jian; Ideker, Trey

    2018-01-01

    Analysis of patient genomes and transcriptomes routinely recognizes new gene sets associated with human disease. Here we present an integrative natural language processing system which infers common functions for a gene set through automatic mining of the scientific literature with biological networks. This system links genes with associated literature phrases and combines these links with protein interactions in a single heterogeneous network. Multiscale functional annotations are inferred based on network distances between phrases and genes and then visualized as an ontology of biological concepts. To evaluate this system, we predict functions for gene sets representing known pathways and find that our approach achieves substantial improvement over the conventional text-mining baseline method. Moreover, our system discovers novel annotations for gene sets or pathways without previously known functions. Two case studies demonstrate how the system is used in discovery of new cancer-related pathways with ontological annotations.

  19. Phylogenetics and evolution of Trx SET genes in fully sequenced land plants.

    Science.gov (United States)

    Zhu, Xinyu; Chen, Caoyi; Wang, Baohua

    2012-04-01

    Plant Trx SET proteins are involved in H3K4 methylation and play a key role in plant floral development. Genes encoding Trx SET proteins constitute a multigene family in which the copy number varies among plant species and functional divergence appears to have occurred repeatedly. To investigate the evolutionary history of the Trx SET gene family, we made a comprehensive evolutionary analysis on this gene family from 13 major representatives of green plants. A novel clustering (here named as cpTrx clade), which included the III-1, III-2, and III-4 orthologous groups, previously resolved was identified. Our analysis showed that plant Trx proteins possessed a variety of domain organizations and gene structures among paralogs. Additional domains such as PHD, PWWP, and FYR were early integrated into primordial SET-PostSET domain organization of cpTrx clade. We suggested that the PostSET domain was lost in some members of III-4 orthologous group during the evolution of land plants. At least four classes of gene structures had been formed at the early evolutionary stage of land plants. Three intronless orphan Trx SET genes from the Physcomitrella patens (moss) were identified, and supposedly, their parental genes have been eliminated from the genome. The structural differences among evolutionary groups of plant Trx SET genes with different functions were described, contributing to the design of further experimental studies.

  20. Ranking metrics in gene set enrichment analysis: do they matter?

    Science.gov (United States)

    Zyla, Joanna; Marczyk, Michal; Weiner, January; Polanska, Joanna

    2017-05-12

    There exist many methods for describing the complex relation between changes of gene expression in molecular pathways or gene ontologies under different experimental conditions. Among them, Gene Set Enrichment Analysis seems to be one of the most commonly used (over 10,000 citations). An important parameter, which could affect the final result, is the choice of a metric for the ranking of genes. Applying a default ranking metric may lead to poor results. In this work 28 benchmark data sets were used to evaluate the sensitivity and false positive rate of gene set analysis for 16 different ranking metrics including new proposals. Furthermore, the robustness of the chosen methods to sample size was tested. Using k-means clustering algorithm a group of four metrics with the highest performance in terms of overall sensitivity, overall false positive rate and computational load was established i.e. absolute value of Moderated Welch Test statistic, Minimum Significant Difference, absolute value of Signal-To-Noise ratio and Baumgartner-Weiss-Schindler test statistic. In case of false positive rate estimation, all selected ranking metrics were robust with respect to sample size. In case of sensitivity, the absolute value of Moderated Welch Test statistic and absolute value of Signal-To-Noise ratio gave stable results, while Baumgartner-Weiss-Schindler and Minimum Significant Difference showed better results for larger sample size. Finally, the Gene Set Enrichment Analysis method with all tested ranking metrics was parallelised and implemented in MATLAB, and is available at https://github.com/ZAEDPolSl/MrGSEA . Choosing a ranking metric in Gene Set Enrichment Analysis has critical impact on results of pathway enrichment analysis. The absolute value of Moderated Welch Test has the best overall sensitivity and Minimum Significant Difference has the best overall specificity of gene set analysis. When the number of non-normally distributed genes is high, using Baumgartner

  1. Titin is a candidate gene for stroke volume response to endurance training: the HERITAGE Family Study.

    Science.gov (United States)

    Rankinen, Tuomo; Rice, Treva; Boudreau, Anik; Leon, Arthur S; Skinner, James S; Wilmore, Jack H; Rao, D C; Bouchard, Claude

    2003-09-29

    A genome-wide linkage scan for endurance training-induced changes in submaximal exercise stroke volume (DeltaSV50) in the HERITAGE Family Study revealed two chromosomal regions (2q31-q32 and 10p11.2) with at least suggestive evidence of linkage among white families. Here we report a further characterization of the quantitative trait locus (QTL) in chromosome 2q31 and provide evidence that titin (TTN) is likely a candidate gene involved. The original linkage was detected with two markers (D2S335 and D2S1391), and the QTL covered approximately 25 million base pairs (Mb). We added 12 microsatellite markers resulting in an average marker density of one marker per 2.3 Mb. The evidence of linkage increased from P = 0.006 to P = 0.0002 and 0.00002 in the multi- and single-point analyses, respectively. The strongest evidence of linkage was seen with two markers in and near the TTN gene. Transmission/disequilibrium test (TDT) with the same marker set provided evidence for association with one of the TTN markers (D2S385; P = 0.004). TTN is a major contributor to the elasticity of cardiomyocytes and a key regulator of the Frank-Starling mechanism. Since TTN is the largest gene in the human genome, the challenge is to identify the DNA sequence variants contributing to the interindividual differences in cardiac adaptation to endurance training.

  2. Comparative Analysis of Fruit Metabolites and Pungency Candidate Genes Expression between Bhut Jolokia and Other Capsicum Species.

    Directory of Open Access Journals (Sweden)

    Sarpras M

    Full Text Available Bhut jolokia, commonly known as Ghost chili, a native Capsicum species found in North East India was recorded as the naturally occurring hottest chili in the world by the Guinness Book of World Records in 2006. Although few studies have reported variation in pungency content of this particular species, no study till date has reported detailed expression analysis of candidate genes involved in capsaicinoids (pungency biosynthesis pathway and other fruit metabolites. Therefore, the present study was designed to evaluate the diversity of fruit morphology, fruiting habit, capsaicinoids and other metabolite contents in 136 different genotypes mainly collected from North East India. Significant intra and inter-specific variations for fruit morphological traits, fruiting habits and 65 fruit metabolites were observed in the collected Capsicum germplasm belonging to three Capsicum species i.e., Capsicum chinense (Bhut jolokia, 63 accessions, C. frutescens (17 accessions and C. annuum (56 accessions. The pungency level, measured in Scoville Heat Unit (SHU and antioxidant activity measured by 2, 2-diphenyl-1-picrylhydrazyl (DPPH free radical scavenging assay showed maximum levels in C. chinense accessions followed by C. frutescens accessions, while C. annuum accessions showed the lowest value for both the traits. The number of different fruit metabolites detected did not vary significantly among the different species but the metabolite such as benzoic acid hydroxyl esters identified in large percentage in majority of C. annuum genotypes was totally absent in the C. chinense genotypes and sparingly present in few genotypes of C. frutescens. Significant correlations were observed between fruit metabolites capsaicin, dihydrocapsaicin, hexadecanoic acid, cyclopentane, α-tocopherol and antioxidant activity. Furthermore, comparative expression analysis (through qRT-PCR of candidate genes involved in capsaicinoid biosynthesis pathway revealed many fold higher

  3. Comparative Analysis of Fruit Metabolites and Pungency Candidate Genes Expression between Bhut Jolokia and Other Capsicum Species.

    Science.gov (United States)

    M, Sarpras; Gaur, Rashmi; Sharma, Vineet; Chhapekar, Sushil Satish; Das, Jharna; Kumar, Ajay; Yadava, Satish Kumar; Nitin, Mukesh; Brahma, Vijaya; Abraham, Suresh K; Ramchiary, Nirala

    2016-01-01

    Bhut jolokia, commonly known as Ghost chili, a native Capsicum species found in North East India was recorded as the naturally occurring hottest chili in the world by the Guinness Book of World Records in 2006. Although few studies have reported variation in pungency content of this particular species, no study till date has reported detailed expression analysis of candidate genes involved in capsaicinoids (pungency) biosynthesis pathway and other fruit metabolites. Therefore, the present study was designed to evaluate the diversity of fruit morphology, fruiting habit, capsaicinoids and other metabolite contents in 136 different genotypes mainly collected from North East India. Significant intra and inter-specific variations for fruit morphological traits, fruiting habits and 65 fruit metabolites were observed in the collected Capsicum germplasm belonging to three Capsicum species i.e., Capsicum chinense (Bhut jolokia, 63 accessions), C. frutescens (17 accessions) and C. annuum (56 accessions). The pungency level, measured in Scoville Heat Unit (SHU) and antioxidant activity measured by 2, 2-diphenyl-1-picrylhydrazyl (DPPH) free radical scavenging assay showed maximum levels in C. chinense accessions followed by C. frutescens accessions, while C. annuum accessions showed the lowest value for both the traits. The number of different fruit metabolites detected did not vary significantly among the different species but the metabolite such as benzoic acid hydroxyl esters identified in large percentage in majority of C. annuum genotypes was totally absent in the C. chinense genotypes and sparingly present in few genotypes of C. frutescens. Significant correlations were observed between fruit metabolites capsaicin, dihydrocapsaicin, hexadecanoic acid, cyclopentane, α-tocopherol and antioxidant activity. Furthermore, comparative expression analysis (through qRT-PCR) of candidate genes involved in capsaicinoid biosynthesis pathway revealed many fold higher expression of

  4. Gene expression profiling reveals candidate genes related to residual feed intake in duodenum of laying ducks.

    Science.gov (United States)

    Zeng, T; Huang, L; Ren, J; Chen, L; Tian, Y; Huang, Y; Zhang, H; Du, J; Lu, L

    2017-12-01

    Feed represents two-thirds of the total costs of poultry production, especially in developing countries. Improvement in feed efficiency would reduce the amount of feed required for production (growth or laying), the production cost, and the amount of nitrogenous waste. The most commonly used measures for feed efficiency are feed conversion ratio (FCR) and residual feed intake (RFI). As a more suitable indicator assessing feed efficiency, RFI is defined as the difference between observed and expected feed intake based on maintenance and growth or laying. However, the genetic and biological mechanisms regulating RFI are largely unknown. Identifying molecular mechanisms explaining divergence in RFI in laying ducks would lead to the development of early detection methods for the selection of more efficient breeding poultry. The objective of this study was to identify duodenum genes and pathways through transcriptional profiling in 2 extreme RFI phenotypes (HRFI and LRFI) of the duck population. Phenotypic aspects of feed efficiency showed that RFI was strongly positive with FCR and feed intake (FI). Transcriptomic analysis identified 35 differentially expressed genes between LRFI and HRFI ducks. These genes play an important role in metabolism, digestibility, secretion, and innate immunity including (), (), (), β (), and (). These results improve our knowledge of the biological basis underlying RFI, which would be useful for further investigations of key candidate genes for RFI and for the development of biomarkers.

  5. Clinically relevant known and candidate genes for obesity and their overlap with human infertility and reproduction.

    Science.gov (United States)

    Butler, Merlin G; McGuire, Austen; Manzardo, Ann M

    2015-04-01

    Obesity is a growing public health concern now reaching epidemic status worldwide for children and adults due to multiple problems impacting on energy intake and expenditure with influences on human reproduction and infertility. A positive family history and genetic factors are known to play a role in obesity by influencing eating behavior, weight and level of physical activity and also contributing to human reproduction and infertility. Recent advances in genetic technology have led to discoveries of new susceptibility genes for obesity and causation of infertility. The goal of our study was to provide an update of clinically relevant candidate and known genes for obesity and infertility using high resolution chromosome ideograms with gene symbols and tabular form. We used computer-based internet websites including PubMed to search for combinations of key words such as obesity, body mass index, infertility, reproduction, azoospermia, endometriosis, diminished ovarian reserve, estrogen along with genetics, gene mutations or variants to identify evidence for development of a master list of recognized obesity genes in humans and those involved with infertility and reproduction. Gene symbols for known and candidate genes for obesity were plotted on high resolution chromosome ideograms at the 850 band level. Both infertility and obesity genes were listed separately in alphabetical order in tabular form and those highlighted when involved with both conditions. By searching the medical literature and computer generated websites for key words, we found documented evidence for 370 genes playing a role in obesity and 153 genes for human reproduction or infertility. The obesity genes primarily affected common pathways in lipid metabolism, deposition or transport, eating behavior and food selection, physical activity or energy expenditure. Twenty-one of the obesity genes were also associated with human infertility and reproduction. Gene symbols were plotted on high resolution

  6. Candidate genes involved in the biosynthesis of triterpenoid saponins in Platycodon grandiflorum identified by transcriptome analysis

    Directory of Open Access Journals (Sweden)

    Chunhua eMa

    2016-05-01

    Full Text Available Background: Platycodon grandiflorum is the only species in the genus Platycodon of the family Campanulaceae, which has been traditionally used as a medicinal plant for its lung-heat-clearing, antitussive, and expectorant properties in China, Japanese and Korean. Oleanane-type triterpenoid saponins were the main chemical components of P. grandiflorum and platycodin D was the abundant and main bioactive component, but little is known about their biosynthesis in plants. Hence, P. grandiflorum is an ideal medicinal plant for studying the biosynthesis of Oleanane-type saponins. In addition, the genomic information of this important herbal plant is unavailable.Principal Findings:A total of 58,580,566 clean reads were obtained, which were assembled into 34,053 unigenes, with an average length of 936 bp and N50 of 1,661 bp by analyzing the transcriptome data of P. grandiflorum. Among these 34,053 unigenes, 22,409 unigenes (65.80% were annotated based on the information available from public databases, including Nr, NCBI, Swiss-Prot, KOG and KEGG. Furthermore, 21 candidate cytochrome P450 genes and 17 candidate UDP-glycosyltransferase genes most likely involved in triterpenoid saponins biosynthesis pathway were discovered from the transcriptome sequencing of P. grandiflorum. In addition, 10,626 SSRs were identified based on the transcriptome data, which would provide abundant candidates of molecular markers for genetic diversity and genetic map for this medicinal plant.Conclusion:The genomic data obtained from P. grandiflorum, especially the identification of putative genes involved in triterpenoid saponins biosynthesis pathway, will facilitate our understanding of the biosynthesis of triterpenoid saponins at molecular level.

  7. [Identification of candidate genes and expression profiles, as doping biomarkers].

    Science.gov (United States)

    Paparini, A; Impagnatiello, F; Pistilli, A; Rinaldi, M; Gianfranceschi, G; Signori, E; Stabile, A M; Fazio, V; Rende, M; Romano Spica, V

    2007-01-01

    Administration of prohibited substances to enhance athletic performance represents an emerging medical, social, ethical and legal issue. Traditional controls are based on direct detection of substances or their catabolites. However out-of-competition doping may not be easily revealed by standard analytical methods. Alternative indirect control strategies are based on the evaluation of mid- and long-term effects of doping in tissues. Drug-induced long-lasting changes of gene expression may be taken as effective indicators of doping exposure. To validate this approach, we used real-time PCR to monitor the expression pattern of selected genes in human haematopoietic cells exposed to nandrolone, insulin-like growth factor I (IGF-I) or growth hormone (GH). Some candidate genes were found significantly and consistently modulated by treatments. Nandrolone up-regulated AR, ESR2 and PGR in K562 cells, and SRD5A1, PPARA and JAK2 in Jurkat cells; IGF-I up-regulated EPOR and PGR in HL60 cells, and SRD5A1 in Jurkat; GH up-regulated SRD5A1 and GHR in K562. GATA1 expression was down-regulated in IGF-1-treated HL60, ESR2 was down-regulated in nandrolone-treated Jurkat, and AR and PGR were down-regulated in GH-treated Jurkat. This pilot study shows the potential of molecular biology-based strategies in anti-doping controls.

  8. Investigating the effect of paralogs on microarray gene-set analysis

    LENUS (Irish Health Repository)

    Faure, Andre J

    2011-01-24

    Abstract Background In order to interpret the results obtained from a microarray experiment, researchers often shift focus from analysis of individual differentially expressed genes to analyses of sets of genes. These gene-set analysis (GSA) methods use previously accumulated biological knowledge to group genes into sets and then aim to rank these gene sets in a way that reflects their relative importance in the experimental situation in question. We suspect that the presence of paralogs affects the ability of GSA methods to accurately identify the most important sets of genes for subsequent research. Results We show that paralogs, which typically have high sequence identity and similar molecular functions, also exhibit high correlation in their expression patterns. We investigate this correlation as a potential confounding factor common to current GSA methods using Indygene http:\\/\\/www.cbio.uct.ac.za\\/indygene, a web tool that reduces a supplied list of genes so that it includes no pairwise paralogy relationships above a specified sequence similarity threshold. We use the tool to reanalyse previously published microarray datasets and determine the potential utility of accounting for the presence of paralogs. Conclusions The Indygene tool efficiently removes paralogy relationships from a given dataset and we found that such a reduction, performed prior to GSA, has the ability to generate significantly different results that often represent novel and plausible biological hypotheses. This was demonstrated for three different GSA approaches when applied to the reanalysis of previously published microarray datasets and suggests that the redundancy and non-independence of paralogs is an important consideration when dealing with GSA methodologies.

  9. Rrp1b, a new candidate susceptibility gene for breast cancer progression and metastasis.

    Directory of Open Access Journals (Sweden)

    Nigel P S Crawford

    2007-11-01

    Full Text Available A novel candidate metastasis modifier, ribosomal RNA processing 1 homolog B (Rrp1b, was identified through two independent approaches. First, yeast two-hybrid, immunoprecipitation, and functional assays demonstrated a physical and functional interaction between Rrp1b and the previous identified metastasis modifier Sipa1. In parallel, using mouse and human metastasis gene expression data it was observed that extracellular matrix (ECM genes are common components of metastasis predictive signatures, suggesting that ECM genes are either important markers or causal factors in metastasis. To investigate the relationship between ECM genes and poor prognosis in breast cancer, expression quantitative trait locus analysis of polyoma middle-T transgene-induced mammary tumor was performed. ECM gene expression was found to be consistently associated with Rrp1b expression. In vitro expression of Rrp1b significantly altered ECM gene expression, tumor growth, and dissemination in metastasis assays. Furthermore, a gene signature induced by ectopic expression of Rrp1b in tumor cells predicted survival in a human breast cancer gene expression dataset. Finally, constitutional polymorphism within RRP1B was found to be significantly associated with tumor progression in two independent breast cancer cohorts. These data suggest that RRP1B may be a novel susceptibility gene for breast cancer progression and metastasis.

  10. Tensor decomposition-based unsupervised feature extraction identifies candidate genes that induce post-traumatic stress disorder-mediated heart diseases.

    Science.gov (United States)

    Taguchi, Y-H

    2017-12-21

    Although post-traumatic stress disorder (PTSD) is primarily a mental disorder, it can cause additional symptoms that do not seem to be directly related to the central nervous system, which PTSD is assumed to directly affect. PTSD-mediated heart diseases are some of such secondary disorders. In spite of the significant correlations between PTSD and heart diseases, spatial separation between the heart and brain (where PTSD is primarily active) prevents researchers from elucidating the mechanisms that bridge the two disorders. Our purpose was to identify genes linking PTSD and heart diseases. In this study, gene expression profiles of various murine tissues observed under various types of stress or without stress were analyzed in an integrated manner using tensor decomposition (TD). Based upon the obtained features, ∼ 400 genes were identified as candidate genes that may mediate heart diseases associated with PTSD. Various gene enrichment analyses supported biological reliability of the identified genes. Ten genes encoding protein-, DNA-, or mRNA-interacting proteins-ILF2, ILF3, ESR1, ESR2, RAD21, HTT, ATF2, NR3C1, TP53, and TP63-were found to be likely to regulate expression of most of these ∼ 400 genes and therefore are candidate primary genes that cause PTSD-mediated heart diseases. Approximately 400 genes in the heart were also found to be strongly affected by various drugs whose known adverse effects are related to heart diseases and/or fear memory conditioning; these data support the reliability of our findings. TD-based unsupervised feature extraction turned out to be a useful method for gene selection and successfully identified possible genes causing PTSD-mediated heart diseases.

  11. FunGeneNet: a web tool to estimate enrichment of functional interactions in experimental gene sets.

    Science.gov (United States)

    Tiys, Evgeny S; Ivanisenko, Timofey V; Demenkov, Pavel S; Ivanisenko, Vladimir A

    2018-02-09

    Estimation of functional connectivity in gene sets derived from genome-wide or other biological experiments is one of the essential tasks of bioinformatics. A promising approach for solving this problem is to compare gene networks built using experimental gene sets with random networks. One of the resources that make such an analysis possible is CrossTalkZ, which uses the FunCoup database. However, existing methods, including CrossTalkZ, do not take into account individual types of interactions, such as protein/protein interactions, expression regulation, transport regulation, catalytic reactions, etc., but rather work with generalized types characterizing the existence of any connection between network members. We developed the online tool FunGeneNet, which utilizes the ANDSystem and STRING to reconstruct gene networks using experimental gene sets and to estimate their difference from random networks. To compare the reconstructed networks with random ones, the node permutation algorithm implemented in CrossTalkZ was taken as a basis. To study the FunGeneNet applicability, the functional connectivity analysis of networks constructed for gene sets involved in the Gene Ontology biological processes was conducted. We showed that the method sensitivity exceeds 0.8 at a specificity of 0.95. We found that the significance level of the difference between gene networks of biological processes and random networks is determined by the type of connections considered between objects. At the same time, the highest reliability is achieved for the generalized form of connections that takes into account all the individual types of connections. By taking examples of the thyroid cancer networks and the apoptosis network, it is demonstrated that key participants in these processes are involved in the interactions of those types by which these networks differ from random ones. FunGeneNet is a web tool aimed at proving the functionality of networks in a wide range of sizes of

  12. Quantitative trait loci affecting the 3D skull shape and size in mouse and prioritization of candidate genes in-silico

    Science.gov (United States)

    Maga, A. Murat; Navarro, Nicolas; Cunningham, Michael L.; Cox, Timothy C.

    2015-01-01

    We describe the first application of high-resolution 3D micro-computed tomography, together with 3D landmarks and geometric morphometrics, to map QTL responsible for variation in skull shape and size using a backcross between C57BL/6J and A/J inbred strains. Using 433 animals, 53 3D landmarks, and 882 SNPs from autosomes, we identified seven QTL responsible for the skull size (SCS.qtl) and 30 QTL responsible for the skull shape (SSH.qtl). Size, sex, and direction-of-cross were all significant factors and included in the analysis as covariates. All autosomes harbored at least one SSH.qtl, sometimes up to three. Effect sizes of SSH.qtl appeared to be small, rarely exceeding 1% of the overall shape variation. However, they account for significant amount of variation in some specific directions of the shape space. Many QTL have stronger effect on the neurocranium than expected from a random vector that will parcellate uniformly across the four cranial regions. On the contrary, most of QTL have an effect on the palate weaker than expected. Combined interval length of 30 SSH.qtl was about 315 MB and contained 2476 known protein coding genes. We used a bioinformatics approach to filter these candidate genes and identified 16 high-priority candidates that are likely to play a role in the craniofacial development and disorders. Thus, coupling the QTL mapping approach in model organisms with candidate gene enrichment approaches appears to be a feasible way to identify high-priority candidates genes related to the structure or tissue of interest. PMID:25859222

  13. Resequencing three candidate genes discovers seven potentially deleterious variants susceptibility to major depressive disorder and suicide attempts in Chinese.

    Science.gov (United States)

    Rao, Shitao; Leung, Cherry She Ting; Lam, Macro Hb; Wing, Yun Kwok; Waye, Mary Miu Yee; Tsui, Stephen Kwok Wing

    2017-03-01

    To date almost 200 genes were found to be associated with major depressive disorder (MDD) or suicide attempts (SA), but very few genes were reported for their molecular mechanisms. This study aimed to find out whether there were common or rare variants in three candidate genes altering the risk for MDD and SA in Chinese. Three candidate genes (HOMER1, SLC6A4 and TEF) were chosen for resequencing analysis and association studies as they were reported to be involved in the etiology of MDD and SA. Following that, bioinformatics analyses were applied on those variants of interest. After resequencing analysis and alignment for the amplicons, a total of 34 common or rare variants were found in the randomly selected 36 Hong Kong Chinese patients with both MDD and SA. Among those, seven variants show potentially deleterious features. Rs60029191 and a rare variant located in regulatory region of the HOMER1 gene may affect the promoter activities through interacting with predicted transcription factors. Two missense mutations existed in the SLC6A4 coding regions were firstly reported in Hong Kong Chinese MDD and SA patients, and both of them could affect the transport efficiency of SLC6A4 to serotonin. Moreover, a common variant rs6354 located in the untranslated region of this gene may affect the expression level or exonic splicing of serotonin transporter. In addition, both of a most studied polymorphism rs738499 and a low-frequency variant in the promoter region of the TEF gene were found to be located in potential transcription factor binding sites, which may let the two variants be able to influence the promoter activities of the gene. This study elucidated the potentially molecular mechanisms of the three candidate genes altering the risk for MDD and SA. These findings implied that not only common variants but rare variants could make contributions to the genetic susceptibility to MDD and SA in Chinese. Copyright © 2016 Elsevier B.V. All rights reserved.

  14. Quantitative transcription dynamic analysis reveals candidate genes and key regulators for ethanol tolerance in Saccharomyces cerevisiae

    Directory of Open Access Journals (Sweden)

    Ma Menggen

    2010-06-01

    Full Text Available Abstract Background Derived from our lignocellulosic conversion inhibitor-tolerant yeast, we generated an ethanol-tolerant strain Saccharomyces cerevisiae NRRL Y-50316 by enforced evolutionary adaptation. Using a newly developed robust mRNA reference and a master equation unifying gene expression data analyses, we investigated comparative quantitative transcription dynamics of 175 genes selected from previous studies for an ethanol-tolerant yeast and its closely related parental strain. Results A highly fitted master equation was established and applied for quantitative gene expression analyses using pathway-based qRT-PCR array assays. The ethanol-tolerant Y-50316 displayed significantly enriched background of mRNA abundance for at least 35 genes without ethanol challenge compared with its parental strain Y-50049. Under the ethanol challenge, the tolerant Y-50316 responded in consistent expressions over time for numerous genes belonging to groups of heat shock proteins, trehalose metabolism, glycolysis, pentose phosphate pathway, fatty acid metabolism, amino acid biosynthesis, pleiotropic drug resistance gene family and transcription factors. The parental strain showed repressed expressions for many genes and was unable to withstand the ethanol stress and establish a viable culture and fermentation. The distinct expression dynamics between the two strains and their close association with cell growth, viability and ethanol fermentation profiles distinguished the tolerance-response from the stress-response in yeast under the ethanol challenge. At least 82 genes were identified as candidate and key genes for ethanol-tolerance and subsequent fermentation under the stress. Among which, 36 genes were newly recognized by the present study. Most of the ethanol-tolerance candidate genes were found to share protein binding motifs of transcription factors Msn4p/Msn2p, Yap1p, Hsf1p and Pdr1p/Pdr3p. Conclusion Enriched background of transcription abundance

  15. Identification and Evolutionary Analysis of Potential Candidate Genes in a Human Eating Disorder

    Directory of Open Access Journals (Sweden)

    Ubadah Sabbagh

    2016-01-01

    Full Text Available The purpose of this study was to find genes linked with eating disorders and associated with both metabolic and neural systems. Our operating hypothesis was that there are genetic factors underlying some eating disorders resting in both those pathways. Specifically, we are interested in disorders that may rest in both sleep and metabolic function, generally called Night Eating Syndrome (NES. A meta-analysis of the Gene Expression Omnibus targeting the mammalian nervous system, sleep, and obesity studies was performed, yielding numerous genes of interest. Through a text-based analysis of the results, a number of potential candidate genes were identified. VGF, in particular, appeared to be relevant both to obesity and, broadly, to brain or neural development. VGF is a highly connected protein that interacts with numerous targets via proteolytically digested peptides. We examined VGF from an evolutionary perspective to determine whether other available evidence supported a role for the gene in human disease. We conclude that some of the already identified variants in VGF from human polymorphism studies may contribute to eating disorders and obesity. Our data suggest that there is enough evidence to warrant eGWAS and GWAS analysis of these genes in NES patients in a case-control study.

  16. Identification and Evolutionary Analysis of Potential Candidate Genes in a Human Eating Disorder.

    Science.gov (United States)

    Sabbagh, Ubadah; Mullegama, Saman; Wyckoff, Gerald J

    2016-01-01

    The purpose of this study was to find genes linked with eating disorders and associated with both metabolic and neural systems. Our operating hypothesis was that there are genetic factors underlying some eating disorders resting in both those pathways. Specifically, we are interested in disorders that may rest in both sleep and metabolic function, generally called Night Eating Syndrome (NES). A meta-analysis of the Gene Expression Omnibus targeting the mammalian nervous system, sleep, and obesity studies was performed, yielding numerous genes of interest. Through a text-based analysis of the results, a number of potential candidate genes were identified. VGF, in particular, appeared to be relevant both to obesity and, broadly, to brain or neural development. VGF is a highly connected protein that interacts with numerous targets via proteolytically digested peptides. We examined VGF from an evolutionary perspective to determine whether other available evidence supported a role for the gene in human disease. We conclude that some of the already identified variants in VGF from human polymorphism studies may contribute to eating disorders and obesity. Our data suggest that there is enough evidence to warrant eGWAS and GWAS analysis of these genes in NES patients in a case-control study.

  17. "Contrasting patterns of selection at Pinus pinaster Ait. Drought stress candidate genes as revealed by genetic differentiation analyses".

    Science.gov (United States)

    Eveno, Emmanuelle; Collada, Carmen; Guevara, M Angeles; Léger, Valérie; Soto, Alvaro; Díaz, Luis; Léger, Patrick; González-Martínez, Santiago C; Cervera, M Teresa; Plomion, Christophe; Garnier-Géré, Pauline H

    2008-02-01

    The importance of natural selection for shaping adaptive trait differentiation among natural populations of allogamous tree species has long been recognized. Determining the molecular basis of local adaptation remains largely unresolved, and the respective roles of selection and demography in shaping population structure are actively debated. Using a multilocus scan that aims to detect outliers from simulated neutral expectations, we analyzed patterns of nucleotide diversity and genetic differentiation at 11 polymorphic candidate genes for drought stress tolerance in phenotypically contrasted Pinus pinaster Ait. populations across its geographical range. We compared 3 coalescent-based methods: 2 frequentist-like, including 1 approach specifically developed for biallelic single nucleotide polymorphisms (SNPs) here and 1 Bayesian. Five genes showed outlier patterns that were robust across methods at the haplotype level for 2 of them. Two genes presented higher F(ST) values than expected (PR-AGP4 and erd3), suggesting that they could have been affected by the action of diversifying selection among populations. In contrast, 3 genes presented lower F(ST) values than expected (dhn-1, dhn2, and lp3-1), which could represent signatures of homogenizing selection among populations. A smaller proportion of outliers were detected at the SNP level suggesting the potential functional significance of particular combinations of sites in drought-response candidate genes. The Bayesian method appeared robust to low sample sizes, flexible to assumptions regarding migration rates, and powerful for detecting selection at the haplotype level, but the frequentist-like method adapted to SNPs was more efficient for the identification of outlier SNPs showing low differentiation. Population-specific effects estimated in the Bayesian method also revealed populations with lower immigration rates, which could have led to favorable situations for local adaptation. Outlier patterns are discussed

  18. A cohort of balanced reciprocal translocations associated with dyslexia: identification of two putative candidate genes at DYX1

    DEFF Research Database (Denmark)

    Buonincontri, Roberta; Bache, Iben; Silahtaroglu, Asli

    2011-01-01

    Dyslexia is one of the most common neurodevelopmental disorders where likely many genes are involved in the pathogenesis. So far six candidate dyslexia genes have been proposed, and two of these were identified by rare chromosomal translocations in affected individuals. By systematic re......-examination of all translocation carriers in Denmark, we have identified 16 different translocations associated with dyslexia. In four families, where the translocation co-segregated with the phenotype, one of the breakpoints concurred (at the cytogenetic level) with either a known dyslexia linkage region--at 15q21...... (DYX1), 2p13 (DYX3) and 1p36 (DYX8)--or an unpublished linkage region at 19q13. As a first exploitation of this unique cohort, we identify three novel candidate dyslexia genes, ZNF280D and TCF12 at 15q21, and PDE7B at 6q23.3, by molecular mapping of the familial translocation with the 15q21 breakpoint....

  19. Principal Angle Enrichment Analysis (PAEA): Dimensionally Reduced Multivariate Gene Set Enrichment Analysis Tool.

    Science.gov (United States)

    Clark, Neil R; Szymkiewicz, Maciej; Wang, Zichen; Monteiro, Caroline D; Jones, Matthew R; Ma'ayan, Avi

    2015-11-01

    Gene set analysis of differential expression, which identifies collectively differentially expressed gene sets, has become an important tool for biology. The power of this approach lies in its reduction of the dimensionality of the statistical problem and its incorporation of biological interpretation by construction. Many approaches to gene set analysis have been proposed, but benchmarking their performance in the setting of real biological data is difficult due to the lack of a gold standard. In a previously published work we proposed a geometrical approach to differential expression which performed highly in benchmarking tests and compared well to the most popular methods of differential gene expression. As reported, this approach has a natural extension to gene set analysis which we call Principal Angle Enrichment Analysis (PAEA). PAEA employs dimensionality reduction and a multivariate approach for gene set enrichment analysis. However, the performance of this method has not been assessed nor its implementation as a web-based tool. Here we describe new benchmarking protocols for gene set analysis methods and find that PAEA performs highly. The PAEA method is implemented as a user-friendly web-based tool, which contains 70 gene set libraries and is freely available to the community.

  20. Differential SPL gene expression patterns reveal candidate genes underlying flowering time and architectural differences in Mimulus and Arabidopsis.

    Science.gov (United States)

    Jorgensen, Stacy A; Preston, Jill C

    2014-04-01

    Evolutionary transitions in growth habit and flowering time responses to variable environmental signals have occurred multiple times independently across angiosperms and have major impacts on plant fitness. Proteins in the SPL family of transcription factors collectively regulate flowering time genes that have been implicated in interspecific shifts in annuality/perenniality. However, their potential importance in the evolution of angiosperm growth habit has not been extensively investigated. Here we identify orthologs representative of the major SPL gene clades in annual Arabidopsis thaliana and Mimulus guttatus IM767, and perennial A. lyrata and M. guttatus PR, and characterize their expression. Spatio-temporal expression patterns are complex across both diverse tissues of the same taxa and comparable tissues of different taxa, consistent with genic sub- or neo-functionalization. However, our data are consistent with a general role for several SPL genes in the promotion of juvenile to adult phase change and/or flowering time in Mimulus and Arabidopsis. Furthermore, several candidate genes were identified for future study whose differential expression correlates with growth habit and architectural variation in annual versus perennial taxa. Copyright © 2014 Elsevier Inc. All rights reserved.

  1. Identification of single nucleotide polymorphisms (SNPs at candidate genes involved in abiotic stress in two Prosopis species of hybrids

    Directory of Open Access Journals (Sweden)

    Maria F. Pomponio

    2014-12-01

    Full Text Available Aim of the study: Identify and compare SNPs on candidate genes related to abiotic stress in Prosopis chilensis, Prosopis flexuosa and interspecific hybridsArea of the study: Chaco árido, Argentina. Material and Methods: Fragments from 6 candidate genes were sequenced in 60 genotypes. DNA polymorphisms were analyzed.Main Results: The analysis revealed that the hybrids had the highest rate of polymorphism, followed by P. flexuosa and P. chilensis, the values found are comparable to other forest tree species.Research highlights: This approach will help to study genetic diversity variation on natural populations for assessing the effects of environmental changes.Keywords: SNPs; abiotic stress; interspecific variation; molecular markers. 

  2. Genome-Wide Association Studies Identify Candidate Genes for Coat Color and Mohair Traits in the Iranian Markhoz Goat.

    Science.gov (United States)

    Nazari-Ghadikolaei, Anahit; Mehrabani-Yeganeh, Hassan; Miarei-Aashtiani, Seyed R; Staiger, Elizabeth A; Rashidi, Amir; Huson, Heather J

    2018-01-01

    The Markhoz goat provides an opportunity to study the genetics underlying coat color and mohair traits of an Angora type goat using genome-wide association studies (GWAS). This indigenous Iranian breed is valued for its quality mohair used in ceremonial garments and has the distinction of exhibiting an array of coat colors including black, brown, and white. Here, we performed 16 GWAS for different fleece (mohair) traits and coat color in 228 Markhoz goats sampled from the Markhoz Goat Research Station in Sanandaj, Kurdistan province, located in western Iran using the Illumina Caprine 50K beadchip. The Efficient Mixed Model Linear analysis was used to identify genomic regions with potential candidate genes contributing to coat color and mohair characteristics while correcting for population structure. Significant associations to coat color were found within or near the ASIP, ITCH, AHCY , and RALY genes on chromosome 13 for black and brown coat color and the KIT and PDGFRA genes on chromosome 6 for white coat color. Individual mohair traits were analyzed for genetic association along with principal components that allowed for a broader perspective of combined traits reflecting overall mohair quality and volume. A multitude of markers demonstrated significant association to mohair traits highlighting potential candidate genes of POU1F1 on chromosome 1 for mohair quality, MREG on chromosome 2 for mohair volume, DUOX1 on chromosome 10 for yearling fleece weight, and ADGRV1 on chromosome 7 for grease percentage. Variation in allele frequencies and haplotypes were identified for coat color and differentiated common markers associated with both brown and black coat color. This demonstrates the potential for genetic markers to be used in future breeding programs to improve selection for coat color and mohair traits. Putative candidate genes, both novel and previously identified in other species or breeds, require further investigation to confirm phenotypic causality and

  3. Integration of liver gene co-expression networks and eGWAs analyses highlighted candidate regulators implicated in lipid metabolism in pigs.

    Science.gov (United States)

    Ballester, Maria; Ramayo-Caldas, Yuliaxis; Revilla, Manuel; Corominas, Jordi; Castelló, Anna; Estellé, Jordi; Fernández, Ana I; Folch, Josep M

    2017-04-19

    In the present study, liver co-expression networks and expression Genome Wide Association Study (eGWAS) were performed to identify DNA variants and molecular pathways implicated in the functional regulatory mechanisms of meat quality traits in pigs. With this purpose, the liver mRNA expression of 44 candidates genes related with lipid metabolism was analysed in 111 Iberian x Landrace backcross animals. The eGWAS identified 92 eSNPs located in seven chromosomal regions and associated with eight genes: CROT, CYP2U1, DGAT1, EGF, FABP1, FABP5, PLA2G12A, and PPARA. Remarkably, cis-eSNPs associated with FABP1 gene expression which may be determining the C18:2(n-6)/C18:3(n-3) ratio in backfat through the multiple interaction of DNA variants and genes were identified. Furthermore, a hotspot on SSC8 associated with the gene expression of eight genes was identified and the TBCK gene was pointed out as candidate gene regulating it. Our results also suggested that the PI3K-Akt-mTOR pathway plays an important role in the control of the analysed genes highlighting nuclear receptors as the NR3C1 or PPARA. Finally, sex-dimorphism associated with hepatic lipid metabolism was identified with over-representation of female-biased genes. These results increase our knowledge of the genetic architecture underlying fat composition traits.

  4. Deep sequencing analysis of the transcriptomes of peanut aerial and subterranean young pods identifies candidate genes related to early embryo abortion.

    Science.gov (United States)

    Chen, Xiaoping; Zhu, Wei; Azam, Sarwar; Li, Heying; Zhu, Fanghe; Li, Haifen; Hong, Yanbin; Liu, Haiyan; Zhang, Erhua; Wu, Hong; Yu, Shanlin; Zhou, Guiyuan; Li, Shaoxiong; Zhong, Ni; Wen, Shijie; Li, Xingyu; Knapp, Steve J; Ozias-Akins, Peggy; Varshney, Rajeev K; Liang, Xuanqiang

    2013-01-01

    The failure of peg penetration into the soil leads to seed abortion in peanut. Knowledge of genes involved in these processes is comparatively deficient. Here, we used RNA-seq to gain insights into transcriptomes of aerial and subterranean pods. More than 2 million transcript reads with an average length of 396 bp were generated from one aerial (AP) and two subterranean (SP1 and SP2) pod libraries using pyrosequencing technology. After assembly, sets of 49 632, 49 952 and 50 494 from a total of 74 974 transcript assembly contigs (TACs) were identified in AP, SP1 and SP2, respectively. A clear linear relationship in the gene expression level was observed between these data sets. In brief, 2194 differentially expressed TACs with a 99.0% true-positive rate were identified, among which 859 and 1068 TACs were up-regulated in aerial and subterranean pods, respectively. Functional analysis showed that putative function based on similarity with proteins catalogued in UniProt and gene ontology term classification could be determined for 59 342 (79.2%) and 42 955 (57.3%) TACs, respectively. A total of 2968 TACs were mapped to 174 KEGG pathways, of which 168 were shared by aerial and subterranean transcriptomes. TACs involved in photosynthesis were significantly up-regulated and enriched in the aerial pod. In addition, two senescence-associated genes were identified as significantly up-regulated in the aerial pod, which potentially contribute to embryo abortion in aerial pods, and in turn, to cessation of swelling. The data set generated in this study provides evidence for some functional genes as robust candidates underlying aerial and subterranean pod development and contributes to an elucidation of the evolutionary implications resulting from fruit development under light and dark conditions. © 2012 The Authors Plant Biotechnology Journal © 2012 Society for Experimental Biology, Association of Applied Biologists and Blackwell Publishing Ltd.

  5. Carotenoid content and root color of cultivated carrot: a candidate-gene association study using an original broad unstructured population.

    Directory of Open Access Journals (Sweden)

    Matthieu Jourdan

    Full Text Available Accumulated in large amounts in carrot, carotenoids are an important product quality attribute and therefore a major breeding trait. However, the knowledge of carotenoid accumulation genetic control in this root vegetable is still limited. In order to identify the genetic variants linked to this character, we performed an association mapping study with a candidate gene approach. We developed an original unstructured population with a broad genetic basis to avoid the pitfall of false positive detection due to population stratification. We genotyped 109 SNPs located in 17 candidate genes – mostly carotenoid biosynthesis genes – on 380 individuals, and tested the association with carotenoid contents and color components. Total carotenoids and β-carotene contents were significantly associated with genes zeaxanthin epoxydase (ZEP, phytoene desaturase (PDS and carotenoid isomerase (CRTISO while α-carotene was associated with CRTISO and plastid terminal oxidase (PTOX genes. Color components were associated most significantly with ZEP. Our results suggest the involvement of the couple PDS/PTOX and ZEP in carotenoid accumulation, as the result of the metabolic and catabolic activities respectively. This study brings new insights in the understanding of the carotenoid pathway in non-photosynthetic organs.

  6. Candidate genes for chronic obstructive pulmonary disease in two large data sets

    DEFF Research Database (Denmark)

    Bakke, P S; Zhu, G; Gulsvik, A

    2011-01-01

    Lack of reproducibility of findings has been a criticism of genetic association studies in complex diseases like chronic obstructive pulmonary disease (COPD). We selected 257 polymorphisms of 16 genes with reported or potential relationshipsto COPD and genotyped these variants in a case......-control study which included 953 COPD cases and 956 control subjects. We explored the association of these polymorphisms to three COPD phenotypes: a COPD binary phenotype and two quantitative traits (post bronchodilator FEV1 in percent predicted and FEV1/FVC). The polymorphisms significantly associated...... to these phenotypes in this first study were tested in a second, family based, study that included 635 pedigrees with 1910 individuals. Significant associations to the binary COPD phenotype in both populations were seen for STAT1 (rs13010343) and NFKBIB/SIRT2 (rs2241704) (p

  7. Transcriptomic identification of candidate genes involved in sunflower responses to chilling and salt stresses based on cDNA microarray analysis

    Directory of Open Access Journals (Sweden)

    Paniego Norma

    2008-01-01

    Full Text Available Abstract Background Considering that sunflower production is expanding to arid regions, tolerance to abiotic stresses as drought, low temperatures and salinity arises as one of the main constrains nowadays. Differential organ-specific sunflower ESTs (expressed sequence tags were previously generated by a subtractive hybridization method that included a considerable number of putative abiotic stress associated sequences. The objective of this work is to analyze concerted gene expression profiles of organ-specific ESTs by fluorescence microarray assay, in response to high sodium chloride concentration and chilling treatments with the aim to identify and follow up candidate genes for early responses to abiotic stress in sunflower. Results Abiotic-related expressed genes were the target of this characterization through a gene expression analysis using an organ-specific cDNA fluorescence microarray approach in response to high salinity and low temperatures. The experiment included three independent replicates from leaf samples. We analyzed 317 unigenes previously isolated from differential organ-specific cDNA libraries from leaf, stem and flower at R1 and R4 developmental stage. A statistical analysis based on mean comparison by ANOVA and ordination by Principal Component Analysis allowed the detection of 80 candidate genes for either salinity and/or chilling stresses. Out of them, 50 genes were up or down regulated under both stresses, supporting common regulatory mechanisms and general responses to chilling and salinity. Interestingly 15 and 12 sequences were up regulated or down regulated specifically in one stress but not in the other, respectively. These genes are potentially involved in different regulatory mechanisms including transcription/translation/protein degradation/protein folding/ROS production or ROS-scavenging. Differential gene expression patterns were confirmed by qRT-PCR for 12.5% of the microarray candidate sequences. Conclusion

  8. Loci and candidate genes conferring resistance to soybean cyst nematode HG type 2.5.7.

    Science.gov (United States)

    Zhao, Xue; Teng, Weili; Li, Yinghui; Liu, Dongyuan; Cao, Guanglu; Li, Dongmei; Qiu, Lijuan; Zheng, Hongkun; Han, Yingpeng; Li, Wenbin

    2017-06-14

    Soybean (Glycine max L. Merr.) cyst nematode (SCN, Heterodera glycines I,) is a major pest of soybean worldwide. The most effective strategy to control this pest involves the use of resistant cultivars. The aim of the present study was to investigate the genome-wide genetic architecture of resistance to SCN HG Type 2.5.7 (race 1) in landrace and elite cultivated soybeans. A total of 200 diverse soybean accessions were screened for resistance to SCN HG Type 2.5.7 and genotyped through sequencing using the Specific Locus Amplified Fragment Sequencing (SLAF-seq) approach with a 6.14-fold average sequencing depth. A total of 33,194 SNPs were identified with minor allele frequencies (MAF) over 4%, covering 97% of all the genotypes. Genome-wide association mapping (GWAS) revealed thirteen SNPs associated with resistance to SCN HG Type 2.5.7. These SNPs were distributed on five chromosomes (Chr), including Chr7, 8, 14, 15 and 18. Four SNPs were novel resistance loci and nine SNPs were located near known QTL. A total of 30 genes were identified as candidate genes underlying SCN resistance. A total of sixteen novel soybean accessions were identified with significant resistance to HG Type 2.5.7. The beneficial alleles and candidate genes identified by GWAS might be valuable for improving marker-assisted breeding efficiency and exploring the molecular mechanisms underlying SCN resistance.

  9. Uniform approximation is more appropriate for Wilcoxon Rank-Sum Test in gene set analysis.

    Directory of Open Access Journals (Sweden)

    Zhide Fang

    Full Text Available Gene set analysis is widely used to facilitate biological interpretations in the analyses of differential expression from high throughput profiling data. Wilcoxon Rank-Sum (WRS test is one of the commonly used methods in gene set enrichment analysis. It compares the ranks of genes in a gene set against those of genes outside the gene set. This method is easy to implement and it eliminates the dichotomization of genes into significant and non-significant in a competitive hypothesis testing. Due to the large number of genes being examined, it is impractical to calculate the exact null distribution for the WRS test. Therefore, the normal distribution is commonly used as an approximation. However, as we demonstrate in this paper, the normal approximation is problematic when a gene set with relative small number of genes is tested against the large number of genes in the complementary set. In this situation, a uniform approximation is substantially more powerful, more accurate, and less intensive in computation. We demonstrate the advantage of the uniform approximations in Gene Ontology (GO term analysis using simulations and real data sets.

  10. Quantitative trait loci affecting the 3D skull shape and size in mouse and prioritization of candidate genes in-silico.

    Directory of Open Access Journals (Sweden)

    A. Murat eMaga

    2015-03-01

    Full Text Available We describe the first application of high-resolution 3D micro-computed tomography, together with 3D landmarks and geometric morphometrics, to map QTL responsible for variation in skull shape and size using a backcross between C57BL/6J and A/J inbred strains. Using 433 animals, 53 3D landmarks, and 882 SNPs from autosomes, we identified seven QTL responsible for the skull size (SCS.qtl and 30 QTL responsible for the skull shape (SSH.qtl. Size, sex and direction-of-cross were all significant factors and included in the analysis as covariates. All autosomes harbored at least one SSH.qtl, sometimes up to three. Effect sizes of SSH.qtl appeared to be small, rarely exceeding 1% of the overall shape variation. However, they account for significant amount of variation in some specific directions of the shape space. Many QTL have stronger effect on the neurocranium than expected from a random vector that will parcellate uniformly across the four cranial regions. On the contrary, most of QTL have an effect on the palate weaker than expected. Combined interval length of 30 SSH.qtl was about 315MB and contained 2,476 known protein coding genes. We used a bioinformatics approach to filter these candidate genes and identified 16 high-priority candidates that are likely to play a role in the craniofacial development and disorders. Thus, coupling the QTL mapping approach in model organisms with candidate gene enrichment approaches appears to be a feasible way to identify high-priority candidates genes related to the structure or tissue of interest.

  11. Evidence of novel fine-scale structural variation at autism spectrum disorder candidate loci

    Directory of Open Access Journals (Sweden)

    Hedges Dale J

    2012-04-01

    Full Text Available Abstract Background Autism spectrum disorders (ASD represent a group of neurodevelopmental disorders characterized by a core set of social-communicative and behavioral impairments. Gamma-aminobutyric acid (GABA is the major inhibitory neurotransmitter in the brain, acting primarily via the GABA receptors (GABR. Multiple lines of evidence, including altered GABA and GABA receptor expression in autistic patients, indicate that the GABAergic system may be involved in the etiology of autism. Methods As copy number variations (CNVs, particularly rare and de novo CNVs, have now been implicated in ASD risk, we examined the GABA receptors and genes in related pathways for structural variation that may be associated with autism. We further extended our candidate gene set to include 19 genes and regions that had either been directly implicated in the autism literature or were directly related (via function or ancestry to these primary candidates. For the high resolution CNV screen we employed custom-designed 244 k comparative genomic hybridization (CGH arrays. Collectively, our probes spanned a total of 11 Mb of GABA-related and additional candidate regions with a density of approximately one probe every 200 nucleotides, allowing a theoretical resolution for detection of CNVs of approximately 1 kb or greater on average. One hundred and sixty-eight autism cases and 149 control individuals were screened for structural variants. Prioritized CNV events were confirmed using quantitative PCR, and confirmed loci were evaluated on an additional set of 170 cases and 170 control individuals that were not included in the original discovery set. Loci that remained interesting were subsequently screened via quantitative PCR on an additional set of 755 cases and 1,809 unaffected family members. Results Results include rare deletions in autistic individuals at JAKMIP1, NRXN1, Neuroligin4Y, OXTR, and ABAT. Common insertion/deletion polymorphisms were detected at several

  12. Candidate Gene Identification with SNP Marker-Based Fine Mapping of Anthracnose Resistance Gene Co-4 in Common Bean.

    Science.gov (United States)

    Burt, Andrew J; William, H Manilal; Perry, Gregory; Khanal, Raja; Pauls, K Peter; Kelly, James D; Navabi, Alireza

    2015-01-01

    Anthracnose, caused by Colletotrichum lindemuthianum, is an important fungal disease of common bean (Phaseolus vulgaris). Alleles at the Co-4 locus confer resistance to a number of races of C. lindemuthianum. A population of 94 F4:5 recombinant inbred lines of a cross between resistant black bean genotype B09197 and susceptible navy bean cultivar Nautica was used to identify markers associated with resistance in bean chromosome 8 (Pv08) where Co-4 is localized. Three SCAR markers with known linkage to Co-4 and a panel of single nucleotide markers were used for genotyping. A refined physical region on Pv08 with significant association with anthracnose resistance identified by markers was used in BLAST searches with the genomic sequence of common bean accession G19833. Thirty two unique annotated candidate genes were identified that spanned a physical region of 936.46 kb. A majority of the annotated genes identified had functional similarity to leucine rich repeats/receptor like kinase domains. Three annotated genes had similarity to 1, 3-β-glucanase domains. There were sequence similarities between some of the annotated genes found in the study and the genes associated with phosphoinositide-specific phosphilipases C associated with Co-x and the COK-4 loci found in previous studies. It is possible that the Co-4 locus is structured as a group of genes with functional domains dominated by protein tyrosine kinase along with leucine rich repeats/nucleotide binding site, phosphilipases C as well as β-glucanases.

  13. Expressed sequence tags from larval gut of the European corn borer (Ostrinia nubilalis: Exploring candidate genes potentially involved in Bacillus thuringiensis toxicity and resistance

    Directory of Open Access Journals (Sweden)

    Crespo Andre LB

    2009-06-01

    Full Text Available Abstract Background Lepidoptera represents more than 160,000 insect species which include some of the most devastating pests of crops, forests, and stored products. However, the genomic information on lepidopteran insects is very limited. Only a few studies have focused on developing expressed sequence tag (EST libraries from the guts of lepidopteran larvae. Knowledge of the genes that are expressed in the insect gut are crucial for understanding basic physiology of food digestion, their interactions with Bacillus thuringiensis (Bt toxins, and for discovering new targets for novel toxins for use in pest management. This study analyzed the ESTs generated from the larval gut of the European corn borer (ECB, Ostrinia nubilalis, one of the most destructive pests of corn in North America and the western world. Our goals were to establish an ECB larval gut-specific EST database as a genomic resource for future research and to explore candidate genes potentially involved in insect-Bt interactions and Bt resistance in ECB. Results We constructed two cDNA libraries from the guts of the fifth-instar larvae of ECB and sequenced a total of 15,000 ESTs from these libraries. A total of 12,519 ESTs (83.4% appeared to be high quality with an average length of 656 bp. These ESTs represented 2,895 unique sequences, including 1,738 singletons and 1,157 contigs. Among the unique sequences, 62.7% encoded putative proteins that shared significant sequence similarities (E-value ≤ 10-3with the sequences available in GenBank. Our EST analysis revealed 52 candidate genes that potentially have roles in Bt toxicity and resistance. These genes encode 18 trypsin-like proteases, 18 chymotrypsin-like proteases, 13 aminopeptidases, 2 alkaline phosphatases and 1 cadherin-like protein. Comparisons of expression profiles of 41 selected candidate genes between Cry1Ab-susceptible and resistant strains of ECB by RT-PCR showed apparently decreased expressions in 2 trypsin-like and 2

  14. Genome-wide association study and annotating candidate gene networks affecting age at first calving in Nellore cattle.

    Science.gov (United States)

    Mota, R R; Guimarães, S E F; Fortes, M R S; Hayes, B; Silva, F F; Verardo, L L; Kelly, M J; de Campos, C F; Guimarães, J D; Wenceslau, R R; Penitente-Filho, J M; Garcia, J F; Moore, S

    2017-12-01

    We performed a genome-wide mapping for the age at first calving (AFC) with the goal of annotating candidate genes that regulate fertility in Nellore cattle. Phenotypic data from 762 cows and 777k SNP genotypes from 2,992 bulls and cows were used. Single nucleotide polymorphism (SNP) effects based on the single-step GBLUP methodology were blocked into adjacent windows of 1 Megabase (Mb) to explain the genetic variance. SNP windows explaining more than 0.40% of the AFC genetic variance were identified on chromosomes 2, 8, 9, 14, 16 and 17. From these windows, we identified 123 coding protein genes that were used to build gene networks. From the association study and derived gene networks, putative candidate genes (e.g., PAPPA, PREP, FER1L6, TPR, NMNAT1, ACAD10, PCMTD1, CRH, OPKR1, NPBWR1 and NCOA2) and transcription factors (TF) (STAT1, STAT3, RELA, E2F1 and EGR1) were strongly associated with female fertility (e.g., negative regulation of luteinizing hormone secretion, folliculogenesis and establishment of uterine receptivity). Evidence suggests that AFC inheritance is complex and controlled by multiple loci across the genome. As several windows explaining higher proportion of the genetic variance were identified on chromosome 14, further studies investigating the interaction across haplotypes to better understand the molecular architecture behind AFC in Nellore cattle should be undertaken. © 2017 Blackwell Verlag GmbH.

  15. Secretome Characterization and Correlation Analysis Reveal Putative Pathogenicity Mechanisms and Identify Candidate Avirulence Genes in the Wheat Stripe Rust Fungus Puccinia striiformis f. sp. tritici.

    Science.gov (United States)

    Xia, Chongjing; Wang, Meinan; Cornejo, Omar E; Jiwan, Derick A; See, Deven R; Chen, Xianming

    2017-01-01

    Stripe (yellow) rust, caused by Puccinia striiformis f. sp. tritici ( Pst ), is one of the most destructive diseases of wheat worldwide. Planting resistant cultivars is an effective way to control this disease, but race-specific resistance can be overcome quickly due to the rapid evolving Pst population. Studying the pathogenicity mechanisms is critical for understanding how Pst virulence changes and how to develop wheat cultivars with durable resistance to stripe rust. We re-sequenced 7 Pst isolates and included additional 7 previously sequenced isolates to represent balanced virulence/avirulence profiles for several avirulence loci in seretome analyses. We observed an uneven distribution of heterozygosity among the isolates. Secretome comparison of Pst with other rust fungi identified a large portion of species-specific secreted proteins, suggesting that they may have specific roles when interacting with the wheat host. Thirty-two effectors of Pst were identified from its secretome. We identified candidates for Avr genes corresponding to six Yr genes by correlating polymorphisms for effector genes to the virulence/avirulence profiles of the 14 Pst isolates. The putative AvYr76 was present in the avirulent isolates, but absent in the virulent isolates, suggesting that deleting the coding region of the candidate avirulence gene has produced races virulent to resistance gene Yr76 . We conclude that incorporating avirulence/virulence phenotypes into correlation analysis with variations in genomic structure and secretome, particularly presence/absence polymorphisms of effectors, is an efficient way to identify candidate Avr genes in Pst . The candidate effector genes provide a rich resource for further studies to determine the evolutionary history of Pst populations and the co-evolutionary arms race between Pst and wheat. The Avr candidates identified in this study will lead to cloning avirulence genes in Pst , which will enable us to understand molecular mechanisms

  16. No Association between Variation in Longevity Candidate Genes and Aging-related Phenotypes in Oldest-old Danes.

    Science.gov (United States)

    Soerensen, Mette; Nygaard, Marianne; Debrabant, Birgit; Mengel-From, Jonas; Dato, Serena; Thinggaard, Mikael; Christensen, Kaare; Christiansen, Lene

    2016-06-01

    In this study we explored the association between aging-related phenotypes previously reported to predict survival in old age and variation in 77 genes from the DNA repair pathway, 32 genes from the growth hormone 1/ insulin-like growth factor 1/insulin (GH/IGF-1/INS) signalling pathway and 16 additional genes repeatedly considered as candidates for human longevity: APOE, APOA4, APOC3, ACE, CETP, HFE, IL6, IL6R, MTHFR, TGFB1, SIRTs 1, 3, 6; and HSPAs 1A, 1L, 14. Altogether, 1,049 single nucleotide polymorphisms (SNPs) were genotyped in 1,088 oldest-old (age 92-93 years) Danes and analysed with phenotype data on physical functioning (hand grip strength), cognitive functioning (mini mental state examination and a cognitive composite score), activity of daily living and self-rated health. Five SNPs showed association to one of the phenotypes; however, none of these SNPs were associated with a change in the relevant phenotype over time (7 years of follow-up) and none of the SNPs could be confirmed in a replication sample of 1,281 oldest-old Danes (age 94-100). Hence, our study does not support association between common variation in the investigated longevity candidate genes and aging-related phenotypes consistently shown to predict survival. It is possible that larger sample sizes are needed to robustly reveal associations with small effect sizes. Copyright © 2016 Elsevier Inc. All rights reserved.

  17. Identifying the candidate genes involved in the calyx abscission process of 'Kuerlexiangli' (Pyrus sinkiangensis Yu) by digital transcript abundance measurements.

    Science.gov (United States)

    Qi, Xiaoxiao; Wu, Jun; Wang, Lifen; Li, Leiting; Cao, Yufen; Tian, Luming; Dong, Xingguang; Zhang, Shaoling

    2013-10-23

    'Kuerlexiangli' (Pyrus sinkiangensis Yu), a native pear of Xinjiang, China, is an important agricultural fruit and primary export to the international market. However, fruit with persistent calyxes affect fruit shape and quality. Although several studies have looked into the physiological aspects of the calyx abscission process, the underlying molecular mechanisms remain unknown. In order to better understand the molecular basis of the process of calyx abscission, materials at three critical stages of regulation, with 6000 × Flusilazole plus 300 × PBO treatment (calyx abscising treatment) and 50 mg.L-1GA3 treatment (calyx persisting treatment), were collected and cDNA fragments were sequenced using digital transcript abundance measurements to identify candidate genes. Digital transcript abundance measurements was performed using high-throughput Illumina GAII sequencing on seven samples that were collected at three important stages of the calyx abscission process with chemical agent treatments promoting calyx abscission and persistence. Altogether more than 251,123,845 high quality reads were obtained with approximately 8.0 M raw data for each library. The values of 69.85%-71.90% of clean data in the digital transcript abundance measurements could be mapped to the pear genome database. There were 12,054 differentially expressed genes having Gene Ontology (GO) terms and associating with 251 Kyoto Encyclopedia of Genes and Genomes (KEGG) defined pathways. The differentially expressed genes correlated with calyx abscission were mainly involved in photosynthesis, plant hormone signal transduction, cell wall modification, transcriptional regulation, and carbohydrate metabolism. Furthermore, candidate calyx abscission-specific genes, e.g. Inflorescence deficient in abscission gene, were identified. Quantitative real-time PCR was used to confirm the digital transcript abundance measurements results. We identified candidate genes that showed highly dynamic changes in

  18. Next-generation text-mining mediated generation of chemical response-specific gene sets for interpretation of gene expression data

    NARCIS (Netherlands)

    K.M. Hettne (Kristina); J. Boorsma (Jeffrey); D.A.M. van Dartel (Dorien A M); J.J. Goeman (Jelle); E.C. de Jong (Esther); A.H. Piersma (Aldert); R.H. Stierum (Rob); J. Kleinjans (Jos); J.A. Kors (Jan)

    2013-01-01

    textabstractBackground: Availability of chemical response-specific lists of genes (gene sets) for pharmacological and/or toxic effect prediction for compounds is limited. We hypothesize that more gene sets can be created by next-generation text mining (next-gen TM), and that these can be used with

  19. Transcriptome analysis reveals candidate genes involved in luciferin metabolism in Luciola aquatilis (Coleoptera: Lampyridae

    Directory of Open Access Journals (Sweden)

    Wanwipa Vongsangnak

    2016-10-01

    Full Text Available Bioluminescence, which living organisms such as fireflies emit light, has been studied extensively for over half a century. This intriguing reaction, having its origins in nature where glowing insects can signal things such as attraction or defense, is now widely used in biotechnology with applications of bioluminescence and chemiluminescence. Luciferase, a key enzyme in this reaction, has been well characterized; however, the enzymes involved in the biosynthetic pathway of its substrate, luciferin, remains unsolved at present. To elucidate the luciferin metabolism, we performed a de novo transcriptome analysis using larvae of the firefly species, Luciola aquatilis. Here, a comparative analysis is performed with the model coleopteran insect Tribolium casteneum to elucidate the metabolic pathways in L. aquatilis. Based on a template luciferin biosynthetic pathway, combined with a range of protein and pathway databases, and various prediction tools for functional annotation, the candidate genes, enzymes, and biochemical reactions involved in luciferin metabolism are proposed for L. aquatilis. The candidate gene expression is validated in the adult L. aquatilis using reverse transcription PCR (RT-PCR. This study provides useful information on the bio-production of luciferin in the firefly and will benefit to future applications of the valuable firefly bioluminescence system.

  20. Resistance gene candidates identified by PCR with degenerate oligonucleotide primers map to clusters of resistance genes in lettuce.

    Science.gov (United States)

    Shen, K A; Meyers, B C; Islam-Faridi, M N; Chin, D B; Stelly, D M; Michelmore, R W

    1998-08-01

    The recent cloning of genes for resistance against diverse pathogens from a variety of plants has revealed that many share conserved sequence motifs. This provides the possibility of isolating numerous additional resistance genes by polymerase chain reaction (PCR) with degenerate oligonucleotide primers. We amplified resistance gene candidates (RGCs) from lettuce with multiple combinations of primers with low degeneracy designed from motifs in the nucleotide binding sites (NBSs) of RPS2 of Arabidopsis thaliana and N of tobacco. Genomic DNA, cDNA, and bacterial artificial chromosome (BAC) clones were successfully used as templates. Four families of sequences were identified that had the same similarity to each other as to resistance genes from other species. The relationship of the amplified products to resistance genes was evaluated by several sequence and genetic criteria. The amplified products contained open reading frames with additional sequences characteristic of NBSs. Hybridization of RGCs to genomic DNA and to BAC clones revealed large numbers of related sequences. Genetic analysis demonstrated the existence of clustered multigene families for each of the four RGC sequences. This parallels classical genetic data on clustering of disease resistance genes. Two of the four families mapped to known clusters of resistance genes; these two families were therefore studied in greater detail. Additional evidence that these RGCs could be resistance genes was gained by the identification of leucine-rich repeat (LRR) regions in sequences adjoining the NBS similar to those in RPM1 and RPS2 of A. thaliana. Fluorescent in situ hybridization confirmed the clustered genomic distribution of these sequences. The use of PCR with degenerate oligonucleotide primers is therefore an efficient method to identify numerous RGCs in plants.

  1. Identification of Quantitative Trait Loci (QTL) and Candidate Genes for Cadmium Tolerance in Populus

    Energy Technology Data Exchange (ETDEWEB)

    Induri, Brahma R [West Virginia University; Ellis, Danielle R [West Virginia University; Slavov, Gancho [West Virginia University; Yin, Tongming [ORNL; Muchero, Wellington [ORNL; Tuskan, Gerald A [ORNL; DiFazio, Stephen P [West Virginia University

    2012-01-01

    Knowledge of genetic variation in response of Populus to heavy metals like cadmium (Cd) is an important step in understanding the underlying mechanisms of tolerance. In this study, a pseudo-backcross pedigree of Populus trichocarpa and Populus deltoides was characterized for Cd exposure. The pedigree showed significant variation for Cd tolerance thus enabling the identification of relatively tolerant and susceptible genotypes for intensive characterization. A total of 16 QTLs at logarithm of odds (LOD) ratio > 2.5, were found to be associated with total dry weight, its components, and root volume. Four major QTLs for total dry weight were mapped to different linkage groups in control (LG III) and Cd conditions (LG XVI) and had opposite allelic effects on Cd tolerance, suggesting that these genomic regions were differentially controlled. The phenotypic variation explained by Cd QTL for all traits under study varied from 5.9% to 11.6% and averaged 8.2% across all QTL. Leaf Cd contents also showed significant variation suggesting the phytoextraction potential of Populus genotypes, though heritability of this trait was low (0.22). A whole-genome microarray study was conducted by using two genotypes with extreme responses for Cd tolerance in the above study and differentially expressed genes were identified. Candidate genes including CAD2 (CADMIUM SENSITIVE 2), HMA5 (HEAVY METAL ATPase5), ATGTST1 (Arabidopsis thaliana Glutathione S-Transferase1), ATGPX6 (Glutathione peroxidase 6), and ATMRP 14 (Arabidopsis thaliana Multidrug Resistance associated Protein 14) were identified from QTL intervals and microarray study. Functional characterization of these candidate genes could enhance phytoremediation capabilities of Populus.

  2. LOD score exclusion analyses for candidate QTLs using random population samples.

    Science.gov (United States)

    Deng, Hong-Wen

    2003-11-01

    While extensive analyses have been conducted to test for, no formal analyses have been conducted to test against, the importance of candidate genes as putative QTLs using random population samples. Previously, we developed an LOD score exclusion mapping approach for candidate genes for complex diseases. Here, we extend this LOD score approach for exclusion analyses of candidate genes for quantitative traits. Under this approach, specific genetic effects (as reflected by heritability) and inheritance models at candidate QTLs can be analyzed and if an LOD score is < or = -2.0, the locus can be excluded from having a heritability larger than that specified. Simulations show that this approach has high power to exclude a candidate gene from having moderate genetic effects if it is not a QTL and is robust to population admixture. Our exclusion analysis complements association analysis for candidate genes as putative QTLs in random population samples. The approach is applied to test the importance of Vitamin D receptor (VDR) gene as a potential QTL underlying the variation of bone mass, an important determinant of osteoporosis.

  3. Gene expression analysis identifies new candidate genes associated with the development of black skin spots in Corriedale sheep.

    Science.gov (United States)

    Peñagaricano, Francisco; Zorrilla, Pilar; Naya, Hugo; Robello, Carlos; Urioste, Jorge I

    2012-02-01

    The white coat colour of sheep is an important economic trait. For unknown reasons, some animals are born with, and others develop with time, black skin spots that can also produce pigmented fibres. The presence of pigmented fibres in the white wool significantly decreases the fibre quality. The aim of this work was to study gene expression in black spots (with and without pigmented fibres) and white skin by microarray techniques, in order to identify the possible genes involved in the development of this trait. Five unrelated Corriedale sheep were used and, for each animal, the three possible comparisons (three different hybridisations) between the three samples of interest were performed. Differential gene expression patterns were analysed using different t-test approaches. Most of the major genes with well-known roles in skin pigmentation, e.g. ASIP, MC1R and C-KIT, showed no significant difference in the gene expression between white skin and black spots. On the other hand, many of the differentially expressed genes (raw P-value spots. The gene expression of C-FOS and KLF4, transcription factors involved in the cellular response to external factors such as ultraviolet light, was validated by quantitative polymerase chain reaction (PCR). This exploratory study provides a list of candidate genes that could be associated with the development of black skin spots that should be studied in more detail. Characterisation of these genes will enable us to discern the molecular mechanisms involved in the development of this feature and, hence, increase our understanding of melanocyte biology and skin pigmentation. In sheep, understanding this phenomenon is a first step towards developing molecular tools to assist in the selection against the presence of pigmented fibres in white wool.

  4. Candidate Gene Identification of Feed Efficiency and Coat Color Traits in a C57BL/6J × Kunming F2 Mice Population Using Genome-Wide Association Study.

    Science.gov (United States)

    Miao, Yuanxin; Soudy, Fathia; Xu, Zhong; Liao, Mingxing; Zhao, Shuhong; Li, Xinyun

    2017-01-01

    Feed efficiency (FE) is a very important trait in livestock industry. Identification of the candidate genes could be of benefit for the improvement of FE trait. Mouse is used as the model for many studies in mammals. In this study, the candidate genes related to FE and coat color were identified using C57BL/6J (C57) × Kunming (KM) F2 mouse population. GWAS results showed that 61 and 2 SNPs were genome-wise suggestive significantly associated with feed conversion ratio (FCR) and feed intake (FI) traits, respectively. Moreover, the Erbin, Msrb2, Ptf1a, and Fgf10 were considered as the candidate genes of FE. The Lpl was considered as the candidate gene of FI. Further, the coat color trait was studied. KM mice are white and C57 ones are black. The GWAS results showed that the most significant SNP was located at chromosome 7, and the closely linked gene was Tyr. Therefore, our study offered useful target genes related to FE in mice; these genes may play similar roles in FE of livestock. Also, we identified the major gene of coat color in mice, which would be useful for better understanding of natural mutation of the coat color in mice.

  5. Candidate Gene Identification of Feed Efficiency and Coat Color Traits in a C57BL/6J × Kunming F2 Mice Population Using Genome-Wide Association Study

    Directory of Open Access Journals (Sweden)

    Yuanxin Miao

    2017-01-01

    Full Text Available Feed efficiency (FE is a very important trait in livestock industry. Identification of the candidate genes could be of benefit for the improvement of FE trait. Mouse is used as the model for many studies in mammals. In this study, the candidate genes related to FE and coat color were identified using C57BL/6J (C57 × Kunming (KM F2 mouse population. GWAS results showed that 61 and 2 SNPs were genome-wise suggestive significantly associated with feed conversion ratio (FCR and feed intake (FI traits, respectively. Moreover, the Erbin, Msrb2, Ptf1a, and Fgf10 were considered as the candidate genes of FE. The Lpl was considered as the candidate gene of FI. Further, the coat color trait was studied. KM mice are white and C57 ones are black. The GWAS results showed that the most significant SNP was located at chromosome 7, and the closely linked gene was Tyr. Therefore, our study offered useful target genes related to FE in mice; these genes may play similar roles in FE of livestock. Also, we identified the major gene of coat color in mice, which would be useful for better understanding of natural mutation of the coat color in mice.

  6. Using the candidate gene approach for detecting genes underlying seed oil concentration and yield in soybean.

    Science.gov (United States)

    Eskandari, Mehrzad; Cober, Elroy R; Rajcan, Istvan

    2013-07-01

    Increasing the oil concentration in soybean seeds has been given more attention in recent years because of demand for both edible oil and biodiesel production. Oil concentration in soybean is a complex quantitative trait regulated by many genes as well as environmental conditions. To identify genes governing seed oil concentration in soybean, 16 putative candidate genes of three important gene families (GPAT: acyl-CoA:sn-glycerol-3-phosphate acyltransferase, DGAT: acyl-CoA:diacylglycerol acyltransferase, and PDAT: phospholipid:diacylglycerol acyltransferase) involved in triacylglycerol (TAG) biosynthesis pathways were selected and their sequences retrieved from the soybean database ( http://www.phytozome.net/soybean ). Three sequence mutations were discovered in either coding or noncoding regions of three DGAT soybean isoforms when comparing the parents of a 203 recombinant inbreed line (RIL) population; OAC Wallace and OAC Glencoe. The RIL population was used to study the effects of these mutations on seed oil concentration and other important agronomic and seed composition traits, including seed yield and protein concentration across three field locations in Ontario, Canada, in 2009 and 2010. An insertion/deletion (indel) mutation in the GmDGAT2B gene in OAC Wallace was significantly associated with reduced seed oil concentration across three environments and reduced seed yield at Woodstock in 2010. A mutation in the 3' untranslated (3'UTR) region of GmDGAT2C was associated with seed yield at Woodstock in 2009. A mutation in the intronic region of GmDGAR1B was associated with seed yield and protein concentration at Ottawa in 2010. The genes identified in this study had minor effects on either seed yield or oil concentration, which was in agreement with the quantitative nature of the traits. However, the novel gene-specific markers designed in the present study can be used in soybean breeding for marker-assisted selection aimed at increasing seed yield and oil

  7. Regulatory Mechanisms of a Highly Pectinolytic Mutant of Penicillium occitanis and Functional Analysis of a Candidate Gene in the Plant Pathogen Fusarium oxysporum

    Directory of Open Access Journals (Sweden)

    Gustavo Bravo-Ruiz

    2017-09-01

    Full Text Available Penicillium occitanis is a model system for enzymatic regulation. A mutant strain exhibiting constitutive overproduction of different pectinolytic enzymes both under inducing (pectin or repressing conditions (glucose was previously isolated after chemical mutagenesis. In order to identify the molecular basis of this regulatory mechanism, the genomes of the wild type and the derived mutant strain were sequenced and compared, providing the first reference genome for this species. We used a phylogenomic approach to compare P. occitanis with other pectinolytic fungi and to trace expansions of gene families involved in carbohydrate degradation. Genome comparison between wild type and mutant identified seven mutations associated with predicted proteins. The most likely candidate was a mutation in a highly conserved serine residue of a conserved fungal protein containing a GAL4-like Zn2Cys6 binuclear cluster DNA-binding domain and a fungus-specific transcription factor regulatory middle homology region. To functionally characterize the role of this candidate gene, the mutation was recapitulated in the predicted orthologue Fusarium oxysporum, a vascular wilt pathogen which secretes a wide array of plant cell wall degrading enzymes, including polygalacturonases, pectate lyases, xylanases and proteases, all of which contribute to infection. However, neither the null mutant nor a mutant carrying the analogous point mutation exhibited a deregulation of pectinolytic enzymes. The availability, annotation and phylogenomic analysis of the P. occitanis genome sequence represents an important resource for understanding the evolution and biology of this species, and sets the basis for the discovery of new genes of biotechnological interest for the degradation of complex polysaccharides.

  8. Identification of a robust gene signature that predicts breast cancer outcome in independent data sets

    International Nuclear Information System (INIS)

    Korkola, James E; Waldman, Frederic M; Blaveri, Ekaterina; DeVries, Sandy; Moore, Dan H II; Hwang, E Shelley; Chen, Yunn-Yi; Estep, Anne LH; Chew, Karen L; Jensen, Ronald H

    2007-01-01

    Breast cancer is a heterogeneous disease, presenting with a wide range of histologic, clinical, and genetic features. Microarray technology has shown promise in predicting outcome in these patients. We profiled 162 breast tumors using expression microarrays to stratify tumors based on gene expression. A subset of 55 tumors with extensive follow-up was used to identify gene sets that predicted outcome. The predictive gene set was further tested in previously published data sets. We used different statistical methods to identify three gene sets associated with disease free survival. A fourth gene set, consisting of 21 genes in common to all three sets, also had the ability to predict patient outcome. To validate the predictive utility of this derived gene set, it was tested in two published data sets from other groups. This gene set resulted in significant separation of patients on the basis of survival in these data sets, correctly predicting outcome in 62–65% of patients. By comparing outcome prediction within subgroups based on ER status, grade, and nodal status, we found that our gene set was most effective in predicting outcome in ER positive and node negative tumors. This robust gene selection with extensive validation has identified a predictive gene set that may have clinical utility for outcome prediction in breast cancer patients

  9. Organization and annotation of the Xcat critical region: elimination of seven positional candidate genes.

    Science.gov (United States)

    Huang, Kristen M; Geunes-Boyer, Scarlett; Wu, Sufen; Dutra, Amalia; Favor, Jack; Stambolian, Dwight

    2004-05-01

    Xcat mice display X-linked congenital cataracts and are a mouse model for the human X-linked cataract disease Nance Horan syndrome (NHS). The genetic defect in Xcat mice and NHS patients is not known. We isolated and sequenced a BAC contig representing a portion of the Xcat critical region. We combined our sequencing data with the most recent mouse sequence assemblies from both Celera and public databases. The sequence of the 2.2-Mb Xcat critical region was then analyzed for potential Xcat candidate genes. The coding regions of the seven known genes within this area (Rai2, Rbbp7, Ctps2, Calb3, Grpr, Reps2, and Syap1) were sequenced in Xcat mice and no mutations were detected. The expression of Rai2 was quantitatively identical in wild-type and Xcat mutant eyes. These results indicate that the Xcat mutation is within a novel, undiscovered gene.

  10. Combined serial analysis of gene expression and transcription factor binding site prediction identifies novel-candidate-target genes of Nr2e1 in neocortex development.

    Science.gov (United States)

    Schmouth, Jean-François; Arenillas, David; Corso-Díaz, Ximena; Xie, Yuan-Yun; Bohacec, Slavita; Banks, Kathleen G; Bonaguro, Russell J; Wong, Siaw H; Jones, Steven J M; Marra, Marco A; Simpson, Elizabeth M; Wasserman, Wyeth W

    2015-07-24

    Nr2e1 (nuclear receptor subfamily 2, group e, member 1) encodes a transcription factor important in neocortex development. Previous work has shown that nuclear receptors can have hundreds of target genes, and bind more than 300 co-interacting proteins. However, recognition of the critical role of Nr2e1 in neural stem cells and neocortex development is relatively recent, thus the molecular mechanisms involved for this nuclear receptor are only beginning to be understood. Serial analysis of gene expression (SAGE), has given researchers both qualitative and quantitative information pertaining to biological processes. Thus, in this work, six LongSAGE mouse libraries were generated from laser microdissected tissue samples of dorsal VZ/SVZ (ventricular zone and subventricular zone) from the telencephalon of wild-type (Wt) and Nr2e1-null embryos at the critical development ages E13.5, E15.5, and E17.5. We then used a novel approach, implementing multiple computational methods followed by biological validation to further our understanding of Nr2e1 in neocortex development. In this work, we have generated a list of 1279 genes that are differentially expressed in response to altered Nr2e1 expression during in vivo neocortex development. We have refined this list to 64 candidate direct-targets of NR2E1. Our data suggested distinct roles for Nr2e1 during different neocortex developmental stages. Most importantly, our results suggest a possible novel pathway by which Nr2e1 regulates neurogenesis, which includes Lhx2 as one of the candidate direct-target genes, and SOX9 as a co-interactor. In conclusion, we have provided new candidate interacting partners and numerous well-developed testable hypotheses for understanding the pathways by which Nr2e1 functions to regulate neocortex development.

  11. Confirming candidate genes for longevity in Drosophila melanogaster using two different genetic backgrounds and selection methods

    DEFF Research Database (Denmark)

    Wit, Janneke; Frydenberg, Jane; Sarup, Pernille Merete

    2013-01-01

    usually focussed on one sex and on flies originating from one genetic background, and results from different studies often do not overlap. Using D. melanogaster selected for increased longevity we aimed to find robust longevity related genes by examining gene expression in both sexes of flies originating......Elucidating genes that affect life span or that can be used as biomarkers for ageing has received attention in diverse studies in recent years. Using model organisms and various approaches several genes have been linked to the longevity phenotype. For Drosophila melanogaster those studies have...... from different genetic backgrounds. Further, we compared expression changes across three ages, when flies were young, middle aged or old, to examine how candidate gene expression changes with the onset of ageing. We selected 10 genes based on their expression differences in prior microarray studies...

  12. A Bayesian variable selection procedure for ranking overlapping gene sets

    DEFF Research Database (Denmark)

    Skarman, Axel; Mahdi Shariati, Mohammad; Janss, Luc

    2012-01-01

    Background Genome-wide expression profiling using microarrays or sequence-based technologies allows us to identify genes and genetic pathways whose expression patterns influence complex traits. Different methods to prioritize gene sets, such as the genes in a given molecular pathway, have been de...

  13. Delimiting Coalescence Genes (C-Genes) in Phylogenomic Data Sets.

    Science.gov (United States)

    Springer, Mark S; Gatesy, John

    2018-02-26

    coalescence methods have emerged as a popular alternative for inferring species trees with large genomic datasets, because these methods explicitly account for incomplete lineage sorting. However, statistical consistency of summary coalescence methods is not guaranteed unless several model assumptions are true, including the critical assumption that recombination occurs freely among but not within coalescence genes (c-genes), which are the fundamental units of analysis for these methods. Each c-gene has a single branching history, and large sets of these independent gene histories should be the input for genome-scale coalescence estimates of phylogeny. By contrast, numerous studies have reported the results of coalescence analyses in which complete protein-coding sequences are treated as c-genes even though exons for these loci can span more than a megabase of DNA. Empirical estimates of recombination breakpoints suggest that c-genes may be much shorter, especially when large clades with many species are the focus of analysis. Although this idea has been challenged recently in the literature, the inverse relationship between c-gene size and increased taxon sampling in a dataset-the 'recombination ratchet'-is a fundamental property of c-genes. For taxonomic groups characterized by genes with long intron sequences, complete protein-coding sequences are likely not valid c-genes and are inappropriate units of analysis for summary coalescence methods unless they occur in recombination deserts that are devoid of incomplete lineage sorting (ILS). Finally, it has been argued that coalescence methods are robust when the no-recombination within loci assumption is violated, but recombination must matter at some scale because ILS, a by-product of recombination, is the raison d'etre for coalescence methods. That is, extensive recombination is required to yield the large number of independently segregating c-genes used to infer a species tree. If coalescent methods are powerful

  14. An Independent Filter for Gene Set Testing Based on Spectral Enrichment

    NARCIS (Netherlands)

    Frost, H Robert; Li, Zhigang; Asselbergs, Folkert W; Moore, Jason H

    2015-01-01

    Gene set testing has become an indispensable tool for the analysis of high-dimensional genomic data. An important motivation for testing gene sets, rather than individual genomic variables, is to improve statistical power by reducing the number of tested hypotheses. Given the dramatic growth in

  15. Neurodevelopmental disorders associated with dosage imbalance of ZBTB20 correlate with the morbidity spectrum of ZBTB20 candidate target genes

    DEFF Research Database (Denmark)

    Rasmussen, Malene B; Nielsen, Jakob V; Lourenço, Charles M

    2014-01-01

    (SRO) involved five RefSeq genes, including the transcription factor gene ZBTB20 and the dopamine receptor gene DRD3, considered as candidate genes for the syndrome. METHODS AND RESULTS: We used array comparative genomic hybridization and next-generation mate-pair sequencing to identify key structural...... patient with developmental delay and autism, we detected the first microdeletion at 3q13.31, which truncated ZBTB20 but did not involve DRD3 or the other genes within the previously defined SRO. Zbtb20 directly represses 346 genes in the developing murine brain. Of the 342 human orthologous ZBTB20...

  16. Application of biclustering of gene expression data and gene set enrichment analysis methods to identify potentially disease causing nanomaterials

    Directory of Open Access Journals (Sweden)

    Andrew Williams

    2015-12-01

    Full Text Available Background: The presence of diverse types of nanomaterials (NMs in commerce is growing at an exponential pace. As a result, human exposure to these materials in the environment is inevitable, necessitating the need for rapid and reliable toxicity testing methods to accurately assess the potential hazards associated with NMs. In this study, we applied biclustering and gene set enrichment analysis methods to derive essential features of altered lung transcriptome following exposure to NMs that are associated with lung-specific diseases. Several datasets from public microarray repositories describing pulmonary diseases in mouse models following exposure to a variety of substances were examined and functionally related biclusters of genes showing similar expression profiles were identified. The identified biclusters were then used to conduct a gene set enrichment analysis on pulmonary gene expression profiles derived from mice exposed to nano-titanium dioxide (nano-TiO2, carbon black (CB or carbon nanotubes (CNTs to determine the disease significance of these data-driven gene sets.Results: Biclusters representing inflammation (chemokine activity, DNA binding, cell cycle, apoptosis, reactive oxygen species (ROS and fibrosis processes were identified. All of the NM studies were significant with respect to the bicluster related to chemokine activity (DAVID; FDR p-value = 0.032. The bicluster related to pulmonary fibrosis was enriched in studies where toxicity induced by CNT and CB studies was investigated, suggesting the potential for these materials to induce lung fibrosis. The pro-fibrogenic potential of CNTs is well established. Although CB has not been shown to induce fibrosis, it induces stronger inflammatory, oxidative stress and DNA damage responses than nano-TiO2 particles.Conclusion: The results of the analysis correctly identified all NMs to be inflammogenic and only CB and CNTs as potentially fibrogenic. In addition to identifying several

  17. Model-based gene set analysis for Bioconductor.

    Science.gov (United States)

    Bauer, Sebastian; Robinson, Peter N; Gagneur, Julien

    2011-07-01

    Gene Ontology and other forms of gene-category analysis play a major role in the evaluation of high-throughput experiments in molecular biology. Single-category enrichment analysis procedures such as Fisher's exact test tend to flag large numbers of redundant categories as significant, which can complicate interpretation. We have recently developed an approach called model-based gene set analysis (MGSA), that substantially reduces the number of redundant categories returned by the gene-category analysis. In this work, we present the Bioconductor package mgsa, which makes the MGSA algorithm available to users of the R language. Our package provides a simple and flexible application programming interface for applying the approach. The mgsa package has been made available as part of Bioconductor 2.8. It is released under the conditions of the Artistic license 2.0. peter.robinson@charite.de; julien.gagneur@embl.de.

  18. Candidate Gene Identification with SNP Marker-Based Fine Mapping of Anthracnose Resistance Gene Co-4 in Common Bean.

    Directory of Open Access Journals (Sweden)

    Andrew J Burt

    Full Text Available Anthracnose, caused by Colletotrichum lindemuthianum, is an important fungal disease of common bean (Phaseolus vulgaris. Alleles at the Co-4 locus confer resistance to a number of races of C. lindemuthianum. A population of 94 F4:5 recombinant inbred lines of a cross between resistant black bean genotype B09197 and susceptible navy bean cultivar Nautica was used to identify markers associated with resistance in bean chromosome 8 (Pv08 where Co-4 is localized. Three SCAR markers with known linkage to Co-4 and a panel of single nucleotide markers were used for genotyping. A refined physical region on Pv08 with significant association with anthracnose resistance identified by markers was used in BLAST searches with the genomic sequence of common bean accession G19833. Thirty two unique annotated candidate genes were identified that spanned a physical region of 936.46 kb. A majority of the annotated genes identified had functional similarity to leucine rich repeats/receptor like kinase domains. Three annotated genes had similarity to 1, 3-β-glucanase domains. There were sequence similarities between some of the annotated genes found in the study and the genes associated with phosphoinositide-specific phosphilipases C associated with Co-x and the COK-4 loci found in previous studies. It is possible that the Co-4 locus is structured as a group of genes with functional domains dominated by protein tyrosine kinase along with leucine rich repeats/nucleotide binding site, phosphilipases C as well as β-glucanases.

  19. Gene set analysis for interpreting genetic studies

    DEFF Research Database (Denmark)

    Pers, Tune H

    2016-01-01

    Interpretation of genome-wide association study (GWAS) results is lacking behind the discovery of new genetic associations. Consequently, there is an urgent need for data-driven methods for interpreting genetic association studies. Gene set analysis (GSA) can identify aetiologic pathways...

  20. Exome sequencing in 53 sporadic cases of schizophrenia identifies 18 putative candidate genes.

    Directory of Open Access Journals (Sweden)

    Michel Guipponi

    Full Text Available Schizophrenia (SCZ is a severe, debilitating mental illness which has a significant genetic component. The identification of genetic factors related to SCZ has been challenging and these factors remain largely unknown. To evaluate the contribution of de novo variants (DNVs to SCZ, we sequenced the exomes of 53 individuals with sporadic SCZ and of their non-affected parents. We identified 49 DNVs, 18 of which were predicted to alter gene function, including 13 damaging missense mutations, 2 conserved splice site mutations, 2 nonsense mutations, and 1 frameshift deletion. The average number of exonic DNV per proband was 0.88, which corresponds to an exonic point mutation rate of 1.7×10(-8 per nucleotide per generation. The non-synonymous-to-synonymous mutation ratio of 2.06 did not differ from neutral expectations. Overall, this study provides a list of 18 putative candidate genes for sporadic SCZ, and when combined with the results of similar reports, identifies a second proband carrying a non-synonymous DNV in the RGS12 gene.

  1. Assessment of PALB2 as a candidate melanoma susceptibility gene.

    Directory of Open Access Journals (Sweden)

    Lauren G Aoude

    Full Text Available Partner and localizer of BRCA2 (PALB2 interacts with BRCA2 to enable double strand break repair through homologous recombination. Similar to BRCA2, germline mutations in PALB2 have been shown to predispose to Fanconi anaemia as well as pancreatic and breast cancer. The PALB2/BRCA2 protein interaction, as well as the increased melanoma risk observed in families harbouring BRCA2 mutations, makes PALB2 a candidate for melanoma susceptibility. In order to assess PALB2 as a melanoma predisposition gene, we sequenced the entire protein-coding sequence of PALB2 in probands from 182 melanoma families lacking pathogenic mutations in known high penetrance melanoma susceptibility genes: CDKN2A, CDK4, and BAP1. In addition, we interrogated whole-genome and exome data from another 19 kindreds with a strong family history of melanoma for deleterious mutations in PALB2. Here we report a rare known deleterious PALB2 mutation (rs118203998 causing a premature truncation of the protein (p.Y1183X in an individual who had developed four different cancer types, including melanoma. Three other family members affected with melanoma did not carry the variant. Overall our data do not support a case for PALB2 being associated with melanoma predisposition.

  2. Finding new genes for non-syndromic hearing loss through an in silico prioritization study.

    Directory of Open Access Journals (Sweden)

    Matteo Accetturo

    Full Text Available At present, 51 genes are already known to be responsible for Non-Syndromic hereditary Hearing Loss (NSHL, but the knowledge of 121 NSHL-linked chromosomal regions brings to the hypothesis that a number of disease genes have still to be uncovered. To help scientists to find new NSHL genes, we built a gene-scoring system, integrating Gene Ontology, NCBI Gene and Map Viewer databases, which prioritizes the candidate genes according to their probability to cause NSHL. We defined a set of candidates and measured their functional similarity with respect to the disease gene set, computing a score ( S S M avg that relies on the assumption that functionally related genes might contribute to the same (disease phenotype. A Kolmogorov-Smirnov test, comparing the pair-wise distribution on the disease gene set with the distribution on the remaining human genes, provided a statistical assessment of this assumption. We found at a p-value 0.99. The twenty top-scored genes were finally examined to evaluate their possible involvement in NSHL. We found that half of them are known to be expressed in human inner ear or cochlea and are mainly involved in remodeling and organization of actin formation and maintenance of the cilia and the endocochlear potential. These findings strongly indicate that our metric was able to suggest excellent NSHL candidates to be screened in patients and controls for causative mutations.

  3. Distinguishing between cancer driver and passenger gene alteration candidates via cross-species comparison: a pilot study

    International Nuclear Information System (INIS)

    Ji, Xinglai; Tang, Jie; Halberg, Richard; Busam, Dana; Ferriera, Steve; Peña, Maria Marjorette O; Venkataramu, Chinnambally; Yeatman, Timothy J; Zhao, Shaying

    2010-01-01

    We are developing a cross-species comparison strategy to distinguish between cancer driver- and passenger gene alteration candidates, by utilizing the difference in genomic location of orthologous genes between the human and other mammals. As an initial test of this strategy, we conducted a pilot study with human colorectal cancer (CRC) and its mouse model C57BL/6J Apc Min/+ , focusing on human 5q22.2 and 18q21.1-q21.2. We first performed bioinformatics analysis on the evolution of 5q22.2 and 18q21.1-q21.2 regions. Then, we performed exon-targeted sequencing, real time quantitative polymerase chain reaction (qPCR), and real time quantitative reverse transcriptase PCR (qRT-PCR) analyses on a number of genes of both regions with both human and mouse colon tumors. These two regions (5q22.2 and 18q21.1-q21.2) are frequently deleted in human CRCs and encode genuine colorectal tumor suppressors APC and SMAD4. They also encode genes such as MCC (mutated in colorectal cancer) with their role in CRC etiology unknown. We have discovered that both regions are evolutionarily unstable, resulting in genes that are clustered in each human region being found scattered at several distinct loci in the genome of many other species. For instance, APC and MCC are within 200 kb apart in human 5q22.2 but are 10 Mb apart in the mouse genome. Importantly, our analyses revealed that, while known CRC driver genes APC and SMAD4 were disrupted in both human colorectal tumors and tumors from Apc Min/+ mice, the questionable MCC gene was disrupted in human tumors but appeared to be intact in mouse tumors. These results indicate that MCC may not actually play any causative role in early colorectal tumorigenesis. We also hypothesize that its disruption in human CRCs is likely a mere result of its close proximity to APC in the human genome. Expanding this pilot study to the entire genome may identify more questionable genes like MCC, facilitating the discovery of new CRC driver gene candidates

  4. Distinguishing between cancer driver and passenger gene alteration candidates via cross-species comparison: a pilot study.

    Science.gov (United States)

    Ji, Xinglai; Tang, Jie; Halberg, Richard; Busam, Dana; Ferriera, Steve; Peña, Maria Marjorette O; Venkataramu, Chinnambally; Yeatman, Timothy J; Zhao, Shaying

    2010-08-13

    We are developing a cross-species comparison strategy to distinguish between cancer driver- and passenger gene alteration candidates, by utilizing the difference in genomic location of orthologous genes between the human and other mammals. As an initial test of this strategy, we conducted a pilot study with human colorectal cancer (CRC) and its mouse model C57BL/6J ApcMin/+, focusing on human 5q22.2 and 18q21.1-q21.2. We first performed bioinformatics analysis on the evolution of 5q22.2 and 18q21.1-q21.2 regions. Then, we performed exon-targeted sequencing, real time quantitative polymerase chain reaction (qPCR), and real time quantitative reverse transcriptase PCR (qRT-PCR) analyses on a number of genes of both regions with both human and mouse colon tumors. These two regions (5q22.2 and 18q21.1-q21.2) are frequently deleted in human CRCs and encode genuine colorectal tumor suppressors APC and SMAD4. They also encode genes such as MCC (mutated in colorectal cancer) with their role in CRC etiology unknown. We have discovered that both regions are evolutionarily unstable, resulting in genes that are clustered in each human region being found scattered at several distinct loci in the genome of many other species. For instance, APC and MCC are within 200 kb apart in human 5q22.2 but are 10 Mb apart in the mouse genome. Importantly, our analyses revealed that, while known CRC driver genes APC and SMAD4 were disrupted in both human colorectal tumors and tumors from ApcMin/+ mice, the questionable MCC gene was disrupted in human tumors but appeared to be intact in mouse tumors. These results indicate that MCC may not actually play any causative role in early colorectal tumorigenesis. We also hypothesize that its disruption in human CRCs is likely a mere result of its close proximity to APC in the human genome. Expanding this pilot study to the entire genome may identify more questionable genes like MCC, facilitating the discovery of new CRC driver gene candidates.

  5. Expression stabilities of candidate reference genes for RT-qPCR under different stress conditions in soybean.

    Directory of Open Access Journals (Sweden)

    Shuhua Ma

    Full Text Available Due to its accuracy, sensitivity and high throughput, real time quantitative PCR (RT-qPCR has been widely used in analysing gene expression. The quality of data from such analyses is affected by the quality of reference genes used. Expression stabilities for nine candidate reference genes widely used in soybean were evaluated under different stresses in this study. Our results showed that EF1A and ACT11 were the best under salinity stress, TUB4, TUA5 and EF1A were the best under drought stress, ACT11 and UKN2 were the best under dark treatment, and EF1B and UKN2 were the best under virus infection. EF1B and UKN2 were the top two genes which can be reliably used in all of the stress conditions assessed.

  6. QTL Mapping by Whole Genome Re-sequencing and Analysis of Candidate Genes for Nitrogen Use Efficiency in Rice

    Directory of Open Access Journals (Sweden)

    Xinghai Yang

    2017-09-01

    Full Text Available Nitrogen is a major nutritional element in rice production. However, excessive application of nitrogen fertilizer has caused severe environmental pollution. Therefore, development of rice varieties with improved nitrogen use efficiency (NUE is urgent for sustainable agriculture. In this study, bulked segregant analysis (BSA combined with whole genome re-sequencing (WGS technology was applied to finely map quantitative trait loci (QTL for NUE. A key QTL, designated as qNUE6 was identified on chromosome 6 and further validated by Insertion/Deletion (InDel marker-based substitutional mapping in recombinants from F2 population (NIL-13B4 × GH998. Forty-four genes were identified in this 266.5-kb region. According to detection and annotation analysis of variation sites, 39 genes with large-effect single-nucleotide polymorphisms (SNPs and large-effect InDels were selected as candidates and their expression levels were analyzed by qRT-PCR. Significant differences in the expression levels of LOC_Os06g15370 (peptide transporter PTR2 and LOC_Os06g15420 (asparagine synthetase were observed between two parents (Y11 and GH998. Phylogenetic analysis in Arabidopsis thaliana identified two closely related homologs, AT1G68570 (AtNPF3.1 and AT5G65010 (ASN2, which share 72.3 and 87.5% amino acid similarity with LOC_Os06g15370 and LOC_Os06g15420, respectively. Taken together, our results suggested that qNUE6 is a possible candidate gene for NUE in rice. The fine mapping and candidate gene analysis of qNUE6 provide the basis of molecular breeding for genetic improvement of rice varieties with high NUE, and lay the foundation for further cloning and functional analysis.

  7. Evaluation of 6 candidate genes on chromosome 11q23 for coeliac disease susceptibility: a case control study.

    LENUS (Irish Health Repository)

    Brophy, Karen

    2010-01-01

    BACKGROUND: Recent whole genome analysis and follow-up studies have identified many new risk variants for coeliac disease (CD, gluten intolerance). The majority of newly associated regions encode candidate genes with a clear functional role in T-cell regulation. Furthermore, the newly discovered risk loci, together with the well established HLA locus, account for less than 50% of the heritability of CD, suggesting that numerous additional loci remain undiscovered. Linkage studies have identified some well-replicated risk regions, most notably chromosome 5q31 and 11q23. METHODS: We have evaluated six candidate genes in one of these regions (11q23), namely CD3E, CD3D, CD3G, IL10RA, THY1 and IL18, as risk factors for CD using a 2-phase candidate gene approach directed at chromosome 11q. 377 CD cases and 349 ethnically matched controls were used in the initial screening, followed by an extended sample of 171 additional coeliac cases and 536 additional controls. RESULTS: Promotor SNPs (-607, -137) in the IL18 gene, which has shown association with several autoimmune diseases, initially suggested association with CD (P < 0.05). Follow-up analyses of an extended sample supported the same, moderate effect (P < 0.05) for one of these. Haplotype analysis of IL18-137\\/-607 also supported this effect, primarily due to one relatively rare haplotype IL18-607C\\/-137C (P < 0.0001), which was independently associated in two case-control comparisons. This same haplotype has been noted in rheumatoid arthritis. CONCLUSION: Haplotypes of the IL18 promotor region may contribute to CD risk, consistent with this cytokine\\'s role in maintaining inflammation in active CD.

  8. Evaluation of 6 candidate genes on chromosome 11q23 for coeliac disease susceptibility: a case control study

    LENUS (Irish Health Repository)

    Brophy, Karen

    2010-05-17

    Abstract Background Recent whole genome analysis and follow-up studies have identified many new risk variants for coeliac disease (CD, gluten intolerance). The majority of newly associated regions encode candidate genes with a clear functional role in T-cell regulation. Furthermore, the newly discovered risk loci, together with the well established HLA locus, account for less than 50% of the heritability of CD, suggesting that numerous additional loci remain undiscovered. Linkage studies have identified some well-replicated risk regions, most notably chromosome 5q31 and 11q23. Methods We have evaluated six candidate genes in one of these regions (11q23), namely CD3E, CD3D, CD3G, IL10RA, THY1 and IL18, as risk factors for CD using a 2-phase candidate gene approach directed at chromosome 11q. 377 CD cases and 349 ethnically matched controls were used in the initial screening, followed by an extended sample of 171 additional coeliac cases and 536 additional controls. Results Promotor SNPs (-607, -137) in the IL18 gene, which has shown association with several autoimmune diseases, initially suggested association with CD (P < 0.05). Follow-up analyses of an extended sample supported the same, moderate effect (P < 0.05) for one of these. Haplotype analysis of IL18-137\\/-607 also supported this effect, primarily due to one relatively rare haplotype IL18-607C\\/-137C (P < 0.0001), which was independently associated in two case-control comparisons. This same haplotype has been noted in rheumatoid arthritis. Conclusion Haplotypes of the IL18 promotor region may contribute to CD risk, consistent with this cytokine\\'s role in maintaining inflammation in active CD.

  9. Genome-Wide Association Studies Identify Candidate Genes for Coat Color and Mohair Traits in the Iranian Markhoz Goat

    Directory of Open Access Journals (Sweden)

    Anahit Nazari-Ghadikolaei

    2018-04-01

    Full Text Available The Markhoz goat provides an opportunity to study the genetics underlying coat color and mohair traits of an Angora type goat using genome-wide association studies (GWAS. This indigenous Iranian breed is valued for its quality mohair used in ceremonial garments and has the distinction of exhibiting an array of coat colors including black, brown, and white. Here, we performed 16 GWAS for different fleece (mohair traits and coat color in 228 Markhoz goats sampled from the Markhoz Goat Research Station in Sanandaj, Kurdistan province, located in western Iran using the Illumina Caprine 50K beadchip. The Efficient Mixed Model Linear analysis was used to identify genomic regions with potential candidate genes contributing to coat color and mohair characteristics while correcting for population structure. Significant associations to coat color were found within or near the ASIP, ITCH, AHCY, and RALY genes on chromosome 13 for black and brown coat color and the KIT and PDGFRA genes on chromosome 6 for white coat color. Individual mohair traits were analyzed for genetic association along with principal components that allowed for a broader perspective of combined traits reflecting overall mohair quality and volume. A multitude of markers demonstrated significant association to mohair traits highlighting potential candidate genes of POU1F1 on chromosome 1 for mohair quality, MREG on chromosome 2 for mohair volume, DUOX1 on chromosome 10 for yearling fleece weight, and ADGRV1 on chromosome 7 for grease percentage. Variation in allele frequencies and haplotypes were identified for coat color and differentiated common markers associated with both brown and black coat color. This demonstrates the potential for genetic markers to be used in future breeding programs to improve selection for coat color and mohair traits. Putative candidate genes, both novel and previously identified in other species or breeds, require further investigation to confirm phenotypic

  10. A Genome-Wide Association Study for Culm Cellulose Content in Barley Reveals Candidate Genes Co-Expressed with Members of the CELLULOSE SYNTHASE A Gene Family

    Science.gov (United States)

    Houston, Kelly; Burton, Rachel A.; Sznajder, Beata; Rafalski, Antoni J.; Dhugga, Kanwarpal S.; Mather, Diane E.; Taylor, Jillian; Steffenson, Brian J.; Waugh, Robbie; Fincher, Geoffrey B.

    2015-01-01

    Cellulose is a fundamentally important component of cell walls of higher plants. It provides a scaffold that allows the development and growth of the plant to occur in an ordered fashion. Cellulose also provides mechanical strength, which is crucial for both normal development and to enable the plant to withstand both abiotic and biotic stresses. We quantified the cellulose concentration in the culm of 288 two – rowed and 288 six – rowed spring type barley accessions that were part of the USDA funded barley Coordinated Agricultural Project (CAP) program in the USA. When the population structure of these accessions was analysed we identified six distinct populations, four of which we considered to be comprised of a sufficient number of accessions to be suitable for genome-wide association studies (GWAS). These lines had been genotyped with 3072 SNPs so we combined the trait and genetic data to carry out GWAS. The analysis allowed us to identify regions of the genome containing significant associations between molecular markers and cellulose concentration data, including one region cross-validated in multiple populations. To identify candidate genes we assembled the gene content of these regions and used these to query a comprehensive RNA-seq based gene expression atlas. This provided us with gene annotations and associated expression data across multiple tissues, which allowed us to formulate a supported list of candidate genes that regulate cellulose biosynthesis. Several regions identified by our analysis contain genes that are co-expressed with CELLULOSE SYNTHASE A (HvCesA) across a range of tissues and developmental stages. These genes are involved in both primary and secondary cell wall development. In addition, genes that have been previously linked with cellulose synthesis by biochemical methods, such as HvCOBRA, a gene of unknown function, were also associated with cellulose levels in the association panel. Our analyses provide new insights into the

  11. Fructan accumulation and transcription of candidate genes during cold acclimation in three varieties of Poa pratensis

    DEFF Research Database (Denmark)

    Rao, R Shyama Prasad; Andersen, Jeppe Reitan; Dionisio, Giuseppe

    2011-01-01

    Poa pratensis, a type species for the grass family (Poaceae), is an important cool season grass that accumulates fructans as a polysaccharide reserve. We studied fructan contents and expression of candidate fructan metabolism genes during cold acclimation in three varieties of P. pratensis adapted...... to different environments: Northern Norway, Denmark, and the Netherlands. Fructan content increased significantly during cold acclimation and varieties showed significant differences in the level of fructan accumulation. cDNA sequences of putative fructosyltransferase (FT), fructan exohydrolase (FEH), and cold...... acclimation protein (CAP) genes were identified and cloned. In agreement with a function in fructan biosynthesis, transcription of a putative sucrose:fructan 6-fructosyltransferase (Pp6-SFT) gene was induced during cold acclimation and fructan accumulation in all three P. pratensis varieties. Transcription...

  12. Association Analysis Suggests SOD2 as a Newly Identified Candidate Gene Associated With Leprosy Susceptibility.

    Science.gov (United States)

    Ramos, Geovana Brotto; Salomão, Heloisa; Francio, Angela Schneider; Fava, Vinícius Medeiros; Werneck, Renata Iani; Mira, Marcelo Távora

    2016-08-01

    Genetic studies have identified several genes and genomic regions contributing to the control of host susceptibility to leprosy. Here, we test variants of the positional and functional candidate gene SOD2 for association with leprosy in 2 independent population samples. Family-based analysis revealed an association between leprosy and allele G of marker rs295340 (P = .042) and borderline evidence of an association between leprosy and alleles C and A of markers rs4880 (P = .077) and rs5746136 (P = .071), respectively. Findings were validated in an independent case-control sample for markers rs295340 (P = .049) and rs4880 (P = .038). These results suggest SOD2 as a newly identified gene conferring susceptibility to leprosy. © The Author 2016. Published by Oxford University Press for the Infectious Diseases Society of America. All rights reserved. For permissions, e-mail journals.permissions@oup.com.

  13. Effect of some candidate genes on meat characteristics of three cattle breeds

    Directory of Open Access Journals (Sweden)

    Alessio Valentini

    2010-01-01

    Full Text Available With the aim to assess if some molecular markers can help to select animals for meat characteristics, we studied 84 individuals equally representing the Marchigiana, Maremmana, and Holstein Friesian cattle breeds genotyped at 288 SNPs located within candidate genes. Several SNPs were found associated with meat quality parameters but with P which was higher than the Bonferroni threshold. However, several SNPs had a low P at different times during meat maturation, suggesting their involvement in the meat quality variation. Of particular interest for the biological role and potential for selection were: cathepsin G affecting MFI, IGF1R affecting pH and collagen XVIII affecting colour.

  14. Genome-wide survey and developmental expression mapping of zebrafish SET domain-containing genes.

    Directory of Open Access Journals (Sweden)

    Xiao-Jian Sun

    Full Text Available SET domain-containing proteins represent an evolutionarily conserved family of epigenetic regulators, which are responsible for most histone lysine methylation. Since some of these genes have been revealed to be essential for embryonic development, we propose that the zebrafish, a vertebrate model organism possessing many advantages for developmental studies, can be utilized to study the biological functions of these genes and the related epigenetic mechanisms during early development. To this end, we have performed a genome-wide survey of zebrafish SET domain genes. 58 genes total have been identified. Although gene duplication events give rise to several lineage-specific paralogs, clear reciprocal orthologous relationship reveals high conservation between zebrafish and human SET domain genes. These data were further subject to an evolutionary analysis ranging from yeast to human, leading to the identification of putative clusters of orthologous groups (COGs of this gene family. By means of whole-mount mRNA in situ hybridization strategy, we have also carried out a developmental expression mapping of these genes. A group of maternal SET domain genes, which are implicated in the programming of histone modification states in early development, have been identified and predicted to be responsible for all known sites of SET domain-mediated histone methylation. Furthermore, some genes show specific expression patterns in certain tissues at certain stages, suggesting the involvement of epigenetic mechanisms in the development of these systems. These results provide a global view of zebrafish SET domain histone methyltransferases in evolutionary and developmental dimensions and pave the way for using zebrafish to systematically study the roles of these genes during development.

  15. The Association Study between Twenty One Polymorphisms in Seven Candidate Genes and Coronary Heart Diseases in Chinese Han Population.

    Directory of Open Access Journals (Sweden)

    Barrak F Alobeidy

    Full Text Available Previous genome-wide association studies (GWAS in multiple populations identified several genetic loci for coronary heart diseases (CHD. Here we utilized a 2-stage candidate gene association strategy in Chinese Han population to shed light on the putative association between several metabolic-related candidate genes and CHD. At the 1(st stage, 190 patients with CHD and 190 controls were genotyped through the MassARRAY platform. At the 2(nd stage, a larger sample including 400 patients and 392 controls was genotyped by the High Resolution Melt (HRM method to confirm or rule out the associations with CHD. MLXIP expression level was quantified by the real time PCR in 65 peripheral blood samples. From the 21 studied single nucleotide polymorphisms (SNPs of seven candidate genes: MLXIPL, MLXIP, MLX, ADIPOR1, VDR, SREBF1 and NR1H3, only one tag SNP rs4758685 (T→C was found to be statistically associated with CHD (P-value = 0.02, Odds ratio (OR of 0.83. After adjustment for the age, sex, lipid levels and diabetes, the association remained significant (P-value = 0.03. After adjustment for the hypertension, P-value became 0.20 although there was a significant difference in the allele distribution between the CHD patients with hypertension and the controls (P-value = 0.04, 406 vs 582. In conclusion, among the 21 tested SNPs, we identified a novel association between rs4758685 of MLXIP gene and CHD. The C allele of common variant rs4758685 interacted with hypertension, and was found to be protective against CHD in both allelic and genotypic models in Chinese Han population.

  16. Candidate gene analyses of 3-dimensional dentoalveolar phenotypes in subjects with malocclusion.

    Science.gov (United States)

    Weaver, Cole A; Miller, Steven F; da Fontoura, Clarissa S G; Wehby, George L; Amendt, Brad A; Holton, Nathan E; Allareddy, Veeratrishul; Southard, Thomas E; Moreno Uribe, Lina M

    2017-03-01

    Genetic studies of malocclusion etiology have identified 4 deleterious mutations in genes DUSP6,ARHGAP21, FGF23, and ADAMTS1 in familial Class III cases. Although these variants may have large impacts on Class III phenotypic expression, their low frequency (common genetic variations in craniofacial candidate genes and 3-dimensional dentoalveolar phenotypes in patients with malocclusion. Pretreatment dental casts or cone-beam computed tomographic images from 300 healthy subjects were digitized with 48 landmarks. The 3-dimensional coordinate data were submitted to a geometric morphometric approach along with principal component analysis to generate continuous phenotypes including symmetric and asymmetric components of dentoalveolar shape variation, fluctuating asymmetry, and size. The subjects were genotyped for 222 single-nucleotide polymorphisms in 82 genes/loci, and phenotpye-genotype associations were tested via multivariate linear regression. Principal component analysis of symmetric variation identified 4 components that explained 68% of the total variance and depicted anteroposterior, vertical, and transverse dentoalveolar discrepancies. Suggestive associations (P centroid size, a proxy for dentoalveolar size variation with 4p16.1 and SNAI1. Specific genetic pathways associated with 3-dimensional dentoalveolar phenotypic variation in malocclusions were identified. Copyright © 2016 American Association of Orthodontists. Published by Elsevier Inc. All rights reserved.

  17. Large-scale evaluation of candidate genes identifies associations between VEGF polymorphisms and bladder cancer risk.

    Directory of Open Access Journals (Sweden)

    Montserrat García-Closas

    2007-02-01

    Full Text Available Common genetic variation could alter the risk for developing bladder cancer. We conducted a large-scale evaluation of single nucleotide polymorphisms (SNPs in candidate genes for cancer to identify common variants that influence bladder cancer risk. An Illumina GoldenGate assay was used to genotype 1,433 SNPs within or near 386 genes in 1,086 cases and 1,033 controls in Spain. The most significant finding was in the 5' UTR of VEGF (rs25648, p for likelihood ratio test, 2 degrees of freedom = 1 x 10(-5. To further investigate the region, we analyzed 29 additional SNPs in VEGF, selected to saturate the promoter and 5' UTR and to tag common genetic variation in this gene. Three additional SNPs in the promoter region (rs833052, rs1109324, and rs1547651 were associated with increased risk for bladder cancer: odds ratio (95% confidence interval: 2.52 (1.06-5.97, 2.74 (1.26-5.98, and 3.02 (1.36-6.63, respectively; and a polymorphism in intron 2 (rs3024994 was associated with reduced risk: 0.65 (0.46-0.91. Two of the promoter SNPs and the intron 2 SNP showed linkage disequilibrium with rs25648. Haplotype analyses revealed three blocks of linkage disequilibrium with significant associations for two blocks including the promoter and 5' UTR (global p = 0.02 and 0.009, respectively. These findings are biologically plausible since VEGF is critical in angiogenesis, which is important for tumor growth, its elevated expression in bladder tumors correlates with tumor progression, and specific 5' UTR haplotypes have been shown to influence promoter activity. Associations between bladder cancer risk and other genes in this report were not robust based on false discovery rate calculations. In conclusion, this large-scale evaluation of candidate cancer genes has identified common genetic variants in the regulatory regions of VEGF that could be associated with bladder cancer risk.

  18. Integrating genome-wide association study and expression quantitative trait loci data identifies multiple genes and gene set associated with neuroticism.

    Science.gov (United States)

    Fan, Qianrui; Wang, Wenyu; Hao, Jingcan; He, Awen; Wen, Yan; Guo, Xiong; Wu, Cuiyan; Ning, Yujie; Wang, Xi; Wang, Sen; Zhang, Feng

    2017-08-01

    Neuroticism is a fundamental personality trait with significant genetic determinant. To identify novel susceptibility genes for neuroticism, we conducted an integrative analysis of genomic and transcriptomic data of genome wide association study (GWAS) and expression quantitative trait locus (eQTL) study. GWAS summary data was driven from published studies of neuroticism, totally involving 170,906 subjects. eQTL dataset containing 927,753 eQTLs were obtained from an eQTL meta-analysis of 5311 samples. Integrative analysis of GWAS and eQTL data was conducted by summary data-based Mendelian randomization (SMR) analysis software. To identify neuroticism associated gene sets, the SMR analysis results were further subjected to gene set enrichment analysis (GSEA). The gene set annotation dataset (containing 13,311 annotated gene sets) of GSEA Molecular Signatures Database was used. SMR single gene analysis identified 6 significant genes for neuroticism, including MSRA (p value=2.27×10 -10 ), MGC57346 (p value=6.92×10 -7 ), BLK (p value=1.01×10 -6 ), XKR6 (p value=1.11×10 -6 ), C17ORF69 (p value=1.12×10 -6 ) and KIAA1267 (p value=4.00×10 -6 ). Gene set enrichment analysis observed significant association for Chr8p23 gene set (false discovery rate=0.033). Our results provide novel clues for the genetic mechanism studies of neuroticism. Copyright © 2017. Published by Elsevier Inc.

  19. Computational analysis of TRAPPC9: candidate gene for autosomal recessive non-syndromic mental retardation.

    Science.gov (United States)

    Khattak, Naureen Aslam; Mir, Asif

    2014-01-01

    Mental retardation (MR)/ intellectual disability (ID) is a neuro-developmental disorder characterized by a low intellectual quotient (IQ) and deficits in adaptive behavior related to everyday life tasks such as delayed language acquisition, social skills or self-help skills with onset before age 18. To date, a few genes (PRSS12, CRBN, CC2D1A, GRIK2, TUSC3, TRAPPC9, TECR, ST3GAL3, MED23, MAN1B1, NSUN1) for autosomal-recessive forms of non syndromic MR (NS-ARMR) have been identified and established in various families with ID. The recently reported candidate gene TRAPPC9 was selected for computational analysis to explore its potentially important role in pathology as it is the only gene for ID reported in more than five different familial cases worldwide. YASARA (12.4.1) was utilized to generate three dimensional structures of the candidate gene TRAPPC9. Hybrid structure prediction was employed. Crystal Structure of a Conserved Metalloprotein From Bacillus Cereus (3D19-C) was selected as best suitable template using position-specific iteration-BLAST. Template (3D19-C) parameters were based on E-value, Z-score and resolution and quality score of 0.32, -1.152, 2.30°A and 0.684 respectively. Model reliability showed 93.1% residues placed in the most favored region with 96.684 quality factor, and overall 0.20 G-factor (dihedrals 0.06 and covalent 0.39 respectively). Protein-Protein docking analysis demonstrated that TRAPPC9 showed strong interactions of the amino acid residues S(253), S(251), Y(256), G(243), D(131) with R(105), Q(425), W(226), N(255), S(233), its functional partner 1KBKB. Protein-protein interacting residues could facilitate the exploration of structural and functional outcomes of wild type and mutated TRAPCC9 protein. Actively involved residues can be used to elucidate the binding properties of the protein, and to develop drug therapy for NS-ARMR patients.

  20. QTL-seq for rapid identification of candidate genes for flowering time in broccoli × cabbage.

    Science.gov (United States)

    Shu, Jinshuai; Liu, Yumei; Zhang, Lili; Li, Zhansheng; Fang, Zhiyuan; Yang, Limei; Zhuang, Mu; Zhang, Yangyong; Lv, Honghao

    2018-04-01

    A major QTL controlling early flowering in broccoli × cabbage was identified by marker analysis and next-generation sequencing, corresponding to GRF6 gene conditioning flowering time in Arabidopsis. Flowering is an important agronomic trait for hybrid production in broccoli and cabbage, but the genetic mechanism underlying this process is unknown. In this study, segregation analysis with BC 1 P1, BC 1 P2, F 2 , and F 2:3 populations derived from a cross between two inbred lines "195" (late-flowering) and "93219" (early flowering) suggested that flowering time is a quantitative trait. Next, employing a next-generation sequencing-based whole-genome QTL-seq strategy, we identified a major genomic region harboring a robust flowering time QTL using an F 2 mapping population, designated Ef2.1 on cabbage chromosome 2 for early flowering. Ef2.1 was further validated by indel (insertion or deletion) marker-based classical QTL mapping, explaining 51.5% (LOD = 37.67) and 54.0% (LOD = 40.5) of the phenotypic variation in F 2 and F 2:3 populations, respectively. Combined QTL-seq and classical QTL analysis narrowed down Ef1.1 to a 228-kb genomic region containing 29 genes. A cabbage gene, Bol024659, was identified in this region, which is a homolog of GRF6, a major gene regulating flowering in Arabidopsis, and was designated BolGRF6. qRT-PCR study of the expression level of BolGRF6 revealed significantly higher expression in the early flowering genotypes. Taken together, our results provide support for BolGRF6 as a possible candidate gene for early flowering in the broccoli line 93219. The identified candidate genomic regions and genes may be useful for molecular breeding to improve broccoli and cabbage flowering times.

  1. A physical map of the heterozygous grapevine 'Cabernet Sauvignon' allows mapping candidate genes for disease resistance

    Directory of Open Access Journals (Sweden)

    Scalabrin Simone

    2008-06-01

    Full Text Available Abstract Background Whole-genome physical maps facilitate genome sequencing, sequence assembly, mapping of candidate genes, and the design of targeted genetic markers. An automated protocol was used to construct a Vitis vinifera 'Cabernet Sauvignon' physical map. The quality of the result was addressed with regard to the effect of high heterozygosity on the accuracy of contig assembly. Its usefulness for the genome-wide mapping of genes for disease resistance, which is an important trait for grapevine, was then assessed. Results The physical map included 29,727 BAC clones assembled into 1,770 contigs, spanning 715,684 kbp, and corresponding to 1.5-fold the genome size. Map inflation was due to high heterozygosity, which caused either the separation of allelic BACs in two different contigs, or local mis-assembly in contigs containing BACs from the two haplotypes. Genetic markers anchored 395 contigs or 255,476 kbp to chromosomes. The fully automated assembly and anchorage procedures were validated by BAC-by-BAC blast of the end sequences against the grape genome sequence, unveiling 7.3% of chimerical contigs. The distribution across the physical map of candidate genes for non-host and host resistance, and for defence signalling pathways was then studied. NBS-LRR and RLK genes for host resistance were found in 424 contigs, 133 of them (32% were assigned to chromosomes, on which they are mostly organised in clusters. Non-host and defence signalling genes were found in 99 contigs dispersed without a discernable pattern across the genome. Conclusion Despite some limitations that interfere with the correct assembly of heterozygous clones into contigs, the 'Cabernet Sauvignon' physical map is a useful and reliable intermediary step between a genetic map and the genome sequence. This tool was successfully exploited for a quick mapping of complex families of genes, and it strengthened previous clues of co-localisation of major NBS-LRR clusters and

  2. Transcription status of vaccine candidate genes of Plasmodium falciparum during the hepatic phase of its life cycle.

    NARCIS (Netherlands)

    Bodescot, M.; Silvie, O.; Siau, A.; Refour, P.; Pino, P.; Franetich, J.F.; Hannoun, L.; Sauerwein, R.W.; Mazier, D.

    2004-01-01

    The CSP, EMP2/MESA, MSP2, MSP3, MSP5, RAP1, RAP2, RESA1, SERA1 and SSP2/TRAP genes of Plasmodium falciparum are vaccine candidates. The hepatic phase of the infection is of major interest due to the protection induced by immunization with radiation-attenuated sporozoites. We therefore performed

  3. Molecular cloning of the potato Gro1-4 gene conferring resistance to pathotype Ro1 of the root cyst nematode Globodera rostochiensis, based on a candidate gene approach.

    Science.gov (United States)

    Paal, Jürgen; Henselewski, Heike; Muth, Jost; Meksem, Khalid; Menéndez, Cristina M; Salamini, Francesco; Ballvora, Agim; Gebhardt, Christiane

    2004-04-01

    The endoparasitic root cyst nematode Globodera rostochiensis causes considerable damage in potato cultivation. In the past, major genes for nematode resistance have been introgressed from related potato species into cultivars. Elucidating the molecular basis of resistance will contribute to the understanding of nematode-plant interactions and assist in breeding nematode-resistant cultivars. The Gro1 resistance locus to G. rostochiensis on potato chromosome VII co-localized with a resistance-gene-like (RGL) DNA marker. This marker was used to isolate from genomic libraries 15 members of a closely related candidate gene family. Analysis of inheritance, linkage mapping, and sequencing reduced the number of candidate genes to three. Complementation analysis by stable potato transformation showed that the gene Gro1-4 conferred resistance to G. rostochiensis pathotype Ro1. Gro1-4 encodes a protein of 1136 amino acids that contains Toll-interleukin 1 receptor (TIR), nucleotide-binding (NB), leucine-rich repeat (LRR) homology domains and a C-terminal domain with unknown function. The deduced Gro1-4 protein differed by 29 amino acid changes from susceptible members of the Gro1 gene family. Sequence characterization of 13 members of the Gro1 gene family revealed putative regulatory elements and a variable microsatellite in the promoter region, insertion of a retrotransposon-like element in the first intron, and a stop codon in the NB coding region of some genes. Sequence analysis of RT-PCR products showed that Gro1-4 is expressed, among other members of the family including putative pseudogenes, in non-infected roots of nematode-resistant plants. RT-PCR also demonstrated that members of the Gro1 gene family are expressed in most potato tissues.

  4. Construction of an American mink Bacterial Artificial Chromosome (BAC library and sequencing candidate genes important for the fur industry

    Directory of Open Access Journals (Sweden)

    Christensen Knud

    2011-07-01

    Full Text Available Abstract Background Bacterial artificial chromosome (BAC libraries continue to be invaluable tools for the genomic analysis of complex organisms. Complemented by the newly and fast growing deep sequencing technologies, they provide an excellent source of information in genomics projects. Results Here, we report the construction and characterization of the CHORI-231 BAC library constructed from a Danish-farmed, male American mink (Neovison vison. The library contains approximately 165,888 clones with an average insert size of 170 kb, representing approximately 10-fold coverage. High-density filters, each consisting of 18,432 clones spotted in duplicate, have been produced for hybridization screening and are publicly available. Overgo probes derived from expressed sequence tags (ESTs, representing 21 candidate genes for traits important for the mink industry, were used to screen the BAC library. These included candidate genes for coat coloring, hair growth and length, coarseness, and some receptors potentially involved in viral diseases in mink. The extensive screening yielded positive results for 19 of these genes. Thirty-five clones corresponding to 19 genes were sequenced using 454 Roche, and large contigs (184 kb in average were assembled. Knowing the complete sequences of these candidate genes will enable confirmation of the association with a phenotype and the finding of causative mutations for the targeted phenotypes. Additionally, 1577 BAC clones were end sequenced; 2505 BAC end sequences (80% of BACs were obtained. An excess of 2 Mb has been analyzed, thus giving a snapshot of the mink genome. Conclusions The availability of the CHORI-321 American mink BAC library will aid in identification of genes and genomic regions of interest. We have demonstrated how the library can be used to identify specific genes of interest, develop genetic markers, and for BAC end sequencing and deep sequencing of selected clones. To our knowledge, this is the

  5. The WRKY Transcription Factor Family in Citrus: Valuable and Useful Candidate Genes for Citrus Breeding.

    Science.gov (United States)

    Ayadi, M; Hanana, M; Kharrat, N; Merchaoui, H; Marzoug, R Ben; Lauvergeat, V; Rebaï, A; Mzid, R

    2016-10-01

    WRKY transcription factors belong to a large family of plant transcriptional regulators whose members have been reported to be involved in a wide range of biological roles including plant development, adaptation to environmental constraints and response to several diseases. However, little or poor information is available about WRKY's in Citrus. The recent release of completely assembled genomes sequences of Citrus sinensis and Citrus clementina and the availability of ESTs sequences from other citrus species allowed us to perform a genome survey for Citrus WRKY proteins. In the present study, we identified 100 WRKY members from C. sinensis (51), C. clementina (48) and Citrus unshiu (1), and analyzed their chromosomal distribution, gene structure, gene duplication, syntenic relation and phylogenetic analysis. A phylogenetic tree of 100 Citrus WRKY sequences with their orthologs from Arabidopsis has distinguished seven groups. The CsWRKY genes were distributed across all ten sweet orange chromosomes. A comprehensive approach and an integrative analysis of Citrus WRKY gene expression revealed variable profiles of expression within tissues and stress conditions indicating functional diversification. Thus, candidate Citrus WRKY genes have been proposed as potentially involved in fruit acidification, essential oil biosynthesis and abiotic/biotic stress tolerance. Our results provided essential prerequisites for further WRKY genes cloning and functional analysis with an aim of citrus crop improvement.

  6. Genome-wide association and pathway analysis of feed efficiency in pigs reveal candidate genes and pathways for residual feed intake

    DEFF Research Database (Denmark)

    Do, Duy Ngoc; Strathe, Anders Bjerring; Ostersen, Tage

    2014-01-01

    Residual feed intake (RFI) is a complex trait that is economically important for livestock production; however, the genetic and biological mechanisms regulating RFI are largely unknown in pigs. Therefore, the study aimed to identify single nucleotide polymorphisms (SNPs), candidate genes and biol...... revealed key genes and genetic variants that control feed efficiency that could potentially be useful for genetic selection of more feed efficient pigs....

  7. Genome-wide transcriptome profiling of black poplar (Populus nigra L.) under boron toxicity revealed candidate genes responsible in boron uptake, transport and detoxification.

    Science.gov (United States)

    Yıldırım, Kubilay; Uylaş, Senem

    2016-12-01

    Boron (B) is an essential nutrient for normal growth of plants. Despite its low abundance in soils, it could be highly toxic to plants in especially arid and semi-arid environments. Poplars are known to be tolerant species to B toxicity and accumulation. However, physiological and gene regulation responses of these trees to B toxicity have not been investigated yet. Here, B accumulation and tolerance level of black poplar clones were firstly tested in the current study. Rooted cutting of these clones were treated with elevated B toxicity to select the most B accumulator and tolerant genotype. Then we carried out a microarray based transcriptome experiment on the leaves and roots of this genotype to find out transcriptional networks, genes and molecular mechanisms behind B toxicity tolerance. The results of the study indicated that black poplar is quite suitable for phytoremediation of B pollution. It could resist 15 ppm soil B content and >1500 ppm B accumulation in leaves, which are highly toxic concentrations for almost all agricultural plants. Transcriptomics results of study revealed totally 1625 and 1419 altered probe sets under 15 ppm B toxicity in leaf and root tissues, respectively. The highest induction were recorded for the probes sets annotated to tyrosine aminotransferase, ATP binding cassette transporters, glutathione S transferases and metallochaperone proteins. Strong up regulation of these genes attributed to internal excretion of B into the cell vacuole and existence of B detoxification processes in black poplar. Many other candidate genes functional in signalling, gene regulation, antioxidation, B uptake and transport processes were also identified in this hyper B accumulator plant for the first time with the current study. Copyright © 2016 Elsevier Masson SAS. All rights reserved.

  8. Patterns of population differentiation of candidate genes for cardiovascular disease

    Directory of Open Access Journals (Sweden)

    Ding Keyue

    2007-07-01

    Full Text Available Abstract Background The basis for ethnic differences in cardiovascular disease (CVD susceptibility is not fully understood. We investigated patterns of population differentiation (FST of a set of genes in etiologic pathways of CVD among 3 ethnic groups: Yoruba in Nigeria (YRI, Utah residents with European ancestry (CEU, and Han Chinese (CHB + Japanese (JPT. We identified 37 pathways implicated in CVD based on the PANTHER classification and 416 genes in these pathways were further studied; these genes belonged to 6 biological processes (apoptosis, blood circulation and gas exchange, blood clotting, homeostasis, immune response, and lipoprotein metabolism. Genotype data were obtained from the HapMap database. Results We calculated FST for 15,559 common SNPs (minor allele frequency ≥ 0.10 in at least one population in genes that co-segregated among the populations, as well as an average-weighted FST for each gene. SNPs were classified as putatively functional (non-synonymous and untranslated regions or non-functional (intronic and synonymous sites. Mean FST values for common putatively functional variants were significantly higher than FST values for nonfunctional variants. A significant variation in FST was also seen based on biological processes; the processes of 'apoptosis' and 'lipoprotein metabolism' showed an excess of genes with high FST. Thus, putative functional SNPs in genes in etiologic pathways for CVD show greater population differentiation than non-functional SNPs and a significant variance of FST values was noted among pairwise population comparisons for different biological processes. Conclusion These results suggest a possible basis for varying susceptibility to CVD among ethnic groups.

  9. Gene set of nuclear-encoded mitochondrial regulators is enriched for common inherited variation in obesity.

    Directory of Open Access Journals (Sweden)

    Nadja Knoll

    Full Text Available There are hints of an altered mitochondrial function in obesity. Nuclear-encoded genes are relevant for mitochondrial function (3 gene sets of known relevant pathways: (1 16 nuclear regulators of mitochondrial genes, (2 91 genes for oxidative phosphorylation and (3 966 nuclear-encoded mitochondrial genes. Gene set enrichment analysis (GSEA showed no association with type 2 diabetes mellitus in these gene sets. Here we performed a GSEA for the same gene sets for obesity. Genome wide association study (GWAS data from a case-control approach on 453 extremely obese children and adolescents and 435 lean adult controls were used for GSEA. For independent confirmation, we analyzed 705 obesity GWAS trios (extremely obese child and both biological parents and a population-based GWAS sample (KORA F4, n = 1,743. A meta-analysis was performed on all three samples. In each sample, the distribution of significance levels between the respective gene set and those of all genes was compared using the leading-edge-fraction-comparison test (cut-offs between the 50(th and 95(th percentile of the set of all gene-wise corrected p-values as implemented in the MAGENTA software. In the case-control sample, significant enrichment of associations with obesity was observed above the 50(th percentile for the set of the 16 nuclear regulators of mitochondrial genes (p(GSEA,50 = 0.0103. This finding was not confirmed in the trios (p(GSEA,50 = 0.5991, but in KORA (p(GSEA,50 = 0.0398. The meta-analysis again indicated a trend for enrichment (p(MAGENTA,50 = 0.1052, p(MAGENTA,75 = 0.0251. The GSEA revealed that weak association signals for obesity might be enriched in the gene set of 16 nuclear regulators of mitochondrial genes.

  10. Refinement of the NHS locus on chromosome Xp22.13 and analysis of five candidate genes.

    Science.gov (United States)

    Toutain, Annick; Dessay, Benoît; Ronce, Nathalie; Ferrante, Maria-Immacolata; Tranchemontagne, Julie; Newbury-Ecob, Ruth; Wallgren-Pettersson, Carina; Burn, John; Kaplan, Josseline; Rossi, Annick; Russo, Silvia; Walpole, Ian; Hartsfield, James K; Oyen, Nina; Nemeth, Andrea; Bitoun, Pierre; Trump, Dorothy; Moraine, Claude; Franco, Brunella

    2002-09-01

    Nance-Horan syndrome (NHS) is an X-linked condition characterised by congenital cataracts, dental abnormalities, dysmorphic features, and mental retardation in some cases. Previous studies have mapped the disease gene to a 2 cM interval on Xp22.2 between DXS43 and DXS999. We report additional linkage data resulting from the analysis of eleven independent NHS families. A maximum lod score of 9.94 (theta=0.00) was obtained at the RS1 locus and a recombination with locus DXS1195 on the telomeric side was observed in two families, thus refining the location of the gene to an interval of around 1 Mb on Xp22.13. Direct sequencing or SSCP analysis of the coding exons of five genes (SCML1, SCML2, STK9, RS1 and PPEF1), considered as candidate genes on the basis of their location in the critical interval, failed to detect any mutation in 12 unrelated NHS patients, thus making it highly unlikely that these genes are implicated in NHS.

  11. Dynamic compression of chondrocyte-agarose constructs reveals new candidate mechanosensitive genes.

    Directory of Open Access Journals (Sweden)

    Carole Bougault

    Full Text Available Articular cartilage is physiologically exposed to repeated loads. The mechanical properties of cartilage are due to its extracellular matrix, and homeostasis is maintained by the sole cell type found in cartilage, the chondrocyte. Although mechanical forces clearly control the functions of articular chondrocytes, the biochemical pathways that mediate cellular responses to mechanical stress have not been fully characterised. The aim of our study was to examine early molecular events triggered by dynamic compression in chondrocytes. We used an experimental system consisting of primary mouse chondrocytes embedded within an agarose hydrogel; embedded cells were pre-cultured for one week and subjected to short-term compression experiments. Using Western blots, we demonstrated that chondrocytes maintain a differentiated phenotype in this model system and reproduce typical chondrocyte-cartilage matrix interactions. We investigated the impact of dynamic compression on the phosphorylation state of signalling molecules and genome-wide gene expression. After 15 min of dynamic compression, we observed transient activation of ERK1/2 and p38 (members of the mitogen-activated protein kinase (MAPK pathways and Smad2/3 (members of the canonical transforming growth factor (TGF-β pathways. A microarray analysis performed on chondrocytes compressed for 30 min revealed that only 20 transcripts were modulated more than 2-fold. A less conservative list of 325 modulated genes included genes related to the MAPK and TGF-β pathways and/or known to be mechanosensitive in other biological contexts. Of these candidate mechanosensitive genes, 85% were down-regulated. Down-regulation may therefore represent a general control mechanism for a rapid response to dynamic compression. Furthermore, modulation of transcripts corresponding to different aspects of cellular physiology was observed, such as non-coding RNAs or primary cilium. This study provides new insight into how

  12. Association between SNPs within candidate genes and compounds related to boar taint and reproduction

    DEFF Research Database (Denmark)

    Moe, Maren; Lien, Sigbjørn; Aasmundstad, Torunn

    2009-01-01

    BACKGROUND: Boar taint is an unpleasant odour and flavour of the meat from some uncastrated male pigs primarily caused by elevated levels of androstenone and skatole in adipose tissue. Androstenone is produced in the same biochemical pathway as testosterone and estrogens, which represents...... of this study was to detect SNPs in boar taint candidate genes and to perform association studies for both single SNPs and haplotypes with levels of boar taint compounds and phenotypes related to reproduction. RESULTS: An association study involving 275 SNPs in 121 genes and compounds related to boar taint...... and reproduction were carried out in Duroc and Norwegian Landrace boars. Phenotypes investigated were levels of androstenone, skatole and indole in adipose tissue, levels of androstenone, testosterone, estrone sulphate and 17beta-estradiol in plasma, and length of bulbo urethralis gland. The SNPs were genotyped...

  13. Optimal structural inference of signaling pathways from unordered and overlapping gene sets.

    Science.gov (United States)

    Acharya, Lipi R; Judeh, Thair; Wang, Guangdi; Zhu, Dongxiao

    2012-02-15

    A plethora of bioinformatics analysis has led to the discovery of numerous gene sets, which can be interpreted as discrete measurements emitted from latent signaling pathways. Their potential to infer signaling pathway structures, however, has not been sufficiently exploited. Existing methods accommodating discrete data do not explicitly consider signal cascading mechanisms that characterize a signaling pathway. Novel computational methods are thus needed to fully utilize gene sets and broaden the scope from focusing only on pairwise interactions to the more general cascading events in the inference of signaling pathway structures. We propose a gene set based simulated annealing (SA) algorithm for the reconstruction of signaling pathway structures. A signaling pathway structure is a directed graph containing up to a few hundred nodes and many overlapping signal cascades, where each cascade represents a chain of molecular interactions from the cell surface to the nucleus. Gene sets in our context refer to discrete sets of genes participating in signal cascades, the basic building blocks of a signaling pathway, with no prior information about gene orderings in the cascades. From a compendium of gene sets related to a pathway, SA aims to search for signal cascades that characterize the optimal signaling pathway structure. In the search process, the extent of overlap among signal cascades is used to measure the optimality of a structure. Throughout, we treat gene sets as random samples from a first-order Markov chain model. We evaluated the performance of SA in three case studies. In the first study conducted on 83 KEGG pathways, SA demonstrated a significantly better performance than Bayesian network methods. Since both SA and Bayesian network methods accommodate discrete data, use a 'search and score' network learning strategy and output a directed network, they can be compared in terms of performance and computational time. In the second study, we compared SA and

  14. Gene expression signature analysis identifies vorinostat as a candidate therapy for gastric cancer.

    Directory of Open Access Journals (Sweden)

    Sofie Claerhout

    Full Text Available Gastric cancer continues to be one of the deadliest cancers in the world and therefore identification of new drugs targeting this type of cancer is thus of significant importance. The purpose of this study was to identify and validate a therapeutic agent which might improve the outcomes for gastric cancer patients in the future.Using microarray technology, we generated a gene expression profile of human gastric cancer-specific genes from human gastric cancer tissue samples. We used this profile in the Broad Institute's Connectivity Map analysis to identify candidate therapeutic compounds for gastric cancer. We found the histone deacetylase inhibitor vorinostat as the lead compound and thus a potential therapeutic drug for gastric cancer. Vorinostat induced both apoptosis and autophagy in gastric cancer cell lines. Pharmacological and genetic inhibition of autophagy however, increased the therapeutic efficacy of vorinostat, indicating that a combination of vorinostat with autophagy inhibitors may therapeutically be more beneficial. Moreover, gene expression analysis of gastric cancer identified a collection of genes (ITGB5, TYMS, MYB, APOC1, CBX5, PLA2G2A, and KIF20A whose expression was elevated in gastric tumor tissue and downregulated more than 2-fold by vorinostat treatment in gastric cancer cell lines. In contrast, SCGB2A1, TCN1, CFD, APLP1, and NQO1 manifested a reversed pattern.We showed that analysis of gene expression signature may represent an emerging approach to discover therapeutic agents for gastric cancer, such as vorinostat. The observation of altered gene expression after vorinostat treatment may provide the clue to identify the molecular mechanism of vorinostat and those patients likely to benefit from vorinostat treatment.

  15. Developing Potential Candidates of Preclinical Preeclampsia

    Directory of Open Access Journals (Sweden)

    Sandra Founds

    2015-11-01

    Full Text Available The potential for developing molecules of interest in preclinical preeclampsia from candidate genes that were discovered on gene expression microarray analysis has been challenged by limited access to additional first trimester trophoblast and decidual tissues. The question of whether these candidates encode secreted proteins that may be detected in maternal circulation early in pregnancy has been investigated using various proteomic methods. Pilot studies utilizing mass spectrometry based proteomic assays, along with enzyme linked immunosorbent assays (ELISAs, and Western immunoblotting in first trimester samples are reported. The novel targeted mass spectrometry methods led to robust multiple reaction monitoring assays. Despite detection of several candidates in early gestation, challenges persist. Future antibody-based studies may lead to a novel multiplex protein panel for screening or detection to prevent or mitigate preeclampsia.

  16. QTL mapping and transcriptome analysis of cowpea reveals candidate genes for root-knot nematode resistance.

    Science.gov (United States)

    Santos, Jansen Rodrigo Pereira; Ndeve, Arsenio Daniel; Huynh, Bao-Lam; Matthews, William Charles; Roberts, Philip Alan

    2018-01-01

    Cowpea is one of the most important food and forage legumes in drier regions of the tropics and subtropics. However, cowpea yield worldwide is markedly below the known potential due to abiotic and biotic stresses, including parasitism by root-knot nematodes (Meloidogyne spp., RKN). Two resistance genes with dominant effect, Rk and Rk2, have been reported to provide resistance against RKN in cowpea. Despite their description and use in breeding for resistance to RKN and particularly genetic mapping of the Rk locus, the exact genes conferring resistance to RKN remain unknown. In the present work, QTL mapping using recombinant inbred line (RIL) population 524B x IT84S-2049 segregating for a newly mapped locus and analysis of the transcriptome changes in two cowpea near-isogenic lines (NIL) were used to identify candidate genes for Rk and the newly mapped locus. A major QTL, designated QRk-vu9.1, associated with resistance to Meloidogyne javanica reproduction, was detected and mapped on linkage group LG9 at position 13.37 cM using egg production data. Transcriptome analysis on resistant and susceptible NILs 3 and 9 days after inoculation revealed up-regulation of 109 and 98 genes and down-regulation of 110 and 89 genes, respectively, out of 19,922 unique genes mapped to the common bean reference genome. Among the differentially expressed genes, four and nine genes were found within the QRk-vu9.1 and QRk-vu11.1 QTL intervals, respectively. Six of these genes belong to the TIR-NBS-LRR family of resistance genes and three were upregulated at one or more time-points. Quantitative RT-PCR validated gene expression to be positively correlated with RNA-seq expression pattern for eight genes. Future functional analysis of these cowpea genes will enhance our understanding of Rk-mediated resistance and identify the specific gene responsible for the resistance.

  17. Mechanism-based biomarker gene sets for glutathione depletion-related hepatotoxicity in rats

    International Nuclear Information System (INIS)

    Gao Weihua; Mizukawa, Yumiko; Nakatsu, Noriyuki; Minowa, Yosuke; Yamada, Hiroshi; Ohno, Yasuo; Urushidani, Tetsuro

    2010-01-01

    Chemical-induced glutathione depletion is thought to be caused by two types of toxicological mechanisms: PHO-type glutathione depletion [glutathione conjugated with chemicals such as phorone (PHO) or diethyl maleate (DEM)], and BSO-type glutathione depletion [i.e., glutathione synthesis inhibited by chemicals such as L-buthionine-sulfoximine (BSO)]. In order to identify mechanism-based biomarker gene sets for glutathione depletion in rat liver, male SD rats were treated with various chemicals including PHO (40, 120 and 400 mg/kg), DEM (80, 240 and 800 mg/kg), BSO (150, 450 and 1500 mg/kg), and bromobenzene (BBZ, 10, 100 and 300 mg/kg). Liver samples were taken 3, 6, 9 and 24 h after administration and examined for hepatic glutathione content, physiological and pathological changes, and gene expression changes using Affymetrix GeneChip Arrays. To identify differentially expressed probe sets in response to glutathione depletion, we focused on the following two courses of events for the two types of mechanisms of glutathione depletion: a) gene expression changes occurring simultaneously in response to glutathione depletion, and b) gene expression changes after glutathione was depleted. The gene expression profiles of the identified probe sets for the two types of glutathione depletion differed markedly at times during and after glutathione depletion, whereas Srxn1 was markedly increased for both types as glutathione was depleted, suggesting that Srxn1 is a key molecule in oxidative stress related to glutathione. The extracted probe sets were refined and verified using various compounds including 13 additional positive or negative compounds, and they established two useful marker sets. One contained three probe sets (Akr7a3, Trib3 and Gstp1) that could detect conjugation-type glutathione depletors any time within 24 h after dosing, and the other contained 14 probe sets that could detect glutathione depletors by any mechanism. These two sets, with appropriate scoring

  18. Presymptomatic Diagnosis of Celiac Disease in Predisposed Children: The Role of Gene Expression Profile.

    Science.gov (United States)

    Galatola, Martina; Cielo, Donatella; Panico, Camilla; Stellato, Pio; Malamisura, Basilio; Carbone, Lorenzo; Gianfrani, Carmen; Troncone, Riccardo; Greco, Luigi; Auricchio, Renata

    2017-09-01

    The prevalence of celiac disease (CD) has increased significantly in recent years, and risk prediction and early diagnosis have become imperative especially in at-risk families. In a previous study, we identified individuals with CD based on the expression profile of a set of candidate genes in peripheral blood monocytes. Here we evaluated the expression of a panel of CD candidate genes in peripheral blood mononuclear cells from at-risk infants long time before any symptom or production of antibodies. We analyzed the gene expression of a set of 9 candidate genes, associated with CD, in 22 human leukocyte antigen predisposed children from at-risk families for CD, studied from birth to 6 years of age. Nine of them developed CD (patients) and 13 did not (controls). We analyzed gene expression at 3 different time points (age matched in the 2 groups): 4-19 months before diagnosis, at the time of CD diagnosis, and after at least 1 year of a gluten-free diet. At similar age points, controls were also evaluated. Three genes (KIAA, TAGAP [T-cell Activation GTPase Activating Protein], and SH2B3 [SH2B Adaptor Protein 3]) were overexpressed in patients, compared with controls, at least 9 months before CD diagnosis. At a stepwise discriminant analysis, 4 genes (RGS1 [Regulator of G-protein signaling 1], TAGAP, TNFSF14 [Tumor Necrosis Factor (Ligand) Superfamily member 14], and SH2B3) differentiate patients from controls before serum antibodies production and clinical symptoms. Multivariate equation correctly classified CD from non-CD children in 95.5% of patients. The expression of a small set of candidate genes in peripheral blood mononuclear cells can predict CD at least 9 months before the appearance of any clinical and serological signs of the disease.

  19. Fine mapping and candidate gene search of quantitative trait loci for growth and obesity using mouse intersubspecific subcongenic intercrosses and exome sequencing.

    Directory of Open Access Journals (Sweden)

    Akira Ishikawa

    Full Text Available Although growth and body composition traits are quantitative traits of medical and agricultural importance, the genetic and molecular basis of those traits remains elusive. Our previous genome-wide quantitative trait locus (QTL analyses in an intersubspecific backcross population between C57BL/6JJcl (B6 and wild Mus musculus castaneus mice revealed a major growth QTL (named Pbwg1 on a proximal region of mouse chromosome 2. Using the B6.Cg-Pbwg1 intersubspecific congenic strain created, we revealed 12 closely linked QTLs for body weight and body composition traits on an approximately 44.1-Mb wild-derived congenic region. In this study, we narrowed down genomic regions harboring three (Pbwg1.12, Pbwg1.3 and Pbwg1.5 of the 12 linked QTLs and searched for possible candidate genes for the QTLs. By phenotypic analyses of F2 intercross populations between B6 and each of four B6.Cg-Pbwg1 subcongenic strains with overlapping and non-overlapping introgressed regions, we physically defined Pbwg1.12 affecting body weight to a 3.8-Mb interval (61.5-65.3 Mb on chromosome 2. We fine-mapped Pbwg1.3 for body length to an 8.0-Mb interval (57.3-65.3 and Pbwg1.5 for abdominal white fat weight to a 2.1-Mb interval (59.4-61.5. The wild-derived allele at Pbwg1.12 and Pbwg1.3 uniquely increased body weight and length despite the fact that the wild mouse has a smaller body size than that of B6, whereas it decreased fat weight at Pbwg1.5. Exome sequencing and candidate gene prioritization suggested that Gcg and Grb14 are putative candidate genes for Pbwg1.12 and that Ly75 and Itgb6 are putative candidate genes for Pbwg1.5. These genes had nonsynonymous SNPs, but the SNPs were predicted to be not harmful to protein functions. These results provide information helpful to identify wild-derived quantitative trait genes causing enhanced growth and resistance to obesity.

  20. Back to the sea twice: identifying candidate plant genes for molecular evolution to marine life

    Directory of Open Access Journals (Sweden)

    Reusch Thorsten BH

    2011-01-01

    Full Text Available Abstract Background Seagrasses are a polyphyletic group of monocotyledonous angiosperms that have adapted to a completely submerged lifestyle in marine waters. Here, we exploit two collections of expressed sequence tags (ESTs of two wide-spread and ecologically important seagrass species, the Mediterranean seagrass Posidonia oceanica (L. Delile and the eelgrass Zostera marina L., which have independently evolved from aquatic ancestors. This replicated, yet independent evolutionary history facilitates the identification of traits that may have evolved in parallel and are possible instrumental candidates for adaptation to a marine habitat. Results In our study, we provide the first quantitative perspective on molecular adaptations in two seagrass species. By constructing orthologous gene clusters shared between two seagrasses (Z. marina and P. oceanica and eight distantly related terrestrial angiosperm species, 51 genes could be identified with detection of positive selection along the seagrass branches of the phylogenetic tree. Characterization of these positively selected genes using KEGG pathways and the Gene Ontology uncovered that these genes are mostly involved in translation, metabolism, and photosynthesis. Conclusions These results provide first insights into which seagrass genes have diverged from their terrestrial counterparts via an initial aquatic stage characteristic of the order and to the derived fully-marine stage characteristic of seagrasses. We discuss how adaptive changes in these processes may have contributed to the evolution towards an aquatic and marine existence.

  1. Back to the sea twice: identifying candidate plant genes for molecular evolution to marine life.

    Science.gov (United States)

    Wissler, Lothar; Codoñer, Francisco M; Gu, Jenny; Reusch, Thorsten B H; Olsen, Jeanine L; Procaccini, Gabriele; Bornberg-Bauer, Erich

    2011-01-12

    Seagrasses are a polyphyletic group of monocotyledonous angiosperms that have adapted to a completely submerged lifestyle in marine waters. Here, we exploit two collections of expressed sequence tags (ESTs) of two wide-spread and ecologically important seagrass species, the Mediterranean seagrass Posidonia oceanica (L.) Delile and the eelgrass Zostera marina L., which have independently evolved from aquatic ancestors. This replicated, yet independent evolutionary history facilitates the identification of traits that may have evolved in parallel and are possible instrumental candidates for adaptation to a marine habitat. In our study, we provide the first quantitative perspective on molecular adaptations in two seagrass species. By constructing orthologous gene clusters shared between two seagrasses (Z. marina and P. oceanica) and eight distantly related terrestrial angiosperm species, 51 genes could be identified with detection of positive selection along the seagrass branches of the phylogenetic tree. Characterization of these positively selected genes using KEGG pathways and the Gene Ontology uncovered that these genes are mostly involved in translation, metabolism, and photosynthesis. These results provide first insights into which seagrass genes have diverged from their terrestrial counterparts via an initial aquatic stage characteristic of the order and to the derived fully-marine stage characteristic of seagrasses. We discuss how adaptive changes in these processes may have contributed to the evolution towards an aquatic and marine existence.

  2. Attenuation and efficacy of human parainfluenza virus type 1 (HPIV1 vaccine candidates containing stabilized mutations in the P/C and L genes

    Directory of Open Access Journals (Sweden)

    Skiadopoulos Mario H

    2007-07-01

    Full Text Available Abstract Background Two recombinant, live attenuated human parainfluenza virus type 1 (rHPIV1 mutant viruses have been developed, using a reverse genetics system, for evaluation as potential intranasal vaccine candidates. These rHPIV1 vaccine candidates have two non-temperature sensitive (non-ts attenuating (att mutations primarily in the P/C gene, namely CR84GHNT553A (two point mutations used together as a set and CΔ170 (a short deletion mutation, and two ts att mutations in the L gene, namely LY942A (a point mutation, and LΔ1710–11 (a short deletion, the last of which has not been previously described. The latter three mutations were specifically designed for increased genetic and phenotypic stability. These mutations were evaluated on the HPIV1 backbone, both individually and in combination, for attenuation, immunogenicity, and protective efficacy in African green monkeys (AGMs. Results The rHPIV1 mutant bearing the novel LΔ1710–11 mutation was highly ts and attenuated in AGMs and was immunogenic and efficacious against HPIV1 wt challenge. The rHPIV1-CR84G/Δ170HNT553ALY942A and rHPIV1-CR84G/Δ170HNT553ALΔ1710–11 vaccine candidates were highly ts, with shut-off temperatures of 38°C and 35°C, respectively, and were highly attenuated in AGMs. Immunization with rHPIV1-CR84G/Δ170HNT553ALY942A protected against HPIV1 wt challenge in both the upper and lower respiratory tracts. In contrast, rHPIV1-CR84G/Δ170HNT553ALΔ1710–11 was not protective in AGMs due to over-attenuation, but it is expected to replicate more efficiently and be more immunogenic in the natural human host. Conclusion The rHPIV1-CR84G/Δ170HNT553ALY942A and rHPIV1-CR84G/Δ170HNT553ALΔ1710–11 vaccine candidates are clearly highly attenuated in AGMs and clinical trials are planned to address safety and immunogenicity in humans.

  3. Gene expression profiling and candidate gene resequencing identifies pathways and mutations important for malignant transformation caused by leukemogenic fusion genes.

    Science.gov (United States)

    Novak, Rachel L; Harper, David P; Caudell, David; Slape, Christopher; Beachy, Sarah H; Aplan, Peter D

    2012-12-01

    NUP98-HOXD13 (NHD13) and CALM-AF10 (CA10) are oncogenic fusion proteins produced by recurrent chromosomal translocations in patients with acute myeloid leukemia (AML). Transgenic mice that express these fusions develop AML with a long latency and incomplete penetrance, suggesting that collaborating genetic events are required for leukemic transformation. We employed genetic techniques to identify both preleukemic abnormalities in healthy transgenic mice as well as collaborating events leading to leukemic transformation. Candidate gene resequencing revealed that 6 of 27 (22%) CA10 AMLs spontaneously acquired a Ras pathway mutation and 8 of 27 (30%) acquired an Flt3 mutation. Two CA10 AMLs acquired an Flt3 internal-tandem duplication, demonstrating that these mutations can be acquired in murine as well as human AML. Gene expression profiles revealed a marked upregulation of Hox genes, particularly Hoxa5, Hoxa9, and Hoxa10 in both NHD13 and CA10 mice. Furthermore, mir196b, which is embedded within the Hoxa locus, was overexpressed in both CA10 and NHD13 samples. In contrast, the Hox cofactors Meis1 and Pbx3 were differentially expressed; Meis1 was increased in CA10 AMLs but not NHD13 AMLs, whereas Pbx3 was consistently increased in NHD13 but not CA10 AMLs. Silencing of Pbx3 in NHD13 cells led to decreased proliferation, increased apoptosis, and decreased colony formation in vitro, suggesting a previously unexpected role for Pbx3 in leukemic transformation. Published by Elsevier Inc.

  4. Glutamatergic and GABAergic gene sets in attention-deficit/hyperactivity disorder

    DEFF Research Database (Denmark)

    Naaijen, Jill; Bralten, Janita; Poelmans, Geert

    2017-01-01

    Attention-deficit/hyperactivity disorder (ADHD) and autism spectrum disorders (ASD) often co-occur. Both are highly heritable; however, it has been difficult to discover genetic risk variants. Glutamate and GABA are main excitatory and inhibitory neurotransmitters in the brain; their balance...... within glutamatergic and GABAergic genes were investigated using the MAGMA software in an ADHD case-only sample (n=931), in which we assessed ASD symptoms and response inhibition on a Stop task. Gene set analysis for ADHD symptom severity, divided into inattention and hyperactivity/impulsivity symptoms...... is essential for proper brain development and functioning. In this study we investigated the role of glutamate and GABA genetics in ADHD severity, autism symptom severity and inhibitory performance, based on gene set analysis, an approach to investigate multiple genetic variants simultaneously. Common variants...

  5. A possible genetic association with chronic fatigue in primary Sjögren's syndrome: a candidate gene study.

    Science.gov (United States)

    Norheim, Katrine Brække; Le Hellard, Stephanie; Nordmark, Gunnel; Harboe, Erna; Gøransson, Lasse; Brun, Johan G; Wahren-Herlenius, Marie; Jonsson, Roland; Omdal, Roald

    2014-02-01

    Fatigue is prevalent and disabling in primary Sjögren's syndrome (pSS). Results from studies in chronic fatigue syndrome (CFS) indicate that genetic variation may influence fatigue. The aim of this study was to investigate single nucleotide polymorphism (SNP) variations in pSS patients with high and low fatigue. A panel of 85 SNPs in 12 genes was selected based on previous studies in CFS. A total of 207 pSS patients and 376 healthy controls were genotyped. One-hundred and ninety-three patients and 70 SNPs in 11 genes were available for analysis after quality control. Patients were dichotomized based on fatigue visual analogue scale (VAS) scores, with VAS fatigue" (n = 53) and VAS ≥50 denominated "high fatigue" (n = 140). We detected signals of association with pSS for one SNP in SLC25A40 (unadjusted p = 0.007) and two SNPs in PKN1 (both p = 0.03) in our pSS case versus control analysis. The association with SLC25A40 was stronger when only pSS high fatigue patients were analysed versus controls (p = 0.002). One SNP in PKN1 displayed an association in the case-only analysis of pSS high fatigue versus pSS low fatigue (p = 0.005). This candidate gene study in pSS did reveal a trend for associations between genetic variation in candidate genes and fatigue. The results will need to be replicated. More research on genetic associations with fatigue is warranted, and future trials should include larger cohorts and multicentre collaborations with sharing of genetic material to increase the statistical power.

  6. Isolation of Resistance Gene Candidates (RGCs) and characterization of an RGC cluster in cassava.

    Science.gov (United States)

    López, C E; Zuluaga, A P; Cooke, R; Delseny, M; Tohme, J; Verdier, V

    2003-08-01

    Plant disease resistance genes (R genes) show significant similarity amongst themselves in terms of both their DNA sequences and structural motifs present in their protein products. Oligonucleotide primers designed from NBS (Nucleotide Binding Site) domains encoded by several R-genes have been used to amplify NBS sequences from the genomic DNA of various plant species, which have been called Resistance Gene Analogues (RGAs) or Resistance Gene Candidates (RGCs). Using specific primers from the NBS and TIR (Toll/Interleukin-1 Receptor) regions, we identified twelve classes of RGCs in cassava (Manihot esculenta Crantz). Two classes were obtained from the PCR-amplification of the TIR domain. The other 10 classes correspond to the NBS sequences and were grouped into two subfamilies. Classes RCa1 to RCa5 are part of the first subfamily and were linked to a TIR domain in the N terminus. Classes RCa6 to RCa10 corresponded to non-TIR NBS-LRR encoding sequences. BAC library screening with the 12 RGC classes as probes allowed the identification of 42 BAC clones that were assembled into 10 contigs and 19 singletons. Members of the two TIR and non-TIR NBS-LRR subfamilies occurred together within individual BAC clones. The BAC screening and Southern hybridization analyses showed that all RGCs were single copy sequences except RCa6 that represented a large and diverse gene family. One BAC contained five NBS sequences and sequence analysis allowed the identification of two complete RGCs encoding two highly similar proteins. This BAC was located on linkage group J with three other RGC-containing BACs. At least one of these genes, RGC2, is expressed constitutively in cassava tissues.

  7. Gene Expression Signature Analysis Identifies Vorinostat as a Candidate Therapy for Gastric Cancer

    Science.gov (United States)

    Choi, Woonyoung; Park, Yun-Yong; Kim, KyoungHyun; Kim, Sang-Bae; Lee, Ju-Seog; Mills, Gordon B.; Cho, Jae Yong

    2011-01-01

    Background Gastric cancer continues to be one of the deadliest cancers in the world and therefore identification of new drugs targeting this type of cancer is thus of significant importance. The purpose of this study was to identify and validate a therapeutic agent which might improve the outcomes for gastric cancer patients in the future. Methodology/Principal Findings Using microarray technology, we generated a gene expression profile of human gastric cancer–specific genes from human gastric cancer tissue samples. We used this profile in the Broad Institute's Connectivity Map analysis to identify candidate therapeutic compounds for gastric cancer. We found the histone deacetylase inhibitor vorinostat as the lead compound and thus a potential therapeutic drug for gastric cancer. Vorinostat induced both apoptosis and autophagy in gastric cancer cell lines. Pharmacological and genetic inhibition of autophagy however, increased the therapeutic efficacy of vorinostat, indicating that a combination of vorinostat with autophagy inhibitors may therapeutically be more beneficial. Moreover, gene expression analysis of gastric cancer identified a collection of genes (ITGB5, TYMS, MYB, APOC1, CBX5, PLA2G2A, and KIF20A) whose expression was elevated in gastric tumor tissue and downregulated more than 2-fold by vorinostat treatment in gastric cancer cell lines. In contrast, SCGB2A1, TCN1, CFD, APLP1, and NQO1 manifested a reversed pattern. Conclusions/Significance We showed that analysis of gene expression signature may represent an emerging approach to discover therapeutic agents for gastric cancer, such as vorinostat. The observation of altered gene expression after vorinostat treatment may provide the clue to identify the molecular mechanism of vorinostat and those patients likely to benefit from vorinostat treatment. PMID:21931799

  8. GeneAnalytics: An Integrative Gene Set Analysis Tool for Next Generation Sequencing, RNAseq and Microarray Data.

    Science.gov (United States)

    Ben-Ari Fuchs, Shani; Lieder, Iris; Stelzer, Gil; Mazor, Yaron; Buzhor, Ella; Kaplan, Sergey; Bogoch, Yoel; Plaschkes, Inbar; Shitrit, Alina; Rappaport, Noa; Kohn, Asher; Edgar, Ron; Shenhav, Liraz; Safran, Marilyn; Lancet, Doron; Guan-Golan, Yaron; Warshawsky, David; Shtrichman, Ronit

    2016-03-01

    Postgenomics data are produced in large volumes by life sciences and clinical applications of novel omics diagnostics and therapeutics for precision medicine. To move from "data-to-knowledge-to-innovation," a crucial missing step in the current era is, however, our limited understanding of biological and clinical contexts associated with data. Prominent among the emerging remedies to this challenge are the gene set enrichment tools. This study reports on GeneAnalytics™ ( geneanalytics.genecards.org ), a comprehensive and easy-to-apply gene set analysis tool for rapid contextualization of expression patterns and functional signatures embedded in the postgenomics Big Data domains, such as Next Generation Sequencing (NGS), RNAseq, and microarray experiments. GeneAnalytics' differentiating features include in-depth evidence-based scoring algorithms, an intuitive user interface and proprietary unified data. GeneAnalytics employs the LifeMap Science's GeneCards suite, including the GeneCards®--the human gene database; the MalaCards-the human diseases database; and the PathCards--the biological pathways database. Expression-based analysis in GeneAnalytics relies on the LifeMap Discovery®--the embryonic development and stem cells database, which includes manually curated expression data for normal and diseased tissues, enabling advanced matching algorithm for gene-tissue association. This assists in evaluating differentiation protocols and discovering biomarkers for tissues and cells. Results are directly linked to gene, disease, or cell "cards" in the GeneCards suite. Future developments aim to enhance the GeneAnalytics algorithm as well as visualizations, employing varied graphical display items. Such attributes make GeneAnalytics a broadly applicable postgenomics data analyses and interpretation tool for translation of data to knowledge-based innovation in various Big Data fields such as precision medicine, ecogenomics, nutrigenomics, pharmacogenomics, vaccinomics

  9. Gene Expression Profiling of Human Vaginal Cells In Vitro Discriminates Compounds with Pro-Inflammatory and Mucosa-Altering Properties: Novel Biomarkers for Preclinical Testing of HIV Microbicide Candidates.

    Directory of Open Access Journals (Sweden)

    Irina A Zalenskaya

    Full Text Available Inflammation and immune activation of the cervicovaginal mucosa are considered factors that increase susceptibility to HIV infection. Therefore, it is essential to screen candidate anti-HIV microbicides for potential mucosal immunomodulatory/inflammatory effects prior to further clinical development. The goal of this study was to develop an in vitro method for preclinical evaluation of the inflammatory potential of new candidate microbicides using a microarray gene expression profiling strategy.To this end, we compared transcriptomes of human vaginal cells (Vk2/E6E7 treated with well-characterized pro-inflammatory (PIC and non-inflammatory (NIC compounds. PICs included compounds with different mechanisms of action. Gene expression was analyzed using Affymetrix U133 Plus 2 arrays. Data processing was performed using GeneSpring 11.5 (Agilent Technologies, Santa Clara, CA.Microarraray comparative analysis allowed us to generate a panel of 20 genes that were consistently deregulated by PICs compared to NICs, thus distinguishing between these two groups. Functional analysis mapped 14 of these genes to immune and inflammatory responses. This was confirmed by the fact that PICs induced NFkB pathway activation in Vk2 cells. By testing microbicide candidates previously characterized in clinical trials we demonstrated that the selected PIC-associated genes properly identified compounds with mucosa-altering effects. The discriminatory power of these genes was further demonstrated after culturing vaginal cells with vaginal bacteria. Prevotella bivia, prevalent bacteria in the disturbed microbiota of bacterial vaginosis, induced strong upregulation of seven selected PIC-associated genes, while a commensal Lactobacillus gasseri associated to vaginal health did not cause any changes.In vitro evaluation of the immunoinflammatory potential of microbicides using the PIC-associated genes defined in this study could help in the initial screening of candidates prior

  10. Comparative genomics reveals candidate carotenoid pathway regulators of ripening watermelon fruit

    Science.gov (United States)

    2013-01-01

    Background Many fruits, including watermelon, are proficient in carotenoid accumulation during ripening. While most genes encoding steps in the carotenoid biosynthetic pathway have been cloned, few transcriptional regulators of these genes have been defined to date. Here we describe the identification of a set of putative carotenoid-related transcription factors resulting from fresh watermelon carotenoid and transcriptome analysis during fruit development and ripening. Our goal is to both clarify the expression profiles of carotenoid pathway genes and to identify candidate regulators and molecular targets for crop improvement. Results Total carotenoids progressively increased during fruit ripening up to ~55 μg g-1 fw in red-ripe fruits. Trans-lycopene was the carotenoid that contributed most to this increase. Many of the genes related to carotenoid metabolism displayed changing expression levels during fruit ripening generating a metabolic flux toward carotenoid synthesis. Constitutive low expression of lycopene cyclase genes resulted in lycopene accumulation. RNA-seq expression profiling of watermelon fruit development yielded a set of transcription factors whose expression was correlated with ripening and carotenoid accumulation. Nineteen putative transcription factor genes from watermelon and homologous to tomato carotenoid-associated genes were identified. Among these, six were differentially expressed in the flesh of both species during fruit development and ripening. Conclusions Taken together the data suggest that, while the regulation of a common set of metabolic genes likely influences carotenoid synthesis and accumulation in watermelon and tomato fruits during development and ripening, specific and limiting regulators may differ between climacteric and non-climacteric fruits, possibly related to their differential susceptibility to and use of ethylene during ripening. PMID:24219562

  11. ADAGE signature analysis: differential expression analysis with data-defined gene sets.

    Science.gov (United States)

    Tan, Jie; Huyck, Matthew; Hu, Dongbo; Zelaya, René A; Hogan, Deborah A; Greene, Casey S

    2017-11-22

    Gene set enrichment analysis and overrepresentation analyses are commonly used methods to determine the biological processes affected by a differential expression experiment. This approach requires biologically relevant gene sets, which are currently curated manually, limiting their availability and accuracy in many organisms without extensively curated resources. New feature learning approaches can now be paired with existing data collections to directly extract functional gene sets from big data. Here we introduce a method to identify perturbed processes. In contrast with methods that use curated gene sets, this approach uses signatures extracted from public expression data. We first extract expression signatures from public data using ADAGE, a neural network-based feature extraction approach. We next identify signatures that are differentially active under a given treatment. Our results demonstrate that these signatures represent biological processes that are perturbed by the experiment. Because these signatures are directly learned from data without supervision, they can identify uncurated or novel biological processes. We implemented ADAGE signature analysis for the bacterial pathogen Pseudomonas aeruginosa. For the convenience of different user groups, we implemented both an R package (ADAGEpath) and a web server ( http://adage.greenelab.com ) to run these analyses. Both are open-source to allow easy expansion to other organisms or signature generation methods. We applied ADAGE signature analysis to an example dataset in which wild-type and ∆anr mutant cells were grown as biofilms on the Cystic Fibrosis genotype bronchial epithelial cells. We mapped active signatures in the dataset to KEGG pathways and compared with pathways identified using GSEA. The two approaches generally return consistent results; however, ADAGE signature analysis also identified a signature that revealed the molecularly supported link between the MexT regulon and Anr. We designed

  12. Mutation analysis of suppressor of cytokine signalling 3, a candidate gene in Type 1 diabetes and insulin sensitivity

    DEFF Research Database (Denmark)

    Gylvin, T; Nolsøe, R; Hansen, T

    2004-01-01

    Beta cell loss in Type 1 and Type 2 diabetes mellitus may result from apoptosis and necrosis induced by inflammatory mediators. The suppressor of cytokine signalling (SOCS)-3 is a natural inhibitor of cytokine signalling and also influences insulin signalling. SOCS3 could therefore be a candidate...... gene in the development of Type 1 and Type 2 diabetes mellitus....

  13. Microbial Dark Matter: Unusual intervening sequences in 16S rRNA genes of candidate phyla from the deep subsurface

    Energy Technology Data Exchange (ETDEWEB)

    Jarett, Jessica; Stepanauskas, Ramunas; Kieft, Thomas; Onstott, Tullis; Woyke, Tanja

    2014-03-17

    The Microbial Dark Matter project has sequenced genomes from over 200 single cells from candidate phyla, greatly expanding our knowledge of the ecology, inferred metabolism, and evolution of these widely distributed, yet poorly understood lineages. The second phase of this project aims to sequence an additional 800 single cells from known as well as potentially novel candidate phyla derived from a variety of environments. In order to identify whole genome amplified single cells, screening based on phylogenetic placement of 16S rRNA gene sequences is being conducted. Briefly, derived 16S rRNA gene sequences are aligned to a custom version of the Greengenes reference database and added to a reference tree in ARB using parsimony. In multiple samples from deep subsurface habitats but not from other habitats, a large number of sequences proved difficult to align and therefore to place in the tree. Based on comparisons to reference sequences and structural alignments using SSU-ALIGN, many of these ?difficult? sequences appear to originate from candidate phyla, and contain intervening sequences (IVSs) within the 16S rRNA genes. These IVSs are short (39 - 79 nt) and do not appear to be self-splicing or to contain open reading frames. IVSs were found in the loop regions of stem-loop structures in several different taxonomic groups. Phylogenetic placement of sequences is strongly affected by IVSs; two out of three groups investigated were classified as different phyla after their removal. Based on data from samples screened in this project, IVSs appear to be more common in microbes occurring in deep subsurface habitats, although the reasons for this remain elusive.

  14. Can survival prediction be improved by merging gene expression data sets?

    Directory of Open Access Journals (Sweden)

    Haleh Yasrebi

    Full Text Available BACKGROUND: High-throughput gene expression profiling technologies generating a wealth of data, are increasingly used for characterization of tumor biopsies for clinical trials. By applying machine learning algorithms to such clinically documented data sets, one hopes to improve tumor diagnosis, prognosis, as well as prediction of treatment response. However, the limited number of patients enrolled in a single trial study limits the power of machine learning approaches due to over-fitting. One could partially overcome this limitation by merging data from different studies. Nevertheless, such data sets differ from each other with regard to technical biases, patient selection criteria and follow-up treatment. It is therefore not clear at all whether the advantage of increased sample size outweighs the disadvantage of higher heterogeneity of merged data sets. Here, we present a systematic study to answer this question specifically for breast cancer data sets. We use survival prediction based on Cox regression as an assay to measure the added value of merged data sets. RESULTS: Using time-dependent Receiver Operating Characteristic-Area Under the Curve (ROC-AUC and hazard ratio as performance measures, we see in overall no significant improvement or deterioration of survival prediction with merged data sets as compared to individual data sets. This apparently was due to the fact that a few genes with strong prognostic power were not available on all microarray platforms and thus were not retained in the merged data sets. Surprisingly, we found that the overall best performance was achieved with a single-gene predictor consisting of CYB5D1. CONCLUSIONS: Merging did not deteriorate performance on average despite (a The diversity of microarray platforms used. (b The heterogeneity of patients cohorts. (c The heterogeneity of breast cancer disease. (d Substantial variation of time to death or relapse. (e The reduced number of genes in the merged data

  15. Transcriptome sequencing of Mycosphaerella fijiensis during association with Musa acuminata reveals candidate pathogenicity genes.

    Science.gov (United States)

    Noar, Roslyn D; Daub, Margaret E

    2016-08-30

    Mycosphaerella fijiensis, causative agent of the black Sigatoka disease of banana, is considered the most economically damaging banana disease. Despite its importance, the genetics of pathogenicity are poorly understood. Previous studies have characterized polyketide pathways with possible roles in pathogenicity. To identify additional candidate pathogenicity genes, we compared the transcriptome of this fungus during the necrotrophic phase of infection with that during saprophytic growth in medium. Transcriptome analysis was conducted, and the functions of differentially expressed genes were predicted by identifying conserved domains, Gene Ontology (GO) annotation and GO enrichment analysis, Carbohydrate-Active EnZymes (CAZy) annotation, and identification of genes encoding effector-like proteins. The analysis showed that genes commonly involved in secondary metabolism have higher expression in infected leaf tissue, including genes encoding cytochrome P450s, short-chain dehydrogenases, and oxidoreductases in the 2-oxoglutarate and Fe(II)-dependent oxygenase superfamily. Other pathogenicity-related genes with higher expression in infected leaf tissue include genes encoding salicylate hydroxylase-like proteins, hydrophobic surface binding proteins, CFEM domain-containing proteins, and genes encoding secreted cysteine-rich proteins characteristic of effectors. More genes encoding amino acid transporters, oligopeptide transporters, peptidases, proteases, proteinases, sugar transporters, and proteins containing Domain of Unknown Function (DUF) 3328 had higher expression in infected leaf tissue, while more genes encoding inhibitors of peptidases and proteinases had higher expression in medium. Sixteen gene clusters with higher expression in leaf tissue were identified including clusters for the synthesis of a non-ribosomal peptide. A cluster encoding a novel fusicoccane was also identified. Two putative dispensable scaffolds were identified with a large proportion of

  16. RNA-Seq analysis and annotation of a draft blueberry genome assembly identifies candidate genes involved in fruit ripening, biosynthesis of bioactive compounds, and stage-specific alternative splicing.

    Science.gov (United States)

    Gupta, Vikas; Estrada, April D; Blakley, Ivory; Reid, Rob; Patel, Ketan; Meyer, Mason D; Andersen, Stig Uggerhøj; Brown, Allan F; Lila, Mary Ann; Loraine, Ann E

    2015-01-01

    Blueberries are a rich source of antioxidants and other beneficial compounds that can protect against disease. Identifying genes involved in synthesis of bioactive compounds could enable the breeding of berry varieties with enhanced health benefits. Toward this end, we annotated a previously sequenced draft blueberry genome assembly using RNA-Seq data from five stages of berry fruit development and ripening. Genome-guided assembly of RNA-Seq read alignments combined with output from ab initio gene finders produced around 60,000 gene models, of which more than half were similar to proteins from other species, typically the grape Vitis vinifera. Comparison of gene models to the PlantCyc database of metabolic pathway enzymes identified candidate genes involved in synthesis of bioactive compounds, including bixin, an apocarotenoid with potential disease-fighting properties, and defense-related cyanogenic glycosides, which are toxic. Cyanogenic glycoside (CG) biosynthetic enzymes were highly expressed in green fruit, and a candidate CG detoxification enzyme was up-regulated during fruit ripening. Candidate genes for ethylene, anthocyanin, and 400 other biosynthetic pathways were also identified. Homology-based annotation using Blast2GO and InterPro assigned Gene Ontology terms to around 15,000 genes. RNA-Seq expression profiling showed that blueberry growth, maturation, and ripening involve dynamic gene expression changes, including coordinated up- and down-regulation of metabolic pathway enzymes and transcriptional regulators. Analysis of RNA-seq alignments identified developmentally regulated alternative splicing, promoter use, and 3' end formation. We report genome sequence, gene models, functional annotations, and RNA-Seq expression data that provide an important new resource enabling high throughput studies in blueberry.

  17. Fine-scale linkage mapping reveals a small set of candidate genes influencing honey bee grooming behavior in response to Varroa mites.

    Directory of Open Access Journals (Sweden)

    Miguel E Arechavaleta-Velasco

    Full Text Available Populations of honey bees in North America have been experiencing high annual colony mortality for 15-20 years. Many apicultural researchers believe that introduced parasites called Varroa mites (V. destructor are the most important factor in colony deaths. One important resistance mechanism that limits mite population growth in colonies is the ability of some lines of honey bees to groom mites from their bodies. To search for genes influencing this trait, we used an Illumina Bead Station genotyping array to determine the genotypes of several hundred worker bees at over a thousand single-nucleotide polymorphisms in a family that was apparently segregating for alleles influencing this behavior. Linkage analyses provided a genetic map with 1,313 markers anchored to genome sequence. Genotypes were analyzed for association with grooming behavior, measured as the time that individual bees took to initiate grooming after mites were placed on their thoraces. Quantitative-trait-locus interval mapping identified a single chromosomal region that was significant at the chromosome-wide level (p<0.05 on chromosome 5 with a LOD score of 2.72. The 95% confidence interval for quantitative trait locus location contained only 27 genes (honey bee official gene annotation set 2 including Atlastin, Ataxin and Neurexin-1 (AmNrx1, which have potential neurodevelopmental and behavioral effects. Atlastin and Ataxin homologs are associated with neurological diseases in humans. AmNrx1 codes for a presynaptic protein with many alternatively spliced isoforms. Neurexin-1 influences the growth, maintenance and maturation of synapses in the brain, as well as the type of receptors most prominent within synapses. Neurexin-1 has also been associated with autism spectrum disorder and schizophrenia in humans, and self-grooming behavior in mice.

  18. Candidate gene analyses of 3-dimensional dentoalveolar phenotypes in subjects with malocclusion

    Science.gov (United States)

    Weaver, Cole A.; Miller, Steven F.; da Fontoura, Clarissa S. G.; Wehby, George L.; Amendt, Brad A.; Holton, Nathan E.; Allareddy, Veeratrishul; Southard, Thomas E.; Moreno Uribe, Lina M.

    2017-01-01

    Introduction Genetic studies of malocclusion etiology have identified 4 deleterious mutations in genes, DUSP6, ARHGAP21, FGF23, and ADAMTS1 in familial Class III cases. Although these variants may have large impacts on Class III phenotypic expression, their low frequency (malocclusions. Thus, much of the genetic variation underlying the dentofacial phenotypic variation associated with malocclusion remains unknown. In this study, we evaluated associations between common genetic variations in craniofacial candidate genes and 3-dimensional dentoalveolar phenotypes in patients with malocclusion. Methods Pretreatment dental casts or cone-beam computed tomographic images from 300 healthy subjects were digitized with 48 landmarks. The 3-dimensional coordinate data were submitted to a geometric morphometric approach along with principal component analysis to generate continuous phenotypes including symmetric and asymmetric components of dentoalveolar shape variation, fluctuating asymmetry, and size. The subjects were genotyped for 222 single-nucleotide polymorphisms in 82 genes/loci, and phenotpye-genotype associations were tested via multivariate linear regression. Results Principal component analysis of symmetric variation identified 4 components that explained 68% of the total variance and depicted anteroposterior, vertical, and transverse dentoalveolar discrepancies. Suggestive associations (P eruptions. Suggestive associations were found with TBX1 AJUBA, SNAI3 SATB2, TP63, and 1p22.1. Fluctuating asymmetry was associated with BMP3 and LATS1. Associations for SATB2 and BMP3 with asymmetric variations remained significant after the Bonferroni correction (P malocclusions were identified. PMID:28257739

  19. Converging evidence that sequence variations in the novel candidate gene MAP2K7 (MKK7) are functionally associated with schizophrenia.

    Science.gov (United States)

    Winchester, Catherine L; Ohzeki, Hiromitsu; Vouyiouklis, Demetrius A; Thompson, Rhiannon; Penninger, Josef M; Yamagami, Keiji; Norrie, John D; Hunter, Robert; Pratt, Judith A; Morris, Brian J

    2012-11-15

    Schizophrenia is a debilitating psychiatric disease with a strong genetic contribution, potentially linked to altered glutamatergic function in brain regions such as the prefrontal cortex (PFC). Here, we report converging evidence to support a functional candidate gene for schizophrenia. In post-mortem PFC from patients with schizophrenia, we detected decreased expression of MKK7/MAP2K7-a kinase activated by glutamatergic activity. While mice lacking one copy of the Map2k7 gene were overtly normal in a variety of behavioural tests, these mice showed a schizophrenia-like cognitive phenotype of impaired working memory. Additional support for MAP2K7 as a candidate gene came from a genetic association study. A substantial effect size (odds ratios: ~1.9) was observed for a common variant in a cohort of case and control samples collected in the Glasgow area and also in a replication cohort of samples of Northern European descent (most significant P-value: 3 × 10(-4)). While some caution is warranted until these association data are further replicated, these results are the first to implicate the candidate gene MAP2K7 in genetic risk for schizophrenia. Complete sequencing of all MAP2K7 exons did not reveal any non-synonymous mutations. However, the MAP2K7 haplotype appeared to have functional effects, in that it influenced the level of expression of MAP2K7 mRNA in human PFC. Taken together, the results imply that reduced function of the MAP2K7-c-Jun N-terminal kinase (JNK) signalling cascade may underlie some of the neurochemical changes and core symptoms in schizophrenia.

  20. Genome-wide data-mining of candidate human splice translational efficiency polymorphisms (STEPs and an online database.

    Directory of Open Access Journals (Sweden)

    Christopher A Raistrick

    2010-10-01

    Full Text Available Variation in pre-mRNA splicing is common and in some cases caused by genetic variants in intronic splicing motifs. Recent studies into the insulin gene (INS discovered a polymorphism in a 5' non-coding intron that influences the likelihood of intron retention in the final mRNA, extending the 5' untranslated region and maintaining protein quality. Retention was also associated with increased insulin levels, suggesting that such variants--splice translational efficiency polymorphisms (STEPs--may relate to disease phenotypes through differential protein expression. We set out to explore the prevalence of STEPs in the human genome and validate this new category of protein quantitative trait loci (pQTL using publicly available data.Gene transcript and variant data were collected and mined for candidate STEPs in motif regions. Sequences from transcripts containing potential STEPs were analysed for evidence of splice site recognition and an effect in expressed sequence tags (ESTs. 16 publicly released genome-wide association data sets of common diseases were searched for association to candidate polymorphisms with HapMap frequency data. Our study found 3324 candidate STEPs lying in motif sequences of 5' non-coding introns and further mining revealed 170 with transcript evidence of intron retention. 21 potential STEPs had EST evidence of intron retention or exon extension, as well as population frequency data for comparison.Results suggest that the insulin STEP was not a unique example and that many STEPs may occur genome-wide with potentially causal effects in complex disease. An online database of STEPs is freely accessible at http://dbstep.genes.org.uk/.

  1. Children’s Hospital of Pittsburgh and Diabetes Institute of the Walter Reed Health Care System Genetic Screening in Diabetes: Candidate Gene Analysis for Diabetic Retinopathy

    Science.gov (United States)

    2010-05-01

    Screening in Diabetes : Candidate Gene Analysis for Diabetic Retinopathy PRINCIPAL INVESTIGATOR: Robert A. Vigersky, COL MC CONTRACTING ORGANIZATION... Diabetes Institute of the Walter Reed Health Care System Genetic Screening in Diabetes : Candidate Gene Analysis for Diabetic Retinopathy 5c. PROGRAM... diabetic  neuropathy, and  diabetic   retinopathy .  This was an observational study in which the investigators obtained DNA samples from the blood of

  2. Whole genome sequencing resource identifies 18 new candidate genes for autism spectrum disorder

    Science.gov (United States)

    Yuen, Ryan KC; Merico, Daniele; Bookman, Matt; Howe, Jennifer L; Thiruvahindrapuram, Bhooma; Patel, Rohan V; Whitney, Joe; Deflaux, Nicole; Bingham, Jonathan; Wang, Zhuozhi; Pellecchia, Giovanna; Buchanan, Janet A; Walker, Susan; Marshall, Christian R; Uddin, Mohammed; Zarrei, Mehdi; Deneault, Eric; D’Abate, Lia; Chan, Ada JS; Koyanagi, Stephanie; Paton, Tara; Pereira, Sergio L; Hoang, Ny; Engchuan, Worrawat; Higginbotham, Edward J; Ho, Karen; Lamoureux, Sylvia; Li, Weili; MacDonald, Jeffrey R; Nalpathamkalam, Thomas; Sung, Wilson WL; Tsoi, Fiona J; Wei, John; Xu, Lizhen; Tasse, Anne-Marie; Kirby, Emily; Van Etten, William; Twigger, Simon; Roberts, Wendy; Drmic, Irene; Jilderda, Sanne; Modi, Bonnie MacKinnon; Kellam, Barbara; Szego, Michael; Cytrynbaum, Cheryl; Weksberg, Rosanna; Zwaigenbaum, Lonnie; Woodbury-Smith, Marc; Brian, Jessica; Senman, Lili; Iaboni, Alana; Doyle-Thomas, Krissy; Thompson, Ann; Chrysler, Christina; Leef, Jonathan; Savion-Lemieux, Tal; Smith, Isabel M; Liu, Xudong; Nicolson, Rob; Seifer, Vicki; Fedele, Angie; Cook, Edwin H; Dager, Stephen; Estes, Annette; Gallagher, Louise; Malow, Beth A; Parr, Jeremy R; Spence, Sarah J; Vorstman, Jacob; Frey, Brendan J; Robinson, James T; Strug, Lisa J; Fernandez, Bridget A; Elsabbagh, Mayada; Carter, Melissa T; Hallmayer, Joachim; Knoppers, Bartha M; Anagnostou, Evdokia; Szatmari, Peter; Ring, Robert H; Glazer, David; Pletcher, Mathew T; Scherer, Stephen W

    2017-01-01

    We are performing whole genome sequencing (WGS) of families with Autism Spectrum Disorder (ASD) to build a resource, named MSSNG, to enable the sub-categorization of phenotypes and underlying genetic factors involved. Here, we report WGS of 5,205 samples from families with ASD, accompanied by clinical information, creating a database accessible in a cloud platform, and through an internet portal with controlled access. We found an average of 73.8 de novo single nucleotide variants and 12.6 de novo insertion/deletions (indels) or copy number variations (CNVs) per ASD subject. We identified 18 new candidate ASD-risk genes such as MED13 and PHF3, and found that participants bearing mutations in susceptibility genes had significantly lower adaptive ability (p=6×10−4). In 294/2,620 (11.2%) of ASD cases, a molecular basis could be determined and 7.2% of these carried CNV/chromosomal abnormalities, emphasizing the importance of detecting all forms of genetic variation as diagnostic and therapeutic targets in ASD. PMID:28263302

  3. Evaluation of suitable reference genes for gene expression studies in bovine muscular tissue

    Directory of Open Access Journals (Sweden)

    Dunner Susana

    2008-09-01

    Full Text Available Abstract Background Real-time reverse transcriptase quantitative polymerase chain reaction (real-time RTqPCR is a technique used to measure mRNA species copy number as a way to determine key genes involved in different biological processes. However, the expression level of these key genes may vary among tissues or cells not only as a consequence of differential expression but also due to different factors, including choice of reference genes to normalize the expression levels of the target genes; thus the selection of reference genes is critical for expression studies. For this purpose, ten candidate reference genes were investigated in bovine muscular tissue. Results The value of stability of ten candidate reference genes included in three groups was estimated: the so called 'classical housekeeping' genes (18S, GAPDH and ACTB, a second set of genes used in expression studies conducted on other tissues (B2M, RPII, UBC and HMBS and a third set of novel genes (SF3A1, EEF1A2 and CASC3. Three different statistical algorithms were used to rank the genes by their stability measures as produced by geNorm, NormFinder and Bestkeeper. The three methods tend to agree on the most stably expressed genes and the least in muscular tissue. EEF1A2 and HMBS followed by SF3A1, ACTB, and CASC3 can be considered as stable reference genes, and B2M, RPII, UBC and GAPDH would not be appropriate. Although the rRNA-18S stability measure seems to be within the range of acceptance, its use is not recommended because its synthesis regulation is not representative of mRNA levels. Conclusion Based on geNorm algorithm, we propose the use of three genes SF3A1, EEF1A2 and HMBS as references for normalization of real-time RTqPCR in muscle expression studies.

  4. Systematic evaluation of candidate blood markers for detecting ovarian cancer.

    Directory of Open Access Journals (Sweden)

    Chana Palmer

    2008-07-01

    Full Text Available Epithelial ovarian cancer is a significant cause of mortality both in the United States and worldwide, due largely to the high proportion of cases that present at a late stage, when survival is extremely poor. Early detection of epithelial ovarian cancer, and of the serous subtype in particular, is a promising strategy for saving lives. The low prevalence of ovarian cancer makes the development of an adequately sensitive and specific test based on blood markers very challenging. We evaluated the performance of a set of candidate blood markers and combinations of these markers in detecting serous ovarian cancer.We selected 14 candidate blood markers of serous ovarian cancer for which assays were available to measure their levels in serum or plasma, based on our analysis of global gene expression data and on literature searches. We evaluated the performance of these candidate markers individually and in combination by measuring them in overlapping sets of serum (or plasma samples from women with clinically detectable ovarian cancer and women without ovarian cancer. Based on sensitivity at high specificity, we determined that 4 of the 14 candidate markers--MUC16, WFDC2, MSLN and MMP7--warrant further evaluation in precious serum specimens collected months to years prior to clinical diagnosis to assess their utility in early detection. We also reported differences in the performance of these candidate blood markers across histological types of epithelial ovarian cancer.By systematically analyzing the performance of candidate blood markers of ovarian cancer in distinguishing women with clinically apparent ovarian cancer from women without ovarian cancer, we identified a set of serum markers with adequate performance to warrant testing for their ability to identify ovarian cancer months to years prior to clinical diagnosis. We argued for the importance of sensitivity at high specificity and of magnitude of difference in marker levels between cases and

  5. Expression of SET Protein in the Ovaries of Patients with Polycystic Ovary Syndrome

    OpenAIRE

    Xu Boqun; Dai Xiaonan; Cui YuGui; Gao Lingling; Dai Xue; Chao Gao; Diao Feiyang; Liu Jiayin; Li Gao; Mei Li; Yuan Zhang; Xiang Ma

    2013-01-01

    Background. We previously found that expression of SET gene was up-regulated in polycystic ovaries by using microarray. It suggested that SET may be an attractive candidate regulator involved in the pathophysiology of polycystic ovary syndrome (PCOS). In this study, expression and cellular localization of SET protein were investigated in human polycystic and normal ovaries. Method. Ovarian tissues, six normal ovaries and six polycystic ovaries, were collected during transsexual operation and ...

  6. QTL mapping and transcriptome analysis of cowpea reveals candidate genes for root-knot nematode resistance.

    Directory of Open Access Journals (Sweden)

    Jansen Rodrigo Pereira Santos

    Full Text Available Cowpea is one of the most important food and forage legumes in drier regions of the tropics and subtropics. However, cowpea yield worldwide is markedly below the known potential due to abiotic and biotic stresses, including parasitism by root-knot nematodes (Meloidogyne spp., RKN. Two resistance genes with dominant effect, Rk and Rk2, have been reported to provide resistance against RKN in cowpea. Despite their description and use in breeding for resistance to RKN and particularly genetic mapping of the Rk locus, the exact genes conferring resistance to RKN remain unknown. In the present work, QTL mapping using recombinant inbred line (RIL population 524B x IT84S-2049 segregating for a newly mapped locus and analysis of the transcriptome changes in two cowpea near-isogenic lines (NIL were used to identify candidate genes for Rk and the newly mapped locus. A major QTL, designated QRk-vu9.1, associated with resistance to Meloidogyne javanica reproduction, was detected and mapped on linkage group LG9 at position 13.37 cM using egg production data. Transcriptome analysis on resistant and susceptible NILs 3 and 9 days after inoculation revealed up-regulation of 109 and 98 genes and down-regulation of 110 and 89 genes, respectively, out of 19,922 unique genes mapped to the common bean reference genome. Among the differentially expressed genes, four and nine genes were found within the QRk-vu9.1 and QRk-vu11.1 QTL intervals, respectively. Six of these genes belong to the TIR-NBS-LRR family of resistance genes and three were upregulated at one or more time-points. Quantitative RT-PCR validated gene expression to be positively correlated with RNA-seq expression pattern for eight genes. Future functional analysis of these cowpea genes will enhance our understanding of Rk-mediated resistance and identify the specific gene responsible for the resistance.

  7. Combined Analysis of the Fruit Metabolome and Transcriptome Reveals Candidate Genes Involved in Flavonoid Biosynthesis in Actinidia arguta.

    Science.gov (United States)

    Li, Yukuo; Fang, Jinbao; Qi, Xiujuan; Lin, Miaomiao; Zhong, Yunpeng; Sun, Leiming; Cui, Wen

    2018-05-15

    To assess the interrelation between the change of metabolites and the change of fruit color, we performed a combined metabolome and transcriptome analysis of the flesh in two different Actinidia arguta cultivars: "HB" ("Hongbaoshixing") and "YF" ("Yongfengyihao") at two different fruit developmental stages: 70d (days after full bloom) and 100d (days after full bloom). Metabolite and transcript profiling was obtained by ultra-performance liquid chromatography quadrupole time-of-flight tandem mass spectrometer and high-throughput RNA sequencing, respectively. The identification and quantification results of metabolites showed that a total of 28,837 metabolites had been obtained, of which 13,715 were annotated. In comparison of HB100 vs. HB70, 41 metabolites were identified as being flavonoids, 7 of which, with significant difference, were identified as bracteatin, luteolin, dihydromyricetin, cyanidin, pelargonidin, delphinidin and (-)-epigallocatechin. Association analysis between metabolome and transcriptome revealed that there were two metabolic pathways presenting significant differences during fruit development, one of which was flavonoid biosynthesis, in which 14 structural genes were selected to conduct expression analysis, as well as 5 transcription factor genes obtained by transcriptome analysis. RT-qPCR results and cluster analysis revealed that AaF3H , AaLDOX , AaUFGT , AaMYB , AabHLH , and AaHB2 showed the best possibility of being candidate genes. A regulatory network of flavonoid biosynthesis was established to illustrate differentially expressed candidate genes involved in accumulation of metabolites with significant differences, inducing red coloring during fruit development. Such a regulatory network linking genes and flavonoids revealed a system involved in the pigmentation of all-red-fleshed and all-green-fleshed A. arguta , suggesting this conjunct analysis approach is not only useful in understanding the relationship between genotype and phenotype

  8. Evolutionary signatures amongst disease genes permit novel methods for gene prioritization and construction of informative gene-based networks.

    Directory of Open Access Journals (Sweden)

    Nolan Priedigkeit

    2015-02-01

    Full Text Available Genes involved in the same function tend to have similar evolutionary histories, in that their rates of evolution covary over time. This coevolutionary signature, termed Evolutionary Rate Covariation (ERC, is calculated using only gene sequences from a set of closely related species and has demonstrated potential as a computational tool for inferring functional relationships between genes. To further define applications of ERC, we first established that roughly 55% of genetic diseases posses an ERC signature between their contributing genes. At a false discovery rate of 5% we report 40 such diseases including cancers, developmental disorders and mitochondrial diseases. Given these coevolutionary signatures between disease genes, we then assessed ERC's ability to prioritize known disease genes out of a list of unrelated candidates. We found that in the presence of an ERC signature, the true disease gene is effectively prioritized to the top 6% of candidates on average. We then apply this strategy to a melanoma-associated region on chromosome 1 and identify MCL1 as a potential causative gene. Furthermore, to gain global insight into disease mechanisms, we used ERC to predict molecular connections between 310 nominally distinct diseases. The resulting "disease map" network associates several diseases with related pathogenic mechanisms and unveils many novel relationships between clinically distinct diseases, such as between Hirschsprung's disease and melanoma. Taken together, these results demonstrate the utility of molecular evolution as a gene discovery platform and show that evolutionary signatures can be used to build informative gene-based networks.

  9. Nuclear-Cytoplasmic Conflict in Pea (Pisum sativum L.) Is Associated with Nuclear and Plastidic Candidate Genes Encoding Acetyl-CoA Carboxylase Subunits

    Science.gov (United States)

    Bogdanova, Vera S.; Zaytseva, Olga O.; Mglinets, Anatoliy V.; Shatskaya, Natalia V.; Kosterin, Oleg E.; Vasiliev, Gennadiy V.

    2015-01-01

    In crosses of wild and cultivated peas (Pisum sativum L.), nuclear-cytoplasmic incompatibility frequently occurs manifested as decreased pollen fertility, male gametophyte lethality, sporophyte lethality. High-throughput sequencing of plastid genomes of one cultivated and four wild pea accessions differing in cross-compatibility was performed. Candidate genes for involvement in the nuclear-plastid conflict were searched in the reconstructed plastid genomes. In the annotated Medicago truncatula genome, nuclear candidate genes were searched in the portion syntenic to the pea chromosome region known to harbor a locus involved in the conflict. In the plastid genomes, a substantial variability of the accD locus represented by nucleotide substitutions and indels was found to correspond to the pattern of cross-compatibility among the accessions analyzed. Amino acid substitutions in the polypeptides encoded by the alleles of a nuclear locus, designated as Bccp3, with a complementary function to accD, fitted the compatibility pattern. The accD locus in the plastid genome encoding beta subunit of the carboxyltransferase of acetyl-coA carboxylase and the nuclear locus Bccp3 encoding biotin carboxyl carrier protein of the same multi-subunit enzyme were nominated as candidate genes for main contribution to nuclear-cytoplasmic incompatibility in peas. Existence of another nuclear locus involved in the accD-mediated conflict is hypothesized. PMID:25789472

  10. Nuclear-cytoplasmic conflict in pea (Pisum sativum L. is associated with nuclear and plastidic candidate genes encoding acetyl-CoA carboxylase subunits.

    Directory of Open Access Journals (Sweden)

    Vera S Bogdanova

    Full Text Available In crosses of wild and cultivated peas (Pisum sativum L., nuclear-cytoplasmic incompatibility frequently occurs manifested as decreased pollen fertility, male gametophyte lethality, sporophyte lethality. High-throughput sequencing of plastid genomes of one cultivated and four wild pea accessions differing in cross-compatibility was performed. Candidate genes for involvement in the nuclear-plastid conflict were searched in the reconstructed plastid genomes. In the annotated Medicago truncatula genome, nuclear candidate genes were searched in the portion syntenic to the pea chromosome region known to harbor a locus involved in the conflict. In the plastid genomes, a substantial variability of the accD locus represented by nucleotide substitutions and indels was found to correspond to the pattern of cross-compatibility among the accessions analyzed. Amino acid substitutions in the polypeptides encoded by the alleles of a nuclear locus, designated as Bccp3, with a complementary function to accD, fitted the compatibility pattern. The accD locus in the plastid genome encoding beta subunit of the carboxyltransferase of acetyl-coA carboxylase and the nuclear locus Bccp3 encoding biotin carboxyl carrier protein of the same multi-subunit enzyme were nominated as candidate genes for main contribution to nuclear-cytoplasmic incompatibility in peas. Existence of another nuclear locus involved in the accD-mediated conflict is hypothesized.

  11. Expression map of a complete set of gustatory receptor genes in chemosensory organs of Bombyx mori.

    Science.gov (United States)

    Guo, Huizhen; Cheng, Tingcai; Chen, Zhiwei; Jiang, Liang; Guo, Youbing; Liu, Jianqiu; Li, Shenglong; Taniai, Kiyoko; Asaoka, Kiyoshi; Kadono-Okuda, Keiko; Arunkumar, Kallare P; Wu, Jiaqi; Kishino, Hirohisa; Zhang, Huijie; Seth, Rakesh K; Gopinathan, Karumathil P; Montagné, Nicolas; Jacquin-Joly, Emmanuelle; Goldsmith, Marian R; Xia, Qingyou; Mita, Kazuei

    2017-03-01

    Most lepidopteran species are herbivores, and interaction with host plants affects their gene expression and behavior as well as their genome evolution. Gustatory receptors (Grs) are expected to mediate host plant selection, feeding, oviposition and courtship behavior. However, due to their high diversity, sequence divergence and extremely low level of expression it has been difficult to identify precisely a complete set of Grs in Lepidoptera. By manual annotation and BAC sequencing, we improved annotation of 43 gene sequences compared with previously reported Grs in the most studied lepidopteran model, the silkworm, Bombyx mori, and identified 7 new tandem copies of BmGr30 on chromosome 7, bringing the total number of BmGrs to 76. Among these, we mapped 68 genes to chromosomes in a newly constructed chromosome distribution map and 8 genes to scaffolds; we also found new evidence for large clusters of BmGrs, especially from the bitter receptor family. RNA-seq analysis of diverse BmGr expression patterns in chemosensory organs of larvae and adults enabled us to draw a precise organ specific map of BmGr expression. Interestingly, most of the clustered genes were expressed in the same tissues and more than half of the genes were expressed in larval maxillae, larval thoracic legs and adult legs. For example, BmGr63 showed high expression levels in all organs in both larval and adult stages. By contrast, some genes showed expression limited to specific developmental stages or organs and tissues. BmGr19 was highly expressed in larval chemosensory organs (especially antennae and thoracic legs), the single exon genes BmGr53 and BmGr67 were expressed exclusively in larval tissues, the BmGr27-BmGr31 gene cluster on chr7 displayed a high expression level limited to adult legs and the candidate CO 2 receptor BmGr2 was highly expressed in adult antennae, where few other Grs were expressed. Transcriptional analysis of the Grs in B. mori provides a valuable new reference for

  12. Assessing CPR training: The willingness of teaching credential candidates to provide CPR in a school setting.

    Science.gov (United States)

    Winkelman, Jack L; Fischbach, Ronald; Spinello, Elio F

    2009-12-01

    The study explores the anticipated willingness of teacher credential candidates at one California public university in the U.S. to perform cardiopulmonary resuscitation (CPR) or foreign body airway obstruction (FBAO) skills in a school setting. Objectives included (1) identifying reasons that credential candidates would elect or decline to perform CPR, (2) assisting schools to remediate cardiac/respiratory emergency preparedness, and (3) assessing CPR training courses to determine how they may influence teachers' willingness to perform CPR. Participants included 582 teacher credential candidates, who were 95.2% of those surveyed after completion of a health science course and CPR certification. Participants described their attitudes regarding the importance of CPR, the CPR training course, and their willingness to perform CPR in a school environment. Based upon chi-square analysis, an association was found between the willingness to perform CPR and the presence of any one concern regarding training, with 68.6% of those expressing concerns willing to perform CPR compared to 81.9% of those expressing no concerns (pteachers (76.9% vs. 43.5%, pteachers' willingness to perform CPR. Recommendations based on these findings include pedagogical changes to CPR curricula, focusing on the importance of CPR as a teacher skill and additional time for hands-on practice. Future research should include U.S. and international participants from a broader geographic area and assessment of both learning and affective outcomes.

  13. Three gene expression vector sets for concurrently expressing multiple genes in Saccharomyces cerevisiae.

    Science.gov (United States)

    Ishii, Jun; Kondo, Takashi; Makino, Harumi; Ogura, Akira; Matsuda, Fumio; Kondo, Akihiko

    2014-05-01

    Yeast has the potential to be used in bulk-scale fermentative production of fuels and chemicals due to its tolerance for low pH and robustness for autolysis. However, expression of multiple external genes in one host yeast strain is considerably labor-intensive due to the lack of polycistronic transcription. To promote the metabolic engineering of yeast, we generated systematic and convenient genetic engineering tools to express multiple genes in Saccharomyces cerevisiae. We constructed a series of multi-copy and integration vector sets for concurrently expressing two or three genes in S. cerevisiae by embedding three classical promoters. The comparative expression capabilities of the constructed vectors were monitored with green fluorescent protein, and the concurrent expression of genes was monitored with three different fluorescent proteins. Our multiple gene expression tool will be helpful to the advanced construction of genetically engineered yeast strains in a variety of research fields other than metabolic engineering. © 2014 Federation of European Microbiological Societies. Published by John Wiley & Sons Ltd. All rights reserved.

  14. Cogena, a novel tool for co-expressed gene-set enrichment analysis, applied to drug repositioning and drug mode of action discovery.

    Science.gov (United States)

    Jia, Zhilong; Liu, Ying; Guan, Naiyang; Bo, Xiaochen; Luo, Zhigang; Barnes, Michael R

    2016-05-27

    Drug repositioning, finding new indications for existing drugs, has gained much recent attention as a potentially efficient and economical strategy for accelerating new therapies into the clinic. Although improvement in the sensitivity of computational drug repositioning methods has identified numerous credible repositioning opportunities, few have been progressed. Arguably the "black box" nature of drug action in a new indication is one of the main blocks to progression, highlighting the need for methods that inform on the broader target mechanism in the disease context. We demonstrate that the analysis of co-expressed genes may be a critical first step towards illumination of both disease pathology and mode of drug action. We achieve this using a novel framework, co-expressed gene-set enrichment analysis (cogena) for co-expression analysis of gene expression signatures and gene set enrichment analysis of co-expressed genes. The cogena framework enables simultaneous, pathway driven, disease and drug repositioning analysis. Cogena can be used to illuminate coordinated changes within disease transcriptomes and identify drugs acting mechanistically within this framework. We illustrate this using a psoriatic skin transcriptome, as an exemplar, and recover two widely used Psoriasis drugs (Methotrexate and Ciclosporin) with distinct modes of action. Cogena out-performs the results of Connectivity Map and NFFinder webservers in similar disease transcriptome analyses. Furthermore, we investigated the literature support for the other top-ranked compounds to treat psoriasis and showed how the outputs of cogena analysis can contribute new insight to support the progression of drugs into the clinic. We have made cogena freely available within Bioconductor or https://github.com/zhilongjia/cogena . In conclusion, by targeting co-expressed genes within disease transcriptomes, cogena offers novel biological insight, which can be effectively harnessed for drug discovery and

  15. Candidate gene resequencing to identify rare, pedigree-specific variants influencing healthy aging phenotypes in the long life family study

    DEFF Research Database (Denmark)

    Druley, Todd E; Wang, Lihua; Lin, Shiow J

    2016-01-01

    from six pedigrees. OBFC1 (chromosome 10) is involved in telomere maintenance, and falls within a linkage peak recently reported from an analysis of telomere length in LLFS families. Two different algorithms for single gene associations identified three genes with an enrichment of variation......BACKGROUND: The Long Life Family Study (LLFS) is an international study to identify the genetic components of various healthy aging phenotypes. We hypothesized that pedigree-specific rare variants at longevity-associated genes could have a similar functional impact on healthy phenotypes. METHODS......: We performed custom hybridization capture sequencing to identify the functional variants in 464 candidate genes for longevity or the major diseases of aging in 615 pedigrees (4,953 individuals) from the LLFS, using a multiplexed, custom hybridization capture. Variants were analyzed individually...

  16. Candidate gene investigation of spinal degenerative osteoarthritis in Greek population.

    Science.gov (United States)

    Liva, Eleni; Panagiotou, Irene; Palikyras, Spyros; Parpa, Efi; Tsilika, Eleni; Paschou, Peristera; Mystakidou, Kyriaki

    2017-12-01

    Few data exist concerning the natural history of degenerative osteoarthritis (OA) of the spine and its associated gene investigation. Degenerative spinal OA demonstrates an international prevalence of 15% in the general population. The aim of this Greek case-control study is to examine gene polymorphisms that have been previously shown or hypothesized to be correlated to degenerative OA. Gene polymorphisms, especially for OA, have never been previously studied in the Greek population. The study was conducted from May 2009 to December 2012. Eligible subjects who agreed to take part in the study were Greek adults from all of Greece, referred for consultation to the Palliative Care and Pain Relief Unit of Aretaieion University Hospital, in Athens, Greece. A total of 601 matched pairs (cases and controls) participated in the study, 258 patients (188 women and 70 men) with clinically and radiologically confirmed degenerative OA and 243 control subjects (138 women and 105 men). All patients presented with chronic pain at the spine (cervical, thoracic or lumbar) caused by sympomatic osteophytes or disc narrowing, whereas clinical diagnosis of OA was based on the presence of both joint symptoms and evidence of structural changes seen on plain conventional X-rays. We investigated genetic variation across candidate OA gene GDF5, CDMP1, CDMP2, Asporin, SMAD3, and chromosomal region 7q22, in a sample of 258 patients with clinically and radiologically confirmed degenerative OA, and 243 control subjects from the Greek population. All subjects (patients and controls) were subsequently matched for the epidemiologic, demographic, and clinical risk factors, to prevent selection biases. A tagging single nucleotide polymorphism (SNP) approach was pursued to cover variation across all targeted loci. Single marker tests as well as haplotypic tests of association were performed. There is no conflict of interest, and also, there are no study funding sources. We found significant

  17. A comprehensive approach to identify reliable reference gene candidates to investigate the link between alcoholism and endocrinology in Sprague-Dawley rats.

    Directory of Open Access Journals (Sweden)

    Faten A Taki

    Full Text Available Gender and hormonal differences are often correlated with alcohol dependence and related complications like addiction and breast cancer. Estrogen (E2 is an important sex hormone because it serves as a key protein involved in organism level signaling pathways. Alcoholism has been reported to affect estrogen receptor signaling; however, identifying the players involved in such multi-faceted syndrome is complex and requires an interdisciplinary approach. In many situations, preliminary investigations included a straight forward, yet informative biotechniques such as gene expression analyses using quantitative real time PCR (qRT-PCR. The validity of qRT-PCR-based conclusions is affected by the choice of reliable internal controls. With this in mind, we compiled a list of 15 commonly used housekeeping genes (HKGs as potential reference gene candidates in rat biological models. A comprehensive comparison among 5 statistical approaches (geNorm, dCt method, NormFinder, BestKeeper, and RefFinder was performed to identify the minimal number as well the most stable reference genes required for reliable normalization in experimental rat groups that comprised sham operated (SO, ovariectomized rats in the absence (OVX or presence of E2 (OVXE2. These rat groups were subdivided into subgroups that received alcohol in liquid diet or isocalroic control liquid diet for 12 weeks. Our results showed that U87, 5S rRNA, GAPDH, and U5a were the most reliable gene candidates for reference genes in heart and brain tissue. However, different gene stability ranking was specific for each tissue input combination. The present preliminary findings highlight the variability in reference gene rankings across different experimental conditions and analytic methods and constitute a fundamental step for gene expression assays.

  18. Drosophila mutants of the autism candidate gene neurobeachin (rugose) exhibit neuro-developmental disorders, aberrant synaptic properties, altered locomotion, impaired adult social behavior and activity patterns

    OpenAIRE

    Wise, Alexandra; Tenezaca, Luis; Fernandez, Robert W.; Schatoff, Emma; Flores, Julian; Ueda, Atsushi; Zhong, Xiaotian; Wu, Chun-Fang; Simon, Anne F.; Venkatesh, Tadmiri

    2015-01-01

    Autism spectrum disorder (ASD) is a neurodevelopmental disorder in humans characterized by complex behavioral deficits, including intellectual disability, impaired social interactions and hyperactivity. ASD exhibits a strong genetic component with underlying multi-gene interactions. Candidate gene studies have shown that the neurobeachin gene is disrupted in human patients with idiopathic autism (Castermans et al., 2003). The gene for neurobeachin (NBEA) spans the common fragile site FRA 13A ...

  19. Identifying Stable Reference Genes for qRT-PCR Normalisation in Gene Expression Studies of Narrow-Leafed Lupin (Lupinus angustifolius L..

    Directory of Open Access Journals (Sweden)

    Candy M Taylor

    Full Text Available Quantitative Reverse Transcription PCR (qRT-PCR is currently one of the most popular, high-throughput and sensitive technologies available for quantifying gene expression. Its accurate application depends heavily upon normalisation of gene-of-interest data with reference genes that are uniformly expressed under experimental conditions. The aim of this study was to provide the first validation of reference genes for Lupinus angustifolius (narrow-leafed lupin, a significant grain legume crop using a selection of seven genes previously trialed as reference genes for the model legume, Medicago truncatula. In a preliminary evaluation, the seven candidate reference genes were assessed on the basis of primer specificity for their respective targeted region, PCR amplification efficiency, and ability to discriminate between cDNA and gDNA. Following this assessment, expression of the three most promising candidates [Ubiquitin C (UBC, Helicase (HEL, and Polypyrimidine tract-binding protein (PTB] was evaluated using the NormFinder and RefFinder statistical algorithms in two narrow-leafed lupin lines, both with and without vernalisation treatment, and across seven organ types (cotyledons, stem, leaves, shoot apical meristem, flowers, pods and roots encompassing three developmental stages. UBC was consistently identified as the most stable candidate and has sufficiently uniform expression that it may be used as a sole reference gene under the experimental conditions tested here. However, as organ type and developmental stage were associated with greater variability in relative expression, it is recommended using UBC and HEL as a pair to achieve optimal normalisation. These results highlight the importance of rigorously assessing candidate reference genes for each species across a diverse range of organs and developmental stages. With emerging technologies, such as RNAseq, and the completion of valuable transcriptome data sets, it is possible that other

  20. Validation of candidate genes putatively associated with resistance to SCMV and MDMV in maize (Zea mays L.) by expression profiling

    DEFF Research Database (Denmark)

    Uzarowska, Anna; Dionisio, Giuseppe; Sarholz, Barbara

    2009-01-01

    Background The potyviruses sugarcane mosaic virus (SCMV) and maize dwarf mosaic virus (MDMV) are major pathogens of maize worldwide. Two loci, Scmv1 and Scmv2, have ealier been shown to confer complete resistance to SCMV. Custom-made microarrays containing previously identified SCMV resistance...... the effectiveness and reliability of the combination of different expression profiling approaches for the identification and validation of candidate genes. Genes identified in this study represent possible future targets for manipulation of SCMV resistance in maize....

  1. Quantitative Trait Locus (QTL meta-analysis and comparative genomics for candidate gene prediction in perennial ryegrass (Lolium perenne L.

    Directory of Open Access Journals (Sweden)

    Shinozuka Hiroshi

    2012-11-01

    Full Text Available Abstract Background In crop species, QTL analysis is commonly used for identification of factors contributing to variation of agronomically important traits. As an important pasture species, a large number of QTLs have been reported for perennial ryegrass based on analysis of biparental mapping populations. Further characterisation of those QTLs is, however, essential for utilisation in varietal improvement programs. Results A bibliographic survey of perennial ryegrass trait-dissection studies identified a total of 560 QTLs from previously published papers, of which 189, 270 and 101 were classified as morphology-, physiology- and resistance/tolerance-related loci, respectively. The collected dataset permitted a subsequent meta-QTL study and implementation of a cross-species candidate gene identification approach. A meta-QTL analysis based on use of the BioMercator software was performed to identify two consensus regions for pathogen resistance traits. Genes that are candidates for causal polymorphism underpinning perennial ryegrass QTLs were identified through in silico comparative mapping using rice databases, and 7 genes were assigned to the p150/112 reference map. Markers linked to the LpDGL1, LpPh1 and LpPIPK1 genes were located close to plant size, leaf extension time and heading date-related QTLs, respectively, suggesting that these genes may be functionally associated with important agronomic traits in perennial ryegrass. Conclusions Functional markers are valuable for QTL meta-analysis and comparative genomics. Enrichment of such genetic markers may permit further detailed characterisation of QTLs. The outcomes of QTL meta-analysis and comparative genomics studies may be useful for accelerated development of novel perennial ryegrass cultivars with desirable traits.

  2. Association Mapping and Nucleotide Sequence Variation in Five Drought Tolerance Candidate Genes in Spring Wheat

    Directory of Open Access Journals (Sweden)

    Erena A. Edae

    2013-07-01

    Full Text Available Functional markers are needed for key genes involved in drought tolerance to improve selection for crop yield under moisture stress conditions. The objectives of this study were to (i characterize five drought tolerance candidate genes, namely dehydration responsive element binding 1A (, enhanced response to abscisic acid ( and , and fructan 1-exohydrolase ( and , in wheat ( L. for nucleotide and haplotype diversity, Tajima’s D value, and linkage disequilibrium (LD and (ii associate within-gene single nucleotide polymorphisms (SNPs with phenotypic traits in a spring wheat association mapping panel ( = 126. Field trials were grown under contrasting moisture regimes in Greeley, CO, and Melkassa, Ethiopia, in 2010 and 2011. Genome-specific amplification and DNA sequence analysis of the genes identified SNPs and revealed differences in nucleotide and haplotype diversity, Tajima’s D, and patterns of LD. showed associations (false discovery rate adjusted probability value = 0.1 with normalized difference vegetation index, heading date, biomass, and spikelet number. Both and were associated with harvest index, flag leaf width, and leaf senescence. was associated with grain yield, and was associated with thousand kernel weight and test weight. If validated in relevant genetic backgrounds, the identified marker–trait associations may be applied to functional marker-assisted selection.

  3. Distilling a Visual Network of Retinitis Pigmentosa Gene-Protein Interactions to Uncover New Disease Candidates.

    Directory of Open Access Journals (Sweden)

    Daniel Boloc

    Full Text Available Retinitis pigmentosa (RP is a highly heterogeneous genetic visual disorder with more than 70 known causative genes, some of them shared with other non-syndromic retinal dystrophies (e.g. Leber congenital amaurosis, LCA. The identification of RP genes has increased steadily during the last decade, and the 30% of the cases that still remain unassigned will soon decrease after the advent of exome/genome sequencing. A considerable amount of genetic and functional data on single RD genes and mutations has been gathered, but a comprehensive view of the RP genes and their interacting partners is still very fragmentary. This is the main gap that needs to be filled in order to understand how mutations relate to progressive blinding disorders and devise effective therapies.We have built an RP-specific network (RPGeNet by merging data from different sources: high-throughput data from BioGRID and STRING databases, manually curated data for interactions retrieved from iHOP, as well as interactions filtered out by syntactical parsing from up-to-date abstracts and full-text papers related to the RP research field. The paths emerging when known RP genes were used as baits over the whole interactome have been analysed, and the minimal number of connections among the RP genes and their close neighbors were distilled in order to simplify the search space.In contrast to the analysis of single isolated genes, finding the networks linking disease genes renders powerful etiopathological insights. We here provide an interactive interface, RPGeNet, for the molecular biologist to explore the network centered on the non-syndromic and syndromic RP and LCA causative genes. By integrating tissue-specific expression levels and phenotypic data on top of that network, a more comprehensive biological view will highlight key molecular players of retinal degeneration and unveil new RP disease candidates.

  4. Are TMEM genes potential candidate genes for panic disorder?

    DEFF Research Database (Denmark)

    NO, Gregersen; Buttenschøn, Henriette Nørmølle; Hedemand, Anne

    2014-01-01

    We analysed single nucleotide polymorphisms in two transmembrane genes (TMEM98 and TMEM132E) in panic disorder (PD) patients and control individuals from the Faroe Islands, Denmark and Germany. The genes encode single-pass membrane proteins and are located within chromosome 17q11.2-q12...

  5. Reconstruction of gene regulatory modules from RNA silencing of IFN-α modulators: experimental set-up and inference method.

    Science.gov (United States)

    Grassi, Angela; Di Camillo, Barbara; Ciccarese, Francesco; Agnusdei, Valentina; Zanovello, Paola; Amadori, Alberto; Finesso, Lorenzo; Indraccolo, Stefano; Toffolo, Gianna Maria

    2016-03-12

    Inference of gene regulation from expression data may help to unravel regulatory mechanisms involved in complex diseases or in the action of specific drugs. A challenging task for many researchers working in the field of systems biology is to build up an experiment with a limited budget and produce a dataset suitable to reconstruct putative regulatory modules worth of biological validation. Here, we focus on small-scale gene expression screens and we introduce a novel experimental set-up and a customized method of analysis to make inference on regulatory modules starting from genetic perturbation data, e.g. knockdown and overexpression data. To illustrate the utility of our strategy, it was applied to produce and analyze a dataset of quantitative real-time RT-PCR data, in which interferon-α (IFN-α) transcriptional response in endothelial cells is investigated by RNA silencing of two candidate IFN-α modulators, STAT1 and IFIH1. A putative regulatory module was reconstructed by our method, revealing an intriguing feed-forward loop, in which STAT1 regulates IFIH1 and they both negatively regulate IFNAR1. STAT1 regulation on IFNAR1 was object of experimental validation at the protein level. Detailed description of the experimental set-up and of the analysis procedure is reported, with the intent to be of inspiration for other scientists who want to realize similar experiments to reconstruct gene regulatory modules starting from perturbations of possible regulators. Application of our approach to the study of IFN-α transcriptional response modulators in endothelial cells has led to many interesting novel findings and new biological hypotheses worth of validation.

  6. Comparison of Expression Profiles in Ovarian Epithelium In Vivo and Ovarian Cancer Identifies Novel Candidate Genes Involved in Disease Pathogenesis

    Science.gov (United States)

    Emmanuel, Catherine; Gava, Natalie; Kennedy, Catherine; Balleine, Rosemary L.; Sharma, Raghwa; Wain, Gerard; Brand, Alison; Hogg, Russell; Etemadmoghadam, Dariush; George, Joshy; Birrer, Michael J.; Clarke, Christine L.; Chenevix-Trench, Georgia; Bowtell, David D. L.; Harnett, Paul R.; deFazio, Anna

    2011-01-01

    Molecular events leading to epithelial ovarian cancer are poorly understood but ovulatory hormones and a high number of life-time ovulations with concomitant proliferation, apoptosis, and inflammation, increases risk. We identified genes that are regulated during the estrous cycle in murine ovarian surface epithelium and analysed these profiles to identify genes dysregulated in human ovarian cancer, using publically available datasets. We identified 338 genes that are regulated in murine ovarian surface epithelium during the estrous cycle and dysregulated in ovarian cancer. Six of seven candidates selected for immunohistochemical validation were expressed in serous ovarian cancer, inclusion cysts, ovarian surface epithelium and in fallopian tube epithelium. Most were overexpressed in ovarian cancer compared with ovarian surface epithelium and/or inclusion cysts (EpCAM, EZH2, BIRC5) although BIRC5 and EZH2 were expressed as highly in fallopian tube epithelium as in ovarian cancer. We prioritised the 338 genes for those likely to be important for ovarian cancer development by in silico analyses of copy number aberration and mutation using publically available datasets and identified genes with established roles in ovarian cancer as well as novel genes for which we have evidence for involvement in ovarian cancer. Chromosome segregation emerged as an important process in which genes from our list of 338 were over-represented including two (BUB1, NCAPD2) for which there is evidence of amplification and mutation. NUAK2, upregulated in ovarian surface epithelium in proestrus and predicted to have a driver mutation in ovarian cancer, was examined in a larger cohort of serous ovarian cancer where patients with lower NUAK2 expression had shorter overall survival. In conclusion, defining genes that are activated in normal epithelium in the course of ovulation that are also dysregulated in cancer has identified a number of pathways and novel candidate genes that may contribute

  7. Fine Mapping and Transcriptome Analysis Reveal Candidate Genes Associated with Hybrid Lethality in Cabbage (Brassica Oleracea).

    Science.gov (United States)

    Xiao, Zhiliang; Hu, Yang; Zhang, Xiaoli; Xue, Yuqian; Fang, Zhiyuan; Yang, Limei; Zhang, Yangyong; Liu, Yumei; Li, Zhansheng; Liu, Xing; Liu, Zezhou; Lv, Honghao; Zhuang, Mu

    2017-06-05

    Hybrid lethality is a deleterious phenotype that is vital to species evolution. We previously reported hybrid lethality in cabbage ( Brassica oleracea ) and performed preliminary mapping of related genes. In the present study, the fine mapping of hybrid lethal genes revealed that BoHL1 was located on chromosome C1 between BoHLTO124 and BoHLTO130, with an interval of 101 kb. BoHL2 was confirmed to be between insertion-deletion (InDels) markers HL234 and HL235 on C4, with a marker interval of 70 kb. Twenty-eight and nine annotated genes were found within the two intervals of BoHL1 and BoHL2 , respectively. We also applied RNA-Seq to analyze hybrid lethality in cabbage. In the region of BoHL1 , seven differentially expressed genes (DEGs) and five resistance (R)-related genes (two in common, i.e., Bo1g153320 and Bo1g153380 ) were found, whereas in the region of BoHL2 , two DEGs and four R-related genes (two in common, i.e., Bo4g173780 and Bo4g173810 ) were found. Along with studies in which R genes were frequently involved in hybrid lethality in other plants, these interesting R-DEGs may be good candidates associated with hybrid lethality. We also used SNP/InDel analyses and quantitative real-time PCR to confirm the results. This work provides new insight into the mechanisms of hybrid lethality in cabbage.

  8. Selection of Housekeeping Genes for Transgene Expression Analysis in Eucommia ulmoides Oliver Using Real-Time RT-PCR

    Directory of Open Access Journals (Sweden)

    Ren Chen

    2010-01-01

    Full Text Available In order to select appropriate housekeeping genes for accurate calibration of experimental variations in real-time (RT- PCR results in transgene expression analysis, particularly with respect to the influence of transgene on stability of endogenous housekeeping gene expression in transgenic plants, we outline a reliable strategy to identify the optimal housekeeping genes from a set of candidates by combining statistical analyses of their (RT- PCR amplification efficiency, gene expression stability, and transgene influences. We used the strategy to select two genes, ACTα and EF1α, from 10 candidate housekeeping genes, as the optimal housekeeping genes to evaluate transgenic Eucommia ulmoides Oliver root lines overexpressing IPPI or FPPS1 genes, which are involved in isoprenoid biosynthesis.

  9. Genetic analysis and fine mapping of LH1 and LH2, a set of complementary genes controlling late heading in rice (Oryza sativa L.).

    Science.gov (United States)

    Liu, Shuang; Wang, Feng; Gao, Li Jun; Li, Jin Hua; Li, Rong Bai; Gao, Han Liang; Deng, Guo Fu; Yang, Jin Shui; Luo, Xiao Jin

    2012-12-01

    Heading date in rice (Oryza sativa L.) is a critical agronomic trait with a complex inheritance. To investigate the genetic basis and mechanism of gene interaction in heading date, we conducted genetic analysis on segregation populations derived from crosses among the indica cultivars Bo B, Yuefeng B and Baoxuan 2. A set of dominant complementary genes controlling late heading, designated LH1 and LH2, were detected by molecular marker mapping. Genetic analysis revealed that Baoxuan 2 contains both dominant genes, while Bo B and Yuefeng B each possess either LH1 or LH2. Using larger populations with segregant ratios of 3 : 1, we fine-mapped LH1 to a 63-kb region near the centromere of chromosome 7 flanked by markers RM5436 and RM8034, and LH2 to a 177-kb region on the short arm of chromosome 8 between flanking markers Indel22468-3 and RM25. Some candidate genes were identified through sequencing of Bo B and Yuefeng B in these target regions. Our work provides a solid foundation for further study on gene interaction in heading date and has application in marker-assisted breeding of photosensitive hybrid rice in China.

  10. Gene set-based module discovery in the breast cancer transcriptome

    Directory of Open Access Journals (Sweden)

    Zhang Michael Q

    2009-02-01

    Full Text Available Abstract Background Although microarray-based studies have revealed global view of gene expression in cancer cells, we still have little knowledge about regulatory mechanisms underlying the transcriptome. Several computational methods applied to yeast data have recently succeeded in identifying expression modules, which is defined as co-expressed gene sets under common regulatory mechanisms. However, such module discovery methods are not applied cancer transcriptome data. Results In order to decode oncogenic regulatory programs in cancer cells, we developed a novel module discovery method termed EEM by extending a previously reported module discovery method, and applied it to breast cancer expression data. Starting from seed gene sets prepared based on cis-regulatory elements, ChIP-chip data, and gene locus information, EEM identified 10 principal expression modules in breast cancer based on their expression coherence. Moreover, EEM depicted their activity profiles, which predict regulatory programs in each subtypes of breast tumors. For example, our analysis revealed that the expression module regulated by the Polycomb repressive complex 2 (PRC2 is downregulated in triple negative breast cancers, suggesting similarity of transcriptional programs between stem cells and aggressive breast cancer cells. We also found that the activity of the PRC2 expression module is negatively correlated to the expression of EZH2, a component of PRC2 which belongs to the E2F expression module. E2F-driven EZH2 overexpression may be responsible for the repression of the PRC2 expression modules in triple negative tumors. Furthermore, our network analysis predicts regulatory circuits in breast cancer cells. Conclusion These results demonstrate that the gene set-based module discovery approach is a powerful tool to decode regulatory programs in cancer cells.

  11. Molecular mapping and candidate gene analysis for resistance to powdery mildew in Cucumis sativus stem.

    Science.gov (United States)

    Liu, P N; Miao, H; Lu, H W; Cui, J Y; Tian, G L; Wehner, T C; Gu, X F; Zhang, S P

    2017-08-31

    Powdery mildew (PM) of cucumber (Cucumis sativus), caused by Podosphaera xanthii, is a major foliar disease worldwide and resistance is one of the main objectives in cucumber breeding programs. The resistance to PM in cucumber stem is important to the resistance for the whole plant. In this study, genetic analysis and gene mapping were implemented with cucumber inbred lines NCG-122 (with resistance to PM in the stem) and NCG-121 (with susceptibility in the stem). Genetic analysis showed that resistance to PM in the stem of NCG-122 was qualitative and controlled by a single-recessive nuclear gene (pm-s). Susceptibility was dominant to resistance. In the initial genetic mapping of the pm-s gene, 10 SSR markers were discovered to be linked to pm-s, which was mapped to chromosome 5 (Chr.5) of cucumber. The pm-s gene's closest flanking markers were SSR20486 and SSR06184/SSR13237 with genetic distances of 0.9 and 1.8 cM, respectively. One hundred and fifty-seven pairs of new SSR primers were exploited by the sequence information in the initial mapping region of pm-s. The analysis on the F 2 mapping population using the new molecular markers showed that 17 SSR markers were confirmed to be linked to the pm-s gene. The two closest flanking markers, pmSSR27and pmSSR17, were 0.1 and 0.7 cM from pm-s, respectively, confirming the location of this gene on Chr.5. The physical length of the genomic region containing pm-s was 135.7 kb harboring 21 predicted genes. Among these genes, the gene Csa5G623470 annotated as encoding Mlo-related protein was defined as the most probable candidate gene for the pm-s. The results of this study will provide a basis for marker-assisted selection, and make the benefit for the cloning of the resistance gene.

  12. A candidate-gene association study for berry colour and anthocyanin content in Vitis vinifera L.

    Directory of Open Access Journals (Sweden)

    Silvana Cardoso

    Full Text Available Anthocyanin content is a trait of major interest in Vitis vinifera L. These compounds affect grape and wine quality, and have beneficial effects on human health. A candidate-gene approach was used to identify genetic variants associated with anthocyanin content in grape berries. A total of 445 polymorphisms were identified in 5 genes encoding transcription factors and 10 genes involved in either the biosynthetic pathway or transport of anthocyanins. A total of 124 SNPs were selected to examine association with a wide range of phenotypes based on RP-HPLC analysis and visual characterization. The phenotypes were total skin anthocyanin (TSA concentration but also specific types of anthocyanins and relative abundance. The visual assessment was based on OIV (Organisation Internationale de la Vigne et du Vin descriptors for berry and skin colour. The genes encoding the transcription factors MYB11, MYBCC and MYC(B were significantly associated with TSA concentration. UFGT and MRP were associated with several different types of anthocyanins. Skin and pulp colour were associated with nine genes (MYB11, MYBCC, MYC(B, UFGT, MRP, DFR, LDOX, CHI and GST. Pulp colour was associated with a similar group of 11 genes (MYB11, MYBCC, MYC(B, MYC(A, UFGT, MRP, GST, DFR, LDOX, CHI and CHS(A. Statistical interactions were observed between SNPs within the transcription factors MYB11, MYBCC and MYC(B. SNPs within LDOX interacted with MYB11 and MYC(B, while SNPs within CHI interacted with MYB11 only. Together, these findings suggest the involvement of these genes in anthocyanin content and on the regulation of anthocyanin biosynthesis. This work forms a benchmark for replication and functional studies.

  13. Genome association study through nonlinear mixed models revealed new candidate genes for pig growth curves

    Directory of Open Access Journals (Sweden)

    Fabyano Fonseca e Silva

    Full Text Available ABSTRACT: Genome association analyses have been successful in identifying quantitative trait loci (QTLs for pig body weights measured at a single age. However, when considering the whole weight trajectories over time in the context of genome association analyses, it is important to look at the markers that affect growth curve parameters. The easiest way to consider them is via the two-step method, in which the growth curve parameters and marker effects are estimated separately, thereby resulting in a reduction of the statistical power and the precision of estimates. One efficient solution is to adopt nonlinear mixed models (NMM, which enables a joint modeling of the individual growth curves and marker effects. Our aim was to propose a genome association analysis for growth curves in pigs based on NMM as well as to compare it with the traditional two-step method. In addition, we also aimed to identify the nearest candidate genes related to significant SNP (single nucleotide polymorphism markers. The NMM presented a higher number of significant SNPs for adult weight (A and maturity rate (K, and provided a direct way to test SNP significance simultaneously for both the A and K parameters. Furthermore, all significant SNPs from the two-step method were also reported in the NMM analysis. The ontology of the three candidate genes (SH3BGRL2, MAPK14, and MYL9 derived from significant SNPs (simultaneously affecting A and K allows us to make inferences with regards to their contribution to the pig growth process in the population studied.

  14. High-density polymorphisms analysis of 23 candidate genes for association with bone mineral density.

    Science.gov (United States)

    Giroux, Sylvie; Elfassihi, Latifa; Clément, Valérie; Bussières, Johanne; Bureau, Alexandre; Cole, David E C; Rousseau, François

    2010-11-01

    Osteoporosis is a bone disease characterized by low bone mineral density (BMD), a highly heritable and polygenic trait. Women are more prone than men to develop osteoporosis due to a lower peak bone mass and accelerated bone loss at menopause. Peak bone mass has been convincingly shown to be due to genetic factors with heritability up to 80%. Menopausal bone loss has been shown to have around 38% to 49% heritability depending on the site studied. To have more statistical power to detect small genetic effects we focused on premenopausal women. We studied 23 candidate genes, some involved in calcium and vitamin-D regulation and others because estrogens strongly induced their gene expression in mice where it was correlated with humerus trabecular bone density. High-density polymorphisms were selected to cover the entire gene variability and 231 polymorphisms were genotyped in a first sample of 709 premenopausal women. Positive associations were retested in a second, independent, sample of 673 premenopausal women. Ten polymorphisms remained associated with BMD in the combined samples and one was further associated in a large sample of postmenopausal women (1401 women). This associated polymorphism was located in the gene CSF3R (granulocyte colony stimulating factor receptor) that had never been associated with BMD before. The results reported in this study suggest a role for CSF3R in the determination of bone density in women. Copyright © 2010 Elsevier Inc. All rights reserved.

  15. Genetic basis of qualitative and quantitative resistance to powdery mildew in wheat: from consensus regions to candidate genes.

    Science.gov (United States)

    Marone, Daniela; Russo, Maria A; Laidò, Giovanni; De Vita, Pasquale; Papa, Roberto; Blanco, Antonio; Gadaleta, Agata; Rubiales, Diego; Mastrangelo, Anna M

    2013-08-19

    Powdery mildew (Blumeria graminis f. sp. tritici) is one of the most damaging diseases of wheat. The objective of this study was to identify the wheat genomic regions that are involved in the control of powdery mildew resistance through a quantitative trait loci (QTL) meta-analysis approach. This meta-analysis allows the use of collected QTL data from different published studies to obtain consensus QTL across different genetic backgrounds, thus providing a better definition of the regions responsible for the trait, and the possibility to obtain molecular markers that will be suitable for marker-assisted selection. Five QTL for resistance to powdery mildew were identified under field conditions in the durum-wheat segregating population Creso × Pedroso. An integrated map was developed for the projection of resistance genes/ alleles and the QTL from the present study and the literature, and to investigate their distribution in the wheat genome. Molecular markers that correspond to candidate genes for plant responses to pathogens were also projected onto the map, particularly considering NBS-LRR and receptor-like protein kinases. More than 80 independent QTL and 51 resistance genes from 62 different mapping populations were projected onto the consensus map using the Biomercator statistical software. Twenty-four MQTL that comprised 2-6 initial QTL that had widely varying confidence intervals were found on 15 chromosomes. The co-location of the resistance QTL and genes was investigated. Moreover, from analysis of the sequences of DArT markers, 28 DArT clones mapped on wheat chromosomes have been shown to be associated with the NBS-LRR genes and positioned in the same regions as the MQTL for powdery mildew resistance. The results from the present study provide a detailed analysis of the genetic basis of resistance to powdery mildew in wheat. The study of the Creso × Pedroso durum-wheat population has revealed some QTL that had not been previously identified. Furthermore

  16. SNP-by-fitness and SNP-by-BMI interactions from seven candidate genes and incident hypertension after 20 years of follow-up: the CARDIA Fitness Study.

    Science.gov (United States)

    Sarzynski, M A; Rankinen, T; Sternfeld, B; Fornage, M; Sidney, S; Bouchard, C

    2011-08-01

    The association of single nucleotide polymorphisms (SNPs) from seven candidate genes, including genotype-by-baseline fitness and genotype-by-baseline body mass index (BMI) interactions, with incident hypertension over 20 years was investigated in 2663 participants (1301 blacks, 1362 whites) of the Coronary Artery Risk Development in Young Adults Study (CARDIA). Baseline cardiorespiratory fitness was determined from duration of a modified Balke treadmill test. A total of 98 SNPs in blacks and 89 SNPs in whites from seven candidate genes were genotyped. Participants that became hypertensive (295 blacks and 146 whites) had significantly higher blood pressure and BMI (both races), and lower fitness (blacks only) at baseline than those who remained normotensive. Markers at the peroxisome proliferative activated receptor gamma coactivator 1α (PPARGC1A) and bradykinin β2 receptor (BDKRB2) genes were nominally associated with greater risk of hypertension, although one marker each at the BDKRB2 and endothelial nitric oxide synthase-3 (NOS3) genes were nominally associated with lower risk. The association of baseline fitness with risk of hypertension was nominally modified by genotype at markers within the angiotensin converting enzyme, angiotensinogen, BDKRB2 and NOS3 genes in blacks and the BDKRB2, endothelin-1 and PPARGC1A genes in whites. BDKRB2 rs4900318 showed nominal interactions with baseline fitness on the risk of hypertension in both races. The association of baseline BMI with risk of hypertension was nominally modified by GNB3 rs2301339 genotype in whites. None of the above associations were statistically significant after correcting for multiple testing. We found that SNPs in these candidate genes did not modify the association between baseline fitness or BMI and risk of hypertension in CARDIA participants.

  17. Exclusion of candidate genes from the chromosome 1q juvenile glaucoma region and mapping of the peripheral cannabis receptor gene (CNR2) to chromosome 1

    Energy Technology Data Exchange (ETDEWEB)

    Sunden, S.L.F.; Nichols, B.E.; Alward, W.L.M. [Univ. of Iowa, Iowa City, IA (United States)] [and others

    1994-09-01

    Juvenile onset primary open angle glaucoma has been mapped by linkage to 1q21-q31. Several candidate genes were evaluated in the same family used to identify the primary linkage. Atrionatriuretic peptide receptor A (NPR1) and laminin C1 (LAMC1) have been previously mapped to this region and could putatively play a role in the pathogenesis of glaucoma. A third gene, the peripheral cannabis receptor (CNR2) was not initially mapped in humans but was a candidate because of the relief that cannabis affords some patients with primary open angle glaucoma. Microsatellites associated with NPR1 and LAMC1 revealed multiple recombinations in affected members of this pedigree. CNR2 was shown to be on chromosome 1 by PCR amplification of a 150 bp fragment of the 3{prime} untranslated region in monochromosomal somatic cell hybrids (NIGMS panel No. 2). These primers also revealed a two allele single strand conformation polymorphism which showed multiple recombinants with juvenile onset primary open angle glaucoma in large pedigrees, segregating this disorder. The marker was then mapped to 1p34-p36 by linkage, with the most likely location between liver alkaline phosphatase (ALPL) and alpha-L-1 fucosidase (FUCA1).

  18. Characterization of Genes for Beef Marbling Based on Applying Gene Coexpression Network

    Directory of Open Access Journals (Sweden)

    Dajeong Lim

    2014-01-01

    Full Text Available Marbling is an important trait in characterization beef quality and a major factor for determining the price of beef in the Korean beef market. In particular, marbling is a complex trait and needs a system-level approach for identifying candidate genes related to the trait. To find the candidate gene associated with marbling, we used a weighted gene coexpression network analysis from the expression value of bovine genes. Hub genes were identified; they were topologically centered with large degree and BC values in the global network. We performed gene expression analysis to detect candidate genes in M. longissimus with divergent marbling phenotype (marbling scores 2 to 7 using qRT-PCR. The results demonstrate that transmembrane protein 60 (TMEM60 and dihydropyrimidine dehydrogenase (DPYD are associated with increasing marbling fat. We suggest that the network-based approach in livestock may be an important method for analyzing the complex effects of candidate genes associated with complex traits like marbling or tenderness.

  19. Identification of candidate genes associated with porcine meat color traits by genome-wide transcriptome analysis.

    Science.gov (United States)

    Li, Bojiang; Dong, Chao; Li, Pinghua; Ren, Zhuqing; Wang, Han; Yu, Fengxiang; Ning, Caibo; Liu, Kaiqing; Wei, Wei; Huang, Ruihua; Chen, Jie; Wu, Wangjun; Liu, Honglin

    2016-10-17

    Meat color is considered to be the most important indicator of meat quality, however, the molecular mechanisms underlying traits related to meat color remain mostly unknown. In this study, to elucidate the molecular basis of meat color, we constructed six cDNA libraries from biceps femoris (Bf) and soleus (Sol), which exhibit obvious differences in meat color, and analyzed the whole-transcriptome differences between Bf (white muscle) and Sol (red muscle) using high-throughput sequencing technology. Using DEseq2 method, we identified 138 differentially expressed genes (DEGs) between Bf and Sol. Using DEGseq method, we identified 770, 810, and 476 DEGs in comparisons between Bf and Sol in three separate animals. Of these DEGs, 52 were overlapping DEGs. Using these data, we determined the enriched GO terms, metabolic pathways and candidate genes associated with meat color traits. Additionally, we mapped 114 non-redundant DEGs to the meat color QTLs via a comparative analysis with the porcine quantitative trait loci (QTL) database. Overall, our data serve as a valuable resource for identifying genes whose functions are critical for meat color traits and can accelerate studies of the molecular mechanisms of meat color formation.

  20. mRNA expression pattern of selected candidate genes differs in bovine oviductal epithelial cells in vitro compared with the in vivo state and during cell culture passages.

    Science.gov (United States)

    Danesh Mesgaran, Sadjad; Sharbati, Jutta; Einspanier, Ralf; Gabler, Christoph

    2016-08-15

    The mammalian oviduct provides the optimal environment for gamete maturation including sperm capacitation, fertilization, and development of the early embryo. Various cell culture models for primary bovine oviductal epithelial cells (BOEC) were established to reveal such physiological events. The aim of this study was to evaluate 17 candidate mRNA expression patterns in oviductal epithelial cells (1) in transition from in vivo cells to in vitro cells; (2) during three consecutive cell culture passages; (3) affected by the impact of LOW or HIGH glucose content media; and (4) influenced by different phases of the estrous cycle in vivo and in vitro. In addition, the release of a metabolite and proteins from BOEC at two distinct cell culture passage numbers was estimated to monitor the functionality. BOEC from 8 animals were isolated and cultured for three consecutive passages. Total RNA was extracted from in vivo and in vitro samples and subjected to reverse transcription quantitative polymerase chain reaction to reveal mRNA expression of selected candidate genes. The release of prostaglandin E2 (PGE2), oviduct-specific glycoprotein 1 (OVGP1) and interleukin 8 (IL8) by BOEC was measured by EIA or ELISA after 24 h. Almost all candidate genes (prostaglandin synthases, enzymes of cellular metabolism and mucins) mRNA expression pattern differed compared in vivo with in vitro state. In addition, transcription of most candidate genes was influenced by the number of cell culture passages. Different glucose medium content did not affect mRNA expression of most candidate genes. The phase of the estrous cycle altered some candidate mRNA expression in BOEC in vitro at later passages. The release of PGE2 and OVGP1 between passages did not differ. However, BOEC in passage 3 released significantly higher amount of IL8 compared with cells in passage 0. This study supports the hypothesis that candidate mRNA expression in BOEC was influenced by transition from the in vivo situation

  1. The null hypothesis of GSEA, and a novel statistical model for competitive gene set analysis

    DEFF Research Database (Denmark)

    Debrabant, Birgit

    2017-01-01

    MOTIVATION: Competitive gene set analysis intends to assess whether a specific set of genes is more associated with a trait than the remaining genes. However, the statistical models assumed to date to underly these methods do not enable a clear cut formulation of the competitive null hypothesis....... This is a major handicap to the interpretation of results obtained from a gene set analysis. RESULTS: This work presents a hierarchical statistical model based on the notion of dependence measures, which overcomes this problem. The two levels of the model naturally reflect the modular structure of many gene set...... analysis methods. We apply the model to show that the popular GSEA method, which recently has been claimed to test the self-contained null hypothesis, actually tests the competitive null if the weight parameter is zero. However, for this result to hold strictly, the choice of the dependence measures...

  2. A new web-based data mining tool for the identification of candidate genes for human genetic disorders

    NARCIS (Netherlands)

    Driel, van M.A.; Cuelenaere, K.; Kemmeren, P.P.C.W.; Leunissen, J.A.M.; Brunner, H.G.

    2003-01-01

    To identify the gene underlying a human genetic disorder can be difficult and time-consuming. Typically, positional data delimit a chromosomal region that contains between 20 and 200 genes. The choice then lies between sequencing large numbers of genes, or setting priorities by combining positional

  3. Novel candidate genes and regions for childhood apraxia of speech identified by array comparative genomic hybridization.

    Science.gov (United States)

    Laffin, Jennifer J S; Raca, Gordana; Jackson, Craig A; Strand, Edythe A; Jakielski, Kathy J; Shriberg, Lawrence D

    2012-11-01

    The goal of this study was to identify new candidate genes and genomic copy-number variations associated with a rare, severe, and persistent speech disorder termed childhood apraxia of speech. Childhood apraxia of speech is the speech disorder segregating with a mutation in FOXP2 in a multigenerational London pedigree widely studied for its role in the development of speech-language in humans. A total of 24 participants who were suspected to have childhood apraxia of speech were assessed using a comprehensive protocol that samples speech in challenging contexts. All participants met clinical-research criteria for childhood apraxia of speech. Array comparative genomic hybridization analyses were completed using a customized 385K Nimblegen array (Roche Nimblegen, Madison, WI) with increased coverage of genes and regions previously associated with childhood apraxia of speech. A total of 16 copy-number variations with potential consequences for speech-language development were detected in 12 or half of the 24 participants. The copy-number variations occurred on 10 chromosomes, 3 of which had two to four candidate regions. Several participants were identified with copy-number variations in two to three regions. In addition, one participant had a heterozygous FOXP2 mutation and a copy-number variation on chromosome 2, and one participant had a 16p11.2 microdeletion and copy-number variations on chromosomes 13 and 14. Findings support the likelihood of heterogeneous genomic pathways associated with childhood apraxia of speech.

  4. Genome-wide association study to identify candidate loci and genes for Mn toxicity tolerance in rice.

    Directory of Open Access Journals (Sweden)

    Asis Shrestha

    Full Text Available Manganese (Mn is an essential micro-nutrient for plants, but flooded rice fields can accumulate high levels of Mn2+ leading to Mn toxicity. Here, we present a genome-wide association study (GWAS to identify candidate loci conferring Mn toxicity tolerance in rice (Oryza sativa L.. A diversity panel of 288 genotypes was grown in hydroponic solutions in a greenhouse under optimal and toxic Mn concentrations. We applied a Mn toxicity treatment (5 ppm Mn2+, 3 weeks at twelve days after transplanting. Mn toxicity caused moderate damage in rice in terms of biomass loss and symptom formation despite extremely high shoot Mn concentrations ranging from 2.4 to 17.4 mg g-1. The tropical japonica subpopulation was more sensitive to Mn toxicity than other subpopulations. Leaf damage symptoms were significantly correlated with Mn uptake into shoots. Association mapping was conducted for seven traits using 416741 single nucleotide polymorphism (SNP markers using a mixed linear model, and detected six significant associations for the traits shoot manganese concentration and relative shoot length. Candidate regions contained genes coding for a heavy metal transporter, peroxidase precursor and Mn2+ ion binding proteins. The significant marker SNP-2.22465867 caused an amino acid change in a gene (LOC_Os02g37170 with unknown function. This study demonstrated significant natural variation in rice for Mn toxicity tolerance and the possibility of using GWAS to unravel genetic factors responsible for such complex traits.

  5. Cumulative Impact of Polychlorinated Biphenyl and Large Chromosomal Duplications on DNA Methylation, Chromatin, and Expression of Autism Candidate Genes.

    Science.gov (United States)

    Dunaway, Keith W; Islam, M Saharul; Coulson, Rochelle L; Lopez, S Jesse; Vogel Ciernia, Annie; Chu, Roy G; Yasui, Dag H; Pessah, Isaac N; Lott, Paul; Mordaunt, Charles; Meguro-Horike, Makiko; Horike, Shin-Ichi; Korf, Ian; LaSalle, Janine M

    2016-12-13

    Rare variants enriched for functions in chromatin regulation and neuronal synapses have been linked to autism. How chromatin and DNA methylation interact with environmental exposures at synaptic genes in autism etiologies is currently unclear. Using whole-genome bisulfite sequencing in brain tissue and a neuronal cell culture model carrying a 15q11.2-q13.3 maternal duplication, we find that significant global DNA hypomethylation is enriched over autism candidate genes and affects gene expression. The cumulative effect of multiple chromosomal duplications and exposure to the pervasive persistent organic pollutant PCB 95 altered methylation of more than 1,000 genes. Hypomethylated genes were enriched for H2A.Z, increased maternal UBE3A in Dup15q corresponded to reduced levels of RING1B, and bivalently modified H2A.Z was altered by PCB 95 and duplication. These results demonstrate the compounding effects of genetic and environmental insults on the neuronal methylome that converge upon dysregulation of chromatin and synaptic genes. Copyright © 2016 The Author(s). Published by Elsevier Inc. All rights reserved.

  6. Constellation Map: Downstream visualization and interpretation of gene set enrichment results [version 1; referees: 2 approved

    Directory of Open Access Journals (Sweden)

    Yan Tan

    2015-06-01

    Full Text Available Summary: Gene set enrichment analysis (GSEA approaches are widely used to identify coordinately regulated genes associated with phenotypes of interest. Here, we present Constellation Map, a tool to visualize and interpret the results when enrichment analyses yield a long list of significantly enriched gene sets. Constellation Map identifies commonalities that explain the enrichment of multiple top-scoring gene sets and maps the relationships between them. Constellation Map can help investigators take full advantage of GSEA and facilitates the biological interpretation of enrichment results. Availability: Constellation Map is freely available as a GenePattern module at http://www.genepattern.org.

  7. Fine-Mapping Resolves Eae23 into Two QTLs and Implicates ZEB1 as a Candidate Gene Regulating Experimental Neuroinflammation in Rat

    OpenAIRE

    Stridh, Pernilla; Thessen Hedreul, Melanie; Beyeen, Amennai Daniel; Adzemovic, Milena Z.; Laaksonen, Hannes; Gillett, Alan; ?ckinger, Johan; Marta, Monica; Lassmann, Hans; Becanovic, Kristina; Jagodic, Maja; Olsson, Tomas

    2010-01-01

    BACKGROUND: To elucidate mechanisms involved in multiple sclerosis (MS), we studied genetic regulation of experimental autoimmune encephalomyelitis (EAE) in rats, assuming a conservation of pathogenic pathways. In this study, we focused on Eae23, originally identified to regulate EAE in a (LEW.1AV1xPVG.1AV1)F2 cross. Our aim was to determine whether one or more genes within the 67 Mb region regulate EAE and to define candidate risk genes. METHODOLOGY/PRINCIPAL FINDINGS: We used high resolutio...

  8. A transcriptomic scan for potential candidate genes involved in osmoregulation in an obligate freshwater palaemonid prawn (Macrobrachium australiense

    Directory of Open Access Journals (Sweden)

    Azam Moshtaghi

    2016-10-01

    Full Text Available Background Understanding the genomic basis of osmoregulation (candidate genes and/or molecular mechanisms controlling the phenotype addresses one of the fundamental questions in evolutionary ecology. Species distributions and adaptive radiations are thought to be controlled by environmental salinity levels, and efficient osmoregulatory (ionic balance ability is the main mechanism to overcome the problems related to environmental salinity gradients. Methods To better understand how osmoregulatory performance in freshwater (FW crustaceans allow individuals to acclimate and adapt to raised salinity conditions, here we (i, reviewed the literature on genes that have been identified to be associated with osmoregulation in FW crustaceans, and (ii, performed a transcriptomic analysis using cDNA libraries developed from mRNA isolated from three important osmoregulatory tissues (gill, antennal gland, hepatopancreas and total mRNA from post larvae taken from the freshwater prawn, Macrobrachium australiense using Illumina deep sequencing technology. This species was targeted because it can complete its life cycle totally in freshwater but, like many Macrobrachium sp., can also tolerate brackish water conditions and hence should have genes associated with tolerance of both FW and saline conditions. Results We obtained between 55.4 and 65.2 million Illumina read pairs from four cDNA libraries. Overall, paired end sequences assembled into a total of 125,196 non-redundant contigs (≥200 bp with an N50 length of 2,282 bp and an average contig length of 968 bp. Transcriptomic analysis of M. australiense identified 32 different gene families that were potentially involved with osmoregulatory capacity. A total of 32,597 transcripts were specified with gene ontology (GO terms identified on the basis of GO categories. Abundance estimation of expressed genes based on TPM (transcript per million ≥20 showed 1625 transcripts commonly expressed in all four libraries

  9. Genome-Wide Association Studies Candidate Gene to Dual Modifier of Nonalcoholic Steatohepatitis and Atherosclerosis

    Directory of Open Access Journals (Sweden)

    Clint L. Miller, PhD

    2016-12-01

    Full Text Available Nonalcoholic steatohepatitis is a common disease involving chronic accumulation of fat and inflammation in the liver, often leading to advanced fibrosis, cirrhosis, and cancer. It is known that nonalcoholic steatohepatitis shares many features with atherosclerosis; however, there are still no effective therapeutics. In a recent study published in Nature, investigators demonstrated that mice lacking a high-density lipoprotein–associated gene were surprisingly protected from both steatohepatitis and atherosclerosis through the stabilization of the liver X receptor. This work reveals a timely candidate target for 2 highly prevalent cardiovascular diseases.

  10. Porcine Is a Positional Candidate Gene Associated with Growth and Fat Deposition

    Directory of Open Access Journals (Sweden)

    Bong Hwan Choi

    2012-12-01

    Full Text Available Crosses between Korean and Landrace pigs have revealed a large quantitative trait loci (QTL region for fat deposition in a region (89 cM of porcine chromosome 4 (SSC4. To more finely map this QTL region and identify candidate genes for this trait, comparative mapping of pig and human chromosomes was performed in the present study. A region in the human genome that corresponds to the porcine QTL region was identified in HSA1q21. Furthermore, the LMNA gene, which is tightly associated with fat augmentation in humans, was localized to this region. Radiation hybrid (RH mapping using a Sus scrofa RH panel localized LMNA to a region of 90.3 cM in the porcine genome, distinct from microsatellite marker S0214 (87.3 cM. Two-point analysis showed that LMNA was linked to S0214, SW1996, and S0073 on SSC4 with logarithm (base 10 of odds scores of 20.98, 17.78, and 16.73, respectively. To clone the porcine LMNA gene and to delineate the genomic structure and sequences, including the 3′untranslated region (UTR, rapid amplification of cDNA ends was performed. The coding sequence of porcine LMNA consisted of 1,719 bp, flanked by a 5’UTR and a 3’UTR. Two synonymous single nucleotide polymorphisms (SNPs were identified in exons 3 and 7. Association tests showed that the SNP located in exon 3 (A193A was significantly associated with weight at 30 wks (p<0.01 and crude fat content (p<0.05. This association suggests that SNPs located in LMNA could be used for marker-assisted selection in pigs.

  11. Dopaminergic, Serotonergic, and Oxytonergic Candidate Genes Associated with Infant Attachment Security and Disorganization? In Search of Main and Interaction Effects

    Science.gov (United States)

    Luijk, Maartje P. C. M.; Roisman, Glenn I.; Haltigan, John D.; Tiemeier, Henning; Booth-LaForce, Cathryn; van IJzendoorn, Marinus H.; Belsky, Jay; Uitterlinden, Andre G.; Jaddoe, Vincent W. V.; Hofman, Albert; Verhulst, Frank C.; Tharner, Anne; Bakermans-Kranenburg, Marian J.

    2011-01-01

    Background and methods: In two birth cohort studies with genetic, sensitive parenting, and attachment data of more than 1,000 infants in total, we tested main and interaction effects of candidate genes involved in the dopamine, serotonin, and oxytocin systems ("DRD4", "DRD2", "COMT", "5-HTT", "OXTR") on attachment security and disorganization.…

  12. Synergistic interactions between Drosophila orthologues of genes spanned by de novo human CNVs support multiple-hit models of autism.

    Science.gov (United States)

    Grice, Stuart J; Liu, Ji-Long; Webber, Caleb

    2015-03-01

    Autism spectrum disorders (ASDs) are highly heritable and characterised by deficits in social interaction and communication, as well as restricted and repetitive behaviours. Although a number of highly penetrant ASD gene variants have been identified, there is growing evidence to support a causal role for combinatorial effects arising from the contributions of multiple loci. By examining synaptic and circadian neurological phenotypes resulting from the dosage variants of unique human:fly orthologues in Drosophila, we observe numerous synergistic interactions between pairs of informatically-identified candidate genes whose orthologues are jointly affected by large de novo copy number variants (CNVs). These CNVs were found in the genomes of individuals with autism, including a patient carrying a 22q11.2 deletion. We first demonstrate that dosage alterations of the unique Drosophila orthologues of candidate genes from de novo CNVs that harbour only a single candidate gene display neurological defects similar to those previously reported in Drosophila models of ASD-associated variants. We then considered pairwise dosage changes within the set of orthologues of candidate genes that were affected by the same single human de novo CNV. For three of four CNVs with complete orthologous relationships, we observed significant synergistic effects following the simultaneous dosage change of gene pairs drawn from a single CNV. The phenotypic variation observed at the Drosophila synapse that results from these interacting genetic variants supports a concordant phenotypic outcome across all interacting gene pairs following the direction of human gene copy number change. We observe both specificity and transitivity between interactors, both within and between CNV candidate gene sets, supporting shared and distinct genetic aetiologies. We then show that different interactions affect divergent synaptic processes, demonstrating distinct molecular aetiologies. Our study illustrates

  13. Identification and Comparison of Candidate Olfactory Genes in the Olfactory and Non-Olfactory Organs of Elm Pest Ambrostoma quadriimpressum (Coleoptera: Chrysomelidae) Based on Transcriptome Analysis.

    Science.gov (United States)

    Wang, Yinliang; Chen, Qi; Zhao, Hanbo; Ren, Bingzhong

    2016-01-01

    The leaf beetle Ambrostoma quadriimpressum (Coleoptera: Chrysomelidae) is a predominant forest pest that causes substantial damage to the lumber industry and city management. However, no effective and environmentally friendly chemical method has been discovered to control this pest. Until recently, the molecular basis of the olfactory system in A. quadriimpressum was completely unknown. In this study, antennae and leg transcriptomes were analyzed and compared using deep sequencing data to identify the olfactory genes in A. quadriimpressum. Moreover, the expression profiles of both male and female candidate olfactory genes were analyzed and validated by bioinformatics, motif analysis, homology analysis, semi-quantitative RT-PCR and RT-qPCR experiments in antennal and non-olfactory organs to explore the candidate olfactory genes that might play key roles in the life cycle of A. quadriimpressum. As a result, approximately 102.9 million and 97.3 million clean reads were obtained from the libraries created from the antennas and legs, respectively. Annotation led to 34344 Unigenes, which were matched to known proteins. Annotation data revealed that the number of genes in antenna with binding functions and receptor activity was greater than that of legs. Furthermore, many pathway genes were differentially expressed in the two organs. Sixteen candidate odorant binding proteins (OBPs), 10 chemosensory proteins (CSPs), 34 odorant receptors (ORs), 20 inotropic receptors [1] and 2 sensory neuron membrane proteins (SNMPs) and their isoforms were identified. Additionally, 15 OBPs, 9 CSPs, 18 ORs, 6 IRs and 2 SNMPs were predicted to be complete ORFs. Using RT-PCR, RT-qPCR and homology analysis, AquaOBP1/2/4/7/C1/C6, AquaCSP3/9, AquaOR8/9/10/14/15/18/20/26/29/33, AquaIR8a/13/25a showed olfactory-specific expression, indicating that these genes might play a key role in olfaction-related behaviors in A. quadriimpressum such as foraging and seeking. AquaOBP4/C5, AquaOBP4/C5, AquaCSP7

  14. Identification and Comparison of Candidate Olfactory Genes in the Olfactory and Non-Olfactory Organs of Elm Pest Ambrostoma quadriimpressum (Coleoptera: Chrysomelidae Based on Transcriptome Analysis.

    Directory of Open Access Journals (Sweden)

    Yinliang Wang

    Full Text Available The leaf beetle Ambrostoma quadriimpressum (Coleoptera: Chrysomelidae is a predominant forest pest that causes substantial damage to the lumber industry and city management. However, no effective and environmentally friendly chemical method has been discovered to control this pest. Until recently, the molecular basis of the olfactory system in A. quadriimpressum was completely unknown. In this study, antennae and leg transcriptomes were analyzed and compared using deep sequencing data to identify the olfactory genes in A. quadriimpressum. Moreover, the expression profiles of both male and female candidate olfactory genes were analyzed and validated by bioinformatics, motif analysis, homology analysis, semi-quantitative RT-PCR and RT-qPCR experiments in antennal and non-olfactory organs to explore the candidate olfactory genes that might play key roles in the life cycle of A. quadriimpressum. As a result, approximately 102.9 million and 97.3 million clean reads were obtained from the libraries created from the antennas and legs, respectively. Annotation led to 34344 Unigenes, which were matched to known proteins. Annotation data revealed that the number of genes in antenna with binding functions and receptor activity was greater than that of legs. Furthermore, many pathway genes were differentially expressed in the two organs. Sixteen candidate odorant binding proteins (OBPs, 10 chemosensory proteins (CSPs, 34 odorant receptors (ORs, 20 inotropic receptors [1] and 2 sensory neuron membrane proteins (SNMPs and their isoforms were identified. Additionally, 15 OBPs, 9 CSPs, 18 ORs, 6 IRs and 2 SNMPs were predicted to be complete ORFs. Using RT-PCR, RT-qPCR and homology analysis, AquaOBP1/2/4/7/C1/C6, AquaCSP3/9, AquaOR8/9/10/14/15/18/20/26/29/33, AquaIR8a/13/25a showed olfactory-specific expression, indicating that these genes might play a key role in olfaction-related behaviors in A. quadriimpressum such as foraging and seeking. AquaOBP4/C5, Aqua

  15. Dissecting a QTL into Candidate Genes Highlighted the Key Role of Pectinesterases in Regulating the Ascorbic Acid Content in Tomato Fruit

    Directory of Open Access Journals (Sweden)

    Valentino Ruggieri

    2015-07-01

    Full Text Available Tomato ( is a crucial component of the human diet because of its high nutritional value and the antioxidant content of its fruit. As a member of the Solanaceae family, it is considered a model species for genomic studies in this family, especially since its genome has been completely sequenced. Among genomic resources available, introgression lines represent a valuable tool to mine the genetic diversity present in wild species. One introgression line, IL12-4, was previously selected for high ascorbic acid (AsA content, and a transcriptomic analysis indicated the involvement of genes controlling pectin degradation in AsA accumulation. In this study the integration of data from different “omics” platforms has been exploited to identify candidate genes that increase AsA belonging to the wild region 12-4. Thirty-two genes potentially involved in pathways controlling AsA levels were analyzed with bioinformatic tools. Two hundred-fifty nonsynonymous polymorphisms were detected in their coding regions, and 11.6% revealed deleterious effects on predicted protein function. To reduce the number of genes that had to be functionally validated, introgression sublines of the region 12–4 were selected using species-specific polymorphic markers between the two species. Four sublines were obtained and we demonstrated that a subregion of around 1 Mbp includes 12 candidate genes potentially involved in AsA accumulation. Among these, only five exhibited structural deleterious variants, and one of the 12 was differentially expressed between the two species. We have highlighted the role of three polymorphic pectinesterases and inhibitors of pectinesterases that merit further investigation.

  16. A storied-identity analysis approach to teacher candidates learning to teach in an urban setting

    Science.gov (United States)

    Ibourk, Amal

    While many studies have investigated the relationship between teachers' identity work and their developing practices, few of these identity focused studies have honed in on teacher candidates' learning to teach in an urban setting. Drawing upon narrative inquiry methodology and a "storied identity" analytic framework, I examined how the storied identities of science learning and becoming a science teacher shape teacher candidates' developing practice. In particular, I examined the stories of three interns, Becky, David, and Ashley, and I tell about their own experiences as science learners, their transitions to science teachers, and the implications this has for the identity work they did as they navigated the challenges of learning to teach in high-needs schools. Initially, each of the interns highlighted a feeling of being an outsider, and having a difficult time becoming a fully valued member of their classroom community in their storied identities of becoming a science teacher in the beginning of their internship year. While the interns named specific challenges, such as limited lab materials and different math abilities, I present how they adapted their lesson plans to address these challenges while drawing from their storied identities of science learning. My study reveals that the storied identities of becoming a science teacher informed how they framed their initial experiences teaching in an urban context. In addition, my findings reveal that the more their storied identities of science learning and becoming a science teacher overlapped, the more they leveraged their storied identity of science learning in order to implement teaching strategies that helped them make sense of the challenges that surfaced in their classroom contexts. Both Becky and Ashley leveraged their storied identities of science learning more than David did in their lesson planning and learning to teach. David's initial storied identity of becoming a science teacher revealed how he

  17. Sequence analysis of the Ras-MAPK pathway genes SOS1, EGFR & GRB2 in silver foxes (Vulpes vulpes): candidate genes for hereditary hyperplastic gingivitis.

    Science.gov (United States)

    Clark, Jo-Anna B J; Tully, Sara J; Dawn Marshall, H

    2014-12-01

    Hereditary hyperplastic gingivitis (HHG) is an autosomal recessive disease that presents with progressive gingival proliferation in farmed silver foxes. Hereditary gingival fibromatosis (HGF) is an analogous condition in humans that is genetically heterogeneous with several known autosomal dominant loci. For one locus the causative mutation is in the Son of sevenless homologue 1 (SOS1) gene. For the remaining loci, the molecular mechanisms are unknown but Ras pathway involvement is suspected. Here we compare sequences for the SOS1 gene, and two adjacent genes in the Ras pathway, growth receptor bound protein 2 (GRB2) and epidermal growth factor receptor (EGFR), between HHG-affected and unaffected foxes. We conclude that the known HGF causative mutation does not cause HHG in foxes, nor do the coding regions or intron-exon boundaries of these three genes contain any candidate mutations for fox gum disease. Patterns of molecular evolution among foxes and other mammals reflect high conservation and strong functional constraints for SOS1 and GRB2 but reveal a lineage-specific pattern of variability in EGFR consistent with mutational rate differences, relaxed functional constraints, and possibly positive selection.

  18. Exploiting proteomic data for genome annotation and gene model validation in Aspergillus niger

    OpenAIRE

    Wright, James C.; Sugden, Deana; Francis-McIntyre, Sue; Riba Garcia, Isabel; Gaskell, Simon J.; Grigoriev, Igor V.; Baker, Scott E.; Beynon, Robert J.; Hubbard, Simon J.

    2009-01-01

    Abstract Background Proteomic data is a potentially rich, but arguably unexploited, data source for genome annotation. Peptide identifications from tandem mass spectrometry provide prima facie evidence for gene predictions and can discriminate over a set of candidate gene models. Here we apply this to the recently sequenced Aspergillus niger fungal genome from the Joint Genome Institutes (JGI) and another predicted protein set from another A.niger sequence. Tandem mass spectra (MS/MS) were ac...

  19. Use of meta-analysis to combine candidate gene association studies: application to study the relationship between the ESR PvuII polymorphism and sow litter size

    Directory of Open Access Journals (Sweden)

    Alfonso Leopoldo

    2005-07-01

    Full Text Available Abstract This article investigates the application of meta-analysis on livestock candidate gene effects. The PvuII polymorphism of the ESR gene is used as an example. The association among ESR PvuII alleles with the number of piglets born alive and total born in the first (NBA1, TNB1 and later parities (NBA, TNB is reviewed by conducting a meta-analysis of 15 published studies including 9329 sows. Under a fixed effects model, litter size values were significantly lower in the "AA" genotype groups when compared with "AB" and "BB" homozygotes. Under the random effects model, the results were similar although differences between "AA" and "AB" genotype groups were not clearly significant for NBA and TNB. Nevertheless, the most noticeable result was the high and significant heterogeneity estimated among studies. This heterogeneity could be assigned to error sampling, genotype by environment interaction, linkage or epistasis, as referred to in the literature, but also to the hypothesis of population admixture/stratification. It is concluded that meta-analysis can be considered as a helpful analytical tool to synthesise and discuss livestock candidate gene effects. The main difficulty found was the insufficient information on the standard errors of the estimated genotype effects in several publications. Consequently, the convenience of publishing the standard errors or the concrete P-values instead of the test significance level should be recommended to guarantee the quality of candidate gene effect meta-analyses.

  20. Xanthine urolithiasis in a cat: a case report and evaluation of a candidate gene for xanthine dehydrogenase.

    Science.gov (United States)

    Tsuchida, Shuichi; Kagi, Akiko; Koyama, Hidekazu; Tagawa, Masahiro

    2007-12-01

    Xanthine urolithiasis was found in a 4-year-old spayed female Himalayan cat with a 10-month history of intermittent haematuria and dysuria. Ultrasonographs indicated the existence of several calculi in the bladder that were undetectable by survey radiographic examination. Four bladder stones were removed by cystotomy. The stones were spherical brownish-yellow and their surface was smooth and glossy. Quantitative mineral analysis showed a representative urolith to be composed of more than 95% xanthine. Ultrasonographic examination of the bladder 4.5 months postoperatively indicated the recurrence of urolithiasis. Analysis of purine concentration in urine and blood showed that the cat excreted excessive amounts of xanthine. In order to test the hypothesis that xanthinuria was caused by a homozygote of the inherited mutant allele of a gene responsible for deficiency of enzyme activity in purine degradation pathway, the allele composition of xanthine dehydrogenase (XDH) gene (one of the candidate genes for hereditary xanthinuria) was evaluated. The cat with xanthinuria was a heterozygote of the polymorphism. A single nucleotide polymorphism analysis of the cat XDH gene strongly indicated that the XDH gene of the patient cat was composed of two kinds of alleles and ruled out the hypothesis that the cat inherited the same recessive XDH allele suggesting no activity from a single ancestor.

  1. A family-based association study identified CYP17 as a candidate gene for obesity susceptibility in Caucasians.

    Science.gov (United States)

    Yan, H; Guo, Y; Yang, T-L; Zhao, L-J; Deng, H-W

    2012-08-06

    The cytochrome P450c17α gene (CYP17) encodes a key biosynthesis enzyme of estrogen, which is critical in regulating adipogenesis and adipocyte development in humans. We therefore hypothesized that CYP17 is a candidate gene for predicting obesity. In order to test this hypothesis, we performed a family-based association test to investigate the relationship between the CYP17 gene and obesity phenotypes in a large sample comprising 1873 subjects from 405 Caucasian nuclear families of European origin recruited by the Osteoporosis Research Center of Creighton University, USA. Both single SNPs and haplotypes were tested for associations with obesity-related phenotypes, including body mass index (BMI) and fat mass. We identified three SNPs to be significantly associated with BMI, including rs3740397, rs6163, and rs619824. We further characterized the linkage disequilibrium structure for CYP17 and found that the whole CYP17 gene was located in a single-linkage disequilibrium block. This block was observed to be significantly associated with BMI. A major haplotype in this block was significantly associated with both BMI and fat mass. In conclusion, we suggest that the CYP17 gene has an effect on obesity in the Caucasian population. Further independent studies will be needed to confirm our findings.

  2. Identification of astrocytoma associated genes including cell surface markers

    International Nuclear Information System (INIS)

    Boon, Kathy; Edwards, Jennifer B; Eberhart, Charles G; Riggins, Gregory J

    2004-01-01

    Despite intense effort the treatment options for the invasive astrocytic tumors are still limited to surgery and radiation therapy, with chemotherapy showing little or no increase in survival. The generation of Serial Analysis of Gene Expression (SAGE) profiles is expected to aid in the identification of astrocytoma-associated genes and highly expressed cell surface genes as molecular therapeutic targets. SAGE tag counts can be easily added to public expression databases and quickly disseminated to research efforts worldwide. We generated and analyzed the SAGE transcription profiles of 25 primary grade II, III and IV astrocytomas [1]. These profiles were produced as part of the Cancer Genome Anatomy Project's SAGE Genie [2], and were used in an in silico search for candidate therapeutic targets by comparing astrocytoma to normal brain transcription. Real-time PCR and immunohistochemistry were used for the validation of selected candidate target genes in 2 independent sets of primary tumors. A restricted set of tumor-associated genes was identified for each grade that included genes not previously associated with astrocytomas (e.g. VCAM1, SMOC1, and thymidylate synthetase), with a high percentage of cell surface genes. Two genes with available antibodies, Aquaporin 1 and Topoisomerase 2A, showed protein expression consistent with transcript level predictions. This survey of transcription in malignant and normal brain tissues reveals a small subset of human genes that are activated in malignant astrocytomas. In addition to providing insights into pathway biology, we have revealed and quantified expression for a significant portion of cell surface and extra-cellular astrocytoma genes

  3. A random set scoring model for prioritization of disease candidate genes using protein complexes and data-mining of GeneRIF, OMIM and PubMed records

    DEFF Research Database (Denmark)

    Jiang, Li; Edwards, Stefan M.; Thomsen, Bo

    2014-01-01

    from PubMed abstracts, OMIM, and GeneRIF records. We also investigated the validity of several vocabulary filters and different likelihood thresholds for predicted protein-protein interactions in terms of their effect on the network-based gene-prioritization approach, which relies on text-mining......Background: Prioritizing genetic variants is a challenge because disease susceptibility loci are often located in genes of unknown function or the relationship with the corresponding phenotype is unclear. A global data-mining exercise on the biomedical literature can establish the phenotypic...

  4. Gene expression profiles in prostate cancer: identification of candidate non-invasive diagnostic markers.

    Science.gov (United States)

    Mengual, L; Ars, E; Lozano, J J; Burset, M; Izquierdo, L; Ingelmo-Torres, M; Gaya, J M; Algaba, F; Villavicencio, H; Ribal, M J; Alcaraz, A

    2014-04-01

    To analyze gene expression profiles of prostate cancer (PCa) with the aim of determining the relevant differentially expressed genes and subsequently ascertain whether this differential expression is maintained in post-prostatic massage (PPM) urine samples. Forty-six tissue specimens (36 from PCa patients and 10 controls) and 158 urine PPM-urines (113 from PCa patients and 45 controls) were collected between December 2003 and May 2007. DNA microarrays were used to identify genes differentially expressed between tumour and control samples. Ten genes were technically validated in the same tissue samples by quantitative RT-PCR (RT-qPCR). Forty two selected differentially expressed genes were validated in an independent set of PPM-urines by qRT-PCR. Multidimensional scaling plot according to the expression of all the microarray genes showed a clear distinction between control and tumour samples. A total of 1047 differentially expressed genes (FDR≤.1) were indentified between both groups of samples. We found a high correlation in the comparison of microarray and RT-qPCR gene expression levels (r=.928, P<.001). Thirteen genes maintained the same fold change direction when analyzed in PPM-urine samples and in four of them (HOXC6, PCA3, PDK4 and TMPRSS2-ERG), these differences were statistically significant (P<.05). The analysis of PCa by DNA microarrays provides new putative mRNA markers for PCa diagnosis that, with caution, can be extrapolated to PPM-urines. Copyright © 2013 AEU. Published by Elsevier Espana. All rights reserved.

  5. Association mapping of starch chain length distribution and amylose content in pea (Pisum sativum L.) using carbohydrate metabolism candidate genes.

    Science.gov (United States)

    Carpenter, Margaret A; Shaw, Martin; Cooper, Rebecca D; Frew, Tonya J; Butler, Ruth C; Murray, Sarah R; Moya, Leire; Coyne, Clarice J; Timmerman-Vaughan, Gail M

    2017-08-01

    Although starch consists of large macromolecules composed of glucose units linked by α-1,4-glycosidic linkages with α-1,6-glycosidic branchpoints, variation in starch structural and functional properties is found both within and between species. Interest in starch genetics is based on the importance of starch in food and industrial processes, with the potential of genetics to provide novel starches. The starch metabolic pathway is complex but has been characterized in diverse plant species, including pea. To understand how allelic variation in the pea starch metabolic pathway affects starch structure and percent amylose, partial sequences of 25 candidate genes were characterized for polymorphisms using a panel of 92 diverse pea lines. Variation in the percent amylose composition of extracted seed starch and (amylopectin) chain length distribution, one measure of starch structure, were characterized for these lines. Association mapping was undertaken to identify polymorphisms associated with the variation in starch chain length distribution and percent amylose, using a mixed linear model that incorporated population structure and kinship. Associations were found for polymorphisms in seven candidate genes plus Mendel's r locus (which conditions the round versus wrinkled seed phenotype). The genes with associated polymorphisms are involved in the substrate supply, chain elongation and branching stages of the pea carbohydrate and starch metabolic pathways. The association of polymorphisms in carbohydrate and starch metabolic genes with variation in amylopectin chain length distribution and percent amylose may help to guide manipulation of pea seed starch structural and functional properties through plant breeding.

  6. Mapping of five candidate sex-determining loci in rainbow trout (Oncorhynchus mykiss

    Directory of Open Access Journals (Sweden)

    Drew Robert E

    2009-01-01

    Full Text Available Abstract Background Rainbow trout have an XX/XY genetic mechanism of sex determination where males are the heterogametic sex. The homology of the sex-determining gene (SDG in medaka to Dmrt1 suggested that SDGs evolve from downstream genes by gene duplication. Orthologous sequences of the major genes of the mammalian sex determination pathway have been reported in the rainbow trout but the map position for the majority of these genes has not been assigned. Results Five loci of four candidate genes (Amh, Dax1, Dmrt1 and Sox6 were tested for linkage to the Y chromosome of rainbow trout. We exclude the role of all these loci as candidates for the primary SDG in this species. Sox6i and Sox6ii, duplicated copies of Sox6, mapped to homeologous linkage groups 10 and 18 respectively. Genotyping fishes of the OSU × Arlee mapping family for Sox6i and Sox6ii alleles indicated that Sox6i locus might be deleted in the Arlee lineage. Conclusion Additional candidate genes should be tested for their linkage to the Y chromosome. Mapping data of duplicated Sox6 loci supports previously suggested homeology between linkage groups 10 and 18. Enrichment of the rainbow trout genomic map with known gene markers allows map comparisons with other salmonids. Mapping of candidate sex-determining loci is important for analyses of potential autosomal modifiers of sex-determination in rainbow trout.

  7. Combined analysis of DNA methylome and transcriptome reveal novel candidate genes with susceptibility to bovine Staphylococcus aureus subclinical mastitis.

    Science.gov (United States)

    Song, Minyan; He, Yanghua; Zhou, Huangkai; Zhang, Yi; Li, Xizhi; Yu, Ying

    2016-07-14

    Subclinical mastitis is a widely spread disease of lactating cows. Its major pathogen is Staphylococcus aureus (S. aureus). In this study, we performed genome-wide integrative analysis of DNA methylation and transcriptional expression to identify candidate genes and pathways relevant to bovine S. aureus subclinical mastitis. The genome-scale DNA methylation profiles of peripheral blood lymphocytes in cows with S. aureus subclinical mastitis (SA group) and healthy controls (CK) were generated by methylated DNA immunoprecipitation combined with microarrays. We identified 1078 differentially methylated genes in SA cows compared with the controls. By integrating DNA methylation and transcriptome data, 58 differentially methylated genes were shared with differently expressed genes, in which 20.7% distinctly hypermethylated genes showed down-regulated expression in SA versus CK, whereas 14.3% dramatically hypomethylated genes showed up-regulated expression. Integrated pathway analysis suggested that these genes were related to inflammation, ErbB signalling pathway and mismatch repair. Further functional analysis revealed that three genes, NRG1, MST1 and NAT9, were strongly correlated with the progression of S. aureus subclinical mastitis and could be used as powerful biomarkers for the improvement of bovine mastitis resistance. Our studies lay the groundwork for epigenetic modification and mechanistic studies on susceptibility of bovine mastitis.

  8. FBLN4 as candidate gene associated with long-term and short-term survival with primary glioblastoma

    Directory of Open Access Journals (Sweden)

    Li F

    2017-01-01

    Full Text Available Fubin Li,1,* Yiping Li,1,* Kewei Zhang,1,* Ye Li,1,* Ping He,1,* Yujia Liu,1,* Hongyan Yuan,2,* Honghua Lu,1,* Jinxiang Liu,1,* Songtian Che,3,* Zhenju Li,4,* Li Bie1,5 1Department of Neurosurgery of the First Clinical Hospital, 2Department of Immunology, Norman Bethune College of Medicine, 3Department of Neurosurgery of the Second Clinical Hospital, 4Department of Neurosurgery of the Fourth Clinical Hospital, Jilin University, Changchun, People’s Republic of China; 5Department of Pathology and Laboratory Medicine, School of Medicine, University of California – Irvine, Irvine, CA, USA *These authors contributed equally to this work Background: Glioblastoma multiforme (GBM is the most common malignant and lethal type of primary central nervous system tumor in humans. In spite of its high lethality, a small percentage of patients have a relatively good prognosis, with median survival times of 36 months or longer. The identification of clinical subsets of GBM associated with distinct molecular genetic profiles has made it possible to design therapies tailored to treat individual patients. Methods: We compared microarray data sets from long-term survivors (LTSs and short-term survivors (STSs to screen for prognostic biomarkers in GBM patients using the WebArrayDB platform. We focused on FBLN4, IGFBP-2, and CHI3L1, all members of a group of 10 of the most promising, differentially regulated gene candidates. Using formalin-fixed paraffin-embedded GBM samples, we corroborated the relationship between these genes and patient outcomes using methylation-specific polymerase chain reaction (PCR for MGMT methylation status and quantitative reverse transcription PCR for expression of these genes. Results: Expression levels of the mRNAs of these 3 genes were higher in the GBM samples than in normal brain samples and these 3 genes were significantly upregulated in STSs compared to the levels in LTS samples (P<0.01. Furthermore, Kaplan–Meier analysis

  9. RNA-Seq reveals seven promising candidate genes affecting the proportion of thick egg albumen in layer-type chickens.

    Science.gov (United States)

    Wan, Yi; Jin, Sihua; Ma, Chendong; Wang, Zhicheng; Fang, Qi; Jiang, Runshen

    2017-12-22

    Eggs with a much higher proportion of thick albumen are preferred in the layer industry, as they are favoured by consumers. However, the genetic factors affecting the thick egg albumen trait have not been elucidated. Using RNA sequencing, we explored the magnum transcriptome in 9 Rhode Island white layers: four layers with phenotypes of extremely high ratios of thick to thin albumen (high thick albumen, HTA) and five with extremely low ratios (low thick albumen, LTA). A total of 220 genes were differentially expressed, among which 150 genes were up-regulated and 70 were down-regulated in the HTA group compared with the LTA group. Gene Ontology (GO) analysis revealed that the up-regulated genes in HTA were mainly involved in a wide range of regulatory functions. In addition, a large number of these genes were related to glycosphingolipid biosynthesis, focal adhesion, ECM-receptor interactions and cytokine-cytokine receptor interactions. Based on functional analysis, ST3GAL4, FUT4, ITGA2, SDC3, PRLR, CDH4 and GALNT9 were identified as promising candidate genes for thick albumen synthesis and metabolism during egg formation. These results provide new insights into the molecular mechanisms of egg albumen traits and may contribute to future breeding strategies that optimise the proportion of thick egg albumen.

  10. A literature search tool for intelligent extraction of disease-associated genes.

    Science.gov (United States)

    Jung, Jae-Yoon; DeLuca, Todd F; Nelson, Tristan H; Wall, Dennis P

    2014-01-01

    To extract disorder-associated genes from the scientific literature in PubMed with greater sensitivity for literature-based support than existing methods. We developed a PubMed query to retrieve disorder-related, original research articles. Then we applied a rule-based text-mining algorithm with keyword matching to extract target disorders, genes with significant results, and the type of study described by the article. We compared our resulting candidate disorder genes and supporting references with existing databases. We demonstrated that our candidate gene set covers nearly all genes in manually curated databases, and that the references supporting the disorder-gene link are more extensive and accurate than other general purpose gene-to-disorder association databases. We implemented a novel publication search tool to find target articles, specifically focused on links between disorders and genotypes. Through comparison against gold-standard manually updated gene-disorder databases and comparison with automated databases of similar functionality we show that our tool can search through the entirety of PubMed to extract the main gene findings for human diseases rapidly and accurately.

  11. Using OWL reasoning to support the generation of novel gene sets for enrichment analysis.

    Science.gov (United States)

    Osumi-Sutherland, David J; Ponta, Enrico; Courtot, Melanie; Parkinson, Helen; Badi, Laura

    2018-02-14

    The Gene Ontology (GO) consists of over 40,000 terms for biological processes, cell components and gene product activities linked into a graph structure by over 90,000 relationships. It has been used to annotate the functions and cellular locations of several million gene products. The graph structure is used by a variety of tools to group annotated genes into sets whose products share function or location. These gene sets are widely used to interpret the results of genomics experiments by assessing which sets are significantly over- or under-represented in results lists. F Hoffmann-La Roche Ltd. has developed a bespoke, manually maintained controlled vocabulary (RCV) for use in over-representation analysis. Many terms in this vocabulary group GO terms in novel ways that cannot easily be derived using the graph structure of the GO. For example, some RCV terms group GO terms by the cell, chemical or tissue type they refer to. Recent improvements in the content and formal structure of the GO make it possible to use logical queries in Web Ontology Language (OWL) to automatically map these cross-cutting classifications to sets of GO terms. We used this approach to automate mapping between RCV and GO, largely replacing the increasingly unsustainable manual mapping process. We then tested the utility of the resulting groupings for over-representation analysis. We successfully mapped 85% of RCV terms to logical OWL definitions and showed that these could be used to recapitulate and extend manual mappings between RCV terms and the sets of GO terms subsumed by them. We also show that gene sets derived from the resulting GO terms sets can be used to detect the signatures of cell and tissue types in whole genome expression data. The rich formal structure of the GO makes it possible to use reasoning to dynamically generate novel, biologically relevant groupings of GO terms. GO term groupings generated with this approach can be used in. over-representation analysis to detect

  12. A random set scoring model for prioritization of disease candidate genes using protein complexes and data-mining of GeneRIF, OMIM and PubMed records

    DEFF Research Database (Denmark)

    Jiang, Li; Edwards, Stefan M.; Thomsen, Bo

    2014-01-01

    from PubMed abstracts, OMIM, and GeneRIF records. We also investigated the validity of several vocabulary filters and different likelihood thresholds for predicted protein-protein interactions in terms of their effect on the network-based gene-prioritization approach, which relies on text...

  13. Candidate Genes for Testicular Cancer Evaluated by In Situ Protein Expression Analyses on Tissue Microarrays

    Directory of Open Access Journals (Sweden)

    Rolf I. Skotheim

    2003-09-01

    Full Text Available By the use of high-throughput molecular technologies, the number of genes and proteins potentially relevant to testicular germ cell tumor (TGCT and other diseases will increase rapidly. In a recent transcriptional profiling, we demonstrated the overexpression of GRB7 and JUP in TGCTs, confirmed the reported overexpression of CCND2. We also have recent evidences for frequent genetic alterations of FHIT and epigenetic alterations of MGMT. To evaluate whether the expression of these genes is related to any clinicopathological variables, we constructed a tissue microarray with 510 testicular tissue cores from 279 patients diagnosed with TGCT, covering various histological subgroups and clinical stages. By immunohistochemistry, we found that JUP, GRB7, CCND2 proteins were rarely present in normal testis, but frequently expressed at high levels in TGCT. Additionally, all premalignant intratubular germ cell neoplasias were JUP-immunopositive. MGMT and FHIT were expressed by normal testicular tissues, but at significantly lower frequencies in TGCT. Except for CCND2, the expressions of all markers were significantly associated with various TGCT subtypes. In summary, we have developed a high-throughput tool for the evaluation of TGCT markers, utilized this to validate five candidate genes whose protein expressions were indeed deregulated in TGCT.

  14. Identification and validation of reference genes for quantitative RT-PCR normalization in wheat

    Directory of Open Access Journals (Sweden)

    Porceddu Enrico

    2009-02-01

    Full Text Available Abstract Background Usually the reference genes used in gene expression analysis have been chosen for their known or suspected housekeeping roles, however the variation observed in most of them hinders their effective use. The assessed lack of validated reference genes emphasizes the importance of a systematic study for their identification. For selecting candidate reference genes we have developed a simple in silico method based on the data publicly available in the wheat databases Unigene and TIGR. Results The expression stability of 32 genes was assessed by qRT-PCR using a set of cDNAs from 24 different plant samples, which included different tissues, developmental stages and temperature stresses. The selected sequences included 12 well-known HKGs representing different functional classes and 20 genes novel with reference to the normalization issue. The expression stability of the 32 candidate genes was tested by the computer programs geNorm and NormFinder using five different data-sets. Some discrepancies were detected in the ranking of the candidate reference genes, but there was substantial agreement between the groups of genes with the most and least stable expression. Three new identified reference genes appear more effective than the well-known and frequently used HKGs to normalize gene expression in wheat. Finally, the expression study of a gene encoding a PDI-like protein showed that its correct evaluation relies on the adoption of suitable normalization genes and can be negatively affected by the use of traditional HKGs with unstable expression, such as actin and α-tubulin. Conclusion The present research represents the first wide screening aimed to the identification of reference genes and of the corresponding primer pairs specifically designed for gene expression studies in wheat, in particular for qRT-PCR analyses. Several of the new identified reference genes outperformed the traditional HKGs in terms of expression stability

  15. Genome-wide scans for delineation of candidate genes regulating seed-protein content in chickpea

    Directory of Open Access Journals (Sweden)

    Hari Deo eUpadhyaya

    2016-03-01

    Full Text Available Identification of potential genes/alleles governing complex seed-protein content (SPC trait is essential in marker-assisted breeding for quality trait improvement of chickpea. Henceforth, the present study utilized an integrated genomics-assisted breeding strategy encompassing trait association analysis, selective genotyping in traditional bi-parental mapping population and differential expression profiling for the first-time to understand the complex genetic architecture of quantitative SPC trait in chickpea. For GWAS (genome-wide association study, high-throughput genotyping information of 16376 genome-based SNPs (single nucleotide polymorphism discovered from a structured population of 336 sequenced desi and kabuli accessions [with 150-200 kb LD (linkage disequilibrium decay] was utilized. This led to identification of seven most effective genomic loci (genes associated [10 to 20% with 41% combined PVE (phenotypic variation explained] with SPC trait in chickpea. Regardless of the diverse desi and kabuli genetic backgrounds, a comparable level of association potential of the identified seven genomic loci with SPC trait was observed. Five SPC-associated genes were validated successfully in parental accessions and homozygous individuals of an intra-specific desi RIL (recombinant inbred line mapping population (ICC 12299 x ICC 4958 by selective genotyping. The seed-specific expression, including differential up-regulation (> 4-fold of six SPC-associated genes particularly in accessions, parents and homozygous individuals of the aforementioned mapping population with high level of contrasting seed-protein content (21-22% was evident. Collectively, the integrated genomic approach delineated diverse naturally occurring novel functional SNP allelic variants in six potential candidate genes regulating SPC trait in chickpea. Of these, a non-synonymous SNP allele-carrying zinc finger transcription factor gene exhibiting strong association with SPC trait

  16. Significant linkage to chromosome 12q24.32-q24.33 and identification of SFRS8 as a possible asthma susceptibility gene

    DEFF Research Database (Denmark)

    brasch-andersen, c; Tan, Q; Børglum, A D

    2006-01-01

    -wide scan in one set of families followed by (2) fine scale mapping in an independent set of families in candidate regions with a maximum likelihood score (MLS) of > or =1.5 in the genome-wide scan. Polymorphisms in a candidate gene in the region on 12q24.33 were tested for association with asthma...... 12q, and suggests a candidate region distal to most previously reported regions. Three single nucleotide polymorphisms in splicing factor, arginine/serine-rich 8 (SFRS8) had an association with asthma (p ..., a protein which, through alternative splice variants, has an essential role in activating T cells. T cells are involved in the pathogenesis of atopic diseases such as asthma, so SFRS8 is a very interesting candidate gene in the region. CONCLUSIONS: Linkage and simulation studies show that the very distal...

  17. Identification of candidate genes involved in Witches' broom disease resistance in a segregating mapping population of Theobroma cacao L. in Brazil.

    Science.gov (United States)

    Royaert, Stefan; Jansen, Johannes; da Silva, Daniela Viana; de Jesus Branco, Samuel Martins; Livingstone, Donald S; Mustiga, Guiliana; Marelli, Jean-Philippe; Araújo, Ioná Santos; Corrêa, Ronan Xavier; Motamayor, Juan Carlos

    2016-02-11

    Witches' broom disease (WBD) caused by the fungus Moniliophthora perniciosa is responsible for considerable economic losses for cacao producers. One of the ways to combat WBD is to plant resistant cultivars. Resistance may be governed by a few genetic factors, mainly found in wild germplasm. We developed a dense genetic linkage map with a length of 852.8 cM that contains 3,526 SNPs and is based on the MP01 mapping population, which counts 459 trees from a cross between the resistant 'TSH 1188' and the tolerant 'CCN 51' at the Mars Center for Cocoa Science in Barro Preto, Bahia, Brazil. Seven quantitative trait loci (QTL) that are associated with WBD were identified on five different chromosomes using a multi-trait QTL analysis for outbreeders. Phasing of the haplotypes at the major QTL region on chromosome IX on a diversity panel of genotypes clearly indicates that the major resistance locus comes from a well-known source of WBD resistance, the clone 'SCAVINA 6'. Various potential candidate genes identified within all QTL may be involved in different steps leading to disease resistance. Preliminary expression data indicate that at least three of these candidate genes may play a role during the first 12 h after infection, with clear differences between 'CCN 51' and 'TSH 1188'. We combined the information from a large mapping population with very distinct parents that segregate for WBD, a dense set of mapped markers, rigorous phenotyping capabilities and the availability of a sequenced genome to identify several genomic regions that are involved in WBD resistance. We also identified a novel source of resistance that most likely comes from the 'CCN 51' parent. Thanks to the large population size of the MP01 population, we were able to pick up QTL and markers with relatively small effects that can contribute to the creation and selection of more tolerant/resistant plant material.

  18. Integrated Metabolo-Transcriptomics Reveals Fusarium Head Blight Candidate Resistance Genes in Wheat QTL-Fhb2.

    Directory of Open Access Journals (Sweden)

    Dhananjay Dhokane

    Full Text Available Fusarium head blight (FHB caused by Fusarium graminearum not only causes severe losses in yield, but also reduces quality of wheat grain by accumulating mycotoxins. Breeding for host plant resistance is considered as the best strategy to manage FHB. Resistance in wheat to FHB is quantitative in nature, involving cumulative effects of many genes governing resistance. The poor understanding of genetics and lack of precise phenotyping has hindered the development of FHB resistant cultivars. Though more than 100 QTLs imparting FHB resistance have been reported, none discovered the specific genes localized within the QTL region, nor the underlying mechanisms of resistance.In our study recombinant inbred lines (RILs carrying resistant (R-RIL and susceptible (S-RIL alleles of QTL-Fhb2 were subjected to metabolome and transcriptome profiling to discover the candidate genes. Metabolome profiling detected a higher abundance of metabolites belonging to phenylpropanoid, lignin, glycerophospholipid, flavonoid, fatty acid, and terpenoid biosynthetic pathways in R-RIL than in S-RIL. Transcriptome analysis revealed up-regulation of several receptor kinases, transcription factors, signaling, mycotoxin detoxification and resistance related genes. The dissection of QTL-Fhb2 using flanking marker sequences, integrating metabolomic and transcriptomic datasets, identified 4-Coumarate: CoA ligase (4CL, callose synthase (CS, basic Helix Loop Helix (bHLH041 transcription factor, glutathione S-transferase (GST, ABC transporter-4 (ABC4 and cinnamyl alcohol dehydrogenase (CAD as putative resistance genes localized within the QTL-Fhb2 region.Some of the identified genes within the QTL region are associated with structural resistance through cell wall reinforcement, reducing the spread of pathogen through rachis within a spike and few other genes that detoxify DON, the virulence factor, thus eventually reducing disease severity. In conclusion, we report that the wheat

  19. Detecting Horizontal Gene Transfer between Closely Related Taxa.

    Directory of Open Access Journals (Sweden)

    Orit Adato

    2015-10-01

    Full Text Available Horizontal gene transfer (HGT, the transfer of genetic material between organisms, is crucial for genetic innovation and the evolution of genome architecture. Existing HGT detection algorithms rely on a strong phylogenetic signal distinguishing the transferred sequence from ancestral (vertically derived genes in its recipient genome. Detecting HGT between closely related species or strains is challenging, as the phylogenetic signal is usually weak and the nucleotide composition is normally nearly identical. Nevertheless, there is a great importance in detecting HGT between congeneric species or strains, especially in clinical microbiology, where understanding the emergence of new virulent and drug-resistant strains is crucial, and often time-sensitive. We developed a novel, self-contained technique named Near HGT, based on the synteny index, to measure the divergence of a gene from its native genomic environment and used it to identify candidate HGT events between closely related strains. The method confirms candidate transferred genes based on the constant relative mutability (CRM. Using CRM, the algorithm assigns a confidence score based on "unusual" sequence divergence. A gene exhibiting exceptional deviations according to both synteny and mutability criteria, is considered a validated HGT product. We first employed the technique to a set of three E. coli strains and detected several highly probable horizontally acquired genes. We then compared the method to existing HGT detection tools using a larger strain data set. When combined with additional approaches our new algorithm provides richer picture and brings us closer to the goal of detecting all newly acquired genes in a particular strain.

  20. Sporulation genes associated with sporulation efficiency in natural isolates of yeast.

    Science.gov (United States)

    Tomar, Parul; Bhatia, Aatish; Ramdas, Shweta; Diao, Liyang; Bhanot, Gyan; Sinha, Himanshu

    2013-01-01

    Yeast sporulation efficiency is a quantitative trait and is known to vary among experimental populations and natural isolates. Some studies have uncovered the genetic basis of this variation and have identified the role of sporulation genes (IME1, RME1) and sporulation-associated genes (FKH2, PMS1, RAS2, RSF1, SWS2), as well as non-sporulation pathway genes (MKT1, TAO3) in maintaining this variation. However, these studies have been done mostly in experimental populations. Sporulation is a response to nutrient deprivation. Unlike laboratory strains, natural isolates have likely undergone multiple selections for quick adaptation to varying nutrient conditions. As a result, sporulation efficiency in natural isolates may have different genetic factors contributing to phenotypic variation. Using Saccharomyces cerevisiae strains in the genetically and environmentally diverse SGRP collection, we have identified genetic loci associated with sporulation efficiency variation in a set of sporulation and sporulation-associated genes. Using two independent methods for association mapping and correcting for population structure biases, our analysis identified two linked clusters containing 4 non-synonymous mutations in genes - HOS4, MCK1, SET3, and SPO74. Five regulatory polymorphisms in five genes such as MLS1 and CDC10 were also identified as putative candidates. Our results provide candidate genes contributing to phenotypic variation in the sporulation efficiency of natural isolates of yeast.