WorldWideScience

Sample records for candidate gene microarray

  1. Using microarrays to identify positional candidate genes for QTL: the case study of ACTH response in pigs.

    Science.gov (United States)

    Jouffe, Vincent; Rowe, Suzanne; Liaubet, Laurence; Buitenhuis, Bart; Hornshøj, Henrik; SanCristobal, Magali; Mormède, Pierre; de Koning, D J

    2009-07-16

    Microarray studies can supplement QTL studies by suggesting potential candidate genes in the QTL regions, which by themselves are too large to provide a limited selection of candidate genes. Here we provide a case study where we explore ways to integrate QTL data and microarray data for the pig, which has only a partial genome sequence. We outline various procedures to localize differentially expressed genes on the pig genome and link this with information on published QTL. The starting point is a set of 237 differentially expressed cDNA clones in adrenal tissue from two pig breeds, before and after treatment with adrenocorticotropic hormone (ACTH). Different approaches to localize the differentially expressed (DE) genes to the pig genome showed different levels of success and a clear lack of concordance for some genes between the various approaches. For a focused analysis on 12 genes, overlapping QTL from the public domain were presented. Also, differentially expressed genes underlying QTL for ACTH response were described. Using the latest version of the draft sequence, the differentially expressed genes were mapped to the pig genome. This enabled co-location of DE genes and previously studied QTL regions, but the draft genome sequence is still incomplete and will contain many errors. A further step to explore links between DE genes and QTL at the pathway level was largely unsuccessful due to the lack of annotation of the pig genome. This could be improved by further comparative mapping analyses but this would be time consuming. This paper provides a case study for the integration of QTL data and microarray data for a species with limited genome sequence information and annotation. The results illustrate the challenges that must be addressed but also provide a roadmap for future work that is applicable to other non-model species.

  2. Transcriptomic identification of candidate genes involved in sunflower responses to chilling and salt stresses based on cDNA microarray analysis

    Directory of Open Access Journals (Sweden)

    Paniego Norma

    2008-01-01

    Full Text Available Abstract Background Considering that sunflower production is expanding to arid regions, tolerance to abiotic stresses as drought, low temperatures and salinity arises as one of the main constrains nowadays. Differential organ-specific sunflower ESTs (expressed sequence tags were previously generated by a subtractive hybridization method that included a considerable number of putative abiotic stress associated sequences. The objective of this work is to analyze concerted gene expression profiles of organ-specific ESTs by fluorescence microarray assay, in response to high sodium chloride concentration and chilling treatments with the aim to identify and follow up candidate genes for early responses to abiotic stress in sunflower. Results Abiotic-related expressed genes were the target of this characterization through a gene expression analysis using an organ-specific cDNA fluorescence microarray approach in response to high salinity and low temperatures. The experiment included three independent replicates from leaf samples. We analyzed 317 unigenes previously isolated from differential organ-specific cDNA libraries from leaf, stem and flower at R1 and R4 developmental stage. A statistical analysis based on mean comparison by ANOVA and ordination by Principal Component Analysis allowed the detection of 80 candidate genes for either salinity and/or chilling stresses. Out of them, 50 genes were up or down regulated under both stresses, supporting common regulatory mechanisms and general responses to chilling and salinity. Interestingly 15 and 12 sequences were up regulated or down regulated specifically in one stress but not in the other, respectively. These genes are potentially involved in different regulatory mechanisms including transcription/translation/protein degradation/protein folding/ROS production or ROS-scavenging. Differential gene expression patterns were confirmed by qRT-PCR for 12.5% of the microarray candidate sequences. Conclusion

  3. Serious limitations of the QTL/Microarray approach for QTL gene discovery

    Directory of Open Access Journals (Sweden)

    Warden Craig H

    2010-07-01

    Full Text Available Abstract Background It has been proposed that the use of gene expression microarrays in nonrecombinant parental or congenic strains can accelerate the process of isolating individual genes underlying quantitative trait loci (QTL. However, the effectiveness of this approach has not been assessed. Results Thirty-seven studies that have implemented the QTL/microarray approach in rodents were reviewed. About 30% of studies showed enrichment for QTL candidates, mostly in comparisons between congenic and background strains. Three studies led to the identification of an underlying QTL gene. To complement the literature results, a microarray experiment was performed using three mouse congenic strains isolating the effects of at least 25 biometric QTL. Results show that genes in the congenic donor regions were preferentially selected. However, within donor regions, the distribution of differentially expressed genes was homogeneous once gene density was accounted for. Genes within identical-by-descent (IBD regions were less likely to be differentially expressed in chromosome 2, but not in chromosomes 11 and 17. Furthermore, expression of QTL regulated in cis (cis eQTL showed higher expression in the background genotype, which was partially explained by the presence of single nucleotide polymorphisms (SNP. Conclusions The literature shows limited successes from the QTL/microarray approach to identify QTL genes. Our own results from microarray profiling of three congenic strains revealed a strong tendency to select cis-eQTL over trans-eQTL. IBD regions had little effect on rate of differential expression, and we provide several reasons why IBD should not be used to discard eQTL candidates. In addition, mismatch probes produced false cis-eQTL that could not be completely removed with the current strains genotypes and low probe density microarrays. The reviewed studies did not account for lack of coverage from the platforms used and therefore removed genes

  4. How does exposure to nickel and cadmium affect the transcriptome of yellow perch (Perca flavescens) – Results from a 1000 candidate-gene microarray

    International Nuclear Information System (INIS)

    Bougas, Bérénice; Normandeau, Eric; Pierron, Fabien; Campbell, Peter G.C.; Bernatchez, Louis; Couture, Patrice

    2013-01-01

    Highlights: •The transcriptional responses of Perca flavescens to both metal and non metal stressors were measured with a 1000 candidate-gene microarray. •475, 287 and 176 genes were differentially transcribed depending on temperature, Ni and Cd concentrations, respectively. •Genes involved in iron metabolism, transcriptional and translational processes, vitamin metabolism, blood coagulation, and calcium transport were impacted by metals. •The developed microarray contributes to a better characterization of the impact of different stressors on the transcriptome. -- Abstract: The molecular mechanisms underlying nickel (Ni) and cadmium (Cd) toxicity and their specific effects on fish are poorly understood. Documenting gene transcription profiles offers a powerful approach toward identifying the molecular mechanisms affected by these metals and to discover biomarkers of their toxicity. However, confounding environmental factors can complicate the interpretation of the results and the detection of biomarkers for fish captured in their natural environment. In the present study, a 1000 candidate-gene microarray, developed from a previous RNA-seq study on a subset of individual fish from contrasting level of metal contamination, was used to investigate the transcriptional response to metal (Ni and Cd) and non metal (temperature, oxygen, and diet) stressors in yellow perch (Perca flavescens). Specifically, we aimed at (1) identifying transcriptional signatures specific to Ni and Cd exposure, (2) investigating the mechanisms of their toxicity, and (3) developing a predictive tool to identify the sublethal effects of Ni and Cd contaminants in fish sampled from natural environments. A total of 475 genes displayed significantly different transcription levels when temperature varied while 287 and 176 genes were differentially transcribed at different concentrations of Ni and Cd, respectively. These metals were found to mainly affect the transcription level of genes

  5. How does exposure to nickel and cadmium affect the transcriptome of yellow perch (Perca flavescens) – Results from a 1000 candidate-gene microarray

    Energy Technology Data Exchange (ETDEWEB)

    Bougas, Bérénice, E-mail: Berenice.Bougas@ete.inrs.ca [Institut National de la Recherche Scientifique, Centre INRS Eau Terre et Environnement, 490, rue de la Couronne, Québec, Québec G1K 9A9 (Canada); Département de biologie, Institut de Biologie Intégrative et des Systèmes (IBIS), Université Laval, Québec, Québec G1V 0A6 (Canada); Normandeau, Eric [Département de biologie, Institut de Biologie Intégrative et des Systèmes (IBIS), Université Laval, Québec, Québec G1V 0A6 (Canada); Pierron, Fabien [Université de Bordeaux, EPOC, UMR 5805, F-33400 Talence (France); CNRS, EPOC, UMR 5805, F-33400 Talence (France); Campbell, Peter G.C. [Institut National de la Recherche Scientifique, Centre INRS Eau Terre et Environnement, 490, rue de la Couronne, Québec, Québec G1K 9A9 (Canada); Bernatchez, Louis [Département de biologie, Institut de Biologie Intégrative et des Systèmes (IBIS), Université Laval, Québec, Québec G1V 0A6 (Canada); Couture, Patrice [Institut National de la Recherche Scientifique, Centre INRS Eau Terre et Environnement, 490, rue de la Couronne, Québec, Québec G1K 9A9 (Canada)

    2013-10-15

    Highlights: •The transcriptional responses of Perca flavescens to both metal and non metal stressors were measured with a 1000 candidate-gene microarray. •475, 287 and 176 genes were differentially transcribed depending on temperature, Ni and Cd concentrations, respectively. •Genes involved in iron metabolism, transcriptional and translational processes, vitamin metabolism, blood coagulation, and calcium transport were impacted by metals. •The developed microarray contributes to a better characterization of the impact of different stressors on the transcriptome. -- Abstract: The molecular mechanisms underlying nickel (Ni) and cadmium (Cd) toxicity and their specific effects on fish are poorly understood. Documenting gene transcription profiles offers a powerful approach toward identifying the molecular mechanisms affected by these metals and to discover biomarkers of their toxicity. However, confounding environmental factors can complicate the interpretation of the results and the detection of biomarkers for fish captured in their natural environment. In the present study, a 1000 candidate-gene microarray, developed from a previous RNA-seq study on a subset of individual fish from contrasting level of metal contamination, was used to investigate the transcriptional response to metal (Ni and Cd) and non metal (temperature, oxygen, and diet) stressors in yellow perch (Perca flavescens). Specifically, we aimed at (1) identifying transcriptional signatures specific to Ni and Cd exposure, (2) investigating the mechanisms of their toxicity, and (3) developing a predictive tool to identify the sublethal effects of Ni and Cd contaminants in fish sampled from natural environments. A total of 475 genes displayed significantly different transcription levels when temperature varied while 287 and 176 genes were differentially transcribed at different concentrations of Ni and Cd, respectively. These metals were found to mainly affect the transcription level of genes

  6. Identification of novel candidate target genes in amplicons of Glioblastoma multiforme tumors detected by expression and CGH microarray profiling

    Directory of Open Access Journals (Sweden)

    Hernández-Moneo Jose-Luis

    2006-09-01

    Full Text Available Abstract Background Conventional cytogenetic and comparative genomic hybridization (CGH studies in brain malignancies have shown that glioblastoma multiforme (GBM is characterized by complex structural and numerical alterations. However, the limited resolution of these techniques has precluded the precise identification of detailed specific gene copy number alterations. Results We performed a genome-wide survey of gene copy number changes in 20 primary GBMs by CGH on cDNA microarrays. A novel amplicon at 4p15, and previously uncharacterized amplicons at 13q32-34 and 1q32 were detected and are analyzed here. These amplicons contained amplified genes not previously reported. Other amplified regions containg well-known oncogenes in GBMs were also detected at 7p12 (EGFR, 7q21 (CDK6, 4q12 (PDGFRA, and 12q13-15 (MDM2 and CDK4. In order to identify the putative target genes of the amplifications, and to determine the changes in gene expression levels associated with copy number change events, we carried out parallel gene expression profiling analyses using the same cDNA microarrays. We detected overexpression of the novel amplified genes SLA/LP and STIM2 (4p15, and TNFSF13B and COL4A2 (13q32-34. Some of the candidate target genes of amplification (EGFR, CDK6, MDM2, CDK4, and TNFSF13B were tested in an independent set of 111 primary GBMs by using FISH and immunohistological assays. The novel candidate 13q-amplification target TNFSF13B was amplified in 8% of the tumors, and showed protein expression in 20% of the GBMs. Conclusion This high-resolution analysis allowed us to propose novel candidate target genes such as STIM2 at 4p15, and TNFSF13B or COL4A2 at 13q32-34 that could potentially contribute to the pathogenesis of these tumors and which would require futher investigations. We showed that overexpression of the amplified genes could be attributable to gene dosage and speculate that deregulation of those genes could be important in the development

  7. FiGS: a filter-based gene selection workbench for microarray data

    Directory of Open Access Journals (Sweden)

    Yun Taegyun

    2010-01-01

    Full Text Available Abstract Background The selection of genes that discriminate disease classes from microarray data is widely used for the identification of diagnostic biomarkers. Although various gene selection methods are currently available and some of them have shown excellent performance, no single method can retain the best performance for all types of microarray datasets. It is desirable to use a comparative approach to find the best gene selection result after rigorous test of different methodological strategies for a given microarray dataset. Results FiGS is a web-based workbench that automatically compares various gene selection procedures and provides the optimal gene selection result for an input microarray dataset. FiGS builds up diverse gene selection procedures by aligning different feature selection techniques and classifiers. In addition to the highly reputed techniques, FiGS diversifies the gene selection procedures by incorporating gene clustering options in the feature selection step and different data pre-processing options in classifier training step. All candidate gene selection procedures are evaluated by the .632+ bootstrap errors and listed with their classification accuracies and selected gene sets. FiGS runs on parallelized computing nodes that capacitate heavy computations. FiGS is freely accessible at http://gexp.kaist.ac.kr/figs. Conclusion FiGS is an web-based application that automates an extensive search for the optimized gene selection analysis for a microarray dataset in a parallel computing environment. FiGS will provide both an efficient and comprehensive means of acquiring optimal gene sets that discriminate disease states from microarray datasets.

  8. Uropathogenic Escherichia coli virulence genes: invaluable approaches for designing DNA microarray probes.

    Science.gov (United States)

    Jahandeh, Nadia; Ranjbar, Reza; Behzadi, Payam; Behzadi, Elham

    2015-01-01

    The pathotypes of uropathogenic Escherichia coli (UPEC) cause different types of urinary tract infections (UTIs). The presence of a wide range of virulence genes in UPEC enables us to design appropriate DNA microarray probes. These probes, which are used in DNA microarray technology, provide us with an accurate and rapid diagnosis and definitive treatment in association with UTIs caused by UPEC pathotypes. The main goal of this article is to introduce the UPEC virulence genes as invaluable approaches for designing DNA microarray probes. Main search engines such as Google Scholar and databases like NCBI were searched to find and study several original pieces of literature, review articles, and DNA gene sequences. In parallel with in silico studies, the experiences of the authors were helpful for selecting appropriate sources and writing this review article. There is a significant variety of virulence genes among UPEC strains. The DNA sequences of virulence genes are fabulous patterns for designing microarray probes. The location of virulence genes and their sequence lengths influence the quality of probes. The use of selected virulence genes for designing microarray probes gives us a wide range of choices from which the best probe candidates can be chosen. DNA microarray technology provides us with an accurate, rapid, cost-effective, sensitive, and specific molecular diagnostic method which is facilitated by designing microarray probes. Via these tools, we are able to have an accurate diagnosis and a definitive treatment regarding UTIs caused by UPEC pathotypes.

  9. from microarrays and quantitative trait loci to candidate genes

    Indian Academy of Sciences (India)

    Unknown

    2004-10-15

    Oct 15, 2004 ... to candidate genes – A research plan and preliminary results using Drosophila as a model organism and climatic ... Recent developments in molecular genetics ..... scientists in agriculture, medicine and psychology for test-.

  10. Candidate Genes for Testicular Cancer Evaluated by In Situ Protein Expression Analyses on Tissue Microarrays

    Directory of Open Access Journals (Sweden)

    Rolf I. Skotheim

    2003-09-01

    Full Text Available By the use of high-throughput molecular technologies, the number of genes and proteins potentially relevant to testicular germ cell tumor (TGCT and other diseases will increase rapidly. In a recent transcriptional profiling, we demonstrated the overexpression of GRB7 and JUP in TGCTs, confirmed the reported overexpression of CCND2. We also have recent evidences for frequent genetic alterations of FHIT and epigenetic alterations of MGMT. To evaluate whether the expression of these genes is related to any clinicopathological variables, we constructed a tissue microarray with 510 testicular tissue cores from 279 patients diagnosed with TGCT, covering various histological subgroups and clinical stages. By immunohistochemistry, we found that JUP, GRB7, CCND2 proteins were rarely present in normal testis, but frequently expressed at high levels in TGCT. Additionally, all premalignant intratubular germ cell neoplasias were JUP-immunopositive. MGMT and FHIT were expressed by normal testicular tissues, but at significantly lower frequencies in TGCT. Except for CCND2, the expressions of all markers were significantly associated with various TGCT subtypes. In summary, we have developed a high-throughput tool for the evaluation of TGCT markers, utilized this to validate five candidate genes whose protein expressions were indeed deregulated in TGCT.

  11. Gene targeting associated with the radiation sensitivity in squamous cell carcinoma by using microarray analysis

    International Nuclear Information System (INIS)

    Nimura, Yoshinori; Kumagai, Ken; Kouzu, Yoshinao; Higo, Morihiro; Kato, Yoshikuni; Seki, Naohiko; Yamada, Shigeru

    2005-01-01

    In order to identify a set of genes related to radiation sensitivity of squamous cell carcinoma (SCC) and establish a predictive method, we compared expression profiles of radio-sensitive/radio-resistant SCC cell lines, using the in-house cDNA microarray consisting of 2,201 human genes derived from full-length enriched SCC cDNA libraries and the Human oligo chip 30 K (Hitachi Software Engineering). Surviving fractions (SF) after irradiation of heavy iron were calculated by colony formation assay. Three pairs (TE2-TE13, YES5-YES6, and HSC3-HSC2), sensitive (SF1 0.6), were selected for the microarray analysis. The results of cDNA microarray analysis showed that 20 genes in resistant cell lines and 5 genes in sensitive cell lines were up regulated more than 1.5-fold compared with sensitive and resistant cell lines respectively. Fourteen out of 25 genes were confirmed the gene expression profiles by real-time polymerase chain reaction (PCR). Twenty-seven genes identified by Human oligo chip 30 K are candidate for the markers to distinguish radio-sensitive from radio-resistant. These results suggest that the isolated 27 genes are the candidates that might be used as specific molecular markers to predict radiation sensitivity. (author)

  12. Dissecting the organ specificity of insecticide resistance candidate genes in Anopheles gambiae: known and novel candidate genes.

    Science.gov (United States)

    Ingham, Victoria A; Jones, Christopher M; Pignatelli, Patricia; Balabanidou, Vasileia; Vontas, John; Wagstaff, Simon C; Moore, Jonathan D; Ranson, Hilary

    2014-11-25

    The elevated expression of enzymes with insecticide metabolism activity can lead to high levels of insecticide resistance in the malaria vector, Anopheles gambiae. In this study, adult female mosquitoes from an insecticide susceptible and resistant strain were dissected into four different body parts. RNA from each of these samples was used in microarray analysis to determine the enrichment patterns of the key detoxification gene families within the mosquito and to identify additional candidate insecticide resistance genes that may have been overlooked in previous experiments on whole organisms. A general enrichment in the transcription of genes from the four major detoxification gene families (carboxylesterases, glutathione transferases, UDP glucornyltransferases and cytochrome P450s) was observed in the midgut and malpighian tubules. Yet the subset of P450 genes that have previously been implicated in insecticide resistance in An gambiae, show a surprisingly varied profile of tissue enrichment, confirmed by qPCR and, for three candidates, by immunostaining. A stringent selection process was used to define a list of 105 genes that are significantly (p ≤0.001) over expressed in body parts from the resistant versus susceptible strain. Over half of these, including all the cytochrome P450s on this list, were identified in previous whole organism comparisons between the strains, but several new candidates were detected, notably from comparisons of the transcriptomes from dissected abdomen integuments. The use of RNA extracted from the whole organism to identify candidate insecticide resistance genes has a risk of missing candidates if key genes responsible for the phenotype have restricted expression within the body and/or are over expression only in certain tissues. However, as transcription of genes implicated in metabolic resistance to insecticides is not enriched in any one single organ, comparison of the transcriptome of individual dissected body parts cannot

  13. Microarray Gene Expression Analysis to Evaluate Cell Type Specific Expression of Targets Relevant for Immunotherapy of Hematological Malignancies.

    Directory of Open Access Journals (Sweden)

    M J Pont

    Full Text Available Cellular immunotherapy has proven to be effective in the treatment of hematological cancers by donor lymphocyte infusion after allogeneic hematopoietic stem cell transplantation and more recently by targeted therapy with chimeric antigen or T-cell receptor-engineered T cells. However, dependent on the tissue distribution of the antigens that are targeted, anti-tumor responses can be accompanied by undesired side effects. Therefore, detailed tissue distribution analysis is essential to estimate potential efficacy and toxicity of candidate targets for immunotherapy of hematological malignancies. We performed microarray gene expression analysis of hematological malignancies of different origins, healthy hematopoietic cells and various non-hematopoietic cell types from organs that are often targeted in detrimental immune responses after allogeneic stem cell transplantation leading to graft-versus-host disease. Non-hematopoietic cells were also cultured in the presence of IFN-γ to analyze gene expression under inflammatory circumstances. Gene expression was investigated by Illumina HT12.0 microarrays and quality control analysis was performed to confirm the cell-type origin and exclude contamination of non-hematopoietic cell samples with peripheral blood cells. Microarray data were validated by quantitative RT-PCR showing strong correlations between both platforms. Detailed gene expression profiles were generated for various minor histocompatibility antigens and B-cell surface antigens to illustrate the value of the microarray dataset to estimate efficacy and toxicity of candidate targets for immunotherapy. In conclusion, our microarray database provides a relevant platform to analyze and select candidate antigens with hematopoietic (lineage-restricted expression as potential targets for immunotherapy of hematological cancers.

  14. Principles of gene microarray data analysis.

    Science.gov (United States)

    Mocellin, Simone; Rossi, Carlo Riccardo

    2007-01-01

    The development of several gene expression profiling methods, such as comparative genomic hybridization (CGH), differential display, serial analysis of gene expression (SAGE), and gene microarray, together with the sequencing of the human genome, has provided an opportunity to monitor and investigate the complex cascade of molecular events leading to tumor development and progression. The availability of such large amounts of information has shifted the attention of scientists towards a nonreductionist approach to biological phenomena. High throughput technologies can be used to follow changing patterns of gene expression over time. Among them, gene microarray has become prominent because it is easier to use, does not require large-scale DNA sequencing, and allows for the parallel quantification of thousands of genes from multiple samples. Gene microarray technology is rapidly spreading worldwide and has the potential to drastically change the therapeutic approach to patients affected with tumor. Therefore, it is of paramount importance for both researchers and clinicians to know the principles underlying the analysis of the huge amount of data generated with microarray technology.

  15. Candidate gene identification of ovulation-inducing genes by RNA sequencing with an in vivo assay in zebrafish.

    Directory of Open Access Journals (Sweden)

    Wanlada Klangnurak

    Full Text Available We previously reported the microarray-based selection of three ovulation-related genes in zebrafish. We used a different selection method in this study, RNA sequencing analysis. An additional eight up-regulated candidates were found as specifically up-regulated genes in ovulation-induced samples. Changes in gene expression were confirmed by qPCR analysis. Furthermore, up-regulation prior to ovulation during natural spawning was verified in samples from natural pairing. Gene knock-out zebrafish strains of one of the candidates, the starmaker gene (stm, were established by CRISPR genome editing techniques. Unexpectedly, homozygous mutants were fertile and could spawn eggs. However, a high percentage of unfertilized eggs and abnormal embryos were produced from these homozygous females. The results suggest that the stm gene is necessary for fertilization. In this study, we selected additional ovulation-inducing candidate genes, and a novel function of the stm gene was investigated.

  16. Screening key candidate genes and pathways involved in insulinoma by microarray analysis.

    Science.gov (United States)

    Zhou, Wuhua; Gong, Li; Li, Xuefeng; Wan, Yunyan; Wang, Xiangfei; Li, Huili; Jiang, Bin

    2018-06-01

    Insulinoma is a rare type tumor and its genetic features remain largely unknown. This study aimed to search for potential key genes and relevant enriched pathways of insulinoma.The gene expression data from GSE73338 were downloaded from Gene Expression Omnibus database. Differentially expressed genes (DEGs) were identified between insulinoma tissues and normal pancreas tissues, followed by pathway enrichment analysis, protein-protein interaction (PPI) network construction, and module analysis. The expressions of candidate key genes were validated by quantitative real-time polymerase chain reaction (RT-PCR) in insulinoma tissues.A total of 1632 DEGs were obtained, including 1117 upregulated genes and 514 downregulated genes. Pathway enrichment results showed that upregulated DEGs were significantly implicated in insulin secretion, and downregulated DEGs were mainly enriched in pancreatic secretion. PPI network analysis revealed 7 hub genes with degrees more than 10, including GCG (glucagon), GCGR (glucagon receptor), PLCB1 (phospholipase C, beta 1), CASR (calcium sensing receptor), F2R (coagulation factor II thrombin receptor), GRM1 (glutamate metabotropic receptor 1), and GRM5 (glutamate metabotropic receptor 5). DEGs involved in the significant modules were enriched in calcium signaling pathway, protein ubiquitination, and platelet degranulation. Quantitative RT-PCR data confirmed that the expression trends of these hub genes were similar to the results of bioinformatic analysis.The present study demonstrated that candidate DEGs and enriched pathways were the potential critical molecule events involved in the development of insulinoma, and these findings were useful for better understanding of insulinoma genesis.

  17. Identification of Arabidopsis candidate genes in response to biotic and abiotic stresses using comparative microarrays.

    Directory of Open Access Journals (Sweden)

    Arjun Sham

    Full Text Available Plants have evolved with intricate mechanisms to cope with multiple environmental stresses. To adapt with biotic and abiotic stresses, plant responses involve changes at the cellular and molecular levels. The current study was designed to investigate the effects of combinations of different environmental stresses on the transcriptome level of Arabidopsis genome using public microarray databases. We investigated the role of cyclopentenones in mediating plant responses to environmental stress through TGA (TGACG motif-binding factor transcription factor, independently from jasmonic acid. Candidate genes were identified by comparing plants inoculated with Botrytis cinerea or treated with heat, salt or osmotic stress with non-inoculated or non-treated tissues. About 2.5% heat-, 19% salinity- and 41% osmotic stress-induced genes were commonly upregulated by B. cinerea-treatment; and 7.6%, 19% and 48% of genes were commonly downregulated by B. cinerea-treatment, respectively. Our results indicate that plant responses to biotic and abiotic stresses are mediated by several common regulatory genes. Comparisons between transcriptome data from Arabidopsis stressed-plants support our hypothesis that some molecular and biological processes involved in biotic and abiotic stress response are conserved. Thirteen of the common regulated genes to abiotic and biotic stresses were studied in detail to determine their role in plant resistance to B. cinerea. Moreover, a T-DNA insertion mutant of the Responsive to Dehydration gene (rd20, encoding for a member of the caleosin (lipid surface protein family, showed an enhanced sensitivity to B. cinerea infection and drought. Overall, the overlapping of plant responses to abiotic and biotic stresses, coupled with the sensitivity of the rd20 mutant, may provide new interesting programs for increased plant resistance to multiple environmental stresses, and ultimately increases its chances to survive. Future research

  18. Transcriptomic Analysis Using Olive Varieties and Breeding Progenies Identifies Candidate Genes Involved in Plant Architecture.

    Science.gov (United States)

    González-Plaza, Juan J; Ortiz-Martín, Inmaculada; Muñoz-Mérida, Antonio; García-López, Carmen; Sánchez-Sevilla, José F; Luque, Francisco; Trelles, Oswaldo; Bejarano, Eduardo R; De La Rosa, Raúl; Valpuesta, Victoriano; Beuzón, Carmen R

    2016-01-01

    Plant architecture is a critical trait in fruit crops that can significantly influence yield, pruning, planting density and harvesting. Little is known about how plant architecture is genetically determined in olive, were most of the existing varieties are traditional with an architecture poorly suited for modern growing and harvesting systems. In the present study, we have carried out microarray analysis of meristematic tissue to compare expression profiles of olive varieties displaying differences in architecture, as well as seedlings from their cross pooled on the basis of their sharing architecture-related phenotypes. The microarray used, previously developed by our group has already been applied to identify candidates genes involved in regulating juvenile to adult transition in the shoot apex of seedlings. Varieties with distinct architecture phenotypes and individuals from segregating progenies displaying opposite architecture features were used to link phenotype to expression. Here, we identify 2252 differentially expressed genes (DEGs) associated to differences in plant architecture. Microarray results were validated by quantitative RT-PCR carried out on genes with functional annotation likely related to plant architecture. Twelve of these genes were further analyzed in individual seedlings of the corresponding pool. We also examined Arabidopsis mutants in putative orthologs of these targeted candidate genes, finding altered architecture for most of them. This supports a functional conservation between species and potential biological relevance of the candidate genes identified. This study is the first to identify genes associated to plant architecture in olive, and the results obtained could be of great help in future programs aimed at selecting phenotypes adapted to modern cultivation practices in this species.

  19. Microarray profiling of mononuclear peripheral blood cells identifies novel candidate genes related to chemoradiation response in rectal cancer.

    Directory of Open Access Journals (Sweden)

    Pablo Palma

    Full Text Available Preoperative chemoradiation significantly improves oncological outcome in locally advanced rectal cancer. However there is no effective method of predicting tumor response to chemoradiation in these patients. Peripheral blood mononuclear cells have emerged recently as pathology markers of cancer and other diseases, making possible their use as therapy predictors. Furthermore, the importance of the immune response in radiosensivity of solid organs led us to hypothesized that microarray gene expression profiling of peripheral blood mononuclear cells could identify patients with response to chemoradiation in rectal cancer. Thirty five 35 patients with locally advanced rectal cancer were recruited initially to perform the study. Peripheral blood samples were obtained before neaodjuvant treatment. RNA was extracted and purified to obtain cDNA and cRNA for hybridization of microarrays included in Human WG CodeLink bioarrays. Quantitative real time PCR was used to validate microarray experiment data. Results were correlated with pathological response, according to Mandard´s criteria and final UICC Stage (patients with tumor regression grade 1-2 and downstaging being defined as responders and patients with grade 3-5 and no downstaging as non-responders. Twenty seven out of 35 patients were finally included in the study. We performed a multiple t-test using Significance Analysis of Microarrays, to find those genes differing significantly in expression, between responders (n = 11 and non-responders (n = 16 to CRT. The differently expressed genes were: BC 035656.1, CIR, PRDM2, CAPG, FALZ, HLA-DPB2, NUPL2, and ZFP36. The measurement of FALZ (p = 0.029 gene expression level determined by qRT-PCR, showed statistically significant differences between the two groups. Gene expression profiling reveals novel genes in peripheral blood samples of mononuclear cells that could predict responders and non-responders to chemoradiation in patients with

  20. Emerging use of gene expression microarrays in plant physiology.

    Science.gov (United States)

    Wullschleger, Stan D; Difazio, Stephen P

    2003-01-01

    Microarrays have become an important technology for the global analysis of gene expression in humans, animals, plants, and microbes. Implemented in the context of a well-designed experiment, cDNA and oligonucleotide arrays can provide highthroughput, simultaneous analysis of transcript abundance for hundreds, if not thousands, of genes. However, despite widespread acceptance, the use of microarrays as a tool to better understand processes of interest to the plant physiologist is still being explored. To help illustrate current uses of microarrays in the plant sciences, several case studies that we believe demonstrate the emerging application of gene expression arrays in plant physiology were selected from among the many posters and presentations at the 2003 Plant and Animal Genome XI Conference. Based on this survey, microarrays are being used to assess gene expression in plants exposed to the experimental manipulation of air temperature, soil water content and aluminium concentration in the root zone. Analysis often includes characterizing transcript profiles for multiple post-treatment sampling periods and categorizing genes with common patterns of response using hierarchical clustering techniques. In addition, microarrays are also providing insights into developmental changes in gene expression associated with fibre and root elongation in cotton and maize, respectively. Technical and analytical limitations of microarrays are discussed and projects attempting to advance areas of microarray design and data analysis are highlighted. Finally, although much work remains, we conclude that microarrays are a valuable tool for the plant physiologist interested in the characterization and identification of individual genes and gene families with potential application in the fields of agriculture, horticulture and forestry.

  1. Investigation of the molecular relationship between breast cancer and obesity by candidate gene prioritization methods

    Directory of Open Access Journals (Sweden)

    Saba Garshasbi

    2015-10-01

    Full Text Available Background: Cancer and obesity are two major public health concerns. More than 12 million cases of cancer are reported annually. Many reports confirmed obesity as a risk factor for cancer. The molecular relationship between obesity and breast cancer has not been clear yet. The purpose of this study was to investigate priorities of effective genes in the molecular relationship between obesity and breast cancer. Methods: In this study, computer simulation method was used for prioritizing the genes that involved in the molecular links between obesity and breast cancer in laboratory of systems biology and bioinformatics (LBB, Tehran University, Tehran, Iran, from March to July 2014. In this study, ENDEAVOUR software was used for prioritizing the genes and integrating multiple data sources was used for data analysis. Training genes were selected from effective genes in obesity and/or breast cancer. Two groups of candidate genes were selected. The first group was included the existential genes in 5 common region chromosomes (between obesity and breast cancer and the second group was included the results of genes microarray data analysis of research Creighton, et al (In 2012 on patients with breast cancer. The microarray data were analyzed with GER2 software (R online software on GEO website. Finally, both training and candidate genes were entered in ENDEAVOUR software package. Results: The candidate genes were prioritized to four style and five genes in ten of the first priorities were repeated twice. In other word, the outcome of prioritizing of 72 genes (Product of microarray data analysis and genes of 5 common chromosome regions (Between obesity and breast cancer showed, 5 genes (TNFRSF10B, F2, IGFALS, NTRK3 and HSP90B1 were the priorities in the molecular connection between obesity and breast cancer. Conclusion: There are some common genes between breast cancer and obesity. So, molecular relationship is confirmed. In this study the possible effect

  2. Emerging Use of Gene Expression Microarrays in Plant Physiology

    Directory of Open Access Journals (Sweden)

    Stephen P. Difazio

    2006-04-01

    Full Text Available Microarrays have become an important technology for the global analysis of gene expression in humans, animals, plants, and microbes. Implemented in the context of a well-designed experiment, cDNA and oligonucleotide arrays can provide highthroughput, simultaneous analysis of transcript abundance for hundreds, if not thousands, of genes. However, despite widespread acceptance, the use of microarrays as a tool to better understand processes of interest to the plant physiologist is still being explored. To help illustrate current uses of microarrays in the plant sciences, several case studies that we believe demonstrate the emerging application of gene expression arrays in plant physiology were selected from among the many posters and presentations at the 2003 Plant and Animal Genome XI Conference. Based on this survey, microarrays are being used to assess gene expression in plants exposed to the experimental manipulation of air temperature, soil water content and aluminium concentration in the root zone. Analysis often includes characterizing transcript profiles for multiple post-treatment sampling periods and categorizing genes with common patterns of response using hierarchical clustering techniques. In addition, microarrays are also providing insights into developmental changes in gene expression associated with fibre and root elongation in cotton and maize, respectively. Technical and analytical limitations of microarrays are discussed and projects attempting to advance areas of microarray design and data analysis are highlighted. Finally, although much work remains, we conclude that microarrays are a valuable tool for the plant physiologist interested in the characterization and identification of individual genes and gene families with potential application in the fields of agriculture, horticulture and forestry.

  3. Integrated olfactory receptor and microarray gene expression databases

    Directory of Open Access Journals (Sweden)

    Crasto Chiquito J

    2007-06-01

    Full Text Available Abstract Background Gene expression patterns of olfactory receptors (ORs are an important component of the signal encoding mechanism in the olfactory system since they determine the interactions between odorant ligands and sensory neurons. We have developed the Olfactory Receptor Microarray Database (ORMD to house OR gene expression data. ORMD is integrated with the Olfactory Receptor Database (ORDB, which is a key repository of OR gene information. Both databases aim to aid experimental research related to olfaction. Description ORMD is a Web-accessible database that provides a secure data repository for OR microarray experiments. It contains both publicly available and private data; accessing the latter requires authenticated login. The ORMD is designed to allow users to not only deposit gene expression data but also manage their projects/experiments. For example, contributors can choose whether to make their datasets public. For each experiment, users can download the raw data files and view and export the gene expression data. For each OR gene being probed in a microarray experiment, a hyperlink to that gene in ORDB provides access to genomic and proteomic information related to the corresponding olfactory receptor. Individual ORs archived in ORDB are also linked to ORMD, allowing users access to the related microarray gene expression data. Conclusion ORMD serves as a data repository and project management system. It facilitates the study of microarray experiments of gene expression in the olfactory system. In conjunction with ORDB, ORMD integrates gene expression data with the genomic and functional data of ORs, and is thus a useful resource for both olfactory researchers and the public.

  4. AffyMiner: mining differentially expressed genes and biological knowledge in GeneChip microarray data

    Directory of Open Access Journals (Sweden)

    Xia Yuannan

    2006-12-01

    Full Text Available Abstract Background DNA microarrays are a powerful tool for monitoring the expression of tens of thousands of genes simultaneously. With the advance of microarray technology, the challenge issue becomes how to analyze a large amount of microarray data and make biological sense of them. Affymetrix GeneChips are widely used microarrays, where a variety of statistical algorithms have been explored and used for detecting significant genes in the experiment. These methods rely solely on the quantitative data, i.e., signal intensity; however, qualitative data are also important parameters in detecting differentially expressed genes. Results AffyMiner is a tool developed for detecting differentially expressed genes in Affymetrix GeneChip microarray data and for associating gene annotation and gene ontology information with the genes detected. AffyMiner consists of the functional modules, GeneFinder for detecting significant genes in a treatment versus control experiment and GOTree for mapping genes of interest onto the Gene Ontology (GO space; and interfaces to run Cluster, a program for clustering analysis, and GenMAPP, a program for pathway analysis. AffyMiner has been used for analyzing the GeneChip data and the results were presented in several publications. Conclusion AffyMiner fills an important gap in finding differentially expressed genes in Affymetrix GeneChip microarray data. AffyMiner effectively deals with multiple replicates in the experiment and takes into account both quantitative and qualitative data in identifying significant genes. AffyMiner reduces the time and effort needed to compare data from multiple arrays and to interpret the possible biological implications associated with significant changes in a gene's expression.

  5. Comparison of gene coverage of mouse oligonucleotide microarray platforms

    Directory of Open Access Journals (Sweden)

    Medrano Juan F

    2006-03-01

    Full Text Available Abstract Background The increasing use of DNA microarrays for genetical genomics studies generates a need for platforms with complete coverage of the genome. We have compared the effective gene coverage in the mouse genome of different commercial and noncommercial oligonucleotide microarray platforms by performing an in-house gene annotation of probes. We only used information about probes that is available from vendors and followed a process that any researcher may take to find the gene targeted by a given probe. In order to make consistent comparisons between platforms, probes in each microarray were annotated with an Entrez Gene id and the chromosomal position for each gene was obtained from the UCSC Genome Browser Database. Gene coverage was estimated as the percentage of Entrez Genes with a unique position in the UCSC Genome database that is tested by a given microarray platform. Results A MySQL relational database was created to store the mapping information for 25,416 mouse genes and for the probes in five microarray platforms (gene coverage level in parenthesis: Affymetrix430 2.0 (75.6%, ABI Genome Survey (81.24%, Agilent (79.33%, Codelink (78.09%, Sentrix (90.47%; and four array-ready oligosets: Sigma (47.95%, Operon v.3 (69.89%, Operon v.4 (84.03%, and MEEBO (84.03%. The differences in coverage between platforms were highly conserved across chromosomes. Differences in the number of redundant and unspecific probes were also found among arrays. The database can be queried to compare specific genomic regions using a web interface. The software used to create, update and query the database is freely available as a toolbox named ArrayGene. Conclusion The software developed here allows researchers to create updated custom databases by using public or proprietary information on genes for any organisms. ArrayGene allows easy comparisons of gene coverage between microarray platforms for any region of the genome. The comparison presented here

  6. The application of DNA microarrays in gene expression analysis.

    Science.gov (United States)

    van Hal, N L; Vorst, O; van Houwelingen, A M; Kok, E J; Peijnenburg, A; Aharoni, A; van Tunen, A J; Keijer, J

    2000-03-31

    DNA microarray technology is a new and powerful technology that will substantially increase the speed of molecular biological research. This paper gives a survey of DNA microarray technology and its use in gene expression studies. The technical aspects and their potential improvements are discussed. These comprise array manufacturing and design, array hybridisation, scanning, and data handling. Furthermore, it is discussed how DNA microarrays can be applied in the working fields of: safety, functionality and health of food and gene discovery and pathway engineering in plants.

  7. Quantitative multiplex quantum dot in-situ hybridisation based gene expression profiling in tissue microarrays identifies prognostic genes in acute myeloid leukaemia

    Energy Technology Data Exchange (ETDEWEB)

    Tholouli, Eleni [Department of Haematology, Manchester Royal Infirmary, Oxford Road, Manchester, M13 9WL (United Kingdom); MacDermott, Sarah [The Medical School, The University of Manchester, Oxford Road, M13 9PT Manchester (United Kingdom); Hoyland, Judith [School of Biomedicine, Faculty of Medical and Human Sciences, The University of Manchester, Oxford Road, M13 9PT Manchester (United Kingdom); Yin, John Liu [Department of Haematology, Manchester Royal Infirmary, Oxford Road, Manchester, M13 9WL (United Kingdom); Byers, Richard, E-mail: richard.byers@cmft.nhs.uk [School of Cancer and Enabling Sciences, Faculty of Medical and Human Sciences, The University of Manchester, Stopford Building, Oxford Road, M13 9PT Manchester (United Kingdom)

    2012-08-24

    Highlights: Black-Right-Pointing-Pointer Development of a quantitative high throughput in situ expression profiling method. Black-Right-Pointing-Pointer Application to a tissue microarray of 242 AML bone marrow samples. Black-Right-Pointing-Pointer Identification of HOXA4, HOXA9, Meis1 and DNMT3A as prognostic markers in AML. -- Abstract: Measurement and validation of microarray gene signatures in routine clinical samples is problematic and a rate limiting step in translational research. In order to facilitate measurement of microarray identified gene signatures in routine clinical tissue a novel method combining quantum dot based oligonucleotide in situ hybridisation (QD-ISH) and post-hybridisation spectral image analysis was used for multiplex in-situ transcript detection in archival bone marrow trephine samples from patients with acute myeloid leukaemia (AML). Tissue-microarrays were prepared into which white cell pellets were spiked as a standard. Tissue microarrays were made using routinely processed bone marrow trephines from 242 patients with AML. QD-ISH was performed for six candidate prognostic genes using triplex QD-ISH for DNMT1, DNMT3A, DNMT3B, and for HOXA4, HOXA9, Meis1. Scrambled oligonucleotides were used to correct for background staining followed by normalisation of expression against the expression values for the white cell pellet standard. Survival analysis demonstrated that low expression of HOXA4 was associated with poorer overall survival (p = 0.009), whilst high expression of HOXA9 (p < 0.0001), Meis1 (p = 0.005) and DNMT3A (p = 0.04) were associated with early treatment failure. These results demonstrate application of a standardised, quantitative multiplex QD-ISH method for identification of prognostic markers in formalin-fixed paraffin-embedded clinical samples, facilitating measurement of gene expression signatures in routine clinical samples.

  8. Mining biological databases for candidate disease genes

    Science.gov (United States)

    Braun, Terry A.; Scheetz, Todd; Webster, Gregg L.; Casavant, Thomas L.

    2001-07-01

    The publicly-funded effort to sequence the complete nucleotide sequence of the human genome, the Human Genome Project (HGP), has currently produced more than 93% of the 3 billion nucleotides of the human genome into a preliminary `draft' format. In addition, several valuable sources of information have been developed as direct and indirect results of the HGP. These include the sequencing of model organisms (rat, mouse, fly, and others), gene discovery projects (ESTs and full-length), and new technologies such as expression analysis and resources (micro-arrays or gene chips). These resources are invaluable for the researchers identifying the functional genes of the genome that transcribe and translate into the transcriptome and proteome, both of which potentially contain orders of magnitude more complexity than the genome itself. Preliminary analyses of this data identified approximately 30,000 - 40,000 human `genes.' However, the bulk of the effort still remains -- to identify the functional and structural elements contained within the transcriptome and proteome, and to associate function in the transcriptome and proteome to genes. A fortuitous consequence of the HGP is the existence of hundreds of databases containing biological information that may contain relevant data pertaining to the identification of disease-causing genes. The task of mining these databases for information on candidate genes is a commercial application of enormous potential. We are developing a system to acquire and mine data from specific databases to aid our efforts to identify disease genes. A high speed cluster of Linux of workstations is used to analyze sequence and perform distributed sequence alignments as part of our data mining and processing. This system has been used to mine GeneMap99 sequences within specific genomic intervals to identify potential candidate disease genes associated with Bardet-Biedle Syndrome (BBS).

  9. Gene selection for microarray data classification via subspace learning and manifold regularization.

    Science.gov (United States)

    Tang, Chang; Cao, Lijuan; Zheng, Xiao; Wang, Minhui

    2017-12-19

    With the rapid development of DNA microarray technology, large amount of genomic data has been generated. Classification of these microarray data is a challenge task since gene expression data are often with thousands of genes but a small number of samples. In this paper, an effective gene selection method is proposed to select the best subset of genes for microarray data with the irrelevant and redundant genes removed. Compared with original data, the selected gene subset can benefit the classification task. We formulate the gene selection task as a manifold regularized subspace learning problem. In detail, a projection matrix is used to project the original high dimensional microarray data into a lower dimensional subspace, with the constraint that the original genes can be well represented by the selected genes. Meanwhile, the local manifold structure of original data is preserved by a Laplacian graph regularization term on the low-dimensional data space. The projection matrix can serve as an importance indicator of different genes. An iterative update algorithm is developed for solving the problem. Experimental results on six publicly available microarray datasets and one clinical dataset demonstrate that the proposed method performs better when compared with other state-of-the-art methods in terms of microarray data classification. Graphical Abstract The graphical abstract of this work.

  10. Defining the Human Macula Transcriptome and Candidate Retinal Disease Genes UsingEyeSAGE

    Science.gov (United States)

    Rickman, Catherine Bowes; Ebright, Jessica N.; Zavodni, Zachary J.; Yu, Ling; Wang, Tianyuan; Daiger, Stephen P.; Wistow, Graeme; Boon, Kathy; Hauser, Michael A.

    2009-01-01

    Purpose To develop large-scale, high-throughput annotation of the human macula transcriptome and to identify and prioritize candidate genes for inherited retinal dystrophies, based on ocular-expression profiles using serial analysis of gene expression (SAGE). Methods Two human retina and two retinal pigment epithelium (RPE)/choroid SAGE libraries made from matched macula or midperipheral retina and adjacent RPE/choroid of morphologically normal 28- to 66-year-old donors and a human central retina longSAGE library made from 41- to 66-year-old donors were generated. Their transcription profiles were entered into a relational database, EyeSAGE, including microarray expression profiles of retina and publicly available normal human tissue SAGE libraries. EyeSAGE was used to identify retina- and RPE-specific and -associated genes, and candidate genes for retina and RPE disease loci. Differential and/or cell-type specific expression was validated by quantitative and single-cell RT-PCR. Results Cone photoreceptor-associated gene expression was elevated in the macula transcription profiles. Analysis of the longSAGE retina tags enhanced tag-to-gene mapping and revealed alternatively spliced genes. Analysis of candidate gene expression tables for the identified Bardet-Biedl syndrome disease gene (BBS5) in the BBS5 disease region table yielded BBS5 as the top candidate. Compelling candidates for inherited retina diseases were identified. Conclusions The EyeSAGE database, combining three different gene-profiling platforms including the authors’ multidonor-derived retina/RPE SAGE libraries and existing single-donor retina/RPE libraries, is a powerful resource for definition of the retina and RPE transcriptomes. It can be used to identify retina-specific genes, including alternatively spliced transcripts and to prioritize candidate genes within mapped retinal disease regions. PMID:16723438

  11. A Critical Perspective On Microarray Breast Cancer Gene Expression Profiling

    NARCIS (Netherlands)

    Sontrop, H.M.J.

    2015-01-01

    Microarrays offer biologists an exciting tool that allows the simultaneous assessment of gene expression levels for thousands of genes at once. At the time of their inception, microarrays were hailed as the new dawn in cancer biology and oncology practice with the hope that within a decade diseases

  12. Genome-wide prediction and analysis of human tissue-selective genes using microarray expression data

    Directory of Open Access Journals (Sweden)

    Teng Shaolei

    2013-01-01

    Full Text Available Abstract Background Understanding how genes are expressed specifically in particular tissues is a fundamental question in developmental biology. Many tissue-specific genes are involved in the pathogenesis of complex human diseases. However, experimental identification of tissue-specific genes is time consuming and difficult. The accurate predictions of tissue-specific gene targets could provide useful information for biomarker development and drug target identification. Results In this study, we have developed a machine learning approach for predicting the human tissue-specific genes using microarray expression data. The lists of known tissue-specific genes for different tissues were collected from UniProt database, and the expression data retrieved from the previously compiled dataset according to the lists were used for input vector encoding. Random Forests (RFs and Support Vector Machines (SVMs were used to construct accurate classifiers. The RF classifiers were found to outperform SVM models for tissue-specific gene prediction. The results suggest that the candidate genes for brain or liver specific expression can provide valuable information for further experimental studies. Our approach was also applied for identifying tissue-selective gene targets for different types of tissues. Conclusions A machine learning approach has been developed for accurately identifying the candidate genes for tissue specific/selective expression. The approach provides an efficient way to select some interesting genes for developing new biomedical markers and improve our knowledge of tissue-specific expression.

  13. Microarray labeling extension values: laboratory signatures for Affymetrix GeneChips

    Science.gov (United States)

    Lee, Yun-Shien; Chen, Chun-Houh; Tsai, Chi-Neu; Tsai, Chia-Lung; Chao, Angel; Wang, Tzu-Hao

    2009-01-01

    Interlaboratory comparison of microarray data, even when using the same platform, imposes several challenges to scientists. RNA quality, RNA labeling efficiency, hybridization procedures and data-mining tools can all contribute variations in each laboratory. In Affymetrix GeneChips, about 11–20 different 25-mer oligonucleotides are used to measure the level of each transcript. Here, we report that ‘labeling extension values (LEVs)’, which are correlation coefficients between probe intensities and probe positions, are highly correlated with the gene expression levels (GEVs) on eukayotic Affymetrix microarray data. By analyzing LEVs and GEVs in the publicly available 2414 cel files of 20 Affymetrix microarray types covering 13 species, we found that correlations between LEVs and GEVs only exist in eukaryotic RNAs, but not in prokaryotic ones. Surprisingly, Affymetrix results of the same specimens that were analyzed in different laboratories could be clearly differentiated only by LEVs, leading to the identification of ‘laboratory signatures’. In the examined dataset, GSE10797, filtering out high-LEV genes did not compromise the discovery of biological processes that are constructed by differentially expressed genes. In conclusion, LEVs provide a new filtering parameter for microarray analysis of gene expression and it may improve the inter- and intralaboratory comparability of Affymetrix GeneChips data. PMID:19295132

  14. Classification across gene expression microarray studies

    Directory of Open Access Journals (Sweden)

    Kuner Ruprecht

    2009-12-01

    Full Text Available Abstract Background The increasing number of gene expression microarray studies represents an important resource in biomedical research. As a result, gene expression based diagnosis has entered clinical practice for patient stratification in breast cancer. However, the integration and combined analysis of microarray studies remains still a challenge. We assessed the potential benefit of data integration on the classification accuracy and systematically evaluated the generalization performance of selected methods on four breast cancer studies comprising almost 1000 independent samples. To this end, we introduced an evaluation framework which aims to establish good statistical practice and a graphical way to monitor differences. The classification goal was to correctly predict estrogen receptor status (negative/positive and histological grade (low/high of each tumor sample in an independent study which was not used for the training. For the classification we chose support vector machines (SVM, predictive analysis of microarrays (PAM, random forest (RF and k-top scoring pairs (kTSP. Guided by considerations relevant for classification across studies we developed a generalization of kTSP which we evaluated in addition. Our derived version (DV aims to improve the robustness of the intrinsic invariance of kTSP with respect to technologies and preprocessing. Results For each individual study the generalization error was benchmarked via complete cross-validation and was found to be similar for all classification methods. The misclassification rates were substantially higher in classification across studies, when each single study was used as an independent test set while all remaining studies were combined for the training of the classifier. However, with increasing number of independent microarray studies used in the training, the overall classification performance improved. DV performed better than the average and showed slightly less variance. In

  15. A Fisheye Viewer for microarray-based gene expression data.

    Science.gov (United States)

    Wu, Min; Thao, Cheng; Mu, Xiangming; Munson, Ethan V

    2006-10-13

    Microarray has been widely used to measure the relative amounts of every mRNA transcript from the genome in a single scan. Biologists have been accustomed to reading their experimental data directly from tables. However, microarray data are quite large and are stored in a series of files in a machine-readable format, so direct reading of the full data set is not feasible. The challenge is to design a user interface that allows biologists to usefully view large tables of raw microarray-based gene expression data. This paper presents one such interface--an electronic table (E-table) that uses fisheye distortion technology. The Fisheye Viewer for microarray-based gene expression data has been successfully developed to view MIAME data stored in the MAGE-ML format. The viewer can be downloaded from the project web site http://polaris.imt.uwm.edu:7777/fisheye/. The fisheye viewer was implemented in Java so that it could run on multiple platforms. We implemented the E-table by adapting JTable, a default table implementation in the Java Swing user interface library. Fisheye views use variable magnification to balance magnification for easy viewing and compression for maximizing the amount of data on the screen. This Fisheye Viewer is a lightweight but useful tool for biologists to quickly overview the raw microarray-based gene expression data in an E-table.

  16. The application of DNA microarrays in gene expression analysis

    NARCIS (Netherlands)

    Hal, van N.L.W.; Vorst, O.; Houwelingen, van A.M.M.L.; Kok, E.J.; Peijnenburg, A.A.C.M.; Aharoni, A.; Tunen, van A.J.; Keijer, J.

    2000-01-01

    DNA microarray technology is a new and powerful technology that will substantially increase the speed of molecular biological research. This paper gives a survey of DNA microarray technology and its use in gene expression studies. The technical aspects and their potential improvements are discussed.

  17. Discovering biological progression underlying microarray samples.

    Directory of Open Access Journals (Sweden)

    Peng Qiu

    2011-04-01

    Full Text Available In biological systems that undergo processes such as differentiation, a clear concept of progression exists. We present a novel computational approach, called Sample Progression Discovery (SPD, to discover patterns of biological progression underlying microarray gene expression data. SPD assumes that individual samples of a microarray dataset are related by an unknown biological process (i.e., differentiation, development, cell cycle, disease progression, and that each sample represents one unknown point along the progression of that process. SPD aims to organize the samples in a manner that reveals the underlying progression and to simultaneously identify subsets of genes that are responsible for that progression. We demonstrate the performance of SPD on a variety of microarray datasets that were generated by sampling a biological process at different points along its progression, without providing SPD any information of the underlying process. When applied to a cell cycle time series microarray dataset, SPD was not provided any prior knowledge of samples' time order or of which genes are cell-cycle regulated, yet SPD recovered the correct time order and identified many genes that have been associated with the cell cycle. When applied to B-cell differentiation data, SPD recovered the correct order of stages of normal B-cell differentiation and the linkage between preB-ALL tumor cells with their cell origin preB. When applied to mouse embryonic stem cell differentiation data, SPD uncovered a landscape of ESC differentiation into various lineages and genes that represent both generic and lineage specific processes. When applied to a prostate cancer microarray dataset, SPD identified gene modules that reflect a progression consistent with disease stages. SPD may be best viewed as a novel tool for synthesizing biological hypotheses because it provides a likely biological progression underlying a microarray dataset and, perhaps more importantly, the

  18. Complete gene expression profiling of Saccharopolyspora erythraea using GeneChip DNA microarrays

    Directory of Open Access Journals (Sweden)

    Bordoni Roberta

    2007-11-01

    Full Text Available Abstract Background The Saccharopolyspora erythraea genome sequence, recently published, presents considerable divergence from those of streptomycetes in gene organization and function, confirming the remarkable potential of S. erythraea for producing many other secondary metabolites in addition to erythromycin. In order to investigate, at whole transcriptome level, how S. erythraea genes are modulated, a DNA microarray was specifically designed and constructed on the S. erythraea strain NRRL 2338 genome sequence, and the expression profiles of 6494 ORFs were monitored during growth in complex liquid medium. Results The transcriptional analysis identified a set of 404 genes, whose transcriptional signals vary during growth and characterize three distinct phases: a rapid growth until 32 h (Phase A; a growth slowdown until 52 h (Phase B; and another rapid growth phase from 56 h to 72 h (Phase C before the cells enter the stationary phase. A non-parametric statistical method, that identifies chromosomal regions with transcriptional imbalances, determined regional organization of transcription along the chromosome, highlighting differences between core and non-core regions, and strand specific patterns of expression. Microarray data were used to characterize the temporal behaviour of major functional classes and of all the gene clusters for secondary metabolism. The results confirmed that the ery cluster is up-regulated during Phase A and identified six additional clusters (for terpenes and non-ribosomal peptides that are clearly regulated in later phases. Conclusion The use of a S. erythraea DNA microarray improved specificity and sensitivity of gene expression analysis, allowing a global and at the same time detailed picture of how S. erythraea genes are modulated. This work underlines the importance of using DNA microarrays, coupled with an exhaustive statistical and bioinformatic analysis of the results, to understand the transcriptional

  19. Clustering approaches to identifying gene expression patterns from DNA microarray data.

    Science.gov (United States)

    Do, Jin Hwan; Choi, Dong-Kug

    2008-04-30

    The analysis of microarray data is essential for large amounts of gene expression data. In this review we focus on clustering techniques. The biological rationale for this approach is the fact that many co-expressed genes are co-regulated, and identifying co-expressed genes could aid in functional annotation of novel genes, de novo identification of transcription factor binding sites and elucidation of complex biological pathways. Co-expressed genes are usually identified in microarray experiments by clustering techniques. There are many such methods, and the results obtained even for the same datasets may vary considerably depending on the algorithms and metrics for dissimilarity measures used, as well as on user-selectable parameters such as desired number of clusters and initial values. Therefore, biologists who want to interpret microarray data should be aware of the weakness and strengths of the clustering methods used. In this review, we survey the basic principles of clustering of DNA microarray data from crisp clustering algorithms such as hierarchical clustering, K-means and self-organizing maps, to complex clustering algorithms like fuzzy clustering.

  20. Microarray Data Analysis of Space Grown Arabidopsis Leaves for Genes Important in Vascular Patterning. Analysis of Space Grown Arabidopsis with Microarray Data from GeneLab: Identification of Genes Important in Vascular Patterning

    Science.gov (United States)

    Weitzel, A. J.; Wyatt, S. E.; Parsons-Wingerter, P.

    2016-01-01

    Venation patterning in leaves is a major determinant of photosynthesis efficiency because of its dependency on vascular transport of photo-assimilates, water, and minerals. Arabidopsis thaliana grown in microgravity show delayed growth and leaf maturation. Gene expression data from the roots, hypocotyl, and leaves of A. thaliana grown during spaceflight vs. ground control analyzed by Affymetrix microarray are available through NASA's GeneLab (GLDS-7). We analyzed the data for differential expression of genes in leaves resulting from the effects of spaceflight on vascular patterning. Two genes were found by preliminary analysis to be up-regulated during spaceflight that may be related to vascular formation. The genes are responsible for coding an ARGOS (Auxin-Regulated Gene Involved in Organ Size)-like protein (potentially affecting cell elongation in the leaves), and an F-box/kelch-repeat protein (possibly contributing to protoxylem specification). Further analysis that will focus on raw data quality assessment and a moderated t-test may further confirm up-regulation of the two genes and/or identify other gene candidates. Plants defective in these genes will then be assessed for phenotype by the mapping and quantification of leaf vascular patterning by NASA's VESsel GENeration (VESGEN) software to model specific vascular differences of plants grown in spaceflight.

  1. Expression microarray identifies the unliganded glucocorticoid receptor as a regulator of gene expression in mammary epithelial cells

    International Nuclear Information System (INIS)

    Ritter, Heather D; Mueller, Christopher R

    2014-01-01

    While glucocorticoids and the liganded glucocorticoid receptor (GR) have a well-established role in the maintenance of differentiation and suppression of apoptosis in breast tissue, the involvement of unliganded GR in cellular processes is less clear. Our previous studies implicated unliganded GR as a positive regulator of the BRCA1 tumour suppressor gene in the absence of glucocorticoid hormone, which suggested it could play a similar role in the regulation of other genes. An shRNA vector directed against GR was used to create mouse mammary cell lines with depleted endogenous levels of this receptor in order to further characterize the role of GR in breast cells. An expression microarray screen for targets of unliganded GR was performed using our GR-depleted cell lines maintained in the absence of glucocorticoids. Candidate genes positively regulated by unliganded GR were identified, classified by Gene Ontology and Ingenuity Pathway Analysis, and validated using quantitative real-time reverse transcriptase PCR. Chromatin immunoprecipitation and dual luciferase expression assays were conducted to further investigate the mechanism through which unliganded GR regulates these genes. Expression microarray analysis revealed 260 targets negatively regulated and 343 targets positively regulated by unliganded GR. A number of the positively regulated targets were involved in pro-apoptotic networks, possibly opposing the activity of liganded GR targets. Validation and further analysis of five candidates from the microarray indicated that two of these, Hsd11b1 and Ch25h, were regulated by unliganded GR in a manner similar to Brca1 during glucocorticoid treatment. Furthermore, GR was shown to interact directly with and upregulate the Ch25h promoter in the absence, but not the presence, of hydrocortisone (HC), confirming our previously described model of gene regulation by unliganded GR. This work presents the first identification of targets of unliganded GR. We propose that

  2. Evaluation of gene expression data generated from expired Affymetrix GeneChip® microarrays using MAQC reference RNA samples

    Directory of Open Access Journals (Sweden)

    Tong Weida

    2010-10-01

    Full Text Available Abstract Background The Affymetrix GeneChip® system is a commonly used platform for microarray analysis but the technology is inherently expensive. Unfortunately, changes in experimental planning and execution, such as the unavailability of previously anticipated samples or a shift in research focus, may render significant numbers of pre-purchased GeneChip® microarrays unprocessed before their manufacturer’s expiration dates. Researchers and microarray core facilities wonder whether expired microarrays are still useful for gene expression analysis. In addition, it was not clear whether the two human reference RNA samples established by the MAQC project in 2005 still maintained their transcriptome integrity over a period of four years. Experiments were conducted to answer these questions. Results Microarray data were generated in 2009 in three replicates for each of the two MAQC samples with either expired Affymetrix U133A or unexpired U133Plus2 microarrays. These results were compared with data obtained in 2005 on the U133Plus2 microarray. The percentage of overlap between the lists of differentially expressed genes (DEGs from U133Plus2 microarray data generated in 2009 and in 2005 was 97.44%. While there was some degree of fold change compression in the expired U133A microarrays, the percentage of overlap between the lists of DEGs from the expired and unexpired microarrays was as high as 96.99%. Moreover, the microarray data generated using the expired U133A microarrays in 2009 were highly concordant with microarray and TaqMan® data generated by the MAQC project in 2005. Conclusions Our results demonstrated that microarray data generated using U133A microarrays, which were more than four years past the manufacturer’s expiration date, were highly specific and consistent with those from unexpired microarrays in identifying DEGs despite some appreciable fold change compression and decrease in sensitivity. Our data also suggested that the

  3. A fisheye viewer for microarray-based gene expression data

    Directory of Open Access Journals (Sweden)

    Munson Ethan V

    2006-10-01

    Full Text Available Abstract Background Microarray has been widely used to measure the relative amounts of every mRNA transcript from the genome in a single scan. Biologists have been accustomed to reading their experimental data directly from tables. However, microarray data are quite large and are stored in a series of files in a machine-readable format, so direct reading of the full data set is not feasible. The challenge is to design a user interface that allows biologists to usefully view large tables of raw microarray-based gene expression data. This paper presents one such interface – an electronic table (E-table that uses fisheye distortion technology. Results The Fisheye Viewer for microarray-based gene expression data has been successfully developed to view MIAME data stored in the MAGE-ML format. The viewer can be downloaded from the project web site http://polaris.imt.uwm.edu:7777/fisheye/. The fisheye viewer was implemented in Java so that it could run on multiple platforms. We implemented the E-table by adapting JTable, a default table implementation in the Java Swing user interface library. Fisheye views use variable magnification to balance magnification for easy viewing and compression for maximizing the amount of data on the screen. Conclusion This Fisheye Viewer is a lightweight but useful tool for biologists to quickly overview the raw microarray-based gene expression data in an E-table.

  4. Genotyping microarray (gene chip) for the ABCR (ABCA4) gene.

    Science.gov (United States)

    Jaakson, K; Zernant, J; Külm, M; Hutchinson, A; Tonisson, N; Glavac, D; Ravnik-Glavac, M; Hawlina, M; Meltzer, M R; Caruso, R C; Testa, F; Maugeri, A; Hoyng, C B; Gouras, P; Simonelli, F; Lewis, R A; Lupski, J R; Cremers, F P M; Allikmets, R

    2003-11-01

    Genetic variation in the ABCR (ABCA4) gene has been associated with five distinct retinal phenotypes, including Stargardt disease/fundus flavimaculatus (STGD/FFM), cone-rod dystrophy (CRD), and age-related macular degeneration (AMD). Comparative genetic analyses of ABCR variation and diagnostics have been complicated by substantial allelic heterogeneity and by differences in screening methods. To overcome these limitations, we designed a genotyping microarray (gene chip) for ABCR that includes all approximately 400 disease-associated and other variants currently described, enabling simultaneous detection of all known ABCR variants. The ABCR genotyping microarray (the ABCR400 chip) was constructed by the arrayed primer extension (APEX) technology. Each sequence change in ABCR was included on the chip by synthesis and application of sequence-specific oligonucleotides. We validated the chip by screening 136 confirmed STGD patients and 96 healthy controls, each of whom we had analyzed previously by single strand conformation polymorphism (SSCP) technology and/or heteroduplex analysis. The microarray was >98% effective in determining the existing genetic variation and was comparable to direct sequencing in that it yielded many sequence changes undetected by SSCP. In STGD patient cohorts, the efficiency of the array to detect disease-associated alleles was between 54% and 78%, depending on the ethnic composition and degree of clinical and molecular characterization of a cohort. In addition, chip analysis suggested a high carrier frequency (up to 1:10) of ABCR variants in the general population. The ABCR genotyping microarray is a robust, cost-effective, and comprehensive screening tool for variation in one gene in which mutations are responsible for a substantial fraction of retinal disease. The ABCR chip is a prototype for the next generation of screening and diagnostic tools in ophthalmic genetics, bridging clinical and scientific research. Copyright 2003 Wiley

  5. Sensitivity and fidelity of DNA microarray improved with integration of Amplified Differential Gene Expression (ADGE

    Directory of Open Access Journals (Sweden)

    Ile Kristina E

    2003-07-01

    Full Text Available Abstract Background The ADGE technique is a method designed to magnify the ratios of gene expression before detection. It improves the detection sensitivity to small change of gene expression and requires small amount of starting material. However, the throughput of ADGE is low. We integrated ADGE with DNA microarray (ADGE microarray and compared it with regular microarray. Results When ADGE was integrated with DNA microarray, a quantitative relationship of a power function between detected and input ratios was found. Because of ratio magnification, ADGE microarray was better able to detect small changes in gene expression in a drug resistant model cell line system. The PCR amplification of templates and efficient labeling reduced the requirement of starting material to as little as 125 ng of total RNA for one slide hybridization and enhanced the signal intensity. Integration of ratio magnification, template amplification and efficient labeling in ADGE microarray reduced artifacts in microarray data and improved detection fidelity. The results of ADGE microarray were less variable and more reproducible than those of regular microarray. A gene expression profile generated with ADGE microarray characterized the drug resistant phenotype, particularly with reference to glutathione, proliferation and kinase pathways. Conclusion ADGE microarray magnified the ratios of differential gene expression in a power function, improved the detection sensitivity and fidelity and reduced the requirement for starting material while maintaining high throughput. ADGE microarray generated a more informative expression pattern than regular microarray.

  6. Gene Expression and Microarray Investigation of Dendrobium ...

    African Journals Online (AJOL)

    blood glucose > 16.7 mmol/L were used as the model group and treated with Dendrobium mixture. (DEN ... Keywords: Diabetes, Gene expression, Dendrobium mixture, Microarray testing ..... homeostasis in airway smooth muscle. Am J.

  7. Prediction potential of candidate biomarker sets identified and validated on gene expression data from multiple datasets

    Directory of Open Access Journals (Sweden)

    Karacali Bilge

    2007-10-01

    Full Text Available Abstract Background Independently derived expression profiles of the same biological condition often have few genes in common. In this study, we created populations of expression profiles from publicly available microarray datasets of cancer (breast, lymphoma and renal samples linked to clinical information with an iterative machine learning algorithm. ROC curves were used to assess the prediction error of each profile for classification. We compared the prediction error of profiles correlated with molecular phenotype against profiles correlated with relapse-free status. Prediction error of profiles identified with supervised univariate feature selection algorithms were compared to profiles selected randomly from a all genes on the microarray platform and b a list of known disease-related genes (a priori selection. We also determined the relevance of expression profiles on test arrays from independent datasets, measured on either the same or different microarray platforms. Results Highly discriminative expression profiles were produced on both simulated gene expression data and expression data from breast cancer and lymphoma datasets on the basis of ER and BCL-6 expression, respectively. Use of relapse-free status to identify profiles for prognosis prediction resulted in poorly discriminative decision rules. Supervised feature selection resulted in more accurate classifications than random or a priori selection, however, the difference in prediction error decreased as the number of features increased. These results held when decision rules were applied across-datasets to samples profiled on the same microarray platform. Conclusion Our results show that many gene sets predict molecular phenotypes accurately. Given this, expression profiles identified using different training datasets should be expected to show little agreement. In addition, we demonstrate the difficulty in predicting relapse directly from microarray data using supervised machine

  8. Evaluation of gene importance in microarray data based upon probability of selection

    Directory of Open Access Journals (Sweden)

    Fu Li M

    2005-03-01

    Full Text Available Abstract Background Microarray devices permit a genome-scale evaluation of gene function. This technology has catalyzed biomedical research and development in recent years. As many important diseases can be traced down to the gene level, a long-standing research problem is to identify specific gene expression patterns linking to metabolic characteristics that contribute to disease development and progression. The microarray approach offers an expedited solution to this problem. However, it has posed a challenging issue to recognize disease-related genes expression patterns embedded in the microarray data. In selecting a small set of biologically significant genes for classifier design, the nature of high data dimensionality inherent in this problem creates substantial amount of uncertainty. Results Here we present a model for probability analysis of selected genes in order to determine their importance. Our contribution is that we show how to derive the P value of each selected gene in multiple gene selection trials based on different combinations of data samples and how to conduct a reliability analysis accordingly. The importance of a gene is indicated by its associated P value in that a smaller value implies higher information content from information theory. On the microarray data concerning the subtype classification of small round blue cell tumors, we demonstrate that the method is capable of finding the smallest set of genes (19 genes with optimal classification performance, compared with results reported in the literature. Conclusion In classifier design based on microarray data, the probability value derived from gene selection based on multiple combinations of data samples enables an effective mechanism for reducing the tendency of fitting local data particularities.

  9. GenePublisher: automated analysis of DNA microarray data

    DEFF Research Database (Denmark)

    Knudsen, Steen; Workman, Christopher; Sicheritz-Ponten, T.

    2003-01-01

    GenePublisher, a system for automatic analysis of data from DNA microarray experiments, has been implemented with a web interface at http://www.cbs.dtu.dk/services/GenePublisher. Raw data are uploaded to the server together with aspecification of the data. The server performs normalization...

  10. DNA microarray revealed and RNAi plants confirmed key genes conferring low Cd accumulation in barley grains

    DEFF Research Database (Denmark)

    Sun, Hongyan; Chen, Zhong-Hua; Chen, Fei

    2015-01-01

    Background Understanding the mechanism of low Cd accumulation in crops is crucial for sustainable safe food production in Cd-contaminated soils. Results Confocal microscopy, atomic absorption spectrometry, gas exchange and chlorophyll fluorescence analyses revealed a distinct difference in Cd...... with a substantial difference between the two genotypes. Cd stress led to higher expression of genes involved in transport, carbohydrate metabolism and signal transduction in the low-grain-Cd-accumulating genotype. Novel transporter genes such as zinc transporter genes were identified as being associated with low Cd...... accumulation. Quantitative RT-PCR confirmed our microarray data. Furthermore, suppression of the zinc transporter genes HvZIP3 and HvZIP8 by RNAi silencing showed increased Cd accumulation and reduced Zn and Mn concentrations in barley grains. Thus, HvZIP3 and HvZIP8 could be candidate genes related to low...

  11. Robust gene selection methods using weighting schemes for microarray data analysis.

    Science.gov (United States)

    Kang, Suyeon; Song, Jongwoo

    2017-09-02

    A common task in microarray data analysis is to identify informative genes that are differentially expressed between two different states. Owing to the high-dimensional nature of microarray data, identification of significant genes has been essential in analyzing the data. However, the performances of many gene selection techniques are highly dependent on the experimental conditions, such as the presence of measurement error or a limited number of sample replicates. We have proposed new filter-based gene selection techniques, by applying a simple modification to significance analysis of microarrays (SAM). To prove the effectiveness of the proposed method, we considered a series of synthetic datasets with different noise levels and sample sizes along with two real datasets. The following findings were made. First, our proposed methods outperform conventional methods for all simulation set-ups. In particular, our methods are much better when the given data are noisy and sample size is small. They showed relatively robust performance regardless of noise level and sample size, whereas the performance of SAM became significantly worse as the noise level became high or sample size decreased. When sufficient sample replicates were available, SAM and our methods showed similar performance. Finally, our proposed methods are competitive with traditional methods in classification tasks for microarrays. The results of simulation study and real data analysis have demonstrated that our proposed methods are effective for detecting significant genes and classification tasks, especially when the given data are noisy or have few sample replicates. By employing weighting schemes, we can obtain robust and reliable results for microarray data analysis.

  12. NMD Microarray Analysis for Rapid Genome-Wide Screen of Mutated Genes in Cancer

    Directory of Open Access Journals (Sweden)

    Maija Wolf

    2005-01-01

    Full Text Available Gene mutations play a critical role in cancer development and progression, and their identification offers possibilities for accurate diagnostics and therapeutic targeting. Finding genes undergoing mutations is challenging and slow, even in the post-genomic era. A new approach was recently developed by Noensie and Dietz to prioritize and focus the search, making use of nonsense-mediated mRNA decay (NMD inhibition and microarray analysis (NMD microarrays in the identification of transcripts containing nonsense mutations. We combined NMD microarrays with array-based CGH (comparative genomic hybridization in order to identify inactivation of tumor suppressor genes in cancer. Such a “mutatomics” screening of prostate cancer cell lines led to the identification of inactivating mutations in the EPHB2 gene. Up to 8% of metastatic uncultured prostate cancers also showed mutations of this gene whose loss of function may confer loss of tissue architecture. NMD microarray analysis could turn out to be a powerful research method to identify novel mutated genes in cancer cell lines, providing targets that could then be further investigated for their clinical relevance and therapeutic potential.

  13. Identification of Quantitative Trait Loci (QTL) and Candidate Genes for Cadmium Tolerance in Populus

    Energy Technology Data Exchange (ETDEWEB)

    Induri, Brahma R [West Virginia University; Ellis, Danielle R [West Virginia University; Slavov, Gancho [West Virginia University; Yin, Tongming [ORNL; Muchero, Wellington [ORNL; Tuskan, Gerald A [ORNL; DiFazio, Stephen P [West Virginia University

    2012-01-01

    Knowledge of genetic variation in response of Populus to heavy metals like cadmium (Cd) is an important step in understanding the underlying mechanisms of tolerance. In this study, a pseudo-backcross pedigree of Populus trichocarpa and Populus deltoides was characterized for Cd exposure. The pedigree showed significant variation for Cd tolerance thus enabling the identification of relatively tolerant and susceptible genotypes for intensive characterization. A total of 16 QTLs at logarithm of odds (LOD) ratio > 2.5, were found to be associated with total dry weight, its components, and root volume. Four major QTLs for total dry weight were mapped to different linkage groups in control (LG III) and Cd conditions (LG XVI) and had opposite allelic effects on Cd tolerance, suggesting that these genomic regions were differentially controlled. The phenotypic variation explained by Cd QTL for all traits under study varied from 5.9% to 11.6% and averaged 8.2% across all QTL. Leaf Cd contents also showed significant variation suggesting the phytoextraction potential of Populus genotypes, though heritability of this trait was low (0.22). A whole-genome microarray study was conducted by using two genotypes with extreme responses for Cd tolerance in the above study and differentially expressed genes were identified. Candidate genes including CAD2 (CADMIUM SENSITIVE 2), HMA5 (HEAVY METAL ATPase5), ATGTST1 (Arabidopsis thaliana Glutathione S-Transferase1), ATGPX6 (Glutathione peroxidase 6), and ATMRP 14 (Arabidopsis thaliana Multidrug Resistance associated Protein 14) were identified from QTL intervals and microarray study. Functional characterization of these candidate genes could enhance phytoremediation capabilities of Populus.

  14. Exploring matrix factorization techniques for significant genes identification of Alzheimer’s disease microarray gene expression data

    Directory of Open Access Journals (Sweden)

    Hu Xiaohua

    2011-07-01

    Full Text Available Abstract Background The wide use of high-throughput DNA microarray technology provide an increasingly detailed view of human transcriptome from hundreds to thousands of genes. Although biomedical researchers typically design microarray experiments to explore specific biological contexts, the relationships between genes are hard to identified because they are complex and noisy high-dimensional data and are often hindered by low statistical power. The main challenge now is to extract valuable biological information from the colossal amount of data to gain insight into biological processes and the mechanisms of human disease. To overcome the challenge requires mathematical and computational methods that are versatile enough to capture the underlying biological features and simple enough to be applied efficiently to large datasets. Methods Unsupervised machine learning approaches provide new and efficient analysis of gene expression profiles. In our study, two unsupervised knowledge-based matrix factorization methods, independent component analysis (ICA and nonnegative matrix factorization (NMF are integrated to identify significant genes and related pathways in microarray gene expression dataset of Alzheimer’s disease. The advantage of these two approaches is they can be performed as a biclustering method by which genes and conditions can be clustered simultaneously. Furthermore, they can group genes into different categories for identifying related diagnostic pathways and regulatory networks. The difference between these two method lies in ICA assume statistical independence of the expression modes, while NMF need positivity constrains to generate localized gene expression profiles. Results In our work, we performed FastICA and non-smooth NMF methods on DNA microarray gene expression data of Alzheimer’s disease respectively. The simulation results shows that both of the methods can clearly classify severe AD samples from control samples, and

  15. Hierarchical information representation and efficient classification of gene expression microarray data

    OpenAIRE

    Bosio, Mattia

    2014-01-01

    In the field of computational biology, microarryas are used to measure the activity of thousands of genes at once and create a global picture of cellular function. Microarrays allow scientists to analyze expression of many genes in a single experiment quickly and eficiently. Even if microarrays are a consolidated research technology nowadays and the trends in high-throughput data analysis are shifting towards new technologies like Next Generation Sequencing (NGS), an optimum method for sample...

  16. Microarray analysis of gene expression profiles of Schistosoma japonicum derived from less-susceptible host water buffalo and susceptible host goat.

    Directory of Open Access Journals (Sweden)

    Jianmei Yang

    Full Text Available BACKGROUND: Water buffalo and goats are natural hosts for S. japonicum in endemic areas of China. The susceptibility of these two hosts to schistosome infection is different, as water buffalo are less conducive to S. japonicum growth and development. To identify genes that may affect schistosome development and survival, we compared gene expression profiles of schistosomes derived from these two natural hosts using high-throughput microarray technology. RESULTS: The worm recovery rate was lower and the length and width of worms from water buffalo were smaller compared to those from goats following S. japonicum infection for 7 weeks. Besides obvious morphological difference between the schistosomes derived from the two hosts, differences were also observed by scanning and transmission electron microscopy. Microarray analysis showed differentially expressed gene patterns for parasites from the two hosts, which revealed that genes related to lipid and nucleotide metabolism, as well as protein folding, sorting, and degradation were upregulated, while others associated with signal transduction, endocrine function, development, immune function, endocytosis, and amino acid/carbohydrate/glycan metabolism were downregulated in schistosomes from water buffalo. KEGG pathway analysis deduced that the differentially expressed genes mainly involved lipid metabolism, the MAPK and ErbB signaling pathways, progesterone-mediated oocyte maturation, dorso-ventral axis formation, reproduction, and endocytosis, etc. CONCLUSION: The microarray gene analysis in schistosomes derived from water buffalo and goats provide a useful platform to disclose differences determining S. japonicum host compatibility to better understand the interplay between natural hosts and parasites, and identify schistosome target genes associated with susceptibility to screen vaccine candidates.

  17. Microarray analysis of the gene expression profile in triethylene ...

    African Journals Online (AJOL)

    Microarray analysis of the gene expression profile in triethylene glycol dimethacrylate-treated human dental pulp cells. ... Conclusions: Our results suggest that TEGDMA can change the many functions of hDPCs through large changes in gene expression levels and complex interactions with different signaling pathways.

  18. Microarray analysis of gene expression profiles in ripening pineapple fruits.

    Science.gov (United States)

    Koia, Jonni H; Moyle, Richard L; Botella, Jose R

    2012-12-18

    Pineapple (Ananas comosus) is a tropical fruit crop of significant commercial importance. Although the physiological changes that occur during pineapple fruit development have been well characterized, little is known about the molecular events that occur during the fruit ripening process. Understanding the molecular basis of pineapple fruit ripening will aid the development of new varieties via molecular breeding or genetic modification. In this study we developed a 9277 element pineapple microarray and used it to profile gene expression changes that occur during pineapple fruit ripening. Microarray analyses identified 271 unique cDNAs differentially expressed at least 1.5-fold between the mature green and mature yellow stages of pineapple fruit ripening. Among these 271 sequences, 184 share significant homology with genes encoding proteins of known function, 53 share homology with genes encoding proteins of unknown function and 34 share no significant homology with any database accession. Of the 237 pineapple sequences with homologs, 160 were up-regulated and 77 were down-regulated during pineapple fruit ripening. DAVID Functional Annotation Cluster (FAC) analysis of all 237 sequences with homologs revealed confident enrichment scores for redox activity, organic acid metabolism, metalloenzyme activity, glycolysis, vitamin C biosynthesis, antioxidant activity and cysteine peptidase activity, indicating the functional significance and importance of these processes and pathways during pineapple fruit development. Quantitative real-time PCR analysis validated the microarray expression results for nine out of ten genes tested. This is the first report of a microarray based gene expression study undertaken in pineapple. Our bioinformatic analyses of the transcript profiles have identified a number of genes, processes and pathways with putative involvement in the pineapple fruit ripening process. This study extends our knowledge of the molecular basis of pineapple fruit

  19. Radioactive cDNA microarray in neurospsychiatry

    International Nuclear Information System (INIS)

    Choe, Jae Gol; Shin, Kyung Ho; Lee, Min Soo; Kim, Meyoung Kon

    2003-01-01

    Microarray technology allows the simultaneous analysis of gene expression patterns of thousands of genes, in a systematic fashion, under a similar set of experimental conditions, thus making the data highly comparable. In some cases arrays are used simply as a primary screen leading to downstream molecular characterization of individual gene candidates. In other cases, the goal of expression profiling is to begin to identify complex regulatory networks underlying developmental processes and disease states. Microarrays were originally used with cell lines or other simple model systems. More recently, microarrays have been used in the analysis of more complex biological tissues including neural systems and the brain. The application of cDNA arrays in neuropsychiatry has lagged behind other fields for a number of reasons. These include a requirement for a large amount of input probe RNA in fluorescent-glass based array systems and the cellular complexity introduced by multicellular brain and neural tissues. An additional factor that impacts the general use of microarrays in neuropsychiatry is the lack of availability of sequenced clone sets from model systems. While human cDNA clones have been widely available, high quality rat, mouse, and drosophilae, among others are just becoming widely available. A final factor in the application of cDNA microarrays in neuropsychiatry is cost of commercial arrays. As academic microarray facilitates become more commonplace custom made arrays will become more widely available at a lower cost allowing more widespread applications. In summary, microarray technology is rapidly having an impact on many areas of biomedical research. Radioisotope-nylon based microarrays offer alternatives that may in some cases be more sensitive, flexible, inexpensive, and universal as compared to other array formats, such as fluorescent-glass arrays. In some situations of limited RNA or exotic species, radioactive membrane microarrays may be the most

  20. Radioactive cDNA microarray in neurospsychiatry

    Energy Technology Data Exchange (ETDEWEB)

    Choe, Jae Gol; Shin, Kyung Ho; Lee, Min Soo; Kim, Meyoung Kon [Korea University Medical School, Seoul (Korea, Republic of)

    2003-02-01

    Microarray technology allows the simultaneous analysis of gene expression patterns of thousands of genes, in a systematic fashion, under a similar set of experimental conditions, thus making the data highly comparable. In some cases arrays are used simply as a primary screen leading to downstream molecular characterization of individual gene candidates. In other cases, the goal of expression profiling is to begin to identify complex regulatory networks underlying developmental processes and disease states. Microarrays were originally used with cell lines or other simple model systems. More recently, microarrays have been used in the analysis of more complex biological tissues including neural systems and the brain. The application of cDNA arrays in neuropsychiatry has lagged behind other fields for a number of reasons. These include a requirement for a large amount of input probe RNA in fluorescent-glass based array systems and the cellular complexity introduced by multicellular brain and neural tissues. An additional factor that impacts the general use of microarrays in neuropsychiatry is the lack of availability of sequenced clone sets from model systems. While human cDNA clones have been widely available, high quality rat, mouse, and drosophilae, among others are just becoming widely available. A final factor in the application of cDNA microarrays in neuropsychiatry is cost of commercial arrays. As academic microarray facilitates become more commonplace custom made arrays will become more widely available at a lower cost allowing more widespread applications. In summary, microarray technology is rapidly having an impact on many areas of biomedical research. Radioisotope-nylon based microarrays offer alternatives that may in some cases be more sensitive, flexible, inexpensive, and universal as compared to other array formats, such as fluorescent-glass arrays. In some situations of limited RNA or exotic species, radioactive membrane microarrays may be the most

  1. Microarray Data Analysis of Space Grown Arabidopsis Leaves for Genes Important in Vascular Patterning

    Science.gov (United States)

    Weitzeal, A. J.; Wyatt, S. E.; Parsons-Wingerter, P.

    2016-01-01

    Venation patterning in leaves is a major determinant of photosynthesis efficiency because of its dependency on vascular transport of photoassimilates, water, and minerals. Arabidopsis thaliana grown in microgravity show delayed growth and leaf maturation. Gene expression data from the roots, hypocotyl, and leaves of A. thaliana grown during spaceflight vs. ground control analyzed by Affymetrix microarray are available through NASAs GeneLab (GLDS-7). We analyzed the data for differential expression of genes in leaves resulting from the effects of spaceflight on vascular patterning. Two genes were found by preliminary analysis to be upregulated during spaceflight that may be related to vascular formation. The genes are responsible for coding an ARGOS like protein (potentially affecting cell elongation in the leaves), and an F-boxkelch-repeat protein (possibly contributing to protoxylem specification). Further analysis that will focus on raw data quality assessment and a moderated t-test may further confirm upregulation of the two genes and/or identify other gene candidates. Plants defective in these genes will then be assessed for phenotype by the mapping and quantification of leaf vascular patterning by NASAs VESsel GENeration (VESGEN) software to model specific vascular differences of plants grown in spaceflight.

  2. Microarray analysis and scale-free gene networks identify candidate regulators in drought-stressed roots of loblolly pine (P. taeda L.

    Directory of Open Access Journals (Sweden)

    Bordeaux John M

    2011-05-01

    Full Text Available Abstract Background Global transcriptional analysis of loblolly pine (Pinus taeda L. is challenging due to limited molecular tools. PtGen2, a 26,496 feature cDNA microarray, was fabricated and used to assess drought-induced gene expression in loblolly pine propagule roots. Statistical analysis of differential expression and weighted gene correlation network analysis were used to identify drought-responsive genes and further characterize the molecular basis of drought tolerance in loblolly pine. Results Microarrays were used to interrogate root cDNA populations obtained from 12 genotype × treatment combinations (four genotypes, three watering regimes. Comparison of drought-stressed roots with roots from the control treatment identified 2445 genes displaying at least a 1.5-fold expression difference (false discovery rate = 0.01. Genes commonly associated with drought response in pine and other plant species, as well as a number of abiotic and biotic stress-related genes, were up-regulated in drought-stressed roots. Only 76 genes were identified as differentially expressed in drought-recovered roots, indicating that the transcript population can return to the pre-drought state within 48 hours. Gene correlation analysis predicts a scale-free network topology and identifies eleven co-expression modules that ranged in size from 34 to 938 members. Network topological parameters identified a number of central nodes (hubs including those with significant homology (E-values ≤ 2 × 10-30 to 9-cis-epoxycarotenoid dioxygenase, zeatin O-glucosyltransferase, and ABA-responsive protein. Identified hubs also include genes that have been associated previously with osmotic stress, phytohormones, enzymes that detoxify reactive oxygen species, and several genes of unknown function. Conclusion PtGen2 was used to evaluate transcriptome responses in loblolly pine and was leveraged to identify 2445 differentially expressed genes responding to severe drought stress in

  3. Microarray analysis and scale-free gene networks identify candidate regulators in drought-stressed roots of loblolly pine (P. taeda L.)

    Science.gov (United States)

    2011-01-01

    Background Global transcriptional analysis of loblolly pine (Pinus taeda L.) is challenging due to limited molecular tools. PtGen2, a 26,496 feature cDNA microarray, was fabricated and used to assess drought-induced gene expression in loblolly pine propagule roots. Statistical analysis of differential expression and weighted gene correlation network analysis were used to identify drought-responsive genes and further characterize the molecular basis of drought tolerance in loblolly pine. Results Microarrays were used to interrogate root cDNA populations obtained from 12 genotype × treatment combinations (four genotypes, three watering regimes). Comparison of drought-stressed roots with roots from the control treatment identified 2445 genes displaying at least a 1.5-fold expression difference (false discovery rate = 0.01). Genes commonly associated with drought response in pine and other plant species, as well as a number of abiotic and biotic stress-related genes, were up-regulated in drought-stressed roots. Only 76 genes were identified as differentially expressed in drought-recovered roots, indicating that the transcript population can return to the pre-drought state within 48 hours. Gene correlation analysis predicts a scale-free network topology and identifies eleven co-expression modules that ranged in size from 34 to 938 members. Network topological parameters identified a number of central nodes (hubs) including those with significant homology (E-values ≤ 2 × 10-30) to 9-cis-epoxycarotenoid dioxygenase, zeatin O-glucosyltransferase, and ABA-responsive protein. Identified hubs also include genes that have been associated previously with osmotic stress, phytohormones, enzymes that detoxify reactive oxygen species, and several genes of unknown function. Conclusion PtGen2 was used to evaluate transcriptome responses in loblolly pine and was leveraged to identify 2445 differentially expressed genes responding to severe drought stress in roots. Many of the

  4. Amygdala-enriched genes identified by microarray technology are restricted to specific amygdaloid subnuclei

    OpenAIRE

    Zirlinger, M.; Kreiman, Gabriel; Anderson, D. J.

    2001-01-01

    Microarray technology represents a potentially powerful method for identifying cell type- and regionally restricted genes expressed in the brain. Here we have combined a microarray analysis of differential gene expression among five selected brain regions, including the amygdala, cerebellum, hippocampus, olfactory bulb, and periaqueductal gray, with in situ hybridization. On average, 0.3% of the 34,000 genes interrogated were highly enriched in each of the five regions...

  5. Development of a porcine skeletal muscle cDNA microarray: analysis of differential transcript expression in phenotypically distinct muscles

    Directory of Open Access Journals (Sweden)

    Stear Michael

    2003-03-01

    Full Text Available Abstract Background Microarray profiling has the potential to illuminate the molecular processes that govern the phenotypic characteristics of porcine skeletal muscles, such as hypertrophy or atrophy, and the expression of specific fibre types. This information is not only important for understanding basic muscle biology but also provides underpinning knowledge for enhancing the efficiency of livestock production. Results We report on the de novo development of a composite skeletal muscle cDNA microarray, comprising 5500 clones from two developmentally distinct cDNA libraries (longissimus dorsi of a 50-day porcine foetus and the gastrocnemius of a 3-day-old pig. Clones selected for the microarray assembly were of low to moderate abundance, as indicated by colony hybridisation. We profiled the differential expression of genes between the psoas (red muscle and the longissimus dorsi (white muscle, by co-hybridisation of Cy3 and Cy5 labelled cDNA derived from these two muscles. Results from seven microarray slides (replicates correctly identified genes that were expected to be differentially expressed, as well as a number of novel candidate regulatory genes. Quantitative real-time RT-PCR on selected genes was used to confirm the results from the microarray. Conclusion We have developed a porcine skeletal muscle cDNA microarray and have identified a number of candidate genes that could be involved in muscle phenotype determination, including several members of the casein kinase 2 signalling pathway.

  6. Observation of intermittency in gene expression on cDNA microarrays

    CERN Document Server

    Peterson, L E

    2002-01-01

    We used scaled factorial moments to search for intermittency in the log expression ratios (LERs) for thousands of genes spotted on cDNA microarrays (gene chips). Results indicate varying levels of intermittency in gene expression. The observation of intermittency in the data analyzed provides a complimentary handle on moderately expressed genes, generally not tackled by conventional techniques.

  7. Identification of cytokinin-responsive genes using microarray meta-analysis and RNA-Seq in Arabidopsis.

    Science.gov (United States)

    Bhargava, Apurva; Clabaugh, Ivory; To, Jenn P; Maxwell, Bridey B; Chiang, Yi-Hsuan; Schaller, G Eric; Loraine, Ann; Kieber, Joseph J

    2013-05-01

    Cytokinins are N(6)-substituted adenine derivatives that play diverse roles in plant growth and development. We sought to define a robust set of genes regulated by cytokinin as well as to query the response of genes not represented on microarrays. To this end, we performed a meta-analysis of microarray data from a variety of cytokinin-treated samples and used RNA-seq to examine cytokinin-regulated gene expression in Arabidopsis (Arabidopsis thaliana). Microarray meta-analysis using 13 microarray experiments combined with empirically defined filtering criteria identified a set of 226 genes differentially regulated by cytokinin, a subset of which has previously been validated by other methods. RNA-seq validated about 73% of the up-regulated genes identified by this meta-analysis. In silico promoter analysis indicated an overrepresentation of type-B Arabidopsis response regulator binding elements, consistent with the role of type-B Arabidopsis response regulators as primary mediators of cytokinin-responsive gene expression. RNA-seq analysis identified 73 cytokinin-regulated genes that were not represented on the ATH1 microarray. Representative genes were verified using quantitative reverse transcription-polymerase chain reaction and NanoString analysis. Analysis of the genes identified reveals a substantial effect of cytokinin on genes encoding proteins involved in secondary metabolism, particularly those acting in flavonoid and phenylpropanoid biosynthesis, as well as in the regulation of redox state of the cell, particularly a set of glutaredoxin genes. Novel splicing events were found in members of some gene families that are known to play a role in cytokinin signaling or metabolism. The genes identified in this analysis represent a robust set of cytokinin-responsive genes that are useful in the analysis of cytokinin function in plants.

  8. Investigating the effect of paralogs on microarray gene-set analysis

    LENUS (Irish Health Repository)

    Faure, Andre J

    2011-01-24

    Abstract Background In order to interpret the results obtained from a microarray experiment, researchers often shift focus from analysis of individual differentially expressed genes to analyses of sets of genes. These gene-set analysis (GSA) methods use previously accumulated biological knowledge to group genes into sets and then aim to rank these gene sets in a way that reflects their relative importance in the experimental situation in question. We suspect that the presence of paralogs affects the ability of GSA methods to accurately identify the most important sets of genes for subsequent research. Results We show that paralogs, which typically have high sequence identity and similar molecular functions, also exhibit high correlation in their expression patterns. We investigate this correlation as a potential confounding factor common to current GSA methods using Indygene http:\\/\\/www.cbio.uct.ac.za\\/indygene, a web tool that reduces a supplied list of genes so that it includes no pairwise paralogy relationships above a specified sequence similarity threshold. We use the tool to reanalyse previously published microarray datasets and determine the potential utility of accounting for the presence of paralogs. Conclusions The Indygene tool efficiently removes paralogy relationships from a given dataset and we found that such a reduction, performed prior to GSA, has the ability to generate significantly different results that often represent novel and plausible biological hypotheses. This was demonstrated for three different GSA approaches when applied to the reanalysis of previously published microarray datasets and suggests that the redundancy and non-independence of paralogs is an important consideration when dealing with GSA methodologies.

  9. Cross-platform analysis of cancer microarray data improves gene expression based classification of phenotypes

    Directory of Open Access Journals (Sweden)

    Eils Roland

    2005-11-01

    Full Text Available Abstract Background The extensive use of DNA microarray technology in the characterization of the cell transcriptome is leading to an ever increasing amount of microarray data from cancer studies. Although similar questions for the same type of cancer are addressed in these different studies, a comparative analysis of their results is hampered by the use of heterogeneous microarray platforms and analysis methods. Results In contrast to a meta-analysis approach where results of different studies are combined on an interpretative level, we investigate here how to directly integrate raw microarray data from different studies for the purpose of supervised classification analysis. We use median rank scores and quantile discretization to derive numerically comparable measures of gene expression from different platforms. These transformed data are then used for training of classifiers based on support vector machines. We apply this approach to six publicly available cancer microarray gene expression data sets, which consist of three pairs of studies, each examining the same type of cancer, i.e. breast cancer, prostate cancer or acute myeloid leukemia. For each pair, one study was performed by means of cDNA microarrays and the other by means of oligonucleotide microarrays. In each pair, high classification accuracies (> 85% were achieved with training and testing on data instances randomly chosen from both data sets in a cross-validation analysis. To exemplify the potential of this cross-platform classification analysis, we use two leukemia microarray data sets to show that important genes with regard to the biology of leukemia are selected in an integrated analysis, which are missed in either single-set analysis. Conclusion Cross-platform classification of multiple cancer microarray data sets yields discriminative gene expression signatures that are found and validated on a large number of microarray samples, generated by different laboratories and

  10. Microarray-Based Identification of Transcription Factor Target Genes

    NARCIS (Netherlands)

    Gorte, M.; Horstman, A.; Page, R.B.; Heidstra, R.; Stromberg, A.; Boutilier, K.A.

    2011-01-01

    Microarray analysis is widely used to identify transcriptional changes associated with genetic perturbation or signaling events. Here we describe its application in the identification of plant transcription factor target genes with emphasis on the design of suitable DNA constructs for controlling TF

  11. Preparation of oligonucleotide microarray for radiation-associated gene expression detection and its application in lung cancer cell lines

    International Nuclear Information System (INIS)

    Guo Wanfeng; Lin Ruxian; Huang Jian; Guo Guozhen; Wang Shengqi

    2005-01-01

    Objective: The response of tumor cell to radiation is accompanied by complex change in patterns of gene expression. It is highly probable that a better understanding of molecular and genetic changes can help to sensitize the radioresistant tumor cells. Methods: Oligonucleotide microarray provides a powerful tool for high-throughput identifying a wider range of genes involved in the radioresistance. Therefore, the authors designed one oligonucleotide microarray according to the biological effect of IR. By using different radiosensitive lung cancer cell lines, the authors identified genes showing altered expression in lung cancer cell lines. To provide independent confirmation of microarray data, semi-quantitative RT-PCR was performed on a selection of genes. Results: In radioresistant A549 cell lines, a total of 18 genes were selected as having significant fold-changes compared to NCI-H446, 8 genes were up-regulated and 10 genes were down-regulated. Subsequently, A549 and NCI-H446 cells were delivered by ionizing radiation. In A549 cell line, we found 22 (19 up-regulated and 3 down-regulated) and 26 (8 up-regulated and 18 down-regulated) differentially expressed genes at 6h and 24h after ionizing radiation. In NCI-H446 cell line, we identified 17 (9 up-regulated and 8 down-regulated) and 18 (6 up-regulated and 12 down-regulated) differentially expressed genes at 6 h and 24 h after ionizing radiation. The authors tested seven genes (MDM2, p53, XRCC5, Bcl-2, PIM2, NFKBIA and Cyclin B1) for RT-PCR, and found that the results were in good agreement with those from the microarray data except for NFKBIA gene, even though the value for each mRNA level might be different between the two measurements. In present study, the authors identified some genes with cell proliferation and anti-apoptosis, such as MdM2, BCL-2, PKCz and PIM2 expression levels increased in A549 cells and decreased in NCI-H446 cells after radiation, and other genes with DNA repair, such as XRCC5, ERCC5

  12. Evaluation of artificial time series microarray data for dynamic gene regulatory network inference.

    Science.gov (United States)

    Xenitidis, P; Seimenis, I; Kakolyris, S; Adamopoulos, A

    2017-08-07

    High-throughput technology like microarrays is widely used in the inference of gene regulatory networks (GRNs). We focused on time series data since we are interested in the dynamics of GRNs and the identification of dynamic networks. We evaluated the amount of information that exists in artificial time series microarray data and the ability of an inference process to produce accurate models based on them. We used dynamic artificial gene regulatory networks in order to create artificial microarray data. Key features that characterize microarray data such as the time separation of directly triggered genes, the percentage of directly triggered genes and the triggering function type were altered in order to reveal the limits that are imposed by the nature of microarray data on the inference process. We examined the effect of various factors on the inference performance such as the network size, the presence of noise in microarray data, and the network sparseness. We used a system theory approach and examined the relationship between the pole placement of the inferred system and the inference performance. We examined the relationship between the inference performance in the time domain and the true system parameter identification. Simulation results indicated that time separation and the percentage of directly triggered genes are crucial factors. Also, network sparseness, the triggering function type and noise in input data affect the inference performance. When two factors were simultaneously varied, it was found that variation of one parameter significantly affects the dynamic response of the other. Crucial factors were also examined using a real GRN and acquired results confirmed simulation findings with artificial data. Different initial conditions were also used as an alternative triggering approach. Relevant results confirmed that the number of datasets constitutes the most significant parameter with regard to the inference performance. Copyright © 2017 Elsevier

  13. Empirical Bayes ranking and selection methods via semiparametric hierarchical mixture models in microarray studies.

    Science.gov (United States)

    Noma, Hisashi; Matsui, Shigeyuki

    2013-05-20

    The main purpose of microarray studies is screening of differentially expressed genes as candidates for further investigation. Because of limited resources in this stage, prioritizing genes are relevant statistical tasks in microarray studies. For effective gene selections, parametric empirical Bayes methods for ranking and selection of genes with largest effect sizes have been proposed (Noma et al., 2010; Biostatistics 11: 281-289). The hierarchical mixture model incorporates the differential and non-differential components and allows information borrowing across differential genes with separation from nuisance, non-differential genes. In this article, we develop empirical Bayes ranking methods via a semiparametric hierarchical mixture model. A nonparametric prior distribution, rather than parametric prior distributions, for effect sizes is specified and estimated using the "smoothing by roughening" approach of Laird and Louis (1991; Computational statistics and data analysis 12: 27-37). We present applications to childhood and infant leukemia clinical studies with microarrays for exploring genes related to prognosis or disease progression. Copyright © 2012 John Wiley & Sons, Ltd.

  14. Validation of candidate genes putatively associated with resistance to SCMV and MDMV in maize (Zea mays L.) by expression profiling

    DEFF Research Database (Denmark)

    Uzarowska, Anna; Dionisio, Giuseppe; Sarholz, Barbara

    2009-01-01

    Background The potyviruses sugarcane mosaic virus (SCMV) and maize dwarf mosaic virus (MDMV) are major pathogens of maize worldwide. Two loci, Scmv1 and Scmv2, have ealier been shown to confer complete resistance to SCMV. Custom-made microarrays containing previously identified SCMV resistance...... the effectiveness and reliability of the combination of different expression profiling approaches for the identification and validation of candidate genes. Genes identified in this study represent possible future targets for manipulation of SCMV resistance in maize....

  15. Improving the scaling normalization for high-density oligonucleotide GeneChip expression microarrays

    Directory of Open Access Journals (Sweden)

    Lu Chao

    2004-07-01

    Full Text Available Abstract Background Normalization is an important step for microarray data analysis to minimize biological and technical variations. Choosing a suitable approach can be critical. The default method in GeneChip expression microarray uses a constant factor, the scaling factor (SF, for every gene on an array. The SF is obtained from a trimmed average signal of the array after excluding the 2% of the probe sets with the highest and the lowest values. Results Among the 76 U34A GeneChip experiments, the total signals on each array showed 25.8% variations in terms of the coefficient of variation, although all microarrays were hybridized with the same amount of biotin-labeled cRNA. The 2% of the probe sets with the highest signals that were normally excluded from SF calculation accounted for 34% to 54% of the total signals (40.7% ± 4.4%, mean ± sd. In comparison with normalization factors obtained from the median signal or from the mean of the log transformed signal, SF showed the greatest variation. The normalization factors obtained from log transformed signals showed least variation. Conclusions Eliminating 40% of the signal data during SF calculation failed to show any benefit. Normalization factors obtained with log transformed signals performed the best. Thus, it is suggested to use the mean of the logarithm transformed data for normalization, rather than the arithmetic mean of signals in GeneChip gene expression microarrays.

  16. Development and validation of a flax (Linum usitatissimum L.) gene expression oligo microarray.

    Science.gov (United States)

    Fenart, Stéphane; Ndong, Yves-Placide Assoumou; Duarte, Jorge; Rivière, Nathalie; Wilmer, Jeroen; van Wuytswinkel, Olivier; Lucau, Anca; Cariou, Emmanuelle; Neutelings, Godfrey; Gutierrez, Laurent; Chabbert, Brigitte; Guillot, Xavier; Tavernier, Reynald; Hawkins, Simon; Thomasset, Brigitte

    2010-10-21

    Flax (Linum usitatissimum L.) has been cultivated for around 9,000 years and is therefore one of the oldest cultivated species. Today, flax is still grown for its oil (oil-flax or linseed cultivars) and its cellulose-rich fibres (fibre-flax cultivars) used for high-value linen garments and composite materials. Despite the wide industrial use of flax-derived products, and our actual understanding of the regulation of both wood fibre production and oil biosynthesis more information must be acquired in both domains. Recent advances in genomics are now providing opportunities to improve our fundamental knowledge of these complex processes. In this paper we report the development and validation of a high-density oligo microarray platform dedicated to gene expression analyses in flax. Nine different RNA samples obtained from flax inner- and outer-stems, seeds, leaves and roots were used to generate a collection of 1,066,481 ESTs by massive parallel pyrosequencing. Sequences were assembled into 59,626 unigenes and 48,021 sequences were selected for oligo design and high-density microarray (Nimblegen 385K) fabrication with eight, non-overlapping 25-mers oligos per unigene. 18 independent experiments were used to evaluate the hybridization quality, precision, specificity and accuracy and all results confirmed the high technical quality of our microarray platform. Cross-validation of microarray data was carried out using quantitative qRT-PCR. Nine target genes were selected on the basis of microarray results and reflected the whole range of fold change (both up-regulated and down-regulated genes in different samples). A statistically significant positive correlation was obtained comparing expression levels for each target gene across all biological replicates both in qRT-PCR and microarray results. Further experiments illustrated the capacity of our arrays to detect differential gene expression in a variety of flax tissues as well as between two contrasted flax varieties

  17. Development and validation of a flax (Linum usitatissimum L. gene expression oligo microarray

    Directory of Open Access Journals (Sweden)

    Gutierrez Laurent

    2010-10-01

    Full Text Available Abstract Background Flax (Linum usitatissimum L. has been cultivated for around 9,000 years and is therefore one of the oldest cultivated species. Today, flax is still grown for its oil (oil-flax or linseed cultivars and its cellulose-rich fibres (fibre-flax cultivars used for high-value linen garments and composite materials. Despite the wide industrial use of flax-derived products, and our actual understanding of the regulation of both wood fibre production and oil biosynthesis more information must be acquired in both domains. Recent advances in genomics are now providing opportunities to improve our fundamental knowledge of these complex processes. In this paper we report the development and validation of a high-density oligo microarray platform dedicated to gene expression analyses in flax. Results Nine different RNA samples obtained from flax inner- and outer-stems, seeds, leaves and roots were used to generate a collection of 1,066,481 ESTs by massive parallel pyrosequencing. Sequences were assembled into 59,626 unigenes and 48,021 sequences were selected for oligo design and high-density microarray (Nimblegen 385K fabrication with eight, non-overlapping 25-mers oligos per unigene. 18 independent experiments were used to evaluate the hybridization quality, precision, specificity and accuracy and all results confirmed the high technical quality of our microarray platform. Cross-validation of microarray data was carried out using quantitative qRT-PCR. Nine target genes were selected on the basis of microarray results and reflected the whole range of fold change (both up-regulated and down-regulated genes in different samples. A statistically significant positive correlation was obtained comparing expression levels for each target gene across all biological replicates both in qRT-PCR and microarray results. Further experiments illustrated the capacity of our arrays to detect differential gene expression in a variety of flax tissues as well

  18. Examination of gene expression in mice exposed to low dose radiation using affymetrix cDNA microarrays

    Energy Technology Data Exchange (ETDEWEB)

    Morris, D.; Knox, D.; Lavoie, J.; Lemon, J.; Boreham, D. [McMaster Univ., Hamilton, Ontario (Canada)

    2005-07-01

    'Full text:' Gamma radiation acts via the indirect effect to damage cells by producing reactive oxygen species (ROS). These ROS are capable damaging macromolecules and, altering signal pathways and gene transcription. Cells have evolved enzymes and mechanisms to scavenge ROS and repair oxidative damage. Microarrays allow the survey of the gene transcription activity of thousands of genes simultaneously. Messenger RNA is extracted from cells, hybridized with the complementary DNA (cDNA) of a microarray chip, and examined with a chip reader. Affymetrix microarray chips have been produced by the CSCHAH in Winnipeg containing 26000 murine genes. Groups of female mice have been exposed to low dose whole body chronic gamma radiation exposures of 0,50,100, and 120 mGy, corresponding to 15,30,60, and 75 weeks, respectively. MRNA from mice brain tissue has been extracted, isolated, converted to cDNA and labeled. Gene expression in each irradiated mouse was compared to the pooled expression of the control mice. Analysis of gene expression levels are performed with microarray analytical software, Array Pro by Media Cybernetics, and powerful statistical software, BRB microarray tools. Differences in gene expressions, focusing on genes for cytokines, DNA repair mechanisms, immuno-modulators, apoptosis pathways, and enzymatic anti-oxidant systems, are being examined and will be reported. (author)

  19. Xylella fastidiosa gene expression analysis by DNA microarrays

    OpenAIRE

    Travensolo,Regiane F.; Carareto-Alves,Lucia M.; Costa,Maria V.C.G.; Lopes,Tiago J.S.; Carrilho,Emanuel; Lemos,Eliana G.M.

    2009-01-01

    Xylella fastidiosa genome sequencing has generated valuable data by identifying genes acting either on metabolic pathways or in associated pathogenicity and virulence. Based on available information on these genes, new strategies for studying their expression patterns, such as microarray technology, were employed. A total of 2,600 primer pairs were synthesized and then used to generate fragments using the PCR technique. The arrays were hybridized against cDNAs labeled during reverse transcrip...

  20. mRMR-ABC: A Hybrid Gene Selection Algorithm for Cancer Classification Using Microarray Gene Expression Profiling

    Directory of Open Access Journals (Sweden)

    Hala Alshamlan

    2015-01-01

    Full Text Available An artificial bee colony (ABC is a relatively recent swarm intelligence optimization approach. In this paper, we propose the first attempt at applying ABC algorithm in analyzing a microarray gene expression profile. In addition, we propose an innovative feature selection algorithm, minimum redundancy maximum relevance (mRMR, and combine it with an ABC algorithm, mRMR-ABC, to select informative genes from microarray profile. The new approach is based on a support vector machine (SVM algorithm to measure the classification accuracy for selected genes. We evaluate the performance of the proposed mRMR-ABC algorithm by conducting extensive experiments on six binary and multiclass gene expression microarray datasets. Furthermore, we compare our proposed mRMR-ABC algorithm with previously known techniques. We reimplemented two of these techniques for the sake of a fair comparison using the same parameters. These two techniques are mRMR when combined with a genetic algorithm (mRMR-GA and mRMR when combined with a particle swarm optimization algorithm (mRMR-PSO. The experimental results prove that the proposed mRMR-ABC algorithm achieves accurate classification performance using small number of predictive genes when tested using both datasets and compared to previously suggested methods. This shows that mRMR-ABC is a promising approach for solving gene selection and cancer classification problems.

  1. mRMR-ABC: A Hybrid Gene Selection Algorithm for Cancer Classification Using Microarray Gene Expression Profiling.

    Science.gov (United States)

    Alshamlan, Hala; Badr, Ghada; Alohali, Yousef

    2015-01-01

    An artificial bee colony (ABC) is a relatively recent swarm intelligence optimization approach. In this paper, we propose the first attempt at applying ABC algorithm in analyzing a microarray gene expression profile. In addition, we propose an innovative feature selection algorithm, minimum redundancy maximum relevance (mRMR), and combine it with an ABC algorithm, mRMR-ABC, to select informative genes from microarray profile. The new approach is based on a support vector machine (SVM) algorithm to measure the classification accuracy for selected genes. We evaluate the performance of the proposed mRMR-ABC algorithm by conducting extensive experiments on six binary and multiclass gene expression microarray datasets. Furthermore, we compare our proposed mRMR-ABC algorithm with previously known techniques. We reimplemented two of these techniques for the sake of a fair comparison using the same parameters. These two techniques are mRMR when combined with a genetic algorithm (mRMR-GA) and mRMR when combined with a particle swarm optimization algorithm (mRMR-PSO). The experimental results prove that the proposed mRMR-ABC algorithm achieves accurate classification performance using small number of predictive genes when tested using both datasets and compared to previously suggested methods. This shows that mRMR-ABC is a promising approach for solving gene selection and cancer classification problems.

  2. Genome-Wide Screening of Genes Showing Altered Expression in Liver Metastases of Human Colorectal Cancers by cDNA Microarray

    Directory of Open Access Journals (Sweden)

    Rempei Yanagawa

    2001-01-01

    Full Text Available In spite of intensive and increasingly successful attempts to determine the multiple steps involved in colorectal carcinogenesis, the mechanisms responsible for metastasis of colorectal tumors to the liver remain to be clarified. To identify genes that are candidates for involvement in the metastatic process, we analyzed genome-wide expression profiles of 10 primary colorectal cancers and their corresponding metastatic lesions by means of a cDNA microarray consisting of 9121 human genes. This analysis identified 40 genes whose expression was commonly upregulated in metastatic lesions, and 7 that were commonly downregulated. The upregulated genes encoded proteins involved in cell adhesion, or remodeling of the actin cytoskeleton. Investigation of the functions of more of the altered genes should improve our understanding of metastasis and may identify diagnostic markers and/or novel molecular targets for prevention or therapy of metastatic lesions.

  3. Normalization and gene p-value estimation: issues in microarray data processing.

    Science.gov (United States)

    Fundel, Katrin; Küffner, Robert; Aigner, Thomas; Zimmer, Ralf

    2008-05-28

    Numerous methods exist for basic processing, e.g. normalization, of microarray gene expression data. These methods have an important effect on the final analysis outcome. Therefore, it is crucial to select methods appropriate for a given dataset in order to assure the validity and reliability of expression data analysis. Furthermore, biological interpretation requires expression values for genes, which are often represented by several spots or probe sets on a microarray. How to best integrate spot/probe set values into gene values has so far been a somewhat neglected problem. We present a case study comparing different between-array normalization methods with respect to the identification of differentially expressed genes. Our results show that it is feasible and necessary to use prior knowledge on gene expression measurements to select an adequate normalization method for the given data. Furthermore, we provide evidence that combining spot/probe set p-values into gene p-values for detecting differentially expressed genes has advantages compared to combining expression values for spots/probe sets into gene expression values. The comparison of different methods suggests to use Stouffer's method for this purpose. The study has been conducted on gene expression experiments investigating human joint cartilage samples of osteoarthritis related groups: a cDNA microarray (83 samples, four groups) and an Affymetrix (26 samples, two groups) data set. The apparently straight forward steps of gene expression data analysis, e.g. between-array normalization and detection of differentially regulated genes, can be accomplished by numerous different methods. We analyzed multiple methods and the possible effects and thereby demonstrate the importance of the single decisions taken during data processing. We give guidelines for evaluating normalization outcomes. An overview of these effects via appropriate measures and plots compared to prior knowledge is essential for the biological

  4. Strategies for comparing gene expression profiles from different microarray platforms: application to a case-control experiment.

    Science.gov (United States)

    Severgnini, Marco; Bicciato, Silvio; Mangano, Eleonora; Scarlatti, Francesca; Mezzelani, Alessandra; Mattioli, Michela; Ghidoni, Riccardo; Peano, Clelia; Bonnal, Raoul; Viti, Federica; Milanesi, Luciano; De Bellis, Gianluca; Battaglia, Cristina

    2006-06-01

    Meta-analysis of microarray data is increasingly important, considering both the availability of multiple platforms using disparate technologies and the accumulation in public repositories of data sets from different laboratories. We addressed the issue of comparing gene expression profiles from two microarray platforms by devising a standardized investigative strategy. We tested this procedure by studying MDA-MB-231 cells, which undergo apoptosis on treatment with resveratrol. Gene expression profiles were obtained using high-density, short-oligonucleotide, single-color microarray platforms: GeneChip (Affymetrix) and CodeLink (Amersham). Interplatform analyses were carried out on 8414 common transcripts represented on both platforms, as identified by LocusLink ID, representing 70.8% and 88.6% of annotated GeneChip and CodeLink features, respectively. We identified 105 differentially expressed genes (DEGs) on CodeLink and 42 DEGs on GeneChip. Among them, only 9 DEGs were commonly identified by both platforms. Multiple analyses (BLAST alignment of probes with target sequences, gene ontology, literature mining, and quantitative real-time PCR) permitted us to investigate the factors contributing to the generation of platform-dependent results in single-color microarray experiments. An effective approach to cross-platform comparison involves microarrays of similar technologies, samples prepared by identical methods, and a standardized battery of bioinformatic and statistical analyses.

  5. Radioactive cDNA microarray (II): Gene expression profiling of antidepressant treatment by human cDNA microarray

    Energy Technology Data Exchange (ETDEWEB)

    Lee, Ji Hye; Kang, Rhee Hun; Ham, Byung Joo; Lee, Min Su; Shin, Kyung Ho; Choe, Jae Gol; Kim, Meyoung Kon [College of Medicine, Univ. of Korea, Seoul (Korea, Republic of)

    2003-07-01

    Major depressive disorder is a prevalent psychiatric disorder in primary care, associated with impaired patient functioning and well-being. Fluoxetine is a selective serotonin-reuptake inhibitors (SSRIs) and is a commonly prescribed antidepressant compound. Its action is primarily attributed to selective inhibition of the reuptake of serotonin (5-hydroxytryptamine) in the central nervous system. Objectives ; the aims of this study were two-fold: (1) to determine the usefulness for investigation of the transcription profiles in depression patients, and (2) to assess the differences in gene expression profiles between positive response group and negative response groups by fluoxetine treatment. This study included 53 patients with major depression (26 in positive response group with antidepressant treatment, 27 in negative response group with antidepressant treatment), and 53 healthy controls. To examine the difference of gene expression profile in depression patients, radioactive complementary DNA microarrays were used to evaluate changes in the expression of 1,152 genes in total. Using 33p-labeled probes, this method provided highly sensitive gene expression profiles including brain receptors, drug metabolism, and cellular signaling. Gene transcription profiles were classified into several categories in accordance with the antidepressant gene-regulation. The gene profiles were significantly up-(22 genes) and down-(16 genes) regulated in the positive response group when compared to the control group. Also, in the negative response group, 35 genes were up-regulated and 8 genes were down-regulated when compared to the control group. Consequently, we demonstrated that radioactive human cDNA microarray is highly likely to be an efficient technology for evaluating the gene regulation of antidepressants, such as selective serotonin-reuptake inhibitors (SSRIs), by using high-throughput biotechnology.

  6. Radioactive cDNA microarray (II): Gene expression profiling of antidepressant treatment by human cDNA microarray

    International Nuclear Information System (INIS)

    Lee, Ji Hye; Kang, Rhee Hun; Ham, Byung Joo; Lee, Min Su; Shin, Kyung Ho; Choe, Jae Gol; Kim, Meyoung Kon

    2003-01-01

    Major depressive disorder is a prevalent psychiatric disorder in primary care, associated with impaired patient functioning and well-being. Fluoxetine is a selective serotonin-reuptake inhibitors (SSRIs) and is a commonly prescribed antidepressant compound. Its action is primarily attributed to selective inhibition of the reuptake of serotonin (5-hydroxytryptamine) in the central nervous system. Objectives ; the aims of this study were two-fold: (1) to determine the usefulness for investigation of the transcription profiles in depression patients, and (2) to assess the differences in gene expression profiles between positive response group and negative response groups by fluoxetine treatment. This study included 53 patients with major depression (26 in positive response group with antidepressant treatment, 27 in negative response group with antidepressant treatment), and 53 healthy controls. To examine the difference of gene expression profile in depression patients, radioactive complementary DNA microarrays were used to evaluate changes in the expression of 1,152 genes in total. Using 33p-labeled probes, this method provided highly sensitive gene expression profiles including brain receptors, drug metabolism, and cellular signaling. Gene transcription profiles were classified into several categories in accordance with the antidepressant gene-regulation. The gene profiles were significantly up-(22 genes) and down-(16 genes) regulated in the positive response group when compared to the control group. Also, in the negative response group, 35 genes were up-regulated and 8 genes were down-regulated when compared to the control group. Consequently, we demonstrated that radioactive human cDNA microarray is highly likely to be an efficient technology for evaluating the gene regulation of antidepressants, such as selective serotonin-reuptake inhibitors (SSRIs), by using high-throughput biotechnology

  7. Study of hepatitis B virus gene mutations with enzymatic colorimetry-based DNA microarray.

    Science.gov (United States)

    Mao, Hailei; Wang, Huimin; Zhang, Donglei; Mao, Hongju; Zhao, Jianlong; Shi, Jian; Cui, Zhichu

    2006-01-01

    To establish a modified microarray method for detecting HBV gene mutations in the clinic. Site-specific oligonucleotide probes were immobilized to microarray slides and hybridized to biotin-labeled HBV gene fragments amplified from two-step PCR. Hybridized targets were transferred to nitrocellulose membranes, followed by intensity measurement using BCIP/NBT colorimetry. HBV genes from 99 Hepatitis B patients and 40 healthy blood donors were analyzed. Mutation frequencies of HBV pre-core/core and basic core promoter (BCP) regions were found to be significantly higher in the patient group (42%, 40% versus 2.5%, 5%, P colorimetry method exhibited the same level of sensitivity and reproducibility. An enzymatic colorimetry-based DNA microarray assay was successfully established to monitor HBV mutations. Pre-core/core and BCP mutations of HBV genes could be major causes of HBV infection in HBeAg-negative patients and could also be relevant to chronicity and aggravation of hepatitis B.

  8. Integrated microarray and ChIP analysis identifies multiple Foxa2 dependent target genes in the notochord.

    Science.gov (United States)

    Tamplin, Owen J; Cox, Brian J; Rossant, Janet

    2011-12-15

    The node and notochord are key tissues required for patterning of the vertebrate body plan. Understanding the gene regulatory network that drives their formation and function is therefore important. Foxa2 is a key transcription factor at the top of this genetic hierarchy and finding its targets will help us to better understand node and notochord development. We performed an extensive microarray-based gene expression screen using sorted embryonic notochord cells to identify early notochord-enriched genes. We validated their specificity to the node and notochord by whole mount in situ hybridization. This provides the largest available resource of notochord-expressed genes, and therefore candidate Foxa2 target genes in the notochord. Using existing Foxa2 ChIP-seq data from adult liver, we were able to identify a set of genes expressed in the notochord that had associated regions of Foxa2-bound chromatin. Given that Foxa2 is a pioneer transcription factor, we reasoned that these sites might represent notochord-specific enhancers. Candidate Foxa2-bound regions were tested for notochord specific enhancer function in a zebrafish reporter assay and 7 novel notochord enhancers were identified. Importantly, sequence conservation or predictive models could not have readily identified these regions. Mutation of putative Foxa2 binding elements in two of these novel enhancers abrogated reporter expression and confirmed their Foxa2 dependence. The combination of highly specific gene expression profiling and genome-wide ChIP analysis is a powerful means of understanding developmental pathways, even for small cell populations such as the notochord. Copyright © 2011 Elsevier Inc. All rights reserved.

  9. Can subtle changes in gene expression be consistently detected with different microarray platforms?

    Directory of Open Access Journals (Sweden)

    Kuiper Rowan

    2008-03-01

    Full Text Available Abstract Background The comparability of gene expression data generated with different microarray platforms is still a matter of concern. Here we address the performance and the overlap in the detection of differentially expressed genes for five different microarray platforms in a challenging biological context where differences in gene expression are few and subtle. Results Gene expression profiles in the hippocampus of five wild-type and five transgenic δC-doublecortin-like kinase mice were evaluated with five microarray platforms: Applied Biosystems, Affymetrix, Agilent, Illumina, LGTC home-spotted arrays. Using a fixed false discovery rate of 10% we detected surprising differences between the number of differentially expressed genes per platform. Four genes were selected by ABI, 130 by Affymetrix, 3,051 by Agilent, 54 by Illumina, and 13 by LGTC. Two genes were found significantly differentially expressed by all platforms and the four genes identified by the ABI platform were found by at least three other platforms. Quantitative RT-PCR analysis confirmed 20 out of 28 of the genes detected by two or more platforms and 8 out of 15 of the genes detected by Agilent only. We observed improved correlations between platforms when ranking the genes based on the significance level than with a fixed statistical cut-off. We demonstrate significant overlap in the affected gene sets identified by the different platforms, although biological processes were represented by only partially overlapping sets of genes. Aberrances in GABA-ergic signalling in the transgenic mice were consistently found by all platforms. Conclusion The different microarray platforms give partially complementary views on biological processes affected. Our data indicate that when analyzing samples with only subtle differences in gene expression the use of two different platforms might be more attractive than increasing the number of replicates. Commercial two-color platforms seem to

  10. Cross-platform comparison of SYBR® Green real-time PCR with TaqMan PCR, microarrays and other gene expression measurement technologies evaluated in the MicroArray Quality Control (MAQC study

    Directory of Open Access Journals (Sweden)

    Dial Stacey L

    2008-07-01

    Full Text Available Abstract Background The MicroArray Quality Control (MAQC project evaluated the inter- and intra-platform reproducibility of seven microarray platforms and three quantitative gene expression assays in profiling the expression of two commercially available Reference RNA samples (Nat Biotechnol 24:1115-22, 2006. The tested microarrays were the platforms from Affymetrix, Agilent Technologies, Applied Biosystems, GE Healthcare, Illumina, Eppendorf and the National Cancer Institute, and quantitative gene expression assays included TaqMan® Gene Expression PCR Assay, Standardized (Sta RT-PCR™ and QuantiGene®. The data showed great consistency in gene expression measurements across different microarray platforms, different technologies and test sites. However, SYBR® Green real-time PCR, another common technique utilized by half of all real-time PCR users for gene expression measurement, was not addressed in the MAQC study. In the present study, we compared the performance of SYBR Green PCR with TaqMan PCR, microarrays and other quantitative technologies using the same two Reference RNA samples as the MAQC project. We assessed SYBR Green real-time PCR using commercially available RT2 Profiler™ PCR Arrays from SuperArray, containing primer pairs that have been experimentally validated to ensure gene-specificity and high amplification efficiency. Results The SYBR Green PCR Arrays exhibit good reproducibility among different users, PCR instruments and test sites. In addition, the SYBR Green PCR Arrays have the highest concordance with TaqMan PCR, and a high level of concordance with other quantitative methods and microarrays that were evaluated in this study in terms of fold-change correlation and overlap of lists of differentially expressed genes. Conclusion These data demonstrate that SYBR Green real-time PCR delivers highly comparable results in gene expression measurement with TaqMan PCR and other high-density microarrays.

  11. Evaluation of a gene information summarization system by users during the analysis process of microarray datasets

    Directory of Open Access Journals (Sweden)

    Cohen Aaron

    2009-02-01

    Full Text Available Abstract Background Summarization of gene information in the literature has the potential to help genomics researchers translate basic research into clinical benefits. Gene expression microarrays have been used to study biomarkers for disease and discover novel types of therapeutics and the task of finding information in journal articles on sets of genes is common for translational researchers working with microarray data. However, manually searching and scanning the literature references returned from PubMed is a time-consuming task for scientists. We built and evaluated an automatic summarizer of information on genes studied in microarray experiments. The Gene Information Clustering and Summarization System (GICSS is a system that integrates two related steps of the microarray data analysis process: functional gene clustering and gene information gathering. The system evaluation was conducted during the process of genomic researchers analyzing their own experimental microarray datasets. Results The clusters generated by GICSS were validated by scientists during their microarray analysis process. In addition, presenting sentences in the abstract provided significantly more important information to the users than just showing the title in the default PubMed format. Conclusion The evaluation results suggest that GICSS can be useful for researchers in genomic area. In addition, the hybrid evaluation method, partway between intrinsic and extrinsic system evaluation, may enable researchers to gauge the true usefulness of the tool for the scientists in their natural analysis workflow and also elicit suggestions for future enhancements. Availability GICSS can be accessed online at: http://ir.ohsu.edu/jianji/index.html

  12. Microarray and RT-PCR screening for white spot syndrome virus immediate-early genes in cycloheximide-treated shrimp

    International Nuclear Information System (INIS)

    Liu Wangjing; Chang Yunshiang; Wang Chunghsiung; Kou, Guang-Hsiung; Lo Chufang

    2005-01-01

    Here, we report for the first time the successful use of cycloheximide (CHX) as an inhibitor to block de novo viral protein synthesis during WSSV (white spot syndrome virus) infection. Sixty candidate IE (immediate-early) genes were identified using a global analysis microarray technique. RT-PCR showed that the genes corresponding to ORF126, ORF242 and ORF418 in the Taiwan isolate were consistently CHX-insensitive, and these genes were designated ie1, ie2 and ie3, respectively. The sequences for these IE genes also appear in the two other WSSV isolates that have been sequenced. Three corresponding ORFs were identified in the China WSSV isolate, but only an ORF corresponding to ie1 was predicted in the Thailand isolate. In a promoter activity assay in Sf9 insect cells using EGFP (enhanced green fluorescence protein) as a reporter, ie1 showed very strong promoter activity, producing higher EGFP signals than the insect Orgyia pseudotsugata multicapsid nuclear polyhedrosis virus (OpMNPV) ie2 promoter

  13. Confirming candidate genes for longevity in Drosophila melanogaster using two different genetic backgrounds and selection methods

    DEFF Research Database (Denmark)

    Wit, Janneke; Frydenberg, Jane; Sarup, Pernille Merete

    2013-01-01

    usually focussed on one sex and on flies originating from one genetic background, and results from different studies often do not overlap. Using D. melanogaster selected for increased longevity we aimed to find robust longevity related genes by examining gene expression in both sexes of flies originating......Elucidating genes that affect life span or that can be used as biomarkers for ageing has received attention in diverse studies in recent years. Using model organisms and various approaches several genes have been linked to the longevity phenotype. For Drosophila melanogaster those studies have...... from different genetic backgrounds. Further, we compared expression changes across three ages, when flies were young, middle aged or old, to examine how candidate gene expression changes with the onset of ageing. We selected 10 genes based on their expression differences in prior microarray studies...

  14. Knowledge-based analysis of microarrays for the discovery of transcriptional regulation relationships.

    Science.gov (United States)

    Seok, Junhee; Kaushal, Amit; Davis, Ronald W; Xiao, Wenzhong

    2010-01-18

    The large amount of high-throughput genomic data has facilitated the discovery of the regulatory relationships between transcription factors and their target genes. While early methods for discovery of transcriptional regulation relationships from microarray data often focused on the high-throughput experimental data alone, more recent approaches have explored the integration of external knowledge bases of gene interactions. In this work, we develop an algorithm that provides improved performance in the prediction of transcriptional regulatory relationships by supplementing the analysis of microarray data with a new method of integrating information from an existing knowledge base. Using a well-known dataset of yeast microarrays and the Yeast Proteome Database, a comprehensive collection of known information of yeast genes, we show that knowledge-based predictions demonstrate better sensitivity and specificity in inferring new transcriptional interactions than predictions from microarray data alone. We also show that comprehensive, direct and high-quality knowledge bases provide better prediction performance. Comparison of our results with ChIP-chip data and growth fitness data suggests that our predicted genome-wide regulatory pairs in yeast are reasonable candidates for follow-up biological verification. High quality, comprehensive, and direct knowledge bases, when combined with appropriate bioinformatic algorithms, can significantly improve the discovery of gene regulatory relationships from high throughput gene expression data.

  15. Gene ARMADA: an integrated multi-analysis platform for microarray data implemented in MATLAB.

    Science.gov (United States)

    Chatziioannou, Aristotelis; Moulos, Panagiotis; Kolisis, Fragiskos N

    2009-10-27

    The microarray data analysis realm is ever growing through the development of various tools, open source and commercial. However there is absence of predefined rational algorithmic analysis workflows or batch standardized processing to incorporate all steps, from raw data import up to the derivation of significantly differentially expressed gene lists. This absence obfuscates the analytical procedure and obstructs the massive comparative processing of genomic microarray datasets. Moreover, the solutions provided, heavily depend on the programming skills of the user, whereas in the case of GUI embedded solutions, they do not provide direct support of various raw image analysis formats or a versatile and simultaneously flexible combination of signal processing methods. We describe here Gene ARMADA (Automated Robust MicroArray Data Analysis), a MATLAB implemented platform with a Graphical User Interface. This suite integrates all steps of microarray data analysis including automated data import, noise correction and filtering, normalization, statistical selection of differentially expressed genes, clustering, classification and annotation. In its current version, Gene ARMADA fully supports 2 coloured cDNA and Affymetrix oligonucleotide arrays, plus custom arrays for which experimental details are given in tabular form (Excel spreadsheet, comma separated values, tab-delimited text formats). It also supports the analysis of already processed results through its versatile import editor. Besides being fully automated, Gene ARMADA incorporates numerous functionalities of the Statistics and Bioinformatics Toolboxes of MATLAB. In addition, it provides numerous visualization and exploration tools plus customizable export data formats for seamless integration by other analysis tools or MATLAB, for further processing. Gene ARMADA requires MATLAB 7.4 (R2007a) or higher and is also distributed as a stand-alone application with MATLAB Component Runtime. Gene ARMADA provides a

  16. A Combinatory Approach for Selecting Prognostic Genes in Microarray Studies of Tumour Survivals

    Directory of Open Access Journals (Sweden)

    Qihua Tan

    2009-01-01

    Full Text Available Different from significant gene expression analysis which looks for genes that are differentially regulated, feature selection in the microarray-based prognostic gene expression analysis aims at finding a subset of marker genes that are not only differentially expressed but also informative for prediction. Unfortunately feature selection in literature of microarray study is predominated by the simple heuristic univariate gene filter paradigm that selects differentially expressed genes according to their statistical significances. We introduce a combinatory feature selection strategy that integrates differential gene expression analysis with the Gram-Schmidt process to identify prognostic genes that are both statistically significant and highly informative for predicting tumour survival outcomes. Empirical application to leukemia and ovarian cancer survival data through-within- and cross-study validations shows that the feature space can be largely reduced while achieving improved testing performances.

  17. Gene Expression Browser: Large-Scale and Cross-Experiment Microarray Data Management, Search & Visualization

    Science.gov (United States)

    The amount of microarray gene expression data in public repositories has been increasing exponentially for the last couple of decades. High-throughput microarray data integration and analysis has become a critical step in exploring the large amount of expression data for biological discovery. Howeve...

  18. Microarray-based genomic surveying of gene polymorphisms in Chlamydia trachomatis

    OpenAIRE

    Brunelle, Brian W; Nicholson, Tracy L; Stephens, Richard S

    2004-01-01

    By comparing two fully sequenced genomes of Chlamydia trachomatis using competitive hybridization on DNA microarrays, a logarithmic correlation was demonstrated between the signal ratio of the arrays and the 75-99% range of nucleotide identities of the genes. Variable genes within 14 uncharacterized strains of C. trachomatis were identified by array analysis and verified by DNA sequencing. These genes may be crucial for understanding chlamydial virulence and pathogenesis.

  19. Evaluating historical candidate genes for schizophrenia

    DEFF Research Database (Denmark)

    Farrell, M S; Werge, T; Sklar, P

    2015-01-01

    Prior to the genome-wide association era, candidate gene studies were a major approach in schizophrenia genetics. In this invited review, we consider the current status of 25 historical candidate genes for schizophrenia (for example, COMT, DISC1, DTNBP1 and NRG1). The initial study for 24 of thes...

  20. Fast gene ontology based clustering for microarray experiments.

    Science.gov (United States)

    Ovaska, Kristian; Laakso, Marko; Hautaniemi, Sampsa

    2008-11-21

    Analysis of a microarray experiment often results in a list of hundreds of disease-associated genes. In order to suggest common biological processes and functions for these genes, Gene Ontology annotations with statistical testing are widely used. However, these analyses can produce a very large number of significantly altered biological processes. Thus, it is often challenging to interpret GO results and identify novel testable biological hypotheses. We present fast software for advanced gene annotation using semantic similarity for Gene Ontology terms combined with clustering and heat map visualisation. The methodology allows rapid identification of genes sharing the same Gene Ontology cluster. Our R based semantic similarity open-source package has a speed advantage of over 2000-fold compared to existing implementations. From the resulting hierarchical clustering dendrogram genes sharing a GO term can be identified, and their differences in the gene expression patterns can be seen from the heat map. These methods facilitate advanced annotation of genes resulting from data analysis.

  1. A stratified transcriptomics analysis of polygenic fat and lean mouse adipose tissues identifies novel candidate obesity genes.

    Directory of Open Access Journals (Sweden)

    Nicholas M Morton

    Full Text Available Obesity and metabolic syndrome results from a complex interaction between genetic and environmental factors. In addition to brain-regulated processes, recent genome wide association studies have indicated that genes highly expressed in adipose tissue affect the distribution and function of fat and thus contribute to obesity. Using a stratified transcriptome gene enrichment approach we attempted to identify adipose tissue-specific obesity genes in the unique polygenic Fat (F mouse strain generated by selective breeding over 60 generations for divergent adiposity from a comparator Lean (L strain.To enrich for adipose tissue obesity genes a 'snap-shot' pooled-sample transcriptome comparison of key fat depots and non adipose tissues (muscle, liver, kidney was performed. Known obesity quantitative trait loci (QTL information for the model allowed us to further filter genes for increased likelihood of being causal or secondary for obesity. This successfully identified several genes previously linked to obesity (C1qr1, and Np3r as positional QTL candidate genes elevated specifically in F line adipose tissue. A number of novel obesity candidate genes were also identified (Thbs1, Ppp1r3d, Tmepai, Trp53inp2, Ttc7b, Tuba1a, Fgf13, Fmr that have inferred roles in fat cell function. Quantitative microarray analysis was then applied to the most phenotypically divergent adipose depot after exaggerating F and L strain differences with chronic high fat feeding which revealed a distinct gene expression profile of line, fat depot and diet-responsive inflammatory, angiogenic and metabolic pathways. Selected candidate genes Npr3 and Thbs1, as well as Gys2, a non-QTL gene that otherwise passed our enrichment criteria were characterised, revealing novel functional effects consistent with a contribution to obesity.A focussed candidate gene enrichment strategy in the unique F and L model has identified novel adipose tissue-enriched genes contributing to obesity.

  2. Developing Potential Candidates of Preclinical Preeclampsia

    Directory of Open Access Journals (Sweden)

    Sandra Founds

    2015-11-01

    Full Text Available The potential for developing molecules of interest in preclinical preeclampsia from candidate genes that were discovered on gene expression microarray analysis has been challenged by limited access to additional first trimester trophoblast and decidual tissues. The question of whether these candidates encode secreted proteins that may be detected in maternal circulation early in pregnancy has been investigated using various proteomic methods. Pilot studies utilizing mass spectrometry based proteomic assays, along with enzyme linked immunosorbent assays (ELISAs, and Western immunoblotting in first trimester samples are reported. The novel targeted mass spectrometry methods led to robust multiple reaction monitoring assays. Despite detection of several candidates in early gestation, challenges persist. Future antibody-based studies may lead to a novel multiplex protein panel for screening or detection to prevent or mitigate preeclampsia.

  3. Washing scaling of GeneChip microarray expression

    Directory of Open Access Journals (Sweden)

    Krohn Knut

    2010-05-01

    Full Text Available Abstract Background Post-hybridization washing is an essential part of microarray experiments. Both the quality of the experimental washing protocol and adequate consideration of washing in intensity calibration ultimately affect the quality of the expression estimates extracted from the microarray intensities. Results We conducted experiments on GeneChip microarrays with altered protocols for washing, scanning and staining to study the probe-level intensity changes as a function of the number of washing cycles. For calibration and analysis of the intensity data we make use of the 'hook' method which allows intensity contributions due to non-specific and specific hybridization of perfect match (PM and mismatch (MM probes to be disentangled in a sequence specific manner. On average, washing according to the standard protocol removes about 90% of the non-specific background and about 30-50% and less than 10% of the specific targets from the MM and PM, respectively. Analysis of the washing kinetics shows that the signal-to-noise ratio doubles roughly every ten stringent washing cycles. Washing can be characterized by time-dependent rate constants which reflect the heterogeneous character of target binding to microarray probes. We propose an empirical washing function which estimates the survival of probe bound targets. It depends on the intensity contribution due to specific and non-specific hybridization per probe which can be estimated for each probe using existing methods. The washing function allows probe intensities to be calibrated for the effect of washing. On a relative scale, proper calibration for washing markedly increases expression measures, especially in the limit of small and large values. Conclusions Washing is among the factors which potentially distort expression measures. The proposed first-order correction method allows direct implementation in existing calibration algorithms for microarray data. We provide an experimental

  4. Gene Expression Profiling and Identification of Resistance Genes to Aspergillus flavus Infection in Peanut through EST and Microarray Strategies

    Directory of Open Access Journals (Sweden)

    Baozhu Guo

    2011-06-01

    Full Text Available Aspergillus flavus and A. parasiticus infect peanut seeds and produce aflatoxins, which are associated with various diseases in domestic animals and humans throughout the world. The most cost-effective strategy to minimize aflatoxin contamination involves the development of peanut cultivars that are resistant to fungal infection and/or aflatoxin production. To identify peanut Aspergillus-interactive and peanut Aspergillus-resistance genes, we carried out a large scale peanut Expressed Sequence Tag (EST project which we used to construct a peanut glass slide oligonucleotide microarray. The fabricated microarray represents over 40% of the protein coding genes in the peanut genome. For expression profiling, resistant and susceptible peanut cultivars were infected with a mixture of Aspergillus flavus and parasiticus spores. The subsequent microarray analysis identified 62 genes in resistant cultivars that were up-expressed in response to Aspergillus infection. In addition, we identified 22 putative Aspergillus-resistance genes that were constitutively up-expressed in the resistant cultivar in comparison to the susceptible cultivar. Some of these genes were homologous to peanut, corn, and soybean genes that were previously shown to confer resistance to fungal infection. This study is a first step towards a comprehensive genome-scale platform for developing Aspergillus-resistant peanut cultivars through targeted marker-assisted breeding and genetic engineering.

  5. ZODET: software for the identification, analysis and visualisation of outlier genes in microarray expression data.

    Directory of Open Access Journals (Sweden)

    Daniel L Roden

    Full Text Available Complex human diseases can show significant heterogeneity between patients with the same phenotypic disorder. An outlier detection strategy was developed to identify variants at the level of gene transcription that are of potential biological and phenotypic importance. Here we describe a graphical software package (z-score outlier detection (ZODET that enables identification and visualisation of gross abnormalities in gene expression (outliers in individuals, using whole genome microarray data. Mean and standard deviation of expression in a healthy control cohort is used to detect both over and under-expressed probes in individual test subjects. We compared the potential of ZODET to detect outlier genes in gene expression datasets with a previously described statistical method, gene tissue index (GTI, using a simulated expression dataset and a publicly available monocyte-derived macrophage microarray dataset. Taken together, these results support ZODET as a novel approach to identify outlier genes of potential pathogenic relevance in complex human diseases. The algorithm is implemented using R packages and Java.The software is freely available from http://www.ucl.ac.uk/medicine/molecular-medicine/publications/microarray-outlier-analysis.

  6. Evaluation of an expanded microarray for detecting antibiotic resistance genes in a broad range of gram-negative bacterial pathogens.

    Science.gov (United States)

    Card, Roderick; Zhang, Jiancheng; Das, Priya; Cook, Charlotte; Woodford, Neil; Anjum, Muna F

    2013-01-01

    A microarray capable of detecting genes for resistance to 75 clinically relevant antibiotics encompassing 19 different antimicrobial classes was tested on 132 Gram-negative bacteria. Microarray-positive results correlated >91% with antimicrobial resistance phenotypes, assessed using British Society for Antimicrobial Chemotherapy clinical breakpoints; the overall test specificity was >83%. Microarray-positive results without a corresponding resistance phenotype matched 94% with PCR results, indicating accurate detection of genes present in the respective bacteria by microarray when expression was low or absent and, hence, undetectable by susceptibility testing. The low sensitivity and negative predictive values of the microarray results for identifying resistance to some antimicrobial resistance classes are likely due to the limited number of resistance genes present on the current microarray for those antimicrobial agents or to mutation-based resistance mechanisms. With regular updates, this microarray can be used for clinical diagnostics to help accurate therapeutic options to be taken following infection with multiple-antibiotic-resistant Gram-negative bacteria and prevent treatment failure.

  7. Using rule-based machine learning for candidate disease gene prioritization and sample classification of cancer gene expression data.

    Science.gov (United States)

    Glaab, Enrico; Bacardit, Jaume; Garibaldi, Jonathan M; Krasnogor, Natalio

    2012-01-01

    Microarray data analysis has been shown to provide an effective tool for studying cancer and genetic diseases. Although classical machine learning techniques have successfully been applied to find informative genes and to predict class labels for new samples, common restrictions of microarray analysis such as small sample sizes, a large attribute space and high noise levels still limit its scientific and clinical applications. Increasing the interpretability of prediction models while retaining a high accuracy would help to exploit the information content in microarray data more effectively. For this purpose, we evaluate our rule-based evolutionary machine learning systems, BioHEL and GAssist, on three public microarray cancer datasets, obtaining simple rule-based models for sample classification. A comparison with other benchmark microarray sample classifiers based on three diverse feature selection algorithms suggests that these evolutionary learning techniques can compete with state-of-the-art methods like support vector machines. The obtained models reach accuracies above 90% in two-level external cross-validation, with the added value of facilitating interpretation by using only combinations of simple if-then-else rules. As a further benefit, a literature mining analysis reveals that prioritizations of informative genes extracted from BioHEL's classification rule sets can outperform gene rankings obtained from a conventional ensemble feature selection in terms of the pointwise mutual information between relevant disease terms and the standardized names of top-ranked genes.

  8. Fast Gene Ontology based clustering for microarray experiments

    Directory of Open Access Journals (Sweden)

    Ovaska Kristian

    2008-11-01

    Full Text Available Abstract Background Analysis of a microarray experiment often results in a list of hundreds of disease-associated genes. In order to suggest common biological processes and functions for these genes, Gene Ontology annotations with statistical testing are widely used. However, these analyses can produce a very large number of significantly altered biological processes. Thus, it is often challenging to interpret GO results and identify novel testable biological hypotheses. Results We present fast software for advanced gene annotation using semantic similarity for Gene Ontology terms combined with clustering and heat map visualisation. The methodology allows rapid identification of genes sharing the same Gene Ontology cluster. Conclusion Our R based semantic similarity open-source package has a speed advantage of over 2000-fold compared to existing implementations. From the resulting hierarchical clustering dendrogram genes sharing a GO term can be identified, and their differences in the gene expression patterns can be seen from the heat map. These methods facilitate advanced annotation of genes resulting from data analysis.

  9. Microarray Analysis of Iris Gene Expression in Mice with Mutations Influencing Pigmentation

    Science.gov (United States)

    Trantow, Colleen M.; Cuffy, Tryphena L.; Fingert, John H.; Kuehn, Markus H.

    2011-01-01

    Purpose. Several ocular diseases involve the iris, notably including oculocutaneous albinism, pigment dispersion syndrome, and exfoliation syndrome. To screen for candidate genes that may contribute to the pathogenesis of these diseases, genome-wide iris gene expression patterns were comparatively analyzed from mouse models of these conditions. Methods. Iris samples from albino mice with a Tyr mutation, pigment dispersion–prone mice with Tyrp1 and Gpnmb mutations, and mice resembling exfoliation syndrome with a Lyst mutation were compared with samples from wild-type mice. All mice were strain (C57BL/6J), age (60 days old), and sex (female) matched. Microarrays were used to compare transcriptional profiles, and differentially expressed transcripts were described by functional annotation clustering using DAVID Bioinformatics Resources. Quantitative real-time PCR was performed to validate a subset of identified changes. Results. Compared with wild-type C57BL/6J mice, each disease context exhibited a large number of statistically significant changes in gene expression, including 685 transcripts differentially expressed in albino irides, 403 in pigment dispersion–prone irides, and 460 in exfoliative-like irides. Conclusions. Functional annotation clusterings were particularly striking among the overrepresented genes, with albino and pigment dispersion–prone irides both exhibiting overall evidence of crystallin-mediated stress responses. Exfoliative-like irides from mice with a Lyst mutation showed overall evidence of involvement of genes that influence immune system processes, lytic vacuoles, and lysosomes. These findings have several biologically relevant implications, particularly with respect to secondary forms of glaucoma, and represent a useful resource as a hypothesis-generating dataset. PMID:20739468

  10. Normal uniform mixture differential gene expression detection for cDNA microarrays

    Directory of Open Access Journals (Sweden)

    Raftery Adrian E

    2005-07-01

    Full Text Available Abstract Background One of the primary tasks in analysing gene expression data is finding genes that are differentially expressed in different samples. Multiple testing issues due to the thousands of tests run make some of the more popular methods for doing this problematic. Results We propose a simple method, Normal Uniform Differential Gene Expression (NUDGE detection for finding differentially expressed genes in cDNA microarrays. The method uses a simple univariate normal-uniform mixture model, in combination with new normalization methods for spread as well as mean that extend the lowess normalization of Dudoit, Yang, Callow and Speed (2002 1. It takes account of multiple testing, and gives probabilities of differential expression as part of its output. It can be applied to either single-slide or replicated experiments, and it is very fast. Three datasets are analyzed using NUDGE, and the results are compared to those given by other popular methods: unadjusted and Bonferroni-adjusted t tests, Significance Analysis of Microarrays (SAM, and Empirical Bayes for microarrays (EBarrays with both Gamma-Gamma and Lognormal-Normal models. Conclusion The method gives a high probability of differential expression to genes known/suspected a priori to be differentially expressed and a low probability to the others. In terms of known false positives and false negatives, the method outperforms all multiple-replicate methods except for the Gamma-Gamma EBarrays method to which it offers comparable results with the added advantages of greater simplicity, speed, fewer assumptions and applicability to the single replicate case. An R package called nudge to implement the methods in this paper will be made available soon at http://www.bioconductor.org.

  11. Gene expression profiles in prostate cancer: identification of candidate non-invasive diagnostic markers.

    Science.gov (United States)

    Mengual, L; Ars, E; Lozano, J J; Burset, M; Izquierdo, L; Ingelmo-Torres, M; Gaya, J M; Algaba, F; Villavicencio, H; Ribal, M J; Alcaraz, A

    2014-04-01

    To analyze gene expression profiles of prostate cancer (PCa) with the aim of determining the relevant differentially expressed genes and subsequently ascertain whether this differential expression is maintained in post-prostatic massage (PPM) urine samples. Forty-six tissue specimens (36 from PCa patients and 10 controls) and 158 urine PPM-urines (113 from PCa patients and 45 controls) were collected between December 2003 and May 2007. DNA microarrays were used to identify genes differentially expressed between tumour and control samples. Ten genes were technically validated in the same tissue samples by quantitative RT-PCR (RT-qPCR). Forty two selected differentially expressed genes were validated in an independent set of PPM-urines by qRT-PCR. Multidimensional scaling plot according to the expression of all the microarray genes showed a clear distinction between control and tumour samples. A total of 1047 differentially expressed genes (FDR≤.1) were indentified between both groups of samples. We found a high correlation in the comparison of microarray and RT-qPCR gene expression levels (r=.928, P<.001). Thirteen genes maintained the same fold change direction when analyzed in PPM-urine samples and in four of them (HOXC6, PCA3, PDK4 and TMPRSS2-ERG), these differences were statistically significant (P<.05). The analysis of PCa by DNA microarrays provides new putative mRNA markers for PCa diagnosis that, with caution, can be extrapolated to PPM-urines. Copyright © 2013 AEU. Published by Elsevier Espana. All rights reserved.

  12. An online survival analysis tool to rapidly assess the effect of 22,277 genes on breast cancer prognosis using microarray data of 1,809 patients

    DEFF Research Database (Denmark)

    Györffy, B; Lanczky, A; Eklund, Aron Charles

    2010-01-01

    Validating prognostic or predictive candidate genes in appropriately powered breast cancer cohorts are of utmost interest. Our aim was to develop an online tool to draw survival plots, which can be used to assess the relevance of the expression levels of various genes on the clinical outcome both...... this integrative data analysis tool to confirm the prognostic power of the proliferation-related genes TOP2A and TOP2B, MKI67, CCND2, CCND3, CCNDE2, as well as CDKN1A, and TK2. We also validated the capability of microarrays to determine estrogen receptor status in 1,231 patients. The tool is highly valuable...

  13. Genes involved in immunity and apoptosis are associated with human presbycusis based on microarray analysis.

    Science.gov (United States)

    Dong, Yang; Li, Ming; Liu, Puzhao; Song, Haiyan; Zhao, Yuping; Shi, Jianrong

    2014-06-01

    Genes involved in immunity and apoptosis were associated with human presbycusis. CCR3 and GILZ played an important role in the pathogenesis of presbycusis, probably through regulating chemokine receptor, T-cell apoptosis, or T-cell activation pathways. To identify genes associated with human presbycusis and explore the molecular mechanism of presbycusis. Hearing function was tested by pure-tone audiometry. Microarray analysis was performed to identify presbycusis-correlated genes by Illumina Human-6 BeadChip using the peripheral blood samples of subjects. To identify biological process categories and pathways associated with presbycusis-correlated genes, bioinformatics analysis was carried out by Gene Ontology Tree Machine (GOTM) and database for annotation, visualization, and integrated discovery (DAVID). Quantitative RT-PCR (qRT-PCR) was used to validate the microarray data. Microarray analysis identified 469 up-regulated genes and 323 down-regulated genes. Both the dominant biological processes by Gene Ontology (GO) analysis and the enriched pathways by Kyoto encyclopedia of genes and genomes (KEGG) and BIOCARTA showed that genes involved in immunity and apoptosis were associated with presbycusis. In addition, CCR3, GILZ, CXCL10, and CX3CR1 genes showed consistent difference between groups for both the gene chip and qRT-PCR data. The differences of CCR3 and GILZ between presbycusis patients and controls were statistically significant (p < 0.05).

  14. GeneRank: Using search engine technology for the analysis of microarray experiments

    Directory of Open Access Journals (Sweden)

    Breitling Rainer

    2005-09-01

    Full Text Available Abstract Background Interpretation of simple microarray experiments is usually based on the fold-change of gene expression between a reference and a "treated" sample where the treatment can be of many types from drug exposure to genetic variation. Interpretation of the results usually combines lists of differentially expressed genes with previous knowledge about their biological function. Here we evaluate a method – based on the PageRank algorithm employed by the popular search engine Google – that tries to automate some of this procedure to generate prioritized gene lists by exploiting biological background information. Results GeneRank is an intuitive modification of PageRank that maintains many of its mathematical properties. It combines gene expression information with a network structure derived from gene annotations (gene ontologies or expression profile correlations. Using both simulated and real data we find that the algorithm offers an improved ranking of genes compared to pure expression change rankings. Conclusion Our modification of the PageRank algorithm provides an alternative method of evaluating microarray experimental results which combines prior knowledge about the underlying network. GeneRank offers an improvement compared to assessing the importance of a gene based on its experimentally observed fold-change alone and may be used as a basis for further analytical developments.

  15. GeneRank: using search engine technology for the analysis of microarray experiments.

    Science.gov (United States)

    Morrison, Julie L; Breitling, Rainer; Higham, Desmond J; Gilbert, David R

    2005-09-21

    Interpretation of simple microarray experiments is usually based on the fold-change of gene expression between a reference and a "treated" sample where the treatment can be of many types from drug exposure to genetic variation. Interpretation of the results usually combines lists of differentially expressed genes with previous knowledge about their biological function. Here we evaluate a method--based on the PageRank algorithm employed by the popular search engine Google--that tries to automate some of this procedure to generate prioritized gene lists by exploiting biological background information. GeneRank is an intuitive modification of PageRank that maintains many of its mathematical properties. It combines gene expression information with a network structure derived from gene annotations (gene ontologies) or expression profile correlations. Using both simulated and real data we find that the algorithm offers an improved ranking of genes compared to pure expression change rankings. Our modification of the PageRank algorithm provides an alternative method of evaluating microarray experimental results which combines prior knowledge about the underlying network. GeneRank offers an improvement compared to assessing the importance of a gene based on its experimentally observed fold-change alone and may be used as a basis for further analytical developments.

  16. Candidate genes in panic disorder

    DEFF Research Database (Denmark)

    Howe, A. S.; Buttenschön, Henriette N; Bani-Fatemi, A.

    2016-01-01

    The utilization of molecular genetics approaches in examination of panic disorder (PD) has implicated several variants as potential susceptibility factors for panicogenesis. However, the identification of robust PD susceptibility genes has been complicated by phenotypic diversity, underpowered...... association studies and ancestry-specific effects. In the present study, we performed a succinct review of case-control association studies published prior to April 2015. Meta-analyses were performed for candidate gene variants examined in at least three studies using the Cochrane Mantel-Haenszel fixed......-effect model. Secondary analyses were also performed to assess the influences of sex, agoraphobia co-morbidity and ancestry-specific effects on panicogenesis. Meta-analyses were performed on 23 variants in 20 PD candidate genes. Significant associations after correction for multiple testing were observed...

  17. No Evidence That Schizophrenia Candidate Genes Are More Associated With Schizophrenia Than Noncandidate Genes.

    Science.gov (United States)

    Johnson, Emma C; Border, Richard; Melroy-Greif, Whitney E; de Leeuw, Christiaan A; Ehringer, Marissa A; Keller, Matthew C

    2017-11-15

    A recent analysis of 25 historical candidate gene polymorphisms for schizophrenia in the largest genome-wide association study conducted to date suggested that these commonly studied variants were no more associated with the disorder than would be expected by chance. However, the same study identified other variants within those candidate genes that demonstrated genome-wide significant associations with schizophrenia. As such, it is possible that variants within historic schizophrenia candidate genes are associated with schizophrenia at levels above those expected by chance, even if the most-studied specific polymorphisms are not. The present study used association statistics from the largest schizophrenia genome-wide association study conducted to date as input to a gene set analysis to investigate whether variants within schizophrenia candidate genes are enriched for association with schizophrenia. As a group, variants in the most-studied candidate genes were no more associated with schizophrenia than were variants in control sets of noncandidate genes. While a small subset of candidate genes did appear to be significantly associated with schizophrenia, these genes were not particularly noteworthy given the large number of more strongly associated noncandidate genes. The history of schizophrenia research should serve as a cautionary tale to candidate gene investigators examining other phenotypes: our findings indicate that the most investigated candidate gene hypotheses of schizophrenia are not well supported by genome-wide association studies, and it is likely that this will be the case for other complex traits as well. Copyright © 2017 Society of Biological Psychiatry. Published by Elsevier Inc. All rights reserved.

  18. Production of DNA microarray and expression analysis of genes from Xylella fastidiosa in different culture media

    Directory of Open Access Journals (Sweden)

    Regiane de Fátima Travensolo

    2009-06-01

    Full Text Available DNA Microarray was developed to monitor the expression of many genes from Xylella fastidiosa, allowing the side by-side comparison of two situations in a single experiment. The experiments were performed using X. fastidiosa cells grown in two culture media: BCYE and XDM2. The primers were synthesized, spotted onto glass slides and the array was hybridized against fluorescently labeled cDNAs. The emitted signals were quantified, normalized and the data were statistically analyzed to verify the differentially expressed genes. According to the data, 104 genes were differentially expressed in XDM2 and 30 genes in BCYE media. The present study showed that DNA microarray technique efficiently differentiate the expressed genes under different conditions.DNA Microarray foi desenvolvida para monitorar a expressão de muitos genes de Xylella fastidiosa, permitindo a comparação de duas situações distintas em um único experimento. Os experimentos foram feitos utilizando células de X. fastidiosa cultivada em dois meios de cultura: BCYE e XDM2. Pares de oligonucleotídeos iniciadores foram sintetizados, depositados em lâminas de vidro e o arranjo foi hibridizado contra cDNAs marcados fluorescentemente. Os sinais emitidos foram quantificados, normalizados e os dados foram estatisticamente analisados para verificar os genes diferencialmente expressos. De acordo com nossos dados, 104 genes foram diferencialmente expressos para o meio de cultura XDM2 e 30 genes para o BCYE. No presente estudo, nós demonstramos que a técnica de DNA microarrays eficientemente diferencia genes expressos sob diferentes condições de cultivo.

  19. Density based pruning for identification of differentially expressed genes from microarray data

    Directory of Open Access Journals (Sweden)

    Xu Jia

    2010-11-01

    Full Text Available Abstract Motivation Identification of differentially expressed genes from microarray datasets is one of the most important analyses for microarray data mining. Popular algorithms such as statistical t-test rank genes based on a single statistics. The false positive rate of these methods can be improved by considering other features of differentially expressed genes. Results We proposed a pattern recognition strategy for identifying differentially expressed genes. Genes are mapped to a two dimension feature space composed of average difference of gene expression and average expression levels. A density based pruning algorithm (DB Pruning is developed to screen out potential differentially expressed genes usually located in the sparse boundary region. Biases of popular algorithms for identifying differentially expressed genes are visually characterized. Experiments on 17 datasets from Gene Omnibus Database (GEO with experimentally verified differentially expressed genes showed that DB pruning can significantly improve the prediction accuracy of popular identification algorithms such as t-test, rank product, and fold change. Conclusions Density based pruning of non-differentially expressed genes is an effective method for enhancing statistical testing based algorithms for identifying differentially expressed genes. It improves t-test, rank product, and fold change by 11% to 50% in the numbers of identified true differentially expressed genes. The source code of DB pruning is freely available on our website http://mleg.cse.sc.edu/degprune

  20. Array2BIO: from microarray expression data to functional annotation of co-regulated genes

    Directory of Open Access Journals (Sweden)

    Rasley Amy

    2006-06-01

    Full Text Available Abstract Background There are several isolated tools for partial analysis of microarray expression data. To provide an integrative, easy-to-use and automated toolkit for the analysis of Affymetrix microarray expression data we have developed Array2BIO, an application that couples several analytical methods into a single web based utility. Results Array2BIO converts raw intensities into probe expression values, automatically maps those to genes, and subsequently identifies groups of co-expressed genes using two complementary approaches: (1 comparative analysis of signal versus control and (2 clustering analysis of gene expression across different conditions. The identified genes are assigned to functional categories based on Gene Ontology classification and KEGG protein interaction pathways. Array2BIO reliably handles low-expressor genes and provides a set of statistical methods for quantifying expression levels, including Benjamini-Hochberg and Bonferroni multiple testing corrections. An automated interface with the ECR Browser provides evolutionary conservation analysis for the identified gene loci while the interconnection with Crème allows prediction of gene regulatory elements that underlie observed expression patterns. Conclusion We have developed Array2BIO – a web based tool for rapid comprehensive analysis of Affymetrix microarray expression data, which also allows users to link expression data to Dcode.org comparative genomics tools and integrates a system for translating co-expression data into mechanisms of gene co-regulation. Array2BIO is publicly available at http://array2bio.dcode.org.

  1. Identification of a novel uromodulin-like gene related to predator-induced bulgy morph in anuran tadpoles by functional microarray analysis.

    Directory of Open Access Journals (Sweden)

    Tsukasa Mori

    2009-06-01

    Full Text Available Tadpoles of the anuran species Rana pirica can undergo predator-specific morphological responses. Exposure to a predation threat by larvae of the salamander Hynobius retardatus results in formation of a bulgy body (bulgy morph with a higher tail. The tadpoles revert to a normal phenotype upon removal of the larval salamander threat. Although predator-induced phenotypic plasticity is of major interest to evolutionary ecologists, the molecular and physiological mechanisms that control this response have yet to be elucidated. In a previous study, we identified various genes that are expressed in the skin of the bulgy morph. However, it proved difficult to determine which of these were key genes in the control of gene expression associated with the bulgy phenotype. Here, we show that a novel gene plays an important role in the phenotypic plasticity producing the bulgy morph. A functional microarray analysis using facial tissue samples of control and bulgy morph tadpoles identified candidate functional genes for predator-specific morphological responses. A larger functional microarray was prepared than in the previous study and used to analyze mRNAs extracted from facial and brain tissues of tadpoles from induction-reversion experiments. We found that a novel uromodulin-like gene, which we name here pirica, was up-regulated and that keratin genes were down-regulated as the period of exposure to larval salamanders increased. Pirica consists of a 1296 bp open reading frame, which is putatively translated into a protein of 432 amino acids. The protein contains a zona pellucida domain similar to that of proteins that function to control water permeability. We found that the gene was expressed in the superficial epidermis of the tadpole skin.

  2. Using rule-based machine learning for candidate disease gene prioritization and sample classification of cancer gene expression data.

    Directory of Open Access Journals (Sweden)

    Enrico Glaab

    Full Text Available Microarray data analysis has been shown to provide an effective tool for studying cancer and genetic diseases. Although classical machine learning techniques have successfully been applied to find informative genes and to predict class labels for new samples, common restrictions of microarray analysis such as small sample sizes, a large attribute space and high noise levels still limit its scientific and clinical applications. Increasing the interpretability of prediction models while retaining a high accuracy would help to exploit the information content in microarray data more effectively. For this purpose, we evaluate our rule-based evolutionary machine learning systems, BioHEL and GAssist, on three public microarray cancer datasets, obtaining simple rule-based models for sample classification. A comparison with other benchmark microarray sample classifiers based on three diverse feature selection algorithms suggests that these evolutionary learning techniques can compete with state-of-the-art methods like support vector machines. The obtained models reach accuracies above 90% in two-level external cross-validation, with the added value of facilitating interpretation by using only combinations of simple if-then-else rules. As a further benefit, a literature mining analysis reveals that prioritizations of informative genes extracted from BioHEL's classification rule sets can outperform gene rankings obtained from a conventional ensemble feature selection in terms of the pointwise mutual information between relevant disease terms and the standardized names of top-ranked genes.

  3. A salmonid EST genomic study: genes, duplications, phylogeny and microarrays

    Directory of Open Access Journals (Sweden)

    Brahmbhatt Sonal

    2008-11-01

    Full Text Available Abstract Background Salmonids are of interest because of their relatively recent genome duplication, and their extensive use in wild fisheries and aquaculture. A comprehensive gene list and a comparison of genes in some of the different species provide valuable genomic information for one of the most widely studied groups of fish. Results 298,304 expressed sequence tags (ESTs from Atlantic salmon (69% of the total, 11,664 chinook, 10,813 sockeye, 10,051 brook trout, 10,975 grayling, 8,630 lake whitefish, and 3,624 northern pike ESTs were obtained in this study and have been deposited into the public databases. Contigs were built and putative full-length Atlantic salmon clones have been identified. A database containing ESTs, assemblies, consensus sequences, open reading frames, gene predictions and putative annotation is available. The overall similarity between Atlantic salmon ESTs and those of rainbow trout, chinook, sockeye, brook trout, grayling, lake whitefish, northern pike and rainbow smelt is 93.4, 94.2, 94.6, 94.4, 92.5, 91.7, 89.6, and 86.2% respectively. An analysis of 78 transcript sets show Salmo as a sister group to Oncorhynchus and Salvelinus within Salmoninae, and Thymallinae as a sister group to Salmoninae and Coregoninae within Salmonidae. Extensive gene duplication is consistent with a genome duplication in the common ancestor of salmonids. Using all of the available EST data, a new expanded salmonid cDNA microarray of 32,000 features was created. Cross-species hybridizations to this cDNA microarray indicate that this resource will be useful for studies of all 68 salmonid species. Conclusion An extensive collection and analysis of salmonid RNA putative transcripts indicate that Pacific salmon, Atlantic salmon and charr are 94–96% similar while the more distant whitefish, grayling, pike and smelt are 93, 92, 89 and 86% similar to salmon. The salmonid transcriptome reveals a complex history of gene duplication that is

  4. A high-density transcript linkage map with 1,845 expressed genes positioned by microarray-based Single Feature Polymorphisms (SFP) in Eucalyptus

    Science.gov (United States)

    2011-01-01

    Background Technological advances are progressively increasing the application of genomics to a wider array of economically and ecologically important species. High-density maps enriched for transcribed genes facilitate the discovery of connections between genes and phenotypes. We report the construction of a high-density linkage map of expressed genes for the heterozygous genome of Eucalyptus using Single Feature Polymorphism (SFP) markers. Results SFP discovery and mapping was achieved using pseudo-testcross screening and selective mapping to simultaneously optimize linkage mapping and microarray costs. SFP genotyping was carried out by hybridizing complementary RNA prepared from 4.5 year-old trees xylem to an SFP array containing 103,000 25-mer oligonucleotide probes representing 20,726 unigenes derived from a modest size expressed sequence tags collection. An SFP-mapping microarray with 43,777 selected candidate SFP probes representing 15,698 genes was subsequently designed and used to genotype SFPs in a larger subset of the segregating population drawn by selective mapping. A total of 1,845 genes were mapped, with 884 of them ordered with high likelihood support on a framework map anchored to 180 microsatellites with average density of 1.2 cM. Using more probes per unigene increased by two-fold the likelihood of detecting segregating SFPs eventually resulting in more genes mapped. In silico validation showed that 87% of the SFPs map to the expected location on the 4.5X draft sequence of the Eucalyptus grandis genome. Conclusions The Eucalyptus 1,845 gene map is the most highly enriched map for transcriptional information for any forest tree species to date. It represents a major improvement on the number of genes previously positioned on Eucalyptus maps and provides an initial glimpse at the gene space for this global tree genome. A general protocol is proposed to build high-density transcript linkage maps in less characterized plant species by SFP genotyping

  5. Development, characterization and experimental validation of a cultivated sunflower (Helianthus annuus L. gene expression oligonucleotide microarray.

    Directory of Open Access Journals (Sweden)

    Paula Fernandez

    Full Text Available Oligonucleotide-based microarrays with accurate gene coverage represent a key strategy for transcriptional studies in orphan species such as sunflower, H. annuus L., which lacks full genome sequences. The goal of this study was the development and functional annotation of a comprehensive sunflower unigene collection and the design and validation of a custom sunflower oligonucleotide-based microarray. A large scale EST (>130,000 ESTs curation, assembly and sequence annotation was performed using Blast2GO (www.blast2go.de. The EST assembly comprises 41,013 putative transcripts (12,924 contigs and 28,089 singletons. The resulting Sunflower Unigen Resource (SUR version 1.0 was used to design an oligonucleotide-based Agilent microarray for cultivated sunflower. This microarray includes a total of 42,326 features: 1,417 Agilent controls, 74 control probes for sunflower replicated 10 times (740 controls and 40,169 different non-control probes. Microarray performance was validated using a model experiment examining the induction of senescence by water deficit. Pre-processing and differential expression analysis of Agilent microarrays was performed using the Bioconductor limma package. The analyses based on p-values calculated by eBayes (p<0.01 allowed the detection of 558 differentially expressed genes between water stress and control conditions; from these, ten genes were further validated by qPCR. Over-represented ontologies were identified using FatiScan in the Babelomics suite. This work generated a curated and trustable sunflower unigene collection, and a custom, validated sunflower oligonucleotide-based microarray using Agilent technology. Both the curated unigene collection and the validated oligonucleotide microarray provide key resources for sunflower genome analysis, transcriptional studies, and molecular breeding for crop improvement.

  6. Data Integration for Microarrays: Enhanced Inference for Gene Regulatory Networks

    Directory of Open Access Journals (Sweden)

    Alina Sîrbu

    2015-05-01

    Full Text Available Microarray technologies have been the basis of numerous important findings regarding gene expression in the few last decades. Studies have generated large amounts of data describing various processes, which, due to the existence of public databases, are widely available for further analysis. Given their lower cost and higher maturity compared to newer sequencing technologies, these data continue to be produced, even though data quality has been the subject of some debate. However, given the large volume of data generated, integration can help overcome some issues related, e.g., to noise or reduced time resolution, while providing additional insight on features not directly addressed by sequencing methods. Here, we present an integration test case based on public Drosophila melanogaster datasets (gene expression, binding site affinities, known interactions. Using an evolutionary computation framework, we show how integration can enhance the ability to recover transcriptional gene regulatory networks from these data, as well as indicating which data types are more important for quantitative and qualitative network inference. Our results show a clear improvement in performance when multiple datasets are integrated, indicating that microarray data will remain a valuable and viable resource for some time to come.

  7. Data Integration for Microarrays: Enhanced Inference for Gene Regulatory Networks.

    Science.gov (United States)

    Sîrbu, Alina; Crane, Martin; Ruskin, Heather J

    2015-05-14

    Microarray technologies have been the basis of numerous important findings regarding gene expression in the few last decades. Studies have generated large amounts of data describing various processes, which, due to the existence of public databases, are widely available for further analysis. Given their lower cost and higher maturity compared to newer sequencing technologies, these data continue to be produced, even though data quality has been the subject of some debate. However, given the large volume of data generated, integration can help overcome some issues related, e.g., to noise or reduced time resolution, while providing additional insight on features not directly addressed by sequencing methods. Here, we present an integration test case based on public Drosophila melanogaster datasets (gene expression, binding site affinities, known interactions). Using an evolutionary computation framework, we show how integration can enhance the ability to recover transcriptional gene regulatory networks from these data, as well as indicating which data types are more important for quantitative and qualitative network inference. Our results show a clear improvement in performance when multiple datasets are integrated, indicating that microarray data will remain a valuable and viable resource for some time to come.

  8. Methods for interpreting lists of affected genes obtained in a DNA microarray experiment

    Directory of Open Access Journals (Sweden)

    Hedegaard Jakob

    2009-07-01

    Full Text Available Abstract Background The aim of this paper was to describe and compare the methods used and the results obtained by the participants in a joint EADGENE (European Animal Disease Genomic Network of Excellence and SABRE (Cutting Edge Genomics for Sustainable Animal Breeding workshop focusing on post analysis of microarray data. The participating groups were provided with identical lists of microarray probes, including test statistics for three different contrasts, and the normalised log-ratios for each array, to be used as the starting point for interpreting the affected probes. The data originated from a microarray experiment conducted to study the host reactions in broilers occurring shortly after a secondary challenge with either a homologous or heterologous species of Eimeria. Results Several conceptually different analytical approaches, using both commercial and public available software, were applied by the participating groups. The following tools were used: Ingenuity Pathway Analysis, MAPPFinder, LIMMA, GOstats, GOEAST, GOTM, Globaltest, TopGO, ArrayUnlock, Pathway Studio, GIST and AnnotationDbi. The main focus of the approaches was to utilise the relation between probes/genes and their gene ontology and pathways to interpret the affected probes/genes. The lack of a well-annotated chicken genome did though limit the possibilities to fully explore the tools. The main results from these analyses showed that the biological interpretation is highly dependent on the statistical method used but that some common biological conclusions could be reached. Conclusion It is highly recommended to test different analytical methods on the same data set and compare the results to obtain a reliable biological interpretation of the affected genes in a DNA microarray experiment.

  9. Identification of differentially expressed genes in cutaneous squamous cell carcinoma by microarray expression profiling

    Directory of Open Access Journals (Sweden)

    Sterry Wolfram

    2006-08-01

    Full Text Available Abstract Background Carcinogenesis is a multi-step process indicated by several genes up- or down-regulated during tumor progression. This study examined and identified differentially expressed genes in cutaneous squamous cell carcinoma (SCC. Results Three different biopsies of 5 immunosuppressed organ-transplanted recipients each normal skin (all were pooled, actinic keratosis (AK (two were pooled, and invasive SCC and additionally 5 normal skin tissues from immunocompetent patients were analyzed. Thus, total RNA of 15 specimens were used for hybridization with Affymetrix HG-U133A microarray technology containing 22,283 genes. Data analyses were performed by prediction analysis of microarrays using nearest shrunken centroids with the threshold 3.5 and ANOVA analysis was independently performed in order to identify differentially expressed genes (p vs. AK and SCC were observed for 118 genes. Conclusion The majority of identified differentially expressed genes in cutaneous SCC were previously not described.

  10. Consistent Differential Expression Pattern (CDEP) on microarray to identify genes related to metastatic behavior.

    Science.gov (United States)

    Tsoi, Lam C; Qin, Tingting; Slate, Elizabeth H; Zheng, W Jim

    2011-11-11

    To utilize the large volume of gene expression information generated from different microarray experiments, several meta-analysis techniques have been developed. Despite these efforts, there remain significant challenges to effectively increasing the statistical power and decreasing the Type I error rate while pooling the heterogeneous datasets from public resources. The objective of this study is to develop a novel meta-analysis approach, Consistent Differential Expression Pattern (CDEP), to identify genes with common differential expression patterns across different datasets. We combined False Discovery Rate (FDR) estimation and the non-parametric RankProd approach to estimate the Type I error rate in each microarray dataset of the meta-analysis. These Type I error rates from all datasets were then used to identify genes with common differential expression patterns. Our simulation study showed that CDEP achieved higher statistical power and maintained low Type I error rate when compared with two recently proposed meta-analysis approaches. We applied CDEP to analyze microarray data from different laboratories that compared transcription profiles between metastatic and primary cancer of different types. Many genes identified as differentially expressed consistently across different cancer types are in pathways related to metastatic behavior, such as ECM-receptor interaction, focal adhesion, and blood vessel development. We also identified novel genes such as AMIGO2, Gem, and CXCL11 that have not been shown to associate with, but may play roles in, metastasis. CDEP is a flexible approach that borrows information from each dataset in a meta-analysis in order to identify genes being differentially expressed consistently. We have shown that CDEP can gain higher statistical power than other existing approaches under a variety of settings considered in the simulation study, suggesting its robustness and insensitivity to data variation commonly associated with microarray

  11. Reranking candidate gene models with cross-species comparison for improved gene prediction

    Directory of Open Access Journals (Sweden)

    Pereira Fernando CN

    2008-10-01

    Full Text Available Abstract Background Most gene finders score candidate gene models with state-based methods, typically HMMs, by combining local properties (coding potential, splice donor and acceptor patterns, etc. Competing models with similar state-based scores may be distinguishable with additional information. In particular, functional and comparative genomics datasets may help to select among competing models of comparable probability by exploiting features likely to be associated with the correct gene models, such as conserved exon/intron structure or protein sequence features. Results We have investigated the utility of a simple post-processing step for selecting among a set of alternative gene models, using global scoring rules to rerank competing models for more accurate prediction. For each gene locus, we first generate the K best candidate gene models using the gene finder Evigan, and then rerank these models using comparisons with putative orthologous genes from closely-related species. Candidate gene models with lower scores in the original gene finder may be selected if they exhibit strong similarity to probable orthologs in coding sequence, splice site location, or signal peptide occurrence. Experiments on Drosophila melanogaster demonstrate that reranking based on cross-species comparison outperforms the best gene models identified by Evigan alone, and also outperforms the comparative gene finders GeneWise and Augustus+. Conclusion Reranking gene models with cross-species comparison improves gene prediction accuracy. This straightforward method can be readily adapted to incorporate additional lines of evidence, as it requires only a ranked source of candidate gene models.

  12. Feature selection and classification of MAQC-II breast cancer and multiple myeloma microarray gene expression data.

    Directory of Open Access Journals (Sweden)

    Qingzhong Liu

    Full Text Available Microarray data has a high dimension of variables but available datasets usually have only a small number of samples, thereby making the study of such datasets interesting and challenging. In the task of analyzing microarray data for the purpose of, e.g., predicting gene-disease association, feature selection is very important because it provides a way to handle the high dimensionality by exploiting information redundancy induced by associations among genetic markers. Judicious feature selection in microarray data analysis can result in significant reduction of cost while maintaining or improving the classification or prediction accuracy of learning machines that are employed to sort out the datasets. In this paper, we propose a gene selection method called Recursive Feature Addition (RFA, which combines supervised learning and statistical similarity measures. We compare our method with the following gene selection methods: Support Vector Machine Recursive Feature Elimination (SVMRFE, Leave-One-Out Calculation Sequential Forward Selection (LOOCSFS, Gradient based Leave-one-out Gene Selection (GLGS. To evaluate the performance of these gene selection methods, we employ several popular learning classifiers on the MicroArray Quality Control phase II on predictive modeling (MAQC-II breast cancer dataset and the MAQC-II multiple myeloma dataset. Experimental results show that gene selection is strictly paired with learning classifier. Overall, our approach outperforms other compared methods. The biological functional analysis based on the MAQC-II breast cancer dataset convinced us to apply our method for phenotype prediction. Additionally, learning classifiers also play important roles in the classification of microarray data and our experimental results indicate that the Nearest Mean Scale Classifier (NMSC is a good choice due to its prediction reliability and its stability across the three performance measurements: Testing accuracy, MCC values, and

  13. [Study of generational risk in deafness inflicted couples using deafness gene microarray technique].

    Science.gov (United States)

    Wang, Ping; Zhao, Jia; Yu, Shu-yuan; Jin, Peng; Zhu, Wei; DU, Bo

    2011-06-01

    To explored the significance of screening the gene mutations of deafness related in deaf-mute (deaf & dumb) family using DNA microarray. Total of 52 couples of deaf-mute were recruited from Changchun deaf-mute community. With an average age of (58.3 ± 6.7) years old (x(-) ± s). Blood samples were obtained with informed consent. Their genomic DNA was extracted from peripheral blood and PCR was performed. Nine of hot spot mutations in four most common deafness pathologic gene were examined with the DNA microarray, including GJB2, GJB3, PDS and mtDNA 12S rRNA genes. At the same time, the results were verified with the traditional methods of sequencing. Fifty of normal people served as a control group. All patients were diagnosed non-syndromic sensorineural hearing loss by subjective pure tone audiometry. Thirty-two of 104 cases appeared GJB2 gene mutation (30.7%), the mutation sites included 35delG, 176del16, 235delC and 299delAT. Eighteen of 32 cases of GJB2 mutations were 235delC (59.1%). Seven of 104 cases appeared SLC26A4 gene IVS7-2 A > G mutation. Questionnaire survey and gene diagnosis revealed that four of 52 families have deaf offspring (7.6%). When a couple carries the same gene mutation, the risk of their children deafness was 100%. The results were confirmed with the traditional methods of sequencing. There is a high risk of deafness if a deaf-mute family is planning to have a new baby. It is very important and helpful to avoid deaf newborns again in deaf-mute family by DNA microarray.

  14. 16S rRNA gene-based phylogenetic microarray for simultaneous identification of members of the genus Burkholderia.

    Science.gov (United States)

    Schönmann, Susan; Loy, Alexander; Wimmersberger, Céline; Sobek, Jens; Aquino, Catharine; Vandamme, Peter; Frey, Beat; Rehrauer, Hubert; Eberl, Leo

    2009-04-01

    For cultivation-independent and highly parallel analysis of members of the genus Burkholderia, an oligonucleotide microarray (phylochip) consisting of 131 hierarchically nested 16S rRNA gene-targeted oligonucleotide probes was developed. A novel primer pair was designed for selective amplification of a 1.3 kb 16S rRNA gene fragment of Burkholderia species prior to microarray analysis. The diagnostic performance of the microarray for identification and differentiation of Burkholderia species was tested with 44 reference strains of the genera Burkholderia, Pandoraea, Ralstonia and Limnobacter. Hybridization patterns based on presence/absence of probe signals were interpreted semi-automatically using the novel likelihood-based strategy of the web-tool Phylo- Detect. Eighty-eight per cent of the reference strains were correctly identified at the species level. The evaluated microarray was applied to investigate shifts in the Burkholderia community structure in acidic forest soil upon addition of cadmium, a condition that selected for Burkholderia species. The microarray results were in agreement with those obtained from phylogenetic analysis of Burkholderia 16S rRNA gene sequences recovered from the same cadmiumcontaminated soil, demonstrating the value of the Burkholderia phylochip for determinative and environmental studies.

  15. Implementation of plaid model biclustering method on microarray of carcinoma and adenoma tumor gene expression data

    Science.gov (United States)

    Ardaneswari, Gianinna; Bustamam, Alhadi; Sarwinda, Devvi

    2017-10-01

    A Tumor is an abnormal growth of cells that serves no purpose. Carcinoma is a tumor that grows from the top of the cell membrane and the organ adenoma is a benign tumor of the gland-like cells or epithelial tissue. In the field of molecular biology, the development of microarray technology is used in the data store of disease genetic expression. For each of microarray gene, an amount of information is stored for each trait or condition. In gene expression data clustering can be done with a bicluster algorithm, thats clustering method which not only the objects to be clustered, but also the properties or condition of the object. This research proposed Plaid Model Biclustering as one of biclustering method. In this study, we discuss the implementation of Plaid Model Biclustering Method on microarray of Carcinoma and Adenoma tumor gene expression data. From the experimental results, we found three biclusters are formed by Carcinoma gene expression data and four biclusters are formed by Adenoma gene expression data.

  16. SoFoCles: feature filtering for microarray classification based on gene ontology.

    Science.gov (United States)

    Papachristoudis, Georgios; Diplaris, Sotiris; Mitkas, Pericles A

    2010-02-01

    Marker gene selection has been an important research topic in the classification analysis of gene expression data. Current methods try to reduce the "curse of dimensionality" by using statistical intra-feature set calculations, or classifiers that are based on the given dataset. In this paper, we present SoFoCles, an interactive tool that enables semantic feature filtering in microarray classification problems with the use of external, well-defined knowledge retrieved from the Gene Ontology. The notion of semantic similarity is used to derive genes that are involved in the same biological path during the microarray experiment, by enriching a feature set that has been initially produced with legacy methods. Among its other functionalities, SoFoCles offers a large repository of semantic similarity methods that are used in order to derive feature sets and marker genes. The structure and functionality of the tool are discussed in detail, as well as its ability to improve classification accuracy. Through experimental evaluation, SoFoCles is shown to outperform other classification schemes in terms of classification accuracy in two real datasets using different semantic similarity computation approaches.

  17. Mutation Analysis of Consanguineous Moroccan Patients with Parkinson’s Disease Combining Microarray and Gene Panel

    Directory of Open Access Journals (Sweden)

    Ahmed Bouhouche

    2017-10-01

    Full Text Available During the last two decades, 15 different genes have been reported to be responsible for the monogenic form of Parkinson’s disease (PD, representing a worldwide frequency of 5–10%. Among them, 10 genes have been associated with autosomal recessive PD, with PRKN and PINK1 being the most frequent. In a cohort of 145 unrelated Moroccan PD patients enrolled since 2013, 19 patients were born from a consanguineous marriage, of which 15 were isolated cases and 4 familial. One patient was homozygous for the common LRRK2 G2019S mutation and the 18 others who did not carry this mutation were screened for exon rearrangements in the PRKN gene using Affymetrix Cytoscan HD microarray. Two patients were determined homozygous for PRKN exon-deletions, while another patient presented with compound heterozygous inheritance (3/18, 17%. Two other patients showed a region of homozygosity covering the 1p36.12 locus and were sequenced for the candidate PINK1 gene, which revealed two homozygous point mutations: the known Q456X mutation in exon 7 and a novel L539F variation in exon 8. The 13 remaining patients were subjected to next-generation sequencing (NGS that targeted a panel of 22 PD-causing genes and overlapping phenotypes. NGS data showed that two unrelated consanguineous patients with juvenile-onset PD (12 and 13 years carried the same homozygous stop mutation W258X in the ATP13A2 gene, possibly resulting from a founder effect; and one patient with late onset (76 years carried a novel heterozygous frameshift mutation in SYNJ1. Clinical analysis showed that patients with the ATP13A2 mutation developed juvenile-onset PD with a severe phenotype, whereas patients having either PRKN or PINK1 mutations displayed early-onset PD with a relatively mild phenotype. By identifying pathogenic mutations in 45% (8/18 of our consanguineous Moroccan PD series, we demonstrate that the combination of chromosomal microarray analysis and NGS is a powerful approach to

  18. Disease candidate gene identification and prioritization using protein interaction networks

    Directory of Open Access Journals (Sweden)

    Aronow Bruce J

    2009-02-01

    Full Text Available Abstract Background Although most of the current disease candidate gene identification and prioritization methods depend on functional annotations, the coverage of the gene functional annotations is a limiting factor. In the current study, we describe a candidate gene prioritization method that is entirely based on protein-protein interaction network (PPIN analyses. Results For the first time, extended versions of the PageRank and HITS algorithms, and the K-Step Markov method are applied to prioritize disease candidate genes in a training-test schema. Using a list of known disease-related genes from our earlier study as a training set ("seeds", and the rest of the known genes as a test list, we perform large-scale cross validation to rank the candidate genes and also evaluate and compare the performance of our approach. Under appropriate settings – for example, a back probability of 0.3 for PageRank with Priors and HITS with Priors, and step size 6 for K-Step Markov method – the three methods achieved a comparable AUC value, suggesting a similar performance. Conclusion Even though network-based methods are generally not as effective as integrated functional annotation-based methods for disease candidate gene prioritization, in a one-to-one comparison, PPIN-based candidate gene prioritization performs better than all other gene features or annotations. Additionally, we demonstrate that methods used for studying both social and Web networks can be successfully used for disease candidate gene prioritization.

  19. Gene selection and classification for cancer microarray data based on machine learning and similarity measures

    Directory of Open Access Journals (Sweden)

    Liu Qingzhong

    2011-12-01

    Full Text Available Abstract Background Microarray data have a high dimension of variables and a small sample size. In microarray data analyses, two important issues are how to choose genes, which provide reliable and good prediction for disease status, and how to determine the final gene set that is best for classification. Associations among genetic markers mean one can exploit information redundancy to potentially reduce classification cost in terms of time and money. Results To deal with redundant information and improve classification, we propose a gene selection method, Recursive Feature Addition, which combines supervised learning and statistical similarity measures. To determine the final optimal gene set for prediction and classification, we propose an algorithm, Lagging Prediction Peephole Optimization. By using six benchmark microarray gene expression data sets, we compared Recursive Feature Addition with recently developed gene selection methods: Support Vector Machine Recursive Feature Elimination, Leave-One-Out Calculation Sequential Forward Selection and several others. Conclusions On average, with the use of popular learning machines including Nearest Mean Scaled Classifier, Support Vector Machine, Naive Bayes Classifier and Random Forest, Recursive Feature Addition outperformed other methods. Our studies also showed that Lagging Prediction Peephole Optimization is superior to random strategy; Recursive Feature Addition with Lagging Prediction Peephole Optimization obtained better testing accuracies than the gene selection method varSelRF.

  20. GTI: a novel algorithm for identifying outlier gene expression profiles from integrated microarray datasets.

    Directory of Open Access Journals (Sweden)

    John Patrick Mpindi

    Full Text Available BACKGROUND: Meta-analysis of gene expression microarray datasets presents significant challenges for statistical analysis. We developed and validated a new bioinformatic method for the identification of genes upregulated in subsets of samples of a given tumour type ('outlier genes', a hallmark of potential oncogenes. METHODOLOGY: A new statistical method (the gene tissue index, GTI was developed by modifying and adapting algorithms originally developed for statistical problems in economics. We compared the potential of the GTI to detect outlier genes in meta-datasets with four previously defined statistical methods, COPA, the OS statistic, the t-test and ORT, using simulated data. We demonstrated that the GTI performed equally well to existing methods in a single study simulation. Next, we evaluated the performance of the GTI in the analysis of combined Affymetrix gene expression data from several published studies covering 392 normal samples of tissue from the central nervous system, 74 astrocytomas, and 353 glioblastomas. According to the results, the GTI was better able than most of the previous methods to identify known oncogenic outlier genes. In addition, the GTI identified 29 novel outlier genes in glioblastomas, including TYMS and CDKN2A. The over-expression of these genes was validated in vivo by immunohistochemical staining data from clinical glioblastoma samples. Immunohistochemical data were available for 65% (19 of 29 of these genes, and 17 of these 19 genes (90% showed a typical outlier staining pattern. Furthermore, raltitrexed, a specific inhibitor of TYMS used in the therapy of tumour types other than glioblastoma, also effectively blocked cell proliferation in glioblastoma cell lines, thus highlighting this outlier gene candidate as a potential therapeutic target. CONCLUSIONS/SIGNIFICANCE: Taken together, these results support the GTI as a novel approach to identify potential oncogene outliers and drug targets. The algorithm is

  1. Extracting gene expression patterns and identifying co-expressed genes from microarray data reveals biologically responsive processes

    Directory of Open Access Journals (Sweden)

    Paules Richard S

    2007-11-01

    Full Text Available Abstract Background A common observation in the analysis of gene expression data is that many genes display similarity in their expression patterns and therefore appear to be co-regulated. However, the variation associated with microarray data and the complexity of the experimental designs make the acquisition of co-expressed genes a challenge. We developed a novel method for Extracting microarray gene expression Patterns and Identifying co-expressed Genes, designated as EPIG. The approach utilizes the underlying structure of gene expression data to extract patterns and identify co-expressed genes that are responsive to experimental conditions. Results Through evaluation of the correlations among profiles, the magnitude of variation in gene expression profiles, and profile signal-to-noise ratio's, EPIG extracts a set of patterns representing co-expressed genes. The method is shown to work well with a simulated data set and microarray data obtained from time-series studies of dauer recovery and L1 starvation in C. elegans and after ultraviolet (UV or ionizing radiation (IR-induced DNA damage in diploid human fibroblasts. With the simulated data set, EPIG extracted the appropriate number of patterns which were more stable and homogeneous than the set of patterns that were determined using the CLICK or CAST clustering algorithms. However, CLICK performed better than EPIG and CAST with respect to the average correlation between clusters/patterns of the simulated data. With real biological data, EPIG extracted more dauer-specific patterns than CLICK. Furthermore, analysis of the IR/UV data revealed 18 unique patterns and 2661 genes out of approximately 17,000 that were identified as significantly expressed and categorized to the patterns by EPIG. The time-dependent patterns displayed similar and dissimilar responses between IR and UV treatments. Gene Ontology analysis applied to each pattern-related subset of co-expressed genes revealed underlying

  2. Replicate high-density rat genome oligonucleotide microarrays reveal hundreds of regulated genes in the dorsal root ganglion after peripheral nerve injury.

    Directory of Open Access Journals (Sweden)

    Mannion James W

    2002-10-01

    Full Text Available Abstract Background Rat oligonucleotide microarrays were used to detect changes in gene expression in the dorsal root ganglion (DRG 3 days following sciatic nerve transection (axotomy. Two comparisons were made using two sets of triplicate microarrays, naïve versus naïve and naïve versus axotomy. Results Microarray variability was assessed using the naïve versus naïve comparison. These results support use of a P 1.5-fold expression change and P 1.5-fold and P in situ hybridization verified the expression of 24 transcripts. These data showed an 83% concordance rate with the arrays; most mismatches represent genes with low expression levels reflecting limits of array sensitivity. A significant correlation was found between actual mRNA differences and relative changes between microarrays (r2 = 0.8567. Temporal patterns of individual genes regulation varied. Conclusions We identify parameters for microarray analysis which reduce error while identifying many putatively regulated genes. Functional classification of these genes suggest reorganization of cell structural components, activation of genes expressed by immune and inflammatory cells and down-regulation of genes involved in neurotransmission.

  3. The 'PUCE CAFE' Project: the first 15K coffee microarray, a new tool for discovering candidate genes correlated to agronomic and quality traits.

    Science.gov (United States)

    Privat, Isabelle; Bardil, Amélie; Gomez, Aureliano Bombarely; Severac, Dany; Dantec, Christelle; Fuentes, Ivanna; Mueller, Lukas; Joët, Thierry; Pot, David; Foucrier, Séverine; Dussert, Stéphane; Leroy, Thierry; Journot, Laurent; de Kochko, Alexandre; Campa, Claudine; Combes, Marie-Christine; Lashermes, Philippe; Bertrand, Benoit

    2011-01-05

    Understanding the genetic elements that contribute to key aspects of coffee biology will have an impact on future agronomical improvements for this economically important tree. During the past years, EST collections were generated in Coffee, opening the possibility to create new tools for functional genomics. The "PUCE CAFE" Project, organized by the scientific consortium NESTLE/IRD/CIRAD, has developed an oligo-based microarray using 15,721 unigenes derived from published coffee EST sequences mostly obtained from different stages of fruit development and leaves in Coffea Canephora (Robusta). Hybridizations for two independent experiments served to compare global gene expression profiles in three types of tissue matter (mature beans, leaves and flowers) in C. canephora as well as in the leaves of three different coffee species (C. canephora, C. eugenoides and C. arabica). Microarray construction, statistical analyses and validation by Q-PCR analysis are presented in this study. We have generated the first 15 K coffee array during this PUCE CAFE project, granted by Génoplante (the French consortium for plant genomics). This new tool will help study functional genomics in a wide range of experiments on various plant tissues, such as analyzing bean maturation or resistance to pathogens or drought. Furthermore, the use of this array has proven to be valid in different coffee species (diploid or tetraploid), drastically enlarging its impact for high-throughput gene expression in the community of coffee research.

  4. The 'PUCE CAFE' Project: the First 15K Coffee Microarray, a New Tool for Discovering Candidate Genes correlated to Agronomic and Quality Traits

    Directory of Open Access Journals (Sweden)

    Leroy Thierry

    2011-01-01

    Full Text Available Abstract Background Understanding the genetic elements that contribute to key aspects of coffee biology will have an impact on future agronomical improvements for this economically important tree. During the past years, EST collections were generated in Coffee, opening the possibility to create new tools for functional genomics. Results The "PUCE CAFE" Project, organized by the scientific consortium NESTLE/IRD/CIRAD, has developed an oligo-based microarray using 15,721 unigenes derived from published coffee EST sequences mostly obtained from different stages of fruit development and leaves in Coffea Canephora (Robusta. Hybridizations for two independent experiments served to compare global gene expression profiles in three types of tissue matter (mature beans, leaves and flowers in C. canephora as well as in the leaves of three different coffee species (C. canephora, C. eugenoides and C. arabica. Microarray construction, statistical analyses and validation by Q-PCR analysis are presented in this study. Conclusion We have generated the first 15 K coffee array during this PUCE CAFE project, granted by Génoplante (the French consortium for plant genomics. This new tool will help study functional genomics in a wide range of experiments on various plant tissues, such as analyzing bean maturation or resistance to pathogens or drought. Furthermore, the use of this array has proven to be valid in different coffee species (diploid or tetraploid, drastically enlarging its impact for high-throughput gene expression in the community of coffee research.

  5. Reproducibility of gene expression across generations of Affymetrix microarrays

    Directory of Open Access Journals (Sweden)

    Haslett Judith N

    2003-06-01

    Full Text Available Abstract Background The development of large-scale gene expression profiling technologies is rapidly changing the norms of biological investigation. But the rapid pace of change itself presents challenges. Commercial microarrays are regularly modified to incorporate new genes and improved target sequences. Although the ability to compare datasets across generations is crucial for any long-term research project, to date no means to allow such comparisons have been developed. In this study the reproducibility of gene expression levels across two generations of Affymetrix GeneChips® (HuGeneFL and HG-U95A was measured. Results Correlation coefficients were computed for gene expression values across chip generations based on different measures of similarity. Comparing the absolute calls assigned to the individual probe sets across the generations found them to be largely unchanged. Conclusion We show that experimental replicates are highly reproducible, but that reproducibility across generations depends on the degree of similarity of the probe sets and the expression level of the corresponding transcript.

  6. Identification of a radiosensitivity signature using integrative metaanalysis of published microarray data for NCI-60 cancer cells

    Directory of Open Access Journals (Sweden)

    Kim Han

    2012-07-01

    Full Text Available Abstract Background In the postgenome era, a prediction of response to treatment could lead to better dose selection for patients in radiotherapy. To identify a radiosensitive gene signature and elucidate related signaling pathways, four different microarray experiments were reanalyzed before radiotherapy. Results Radiosensitivity profiling data using clonogenic assay and gene expression profiling data from four published microarray platforms applied to NCI-60 cancer cell panel were used. The survival fraction at 2 Gy (SF2, range from 0 to 1 was calculated as a measure of radiosensitivity and a linear regression model was applied to identify genes or a gene set with a correlation between expression and radiosensitivity (SF2. Radiosensitivity signature genes were identified using significant analysis of microarrays (SAM and gene set analysis was performed using a global test using linear regression model. Using the radiation-related signaling pathway and identified genes, a genetic network was generated. According to SAM, 31 genes were identified as common to all the microarray platforms and therefore a common radiosensitivity signature. In gene set analysis, functions in the cell cycle, DNA replication, and cell junction, including adherence and gap junctions were related to radiosensitivity. The integrin, VEGF, MAPK, p53, JAK-STAT and Wnt signaling pathways were overrepresented in radiosensitivity. Significant genes including ACTN1, CCND1, HCLS1, ITGB5, PFN2, PTPRC, RAB13, and WAS, which are adhesion-related molecules that were identified by both SAM and gene set analysis, and showed interaction in the genetic network with the integrin signaling pathway. Conclusions Integration of four different microarray experiments and gene selection using gene set analysis discovered possible target genes and pathways relevant to radiosensitivity. Our results suggested that the identified genes are candidates for radiosensitivity biomarkers and that

  7. Candidate gene studies and the quest for the entrepreneurial gene

    NARCIS (Netherlands)

    M.J.H.M. van der Loos (Matthijs); Ph.D. Koellinger (Philipp); P.J.F. Groenen (Patrick); C.A. Rietveld (Niels); F. Rivadeneira Ramirez (Fernando); F.J.A. van Rooij (Frank); A.G. Uitterlinden (André); A. Hofman (Albert); A.R. Thurik (Roy)

    2011-01-01

    textabstractCandidate gene studies of human behavior are gaining interest in economics and entrepreneurship research. Performing and interpreting these studies is not straightforward because the selection of candidates influences the interpretation of the results. As an example, Nicolaou et al.

  8. The prediction of candidate genes for cervix related cancer through gene ontology and graph theoretical approach.

    Science.gov (United States)

    Hindumathi, V; Kranthi, T; Rao, S B; Manimaran, P

    2014-06-01

    With rapidly changing technology, prediction of candidate genes has become an indispensable task in recent years mainly in the field of biological research. The empirical methods for candidate gene prioritization that succors to explore the potential pathway between genetic determinants and complex diseases are highly cumbersome and labor intensive. In such a scenario predicting potential targets for a disease state through in silico approaches are of researcher's interest. The prodigious availability of protein interaction data coupled with gene annotation renders an ease in the accurate determination of disease specific candidate genes. In our work we have prioritized the cervix related cancer candidate genes by employing Csaba Ortutay and his co-workers approach of identifying the candidate genes through graph theoretical centrality measures and gene ontology. With the advantage of the human protein interaction data, cervical cancer gene sets and the ontological terms, we were able to predict 15 novel candidates for cervical carcinogenesis. The disease relevance of the anticipated candidate genes was corroborated through a literature survey. Also the presence of the drugs for these candidates was detected through Therapeutic Target Database (TTD) and DrugMap Central (DMC) which affirms that they may be endowed as potential drug targets for cervical cancer.

  9. Age-Specific Gene Expression Profiles of Rhesus Monkey Ovaries Detected by Microarray Analysis

    Directory of Open Access Journals (Sweden)

    Hengxi Wei

    2015-01-01

    Full Text Available The biological function of human ovaries declines with age. To identify the potential molecular changes in ovarian aging, we performed genome-wide gene expression analysis by microarray of ovaries from young, middle-aged, and old rhesus monkeys. Microarray data was validated by quantitative real-time PCR. Results showed that a total of 503 (60 upregulated, 443 downregulated and 84 (downregulated genes were differentially expressed in old ovaries compared to young and middle-aged groups, respectively. No difference in gene expression was found between middle-aged and young groups. Differentially expressed genes were mainly enriched in cell and organelle, cellular and physiological process, binding, and catalytic activity. These genes were primarily associated with KEGG pathways of cell cycle, DNA replication and repair, oocyte meiosis and maturation, MAPK, TGF-beta, and p53 signaling pathway. Genes upregulated were involved in aging, defense response, oxidation reduction, and negative regulation of cellular process; genes downregulated have functions in reproduction, cell cycle, DNA and RNA process, macromolecular complex assembly, and positive regulation of macromolecule metabolic process. These findings show that monkey ovary undergoes substantial change in global transcription with age. Gene expression profiles are useful in understanding the mechanisms underlying ovarian aging and age-associated infertility in primates.

  10. Microarray analysis of gene expression by skeletal muscle of three mouse models of Kennedy disease/spinal bulbar muscular atrophy.

    Directory of Open Access Journals (Sweden)

    Kaiguo Mo

    2010-09-01

    Full Text Available Emerging evidence implicates altered gene expression within skeletal muscle in the pathogenesis of Kennedy disease/spinal bulbar muscular atrophy (KD/SBMA. We therefore broadly characterized gene expression in skeletal muscle of three independently generated mouse models of this disease. The mouse models included a polyglutamine expanded (polyQ AR knock-in model (AR113Q, a polyQ AR transgenic model (AR97Q, and a transgenic mouse that overexpresses wild type AR solely in skeletal muscle (HSA-AR. HSA-AR mice were included because they substantially reproduce the KD/SBMA phenotype despite the absence of polyQ AR.We performed microarray analysis of lower hindlimb muscles taken from these three models relative to wild type controls using high density oligonucleotide arrays. All microarray comparisons were made with at least 3 animals in each condition, and only those genes having at least 2-fold difference and whose coefficient of variance was less than 100% were considered to be differentially expressed. When considered globally, there was a similar overlap in gene changes between the 3 models: 19% between HSA-AR and AR97Q, 21% between AR97Q and AR113Q, and 17% between HSA-AR and AR113Q, with 8% shared by all models. Several patterns of gene expression relevant to the disease process were observed. Notably, patterns of gene expression typical of loss of AR function were observed in all three models, as were alterations in genes involved in cell adhesion, energy balance, muscle atrophy and myogenesis. We additionally measured changes similar to those observed in skeletal muscle of a mouse model of Huntington's Disease, and to those common to muscle atrophy from diverse causes.By comparing patterns of gene expression in three independent models of KD/SBMA, we have been able to identify candidate genes that might mediate the core myogenic features of KD/SBMA.

  11. Xylella fastidiosa gene expression analysis by DNA microarrays

    Directory of Open Access Journals (Sweden)

    Regiane F. Travensolo

    2009-01-01

    Full Text Available Xylella fastidiosa genome sequencing has generated valuable data by identifying genes acting either on metabolic pathways or in associated pathogenicity and virulence. Based on available information on these genes, new strategies for studying their expression patterns, such as microarray technology, were employed. A total of 2,600 primer pairs were synthesized and then used to generate fragments using the PCR technique. The arrays were hybridized against cDNAs labeled during reverse transcription reactions and which were obtained from bacteria grown under two different conditions (liquid XDM2 and liquid BCYE. All data were statistically analyzed to verify which genes were differentially expressed. In addition to exploring conditions for X. fastidiosa genome-wide transcriptome analysis, the present work observed the differential expression of several classes of genes (energy, protein, amino acid and nucleotide metabolism, transport, degradation of substances, toxins and hypothetical proteins, among others. The understanding of expressed genes in these two different media will be useful in comprehending the metabolic characteristics of X. fastidiosa, and in evaluating how important certain genes are for the functioning and survival of these bacteria in plants.

  12. Biomarker Identification for Prostate Cancer and Lymph Node Metastasis from Microarray Data and Protein Interaction Network Using Gene Prioritization Method

    Directory of Open Access Journals (Sweden)

    Carlos Roberto Arias

    2012-01-01

    Full Text Available Finding a genetic disease-related gene is not a trivial task. Therefore, computational methods are needed to present clues to the biomedical community to explore genes that are more likely to be related to a specific disease as biomarker. We present biomarker identification problem using gene prioritization method called gene prioritization from microarray data based on shortest paths, extended with structural and biological properties and edge flux using voting scheme (GP-MIDAS-VXEF. The method is based on finding relevant interactions on protein interaction networks, then scoring the genes using shortest paths and topological analysis, integrating the results using a voting scheme and a biological boosting. We applied two experiments, one is prostate primary and normal samples and the other is prostate primary tumor with and without lymph nodes metastasis. We used 137 truly prostate cancer genes as benchmark. In the first experiment, GP-MIDAS-VXEF outperforms all the other state-of-the-art methods in the benchmark by retrieving the truest related genes from the candidate set in the top 50 scores found. We applied the same technique to infer the significant biomarkers in prostate cancer with lymph nodes metastasis which is not established well.

  13. Functional microarray analysis of nitrogen and carbon cycling genes across an Antarctic latitudinal transect.

    NARCIS (Netherlands)

    Yergeau, E.; Kang, S.; He, Z.; Zhou, J.; Kowalchuk, G.A.

    2007-01-01

    Soil-borne microbial communities were examined via a functional gene microarray approach across a southern polar latitudinal gradient to gain insight into the environmental factors steering soil N- and C-cycling in terrestrial Antarctic ecosystems. The abundance and diversity of functional gene

  14. Previously unidentified changes in renal cell carcinoma gene expression identified by parametric analysis of microarray data

    International Nuclear Information System (INIS)

    Lenburg, Marc E; Liou, Louis S; Gerry, Norman P; Frampton, Garrett M; Cohen, Herbert T; Christman, Michael F

    2003-01-01

    Renal cell carcinoma is a common malignancy that often presents as a metastatic-disease for which there are no effective treatments. To gain insights into the mechanism of renal cell carcinogenesis, a number of genome-wide expression profiling studies have been performed. Surprisingly, there is very poor agreement among these studies as to which genes are differentially regulated. To better understand this lack of agreement we profiled renal cell tumor gene expression using genome-wide microarrays (45,000 probe sets) and compare our analysis to previous microarray studies. We hybridized total RNA isolated from renal cell tumors and adjacent normal tissue to Affymetrix U133A and U133B arrays. We removed samples with technical defects and removed probesets that failed to exhibit sequence-specific hybridization in any of the samples. We detected differential gene expression in the resulting dataset with parametric methods and identified keywords that are overrepresented in the differentially expressed genes with the Fisher-exact test. We identify 1,234 genes that are more than three-fold changed in renal tumors by t-test, 800 of which have not been previously reported to be altered in renal cell tumors. Of the only 37 genes that have been identified as being differentially expressed in three or more of five previous microarray studies of renal tumor gene expression, our analysis finds 33 of these genes (89%). A key to the sensitivity and power of our analysis is filtering out defective samples and genes that are not reliably detected. The widespread use of sample-wise voting schemes for detecting differential expression that do not control for false positives likely account for the poor overlap among previous studies. Among the many genes we identified using parametric methods that were not previously reported as being differentially expressed in renal cell tumors are several oncogenes and tumor suppressor genes that likely play important roles in renal cell

  15. Finding gene regulatory network candidates using the gene expression knowledge base.

    Science.gov (United States)

    Venkatesan, Aravind; Tripathi, Sushil; Sanz de Galdeano, Alejandro; Blondé, Ward; Lægreid, Astrid; Mironov, Vladimir; Kuiper, Martin

    2014-12-10

    Network-based approaches for the analysis of large-scale genomics data have become well established. Biological networks provide a knowledge scaffold against which the patterns and dynamics of 'omics' data can be interpreted. The background information required for the construction of such networks is often dispersed across a multitude of knowledge bases in a variety of formats. The seamless integration of this information is one of the main challenges in bioinformatics. The Semantic Web offers powerful technologies for the assembly of integrated knowledge bases that are computationally comprehensible, thereby providing a potentially powerful resource for constructing biological networks and network-based analysis. We have developed the Gene eXpression Knowledge Base (GeXKB), a semantic web technology based resource that contains integrated knowledge about gene expression regulation. To affirm the utility of GeXKB we demonstrate how this resource can be exploited for the identification of candidate regulatory network proteins. We present four use cases that were designed from a biological perspective in order to find candidate members relevant for the gastrin hormone signaling network model. We show how a combination of specific query definitions and additional selection criteria derived from gene expression data and prior knowledge concerning candidate proteins can be used to retrieve a set of proteins that constitute valid candidates for regulatory network extensions. Semantic web technologies provide the means for processing and integrating various heterogeneous information sources. The GeXKB offers biologists such an integrated knowledge resource, allowing them to address complex biological questions pertaining to gene expression. This work illustrates how GeXKB can be used in combination with gene expression results and literature information to identify new potential candidates that may be considered for extending a gene regulatory network.

  16. Supplementing High-Density SNP Microarrays for Additional Coverage of Disease-Related Genes: Addiction as a Paradigm

    Energy Technology Data Exchange (ETDEWEB)

    SacconePhD, Scott F [Washington University, St. Louis; Chesler, Elissa J [ORNL; Bierut, Laura J [Washington University, St. Louis; Kalivas, Peter J [Medical College of South Carolina, Charleston; Lerman, Caryn [University of Pennsylvania; Saccone, Nancy L [Washington University, St. Louis; Uhl, George R [Johns Hopkins University; Li, Chuan-Yun [Peking University; Philip, Vivek M [ORNL; Edenberg, Howard [Indiana University; Sherry, Steven [National Center for Biotechnology Information; Feolo, Michael [National Center for Biotechnology Information; Moyzis, Robert K [Johns Hopkins University; Rutter, Joni L [National Institute of Drug Abuse

    2009-01-01

    Commercial SNP microarrays now provide comprehensive and affordable coverage of the human genome. However, some diseases have biologically relevant genomic regions that may require additional coverage. Addiction, for example, is thought to be influenced by complex interactions among many relevant genes and pathways. We have assembled a list of 486 biologically relevant genes nominated by a panel of experts on addiction. We then added 424 genes that showed evidence of association with addiction phenotypes through mouse QTL mappings and gene co-expression analysis. We demonstrate that there are a substantial number of SNPs in these genes that are not well represented by commercial SNP platforms. We address this problem by introducing a publicly available SNP database for addiction. The database is annotated using numeric prioritization scores indicating the extent of biological relevance. The scores incorporate a number of factors such as SNP/gene functional properties (including synonymy and promoter regions), data from mouse systems genetics and measures of human/mouse evolutionary conservation. We then used HapMap genotyping data to determine if a SNP is tagged by a commercial microarray through linkage disequilibrium. This combination of biological prioritization scores and LD tagging annotation will enable addiction researchers to supplement commercial SNP microarrays to ensure comprehensive coverage of biologically relevant regions.

  17. Training ANFIS structure using genetic algorithm for liver cancer classification based on microarray gene expression data

    Directory of Open Access Journals (Sweden)

    Bülent Haznedar

    2017-02-01

    Full Text Available Classification is an important data mining technique, which is used in many fields mostly exemplified as medicine, genetics and biomedical engineering. The number of studies about classification of the datum on DNA microarray gene expression is specifically increased in recent years. However, because of the reasons as the abundance of gene numbers in the datum as microarray gene expressions and the nonlinear relations mostly across those datum, the success of conventional classification algorithms can be limited. Because of these reasons, the interest on classification methods which are based on artificial intelligence to solve the problem on classification has been gradually increased in recent times. In this study, a hybrid approach which is based on Adaptive Neuro-Fuzzy Inference System (ANFIS and Genetic Algorithm (GA are suggested in order to classify liver microarray cancer data set. Simulation results are compared with the results of other methods. According to the results obtained, it is seen that the recommended method is better than the other methods.

  18. Identification of candidate genes associated with leaf senescence in cultivated sunflower (Helianthus annuus L..

    Directory of Open Access Journals (Sweden)

    Sebastian Moschen

    Full Text Available Cultivated sunflower (Helianthus annuus L., an important source of edible vegetable oil, shows rapid onset of senescence, which limits production by reducing photosynthetic capacity under specific growing conditions. Carbon for grain filling depends strongly on light interception by green leaf area, which diminishes during grain filling due to leaf senescence. Transcription factors (TFs regulate the progression of leaf senescence in plants and have been well explored in model systems, but information for many agronomic crops remains limited. Here, we characterize the expression profiles of a set of putative senescence associated genes (SAGs identified by a candidate gene approach and sunflower microarray expression studies. We examined a time course of sunflower leaves undergoing natural senescence and used quantitative PCR (qPCR to measure the expression of 11 candidate genes representing the NAC, WRKY, MYB and NF-Y TF families. In addition, we measured physiological parameters such as chlorophyll, total soluble sugars and nitrogen content. The expression of Ha-NAC01, Ha-NAC03, Ha-NAC04, Ha-NAC05 and Ha-MYB01 TFs increased before the remobilization rate increased and therefore, before the appearance of the first physiological symptoms of senescence, whereas Ha-NAC02 expression decreased. In addition, we also examined the trifurcate feed-forward pathway (involving ORE1, miR164, and ethylene insensitive 2 previously reported for Arabidopsis. We measured transcription of Ha-NAC01 (the sunflower homolog of ORE1 and Ha-EIN2, along with the levels of miR164, in two leaves from different stem positions, and identified differences in transcription between basal and upper leaves. Interestingly, Ha-NAC01 and Ha-EIN2 transcription profiles showed an earlier up-regulation in upper leaves of plants close to maturity, compared with basal leaves of plants at pre-anthesis stages. These results suggest that the H. annuus TFs characterized in this work could

  19. Identification of candidate genes associated with leaf senescence in cultivated sunflower (Helianthus annuus L.).

    Science.gov (United States)

    Moschen, Sebastian; Bengoa Luoni, Sofia; Paniego, Norma B; Hopp, H Esteban; Dosio, Guillermo A A; Fernandez, Paula; Heinz, Ruth A

    2014-01-01

    Cultivated sunflower (Helianthus annuus L.), an important source of edible vegetable oil, shows rapid onset of senescence, which limits production by reducing photosynthetic capacity under specific growing conditions. Carbon for grain filling depends strongly on light interception by green leaf area, which diminishes during grain filling due to leaf senescence. Transcription factors (TFs) regulate the progression of leaf senescence in plants and have been well explored in model systems, but information for many agronomic crops remains limited. Here, we characterize the expression profiles of a set of putative senescence associated genes (SAGs) identified by a candidate gene approach and sunflower microarray expression studies. We examined a time course of sunflower leaves undergoing natural senescence and used quantitative PCR (qPCR) to measure the expression of 11 candidate genes representing the NAC, WRKY, MYB and NF-Y TF families. In addition, we measured physiological parameters such as chlorophyll, total soluble sugars and nitrogen content. The expression of Ha-NAC01, Ha-NAC03, Ha-NAC04, Ha-NAC05 and Ha-MYB01 TFs increased before the remobilization rate increased and therefore, before the appearance of the first physiological symptoms of senescence, whereas Ha-NAC02 expression decreased. In addition, we also examined the trifurcate feed-forward pathway (involving ORE1, miR164, and ethylene insensitive 2) previously reported for Arabidopsis. We measured transcription of Ha-NAC01 (the sunflower homolog of ORE1) and Ha-EIN2, along with the levels of miR164, in two leaves from different stem positions, and identified differences in transcription between basal and upper leaves. Interestingly, Ha-NAC01 and Ha-EIN2 transcription profiles showed an earlier up-regulation in upper leaves of plants close to maturity, compared with basal leaves of plants at pre-anthesis stages. These results suggest that the H. annuus TFs characterized in this work could play important

  20. Ranking candidate disease genes from gene expression and protein interaction: a Katz-centrality based approach.

    Directory of Open Access Journals (Sweden)

    Jing Zhao

    Full Text Available Many diseases have complex genetic causes, where a set of alleles can affect the propensity of getting the disease. The identification of such disease genes is important to understand the mechanistic and evolutionary aspects of pathogenesis, improve diagnosis and treatment of the disease, and aid in drug discovery. Current genetic studies typically identify chromosomal regions associated specific diseases. But picking out an unknown disease gene from hundreds of candidates located on the same genomic interval is still challenging. In this study, we propose an approach to prioritize candidate genes by integrating data of gene expression level, protein-protein interaction strength and known disease genes. Our method is based only on two, simple, biologically motivated assumptions--that a gene is a good disease-gene candidate if it is differentially expressed in cases and controls, or that it is close to other disease-gene candidates in its protein interaction network. We tested our method on 40 diseases in 58 gene expression datasets of the NCBI Gene Expression Omnibus database. On these datasets our method is able to predict unknown disease genes as well as identifying pleiotropic genes involved in the physiological cellular processes of many diseases. Our study not only provides an effective algorithm for prioritizing candidate disease genes but is also a way to discover phenotypic interdependency, cooccurrence and shared pathophysiology between different disorders.

  1. Analysis of gene expression in resynthesized Brassica napus Allopolyploids using arabidopsis 70mer oligo microarrays.

    Directory of Open Access Journals (Sweden)

    Robert T Gaeta

    Full Text Available BACKGROUND: Studies in resynthesized Brassica napus allopolyploids indicate that homoeologous chromosome exchanges in advanced generations (S(5ratio6 alter gene expression through the loss and doubling of homoeologous genes within the rearrangements. Rearrangements may also indirectly affect global gene expression if homoeologous copies of gene regulators within rearrangements have differential affects on the transcription of genes in networks. METHODOLOGY/PRINCIPAL FINDINGS: We utilized Arabidopsis 70mer oligonucleotide microarrays for exploring gene expression in three resynthesized B. napus lineages at the S(0ratio1 and S(5ratio6 generations as well as their diploid progenitors B. rapa and B. oleracea. Differential gene expression between the progenitors and additive (midparent expression in the allopolyploids were tested. The S(5ratio6 lines differed in the number of genetic rearrangements, allowing us to test if the number of genes displaying nonadditive expression was related to the number of rearrangements. Estimates using per-gene and common variance ANOVA models indicated that 6-15% of 26,107 genes were differentially expressed between the progenitors. Individual allopolyploids showed nonadditive expression for 1.6-32% of all genes. Less than 0.3% of genes displayed nonadditive expression in all S(0ratio1 lines and 0.1-0.2% were nonadditive among all S(5ratio6 lines. Differentially expressed genes in the polyploids were over-represented by genes differential between the progenitors. The total number of differentially expressed genes was correlated with the number of genetic changes in S(5ratio6 lines under the common variance model; however, there was no relationship using a per-gene variance model, and many genes showed nonadditive expression in S(0ratio1 lines. CONCLUSIONS/SIGNIFICANCE: Few genes reproducibly demonstrated nonadditive expression among lineages, suggesting few changes resulted from a general response to polyploidization

  2. Microarray-Based Gene Expression Analysis for Veterinary Pathologists: A Review.

    Science.gov (United States)

    Raddatz, Barbara B; Spitzbarth, Ingo; Matheis, Katja A; Kalkuhl, Arno; Deschl, Ulrich; Baumgärtner, Wolfgang; Ulrich, Reiner

    2017-09-01

    High-throughput, genome-wide transcriptome analysis is now commonly used in all fields of life science research and is on the cusp of medical and veterinary diagnostic application. Transcriptomic methods such as microarrays and next-generation sequencing generate enormous amounts of data. The pathogenetic expertise acquired from understanding of general pathology provides veterinary pathologists with a profound background, which is essential in translating transcriptomic data into meaningful biological knowledge, thereby leading to a better understanding of underlying disease mechanisms. The scientific literature concerning high-throughput data-mining techniques usually addresses mathematicians or computer scientists as the target audience. In contrast, the present review provides the reader with a clear and systematic basis from a veterinary pathologist's perspective. Therefore, the aims are (1) to introduce the reader to the necessary methodological background; (2) to introduce the sequential steps commonly performed in a microarray analysis including quality control, annotation, normalization, selection of differentially expressed genes, clustering, gene ontology and pathway analysis, analysis of manually selected genes, and biomarker discovery; and (3) to provide references to publically available and user-friendly software suites. In summary, the data analysis methods presented within this review will enable veterinary pathologists to analyze high-throughput transcriptome data obtained from their own experiments, supplemental data that accompany scientific publications, or public repositories in order to obtain a more in-depth insight into underlying disease mechanisms.

  3. Gene Network Construction from Microarray Data Identifies a Key Network Module and Several Candidate Hub Genes in Age-Associated Spatial Learning Impairment.

    Science.gov (United States)

    Uddin, Raihan; Singh, Shiva M

    2017-01-01

    As humans age many suffer from a decrease in normal brain functions including spatial learning impairments. This study aimed to better understand the molecular mechanisms in age-associated spatial learning impairment (ASLI). We used a mathematical modeling approach implemented in Weighted Gene Co-expression Network Analysis (WGCNA) to create and compare gene network models of young (learning unimpaired) and aged (predominantly learning impaired) brains from a set of exploratory datasets in rats in the context of ASLI. The major goal was to overcome some of the limitations previously observed in the traditional meta- and pathway analysis using these data, and identify novel ASLI related genes and their networks based on co-expression relationship of genes. This analysis identified a set of network modules in the young, each of which is highly enriched with genes functioning in broad but distinct GO functional categories or biological pathways. Interestingly, the analysis pointed to a single module that was highly enriched with genes functioning in "learning and memory" related functions and pathways. Subsequent differential network analysis of this "learning and memory" module in the aged (predominantly learning impaired) rats compared to the young learning unimpaired rats allowed us to identify a set of novel ASLI candidate hub genes. Some of these genes show significant repeatability in networks generated from independent young and aged validation datasets. These hub genes are highly co-expressed with other genes in the network, which not only show differential expression but also differential co-expression and differential connectivity across age and learning impairment. The known function of these hub genes indicate that they play key roles in critical pathways, including kinase and phosphatase signaling, in functions related to various ion channels, and in maintaining neuronal integrity relating to synaptic plasticity and memory formation. Taken together, they

  4. Candidate genes in ocular dominance plasticity

    NARCIS (Netherlands)

    Rietman, M.L.; Sommeijer, J.-P.; Levelt, C.N.; Heimel, J.A.; Brussaard, A.B.; Borst, J.G.G.; Elgersma, Y.; Galjart, N.; van der Horst, G.T.; Pennartz, C.M.; Smit, A.B.; Spruijt, B.M.; Verhage, M.; de Zeeuw, C.I.

    2012-01-01

    Many studies have been devoted to the identification of genes involved in experience-dependent plasticity in the visual cortex. To discover new candidate genes, we have reexamined data from one such study on ocular dominance (OD) plasticity in recombinant inbred BXD mouse strains. We have correlated

  5. Discrimination of phytoplasmas using an oligonucleotide microarray targeting rps3, rpl22, and rps19 genes

    Czech Academy of Sciences Publication Activity Database

    Lenz, Ondřej; Marková, J.; Sarkisova, Tatiana; Fránová, Jana; Přibylová, Jaroslava

    2015-01-01

    Roč. 70, January 2015 (2015), s. 47-52 ISSN 0261-2194 Institutional support: RVO:60077344 Keywords : DNA microarray * rpl22 gene * rps19 gene * rps3 gene Subject RIV: EE - Microbiology, Virology Impact factor: 1.652, year: 2015

  6. Microarray analysis identified Puccinia striiformis f. sp. tritici genes involved in infection and sporulation.

    Science.gov (United States)

    Puccinia striiformis f. sp. tritici (Pst) causes stripe rust, one of the most important diseases of wheat worldwide. To identify Pst genes involved in infection and sporulation, a custom oligonucleotide Genechip was made using sequences of 442 genes selected from Pst cDNA libraries. Microarray analy...

  7. Gene expression profiling in gill tissues of White spot syndrome virus infected black tiger shrimp Penaeus monodon by DNA microarray.

    Science.gov (United States)

    Shekhar, M S; Gomathi, A; Gopikrishna, G; Ponniah, A G

    2015-06-01

    White spot syndrome virus (WSSV) continues to be the most devastating viral pathogen infecting penaeid shrimp the world over. The genome of WSSV has been deciphered and characterized from three geographical isolates and significant progress has been made in developing various molecular diagnostic methods to detect the virus. However, the information on host immune gene response to WSSV pathogenesis is limited. Microarray analysis was carried out as an approach to analyse the gene expression in black tiger shrimp Penaeus monodon in response to WSSV infection. Gill tissues collected from the WSSV infected shrimp at 6, 24, 48 h and moribund stage were analysed for differential gene expression. Shrimp cDNAs of 40,059 unique sequences were considered for designing the microarray chip. The Cy3-labeled cRNA derived from healthy and WSSV-infected shrimp was subjected to hybridization with all the DNA spots in the microarray which revealed 8,633 and 11,147 as up- and down-regulated genes respectively at different time intervals post infection. The altered expression of these numerous genes represented diverse functions such as immune response, osmoregulation, apoptosis, nucleic acid binding, energy and metabolism, signal transduction, stress response and molting. The changes in gene expression profiles observed by microarray analysis provides molecular insights and framework of genes which are up- and down-regulated at different time intervals during WSSV infection in shrimp. The microarray data was validated by Real Time analysis of four differentially expressed genes involved in apoptosis (translationally controlled tumor protein, inhibitor of apoptosis protein, ubiquitin conjugated enzyme E2 and caspase) for gene expression levels. The role of apoptosis related genes in WSSV infected shrimp is discussed herein.

  8. ENU Mutagenesis in Mice Identifies Candidate Genes For Hypogonadism

    Science.gov (United States)

    Weiss, Jeffrey; Hurley, Lisa A.; Harris, Rebecca M.; Finlayson, Courtney; Tong, Minghan; Fisher, Lisa A.; Moran, Jennifer L.; Beier, David R.; Mason, Christopher; Jameson, J. Larry

    2012-01-01

    Genome-wide mutagenesis was performed in mice to identify candidate genes for male infertility, for which the predominant causes remain idiopathic. Mice were mutagenized using N-ethyl-N-nitrosourea (ENU), bred, and screened for phenotypes associated with the male urogenital system. Fifteen heritable lines were isolated and chromosomal loci were assigned using low density genome-wide SNP arrays. Ten of the fifteen lines were pursued further using higher resolution SNP analysis to narrow the candidate gene regions. Exon sequencing of candidate genes identified mutations in mice with cystic kidneys (Bicc1), cryptorchidism (Rxfp2), restricted germ cell deficiency (Plk4), and severe germ cell deficiency (Prdm9). In two other lines with severe hypogonadism candidate sequencing failed to identify mutations, suggesting defects in genes with previously undocumented roles in gonadal function. These genomic intervals were sequenced in their entirety and a candidate mutation was identified in SnrpE in one of the two lines. The line harboring the SnrpE variant retains substantial spermatogenesis despite small testis size, an unusual phenotype. In addition to the reproductive defects, heritable phenotypes were observed in mice with ataxia (Myo5a), tremors (Pmp22), growth retardation (unknown gene), and hydrocephalus (unknown gene). These results demonstrate that the ENU screen is an effective tool for identifying potential causes of male infertility. PMID:22258617

  9. Generalization of DNA microarray dispersion properties: microarray equivalent of t-distribution

    DEFF Research Database (Denmark)

    Novak, Jaroslav P; Kim, Seon-Young; Xu, Jun

    2006-01-01

    BACKGROUND: DNA microarrays are a powerful technology that can provide a wealth of gene expression data for disease studies, drug development, and a wide scope of other investigations. Because of the large volume and inherent variability of DNA microarray data, many new statistical methods have...

  10. Genetic Bee Colony (GBC) algorithm: A new gene selection method for microarray cancer classification.

    Science.gov (United States)

    Alshamlan, Hala M; Badr, Ghada H; Alohali, Yousef A

    2015-06-01

    Naturally inspired evolutionary algorithms prove effectiveness when used for solving feature selection and classification problems. Artificial Bee Colony (ABC) is a relatively new swarm intelligence method. In this paper, we propose a new hybrid gene selection method, namely Genetic Bee Colony (GBC) algorithm. The proposed algorithm combines the used of a Genetic Algorithm (GA) along with Artificial Bee Colony (ABC) algorithm. The goal is to integrate the advantages of both algorithms. The proposed algorithm is applied to a microarray gene expression profile in order to select the most predictive and informative genes for cancer classification. In order to test the accuracy performance of the proposed algorithm, extensive experiments were conducted. Three binary microarray datasets are use, which include: colon, leukemia, and lung. In addition, another three multi-class microarray datasets are used, which are: SRBCT, lymphoma, and leukemia. Results of the GBC algorithm are compared with our recently proposed technique: mRMR when combined with the Artificial Bee Colony algorithm (mRMR-ABC). We also compared the combination of mRMR with GA (mRMR-GA) and Particle Swarm Optimization (mRMR-PSO) algorithms. In addition, we compared the GBC algorithm with other related algorithms that have been recently published in the literature, using all benchmark datasets. The GBC algorithm shows superior performance as it achieved the highest classification accuracy along with the lowest average number of selected genes. This proves that the GBC algorithm is a promising approach for solving the gene selection problem in both binary and multi-class cancer classification. Copyright © 2015 Elsevier Ltd. All rights reserved.

  11. Analysis of Temporal-spatial Co-variation within Gene Expression Microarray Data in an Organogenesis Model

    Science.gov (United States)

    Ehler, Martin; Rajapakse, Vinodh; Zeeberg, Barry; Brooks, Brian; Brown, Jacob; Czaja, Wojciech; Bonner, Robert F.

    The gene networks underlying closure of the optic fissure during vertebrate eye development are poorly understood. We used a novel clustering method based on Laplacian Eigenmaps, a nonlinear dimension reduction method, to analyze microarray data from laser capture microdissected (LCM) cells at the site and developmental stages (days 10.5 to 12.5) of optic fissure closure. Our new method provided greater biological specificity than classical clustering algorithms in terms of identifying more biological processes and functions related to eye development as defined by Gene Ontology at lower false discovery rates. This new methodology builds on the advantages of LCM to isolate pure phenotypic populations within complex tissues and allows improved ability to identify critical gene products expressed at lower copy number. The combination of LCM of embryonic organs, gene expression microarrays, and extracting spatial and temporal co-variations appear to be a powerful approach to understanding the gene regulatory networks that specify mammalian organogenesis.

  12. Microarray BASICA: Background Adjustment, Segmentation, Image Compression and Analysis of Microarray Images

    Directory of Open Access Journals (Sweden)

    Jianping Hua

    2004-01-01

    Full Text Available This paper presents microarray BASICA: an integrated image processing tool for background adjustment, segmentation, image compression, and analysis of cDNA microarray images. BASICA uses a fast Mann-Whitney test-based algorithm to segment cDNA microarray images, and performs postprocessing to eliminate the segmentation irregularities. The segmentation results, along with the foreground and background intensities obtained with the background adjustment, are then used for independent compression of the foreground and background. We introduce a new distortion measurement for cDNA microarray image compression and devise a coding scheme by modifying the embedded block coding with optimized truncation (EBCOT algorithm (Taubman, 2000 to achieve optimal rate-distortion performance in lossy coding while still maintaining outstanding lossless compression performance. Experimental results show that the bit rate required to ensure sufficiently accurate gene expression measurement varies and depends on the quality of cDNA microarray images. For homogeneously hybridized cDNA microarray images, BASICA is able to provide from a bit rate as low as 5 bpp the gene expression data that are 99% in agreement with those of the original 32 bpp images.

  13. Multi-gene detection and identification of mosquito-borne RNA viruses using an oligonucleotide microarray.

    Directory of Open Access Journals (Sweden)

    Nathan D Grubaugh

    Full Text Available BACKGROUND: Arthropod-borne viruses are important emerging pathogens world-wide. Viruses transmitted by mosquitoes, such as dengue, yellow fever, and Japanese encephalitis viruses, infect hundreds of millions of people and animals each year. Global surveillance of these viruses in mosquito vectors using molecular based assays is critical for prevention and control of the associated diseases. Here, we report an oligonucleotide DNA microarray design, termed ArboChip5.1, for multi-gene detection and identification of mosquito-borne RNA viruses from the genera Flavivirus (family Flaviviridae, Alphavirus (Togaviridae, Orthobunyavirus (Bunyaviridae, and Phlebovirus (Bunyaviridae. METHODOLOGY/PRINCIPAL FINDINGS: The assay utilizes targeted PCR amplification of three genes from each virus genus for electrochemical detection on a portable, field-tested microarray platform. Fifty-two viruses propagated in cell-culture were used to evaluate the specificity of the PCR primer sets and the ArboChip5.1 microarray capture probes. The microarray detected all of the tested viruses and differentiated between many closely related viruses such as members of the dengue, Japanese encephalitis, and Semliki Forest virus clades. Laboratory infected mosquitoes were used to simulate field samples and to determine the limits of detection. Additionally, we identified dengue virus type 3, Japanese encephalitis virus, Tembusu virus, Culex flavivirus, and a Quang Binh-like virus from mosquitoes collected in Thailand in 2011 and 2012. CONCLUSIONS/SIGNIFICANCE: We demonstrated that the described assay can be utilized in a comprehensive field surveillance program by the broad-range amplification and specific identification of arboviruses from infected mosquitoes. Furthermore, the microarray platform can be deployed in the field and viral RNA extraction to data analysis can occur in as little as 12 h. The information derived from the ArboChip5.1 microarray can help to establish

  14. Statistical Redundancy Testing for Improved Gene Selection in Cancer Classification Using Microarray Data

    Directory of Open Access Journals (Sweden)

    J. Sunil Rao

    2007-01-01

    Full Text Available In gene selection for cancer classifi cation using microarray data, we define an eigenvalue-ratio statistic to measure a gene’s contribution to the joint discriminability when this gene is included into a set of genes. Based on this eigenvalueratio statistic, we define a novel hypothesis testing for gene statistical redundancy and propose two gene selection methods. Simulation studies illustrate the agreement between statistical redundancy testing and gene selection methods. Real data examples show the proposed gene selection methods can select a compact gene subset which can not only be used to build high quality cancer classifiers but also show biological relevance.

  15. ESTs, cDNA microarrays, and gene expression profiling: tools for dissecting plant physiology and development.

    Science.gov (United States)

    Alba, Rob; Fei, Zhangjun; Payton, Paxton; Liu, Yang; Moore, Shanna L; Debbie, Paul; Cohn, Jonathan; D'Ascenzo, Mark; Gordon, Jeffrey S; Rose, Jocelyn K C; Martin, Gregory; Tanksley, Steven D; Bouzayen, Mondher; Jahn, Molly M; Giovannoni, Jim

    2004-09-01

    Gene expression profiling holds tremendous promise for dissecting the regulatory mechanisms and transcriptional networks that underlie biological processes. Here we provide details of approaches used by others and ourselves for gene expression profiling in plants with emphasis on cDNA microarrays and discussion of both experimental design and downstream analysis. We focus on methods and techniques emphasizing fabrication of cDNA microarrays, fluorescent labeling, cDNA hybridization, experimental design, and data processing. We include specific examples that demonstrate how this technology can be used to further our understanding of plant physiology and development (specifically fruit development and ripening) and for comparative genomics by comparing transcriptome activity in tomato and pepper fruit.

  16. Evaluation of DNA microarray results in the Toxicogenomics Project (TGP) consortium in Japan.

    Science.gov (United States)

    Noriyuki, Nakatsu; Igarashi, Yoshinobu; Ono, Atsushi; Yamada, Hiroshi; Ohno, Yasuo; Urushidani, Tetsuro

    2012-01-01

    An important technology used in toxicogenomic drug discovery research is the microarray, which enables researchers to simultaneously analyze the expression of a large number of genes. To build a database and data analysis system for use in assessing the safety of drugs and drug candidates, in 2002 we conducted a 5-year collaborative study in the Toxicogenomics Project (TGP1) in Japan. Experimental data generated by such studies must be validated by different laboratories for robust and accurate analysis. For this purpose, we conducted intra- and inter-laboratory validation studies with participating companies in the second collaborative study in the Toxicogenomics Project (TGP2). Gene expression in the liver of rats treated with acetaminophen (APAP) was independently examined by the participating companies using Affymetrix GeneChip microarrays. The intra- and inter-laboratory reproducibility of the data was evaluated using hierarchical clustering analysis. The toxicogenomics results were highly reproducible, indicating that the gene expression data generated in our TGP1 project is reliable and compatible with the data generated by the participating laboratories.

  17. Comparative analysis of gene expression by microarray analysis of male and female flowers of Asparagus officinalis.

    Science.gov (United States)

    Gao, Wu-Jun; Li, Shu-Fen; Zhang, Guo-Jun; Wang, Ning-Na; Deng, Chuan-Liang; Lu, Long-Dou

    2013-01-01

    To identify rapidly a number of genes probably involved in sex determination and differentiation of the dioecious plant Asparagus officinalis, gene expression profiles in early flower development for male and female plants were investigated by microarray assay with 8,665 probes. In total, 638 male-biased and 543 female-biased genes were identified. These genes with biased-expression for male and female were involved in a variety of processes associated with molecular functions, cellular components, and biological processes, suggesting that a complex mechanism underlies the sex development of asparagus. Among the differentially expressed genes involved in the reproductive process, a number of genes associated with floral development were identified. Reverse transcription-PCR was performed for validation, and the results were largely consistent with those obtained by microarray analysis. The findings of this study might contribute to understanding of the molecular mechanisms of sex determination and differentiation in dioecious asparagus and provide a foundation for further studies of this plant.

  18. [Obesity studies in candidate genes].

    Science.gov (United States)

    Ochoa, María del Carmen; Martí, Amelia; Martínez, J Alfredo

    2004-04-17

    There are more than 430 chromosomic regions with gene variants involved in body weight regulation and obesity development. Polymorphisms in genes related to energy expenditure--uncoupling proteins (UCPs), related to adipogenesis and insulin resistance--hormone-sensitive lipase (HLS), peroxisome proliferator-activated receptor gamma (PPAR gamma), beta adrenergic receptors (ADRB2,3), and alfa tumor necrosis factor (TNF-alpha), and related to food intake--ghrelin (GHRL)--appear to be associated with obesity phenotypes. Obesity risk depends on two factors: a) genetic variants in candidate genes, and b) biographical exposure to environmental risk factors. It is necessary to perform new studies, with appropriate control groups and designs, in order to reach relevant conclusions with regard to gene/environmental (diet, lifestyle) interactions.

  19. Plant-pathogen interactions: what microarray tells about it?

    Science.gov (United States)

    Lodha, T D; Basak, J

    2012-01-01

    Plant defense responses are mediated by elementary regulatory proteins that affect expression of thousands of genes. Over the last decade, microarray technology has played a key role in deciphering the underlying networks of gene regulation in plants that lead to a wide variety of defence responses. Microarray is an important tool to quantify and profile the expression of thousands of genes simultaneously, with two main aims: (1) gene discovery and (2) global expression profiling. Several microarray technologies are currently in use; most include a glass slide platform with spotted cDNA or oligonucleotides. Till date, microarray technology has been used in the identification of regulatory genes, end-point defence genes, to understand the signal transduction processes underlying disease resistance and its intimate links to other physiological pathways. Microarray technology can be used for in-depth, simultaneous profiling of host/pathogen genes as the disease progresses from infection to resistance/susceptibility at different developmental stages of the host, which can be done in different environments, for clearer understanding of the processes involved. A thorough knowledge of plant disease resistance using successful combination of microarray and other high throughput techniques, as well as biochemical, genetic, and cell biological experiments is needed for practical application to secure and stabilize yield of many crop plants. This review starts with a brief introduction to microarray technology, followed by the basics of plant-pathogen interaction, the use of DNA microarrays over the last decade to unravel the mysteries of plant-pathogen interaction, and ends with the future prospects of this technology.

  20. Gene Expression Signature in Endemic Osteoarthritis by Microarray Analysis

    Directory of Open Access Journals (Sweden)

    Xi Wang

    2015-05-01

    Full Text Available Kashin-Beck Disease (KBD is an endemic osteochondropathy with an unknown pathogenesis. Diagnosis of KBD is effective only in advanced cases, which eliminates the possibility of early treatment and leads to an inevitable exacerbation of symptoms. Therefore, we aim to identify an accurate blood-based gene signature for the detection of KBD. Previously published gene expression profile data on cartilage and peripheral blood mononuclear cells (PBMCs from adults with KBD were compared to select potential target genes. Microarray analysis was conducted to evaluate the expression of the target genes in a cohort of 100 KBD patients and 100 healthy controls. A gene expression signature was identified using a training set, which was subsequently validated using an independent test set with a minimum redundancy maximum relevance (mRMR algorithm and support vector machine (SVM algorithm. Fifty unique genes were differentially expressed between KBD patients and healthy controls. A 20-gene signature was identified that distinguished between KBD patients and controls with 90% accuracy, 85% sensitivity, and 95% specificity. This study identified a 20-gene signature that accurately distinguishes between patients with KBD and controls using peripheral blood samples. These results promote the further development of blood-based genetic biomarkers for detection of KBD.

  1. OpWise: Operons aid the identification of differentially expressed genes in bacterial microarray experiments

    Directory of Open Access Journals (Sweden)

    Arkin Adam P

    2006-01-01

    Full Text Available Abstract Background Differentially expressed genes are typically identified by analyzing the variation between replicate measurements. These procedures implicitly assume that there are no systematic errors in the data even though several sources of systematic error are known. Results OpWise estimates the amount of systematic error in bacterial microarray data by assuming that genes in the same operon have matching expression patterns. OpWise then performs a Bayesian analysis of a linear model to estimate significance. In simulations, OpWise corrects for systematic error and is robust to deviations from its assumptions. In several bacterial data sets, significant amounts of systematic error are present, and replicate-based approaches overstate the confidence of the changers dramatically, while OpWise does not. Finally, OpWise can identify additional changers by assigning genes higher confidence if they are consistent with other genes in the same operon. Conclusion Although microarray data can contain large amounts of systematic error, operons provide an external standard and allow for reasonable estimates of significance. OpWise is available at http://microbesonline.org/OpWise.

  2. Microarray-based DNA methylation study of Ewing's sarcoma of the bone.

    Science.gov (United States)

    Park, Hye-Rim; Jung, Woon-Won; Kim, Hyun-Sook; Park, Yong-Koo

    2014-10-01

    Alterations in DNA methylation patterns are a hallmark of malignancy. However, the majority of epigenetic studies of Ewing's sarcoma have focused on the analysis of only a few candidate genes. Comprehensive studies are thus lacking and are required. The aim of the present study was to identify novel methylation markers in Ewing's sarcoma using microarray analysis. The current study reports the microarray-based DNA methylation study of 1,505 CpG sites of 807 cancer-related genes from 69 Ewing's sarcoma samples. The Illumina GoldenGate Methylation Cancer Panel I microarray was used, and with the appropriate controls (n=14), a total of 92 hypermethylated genes were identified in the Ewing's sarcoma samples. The majority of the hypermethylated genes were associated with cell adhesion, cell regulation, development and signal transduction. The overall methylation mean values were compared between patients who survived and those that did not. The overall methylation mean was significantly higher in the patients who did not survive (0.25±0.03) than in those who did (0.22±0.05) (P=0.0322). However, the overall methylation mean was not found to significantly correlate with age, gender or tumor location. GDF10 , OSM , APC and HOXA11 were the most significant differentially-methylated genes, however, their methylation levels were not found to significantly correlate with the survival rate. The DNA methylation profile of Ewing's sarcoma was characterized and 92 genes that were significantly hypermethylated were detected. A trend towards a more aggressive behavior was identified in the methylated group. The results of this study indicated that methylation may be significant in the development of Ewing's sarcoma.

  3. DNA microarrays of baculovirus genomes: differential expression of viral genes in two susceptible insect cell lines.

    Science.gov (United States)

    Yamagishi, J; Isobe, R; Takebuchi, T; Bando, H

    2003-03-01

    We describe, for the first time, the generation of a viral DNA chip for simultaneous expression measurements of nearly all known open reading frames (ORFs) in the best-studied members of the family Baculoviridae, Autographa californica multiple nucleopolyhedrovirus (AcMNPV) and Bombyx mori nucleopolyhedrovirus (BmNPV). In this study, a viral DNA chip (Ac-BmNPV chip) was fabricated and used to characterize the viral gene expression profile for AcMNPV in different cell types. The viral chip is composed of microarrays of viral DNA prepared by robotic deposition of PCR-amplified viral DNA fragments on glass for ORFs in the NPV genome. Viral gene expression was monitored by hybridization to the DNA fragment microarrays with fluorescently labeled cDNAs prepared from infected Spodoptera frugiperda, Sf9 cells and Trichoplusia ni, TnHigh-Five cells, the latter a major producer of baculovirus and recombinant proteins. A comparison of expression profiles of known ORFs in AcMNPV elucidated six genes (ORF150, p10, pk2, and three late gene expression factor genes lef-3, p35 and lef- 6) the expression of each of which was regulated differently in the two cell lines. Most of these genes are known to be closely involved in the viral life cycle such as in DNA replication, late gene expression and the release of polyhedra from infected cells. These results imply that the differential expression of these viral genes accounts for the differences in viral replication between these two cell lines. Thus, these fabricated microarrays of NPV DNA which allow a rapid analysis of gene expression at the viral genome level should greatly speed the functional analysis of large genomes of NPV.

  4. Cross-species microarray hybridization to identify developmentally regulated genes in the filamentous fungus Sordaria macrospora.

    Science.gov (United States)

    Nowrousian, Minou; Ringelberg, Carol; Dunlap, Jay C; Loros, Jennifer J; Kück, Ulrich

    2005-04-01

    The filamentous fungus Sordaria macrospora forms complex three-dimensional fruiting bodies that protect the developing ascospores and ensure their proper discharge. Several regulatory genes essential for fruiting body development were previously isolated by complementation of the sterile mutants pro1, pro11 and pro22. To establish the genetic relationships between these genes and to identify downstream targets, we have conducted cross-species microarray hybridizations using cDNA arrays derived from the closely related fungus Neurospora crassa and RNA probes prepared from wild-type S. macrospora and the three developmental mutants. Of the 1,420 genes which gave a signal with the probes from all the strains used, 172 (12%) were regulated differently in at least one of the three mutants compared to the wild type, and 17 (1.2%) were regulated differently in all three mutant strains. Microarray data were verified by Northern analysis or quantitative real time PCR. Among the genes that are up- or down-regulated in the mutant strains are genes encoding the pheromone precursors, enzymes involved in melanin biosynthesis and a lectin-like protein. Analysis of gene expression in double mutants revealed a complex network of interaction between the pro gene products.

  5. Gene expression analysis identifies new candidate genes associated with the development of black skin spots in Corriedale sheep.

    Science.gov (United States)

    Peñagaricano, Francisco; Zorrilla, Pilar; Naya, Hugo; Robello, Carlos; Urioste, Jorge I

    2012-02-01

    The white coat colour of sheep is an important economic trait. For unknown reasons, some animals are born with, and others develop with time, black skin spots that can also produce pigmented fibres. The presence of pigmented fibres in the white wool significantly decreases the fibre quality. The aim of this work was to study gene expression in black spots (with and without pigmented fibres) and white skin by microarray techniques, in order to identify the possible genes involved in the development of this trait. Five unrelated Corriedale sheep were used and, for each animal, the three possible comparisons (three different hybridisations) between the three samples of interest were performed. Differential gene expression patterns were analysed using different t-test approaches. Most of the major genes with well-known roles in skin pigmentation, e.g. ASIP, MC1R and C-KIT, showed no significant difference in the gene expression between white skin and black spots. On the other hand, many of the differentially expressed genes (raw P-value spots. The gene expression of C-FOS and KLF4, transcription factors involved in the cellular response to external factors such as ultraviolet light, was validated by quantitative polymerase chain reaction (PCR). This exploratory study provides a list of candidate genes that could be associated with the development of black skin spots that should be studied in more detail. Characterisation of these genes will enable us to discern the molecular mechanisms involved in the development of this feature and, hence, increase our understanding of melanocyte biology and skin pigmentation. In sheep, understanding this phenomenon is a first step towards developing molecular tools to assist in the selection against the presence of pigmented fibres in white wool.

  6. Degrees of separation as a statistical tool for evaluating candidate genes.

    Science.gov (United States)

    Nelson, Ronald M; Pettersson, Mats E

    2014-12-01

    Selection of candidate genes is an important step in the exploration of complex genetic architecture. The number of gene networks available is increasing and these can provide information to help with candidate gene selection. It is currently common to use the degree of connectedness in gene networks as validation in Genome Wide Association (GWA) and Quantitative Trait Locus (QTL) mapping studies. However, it can cause misleading results if not validated properly. Here we present a method and tool for validating the gene pairs from GWA studies given the context of the network they co-occur in. It ensures that proposed interactions and gene associations are not statistical artefacts inherent to the specific gene network architecture. The CandidateBacon package provides an easy and efficient method to calculate the average degree of separation (DoS) between pairs of genes to currently available gene networks. We show how these empirical estimates of average connectedness are used to validate candidate gene pairs. Validation of interacting genes by comparing their connectedness with the average connectedness in the gene network will provide support for said interactions by utilising the growing amount of gene network information available. Copyright © 2014 Elsevier Ltd. All rights reserved.

  7. Identification of late O{sub 3}-responsive genes in Arabidopsis thaliana by cDNA microarray analysis

    Energy Technology Data Exchange (ETDEWEB)

    D' Haese, D. [Univ. of Antwerp, Dept. of Biology, Antwerp (BE) and Univ. of Newcastle, School of Biology and Psychology, Div. of Biology, Newcastle-Upon-Tyne (United Kingdom); Horemans, N.; Coen, W. De; Guisez, Y. [Univ. of Antwerp, Dept. of Biology, Antwerp (Belgium)

    2006-09-15

    To better understand the response of a plant to 0{sub 3} stress, an integrated microarray analysis was performed on Arabidopsis plants exposed during 2 days to purified air or 150 nl l{sup -1} O{sub 3}, 8 h day-l. Agilent Arabidopsis 2 Oligo Microarrays were used of which the reliability was confirmed by quantitative real-time PCR of nine randomly selected genes. We confirmed the O{sub 3} responsiveness of heat shock proteins (HSPs), glutathione-S-tranferases and genes involved in cell wall stiffening and microbial defence. Whereas, a previous study revealed that during an early stage of the O{sub 3} stress response, gene expression was strongly dependent on jasmonic acid and ethylene, we report that at a later stage (48 h) synthesis of jasrnonic acid and ethylene was downregulated. In addition, we observed the simultaneous induction of salicylic acid synthesis and genes involved in programmed cell death and senescence. Also typically, the later stage of the response to O{sub 3} appeared to be the induction of the complete pathway leading to the biosynthesis of anthocyanin diglucosides and the induction of thioredoxin-based redox control. Surprisingly absent in the list of induced genes were genes involved in ASC-dependent antioxidation, few of which were found to be induced after 12 h of 0{sub 3} exposure in another study. We discuss these and other particular results of the microarray analysis and provide a map depicting significantly affected genes and their pathways highlighting their interrelationships and subcellular localization. (au)

  8. Candidate genes detected in transcriptome studies are strongly dependent on genetic background.

    Directory of Open Access Journals (Sweden)

    Pernille Sarup

    2011-01-01

    Full Text Available Whole genome transcriptomic studies can point to potential candidate genes for organismal traits. However, the importance of potential candidates is rarely followed up through functional studies and/or by comparing results across independent studies. We have analysed the overlap of candidate genes identified from studies of gene expression in Drosophila melanogaster using similar technical platforms. We found little overlap across studies between putative candidate genes for the same traits in the same sex. Instead there was a high degree of overlap between different traits and sexes within the same genetic backgrounds. Putative candidates found using transcriptomics therefore appear very sensitive to genetic background and this can mask or override effects of treatments. The functional importance of putative candidate genes emerging from transcriptome studies needs to be validated through additional experiments and in future studies we suggest a focus on the genes, networks and pathways affecting traits in a consistent manner across backgrounds.

  9. Microarray meta-analysis to explore abiotic stress-specific gene expression patterns in Arabidopsis.

    Science.gov (United States)

    Shen, Po-Chih; Hour, Ai-Ling; Liu, Li-Yu Daisy

    2017-12-01

    Abiotic stresses are the major limiting factors that affect plant growth, development, yield and final quality. Deciphering the underlying mechanisms of plants' adaptations to stresses using few datasets might overlook the different aspects of stress tolerance in plants, which might be simultaneously and consequently operated in the system. Fortunately, the accumulated microarray expression data offer an opportunity to infer abiotic stress-specific gene expression patterns through meta-analysis. In this study, we propose to combine microarray gene expression data under control, cold, drought, heat, and salt conditions and determined modules (gene sets) of genes highly associated with each other according to the observed expression data. By analyzing the expression variations of the Eigen genes from different conditions, we had identified two, three, and five gene modules as cold-, heat-, and salt-specific modules, respectively. Most of the cold- or heat-specific modules were differentially expressed to a particular degree in shoot samples, while most of the salt-specific modules were differentially expressed to a particular degree in root samples. A gene ontology (GO) analysis on the stress-specific modules suggested that the gene modules exclusively enriched stress-related GO terms and that different genes under the same GO terms may be alternatively disturbed in different conditions. The gene regulatory events for two genes, DREB1A and DEAR1, in the cold-specific gene module had also been validated, as evidenced through the literature search. Our protocols study the specificity of the gene modules that were specifically activated under a particular type of abiotic stress. The biplot can also assist to visualize the stress-specific gene modules. In conclusion, our approach has the potential to further elucidate mechanisms in plants and beneficial for future experiments design under different abiotic stresses.

  10. Identification and optimization of classifier genes from multi-class earthworm microarray dataset.

    Directory of Open Access Journals (Sweden)

    Ying Li

    Full Text Available Monitoring, assessment and prediction of environmental risks that chemicals pose demand rapid and accurate diagnostic assays. A variety of toxicological effects have been associated with explosive compounds TNT and RDX. One important goal of microarray experiments is to discover novel biomarkers for toxicity evaluation. We have developed an earthworm microarray containing 15,208 unique oligo probes and have used it to profile gene expression in 248 earthworms exposed to TNT, RDX or neither. We assembled a new machine learning pipeline consisting of several well-established feature filtering/selection and classification techniques to analyze the 248-array dataset in order to construct classifier models that can separate earthworm samples into three groups: control, TNT-treated, and RDX-treated. First, a total of 869 genes differentially expressed in response to TNT or RDX exposure were identified using a univariate statistical algorithm of class comparison. Then, decision tree-based algorithms were applied to select a subset of 354 classifier genes, which were ranked by their overall weight of significance. A multiclass support vector machine (MC-SVM method and an unsupervised K-mean clustering method were applied to independently refine the classifier, producing a smaller subset of 39 and 30 classifier genes, separately, with 11 common genes being potential biomarkers. The combined 58 genes were considered the refined subset and used to build MC-SVM and clustering models with classification accuracy of 83.5% and 56.9%, respectively. This study demonstrates that the machine learning approach can be used to identify and optimize a small subset of classifier/biomarker genes from high dimensional datasets and generate classification models of acceptable precision for multiple classes.

  11. Functional validation of candidate genes detected by genomic feature models

    DEFF Research Database (Denmark)

    Rohde, Palle Duun; Østergaard, Solveig; Kristensen, Torsten Nygaard

    2018-01-01

    Understanding the genetic underpinnings of complex traits requires knowledge of the genetic variants that contribute to phenotypic variability. Reliable statistical approaches are needed to obtain such knowledge. In genome-wide association studies, variants are tested for association with trait...... then functionally assessed whether the identified candidate genes affected locomotor activity by reducing gene expression using RNA interference. In five of the seven candidate genes tested, reduced gene expression altered the phenotype. The ranking of genes within the predictive GO term was highly correlated...

  12. Microarray analysis identifies a common set of cellular genes modulated by different HCV replicon clones

    Directory of Open Access Journals (Sweden)

    Gerosolimo Germano

    2008-06-01

    Full Text Available Abstract Background Hepatitis C virus (HCV RNA synthesis and protein expression affect cell homeostasis by modulation of gene expression. The impact of HCV replication on global cell transcription has not been fully evaluated. Thus, we analysed the expression profiles of different clones of human hepatoma-derived Huh-7 cells carrying a self-replicating HCV RNA which express all viral proteins (HCV replicon system. Results First, we compared the expression profile of HCV replicon clone 21-5 with both the Huh-7 parental cells and the 21-5 cured (21-5c cells. In these latter, the HCV RNA has been eliminated by IFN-α treatment. To confirm data, we also analyzed microarray results from both the 21-5 and two other HCV replicon clones, 22-6 and 21-7, compared to the Huh-7 cells. The study was carried out by using the Applied Biosystems (AB Human Genome Survey Microarray v1.0 which provides 31,700 probes that correspond to 27,868 human genes. Microarray analysis revealed a specific transcriptional program induced by HCV in replicon cells respect to both IFN-α-cured and Huh-7 cells. From the original datasets of differentially expressed genes, we selected by Venn diagrams a final list of 38 genes modulated by HCV in all clones. Most of the 38 genes have never been described before and showed high fold-change associated with significant p-value, strongly supporting data reliability. Classification of the 38 genes by Panther System identified functional categories that were significantly enriched in this gene set, such as histones and ribosomal proteins as well as extracellular matrix and intracellular protein traffic. The dataset also included new genes involved in lipid metabolism, extracellular matrix and cytoskeletal network, which may be critical for HCV replication and pathogenesis. Conclusion Our data provide a comprehensive analysis of alterations in gene expression induced by HCV replication and reveal modulation of new genes potentially useful

  13. Gene expression profiling in respond to TBT exposure in small abalone Haliotis diversicolor.

    Science.gov (United States)

    Jia, Xiwei; Zou, Zhihua; Wang, Guodong; Wang, Shuhong; Wang, Yilei; Zhang, Ziping

    2011-10-01

    In this study, we investigated the gene expression profiling of small abalone, Haliotis diversicolor by tributyltin (TBT) exposure using a cDNA microarray containing 2473 unique transcripts. Totally, 107 up-regulated genes and 41 down-regulated genes were found. For further investigation of candidate genes from microarray data and EST analysis, quantitative real-time PCR was performed at 6 h, 24 h, 48 h, 96 h and 192 h TBT exposure. 26 genes were found to be significantly differentially expressed in different time course, 3 of them were unknown. Some gene homologues like cellulose, endo-beta-1,4-glucanase, ferritin subunit 1 and thiolester containing protein II CG7052-PB might be the good biomarker candidate for TBT monitor. The identification of stress response genes and their expression profiles will permit detailed investigation of the defense responses of small abalone genes. Published by Elsevier Ltd.

  14. Candidate Gene Identification of Flowering Time Genes in Cotton

    Directory of Open Access Journals (Sweden)

    Corrinne E. Grover

    2015-07-01

    Full Text Available Flowering time control is critically important to all sexually reproducing angiosperms in both natural ecological and agronomic settings. Accordingly, there is much interest in defining the genes involved in the complex flowering-time network and how these respond to natural and artificial selection, the latter often entailing transitions in day-length responses. Here we describe a candidate gene analysis in the cotton genus , which uses homologs from the well-described flowering network to bioinformatically and phylogenetically identify orthologs in the published genome sequence from Ulbr., one of the two model diploid progenitors of the commercially important allopolyploid cottons, L. and L. Presence and patterns of expression were evaluated from 13 aboveground tissues related to flowering for each of the candidate genes using allopolyploid as a model. Furthermore, we use a comparative context to determine copy number variability of each key gene family across 10 published angiosperm genomes. Data suggest a pattern of repeated loss of duplicates following ancient whole-genome doubling events in diverse lineages. The data presented here provide a foundation for understanding both the parallel evolution of day-length neutrality in domesticated cottons and the flowering-time network, in general, in this important crop plant.

  15. Candidate genes and molecular markers associated with heat tolerance in colonial Bentgrass.

    Science.gov (United States)

    Jespersen, David; Belanger, Faith C; Huang, Bingru

    2017-01-01

    Elevated temperature is a major abiotic stress limiting the growth of cool-season grasses during the summer months. The objectives of this study were to determine the genetic variation in the expression patterns of selected genes involved in several major metabolic pathways regulating heat tolerance for two genotypes contrasting in heat tolerance to confirm their status as potential candidate genes, and to identify PCR-based markers associated with candidate genes related to heat tolerance in a colonial (Agrostis capillaris L.) x creeping bentgrass (Agrostis stolonifera L.) hybrid backcross population. Plants were subjected to heat stress in controlled-environmental growth chambers for phenotypic evaluation and determination of genetic variation in candidate gene expression. Molecular markers were developed for genes involved in protein degradation (cysteine protease), antioxidant defense (catalase and glutathione-S-transferase), energy metabolism (glyceraldehyde-3-phosphate dehydrogenase), cell expansion (expansin), and stress protection (heat shock proteins HSP26, HSP70, and HSP101). Kruskal-Wallis analysis, a commonly used non-parametric test used to compare population individuals with or without the gene marker, found the physiological traits of chlorophyll content, electrolyte leakage, normalized difference vegetative index, and turf quality were associated with all candidate gene markers with the exception of HSP101. Differential gene expression was frequently found for the tested candidate genes. The development of candidate gene markers for important heat tolerance genes may allow for the development of new cultivars with increased abiotic stress tolerance using marker-assisted selection.

  16. Candidate genes and molecular markers associated with heat tolerance in colonial Bentgrass.

    Directory of Open Access Journals (Sweden)

    David Jespersen

    Full Text Available Elevated temperature is a major abiotic stress limiting the growth of cool-season grasses during the summer months. The objectives of this study were to determine the genetic variation in the expression patterns of selected genes involved in several major metabolic pathways regulating heat tolerance for two genotypes contrasting in heat tolerance to confirm their status as potential candidate genes, and to identify PCR-based markers associated with candidate genes related to heat tolerance in a colonial (Agrostis capillaris L. x creeping bentgrass (Agrostis stolonifera L. hybrid backcross population. Plants were subjected to heat stress in controlled-environmental growth chambers for phenotypic evaluation and determination of genetic variation in candidate gene expression. Molecular markers were developed for genes involved in protein degradation (cysteine protease, antioxidant defense (catalase and glutathione-S-transferase, energy metabolism (glyceraldehyde-3-phosphate dehydrogenase, cell expansion (expansin, and stress protection (heat shock proteins HSP26, HSP70, and HSP101. Kruskal-Wallis analysis, a commonly used non-parametric test used to compare population individuals with or without the gene marker, found the physiological traits of chlorophyll content, electrolyte leakage, normalized difference vegetative index, and turf quality were associated with all candidate gene markers with the exception of HSP101. Differential gene expression was frequently found for the tested candidate genes. The development of candidate gene markers for important heat tolerance genes may allow for the development of new cultivars with increased abiotic stress tolerance using marker-assisted selection.

  17. Shared probe design and existing microarray reanalysis using PICKY

    Directory of Open Access Journals (Sweden)

    Chou Hui-Hsien

    2010-04-01

    Full Text Available Abstract Background Large genomes contain families of highly similar genes that cannot be individually identified by microarray probes. This limitation is due to thermodynamic restrictions and cannot be resolved by any computational method. Since gene annotations are updated more frequently than microarrays, another common issue facing microarray users is that existing microarrays must be routinely reanalyzed to determine probes that are still useful with respect to the updated annotations. Results PICKY 2.0 can design shared probes for sets of genes that cannot be individually identified using unique probes. PICKY 2.0 uses novel algorithms to track sharable regions among genes and to strictly distinguish them from other highly similar but nontarget regions during thermodynamic comparisons. Therefore, PICKY does not sacrifice the quality of shared probes when choosing them. The latest PICKY 2.1 includes the new capability to reanalyze existing microarray probes against updated gene sets to determine probes that are still valid to use. In addition, more precise nonlinear salt effect estimates and other improvements are added, making PICKY 2.1 more versatile to microarray users. Conclusions Shared probes allow expressed gene family members to be detected; this capability is generally more desirable than not knowing anything about these genes. Shared probes also enable the design of cross-genome microarrays, which facilitate multiple species identification in environmental samples. The new nonlinear salt effect calculation significantly increases the precision of probes at a lower buffer salt concentration, and the probe reanalysis function improves existing microarray result interpretations.

  18. Microarray-based analysis of IncA/C plasmid-associated genes from multidrug-resistant Salmonella enterica.

    Science.gov (United States)

    Lindsey, Rebecca L; Frye, Jonathan G; Fedorka-Cray, Paula J; Meinersmann, Richard J

    2011-10-01

    In the family Enterobacteriaceae, plasmids have been classified according to 27 incompatibility (Inc) or replicon types that are based on the inability of different plasmids with the same replication mechanism to coexist in the same cell. Certain replicon types such as IncA/C are associated with multidrug resistance (MDR). We developed a microarray that contains 286 unique 70-mer oligonucleotide probes based on sequences from five IncA/C plasmids: pYR1 (Yersinia ruckeri), pPIP1202 (Yersinia pestis), pP99-018 (Photobacterium damselae), pSN254 (Salmonella enterica serovar Newport), and pP91278 (Photobacterium damselae). DNA from 59 Salmonella enterica isolates was hybridized to the microarray and analyzed for the presence or absence of genes. These isolates represented 17 serovars from 14 different animal hosts and from different geographical regions in the United States. Qualitative cluster analysis was performed using CLUSTER 3.0 to group microarray hybridization results. We found that IncA/C plasmids occurred in two lineages distinguished by a major insertion-deletion (indel) region that contains genes encoding mostly hypothetical proteins. The most variable genes were represented by transposon-associated genes as well as four antimicrobial resistance genes (aphA, merP, merA, and aadA). Sixteen mercury resistance genes were identified and highly conserved, suggesting that mercury ion-related exposure is a stronger pressure than anticipated. We used these data to construct a core IncA/C genome and an accessory genome. The results of our studies suggest that the transfer of antimicrobial resistance determinants by transfer of IncA/C plasmids is somewhat less common than exchange within the plasmids orchestrated by transposable elements, such as transposons, integrating and conjugative elements (ICEs), and insertion sequence common regions (ISCRs), and thus pose less opportunity for exchange of antimicrobial resistance.

  19. ArraySolver: an algorithm for colour-coded graphical display and Wilcoxon signed-rank statistics for comparing microarray gene expression data.

    Science.gov (United States)

    Khan, Haseeb Ahmad

    2004-01-01

    The massive surge in the production of microarray data poses a great challenge for proper analysis and interpretation. In recent years numerous computational tools have been developed to extract meaningful interpretation of microarray gene expression data. However, a convenient tool for two-groups comparison of microarray data is still lacking and users have to rely on commercial statistical packages that might be costly and require special skills, in addition to extra time and effort for transferring data from one platform to other. Various statistical methods, including the t-test, analysis of variance, Pearson test and Mann-Whitney U test, have been reported for comparing microarray data, whereas the utilization of the Wilcoxon signed-rank test, which is an appropriate test for two-groups comparison of gene expression data, has largely been neglected in microarray studies. The aim of this investigation was to build an integrated tool, ArraySolver, for colour-coded graphical display and comparison of gene expression data using the Wilcoxon signed-rank test. The results of software validation showed similar outputs with ArraySolver and SPSS for large datasets. Whereas the former program appeared to be more accurate for 25 or fewer pairs (n < or = 25), suggesting its potential application in analysing molecular signatures that usually contain small numbers of genes. The main advantages of ArraySolver are easy data selection, convenient report format, accurate statistics and the familiar Excel platform.

  20. Candidate genes for cross-resistance against DNA-damaging drugs

    DEFF Research Database (Denmark)

    Wittig, Rainer; Nessling, Michelle; Will, Rainer D

    2002-01-01

    Drug resistance of tumor cells leads to major drawbacks in the treatment of cancer. To identify candidate genes for drug resistance, we compared the expression patterns of the drug-sensitive human malignant melanoma cell line MeWo and three derived sublines with acquired resistance to the DNA...... as several apoptosis-related genes, in particular STK17A and CRYAB. As MPP1 and CRYAB are also among the 14 genes differentially expressed in all three of the drug-resistant sublines, they represent the strongest candidates for resistance against DNA-damaging drugs....

  1. Development, characterization and experimental validation of a cultivated sunflower (Helianthus annuus L.) gene expression oligonucleotide microarray.

    Science.gov (United States)

    Fernandez, Paula; Soria, Marcelo; Blesa, David; DiRienzo, Julio; Moschen, Sebastian; Rivarola, Maximo; Clavijo, Bernardo Jose; Gonzalez, Sergio; Peluffo, Lucila; Príncipi, Dario; Dosio, Guillermo; Aguirrezabal, Luis; García-García, Francisco; Conesa, Ana; Hopp, Esteban; Dopazo, Joaquín; Heinz, Ruth Amelia; Paniego, Norma

    2012-01-01

    Oligonucleotide-based microarrays with accurate gene coverage represent a key strategy for transcriptional studies in orphan species such as sunflower, H. annuus L., which lacks full genome sequences. The goal of this study was the development and functional annotation of a comprehensive sunflower unigene collection and the design and validation of a custom sunflower oligonucleotide-based microarray. A large scale EST (>130,000 ESTs) curation, assembly and sequence annotation was performed using Blast2GO (www.blast2go.de). The EST assembly comprises 41,013 putative transcripts (12,924 contigs and 28,089 singletons). The resulting Sunflower Unigen Resource (SUR version 1.0) was used to design an oligonucleotide-based Agilent microarray for cultivated sunflower. This microarray includes a total of 42,326 features: 1,417 Agilent controls, 74 control probes for sunflower replicated 10 times (740 controls) and 40,169 different non-control probes. Microarray performance was validated using a model experiment examining the induction of senescence by water deficit. Pre-processing and differential expression analysis of Agilent microarrays was performed using the Bioconductor limma package. The analyses based on p-values calculated by eBayes (psunflower unigene collection, and a custom, validated sunflower oligonucleotide-based microarray using Agilent technology. Both the curated unigene collection and the validated oligonucleotide microarray provide key resources for sunflower genome analysis, transcriptional studies, and molecular breeding for crop improvement.

  2. Gene expression signature analysis identifies vorinostat as a candidate therapy for gastric cancer.

    Directory of Open Access Journals (Sweden)

    Sofie Claerhout

    Full Text Available Gastric cancer continues to be one of the deadliest cancers in the world and therefore identification of new drugs targeting this type of cancer is thus of significant importance. The purpose of this study was to identify and validate a therapeutic agent which might improve the outcomes for gastric cancer patients in the future.Using microarray technology, we generated a gene expression profile of human gastric cancer-specific genes from human gastric cancer tissue samples. We used this profile in the Broad Institute's Connectivity Map analysis to identify candidate therapeutic compounds for gastric cancer. We found the histone deacetylase inhibitor vorinostat as the lead compound and thus a potential therapeutic drug for gastric cancer. Vorinostat induced both apoptosis and autophagy in gastric cancer cell lines. Pharmacological and genetic inhibition of autophagy however, increased the therapeutic efficacy of vorinostat, indicating that a combination of vorinostat with autophagy inhibitors may therapeutically be more beneficial. Moreover, gene expression analysis of gastric cancer identified a collection of genes (ITGB5, TYMS, MYB, APOC1, CBX5, PLA2G2A, and KIF20A whose expression was elevated in gastric tumor tissue and downregulated more than 2-fold by vorinostat treatment in gastric cancer cell lines. In contrast, SCGB2A1, TCN1, CFD, APLP1, and NQO1 manifested a reversed pattern.We showed that analysis of gene expression signature may represent an emerging approach to discover therapeutic agents for gastric cancer, such as vorinostat. The observation of altered gene expression after vorinostat treatment may provide the clue to identify the molecular mechanism of vorinostat and those patients likely to benefit from vorinostat treatment.

  3. Kernel-imbedded Gaussian processes for disease classification using microarray gene expression data

    Directory of Open Access Journals (Sweden)

    Cheung Leo

    2007-02-01

    Full Text Available Abstract Background Designing appropriate machine learning methods for identifying genes that have a significant discriminating power for disease outcomes has become more and more important for our understanding of diseases at genomic level. Although many machine learning methods have been developed and applied to the area of microarray gene expression data analysis, the majority of them are based on linear models, which however are not necessarily appropriate for the underlying connection between the target disease and its associated explanatory genes. Linear model based methods usually also bring in false positive significant features more easily. Furthermore, linear model based algorithms often involve calculating the inverse of a matrix that is possibly singular when the number of potentially important genes is relatively large. This leads to problems of numerical instability. To overcome these limitations, a few non-linear methods have recently been introduced to the area. Many of the existing non-linear methods have a couple of critical problems, the model selection problem and the model parameter tuning problem, that remain unsolved or even untouched. In general, a unified framework that allows model parameters of both linear and non-linear models to be easily tuned is always preferred in real-world applications. Kernel-induced learning methods form a class of approaches that show promising potentials to achieve this goal. Results A hierarchical statistical model named kernel-imbedded Gaussian process (KIGP is developed under a unified Bayesian framework for binary disease classification problems using microarray gene expression data. In particular, based on a probit regression setting, an adaptive algorithm with a cascading structure is designed to find the appropriate kernel, to discover the potentially significant genes, and to make the optimal class prediction accordingly. A Gibbs sampler is built as the core of the algorithm to make

  4. Combined analysis of DNA methylome and transcriptome reveal novel candidate genes with susceptibility to bovine Staphylococcus aureus subclinical mastitis.

    Science.gov (United States)

    Song, Minyan; He, Yanghua; Zhou, Huangkai; Zhang, Yi; Li, Xizhi; Yu, Ying

    2016-07-14

    Subclinical mastitis is a widely spread disease of lactating cows. Its major pathogen is Staphylococcus aureus (S. aureus). In this study, we performed genome-wide integrative analysis of DNA methylation and transcriptional expression to identify candidate genes and pathways relevant to bovine S. aureus subclinical mastitis. The genome-scale DNA methylation profiles of peripheral blood lymphocytes in cows with S. aureus subclinical mastitis (SA group) and healthy controls (CK) were generated by methylated DNA immunoprecipitation combined with microarrays. We identified 1078 differentially methylated genes in SA cows compared with the controls. By integrating DNA methylation and transcriptome data, 58 differentially methylated genes were shared with differently expressed genes, in which 20.7% distinctly hypermethylated genes showed down-regulated expression in SA versus CK, whereas 14.3% dramatically hypomethylated genes showed up-regulated expression. Integrated pathway analysis suggested that these genes were related to inflammation, ErbB signalling pathway and mismatch repair. Further functional analysis revealed that three genes, NRG1, MST1 and NAT9, were strongly correlated with the progression of S. aureus subclinical mastitis and could be used as powerful biomarkers for the improvement of bovine mastitis resistance. Our studies lay the groundwork for epigenetic modification and mechanistic studies on susceptibility of bovine mastitis.

  5. Sparse canonical correlation analysis for identifying, connecting and completing gene-expression networks

    NARCIS (Netherlands)

    Waaijenborg, S.; Zwinderman, A.H.

    2009-01-01

    ABSTRACT: BACKGROUND: We generalized penalized canonical correlation analysis for analyzing microarray gene-expression measurements for checking completeness of known metabolic pathways and identifying candidate genes for incorporation in the pathway. We used Wold's method for calculation of the

  6. Evaluation of toxicity of the mycotoxin citrinin using yeast ORF DNA microarray and Oligo DNA microarray

    Directory of Open Access Journals (Sweden)

    Nobumasa Hitoshi

    2007-04-01

    Full Text Available Abstract Background Mycotoxins are fungal secondary metabolites commonly present in feed and food, and are widely regarded as hazardous contaminants. Citrinin, one of the very well known mycotoxins that was first isolated from Penicillium citrinum, is produced by more than 10 kinds of fungi, and is possibly spread all over the world. However, the information on the action mechanism of the toxin is limited. Thus, we investigated the citrinin-induced genomic response for evaluating its toxicity. Results Citrinin inhibited growth of yeast cells at a concentration higher than 100 ppm. We monitored the citrinin-induced mRNA expression profiles in yeast using the ORF DNA microarray and Oligo DNA microarray, and the expression profiles were compared with those of the other stress-inducing agents. Results obtained from both microarray experiments clustered together, but were different from those of the mycotoxin patulin. The oxidative stress response genes – AADs, FLR1, OYE3, GRE2, and MET17 – were significantly induced. In the functional category, expression of genes involved in "metabolism", "cell rescue, defense and virulence", and "energy" were significantly activated. In the category of "metabolism", genes involved in the glutathione synthesis pathway were activated, and in the category of "cell rescue, defense and virulence", the ABC transporter genes were induced. To alleviate the induced stress, these cells might pump out the citrinin after modification with glutathione. While, the citrinin treatment did not induce the genes involved in the DNA repair. Conclusion Results from both microarray studies suggest that citrinin treatment induced oxidative stress in yeast cells. The genotoxicity was less severe than the patulin, suggesting that citrinin is less toxic than patulin. The reproducibility of the expression profiles was much better with the Oligo DNA microarray. However, the Oligo DNA microarray did not completely overcome cross

  7. Annotating breast cancer microarray samples using ontologies

    Science.gov (United States)

    Liu, Hongfang; Li, Xin; Yoon, Victoria; Clarke, Robert

    2008-01-01

    As the most common cancer among women, breast cancer results from the accumulation of mutations in essential genes. Recent advance in high-throughput gene expression microarray technology has inspired researchers to use the technology to assist breast cancer diagnosis, prognosis, and treatment prediction. However, the high dimensionality of microarray experiments and public access of data from many experiments have caused inconsistencies which initiated the development of controlled terminologies and ontologies for annotating microarray experiments, such as the standard microarray Gene Expression Data (MGED) ontology (MO). In this paper, we developed BCM-CO, an ontology tailored specifically for indexing clinical annotations of breast cancer microarray samples from the NCI Thesaurus. Our research showed that the coverage of NCI Thesaurus is very limited with respect to i) terms used by researchers to describe breast cancer histology (covering 22 out of 48 histology terms); ii) breast cancer cell lines (covering one out of 12 cell lines); and iii) classes corresponding to the breast cancer grading and staging. By incorporating a wider range of those terms into BCM-CO, we were able to indexed breast cancer microarray samples from GEO using BCM-CO and MGED ontology and developed a prototype system with web interface that allows the retrieval of microarray data based on the ontology annotations. PMID:18999108

  8. Candidate genes for COPD: current evidence and research

    Directory of Open Access Journals (Sweden)

    Kim WJ

    2015-10-01

    Full Text Available Woo Jin Kim,1 Sang Do Lee2 1Department of Internal Medicine and Environmental Health Center, Kangwon National University, Chuncheon, 2Department of Pulmonary and Critical Care Medicine, Clinical Research Center for Chronic Obstructive Airway Diseases, Asan Medical Center, University of Ulsan College of Medicine, Seoul, South Korea Abstract: COPD is a common complex disease characterized by progressive airflow limitation. Several genome-wide association studies (GWASs have discovered genes that are associated with COPD. Recently, candidate genes for COPD identified by GWASs include CHRNA3/5 (cholinergic nicotine receptor alpha 3/5, IREB2 (iron regulatory binding protein 2, HHIP (hedgehog-interacting protein, FAM13A (family with sequence similarity 13, member A, and AGER (advanced glycosylation end product–specific receptor. Their association with COPD susceptibility has been replicated in multiple populations. Since these candidate genes have not been considered in COPD, their pathological roles are still largely unknown. Herein, we review some evidences that they can be effective drug targets or serve as biomarkers for diagnosis or subtyping. However, more study is required to understand the functional roles of these candidate genes. Future research is needed to characterize the effect of genetic variants, validate gene function in humans and model systems, and elucidate the genes’ transcriptional and posttranscriptional regulatory mechanisms. Keywords: chronic obstructive pulmonary disease, genetics, genome-wide association study

  9. Characterization of Gene Candidates for Vacuolar Sodium Transport from Hordeum Vulgare

    KAUST Repository

    Scheu, Arne Hagen August

    2017-05-01

    Soil salinity is a major abiotic stress for land plants, and multiple mechanisms of salt tolerance have evolved. Tissue tolerance is one of these mechanisms, which involves the sequestration of sodium into the vacuole to retain low cytosolic sodium concentrations. This enables the plant to maintain cellular functions, and ultimately maintain growth and yield. However, the molecular components involved in tissue tolerance remain elusive. Several candidate genes for vacuolar sodium sequestration have recently been identified by proteome analysis of vacuolar membranes purified from the salt-tolerant cereal Hordeum vulgare (barley). In this study, I aimed to characterize these candidates in more detail. I successfully cloned coding sequences for the majority of candidate genes with primers designed based on the barley reference genome sequence. During the course of this study a newer genome sequence with improved annotations was published, to which I also compared my observations. To study the candidate genes, I used the heterologous expression system Saccharomyces cerevisiae (yeast). I used several salt sensitive yeast strains (deficient in intrinsic sodium transporters) to test whether the candidate genes would affect their salt tolerance by mediating the sequestration of sodium into the yeast vacuole. I observed a reduction in growth upon expression for several of the gene candidate under salt-stress conditions. However, confocal microscopy suggests that most gene products are subject to degradation, and did not localize to the vacuolar membrane (tonoplast). Therefore, growth effects cannot be linked to protein function without further evidence. Various potential causes are discussed, including inaccuracies in the genome resource used as reference for primer design and issues inherent to the model system. Finally, I make suggestions on how to proceed to further characterize the candidate genes and hopefully identify novel sodium transporters from barley.

  10. Inferring Gene Regulatory Networks Using Conditional Regulation Pattern to Guide Candidate Genes.

    Directory of Open Access Journals (Sweden)

    Fei Xiao

    Full Text Available Combining path consistency (PC algorithms with conditional mutual information (CMI are widely used in reconstruction of gene regulatory networks. CMI has many advantages over Pearson correlation coefficient in measuring non-linear dependence to infer gene regulatory networks. It can also discriminate the direct regulations from indirect ones. However, it is still a challenge to select the conditional genes in an optimal way, which affects the performance and computation complexity of the PC algorithm. In this study, we develop a novel conditional mutual information-based algorithm, namely RPNI (Regulation Pattern based Network Inference, to infer gene regulatory networks. For conditional gene selection, we define the co-regulation pattern, indirect-regulation pattern and mixture-regulation pattern as three candidate patterns to guide the selection of candidate genes. To demonstrate the potential of our algorithm, we apply it to gene expression data from DREAM challenge. Experimental results show that RPNI outperforms existing conditional mutual information-based methods in both accuracy and time complexity for different sizes of gene samples. Furthermore, the robustness of our algorithm is demonstrated by noisy interference analysis using different types of noise.

  11. An Efficient Ensemble Learning Method for Gene Microarray Classification

    Directory of Open Access Journals (Sweden)

    Alireza Osareh

    2013-01-01

    Full Text Available The gene microarray analysis and classification have demonstrated an effective way for the effective diagnosis of diseases and cancers. However, it has been also revealed that the basic classification techniques have intrinsic drawbacks in achieving accurate gene classification and cancer diagnosis. On the other hand, classifier ensembles have received increasing attention in various applications. Here, we address the gene classification issue using RotBoost ensemble methodology. This method is a combination of Rotation Forest and AdaBoost techniques which in turn preserve both desirable features of an ensemble architecture, that is, accuracy and diversity. To select a concise subset of informative genes, 5 different feature selection algorithms are considered. To assess the efficiency of the RotBoost, other nonensemble/ensemble techniques including Decision Trees, Support Vector Machines, Rotation Forest, AdaBoost, and Bagging are also deployed. Experimental results have revealed that the combination of the fast correlation-based feature selection method with ICA-based RotBoost ensemble is highly effective for gene classification. In fact, the proposed method can create ensemble classifiers which outperform not only the classifiers produced by the conventional machine learning but also the classifiers generated by two widely used conventional ensemble learning methods, that is, Bagging and AdaBoost.

  12. A microarray analysis of sex- and gonad-biased gene expression in the zebrafish: Evidence for masculinization of the transcriptome

    Directory of Open Access Journals (Sweden)

    Mo Qianxing

    2009-12-01

    Full Text Available Abstract Background In many taxa, males and females are very distinct phenotypically, and these differences often reflect divergent selective pressures acting on the sexes. Phenotypic sexual dimorphism almost certainly reflects differing patterns of gene expression between the sexes, and microarray studies have documented widespread sexually dimorphic gene expression. Although the evolutionary significance of sexual dimorphism in gene expression remains unresolved, these studies have led to the formulation of a hypothesis that male-driven evolution has resulted in the masculinization of animal transcriptomes. Here we use a microarray assessment of sex- and gonad-biased gene expression to test this hypothesis in zebrafish. Results By using zebrafish Affymetrix microarrays to compare gene expression patterns in male and female somatic and gonadal tissues, we identified a large number of genes (5899 demonstrating differences in transcript abundance between male and female Danio rerio. Under conservative statistical significance criteria, all sex-biases in gene expression were due to differences between testes and ovaries. Male-enriched genes were more abundant than female-enriched genes, and expression bias for male-enriched genes was greater in magnitude than that for female-enriched genes. We also identified a large number of genes demonstrating elevated transcript abundance in testes and ovaries relative to male body and female body, respectively. Conclusion Overall our results support the hypothesis that male-biased evolutionary pressures have resulted in male-biased patterns of gene expression. Interestingly, our results seem to be at odds with a handful of other microarray-based studies of sex-specific gene expression patterns in zebrafish. However, ours was the only study designed to address this specific hypothesis, and major methodological differences among studies could explain the discrepancies. Regardless, all of these studies agree

  13. Systematic identification and validation of candidate genes for detection of circulating tumor cells in peripheral blood specimens of colorectal cancer patients.

    Science.gov (United States)

    Findeisen, Peter; Röckel, Matthias; Nees, Matthias; Röder, Christian; Kienle, Peter; Von Knebel Doeberitz, Magnus; Kalthoff, Holger; Neumaier, Michael

    2008-11-01

    The presence of tumor cells in peripheral blood is being regarded increasingly as a clinically relevant prognostic factor for colorectal cancer patients. Current molecular methods are very sensitive but due to low specificity their diagnostic value is limited. This study was undertaken in order to systematically identify and validate new colorectal cancer (CRC) marker genes for improved detection of minimal residual disease in peripheral blood mononuclear cells of colorectal cancer patients. Marker genes with upregulated gene expression in colorectal cancer tissue and cell lines were identified using microarray experiments and publicly available gene expression data. A systematic iterative approach was used to reduce a set of 346 candidate genes, reportedly associated with CRC to a selection of candidate genes that were then further validated by relative quantitative real-time RT-PCR. Analytical sensitivity of RT-PCR assays was determined by spiking experiments with CRC cells. Diagnostic sensitivity as well as specificity was tested on a control group consisting of 18 CRC patients compared to 12 individuals without malignant disease. From a total of 346-screened genes only serine (or cysteine) proteinase inhibitor, clade B (ovalbumin), member 5 (SERPINB5) showed significantly elevated transcript levels in peripheral venous blood specimens of tumor patients when compared to the nonmalignant control group. These results were confirmed by analysis of an enlarged collective consisting of 63 CRC patients and 36 control individuals without malignant disease. In conclusion SERPINB5 seems to be a promising marker for detection of circulating tumor cells in peripheral blood of colorectal cancer patients.

  14. Global microarray analysis of carbohydrate use in alkaliphilic hemicellulolytic bacterium Bacillus sp. N16-5.

    Directory of Open Access Journals (Sweden)

    Yajian Song

    Full Text Available The alkaliphilic hemicellulolytic bacterium Bacillus sp. N16-5 has a broad substrate spectrum and exhibits the capacity to utilize complex carbohydrates such as galactomannan, xylan, and pectin. In the monosaccharide mixture, sequential utilization by Bacillus sp. N16-5 was observed. Glucose appeared to be its preferential monosaccharide, followed by fructose, mannose, arabinose, xylose, and galactose. Global transcription profiles of the strain were determined separately for growth on six monosaccharides (glucose, fructose, mannose, galactose, arabinose, and xylose and four polysaccharides (galactomannan, xylan, pectin, and sodium carboxymethylcellulose using one-color microarrays. Numerous genes potentially related to polysaccharide degradation, sugar transport, and monosaccharide metabolism were found to respond to a specific substrate. Putative gene clusters for different carbohydrates were identified according to transcriptional patterns and genome annotation. Identification and analysis of these gene clusters contributed to pathway reconstruction for carbohydrate utilization in Bacillus sp. N16-5. Several genes encoding putative sugar transporters were highly expressed during growth on specific sugars, suggesting their functional roles. Two phosphoenolpyruvate-dependent phosphotransferase systems were identified as candidate transporters for mannose and fructose, and a major facilitator superfamily transporter was identified as a candidate transporter for arabinose and xylose. Five carbohydrate uptake transporter 1 family ATP-binding cassette transporters were predicted to participate in the uptake of hemicellulose and pectin degradation products. Collectively, microarray data improved the pathway reconstruction involved in carbohydrate utilization of Bacillus sp. N16-5 and revealed that the organism precisely regulates gene transcription in response to fluctuations in energy resources.

  15. Global Microarray Analysis of Carbohydrate Use in Alkaliphilic Hemicellulolytic Bacterium Bacillus sp. N16-5

    Science.gov (United States)

    Song, Yajian; Xue, Yanfen; Ma, Yanhe

    2013-01-01

    The alkaliphilic hemicellulolytic bacterium Bacillus sp. N16-5 has a broad substrate spectrum and exhibits the capacity to utilize complex carbohydrates such as galactomannan, xylan, and pectin. In the monosaccharide mixture, sequential utilization by Bacillus sp. N16-5 was observed. Glucose appeared to be its preferential monosaccharide, followed by fructose, mannose, arabinose, xylose, and galactose. Global transcription profiles of the strain were determined separately for growth on six monosaccharides (glucose, fructose, mannose, galactose, arabinose, and xylose) and four polysaccharides (galactomannan, xylan, pectin, and sodium carboxymethylcellulose) using one-color microarrays. Numerous genes potentially related to polysaccharide degradation, sugar transport, and monosaccharide metabolism were found to respond to a specific substrate. Putative gene clusters for different carbohydrates were identified according to transcriptional patterns and genome annotation. Identification and analysis of these gene clusters contributed to pathway reconstruction for carbohydrate utilization in Bacillus sp. N16-5. Several genes encoding putative sugar transporters were highly expressed during growth on specific sugars, suggesting their functional roles. Two phosphoenolpyruvate-dependent phosphotransferase systems were identified as candidate transporters for mannose and fructose, and a major facilitator superfamily transporter was identified as a candidate transporter for arabinose and xylose. Five carbohydrate uptake transporter 1 family ATP-binding cassette transporters were predicted to participate in the uptake of hemicellulose and pectin degradation products. Collectively, microarray data improved the pathway reconstruction involved in carbohydrate utilization of Bacillus sp. N16-5 and revealed that the organism precisely regulates gene transcription in response to fluctuations in energy resources. PMID:23326578

  16. Dynamic compression of chondrocyte-agarose constructs reveals new candidate mechanosensitive genes.

    Directory of Open Access Journals (Sweden)

    Carole Bougault

    Full Text Available Articular cartilage is physiologically exposed to repeated loads. The mechanical properties of cartilage are due to its extracellular matrix, and homeostasis is maintained by the sole cell type found in cartilage, the chondrocyte. Although mechanical forces clearly control the functions of articular chondrocytes, the biochemical pathways that mediate cellular responses to mechanical stress have not been fully characterised. The aim of our study was to examine early molecular events triggered by dynamic compression in chondrocytes. We used an experimental system consisting of primary mouse chondrocytes embedded within an agarose hydrogel; embedded cells were pre-cultured for one week and subjected to short-term compression experiments. Using Western blots, we demonstrated that chondrocytes maintain a differentiated phenotype in this model system and reproduce typical chondrocyte-cartilage matrix interactions. We investigated the impact of dynamic compression on the phosphorylation state of signalling molecules and genome-wide gene expression. After 15 min of dynamic compression, we observed transient activation of ERK1/2 and p38 (members of the mitogen-activated protein kinase (MAPK pathways and Smad2/3 (members of the canonical transforming growth factor (TGF-β pathways. A microarray analysis performed on chondrocytes compressed for 30 min revealed that only 20 transcripts were modulated more than 2-fold. A less conservative list of 325 modulated genes included genes related to the MAPK and TGF-β pathways and/or known to be mechanosensitive in other biological contexts. Of these candidate mechanosensitive genes, 85% were down-regulated. Down-regulation may therefore represent a general control mechanism for a rapid response to dynamic compression. Furthermore, modulation of transcripts corresponding to different aspects of cellular physiology was observed, such as non-coding RNAs or primary cilium. This study provides new insight into how

  17. Generating Genome-Scale Candidate Gene Lists for Pharmacogenomics

    DEFF Research Database (Denmark)

    Hansen, Niclas Tue; Brunak, Søren; Altman, R. B.

    2009-01-01

    A critical task in pharmacogenomics is identifying genes that may be important modulators of drug response. High-throughput experimental methods are often plagued by false positives and do not take advantage of existing knowledge. Candidate gene lists can usefully summarize existing knowledge...

  18. Carbon nanoparticles as detection labels in antibody microarrays. Detection of genes encoding virulence factors in Shiga toxin-producing Escherichia coli.

    NARCIS (Netherlands)

    Noguera, P.S.; Posthuma-Trumpie, G.A.; Tuil, Van M.; Wal, van der F.J.; Boer, De A.; Moers, A.P.H.A.; Amerongen, Van A.

    2011-01-01

    The present study demonstrates that carbon nanoparticles (CNPs) can be used as labels in microarrays. CNPs were used in nucleic acid microarray immunoassays (NAMIAs) for the detection of different Shiga toxin-producing Escherichia coli (STEC) virulence factors: four genes specific for STEC (vt1,

  19. The use of microarrays in microbial ecology

    Energy Technology Data Exchange (ETDEWEB)

    Andersen, G.L.; He, Z.; DeSantis, T.Z.; Brodie, E.L.; Zhou, J.

    2009-09-15

    Microarrays have proven to be a useful and high-throughput method to provide targeted DNA sequence information for up to many thousands of specific genetic regions in a single test. A microarray consists of multiple DNA oligonucleotide probes that, under high stringency conditions, hybridize only to specific complementary nucleic acid sequences (targets). A fluorescent signal indicates the presence and, in many cases, the abundance of genetic regions of interest. In this chapter we will look at how microarrays are used in microbial ecology, especially with the recent increase in microbial community DNA sequence data. Of particular interest to microbial ecologists, phylogenetic microarrays are used for the analysis of phylotypes in a community and functional gene arrays are used for the analysis of functional genes, and, by inference, phylotypes in environmental samples. A phylogenetic microarray that has been developed by the Andersen laboratory, the PhyloChip, will be discussed as an example of a microarray that targets the known diversity within the 16S rRNA gene to determine microbial community composition. Using multiple, confirmatory probes to increase the confidence of detection and a mismatch probe for every perfect match probe to minimize the effect of cross-hybridization by non-target regions, the PhyloChip is able to simultaneously identify any of thousands of taxa present in an environmental sample. The PhyloChip is shown to reveal greater diversity within a community than rRNA gene sequencing due to the placement of the entire gene product on the microarray compared with the analysis of up to thousands of individual molecules by traditional sequencing methods. A functional gene array that has been developed by the Zhou laboratory, the GeoChip, will be discussed as an example of a microarray that dynamically identifies functional activities of multiple members within a community. The recent version of GeoChip contains more than 24,000 50mer

  20. Missing value imputation for microarray gene expression data using histone acetylation information

    Directory of Open Access Journals (Sweden)

    Feng Jihua

    2008-05-01

    Full Text Available Abstract Background It is an important pre-processing step to accurately estimate missing values in microarray data, because complete datasets are required in numerous expression profile analysis in bioinformatics. Although several methods have been suggested, their performances are not satisfactory for datasets with high missing percentages. Results The paper explores the feasibility of doing missing value imputation with the help of gene regulatory mechanism. An imputation framework called histone acetylation information aided imputation method (HAIimpute method is presented. It incorporates the histone acetylation information into the conventional KNN(k-nearest neighbor and LLS(local least square imputation algorithms for final prediction of the missing values. The experimental results indicated that the use of acetylation information can provide significant improvements in microarray imputation accuracy. The HAIimpute methods consistently improve the widely used methods such as KNN and LLS in terms of normalized root mean squared error (NRMSE. Meanwhile, the genes imputed by HAIimpute methods are more correlated with the original complete genes in terms of Pearson correlation coefficients. Furthermore, the proposed methods also outperform GOimpute, which is one of the existing related methods that use the functional similarity as the external information. Conclusion We demonstrated that the using of histone acetylation information could greatly improve the performance of the imputation especially at high missing percentages. This idea can be generalized to various imputation methods to facilitate the performance. Moreover, with more knowledge accumulated on gene regulatory mechanism in addition to histone acetylation, the performance of our approach can be further improved and verified.

  1. Quantitative profiling of housekeeping and Epstein-Barr virus gene transcription in Burkitt lymphoma cell lines using an oligonucleotide microarray

    Directory of Open Access Journals (Sweden)

    Niggli Felix K

    2006-06-01

    Full Text Available Abstract Background The Epstein-Barr virus (EBV is associated with lymphoid malignancies, including Burkitt's lymphoma (BL, and can transform human B cells in vitro. EBV-harboring cell lines are widely used to investigate lymphocyte transformation and oncogenesis. Qualitative EBV gene expression has been extensively described, but knowledge of quantitative transcription is lacking. We hypothesized that transcription levels of EBNA1, the gene essential for EBV persistence within an infected cell, are similar in BL cell lines. Results To compare quantitative gene transcription in the BL cell lines Namalwa, Raji, Akata, Jijoye, and P3HR1, we developed an oligonucleotide microarray chip, including 17 housekeeping genes, six latent EBV genes (EBNA1, EBNA2, EBNA3A, EBNA3C, LMP1, LMP2, and four lytic EBV genes (BZLF1, BXLF2, BKRF2, BZLF2, and used the cell line B95.8 as a reference for EBV gene transcription. Quantitative polymerase chain reaction assays were used to validate microarray results. We found that transcription levels of housekeeping genes differed considerably among BL cell lines. Using a selection of housekeeping genes with similar quantitative transcription in the tested cell lines to normalize EBV gene transcription data, we showed that transcription levels of EBNA1 were quite similar in very different BL cell lines, in contrast to transcription levels of other EBV genes. As demonstrated with Akata cells, the chip allowed us to accurately measure EBV gene transcription changes triggered by treatment interventions. Conclusion Our results suggest uniform EBNA1 transcription levels in BL and that microarray profiling can reveal novel insights on quantitative EBV gene transcription and its impact on lymphocyte biology.

  2. Characterisation of five candidate genes within the ETEC F4ab/ac candidate region in pigs

    DEFF Research Database (Denmark)

    Jacobsen, Mette Juul; Cirera Salicio, Susanna; Joller, David

    2011-01-01

    by haplotype sharing to a 2.5 Mb region on pig chromosome 13, a region containing 18 annotated genes. FINDINGS: The coding regions of five candidate genes for susceptibility to ETEC F4ab/ac infection (TFRC, ACK1, MUC20, MUC4 and KIAA0226), all located in the 2.5 Mb region, were investigated for the presence...... polymorphism in exon 22 of KIAA0226. Transcriptional profiles of the five genes were investigated in a porcine tissue panel including various intestinal tissues. All five genes were expressed in intestinal tissues at different levels but none of the genes were found differentially expressed between ETEC F4ab/ac...... of the amino acids composition. However, we cannot exclude that the five tested genes are bona fide candidate genes for susceptibility to ETEC F4ab/ac infection since the identified polymorphism might affect the translational apparatus, alternative splice forms may exist and post translational mechanisms might...

  3. Web tools for the prioritization of candidate disease genes.

    NARCIS (Netherlands)

    Oti, M.O.; Ballouz, S.; Wouters, M.A.

    2011-01-01

    Despite increasing sequencing capacity, genetic disease investigation still frequently results in the identification of loci containing multiple candidate disease genes that need to be tested for involvement in the disease. This process can be expedited by prioritizing the candidates prior to

  4. Microarray Analysis of Gene Expression Alteration in Human Middle Ear Epithelial Cells Induced by Asian Sand Dust.

    Science.gov (United States)

    Go, Yoon Young; Park, Moo Kyun; Kwon, Jee Young; Seo, Young Rok; Chae, Sung-Won; Song, Jae-Jun

    2015-12-01

    The primary aim of this study is to evaluate the gene expression profile of Asian sand dust (ASD)-treated human middle ear epithelial cell (HMEEC) using microarray analysis. The HMEEC was treated with ASD (400 µg/mL) and total RNA was extracted for microarray analysis. Molecular pathways among differentially expressed genes were further analyzed. For selected genes, the changes in gene expression were confirmed by real-time polymerase chain reaction. A total of 1,274 genes were differentially expressed by ASD. Among them, 1,138 genes were 2 folds up-regulated, whereas 136 genes were 2 folds down-regulated. Up-regulated genes were mainly involved in cellular processes, including apoptosis, cell differentiation, and cell proliferation. Down-regulated genes affected cellular processes, including apoptosis, cell cycle, cell differentiation, and cell proliferation. The 10 genes including ADM, CCL5, EDN1, EGR1, FOS, GHRL, JUN, SOCS3, TNF, and TNFSF10 were identified as main modulators in up-regulated genes. A total of 11 genes including CSF3, DKK1, FOSL1, FST, TERT, MMP13, PTHLH, SPRY2, TGFBR2, THBS1, and TIMP1 acted as main components of pathway associated with 2-fold down regulated genes. We identified the differentially expressed genes in ASD-treated HMEEC. Our work indicates that air pollutant like ASD, may play an important role in the pathogenesis of otitis media.

  5. Gene Expression Signature Analysis Identifies Vorinostat as a Candidate Therapy for Gastric Cancer

    Science.gov (United States)

    Choi, Woonyoung; Park, Yun-Yong; Kim, KyoungHyun; Kim, Sang-Bae; Lee, Ju-Seog; Mills, Gordon B.; Cho, Jae Yong

    2011-01-01

    Background Gastric cancer continues to be one of the deadliest cancers in the world and therefore identification of new drugs targeting this type of cancer is thus of significant importance. The purpose of this study was to identify and validate a therapeutic agent which might improve the outcomes for gastric cancer patients in the future. Methodology/Principal Findings Using microarray technology, we generated a gene expression profile of human gastric cancer–specific genes from human gastric cancer tissue samples. We used this profile in the Broad Institute's Connectivity Map analysis to identify candidate therapeutic compounds for gastric cancer. We found the histone deacetylase inhibitor vorinostat as the lead compound and thus a potential therapeutic drug for gastric cancer. Vorinostat induced both apoptosis and autophagy in gastric cancer cell lines. Pharmacological and genetic inhibition of autophagy however, increased the therapeutic efficacy of vorinostat, indicating that a combination of vorinostat with autophagy inhibitors may therapeutically be more beneficial. Moreover, gene expression analysis of gastric cancer identified a collection of genes (ITGB5, TYMS, MYB, APOC1, CBX5, PLA2G2A, and KIF20A) whose expression was elevated in gastric tumor tissue and downregulated more than 2-fold by vorinostat treatment in gastric cancer cell lines. In contrast, SCGB2A1, TCN1, CFD, APLP1, and NQO1 manifested a reversed pattern. Conclusions/Significance We showed that analysis of gene expression signature may represent an emerging approach to discover therapeutic agents for gastric cancer, such as vorinostat. The observation of altered gene expression after vorinostat treatment may provide the clue to identify the molecular mechanism of vorinostat and those patients likely to benefit from vorinostat treatment. PMID:21931799

  6. Identification of genes highly downregulated in pancreatic cancer through a meta-analysis of microarray datasets: implications for discovery of novel tumor-suppressor genes and therapeutic targets.

    Science.gov (United States)

    Goonesekere, Nalin C W; Andersen, Wyatt; Smith, Alex; Wang, Xiaosheng

    2018-02-01

    The lack of specific symptoms at early tumor stages, together with a high biological aggressiveness of the tumor contribute to the high mortality rate for pancreatic cancer (PC), which has a 5-year survival rate of about 7%. Recent failures of targeted therapies inhibiting kinase activity in clinical trials have highlighted the need for new approaches towards combating this deadly disease. In this study, we have identified genes that are significantly downregulated in PC, through a meta-analysis of large number of microarray datasets. We have used qRT-PCR to confirm the downregulation of selected genes in a panel of PC cell lines. This study has yielded several novel candidate tumor-suppressor genes (TSGs) including GNMT, CEL, PLA2G1B and SERPINI2. We highlight the role of GNMT, a methyl transferase associated with the methylation potential of the cell, and CEL, a lipase, as potential therapeutic targets. We have uncovered genetic links to risk factors associated with PC such as smoking and obesity. Genes important for patient survival and prognosis are also discussed, and we confirm the dysregulation of metabolic pathways previously observed in PC. While many of the genes downregulated in our dataset are associated with protein products normally produced by the pancreas for excretion, we have uncovered some genes whose downregulation appear to play a more causal role in PC. These genes will assist in providing a better understanding of the disease etiology of PC, and in the search for new therapeutic targets and biomarkers.

  7. Functional validation of candidate genes detected by genomic feature models

    DEFF Research Database (Denmark)

    Rohde, Palle Duun; Østergaard, Solveig; Kristensen, Torsten Nygaard

    2018-01-01

    to investigate locomotor activity, and applied genomic feature prediction models to identify gene ontology (GO) cate- gories predictive of this phenotype. Next, we applied the covariance association test to partition the genomic variance of the predictive GO terms to the genes within these terms. We...... then functionally assessed whether the identified candidate genes affected locomotor activity by reducing gene expression using RNA interference. In five of the seven candidate genes tested, reduced gene expression altered the phenotype. The ranking of genes within the predictive GO term was highly correlated......Understanding the genetic underpinnings of complex traits requires knowledge of the genetic variants that contribute to phenotypic variability. Reliable statistical approaches are needed to obtain such knowledge. In genome-wide association studies, variants are tested for association with trait...

  8. Microarray-based analysis of differential gene expression between infective and noninfective larvae of Strongyloides stercoralis.

    Directory of Open Access Journals (Sweden)

    Roshan Ramanathan

    2011-05-01

    Full Text Available Differences between noninfective first-stage (L1 and infective third-stage (L3i larvae of parasitic nematode Strongyloides stercoralis at the molecular level are relatively uncharacterized. DNA microarrays were developed and utilized for this purpose.Oligonucleotide hybridization probes for the array were designed to bind 3,571 putative mRNA transcripts predicted by analysis of 11,335 expressed sequence tags (ESTs obtained as part of the Nematode EST project. RNA obtained from S. stercoralis L3i and L1 was co-hybridized to each array after labeling the individual samples with different fluorescent tags. Bioinformatic predictions of gene function were developed using a novel cDNA Annotation System software. We identified 935 differentially expressed genes (469 L3i-biased; 466 L1-biased having two-fold expression differences or greater and microarray signals with a p value<0.01. Based on a functional analysis, L1 larvae have a larger number of genes putatively involved in transcription (p = 0.004, and L3i larvae have biased expression of putative heat shock proteins (such as hsp-90. Genes with products known to be immunoreactive in S. stercoralis-infected humans (such as SsIR and NIE had L3i biased expression. Abundantly expressed L3i contigs of interest included S. stercoralis orthologs of cytochrome oxidase ucr 2.1 and hsp-90, which may be potential chemotherapeutic targets. The S. stercoralis ortholog of fatty acid and retinol binding protein-1, successfully used in a vaccine against Ancylostoma ceylanicum, was identified among the 25 most highly expressed L3i genes. The sperm-containing glycoprotein domain, utilized in a vaccine against the nematode Cooperia punctata, was exclusively found in L3i biased genes and may be a valuable S. stercoralis target of interest.A new DNA microarray tool for the examination of S. stercoralis biology has been developed and provides new and valuable insights regarding differences between infective and

  9. Single-cell multiple gene expression analysis based on single-molecule-detection microarray assay for multi-DNA determination

    Energy Technology Data Exchange (ETDEWEB)

    Li, Lu [School of Chemistry and Chemical Engineering, Shandong University, Jinan 250100 (China); Wang, Xianwei [School of Life Sciences, Shandong University, Jinan 250100 (China); Zhang, Xiaoli [School of Chemistry and Chemical Engineering, Shandong University, Jinan 250100 (China); Wang, Jinxing [School of Life Sciences, Shandong University, Jinan 250100 (China); Jin, Wenrui, E-mail: jwr@sdu.edu.cn [School of Chemistry and Chemical Engineering, Shandong University, Jinan 250100 (China)

    2015-01-07

    Highlights: • A single-molecule-detection (SMD) microarray for 10 samples is fabricated. • The based-SMD microarray assay (SMA) can determine 8 DNAs for each sample. • The limit of detection of SMA is as low as 1.3 × 10{sup −16} mol L{sup −1}. • The SMA can be applied in single-cell multiple gene expression analysis. - Abstract: We report a novel ultra-sensitive and high-selective single-molecule-detection microarray assay (SMA) for multiple DNA determination. In the SMA, a capture DNA (DNAc) microarray consisting of 10 subarrays with 9 spots for each subarray is fabricated on a silanized glass coverslip as the substrate. On the subarrays, the spot-to-spot spacing is 500 μm and each spot has a diameter of ∼300 μm. The sequence of the DNAcs on the 9 spots of a subarray is different, to determine 8 types of target DNAs (DNAts). Thus, 8 types of DNAts are captured to their complementary DNAcs at 8 spots of a subarray, respectively, and then labeled with quantum dots (QDs) attached to 8 types of detection DNAs (DNAds) with different sequences. The ninth spot is used to detect the blank value. In order to determine the same 8 types of DNAts in 10 samples, the 10 DNAc-modified subarrays on the microarray are identical. Fluorescence single-molecule images of the QD-labeled DNAts on each spot of the subarray are acquired using a home-made single-molecule microarray reader. The amounts of the DNAts are quantified by counting the bright dots from the QDs. For a microarray, 8 types of DNAts in 10 samples can be quantified in parallel. The limit of detection of the SMA for DNA determination is as low as 1.3 × 10{sup −16} mol L{sup −1}. The SMA for multi-DNA determination can also be applied in single-cell multiple gene expression analysis through quantification of complementary DNAs (cDNAs) corresponding to multiple messenger RNAs (mRNAs) in single cells. To do so, total RNA in single cells is extracted and reversely transcribed into their cDNAs. Three

  10. ArraySolver: An Algorithm for Colour-Coded Graphical Display and Wilcoxon Signed-Rank Statistics for Comparing Microarray Gene Expression Data

    OpenAIRE

    Khan, Haseeb Ahmad

    2004-01-01

    The massive surge in the production of microarray data poses a great challenge for proper analysis and interpretation. In recent years numerous computational tools have been developed to extract meaningful interpretation of microarray gene expression data. However, a convenient tool for two-groups comparison of microarray data is still lacking and users have to rely on commercial statistical packages that might be costly and require special skills, in addition to extra time and effort for tra...

  11. Extending the scope of diagnostic chromosome analysis: detection of single gene defects using high-resolution SNP microarrays.

    Science.gov (United States)

    Bruno, Damien L; Stark, Zornitza; Amor, David J; Burgess, Trent; Butler, Kathy; Corrie, Sylvea; Francis, David; Ganesamoorthy, Devika; Hills, Louise; James, Paul A; O'Rielly, Darren; Oertel, Ralph; Savarirayan, Ravi; Prabhakara, Krishnamurthy; Salce, Nicholas; Slater, Howard R

    2011-12-01

    Microarray analysis has provided significant advances in the diagnosis of conditions resulting from submicroscopic chromosome abnormalities. It has been recommended that array testing should be a "first tier" test in the evaluation of individuals with intellectual disability, developmental delay, congenital anomalies, and autism. The availability of arrays with increasingly high probe coverage and resolution has increased the detection of decreasingly small copy number changes (CNCs) down to the intragenic or even exon level. Importantly, arrays that genotype SNPs also detect extended regions of homozygosity. We describe 14 examples of single gene disorders caused by intragenic changes from a consecutive set of 6,500 tests using high-resolution SNP microarrays. These cases illustrate the increased scope of cytogenetic testing beyond dominant chromosome rearrangements that typically contain many genes. Nine of the cases confirmed the clinical diagnosis, that is, followed a "phenotype to genotype" approach. Five were diagnosed by the laboratory analysis in the absence of a specific clinical diagnosis, that is, followed a "genotype to phenotype" approach. Two were clinically significant, incidental findings. The importance of astute clinical assessment and laboratory-clinician consultation is emphasized to optimize the value of microarrays in the diagnosis of disorders caused by single gene copy number and sequence mutations. © 2011 Wiley-Liss, Inc.

  12. Metric learning for DNA microarray data analysis

    International Nuclear Information System (INIS)

    Takeuchi, Ichiro; Nakagawa, Masao; Seto, Masao

    2009-01-01

    In many microarray studies, gene set selection is an important preliminary step for subsequent main task such as tumor classification, cancer subtype identification, etc. In this paper, we investigate the possibility of using metric learning as an alternative to gene set selection. We develop a simple metric learning algorithm aiming to use it for microarray data analysis. Exploiting a property of the algorithm, we introduce a novel approach for extending the metric learning to be adaptive. We apply the algorithm to previously studied microarray data on malignant lymphoma subtype identification.

  13. Microarray-Based Gene Expression Profiling to Elucidate Effectiveness of Fermented Codonopsis lanceolata in Mice

    Directory of Open Access Journals (Sweden)

    Woon Yong Choi

    2014-04-01

    Full Text Available In this study, the effect of Codonopsis lanceolata fermented by lactic acid on controlling gene expression levels related to obesity was observed in an oligonucleotide chip microarray. Among 8170 genes, 393 genes were up regulated and 760 genes were down regulated in feeding the fermented C. lanceolata (FCL. Another 374 genes were up regulated and 527 genes down regulated without feeding the sample. The genes were not affected by the FCL sample. It was interesting that among those genes, Chytochrome P450, Dmbt1, LOC76487, and thyroid hormones, etc., were mostly up or down regulated. These genes are more related to lipid synthesis. We could conclude that the FCL possibly controlled the gene expression levels related to lipid synthesis, which resulted in reducing obesity. However, more detailed protein expression experiments should be carried out.

  14. LOD score exclusion analyses for candidate genes using random population samples.

    Science.gov (United States)

    Deng, H W; Li, J; Recker, R R

    2001-05-01

    While extensive analyses have been conducted to test for, no formal analyses have been conducted to test against, the importance of candidate genes with random population samples. We develop a LOD score approach for exclusion analyses of candidate genes with random population samples. Under this approach, specific genetic effects and inheritance models at candidate genes can be analysed and if a LOD score is < or = - 2.0, the locus can be excluded from having an effect larger than that specified. Computer simulations show that, with sample sizes often employed in association studies, this approach has high power to exclude a gene from having moderate genetic effects. In contrast to regular association analyses, population admixture will not affect the robustness of our analyses; in fact, it renders our analyses more conservative and thus any significant exclusion result is robust. Our exclusion analysis complements association analysis for candidate genes in random population samples and is parallel to the exclusion mapping analyses that may be conducted in linkage analyses with pedigrees or relative pairs. The usefulness of the approach is demonstrated by an application to test the importance of vitamin D receptor and estrogen receptor genes underlying the differential risk to osteoporotic fractures.

  15. The cld mutation: narrowing the critical chromosomal region and selecting candidate genes.

    Science.gov (United States)

    Péterfy, Miklós; Mao, Hui Z; Doolittle, Mark H

    2006-10-01

    Combined lipase deficiency (cld) is a recessive, lethal mutation specific to the tw73 haplotype on mouse Chromosome 17. While the cld mutation results in lipase proteins that are inactive, aggregated, and retained in the endoplasmic reticulum (ER), it maps separately from the lipase structural genes. We have narrowed the gene critical region by about 50% using the tw18 haplotype for deletion mapping and a recombinant chromosome used originally to map cld with respect to the phenotypic marker tf. The region now extends from 22 to 25.6 Mbp on the wild-type chromosome, currently containing 149 genes and 50 expressed sequence tags (ESTs). To identify the affected gene, we have selected candidates based on their known role in associated biological processes, cellular components, and molecular functions that best fit with the predicted function of the cld gene. A secondary approach was based on differences in mRNA levels between mutant (cld/cld) and unaffected (+/cld) cells. Using both approaches, we have identified seven functional candidates with an ER localization and/or an involvement in protein maturation and folding that could explain the lipase deficiency, and six expression candidates that exhibit large differences in mRNA levels between mutant and unaffected cells. Significantly, two genes were found to be candidates with regard to both function and expression, thus emerging as the strongest candidates for cld. We discuss the implications of our mapping results and our selection of candidates with respect to other genes, deletions, and mutations occurring in the cld critical region.

  16. Identification of Inherited Retinal Disease-Associated Genetic Variants in 11 Candidate Genes.

    Science.gov (United States)

    Astuti, Galuh D N; van den Born, L Ingeborgh; Khan, M Imran; Hamel, Christian P; Bocquet, Béatrice; Manes, Gaël; Quinodoz, Mathieu; Ali, Manir; Toomes, Carmel; McKibbin, Martin; El-Asrag, Mohammed E; Haer-Wigman, Lonneke; Inglehearn, Chris F; Black, Graeme C M; Hoyng, Carel B; Cremers, Frans P M; Roosing, Susanne

    2018-01-10

    Inherited retinal diseases (IRDs) display an enormous genetic heterogeneity. Whole exome sequencing (WES) recently identified genes that were mutated in a small proportion of IRD cases. Consequently, finding a second case or family carrying pathogenic variants in the same candidate gene often is challenging. In this study, we searched for novel candidate IRD gene-associated variants in isolated IRD families, assessed their causality, and searched for novel genotype-phenotype correlations. Whole exome sequencing was performed in 11 probands affected with IRDs. Homozygosity mapping data was available for five cases. Variants with minor allele frequencies ≤ 0.5% in public databases were selected as candidate disease-causing variants. These variants were ranked based on their: (a) presence in a gene that was previously implicated in IRD; (b) minor allele frequency in the Exome Aggregation Consortium database (ExAC); (c) in silico pathogenicity assessment using the combined annotation dependent depletion (CADD) score; and (d) interaction of the corresponding protein with known IRD-associated proteins. Twelve unique variants were found in 11 different genes in 11 IRD probands. Novel autosomal recessive and dominant inheritance patterns were found for variants in Small Nuclear Ribonucleoprotein U5 Subunit 200 ( SNRNP200 ) and Zinc Finger Protein 513 ( ZNF513 ), respectively. Using our pathogenicity assessment, a variant in DEAH-Box Helicase 32 ( DHX32 ) was the top ranked novel candidate gene to be associated with IRDs, followed by eight medium and lower ranked candidate genes. The identification of candidate disease-associated sequence variants in 11 single families underscores the notion that the previously identified IRD-associated genes collectively carry > 90% of the defects implicated in IRDs. To identify multiple patients or families with variants in the same gene and thereby provide extra proof for pathogenicity, worldwide data sharing is needed.

  17. The Importance of Normalization on Large and Heterogeneous Microarray Datasets

    Science.gov (United States)

    DNA microarray technology is a powerful functional genomics tool increasingly used for investigating global gene expression in environmental studies. Microarrays can also be used in identifying biological networks, as they give insight on the complex gene-to-gene interactions, ne...

  18. MARS: Microarray analysis, retrieval, and storage system

    Directory of Open Access Journals (Sweden)

    Scheideler Marcel

    2005-04-01

    Full Text Available Abstract Background Microarray analysis has become a widely used technique for the study of gene-expression patterns on a genomic scale. As more and more laboratories are adopting microarray technology, there is a need for powerful and easy to use microarray databases facilitating array fabrication, labeling, hybridization, and data analysis. The wealth of data generated by this high throughput approach renders adequate database and analysis tools crucial for the pursuit of insights into the transcriptomic behavior of cells. Results MARS (Microarray Analysis and Retrieval System provides a comprehensive MIAME supportive suite for storing, retrieving, and analyzing multi color microarray data. The system comprises a laboratory information management system (LIMS, a quality control management, as well as a sophisticated user management system. MARS is fully integrated into an analytical pipeline of microarray image analysis, normalization, gene expression clustering, and mapping of gene expression data onto biological pathways. The incorporation of ontologies and the use of MAGE-ML enables an export of studies stored in MARS to public repositories and other databases accepting these documents. Conclusion We have developed an integrated system tailored to serve the specific needs of microarray based research projects using a unique fusion of Web based and standalone applications connected to the latest J2EE application server technology. The presented system is freely available for academic and non-profit institutions. More information can be found at http://genome.tugraz.at.

  19. A general framework for optimization of probes for gene expression microarray and its application to the fungus Podospora anserina.

    Science.gov (United States)

    Bidard, Frédérique; Imbeaud, Sandrine; Reymond, Nancie; Lespinet, Olivier; Silar, Philippe; Clavé, Corinne; Delacroix, Hervé; Berteaux-Lecellier, Véronique; Debuchy, Robert

    2010-06-18

    The development of new microarray technologies makes custom long oligonucleotide arrays affordable for many experimental applications, notably gene expression analyses. Reliable results depend on probe design quality and selection. Probe design strategy should cope with the limited accuracy of de novo gene prediction programs, and annotation up-dating. We present a novel in silico procedure which addresses these issues and includes experimental screening, as an empirical approach is the best strategy to identify optimal probes in the in silico outcome. We used four criteria for in silico probe selection: cross-hybridization, hairpin stability, probe location relative to coding sequence end and intron position. This latter criterion is critical when exon-intron gene structure predictions for intron-rich genes are inaccurate. For each coding sequence (CDS), we selected a sub-set of four probes. These probes were included in a test microarray, which was used to evaluate the hybridization behavior of each probe. The best probe for each CDS was selected according to three experimental criteria: signal-to-noise ratio, signal reproducibility, and representative signal intensities. This procedure was applied for the development of a gene expression Agilent platform for the filamentous fungus Podospora anserina and the selection of a single 60-mer probe for each of the 10,556 P. anserina CDS. A reliable gene expression microarray version based on the Agilent 44K platform was developed with four spot replicates of each probe to increase statistical significance of analysis.

  20. A general framework for optimization of probes for gene expression microarray and its application to the fungus Podospora anserina

    Directory of Open Access Journals (Sweden)

    Bidard Frédérique

    2010-06-01

    Full Text Available Abstract Background The development of new microarray technologies makes custom long oligonucleotide arrays affordable for many experimental applications, notably gene expression analyses. Reliable results depend on probe design quality and selection. Probe design strategy should cope with the limited accuracy of de novo gene prediction programs, and annotation up-dating. We present a novel in silico procedure which addresses these issues and includes experimental screening, as an empirical approach is the best strategy to identify optimal probes in the in silico outcome. Findings We used four criteria for in silico probe selection: cross-hybridization, hairpin stability, probe location relative to coding sequence end and intron position. This latter criterion is critical when exon-intron gene structure predictions for intron-rich genes are inaccurate. For each coding sequence (CDS, we selected a sub-set of four probes. These probes were included in a test microarray, which was used to evaluate the hybridization behavior of each probe. The best probe for each CDS was selected according to three experimental criteria: signal-to-noise ratio, signal reproducibility, and representative signal intensities. This procedure was applied for the development of a gene expression Agilent platform for the filamentous fungus Podospora anserina and the selection of a single 60-mer probe for each of the 10,556 P. anserina CDS. Conclusions A reliable gene expression microarray version based on the Agilent 44K platform was developed with four spot replicates of each probe to increase statistical significance of analysis.

  1. Integrated database for identifying candidate genes for Aspergillus flavus resistance in maize.

    Science.gov (United States)

    Kelley, Rowena Y; Gresham, Cathy; Harper, Jonathan; Bridges, Susan M; Warburton, Marilyn L; Hawkins, Leigh K; Pechanova, Olga; Peethambaran, Bela; Pechan, Tibor; Luthe, Dawn S; Mylroie, J E; Ankala, Arunkanth; Ozkan, Seval; Henry, W B; Williams, W P

    2010-10-07

    Aspergillus flavus Link:Fr, an opportunistic fungus that produces aflatoxin, is pathogenic to maize and other oilseed crops. Aflatoxin is a potent carcinogen, and its presence markedly reduces the value of grain. Understanding and enhancing host resistance to A. flavus infection and/or subsequent aflatoxin accumulation is generally considered an efficient means of reducing grain losses to aflatoxin. Different proteomic, genomic and genetic studies of maize (Zea mays L.) have generated large data sets with the goal of identifying genes responsible for conferring resistance to A. flavus, or aflatoxin. In order to maximize the usage of different data sets in new studies, including association mapping, we have constructed a relational database with web interface integrating the results of gene expression, proteomic (both gel-based and shotgun), Quantitative Trait Loci (QTL) genetic mapping studies, and sequence data from the literature to facilitate selection of candidate genes for continued investigation. The Corn Fungal Resistance Associated Sequences Database (CFRAS-DB) (http://agbase.msstate.edu/) was created with the main goal of identifying genes important to aflatoxin resistance. CFRAS-DB is implemented using MySQL as the relational database management system running on a Linux server, using an Apache web server, and Perl CGI scripts as the web interface. The database and the associated web-based interface allow researchers to examine many lines of evidence (e.g. microarray, proteomics, QTL studies, SNP data) to assess the potential role of a gene or group of genes in the response of different maize lines to A. flavus infection and subsequent production of aflatoxin by the fungus. CFRAS-DB provides the first opportunity to integrate data pertaining to the problem of A. flavus and aflatoxin resistance in maize in one resource and to support queries across different datasets. The web-based interface gives researchers different query options for mining the database

  2. Development of a miniaturised microarray-based assay for the rapid identification of antimicrobial resistance genes in Gram-negative bacteria

    DEFF Research Database (Denmark)

    Batchelor, Miranda; Hopkins, Katie L; Liebana, Ernesto

    2008-01-01

    We describe the development of a miniaturised microarray for the detection of antimicrobial resistance genes in Gram-negative bacteria. Included on the array are genes encoding resistance to aminoglycosides, trimethoprim, sulphonamides, tetracyclines and beta-lactams, including extended-spectrum ...

  3. Global methylation profiling of lymphoblastoid cell lines reveals epigenetic contributions to autism spectrum disorders and a novel autism candidate gene, RORA, whose protein product is reduced in autistic brain

    Science.gov (United States)

    Nguyen, AnhThu; Rauch, Tibor A.; Pfeifer, Gerd P.; Hu, Valerie W.

    2010-01-01

    Autism is currently considered a multigene disorder with epigenetic influences. To investigate the contribution of DNA methylation to autism spectrum disorders, we have recently completed large-scale methylation profiling by CpG island microarray analysis of lymphoblastoid cell lines derived from monozygotic twins discordant for diagnosis of autism and their nonautistic siblings. Methylation profiling revealed many candidate genes differentially methylated between discordant MZ twins as well as between both twins and nonautistic siblings. Bioinformatics analysis of the differentially methylated genes demonstrated enrichment for high-level functions including gene transcription, nervous system development, cell death/survival, and other biological processes implicated in autism. The methylation status of 2 of these candidate genes, BCL-2 and retinoic acid-related orphan receptor alpha (RORA), was further confirmed by bisulfite sequencing and methylation-specific PCR, respectively. Immunohistochemical analyses of tissue arrays containing slices of the cerebellum and frontal cortex of autistic and age- and sex-matched control subjects revealed decreased expression of RORA and BCL-2 proteins in the autistic brain. Our data thus confirm the role of epigenetic regulation of gene expression via differential DNA methylation in idiopathic autism, and furthermore link molecular changes in a peripheral cell model with brain pathobiology in autism.—Nguyen, A., Rauch, T. A., Pfeifer, G. P., Hu, V. W. Global methylation profiling of lymphoblastoid cell lines reveals epigenetic contributions to autism spectrum disorders and a novel autism candidate gene, RORA, whose protein product is reduced in autistic brain. PMID:20375269

  4. Bone health nutraceuticals alter microarray mRNA gene expression: A randomized, parallel, open-label clinical study.

    Science.gov (United States)

    Lin, Yumei; Kazlova, Valentina; Ramakrishnan, Shyam; Murray, Mary A; Fast, David; Chandra, Amitabh; Gellenbeck, Kevin W

    2016-01-15

    Dietary intake of fruits and vegetables has been suggested to have a role in promoting bone health. More specifically, the polyphenols they contain have been linked to physiological effects related to bone mineral density and bone metabolism. In this research, we use standard microarray analyses of peripheral whole blood from post-menopausal women treated with two fixed combinations of plant extracts standardized to polyphenol content to identify differentially expressed genes relevant to bone health. In this 28-day open-label study, healthy post-menopausal women were randomized into three groups, each receiving one of three investigational fixed combinations of plant extracts: an anti-resorptive (AR) combination of pomegranate fruit (Punica granatum L.) and grape seed (Vitis vinifera L.) extracts; a bone formation (BF) combination of quercetin (Dimorphandra mollis Benth) and licorice (Glycyrrhiza glabra L.) extracts; and a fixed combination of all four plant extracts (AR plus BF). Standard microarray analysis was performed on peripheral whole blood samples taken before and after each treatment. Annotated genes were analyzed for their association to bone health by comparison to a gene library. The AR combination down-regulated a number of genes involved in reduction of bone resorption including cathepsin G (CTSG) and tachykinin receptor 1 (TACR1). The AR combination also up-regulated genes associated with formation of extracellular matrix including heparan sulfate proteoglycan 2 (HSPG2) and hyaluronoglucosaminidase 1 (HYAL1). In contrast, treatment with the BF combination resulted in up-regulation of bone morphogenetic protein 2 (BMP-2) and COL1A1 (collagen type I α1) genes which are linked to bone and collagen formation while down-regulating genes linked to osteoclastogenesis. Treatment with a combination of all four plant extracts had a distinctly different effect on gene expression than the results of the AR and BF combinations individually. These results could

  5. Incorporation of gene-specific variability improves expression analysis using high-density DNA microarrays

    Directory of Open Access Journals (Sweden)

    Spitznagel Edward

    2003-11-01

    Full Text Available Abstract Background The assessment of data reproducibility is essential for application of microarray technology to exploration of biological pathways and disease states. Technical variability in data analysis largely depends on signal intensity. Within that context, the reproducibility of individual probe sets has not been hitherto addressed. Results We used an extraordinarily large replicate data set derived from human placental trophoblast to analyze probe-specific contribution to variability of gene expression. We found that signal variability, in addition to being signal-intensity dependant, is probe set-specific. Importantly, we developed a novel method to quantify the contribution of this probe set-specific variability. Furthermore, we devised a formula that incorporates a priori-computed, replicate-based information on probe set- and intensity-specific variability in determination of expression changes even without technical replicates. Conclusion The strategy of incorporating probe set-specific variability is superior to analysis based on arbitrary fold-change thresholds. We recommend its incorporation to any computation of gene expression changes using high-density DNA microarrays. A Java application implementing our T-score is available at http://www.sadovsky.wustl.edu/tscore.html.

  6. Characterization of fetal cells from the maternal circulation by microarray gene expression analysis - Could the extravillous trophoblasts be a target for future cell-based non-invasive prenatal diagnosis?

    DEFF Research Database (Denmark)

    Hatt, Lotte; Brinch, Marie; Singh, Ripudaman

    2014-01-01

    stem cell microarray analysis. Results: 39 genes were identified as candidates for unique fetal cell markers. More than half of these are genes known to be expressed in the placenta, especially in extravillous trophoblasts (EVTs). Immunohistochemical staining of placental tissue confirmed CD105......Introduction: Circulating fetal cells in maternal blood provide a tool for risk-free, non-invasive prenatal diagnosis. However, fetal cells in the maternal circulation are scarce, and to effectively isolate enough of them for reliable diagnostics, it is crucial to know which fetal cell type......(s) should be targeted. Materials and Methods: Fetal cells were enriched from maternal blood by magnetic-activated cell sorting using the endothelial cell marker CD105 and identified by XY fluorescence in situ hybridization. Expression pattern was compared between fetal cells and maternal blood cells using...

  7. Gene Expression Profiling of Human Vaginal Cells In Vitro Discriminates Compounds with Pro-Inflammatory and Mucosa-Altering Properties: Novel Biomarkers for Preclinical Testing of HIV Microbicide Candidates.

    Directory of Open Access Journals (Sweden)

    Irina A Zalenskaya

    Full Text Available Inflammation and immune activation of the cervicovaginal mucosa are considered factors that increase susceptibility to HIV infection. Therefore, it is essential to screen candidate anti-HIV microbicides for potential mucosal immunomodulatory/inflammatory effects prior to further clinical development. The goal of this study was to develop an in vitro method for preclinical evaluation of the inflammatory potential of new candidate microbicides using a microarray gene expression profiling strategy.To this end, we compared transcriptomes of human vaginal cells (Vk2/E6E7 treated with well-characterized pro-inflammatory (PIC and non-inflammatory (NIC compounds. PICs included compounds with different mechanisms of action. Gene expression was analyzed using Affymetrix U133 Plus 2 arrays. Data processing was performed using GeneSpring 11.5 (Agilent Technologies, Santa Clara, CA.Microarraray comparative analysis allowed us to generate a panel of 20 genes that were consistently deregulated by PICs compared to NICs, thus distinguishing between these two groups. Functional analysis mapped 14 of these genes to immune and inflammatory responses. This was confirmed by the fact that PICs induced NFkB pathway activation in Vk2 cells. By testing microbicide candidates previously characterized in clinical trials we demonstrated that the selected PIC-associated genes properly identified compounds with mucosa-altering effects. The discriminatory power of these genes was further demonstrated after culturing vaginal cells with vaginal bacteria. Prevotella bivia, prevalent bacteria in the disturbed microbiota of bacterial vaginosis, induced strong upregulation of seven selected PIC-associated genes, while a commensal Lactobacillus gasseri associated to vaginal health did not cause any changes.In vitro evaluation of the immunoinflammatory potential of microbicides using the PIC-associated genes defined in this study could help in the initial screening of candidates prior

  8. Microarray analysis of the gene expression profile in triethylene glycol dimethacrylate-treated human dental pulp cells.

    Science.gov (United States)

    Torun, D; Torun, Z Ö; Demirkaya, K; Sarper, M; Elçi, M P; Avcu, F

    2017-11-01

    Triethylene glycol dimethacrylate (TEGDMA) is an important resin monomer commonly used in the structure of dental restorative materials. Recent studies have shown that unpolymerized resin monomers may be released into the oral environment and cause harmful biological effects. We investigated changes in the gene expression profiles of TEGDMA-treated human dental pulp cells (hDPCs) following short- (1-day) and long-term (7-days) exposure. HDPCs were exposed to a noncytotoxic concentration of TEGDMA, and gene expression profiles were evaluated by microarray analysis. The results were confirmed by quantitative reverse-transcriptase PCR (qRT PCR). In total, 1282 and 1319 genes (up- or down-regulated) were differentially expressed compared with control group after the 1- and 7-day incubation periods, respectively. Biological ontology-based analyses revealed that metabolic, cellular, and developmental processes constituted the largest groups of biological functional processes. qRT-PCR analysis on bone morphogenetic protein-2 (BMP-2), BMP-4, secreted protein, acidic, cysteine-rich, collagen type I alpha 1, oxidative stress-induced growth inhibitor 1, MMP3, interleukin-6, and heme oxygenase-1 genes confirmed the changes in expression observed in the microarray analysis. Our results suggest that TEGDMA can change the many functions of hDPCs through large changes in gene expression levels and complex interactions with different signaling pathways.

  9. EXONSAMPLER: a computer program for genome-wide and candidate gene exon sampling for targeted next-generation sequencing.

    Science.gov (United States)

    Cosart, Ted; Beja-Pereira, Albano; Luikart, Gordon

    2014-11-01

    The computer program EXONSAMPLER automates the sampling of thousands of exon sequences from publicly available reference genome sequences and gene annotation databases. It was designed to provide exon sequences for the efficient, next-generation gene sequencing method called exon capture. The exon sequences can be sampled by a list of gene name abbreviations (e.g. IFNG, TLR1), or by sampling exons from genes spaced evenly across chromosomes. It provides a list of genomic coordinates (a bed file), as well as a set of sequences in fasta format. User-adjustable parameters for collecting exon sequences include a minimum and maximum acceptable exon length, maximum number of exonic base pairs (bp) to sample per gene, and maximum total bp for the entire collection. It allows for partial sampling of very large exons. It can preferentially sample upstream (5 prime) exons, downstream (3 prime) exons, both external exons, or all internal exons. It is written in the Python programming language using its free libraries. We describe the use of EXONSAMPLER to collect exon sequences from the domestic cow (Bos taurus) genome for the design of an exon-capture microarray to sequence exons from related species, including the zebu cow and wild bison. We collected ~10% of the exome (~3 million bp), including 155 candidate genes, and ~16,000 exons evenly spaced genomewide. We prioritized the collection of 5 prime exons to facilitate discovery and genotyping of SNPs near upstream gene regulatory DNA sequences, which control gene expression and are often under natural selection. © 2014 John Wiley & Sons Ltd.

  10. Gene expression of panaxydol-treated human melanoma cells using radioactive cDNA microarrays

    International Nuclear Information System (INIS)

    Cho, Joong Youn; Yu, Su Jin; Soh, Jeong Won; Kim, Meyoung Kon

    2001-01-01

    Polyacetylenic alcohols derived from Panax ginseng have been studied to be an anticancer reagent previously. One of the Panax ginseng polyacetylenic alcohols, i.e., panaxydol, has been studied to possess an antiproliferative effect on human melanoma cell line (SK-MEL-1). In ths study, radioactive cDNA microarrays enabled an efficient approach to analyze the pattern of gene expression (3.194 genes in a total) simultaneously. The bioinformatics selection of human cDNAs, which is specifically designed for immunology, apoptosis and signal transduction, were arrayed on nylon membranes. Using with 33 P labeled probes, this method provided highly sensitive gene expression profiles of our interest including apoptosis, cell proliferation, cell cycle, and signal transduction. Gene expression profiles were also classified into several categories in accordance with the duration of panaxydol treatment. Consequently, the gene profiles of our interest were significantly up (199 genes, > 2.0 of Z-ratio) or down-(196 genes, < 2.0 of Z-ratio) regulated in panaxydol-treated human melanoma cells

  11. Gene expression of panaxydol-treated human melanoma cells using radioactive cDNA microarrays

    Energy Technology Data Exchange (ETDEWEB)

    Cho, Joong Youn; Yu, Su Jin; Soh, Jeong Won; Kim, Meyoung Kon [College of Medicine, Korea Univ., Seoul (Korea, Republic of)

    2001-07-01

    Polyacetylenic alcohols derived from Panax ginseng have been studied to be an anticancer reagent previously. One of the Panax ginseng polyacetylenic alcohols, i.e., panaxydol, has been studied to possess an antiproliferative effect on human melanoma cell line (SK-MEL-1). In ths study, radioactive cDNA microarrays enabled an efficient approach to analyze the pattern of gene expression (3.194 genes in a total) simultaneously. The bioinformatics selection of human cDNAs, which is specifically designed for immunology, apoptosis and signal transduction, were arrayed on nylon membranes. Using with {sup 33}P labeled probes, this method provided highly sensitive gene expression profiles of our interest including apoptosis, cell proliferation, cell cycle, and signal transduction. Gene expression profiles were also classified into several categories in accordance with the duration of panaxydol treatment. Consequently, the gene profiles of our interest were significantly up (199 genes, > 2.0 of Z-ratio) or down-(196 genes, < 2.0 of Z-ratio) regulated in panaxydol-treated human melanoma cells.

  12. High-throughput analysis of candidate imprinted genes and allele-specific gene expression in the human term placenta

    Directory of Open Access Journals (Sweden)

    Clark Taane G

    2010-04-01

    Full Text Available Abstract Background Imprinted genes show expression from one parental allele only and are important for development and behaviour. This extreme mode of allelic imbalance has been described for approximately 56 human genes. Imprinting status is often disrupted in cancer and dysmorphic syndromes. More subtle variation of gene expression, that is not parent-of-origin specific, termed 'allele-specific gene expression' (ASE is more common and may give rise to milder phenotypic differences. Using two allele-specific high-throughput technologies alongside bioinformatics predictions, normal term human placenta was screened to find new imprinted genes and to ascertain the extent of ASE in this tissue. Results Twenty-three family trios of placental cDNA, placental genomic DNA (gDNA and gDNA from both parents were tested for 130 candidate genes with the Sequenom MassArray system. Six genes were found differentially expressed but none imprinted. The Illumina ASE BeadArray platform was then used to test 1536 SNPs in 932 genes. The array was enriched for the human orthologues of 124 mouse candidate genes from bioinformatics predictions and 10 human candidate imprinted genes from EST database mining. After quality control pruning, a total of 261 informative SNPs (214 genes remained for analysis. Imprinting with maternal expression was demonstrated for the lymphocyte imprinted gene ZNF331 in human placenta. Two potential differentially methylated regions (DMRs were found in the vicinity of ZNF331. None of the bioinformatically predicted candidates tested showed imprinting except for a skewed allelic expression in a parent-specific manner observed for PHACTR2, a neighbour of the imprinted PLAGL1 gene. ASE was detected for two or more individuals in 39 candidate genes (18%. Conclusions Both Sequenom and Illumina assays were sensitive enough to study imprinting and strong allelic bias. Previous bioinformatics approaches were not predictive of new imprinted genes

  13. CANDIDATE GENE ANALYSIS IN ISRAELI SOLDIERS WITH STRESS FRACTURES

    Directory of Open Access Journals (Sweden)

    Ran Yanovich

    2012-03-01

    Full Text Available To investigate the association of polymorphisms within candidate genes which we hypothesized may contribute to stress fracture predisposition, a case-control, cross- sectional study design was employed. Genotyping 268 Single Nucleotide Polymorphisms- SNPs within 17 genes in 385 Israeli young male and female recruits (182 with and 203 without stress fractures. Twenty-five polymorphisms within 9 genes (NR3C1, ANKH, VDR, ROR2, CALCR, IL6, COL1A2, CBG, and LRP4 showed statistically significant differences (p < 0.05 in the distribution between stress fracture cases and non stress fracture controls. Seventeen genetic variants were associated with an increased stress fracture risk, and eight variants with a decreased stress fracture risk. None of the SNP associations remained significant after correcting for multiple comparisons (false discovery rate- FDR. Our findings suggest that genes may be involved in stress fracture pathogenesis. Specifically, the CALCR and the VDR genes are intriguing candidates. The putative involvement of these genes in stress fracture predisposition requires analysis of more cases and controls and sequencing the relevant genomic regions, in order to define the specific gene mutations

  14. A hybrid gene selection approach for microarray data classification using cellular learning automata and ant colony optimization.

    Science.gov (United States)

    Vafaee Sharbaf, Fatemeh; Mosafer, Sara; Moattar, Mohammad Hossein

    2016-06-01

    This paper proposes an approach for gene selection in microarray data. The proposed approach consists of a primary filter approach using Fisher criterion which reduces the initial genes and hence the search space and time complexity. Then, a wrapper approach which is based on cellular learning automata (CLA) optimized with ant colony method (ACO) is used to find the set of features which improve the classification accuracy. CLA is applied due to its capability to learn and model complicated relationships. The selected features from the last phase are evaluated using ROC curve and the most effective while smallest feature subset is determined. The classifiers which are evaluated in the proposed framework are K-nearest neighbor; support vector machine and naïve Bayes. The proposed approach is evaluated on 4 microarray datasets. The evaluations confirm that the proposed approach can find the smallest subset of genes while approaching the maximum accuracy. Copyright © 2016 Elsevier Inc. All rights reserved.

  15. Whole genome homology-based identification of candidate genes ...

    African Journals Online (AJOL)

    Josephine Erhiakporeh

    2016-07-06

    Jul 6, 2016 ... candidate genes for drought tolerance in sesame. (Sesamum ... Our results provided genomic resources for further functional analysis and genetic engineering .... reverse transcribed using the Reverse Transcription System.

  16. Single-gene testing combined with single nucleotide polymorphism microarray preimplantation genetic diagnosis for aneuploidy: a novel approach in optimizing pregnancy outcome.

    Science.gov (United States)

    Brezina, Paul R; Benner, Andrew; Rechitsky, Svetlana; Kuliev, Anver; Pomerantseva, Ekaterina; Pauling, Dana; Kearns, William G

    2011-04-01

    To describe a method of amplifying DNA from blastocyst trophectoderm cells (two or three cells) and simultaneously performing 23-chromosome single nucleotide polymorphism microarrays and single-gene preimplantation genetic diagnosis. Case report. IVF clinic and preimplantation genetic diagnostic centers. A 36-year-old woman, gravida 2, para 1011, and her husband who both were carriers of GM(1) gangliosidosis. The couple wished to proceed with microarray analysis for aneuploidy detection coupled with DNA sequencing for GM(1) gangliosidosis. An IVF cycle was performed. Ten blastocyst-stage embryos underwent trophectoderm biopsy. Twenty-three-chromosome microarray analysis for aneuploidy and specific DNA sequencing for GM(1) gangliosidosis mutations were performed. Viable pregnancy. After testing, elective single embryo transfer was performed followed by an intrauterine pregnancy with documented fetal cardiac activity by ultrasound. Twenty-three-chromosome microarray analysis for aneuploidy detection and single-gene evaluation via specific DNA sequencing and linkage analysis are used for preimplantation diagnosis for single-gene disorders and aneuploidy. Because of the minimal amount of genetic material obtained from the day 3 to 5 embryos (up to 6 pg), these modalities have been used in isolation of each other. The use of preimplantation genetic diagnosis for aneuploidy coupled with testing for single-gene disorders via trophectoderm biopsy is a novel approach to maximize pregnancy outcomes. Although further investigation is warranted, preimplantation genetic diagnosis for aneuploidy and single-gene testing seem destined to be used increasingly to optimize ultimate pregnancy success. Copyright © 2011 American Society for Reproductive Medicine. Published by Elsevier Inc. All rights reserved.

  17. Early Gene Expression in Wounded Human Keratinocytes Revealed by DNA Microarray Analysis

    Directory of Open Access Journals (Sweden)

    Pascal Barbry

    2006-04-01

    Full Text Available Wound healing involves several steps: spreading of the cells, migration and proliferation. We have profiled gene expression during the early events of wound healing in normal human keratinocytes with a home-made DNA microarray containing about 1000 relevant human probes. An original wounding machine was used, that allows the wounding of up to 40% of the surface of a confluent monolayer of cultured cells grown on a Petri dish (compared with 5% with a classical ‘scratch’ method. The two aims of the present study were: (a to validate a limited number of genes by comparing the expression levels obtained with this technique with those found in the literature; (b to combine the use of the wounding machine with DNA microarray analysis for large-scale detection of the molecular events triggered during the early stages of the wound-healing process. The time-courses of RNA expression observed at 0.5, 1.5, 3, 6 and 15 h after wounding for genes such as c-Fos, c-Jun, Egr1, the plasminogen activator PLAU (uPA and the signal transducer and transcription activator STAT3, were consistent with previously published data. This suggests that our methodologies are able to perform quantitative measurement of gene expression. Transcripts encoding two zinc finger proteins, ZFP36 and ZNF161, and the tumour necrosis factor α-induced protein TNFAIP3, were also overexpressed after wounding. The role of the p38 mitogen-activated protein kinase (p38MAPK in wound healing was shown after the inhibition of p38 by SB203580, but our results also suggest the existence of surrogate activating pathways.

  18. Herpesvirus of turkeys: microarray analysis of host gene responses to infection

    International Nuclear Information System (INIS)

    Karaca, Gamze; Anobile, Jonathan; Downs, Danielle; Burnside, Joan; Schmidt, Carl J.

    2004-01-01

    Herpesvirus of turkeys (HVT) provides an economically important live vaccine for prevention of Marek's disease (MD) of chickens. MD, characterized by both immunosuppression and T-cell lymphoma, is caused by another herpesvirus termed Marek's disease virus (MDV). Microarrays were used to investigate the response of chicken embryonic fibroblasts (CEF) to infection with HVT. Genes responding to HVT infection include several induced by interferon along with others modulating signal transduction, transcription, scaffolding proteins, and the cytoskeleton. Results are compared with earlier studies examining the responses of CEF cells to infection with MDV

  19. Test for positional candidate genes for body composition on pig chromosome 6

    Directory of Open Access Journals (Sweden)

    Pérez-Enciso Miguel

    2002-07-01

    Full Text Available Abstract One QTL affecting backfat thickness (BF, intramuscular fat content (IMF and eye muscle area (MA was previously localized on porcine chromosome 6 in an F2 cross between Iberian and Landrace pigs. This work was done to study the effect of two positional candidate genes on these traits: H-FABP and LEPR genes. The QTL mapping analysis was repeated with a regression method using genotypes for seven microsatellites and two PCR-RFLPs in the H-FABP and LEPR genes. H-FABP and LEPR genes were located at 85.4 and 107 cM respectively, by linkage analysis. The effects of the candidate gene polymorphisms were analyzed in two ways. When an animal model was fitted, both genes showed significant effects on fatness traits, the H-FABP polymorphism showed significant effects on IMF and MA, and the LEPR polymorphism on BF and IMF. But when the candidate gene effect was included in a QTL regression analysis these associations were not observed, suggesting that they must not be the causal mutations responsible for the effects found. Differences in the results of both analyses showed the inadequacy of the animal model approach for the evaluation of positional candidate genes in populations with linkage disequilibrium, when the probabilities of the parental origin of the QTL alleles are not included in the model.

  20. Dimension reduction methods for microarray data: a review

    Directory of Open Access Journals (Sweden)

    Rabia Aziz

    2017-03-01

    Full Text Available Dimension reduction has become inevitable for pre-processing of high dimensional data. “Gene expression microarray data” is an instance of such high dimensional data. Gene expression microarray data displays the maximum number of genes (features simultaneously at a molecular level with a very small number of samples. The copious numbers of genes are usually provided to a learning algorithm for producing a complete characterization of the classification task. However, most of the times the majority of the genes are irrelevant or redundant to the learning task. It will deteriorate the learning accuracy and training speed as well as lead to the problem of overfitting. Thus, dimension reduction of microarray data is a crucial preprocessing step for prediction and classification of disease. Various feature selection and feature extraction techniques have been proposed in the literature to identify the genes, that have direct impact on the various machine learning algorithms for classification and eliminate the remaining ones. This paper describes the taxonomy of dimension reduction methods with their characteristics, evaluation criteria, advantages and disadvantages. It also presents a review of numerous dimension reduction approaches for microarray data, mainly those methods that have been proposed over the past few years.

  1. Oral tongue cancer gene expression profiling: Identification of novel potential prognosticators by oligonucleotide microarray analysis

    International Nuclear Information System (INIS)

    Estilo, Cherry L; Boyle, Jay O; Kraus, Dennis H; Patel, Snehal; Shaha, Ashok R; Wong, Richard J; Huryn, Joseph M; Shah, Jatin P; Singh, Bhuvanesh; O-charoenrat, Pornchai; Talbot, Simon; Socci, Nicholas D; Carlson, Diane L; Ghossein, Ronald; Williams, Tijaana; Yonekawa, Yoshihiro; Ramanathan, Yegnanarayana

    2009-01-01

    The present study is aimed at identifying potential candidate genes as prognostic markers in human oral tongue squamous cell carcinoma (SCC) by large scale gene expression profiling. The gene expression profile of patients (n=37) with oral tongue SCC were analyzed using Affymetrix HG-U95Av2 high-density oligonucleotide arrays. Patients (n=20) from which there were available tumor and matched normal mucosa were grouped into stage (early vs. late) and nodal disease (node positive vs. node negative) subgroups and genes differentially expressed in tumor vs. normal and between the subgroups were identified. Three genes, GLUT3, HSAL2, and PACE4, were selected for their potential biological significance in a larger cohort of 49 patients via quantitative real-time RT-PCR. Hierarchical clustering analyses failed to show significant segregation of patients. In patients (n=20) with available tumor and matched normal mucosa, 77 genes were found to be differentially expressed (P< 0.05) in the tongue tumor samples compared to their matched normal controls. Among the 45 over-expressed genes, MMP-1 encoding interstitial collagenase showed the highest level of increase (average: 34.18 folds). Using the criterion of two-fold or greater as overexpression, 30.6%, 24.5% and 26.5% of patients showed high levels of GLUT3, HSAL2 and PACE4, respectively. Univariate analyses demonstrated that GLUT3 over-expression correlated with depth of invasion (P<0.0001), tumor size (P=0.024), pathological stage (P=0.009) and recurrence (P=0.038). HSAL2 was positively associated with depth of invasion (P=0.015) and advanced T stage (P=0.047). In survival studies, only GLUT3 showed a prognostic value with disease-free (P=0.049), relapse-free (P=0.002) and overall survival (P=0.003). PACE4 mRNA expression failed to show correlation with any of the relevant parameters. The characterization of genes identified to be significant predictors of prognosis by oligonucleotide microarray and further validation by

  2. Temporal gene expression profiling reveals CEBPD as a candidate regulator of brain disease in prosaposin deficient mice

    Directory of Open Access Journals (Sweden)

    Ran Huimin

    2008-08-01

    Full Text Available Abstract Background Prosaposin encodes, in tandem, four small acidic activator proteins (saposins with specificities for glycosphingolipid (GSL hydrolases in lysosomes. Extensive GSL storage occurs in various central nervous system regions in mammalian prosaposin deficiencies. Results Our hypomorphic prosaposin deficient mouse, PS-NA, exhibited 45% WT levels of brain saposins and showed neuropathology that included neuronal GSL storage and Purkinje cell loss. Impairment of neuronal function was observed as early as 6 wks as demonstrated by the narrow bridges tests. Temporal transcriptome microarray analyses of brain tissues were conducted with mRNA from three prosaposin deficient mouse models: PS-NA, prosaposin null (PS-/- and a V394L/V394L glucocerebrosidase mutation combined with PS-NA (4L/PS-NA. Gene expression alterations in cerebrum and cerebellum were detectable at birth preceding the neuronal deficits. Differentially expressed genes encompassed a broad spectrum of cellular functions. The number of down-regulated genes was constant, but up-regulated gene numbers increased with age. CCAAT/enhancer-binding protein delta (CEBPD was the only up-regulated transcription factor in these two brain regions of all three models. Network analyses revealed that CEBPD has functional relationships with genes in transcription, pro-inflammation, cell death, binding, myelin and transport. Conclusion These results show that: 1 Regionally specific gene expression abnormalities precede the brain histological and neuronal function changes, 2 Temporal gene expression profiles provide insights into the molecular mechanism during the GSL storage disease course, and 3 CEBPD is a candidate regulator of brain disease in prosaposin deficiency to participate in modulating disease acceleration or progression.

  3. Identification of self-consistent modulons from bacterial microarray expression data with the help of structured regulon gene sets

    KAUST Repository

    Permina, Elizaveta A.

    2013-01-01

    Identification of bacterial modulons from series of gene expression measurements on microarrays is a principal problem, especially relevant for inadequately studied but practically important species. Usage of a priori information on regulatory interactions helps to evaluate parameters for regulatory subnetwork inference. We suggest a procedure for modulon construction where a seed regulon is iteratively updated with genes having expression patterns similar to those for regulon member genes. A set of genes essential for a regulon is used to control modulon updating. Essential genes for a regulon were selected as a subset of regulon genes highly related by different measures to each other. Using Escherichia coli as a model, we studied how modulon identification depends on the data, including the microarray experiments set, the adopted relevance measure and the regulon itself. We have found that results of modulon identification are highly dependent on all parameters studied and thus the resulting modulon varies substantially depending on the identification procedure. Yet, modulons that were identified correctly displayed higher stability during iterations, which allows developing a procedure for reliable modulon identification in the case of less studied species where the known regulatory interactions are sparse. Copyright © 2013 Taylor & Francis.

  4. Comparative RNA-Seq and microarray analysis of gene expression changes in B-cell lymphomas of Canis familiaris.

    Directory of Open Access Journals (Sweden)

    Marie Mooney

    Full Text Available Comparative oncology is a developing research discipline that is being used to assist our understanding of human neoplastic diseases. Companion canines are a preferred animal oncology model due to spontaneous tumor development and similarity to human disease at the pathophysiological level. We use a paired RNA sequencing (RNA-Seq/microarray analysis of a set of four normal canine lymph nodes and ten canine lymphoma fine needle aspirates to identify technical biases and variation between the technologies and convergence on biological disease pathways. Surrogate Variable Analysis (SVA provides a formal multivariate analysis of the combined RNA-Seq/microarray data set. Applying SVA to the data allows us to decompose variation into contributions associated with transcript abundance, differences between the technology, and latent variation within each technology. A substantial and highly statistically significant component of the variation reflects transcript abundance, and RNA-Seq appeared more sensitive for detection of transcripts expressed at low levels. Latent random variation among RNA-Seq samples is also distinct in character from that impacting microarray samples. In particular, we observed variation between RNA-Seq samples that reflects transcript GC content. Platform-independent variable decomposition without a priori knowledge of the sources of variation using SVA represents a generalizable method for accomplishing cross-platform data analysis. We identified genes differentially expressed between normal lymph nodes of disease free dogs and a subset of the diseased dogs diagnosed with B-cell lymphoma using each technology. There is statistically significant overlap between the RNA-Seq and microarray sets of differentially expressed genes. Analysis of overlapping genes in the context of biological systems suggests elevated expression and activity of PI3K signaling in B-cell lymphoma biopsies compared with normal biopsies, consistent with

  5. Association of candidate genes with drought tolerance traits in diverse perennial ryegrass accessions

    Science.gov (United States)

    Xiaoqing Yu; Guihua Bai; Shuwei Liu; Na Luo; Ying Wang; Douglas S. Richmond; Paula M. Pijut; Scott A. Jackson; Jianming Yu; Yiwei. Jiang

    2013-01-01

    Drought is a major environmental stress limiting growth of perennial grasses in temperate regions. Plant drought tolerance is a complex trait that is controlled by multiple genes. Candidate gene association mapping provides a powerful tool for dissection of complex traits. Candidate gene association mapping of drought tolerance traits was conducted in 192 diverse...

  6. Identification of candidate genes for dyslexia susceptibility on chromosome 18.

    Directory of Open Access Journals (Sweden)

    Thomas S Scerri

    2010-10-01

    Full Text Available Six independent studies have identified linkage to chromosome 18 for developmental dyslexia or general reading ability. Until now, no candidate genes have been identified to explain this linkage. Here, we set out to identify the gene(s conferring susceptibility by a two stage strategy of linkage and association analysis.Linkage analysis: 264 UK families and 155 US families each containing at least one child diagnosed with dyslexia were genotyped with a dense set of microsatellite markers on chromosome 18. Association analysis: Using a discovery sample of 187 UK families, nearly 3000 SNPs were genotyped across the chromosome 18 dyslexia susceptibility candidate region. Following association analysis, the top ranking SNPs were then genotyped in the remaining samples. The linkage analysis revealed a broad signal that spans approximately 40 Mb from 18p11.2 to 18q12.2. Following the association analysis and subsequent replication attempts, we observed consistent association with the same SNPs in three genes; melanocortin 5 receptor (MC5R, dymeclin (DYM and neural precursor cell expressed, developmentally down-regulated 4-like (NEDD4L.Along with already published biological evidence, MC5R, DYM and NEDD4L make attractive candidates for dyslexia susceptibility genes. However, further replication and functional studies are still required.

  7. BEEF CATTLE MUSCULARITY CANDIDATE GENES

    Directory of Open Access Journals (Sweden)

    Irida Novianti

    2010-04-01

    Full Text Available Muscularity is a potential indicator for the selection of more productive cattle. Mapping quantitative trait loci (QTL for traits related to muscularity is useful to identify the genomic regions where the genes affecting muscularity reside. QTL analysis from a Limousin-Jersey double backcross herd was conducted using QTL Express software with cohort and breed as the fixed effects. Nine QTL suggested to have an association with muscularity were identified on cattle chromosomes BTA 1, 2, 3, 4, 5, 8, 12, 14 and 17. The myostatin gene is located at the centromeric end of chromosome 2 and not surprisingly, the Limousin myostatin F94L variant accounted for the QTL on BTA2. However, when the myostatin F94L genotype was included as an additional fixed effect, the QTL on BTA17 was also no longer significant. This result suggests that there may be gene(s that have epistatic effects with myostatin located on cattle chromosome 17. Based on the position of the QTL in base pairs, all the genes that reside in the region were determined using the Ensembl data base (www.ensembl.org. There were two potential candidate genes residing within these QTL regions were selected. They were Smad nuclear interacting protein 1 (SNIP1 and similar to follistatin-like 5 (FSTL5. (JIIPB 2010 Vol 20 No 1: 1-10

  8. Facilitating functional annotation of chicken microarray data

    Directory of Open Access Journals (Sweden)

    Gresham Cathy R

    2009-10-01

    Full Text Available Abstract Background Modeling results from chicken microarray studies is challenging for researchers due to little functional annotation associated with these arrays. The Affymetrix GenChip chicken genome array, one of the biggest arrays that serve as a key research tool for the study of chicken functional genomics, is among the few arrays that link gene products to Gene Ontology (GO. However the GO annotation data presented by Affymetrix is incomplete, for example, they do not show references linked to manually annotated functions. In addition, there is no tool that facilitates microarray researchers to directly retrieve functional annotations for their datasets from the annotated arrays. This costs researchers amount of time in searching multiple GO databases for functional information. Results We have improved the breadth of functional annotations of the gene products associated with probesets on the Affymetrix chicken genome array by 45% and the quality of annotation by 14%. We have also identified the most significant diseases and disorders, different types of genes, and known drug targets represented on Affymetrix chicken genome array. To facilitate functional annotation of other arrays and microarray experimental datasets we developed an Array GO Mapper (AGOM tool to help researchers to quickly retrieve corresponding functional information for their dataset. Conclusion Results from this study will directly facilitate annotation of other chicken arrays and microarray experimental datasets. Researchers will be able to quickly model their microarray dataset into more reliable biological functional information by using AGOM tool. The disease, disorders, gene types and drug targets revealed in the study will allow researchers to learn more about how genes function in complex biological systems and may lead to new drug discovery and development of therapies. The GO annotation data generated will be available for public use via AgBase website and

  9. Comparative analysis of methods for gene transcription profiling data derived from different microarray technologies in rat and mouse models of diabetes

    Directory of Open Access Journals (Sweden)

    Bihoreau Marie-Thérèse

    2009-02-01

    Full Text Available Abstract Background Microarray technologies are widely used to quantify the abundance of transcripts corresponding to thousands of genes. To maximise the robustness of transcriptome results, we have tested the performance and reproducibility of rat and mouse gene expression data obtained with Affymetrix, Illumina and Operon platforms. Results We present a thorough analysis of the degree of reproducibility provided by analysing the transcriptomic profile of the same animals of several experimental groups under different popular microarray technologies in different tissues. Concordant results from inter- and intra-platform comparisons were maximised by testing many popular computational methods for generating fold changes and significances and by only considering oligonucleotides giving high expression levels. The choice of Affymetrix signal extraction technique was shown to have the greatest effect on the concordance across platforms. In both species, when choosing optimal methods, the agreement between data generated on the Affymetrix and Illumina was excellent; this was verified using qRT-PCR on a selection of genes present on all platforms. Conclusion This study provides an extensive assessment of analytical methods best suited for processing data from different microarray technologies and can assist integration of technologically different gene expression datasets in biological systems.

  10. Application of four dyes in gene expression analyses by microarrays

    Directory of Open Access Journals (Sweden)

    van Schooten Frederik J

    2005-07-01

    Full Text Available Abstract Background DNA microarrays are widely used in gene expression analyses. To increase throughput and minimize costs without reducing gene expression data obtained, we investigated whether four mRNA samples can be analyzed simultaneously by applying four different fluorescent dyes. Results Following tests for cross-talk of fluorescence signals, Alexa 488, Alexa 594, Cyanine 3 and Cyanine 5 were selected for hybridizations. For self-hybridizations, a single RNA sample was labelled with all dyes and hybridized on commercial cDNA arrays or on in-house spotted oligonucleotide arrays. Correlation coefficients for all combinations of dyes were above 0.9 on the cDNA array. On the oligonucleotide array they were above 0.8, except combinations with Alexa 488, which were approximately 0.5. Standard deviation of expression differences for replicate spots were similar on the cDNA array for all dye combinations, but on the oligonucleotide array combinations with Alexa 488 showed a higher variation. Conclusion In conclusion, the four dyes can be used simultaneously for gene expression experiments on the tested cDNA array, but only three dyes can be used on the tested oligonucleotide array. This was confirmed by hybridizations of control with test samples, as all combinations returned similar numbers of differentially expressed genes with comparable effects on gene expression.

  11. Construction of a cDNA microarray derived from the ascidian Ciona intestinalis.

    Science.gov (United States)

    Azumi, Kaoru; Takahashi, Hiroki; Miki, Yasufumi; Fujie, Manabu; Usami, Takeshi; Ishikawa, Hisayoshi; Kitayama, Atsusi; Satou, Yutaka; Ueno, Naoto; Satoh, Nori

    2003-10-01

    A cDNA microarray was constructed from a basal chordate, the ascidian Ciona intestinalis. The draft genome of Ciona has been read and inferred to contain approximately 16,000 protein-coding genes, and cDNAs for transcripts of 13,464 genes have been characterized and compiled as the "Ciona intestinalis Gene Collection Release I". In the present study, we constructed a cDNA microarray of these 13,464 Ciona genes. A preliminary experiment with Cy3- and Cy5-labeled probes showed extensive differential gene expression between fertilized eggs and larvae. In addition, there was a good correlation between results obtained by the present microarray analysis and those from previous EST analyses. This first microarray of a large collection of Ciona intestinalis cDNA clones should facilitate the analysis of global gene expression and gene networks during the embryogenesis of basal chordates.

  12. Functional validation of GWAS gene candidates for abnormal liver function during zebrafish liver development

    Directory of Open Access Journals (Sweden)

    Leah Y. Liu

    2013-09-01

    Genome-wide association studies (GWAS have revealed numerous associations between many phenotypes and gene candidates. Frequently, however, further elucidation of gene function has not been achieved. A recent GWAS identified 69 candidate genes associated with elevated liver enzyme concentrations, which are clinical markers of liver disease. To investigate the role of these genes in liver homeostasis, we narrowed down this list to 12 genes based on zebrafish orthology, zebrafish liver expression and disease correlation. To assess the function of gene candidates during liver development, we assayed hepatic progenitors at 48 hours post fertilization (hpf and hepatocytes at 72 hpf using in situ hybridization following morpholino knockdown in zebrafish embryos. Knockdown of three genes (pnpla3, pklr and mapk10 decreased expression of hepatic progenitor cells, whereas knockdown of eight genes (pnpla3, cpn1, trib1, fads2, slc2a2, pklr, mapk10 and samm50 decreased cell-specific hepatocyte expression. We then induced liver injury in zebrafish embryos using acetaminophen exposure and observed changes in liver toxicity incidence in morphants. Prioritization of GWAS candidates and morpholino knockdown expedites the study of newly identified genes impacting liver development and represents a feasible method for initial assessment of candidate genes to instruct further mechanistic analyses. Our analysis can be extended to GWAS for additional disease-associated phenotypes.

  13. A cell spot microarray method for production of high density siRNA transfection microarrays

    Directory of Open Access Journals (Sweden)

    Mpindi John-Patrick

    2011-03-01

    Full Text Available Abstract Background High-throughput RNAi screening is widely applied in biological research, but remains expensive, infrastructure-intensive and conversion of many assays to HTS applications in microplate format is not feasible. Results Here, we describe the optimization of a miniaturized cell spot microarray (CSMA method, which facilitates utilization of the transfection microarray technique for disparate RNAi analyses. To promote rapid adaptation of the method, the concept has been tested with a panel of 92 adherent cell types, including primary human cells. We demonstrate the method in the systematic screening of 492 GPCR coding genes for impact on growth and survival of cultured human prostate cancer cells. Conclusions The CSMA method facilitates reproducible preparation of highly parallel cell microarrays for large-scale gene knockdown analyses. This will be critical towards expanding the cell based functional genetic screens to include more RNAi constructs, allow combinatorial RNAi analyses, multi-parametric phenotypic readouts or comparative analysis of many different cell types.

  14. Candidate luminal B breast cancer genes identified by genome, gene expression and DNA methylation profiling.

    Directory of Open Access Journals (Sweden)

    Stéphanie Cornen

    Full Text Available Breast cancers (BCs of the luminal B subtype are estrogen receptor-positive (ER+, highly proliferative, resistant to standard therapies and have a poor prognosis. To better understand this subtype we compared DNA copy number aberrations (CNAs, DNA promoter methylation, gene expression profiles, and somatic mutations in nine selected genes, in 32 luminal B tumors with those observed in 156 BCs of the other molecular subtypes. Frequent CNAs included 8p11-p12 and 11q13.1-q13.2 amplifications, 7q11.22-q34, 8q21.12-q24.23, 12p12.3-p13.1, 12q13.11-q24.11, 14q21.1-q23.1, 17q11.1-q25.1, 20q11.23-q13.33 gains and 6q14.1-q24.2, 9p21.3-p24,3, 9q21.2, 18p11.31-p11.32 losses. A total of 237 and 101 luminal B-specific candidate oncogenes and tumor suppressor genes (TSGs presented a deregulated expression in relation with their CNAs, including 11 genes previously reported associated with endocrine resistance. Interestingly, 88% of the potential TSGs are located within chromosome arm 6q, and seven candidate oncogenes are potential therapeutic targets. A total of 100 candidate oncogenes were validated in a public series of 5,765 BCs and the overexpression of 67 of these was associated with poor survival in luminal tumors. Twenty-four genes presented a deregulated expression in relation with a high DNA methylation level. FOXO3, PIK3CA and TP53 were the most frequent mutated genes among the nine tested. In a meta-analysis of next-generation sequencing data in 875 BCs, KCNB2 mutations were associated with luminal B cases while candidate TSGs MDN1 (6q15 and UTRN (6q24, were mutated in this subtype. In conclusion, we have reported luminal B candidate genes that may play a role in the development and/or hormone resistance of this aggressive subtype.

  15. Identification of Cell Surface Targets through Meta-analysis of Microarray Data

    Directory of Open Access Journals (Sweden)

    Henry Haeberle

    2012-07-01

    Full Text Available High-resolution image guidance for resection of residual tumor cells would enable more precise and complete excision for more effective treatment of cancers, such as medulloblastoma, the most common pediatric brain cancer. Numerous studies have shown that brain tumor patient outcomes correlate with the precision of resection. To enable guided resection with molecular specificity and cellular resolution, molecular probes that effectively delineate brain tumor boundaries are essential. Therefore, we developed a bioinformatics approach to analyze micro-array datasets for the identification of transcripts that encode candidate cell surface biomarkers that are highly enriched in medulloblastoma. The results identified 380 genes with greater than a two-fold increase in the expression in the medulloblastoma compared with that in the normal cerebellum. To enrich for targets with accessibility for extracellular molecular probes, we further refined this list by filtering it with gene ontology to identify genes with protein localization on, or within, the plasma membrane. To validate this meta-analysis, the top 10 candidates were evaluated with immunohistochemistry. We identified two targets, fibrillin 2 and EphA3, which specifically stain medulloblastoma. These results demonstrate a novel bioinformatics approach that successfully identified cell surface and extracellular candidate markers enriched in medulloblastoma versus adjacent cerebellum. These two proteins are high-value targets for the development of tumor-specific probes in medulloblastoma. This bioinformatics method has broad utility for the identification of accessible molecular targets in a variety of cancers and will enable probe development for guided resection.

  16. DNA Microarray Technology; TOPICAL

    International Nuclear Information System (INIS)

    WERNER-WASHBURNE, MARGARET; DAVIDSON, GEORGE S.

    2002-01-01

    Collaboration between Sandia National Laboratories and the University of New Mexico Biology Department resulted in the capability to train students in microarray techniques and the interpretation of data from microarray experiments. These studies provide for a better understanding of the role of stationary phase and the gene regulation involved in exit from stationary phase, which may eventually have important clinical implications. Importantly, this research trained numerous students and is the basis for three new Ph.D. projects

  17. Epidermal growth factor gene is a newly identified candidate gene for gout

    OpenAIRE

    Lin Han; Chunwei Cao; Zhaotong Jia; Shiguo Liu; Zhen Liu; Ruosai Xin; Can Wang; Xinde Li; Wei Ren; Xuefeng Wang; Changgui Li

    2016-01-01

    Chromosome 4q25 has been identified as a genomic region associated with gout. However, the associations of gout with the genes in this region have not yet been confirmed. Here, we performed two-stage analysis to determine whether variations in candidate genes in the 4q25 region are associated with gout in a male Chinese Han population. We first evaluated 96 tag single nucleotide polymorphisms (SNPs) in eight inflammatory/immune pathway- or glucose/lipid metabolism-related genes in the 4q25 re...

  18. Comparison of Nanostring nCounter® Data on FFPE Colon Cancer Samples and Affymetrix Microarray Data on Matched Frozen Tissues.

    Directory of Open Access Journals (Sweden)

    Xi Chen

    Full Text Available The prognosis of colorectal cancer (CRC stage II and III patients remains a challenge due to the difficulties of finding robust biomarkers suitable for testing clinical samples. The majority of published gene signatures of CRC have been generated on fresh frozen colorectal tissues. Because collection of frozen tissue is not practical for routine surgical pathology practice, a clinical test that improves prognostic capabilities beyond standard pathological staging of colon cancer will need to be designed for formalin-fixed paraffin-embedded (FFPE tissues. The NanoString nCounter® platform is a gene expression analysis tool developed for use with FFPE-derived samples. We designed a custom nCounter® codeset based on elements from multiple published fresh frozen tissue microarray-based prognostic gene signatures for colon cancer, and we used this platform to systematically compare gene expression data from FFPE with matched microarray array data from frozen tissues. Our results show moderate correlation of gene expression between two platforms and discovery of a small subset of genes as candidate biomarkers for colon cancer prognosis that are detectable and quantifiable in FFPE tissue sections.

  19. Feature Genes Selection Using Supervised Locally Linear Embedding and Correlation Coefficient for Microarray Classification.

    Science.gov (United States)

    Xu, Jiucheng; Mu, Huiyu; Wang, Yun; Huang, Fangzhou

    2018-01-01

    The selection of feature genes with high recognition ability from the gene expression profiles has gained great significance in biology. However, most of the existing methods have a high time complexity and poor classification performance. Motivated by this, an effective feature selection method, called supervised locally linear embedding and Spearman's rank correlation coefficient (SLLE-SC 2 ), is proposed which is based on the concept of locally linear embedding and correlation coefficient algorithms. Supervised locally linear embedding takes into account class label information and improves the classification performance. Furthermore, Spearman's rank correlation coefficient is used to remove the coexpression genes. The experiment results obtained on four public tumor microarray datasets illustrate that our method is valid and feasible.

  20. Association Study between BDNF Gene Polymorphisms and Autism by Three-Dimensional Gel-Based Microarray

    Directory of Open Access Journals (Sweden)

    Zuhong Lu

    2009-06-01

    Full Text Available Single nucleotide polymorphisms (SNPs are important markers which can be used in association studies searching for susceptible genes of complex diseases. High-throughput methods are needed for SNP genotyping in a large number of samples. In this study, we applied polyacrylamide gel-based microarray combined with dual-color hybridization for association study of four BDNF polymorphisms with autism. All the SNPs in both patients and controls could be analyzed quickly and correctly. Among four SNPs, only C270T polymorphism showed significant differences in the frequency of the allele (χ2 = 7.809, p = 0.005 and genotype (χ2 = 7.800, p = 0.020. In the haplotype association analysis, there was significant difference in global haplotype distribution between the groups (χ2 = 28.19,p = 3.44e-005. We suggest that BDNF has a possible role in the pathogenesis of autism. The study also show that the polyacrylamide gel-based microarray combined with dual-color hybridization is a rapid, simple and high-throughput method for SNPs genotyping, and can be used for association study of susceptible gene with disorders in large samples.

  1. Universal Reference RNA as a standard for microarray experiments

    Directory of Open Access Journals (Sweden)

    Fero Michael

    2004-03-01

    Full Text Available Abstract Background Obtaining reliable and reproducible two-color microarray gene expression data is critically important for understanding the biological significance of perturbations made on a cellular system. Microarray design, RNA preparation and labeling, hybridization conditions and data acquisition and analysis are variables difficult to simultaneously control. A useful tool for monitoring and controlling intra- and inter-experimental variation is Universal Reference RNA (URR, developed with the goal of providing hybridization signal at each microarray probe location (spot. Measuring signal at each spot as the ratio of experimental RNA to reference RNA targets, rather than relying on absolute signal intensity, decreases variability by normalizing signal output in any two-color hybridization experiment. Results Human, mouse and rat URR (UHRR, UMRR and URRR, respectively were prepared from pools of RNA derived from individual cell lines representing different tissues. A variety of microarrays were used to determine percentage of spots hybridizing with URR and producing signal above a user defined threshold (microarray coverage. Microarray coverage was consistently greater than 80% for all arrays tested. We confirmed that individual cell lines contribute their own unique set of genes to URR, arguing for a pool of RNA from several cell lines as a better configuration for URR as opposed to a single cell line source for URR. Microarray coverage comparing two separately prepared batches each of UHRR, UMRR and URRR were highly correlated (Pearson's correlation coefficients of 0.97. Conclusion Results of this study demonstrate that large quantities of pooled RNA from individual cell lines are reproducibly prepared and possess diverse gene representation. This type of reference provides a standard for reducing variation in microarray experiments and allows more reliable comparison of gene expression data within and between experiments and

  2. NF2 tumor suppressor gene: a comprehensive and efficient detection of somatic mutations by denaturing HPLC and microarray-CGH.

    Science.gov (United States)

    Szijan, Irene; Rochefort, Daniel; Bruder, Carl; Surace, Ezequiel; Machiavelli, Gloria; Dalamon, Viviana; Cotignola, Javier; Ferreiro, Veronica; Campero, Alvaro; Basso, Armando; Dumanski, Jan P; Rouleau, Guy A

    2003-01-01

    The NF2 tumor suppressor gene, located in chromosome 22q12, is involved in the development of multiple tumors of the nervous system, either associated with neurofibromatosis 2 or sporadic ones, mainly schwannomas and meningiomas. In order to evaluate the role of the NF2 gene in sporadic central nervous system (CNS) tumors, we analyzed NF2 mutations in 26 specimens: 14 meningiomas, 4 schwannomas, 4 metastases, and 4 other histopathological types of neoplasms. Denaturing high performance liquid chromatography (denaturing HPLC) and comparative genomic hybridization on a DNA microarray (microarray- CGH) were used as scanning methods for small mutations and gross rearrangements respectively. Small mutations were identified in six out of seventeen meningiomas and schwannomas, one mutation was novel. Large deletions were detected in six meningiomas. All mutations were predicted to result in truncated protein or in the absence of a large protein domain. No NF2 mutations were found in other histopathological types of CNS tumors. These results provide additional evidence that mutations in the NF2 gene play an important role in the development of sporadic meningiomas and schwannomas. Denaturing HPLC analysis of small mutations and microarray-CGH of large deletions are complementary, fast, and efficient methods for the detection of mutations in tumor tissues.

  3. Evaluation of Different Normalization and Analysis Procedures for Illumina Gene Expression Microarray Data Involving Small Changes

    Science.gov (United States)

    Johnstone, Daniel M.; Riveros, Carlos; Heidari, Moones; Graham, Ross M.; Trinder, Debbie; Berretta, Regina; Olynyk, John K.; Scott, Rodney J.; Moscato, Pablo; Milward, Elizabeth A.

    2013-01-01

    While Illumina microarrays can be used successfully for detecting small gene expression changes due to their high degree of technical replicability, there is little information on how different normalization and differential expression analysis strategies affect outcomes. To evaluate this, we assessed concordance across gene lists generated by applying different combinations of normalization strategy and analytical approach to two Illumina datasets with modest expression changes. In addition to using traditional statistical approaches, we also tested an approach based on combinatorial optimization. We found that the choice of both normalization strategy and analytical approach considerably affected outcomes, in some cases leading to substantial differences in gene lists and subsequent pathway analysis results. Our findings suggest that important biological phenomena may be overlooked when there is a routine practice of using only one approach to investigate all microarray datasets. Analytical artefacts of this kind are likely to be especially relevant for datasets involving small fold changes, where inherent technical variation—if not adequately minimized by effective normalization—may overshadow true biological variation. This report provides some basic guidelines for optimizing outcomes when working with Illumina datasets involving small expression changes. PMID:27605185

  4. Microarray expression analysis of genes involved in innate immune memory in peritoneal macrophages

    Directory of Open Access Journals (Sweden)

    Keisuke Yoshida

    2016-03-01

    Full Text Available Immunological memory has been believed to be a feature of the adaptive immune system for long period, but recent reports suggest that the innate immune system also exhibits memory-like reaction. Although evidence of innate immune memory is accumulating, no in vivo experimental data has clearly implicated a molecular mechanism, or even a cell-type, for this phenomenon. In this study of data deposited into Gene Expression Omnibus (GEO under GSE71111, we analyzed the expression profile of peritoneal macrophages isolated from mice pre-administrated with toll-like receptor (TLR ligands, mimicking pathogen infection. In these macrophages, increased expression of a group of innate immunity-related genes was sustained over a long period of time, and these genes overlapped with ATF7-regulated genes. We conclude that ATF7 plays an important role in innate immune memory in macrophages. Keywords: Macrophage, ATF7, Innate immune memory, Microarray

  5. Identification of potential biomarkers from microarray experiments using multiple criteria optimization

    International Nuclear Information System (INIS)

    Sánchez-Peña, Matilde L; Isaza, Clara E; Pérez-Morales, Jaileene; Rodríguez-Padilla, Cristina; Castro, José M; Cabrera-Ríos, Mauricio

    2013-01-01

    Microarray experiments are capable of determining the relative expression of tens of thousands of genes simultaneously, thus resulting in very large databases. The analysis of these databases and the extraction of biologically relevant knowledge from them are challenging tasks. The identification of potential cancer biomarker genes is one of the most important aims for microarray analysis and, as such, has been widely targeted in the literature. However, identifying a set of these genes consistently across different experiments, researches, microarray platforms, or cancer types is still an elusive endeavor. Besides the inherent difficulty of the large and nonconstant variability in these experiments and the incommensurability between different microarray technologies, there is the issue of the users having to adjust a series of parameters that significantly affect the outcome of the analyses and that do not have a biological or medical meaning. In this study, the identification of potential cancer biomarkers from microarray data is casted as a multiple criteria optimization (MCO) problem. The efficient solutions to this problem, found here through data envelopment analysis (DEA), are associated to genes that are proposed as potential cancer biomarkers. The method does not require any parameter adjustment by the user, and thus fosters repeatability. The approach also allows the analysis of different microarray experiments, microarray platforms, and cancer types simultaneously. The results include the analysis of three publicly available microarray databases related to cervix cancer. This study points to the feasibility of modeling the selection of potential cancer biomarkers from microarray data as an MCO problem and solve it using DEA. Using MCO entails a new optic to the identification of potential cancer biomarkers as it does not require the definition of a threshold value to establish significance for a particular gene and the selection of a normalization

  6.  DNA microarray-based gene expression profiling in diagnosis, assessing prognosis and predicting response to therapy in colorectal cancer

    Directory of Open Access Journals (Sweden)

    Przemysław Kwiatkowski

    2012-06-01

    Full Text Available  Colorectal cancer is the most common cancer of the gastrointestinal tract. It is considered as a biological model of a certain type of cancerogenesis process in which progression from an early to late stage adenoma and cancer is accompanied by distinct genetic alterations.Clinical and pathological parameters commonly used in clinical practice are often insufficient to determine groups of patients suitable for personalized treatment. Moreover, reliable molecular markers with high prognostic value have not yet been determined. Molecular studies using DNA-based microarrays have identified numerous genes involved in cell proliferation and differentiation during the process of cancerogenesis. Assessment of the genetic profile of colorectal cancer using the microarray technique might be a useful tool in determining the groups of patients with different clinical outcomes who would benefit from additional personalized treatment.The main objective of this study was to present the current state of knowledge on the practical application of gene profiling techniques using microarrays for determining diagnosis, prognosis and response to treatment in colorectal cancer.

  7. Defining a new candidate gene for amelogenesis imperfecta: from molecular genetics to biochemistry.

    Science.gov (United States)

    Urzúa, Blanca; Ortega-Pinto, Ana; Morales-Bozo, Irene; Rojas-Alcayaga, Gonzalo; Cifuentes, Víctor

    2011-02-01

    Amelogenesis imperfecta is a group of genetic conditions that affect the structure and clinical appearance of tooth enamel. The types (hypoplastic, hypocalcified, and hypomature) are correlated with defects in different stages of the process of enamel synthesis. Autosomal dominant, recessive, and X-linked types have been previously described. These disorders are considered clinically and genetically heterogeneous in etiology, involving a variety of genes, such as AMELX, ENAM, DLX3, FAM83H, MMP-20, KLK4, and WDR72. The mutations identified within these causal genes explain less than half of all cases of amelogenesis imperfecta. Most of the candidate and causal genes currently identified encode proteins involved in enamel synthesis. We think it is necessary to refocus the search for candidate genes using biochemical processes. This review provides theoretical evidence that the human SLC4A4 gene (sodium bicarbonate cotransporter) may be a new candidate gene.

  8. Computational analysis of candidate disease genes and variants for Salt-sensitive hypertension in indigenous Southern Africans

    KAUST Repository

    Tiffin, Nicki

    2010-09-27

    Multiple factors underlie susceptibility to essential hypertension, including a significant genetic and ethnic component, and environmental effects. Blood pressure response of hypertensive individuals to salt is heterogeneous, but salt sensitivity appears more prevalent in people of indigenous African origin. The underlying genetics of salt-sensitive hypertension, however, are poorly understood. In this study, computational methods including text- and data-mining have been used to select and prioritize candidate aetiological genes for salt-sensitive hypertension. Additionally, we have compared allele frequencies and copy number variation for single nucleotide polymorphisms in candidate genes between indigenous Southern African and Caucasian populations, with the aim of identifying candidate genes with significant variability between the population groups: identifying genetic variability between population groups can exploit ethnic differences in disease prevalence to aid with prioritisation of good candidate genes. Our top-ranking candidate genes include parathyroid hormone precursor (PTH) and type-1angiotensin II receptor (AGTR1). We propose that the candidate genes identified in this study warrant further investigation as potential aetiological genes for salt-sensitive hypertension. © 2010 Tiffin et al.

  9. A non-parametric meta-analysis approach for combining independent microarray datasets: application using two microarray datasets pertaining to chronic allograft nephropathy

    Directory of Open Access Journals (Sweden)

    Archer Kellie J

    2008-02-01

    Full Text Available Abstract Background With the popularity of DNA microarray technology, multiple groups of researchers have studied the gene expression of similar biological conditions. Different methods have been developed to integrate the results from various microarray studies, though most of them rely on distributional assumptions, such as the t-statistic based, mixed-effects model, or Bayesian model methods. However, often the sample size for each individual microarray experiment is small. Therefore, in this paper we present a non-parametric meta-analysis approach for combining data from independent microarray studies, and illustrate its application on two independent Affymetrix GeneChip studies that compared the gene expression of biopsies from kidney transplant recipients with chronic allograft nephropathy (CAN to those with normal functioning allograft. Results The simulation study comparing the non-parametric meta-analysis approach to a commonly used t-statistic based approach shows that the non-parametric approach has better sensitivity and specificity. For the application on the two CAN studies, we identified 309 distinct genes that expressed differently in CAN. By applying Fisher's exact test to identify enriched KEGG pathways among those genes called differentially expressed, we found 6 KEGG pathways to be over-represented among the identified genes. We used the expression measurements of the identified genes as predictors to predict the class labels for 6 additional biopsy samples, and the predicted results all conformed to their pathologist diagnosed class labels. Conclusion We present a new approach for combining data from multiple independent microarray studies. This approach is non-parametric and does not rely on any distributional assumptions. The rationale behind the approach is logically intuitive and can be easily understood by researchers not having advanced training in statistics. Some of the identified genes and pathways have been

  10. CDNA Microarray Based Comparative Gene Expression Analysis of Primary Breast Tumors Versus In Vitro Transformed Neoplastic Breast Epithelium

    National Research Council Canada - National Science Library

    Szallasi, Zoltan

    2001-01-01

    .... The first group of clones is being sorted by their ability to form tumors. We are currently performing cDNA microarray analysis quantifying the expression level of about 15,000 genes in these cell lines...

  11. Speeding disease gene discovery by sequence based candidate prioritization

    Directory of Open Access Journals (Sweden)

    Porteous David J

    2005-03-01

    Full Text Available Abstract Background Regions of interest identified through genetic linkage studies regularly exceed 30 centimorgans in size and can contain hundreds of genes. Traditionally this number is reduced by matching functional annotation to knowledge of the disease or phenotype in question. However, here we show that disease genes share patterns of sequence-based features that can provide a good basis for automatic prioritization of candidates by machine learning. Results We examined a variety of sequence-based features and found that for many of them there are significant differences between the sets of genes known to be involved in human hereditary disease and those not known to be involved in disease. We have created an automatic classifier called PROSPECTR based on those features using the alternating decision tree algorithm which ranks genes in the order of likelihood of involvement in disease. On average, PROSPECTR enriches lists for disease genes two-fold 77% of the time, five-fold 37% of the time and twenty-fold 11% of the time. Conclusion PROSPECTR is a simple and effective way to identify genes involved in Mendelian and oligogenic disorders. It performs markedly better than the single existing sequence-based classifier on novel data. PROSPECTR could save investigators looking at large regions of interest time and effort by prioritizing positional candidate genes for mutation detection and case-control association studies.

  12. Molecular genetic gene-environment studies using candidate genes in schizophrenia: a systematic review.

    Science.gov (United States)

    Modinos, Gemma; Iyegbe, Conrad; Prata, Diana; Rivera, Margarita; Kempton, Matthew J; Valmaggia, Lucia R; Sham, Pak C; van Os, Jim; McGuire, Philip

    2013-11-01

    The relatively high heritability of schizophrenia suggests that genetic factors play an important role in the etiology of the disorder. On the other hand, a number of environmental factors significantly influence its incidence. As few direct genetic effects have been demonstrated, and there is considerable inter-individual heterogeneity in the response to the known environmental factors, interactions between genetic and environmental factors may be important in determining whether an individual develops the disorder. To date, a considerable number of studies of gene-environment interactions (G×E) in schizophrenia have employed a hypothesis-based molecular genetic approach using candidate genes, which have led to a range of different findings. This systematic review aims to summarize the results from molecular genetic candidate studies and to review challenges and opportunities of this approach in psychosis research. Finally, we discuss the potential of future prospects, such as new studies that combine hypothesis-based molecular genetic candidate approaches with agnostic genome-wide association studies in determining schizophrenia risk. © 2013 Elsevier B.V. All rights reserved.

  13. Analysis of gene expression during odontogenic differentiation of cultured human dental pulp cells

    Directory of Open Access Journals (Sweden)

    Min-Seock Seo

    2012-08-01

    Full Text Available Objectives We analyzed gene-expression profiles after 14 day odontogenic induction of human dental pulp cells (DPCs using a DNA microarray and sought candidate genes possibly associated with mineralization. Materials and Methods Induced human dental pulp cells were obtained by culturing DPCs in odontogenic induction medium (OM for 14 day. Cells exposed to normal culture medium were used as controls. Total RNA was extracted from cells and analyzed by microarray analysis and the key results were confirmed selectively by reverse-transcriptase polymerase chain reaction (RT-PCR. We also performed a gene set enrichment analysis (GSEA of the microarray data. Results Six hundred and five genes among the 47,320 probes on the BeadChip differed by a factor of more than two-fold in the induced cells. Of these, 217 genes were upregulated, and 388 were down-regulated. GSEA revealed that in the induced cells, genes implicated in Apoptosis and Signaling by wingless MMTV integration (Wnt were significantly upregulated. Conclusions Genes implicated in Apoptosis and Signaling by Wnt are highly connected to the differentiation of dental pulp cells into odontoblast.

  14. Candidate innate immune system gene expression in the ecological model Daphnia.

    Science.gov (United States)

    Decaestecker, Ellen; Labbé, Pierrick; Ellegaard, Kirsten; Allen, Judith E; Little, Tom J

    2011-10-01

    The last ten years have witnessed increasing interest in host-pathogen interactions involving invertebrate hosts. The invertebrate innate immune system is now relatively well characterised, but in a limited range of genetic model organisms and under a limited number of conditions. Immune systems have been little studied under real-world scenarios of environmental variation and parasitism. Thus, we have investigated expression of candidate innate immune system genes in the water flea Daphnia, a model organism for ecological genetics, and whose capacity for clonal reproduction facilitates an exceptionally rigorous control of exposure dose or the study of responses at many time points. A unique characteristic of the particular Daphnia clones and pathogen strain combinations used presently is that they have been shown to be involved in specific host-pathogen coevolutionary interactions in the wild. We choose five genes, which are strong candidates to be involved in Daphnia-pathogen interactions, given that they have been shown to code for immune effectors in related organisms. Differential expression of these genes was quantified by qRT-PCR following exposure to the bacterial pathogen Pasteuria ramosa. Constitutive expression levels differed between host genotypes, and some genes appeared to show correlated expression. However, none of the genes appeared to show a major modification of expression level in response to Pasteuria exposure. By applying knowledge from related genetic model organisms (e.g. Drosophila) to models for the study of evolutionary ecology and coevolution (i.e. Daphnia), the candidate gene approach is temptingly efficient. However, our results show that detection of only weak patterns is likely if one chooses target genes for study based on previously identified genome sequences by comparison to homologues from other related organisms. Future work on the Daphnia-Pasteuria system will need to balance a candidate gene approach with more comprehensive

  15. Microarray analysis reveals key genes and pathways in Tetralogy of Fallot

    Science.gov (United States)

    He, Yue-E; Qiu, Hui-Xian; Jiang, Jian-Bing; Wu, Rong-Zhou; Xiang, Ru-Lian; Zhang, Yuan-Hai

    2017-01-01

    The aim of the present study was to identify key genes that may be involved in the pathogenesis of Tetralogy of Fallot (TOF) using bioinformatics methods. The GSE26125 microarray dataset, which includes cardiovascular tissue samples derived from 16 children with TOF and five healthy age-matched control infants, was downloaded from the Gene Expression Omnibus database. Differential expression analysis was performed between TOF and control samples to identify differentially expressed genes (DEGs) using Student's t-test, and the R/limma package, with a log2 fold-change of >2 and a false discovery rate of <0.01 set as thresholds. The biological functions of DEGs were analyzed using the ToppGene database. The ReactomeFIViz application was used to construct functional interaction (FI) networks, and the genes in each module were subjected to pathway enrichment analysis. The iRegulon plugin was used to identify transcription factors predicted to regulate the DEGs in the FI network, and the gene-transcription factor pairs were then visualized using Cytoscape software. A total of 878 DEGs were identified, including 848 upregulated genes and 30 downregulated genes. The gene FI network contained seven function modules, which were all comprised of upregulated genes. Genes enriched in Module 1 were enriched in the following three neurological disorder-associated signaling pathways: Parkinson's disease, Alzheimer's disease and Huntington's disease. Genes in Modules 0, 3 and 5 were dominantly enriched in pathways associated with ribosomes and protein translation. The Xbox binding protein 1 transcription factor was demonstrated to be involved in the regulation of genes encoding the subunits of cytoplasmic and mitochondrial ribosomes, as well as genes involved in neurodegenerative disorders. Therefore, dysfunction of genes involved in signaling pathways associated with neurodegenerative disorders, ribosome function and protein translation may contribute to the pathogenesis of TOF

  16. GENE EXPRESSION IN THE TESTES OF NORMOSPERMIC VERSUS TERATOSPERMIC DOMESTIC CATS USING HUMAN CDNA MICROARRAY ANALYSES

    Science.gov (United States)

    GENE EXPRESSION IN THE TESTES OF NORMOSPERMIC VERSUS TERATOSPERMIC DOMESTIC CATS USING HUMAN cDNA MICROARRAY ANALYSESB.S. Pukazhenthi1, J. C. Rockett2, M. Ouyang3, D.J. Dix2, J.G. Howard1, P. Georgopoulos4, W.J. J. Welsh3 and D. E. Wildt11Department of Reproductiv...

  17. Seeded Bayesian Networks: Constructing genetic networks from microarray data

    Directory of Open Access Journals (Sweden)

    Quackenbush John

    2008-07-01

    Full Text Available Abstract Background DNA microarrays and other genomics-inspired technologies provide large datasets that often include hidden patterns of correlation between genes reflecting the complex processes that underlie cellular metabolism and physiology. The challenge in analyzing large-scale expression data has been to extract biologically meaningful inferences regarding these processes – often represented as networks – in an environment where the datasets are often imperfect and biological noise can obscure the actual signal. Although many techniques have been developed in an attempt to address these issues, to date their ability to extract meaningful and predictive network relationships has been limited. Here we describe a method that draws on prior information about gene-gene interactions to infer biologically relevant pathways from microarray data. Our approach consists of using preliminary networks derived from the literature and/or protein-protein interaction data as seeds for a Bayesian network analysis of microarray results. Results Through a bootstrap analysis of gene expression data derived from a number of leukemia studies, we demonstrate that seeded Bayesian Networks have the ability to identify high-confidence gene-gene interactions which can then be validated by comparison to other sources of pathway data. Conclusion The use of network seeds greatly improves the ability of Bayesian Network analysis to learn gene interaction networks from gene expression data. We demonstrate that the use of seeds derived from the biomedical literature or high-throughput protein-protein interaction data, or the combination, provides improvement over a standard Bayesian Network analysis, allowing networks involving dynamic processes to be deduced from the static snapshots of biological systems that represent the most common source of microarray data. Software implementing these methods has been included in the widely used TM4 microarray analysis package.

  18. Patterns of gene expression in carp liver after exposure to a mixture of waterborne and dietary cadmium using a custom-made microarray

    International Nuclear Information System (INIS)

    Reynders, Hans; Ven, Karlijn van der; Moens, Lotte N.; Remortel, Piet van; De Coen, Wim M.; Blust, Ronny

    2006-01-01

    Gene expression changes in carp liver tissue were studied after acute (3 and 24 h) and subchronic (7 and 28 days) exposure to a mixture of waterborne (9, 105 and 480 μg/l) and dietary (9.5, 122 and 144 μg/g) cadmium, using a custom-made microarray. Suppression subtractive hybridization-PCR (SSH-PCR) was applied to isolate a set of 643 liver genes, involved in multiple biological pathways, such as energy metabolism (e.g. glucokinase), immune response (e.g. complement C3) and stress and detoxification (e.g. cytochrome P450 2F2, glutathione-S-transferase pi). These genes were subsequently spotted on glass-slides for the construction of a custom-made microarray. Resulting microarray hybridizations indicated a highly dynamic response to cadmium exposure. At low exposure concentrations (9 μg/l through water and 9.5 μg/g dry weight through food) mostly energy-related genes (e.g. glucokinase, elastase) were influenced, while a general stress response was obvious through induction of several stress-related genes, including hemopexin and cytochrome P450 2F2, at high cadmium concentrations. In addition, fish exposed to the highest cadmium concentrations showed liver damage after 7 days of exposure, as measured by elevated alanine transaminase activity in plasma and increased liver water content (wet-to-dry weight ratio). Moreover, decreased hematocrit and growth were found at the end of the experiment. Altogether this study clearly demonstrated the importance of varying exposure conditions for the characterization of the molecular impact of cadmium and showed that microarray results can provide important information, required to unravel the molecular events and responses related to cadmium exposure

  19. Discovery of possible gene relationships through the application of self-organizing maps to DNA microarray databases.

    Science.gov (United States)

    Chavez-Alvarez, Rocio; Chavoya, Arturo; Mendez-Vazquez, Andres

    2014-01-01

    DNA microarrays and cell cycle synchronization experiments have made possible the study of the mechanisms of cell cycle regulation of Saccharomyces cerevisiae by simultaneously monitoring the expression levels of thousands of genes at specific time points. On the other hand, pattern recognition techniques can contribute to the analysis of such massive measurements, providing a model of gene expression level evolution through the cell cycle process. In this paper, we propose the use of one of such techniques--an unsupervised artificial neural network called a Self-Organizing Map (SOM)-which has been successfully applied to processes involving very noisy signals, classifying and organizing them, and assisting in the discovery of behavior patterns without requiring prior knowledge about the process under analysis. As a test bed for the use of SOMs in finding possible relationships among genes and their possible contribution in some biological processes, we selected 282 S. cerevisiae genes that have been shown through biological experiments to have an activity during the cell cycle. The expression level of these genes was analyzed in five of the most cited time series DNA microarray databases used in the study of the cell cycle of this organism. With the use of SOM, it was possible to find clusters of genes with similar behavior in the five databases along two cell cycles. This result suggested that some of these genes might be biologically related or might have a regulatory relationship, as was corroborated by comparing some of the clusters obtained with SOMs against a previously reported regulatory network that was generated using biological knowledge, such as protein-protein interactions, gene expression levels, metabolism dynamics, promoter binding, and modification, regulation and transport of proteins. The methodology described in this paper could be applied to the study of gene relationships of other biological processes in different organisms.

  20. Identification of reference genes for quantitative expression analysis using large-scale RNA-seq data of Arabidopsis thaliana and model crop plants.

    Science.gov (United States)

    Kudo, Toru; Sasaki, Yohei; Terashima, Shin; Matsuda-Imai, Noriko; Takano, Tomoyuki; Saito, Misa; Kanno, Maasa; Ozaki, Soichi; Suwabe, Keita; Suzuki, Go; Watanabe, Masao; Matsuoka, Makoto; Takayama, Seiji; Yano, Kentaro

    2016-10-13

    In quantitative gene expression analysis, normalization using a reference gene as an internal control is frequently performed for appropriate interpretation of the results. Efforts have been devoted to exploring superior novel reference genes using microarray transcriptomic data and to evaluating commonly used reference genes by targeting analysis. However, because the number of specifically detectable genes is totally dependent on probe design in the microarray analysis, exploration using microarray data may miss some of the best choices for the reference genes. Recently emerging RNA sequencing (RNA-seq) provides an ideal resource for comprehensive exploration of reference genes since this method is capable of detecting all expressed genes, in principle including even unknown genes. We report the results of a comprehensive exploration of reference genes using public RNA-seq data from plants such as Arabidopsis thaliana (Arabidopsis), Glycine max (soybean), Solanum lycopersicum (tomato) and Oryza sativa (rice). To select reference genes suitable for the broadest experimental conditions possible, candidates were surveyed by the following four steps: (1) evaluation of the basal expression level of each gene in each experiment; (2) evaluation of the expression stability of each gene in each experiment; (3) evaluation of the expression stability of each gene across the experiments; and (4) selection of top-ranked genes, after ranking according to the number of experiments in which the gene was expressed stably. Employing this procedure, 13, 10, 12 and 21 top candidates for reference genes were proposed in Arabidopsis, soybean, tomato and rice, respectively. Microarray expression data confirmed that the expression of the proposed reference genes under broad experimental conditions was more stable than that of commonly used reference genes. These novel reference genes will be useful for analyzing gene expression profiles across experiments carried out under various

  1. Expressed sequence tags from larval gut of the European corn borer (Ostrinia nubilalis: Exploring candidate genes potentially involved in Bacillus thuringiensis toxicity and resistance

    Directory of Open Access Journals (Sweden)

    Crespo Andre LB

    2009-06-01

    chymotrypsin-like protease genes, and 1 aminopeptidase genes in the resistant strain as compared with the susceptible strain. In contrast, the expression of 3 trypsin- like and 3 chymotrypsin-like protease genes, 2 aminopeptidase genes, and 2 alkaline phosphatase genes were increased in the resistant strain. Such differential expressions of the candidate genes may suggest their involvement in Cry1Ab resistance. Indeed, certain trypsin-like and chymotrypsin-like proteases have previously been found to activate or degrade Bt protoxins and toxins, whereas several aminopeptidases, cadherin-like proteins and alkaline phosphatases have been demonstrated to serve as Bt receptor proteins in other insect species. Conclusion We developed a relatively large EST database consisting of 12,519 high-quality sequences from a total of 15,000 cDNAs from the larval gut of ECB. To our knowledge, this database represents the largest gut-specific EST database from a lepidopteran pest. Our work provides a foundation for future research to develop an ECB gut-specific DNA microarray which can be used to analyze the global changes of gene expression in response to Bt protoxins/toxins and the genetic difference(s between Bt- resistant and susceptible strains. Furthermore, we identified 52 candidate genes that may potentially be involved in Bt toxicity and resistance. Differential expressions of 15 out of the 41 selected candidate genes examined by RT-PCR, including 5 genes with apparently decreased expression and 10 with increased expression in Cry1Ab-resistant strain, may help us conclusively identify the candidate genes involved in Bt resistance and provide us with new insights into the mechanism of Cry1Ab resistance in ECB.

  2. Exploring the key genes and pathways in enchondromas using a gene expression microarray.

    Science.gov (United States)

    Shi, Zhongju; Zhou, Hengxing; Pan, Bin; Lu, Lu; Kang, Yi; Liu, Lu; Wei, Zhijian; Feng, Shiqing

    2017-07-04

    Enchondromas are the most common primary benign osseous neoplasms that occur in the medullary bone; they can undergo malignant transformation into chondrosarcoma. However, enchondromas are always undetected in patients, and the molecular mechanism is unclear. To identify key genes and pathways associated with the occurrence and development of enchondromas, we downloaded the gene expression dataset GSE22855 and obtained the differentially expressed genes (DEGs) by analyzing high-throughput gene expression in enchondromas. In total, 635 genes were identified as DEGs. Of these, 225 genes (35.43%) were up-regulated, and the remaining 410 genes (64.57%) were down-regulated. We identified the predominant gene ontology (GO) categories and Kyoto Encyclopedia of Genes and Genomes (KEGG) pathways that were significantly over-represented in the enchondromas samples compared with the control samples. Subsequently the top 10 core genes were identified from the protein-protein interaction (PPI) network. The enrichment analyses of the genes mainly involved in two significant modules showed that the DEGs were principally related to ribosomes, protein digestion and absorption, ECM-receptor interaction, focal adhesion, amoebiasis and the PI3K-Akt signaling pathway.Together, these data elucidate the molecular mechanisms underlying the occurrence and development of enchondromas and provide promising candidates for therapeutic intervention and prognostic evaluation. However, further experimental studies are needed to confirm these results.

  3. Microarray-based apoptosis gene screening technique in trichostatin A-induced drug-resisted lung cancer A549/CDDP cells

    Directory of Open Access Journals (Sweden)

    Ya-jun WANG

    2016-09-01

    Full Text Available Objective  To detect the expression profile changes of apoptosis-related genes in trichostatin A (TSA-induced drug-resisted lung cancer cells A549/CDDP by microarray, in order to screen the target genes in TSA treating cisplatin-resisted lung cancer. Methods  A549/CDDP cells were treated by TSA for 24 hours. Total RNA was extracted and reversely transcribed into cDNA. Gene expression levels were detected by the NimbleGen whole genome microarray. Differences of expression profiles between TSA-treated and control group were measured by NimbleScan 2.5 software and GO analysis. Apoptosis and proliferation related genes were screened from the expression changed genes. Results  Compared with the control group, 85 apoptosis-related genes were up-regulated and 43 growth or proliferation related genes were down-regulated in the TSA-treated group. GO analysis showed that the functions of these genes are mainly regulating apoptosis, cell resistance to chem ical stimuli protein, as well as regulating cell growth, proliferation and the biological process of maintaining the cell biological quality. TSA-activated not only the mitochondrial apoptotic pathways, but also the death receptor related apoptosis pathway, and down-regulated the drug resistance related genes BAG3 and ABCC2. Conclusion  TSA may cause the expression changes of apoptotic and proliferation genes in A549/CDDP cells, these genes may play a role in TSA treating cisplatin-resisted lung cancer. DOI: 10.11855/j.issn.0577-7402.2016.08.07

  4. Survey of Candidate Genes for Maize Resistance to Infection by Aspergillus flavus and/or Aflatoxin Contamination

    Science.gov (United States)

    Hawkins, Leigh K.; Tang, Juliet D.; Tomashek, John; Alves Oliveira, Dafne; Ogunola, Oluwaseun F.; Smith, J. Spencer; Williams, W. Paul

    2018-01-01

    Many projects have identified candidate genes for resistance to aflatoxin accumulation or Aspergillus flavus infection and growth in maize using genetic mapping, genomics, transcriptomics and/or proteomics studies. However, only a small percentage of these candidates have been validated in field conditions, and their relative contribution to resistance, if any, is unknown. This study presents a consolidated list of candidate genes identified in past studies or in-house studies, with descriptive data including genetic location, gene annotation, known protein identifiers, and associated pathway information, if known. A candidate gene pipeline to test the phenotypic effect of any maize DNA sequence on aflatoxin accumulation resistance was used in this study to determine any measurable effect on polymorphisms within or linked to the candidate gene sequences, and the results are published here. PMID:29385107

  5. Survey of Candidate Genes for Maize Resistance to Infection by Aspergillus flavus and/or Aflatoxin Contamination

    Directory of Open Access Journals (Sweden)

    Leigh K. Hawkins

    2018-01-01

    Full Text Available Many projects have identified candidate genes for resistance to aflatoxin accumulation or Aspergillus flavus infection and growth in maize using genetic mapping, genomics, transcriptomics and/or proteomics studies. However, only a small percentage of these candidates have been validated in field conditions, and their relative contribution to resistance, if any, is unknown. This study presents a consolidated list of candidate genes identified in past studies or in-house studies, with descriptive data including genetic location, gene annotation, known protein identifiers, and associated pathway information, if known. A candidate gene pipeline to test the phenotypic effect of any maize DNA sequence on aflatoxin accumulation resistance was used in this study to determine any measurable effect on polymorphisms within or linked to the candidate gene sequences, and the results are published here.

  6. Microarray Analysis in a Cell Death Resistant Glioma Cell Line to Identify Signaling Pathways and Novel Genes Controlling Resistance and Malignancy

    Energy Technology Data Exchange (ETDEWEB)

    Seznec, Janina; Naumann, Ulrike, E-mail: ulrike.naumann@uni-tuebingen.de [Laboratory of Molecular Neuro-Oncology, Department of General Neurology, Hertie-Institute for Clinical Brain Research and Center Neurology, University of Tuebingen, Otfried-Mueller-Str. 27, Tuebingen 72076 (Germany)

    2011-06-27

    Glioblastoma multiforme (GBM) is a lethal type of cancer mainly resistant to radio- and chemotherapy. Since the tumor suppressor p53 functions as a transcription factor regulating the expression of genes involved in growth inhibition, DNA repair and apoptosis, we previously assessed whether specific differences in the modulation of gene expression are responsible for the anti-tumor properties of a dominant positive p53, chimeric tumor suppressor (CTS)-1. CTS-1 is based on the sequence of p53 and designed to resist various mechanisms of inactivation which limit the activity of p53. To identify CTS-1-regulated cell death-inducing genes, we generated a CTS-1-resistant glioma cell line (229R). We used Affymetrix whole-genome microarray expression analysis to analyze alterations in gene expression and identified a variety of CTS-1 regulated genes involved in cancer-linked processes. 313 genes were differentially expressed in Adeno-CTS-1 (Ad-CTS-1)-infected and 700 genes in uninfected 229R cells compared to matching parental cells. Ingenuity Pathway Analysis (IPA) determined a variety of differentially expressed genes in Ad-CTS-1-infected cells that were members of the intracellular networks with central tumor-involved players such as nuclear factor kappa B (NF-κB), protein kinase B (PKB/AKT) or transforming growth factor beta (TGF-β). Differentially regulated genes include secreted factors as well as intracellular proteins and transcription factors regulating not only cell death, but also processes such as tumor cell motility and immunity. This work gives an overview of the pathways differentially regulated in the resistant versus parental glioma cells and might be helpful to identify candidate genes which could serve as targets to develop novel glioma specific therapy strategies.

  7. Microarray Analysis in a Cell Death Resistant Glioma Cell Line to Identify Signaling Pathways and Novel Genes Controlling Resistance and Malignancy

    International Nuclear Information System (INIS)

    Seznec, Janina; Naumann, Ulrike

    2011-01-01

    Glioblastoma multiforme (GBM) is a lethal type of cancer mainly resistant to radio- and chemotherapy. Since the tumor suppressor p53 functions as a transcription factor regulating the expression of genes involved in growth inhibition, DNA repair and apoptosis, we previously assessed whether specific differences in the modulation of gene expression are responsible for the anti-tumor properties of a dominant positive p53, chimeric tumor suppressor (CTS)-1. CTS-1 is based on the sequence of p53 and designed to resist various mechanisms of inactivation which limit the activity of p53. To identify CTS-1-regulated cell death-inducing genes, we generated a CTS-1-resistant glioma cell line (229R). We used Affymetrix whole-genome microarray expression analysis to analyze alterations in gene expression and identified a variety of CTS-1 regulated genes involved in cancer-linked processes. 313 genes were differentially expressed in Adeno-CTS-1 (Ad-CTS-1)-infected and 700 genes in uninfected 229R cells compared to matching parental cells. Ingenuity Pathway Analysis (IPA) determined a variety of differentially expressed genes in Ad-CTS-1-infected cells that were members of the intracellular networks with central tumor-involved players such as nuclear factor kappa B (NF-κB), protein kinase B (PKB/AKT) or transforming growth factor beta (TGF-β). Differentially regulated genes include secreted factors as well as intracellular proteins and transcription factors regulating not only cell death, but also processes such as tumor cell motility and immunity. This work gives an overview of the pathways differentially regulated in the resistant versus parental glioma cells and might be helpful to identify candidate genes which could serve as targets to develop novel glioma specific therapy strategies

  8. Design of an Enterobacteriaceae Pan-genome Microarray Chip

    DEFF Research Database (Denmark)

    Lukjancenko, Oksana; Ussery, David

    2010-01-01

    -density microarray chip has been designed, using 116 Enterobacteriaceae genome sequences, taking into account the enteric pan-genome. Probes for the microarray were checked in silico and performance of the chip, based on experimental strains from four different genera, demonstrate a relatively high ability...... to distinguish those strains on genus, species, and pathotype/serovar levels. Additionally, the microarray performed well when investigating which genes were found in a given strain of interest. The Enterobacteriaceae pan-genome microarray, based on 116 genomes, provides a valuable tool for determination...

  9. PSPHL as a candidate gene influencing racial disparities in endometrial cancer incidence and survival

    Directory of Open Access Journals (Sweden)

    Jay eAllard

    2012-07-01

    Full Text Available Endometrial cancer is the most commonly diagnosed gynecologic malignancy in the United States and is characterized by a well recognized racial disparity in both incidence and survival. Specifically Caucasians are about two times more likely to develop endometrial cancer than are African Americans. However, African American women are more likely to die from this disease than are Caucasians. The basis for this disparity remains unknown. Previous studies have identified differences in the types and frequencies of gene mutations among endometrial cancers from Caucasians and African Americans suggesting. We performed a gene expression microarray study in an effort to further examine differences between African American and Caucasian women’s endometrial cancers. This expression screen identified a list of potential biomarkers differentially expressed between these two groups of cancers. Of these we identified a poorly characterized transcript with a region of homology to phospho serine phospatase (PSPH and designated phospho serine phospatase like (PSPHL as the most differentially over-expressed gene in cancers from African Americans. We clarified the nature of expressed transcripts. Northern blot analysis confirmed PSPHL messages under 1 KB. Sequence analysis of transcripts confirmed two alternate open reading frame (ORF isoforms due to alternative splicing events. Splice specific primer sets confirmed both isoforms were differentially expressed in tissues from Caucasians and African Americans. We further examined the expression in other tissues from women to include normal endometrium, normal and malignant ovary. In all cases PSPHL expression was more often present in tissues from African-Americans than Caucasians. Our data confirm the African-American based expression of the PSPHL transcript several tissue types. PSPHL represents a candidate gene that might influence the observed racial disparity in endometrial and other cancers.

  10. DNA microarray data and contextual analysis of correlation graphs

    Directory of Open Access Journals (Sweden)

    Hingamp Pascal

    2003-04-01

    Full Text Available Abstract Background DNA microarrays are used to produce large sets of expression measurements from which specific biological information is sought. Their analysis requires efficient and reliable algorithms for dimensional reduction, classification and annotation. Results We study networks of co-expressed genes obtained from DNA microarray experiments. The mathematical concept of curvature on graphs is used to group genes or samples into clusters to which relevant gene or sample annotations are automatically assigned. Application to publicly available yeast and human lymphoma data demonstrates the reliability of the method in spite of its simplicity, especially with respect to the small number of parameters involved. Conclusions We provide a method for automatically determining relevant gene clusters among the many genes monitored with microarrays. The automatic annotations and the graphical interface improve the readability of the data. A C++ implementation, called Trixy, is available from http://tagc.univ-mrs.fr/bioinformatics/trixy.html.

  11. Identification of genes associated with cisplatin resistance in human oral squamous cell carcinoma cell line

    International Nuclear Information System (INIS)

    Zhang, Ping; Zhang, Zhiyuan; Zhou, Xiaojian; Qiu, Weiliu; Chen, Fangan; Chen, Wantao

    2006-01-01

    Cisplatin is widely used for chemotherapy of head and neck squamous cell carcinoma. However, details of the molecular mechanism responsible for cisplatin resistance are still unclear. The aim of this study was to identify the expression of genes related to cisplatin resistance in oral squamous cell carcinoma cells. A cisplatin-resistant cell line, Tca/cisplatin, was established from a cisplatin-sensitive cell line, Tca8113, which was derived from moderately-differentiated tongue squamous cell carcinoma. Global gene expression in this resistant cell line and its sensitive parent cell line was analyzed using Affymetrix HG-U95Av2 microarrays. Candidate genes involved in DNA repair, the MAP pathway and cell cycle regulation were chosen to validate the microarray analysis results. Cell cycle distribution and apoptosis following cisplatin exposure were also investigated. Cisplatin resistance in Tca/cisplatin cells was stable for two years in cisplatin-free culture medium. The IC50 for cisplatin in Tca/cisplatin was 6.5-fold higher than that in Tca8113. Microarray analysis identified 38 genes that were up-regulated and 25 that were down-regulated in this cell line. Some were novel candidates, while others are involved in well-characterized mechanisms that could be relevant to cisplatin resistance, such as RECQL for DNA repair and MAP2K6 in the MAP pathway; all the genes were further validated by Real-time PCR. The cell cycle-regulated genes CCND1 and CCND3 were involved in cisplatin resistance; 24-hour exposure to 10 μM cisplatin induced a marked S phase block in Tca/cisplatin cells but not in Tca8113 cells. The Tca8113 cell line and its stable drug-resistant variant Tca/cisplatin provided a useful model for identifying candidate genes responsible for the mechanism of cisplatin resistance in oral squamous cell carcinoma. Our data provide a useful basis for screening candidate targets for early diagnosis and further intervention in cisplatin resistance

  12. Identification of genes associated with cisplatin resistance in human oral squamous cell carcinoma cell line

    Directory of Open Access Journals (Sweden)

    Zhang Ping

    2006-09-01

    Full Text Available Abstract Background Cisplatin is widely used for chemotherapy of head and neck squamous cell carcinoma. However, details of the molecular mechanism responsible for cisplatin resistance are still unclear. The aim of this study was to identify the expression of genes related to cisplatin resistance in oral squamous cell carcinoma cells. Methods A cisplatin-resistant cell line, Tca/cisplatin, was established from a cisplatin-sensitive cell line, Tca8113, which was derived from moderately-differentiated tongue squamous cell carcinoma. Global gene expression in this resistant cell line and its sensitive parent cell line was analyzed using Affymetrix HG-U95Av2 microarrays. Candidate genes involved in DNA repair, the MAP pathway and cell cycle regulation were chosen to validate the microarray analysis results. Cell cycle distribution and apoptosis following cisplatin exposure were also investigated. Results Cisplatin resistance in Tca/cisplatin cells was stable for two years in cisplatin-free culture medium. The IC50 for cisplatin in Tca/cisplatin was 6.5-fold higher than that in Tca8113. Microarray analysis identified 38 genes that were up-regulated and 25 that were down-regulated in this cell line. Some were novel candidates, while others are involved in well-characterized mechanisms that could be relevant to cisplatin resistance, such as RECQL for DNA repair and MAP2K6 in the MAP pathway; all the genes were further validated by Real-time PCR. The cell cycle-regulated genes CCND1 and CCND3 were involved in cisplatin resistance; 24-hour exposure to 10 μM cisplatin induced a marked S phase block in Tca/cisplatin cells but not in Tca8113 cells. Conclusion The Tca8113 cell line and its stable drug-resistant variant Tca/cisplatin provided a useful model for identifying candidate genes responsible for the mechanism of cisplatin resistance in oral squamous cell carcinoma. Our data provide a useful basis for screening candidate targets for early diagnosis

  13. A Generally Applicable Translational Strategy Identifies S100A4 as a Candidate Gene in Allergy

    DEFF Research Database (Denmark)

    Bruhn, Sören; Fang, Yu; Barrenäs, Fredrik

    2014-01-01

    The identification of diagnostic markers and therapeutic candidate genes in common diseases is complicated by the involvement of thousands of genes. We hypothesized that genes co-regulated with a key gene in allergy, IL13, would form a module that could help to identify candidate genes. We identi...

  14. The Arabidopsis co-expression tool (act): a WWW-based tool and database for microarray-based gene expression analysis

    DEFF Research Database (Denmark)

    Jen, C. H.; Manfield, I. W.; Michalopoulos, D. W.

    2006-01-01

    be examined using the novel clique finder tool to determine the sets of genes most likely to be regulated in a similar manner. In combination, these tools offer three levels of analysis: creation of correlation lists of co-expressed genes, refinement of these lists using two-dimensional scatter plots......We present a new WWW-based tool for plant gene analysis, the Arabidopsis Co-Expression Tool (act) , based on a large Arabidopsis thaliana microarray data set obtained from the Nottingham Arabidopsis Stock Centre. The co-expression analysis tool allows users to identify genes whose expression...

  15. PAX3 gene deletion detected by microarray analysis in a girl with hearing loss.

    Science.gov (United States)

    Drozniewska, Malgorzata; Haus, Olga

    2014-01-01

    Deletions of the PAX3 gene have been rarely reported in the literature. Mutations of this gene are a common cause of Waardenburg syndrome type 1 and 3. We report a 16 year old female presenting hearing loss and normal intellectual development, without major features of Waardenburg syndrome type 1, and without family history of the syndrome. Her phenotype, however, overlaps with features of craniofacial-deafness-hand syndrome. Microarray analysis showed ~862 kb de novo deletion at 2q36.1 including PAX3. The above findings suggest that the rearrangement found in our patient appeared de novo and with high probability is a cause of her phenotype.

  16. Translating microarray data for diagnostic testing in childhood leukaemia

    International Nuclear Information System (INIS)

    Hoffmann, Katrin; Firth, Martin J; Beesley, Alex H; Klerk, Nicholas H de; Kees, Ursula R

    2006-01-01

    Recent findings from microarray studies have raised the prospect of a standardized diagnostic gene expression platform to enhance accurate diagnosis and risk stratification in paediatric acute lymphoblastic leukaemia (ALL). However, the robustness as well as the format for such a diagnostic test remains to be determined. As a step towards clinical application of these findings, we have systematically analyzed a published ALL microarray data set using Robust Multi-array Analysis (RMA) and Random Forest (RF). We examined published microarray data from 104 ALL patients specimens, that represent six different subgroups defined by cytogenetic features and immunophenotypes. Using the decision-tree based supervised learning algorithm Random Forest (RF), we determined a small set of genes for optimal subgroup distinction and subsequently validated their predictive power in an independent patient cohort. We achieved very high overall ALL subgroup prediction accuracies of about 98%, and were able to verify the robustness of these genes in an independent panel of 68 specimens obtained from a different institution and processed in a different laboratory. Our study established that the selection of discriminating genes is strongly dependent on the analysis method. This may have profound implications for clinical use, particularly when the classifier is reduced to a small set of genes. We have demonstrated that as few as 26 genes yield accurate class prediction and importantly, almost 70% of these genes have not been previously identified as essential for class distinction of the six ALL subgroups. Our finding supports the feasibility of qRT-PCR technology for standardized diagnostic testing in paediatric ALL and should, in conjunction with conventional cytogenetics lead to a more accurate classification of the disease. In addition, we have demonstrated that microarray findings from one study can be confirmed in an independent study, using an entirely independent patient cohort

  17. DNA-microarrays identification of Streptococcus mutans genes associated with biofilm thickness

    Directory of Open Access Journals (Sweden)

    Feldman Mark

    2008-12-01

    Full Text Available Abstract Background A biofilm is a complex community of microorganisms that develop on surfaces in diverse environments. The thickness of the biofilm plays a crucial role in the physiology of the immobilized bacteria. The most cariogenic bacteria, mutans streptococci, are common inhabitants of a dental biofilm community. In this study, DNA-microarray analysis was used to identify differentially expressed genes associated with the thickness of S. mutans biofilms. Results Comparative transcriptome analyses indicated that expression of 29 genes was differentially altered in 400- vs. 100-microns depth and 39 genes in 200- vs. 100-microns biofilms. Only 10 S. mutans genes showed differential expression in both 400- vs. 100-microns and 200- vs. 100-microns biofilms. All of these genes were upregulated. As sucrose is a predominant factor in oral biofilm development, its influence was evaluated on selected genes expression in the various depths of biofilms. The presence of sucrose did not noticeably change the regulation of these genes in 400- vs. 100-microns and/or 200- vs. 100-microns biofilms tested by real-time RT-PCR. Furthermore, we analyzed the expression profile of selected biofilm thickness associated genes in the luxS- mutant strain. The expression of those genes was not radically changed in the mutant strain compared to wild-type bacteria in planktonic condition. Only slight downregulation was recorded in SMU.2146c, SMU.574, SMU.609, and SMU.987 genes expression in luxS- bacteria in biofilm vs. planktonic environments. Conclusion These findings reveal genes associated with the thickness of biofilms of S. mutans. Expression of these genes is apparently not regulated directly by luxS and is not necessarily influenced by the presence of sucrose in the growth media.

  18. GeneAnalytics: An Integrative Gene Set Analysis Tool for Next Generation Sequencing, RNAseq and Microarray Data.

    Science.gov (United States)

    Ben-Ari Fuchs, Shani; Lieder, Iris; Stelzer, Gil; Mazor, Yaron; Buzhor, Ella; Kaplan, Sergey; Bogoch, Yoel; Plaschkes, Inbar; Shitrit, Alina; Rappaport, Noa; Kohn, Asher; Edgar, Ron; Shenhav, Liraz; Safran, Marilyn; Lancet, Doron; Guan-Golan, Yaron; Warshawsky, David; Shtrichman, Ronit

    2016-03-01

    Postgenomics data are produced in large volumes by life sciences and clinical applications of novel omics diagnostics and therapeutics for precision medicine. To move from "data-to-knowledge-to-innovation," a crucial missing step in the current era is, however, our limited understanding of biological and clinical contexts associated with data. Prominent among the emerging remedies to this challenge are the gene set enrichment tools. This study reports on GeneAnalytics™ ( geneanalytics.genecards.org ), a comprehensive and easy-to-apply gene set analysis tool for rapid contextualization of expression patterns and functional signatures embedded in the postgenomics Big Data domains, such as Next Generation Sequencing (NGS), RNAseq, and microarray experiments. GeneAnalytics' differentiating features include in-depth evidence-based scoring algorithms, an intuitive user interface and proprietary unified data. GeneAnalytics employs the LifeMap Science's GeneCards suite, including the GeneCards®--the human gene database; the MalaCards-the human diseases database; and the PathCards--the biological pathways database. Expression-based analysis in GeneAnalytics relies on the LifeMap Discovery®--the embryonic development and stem cells database, which includes manually curated expression data for normal and diseased tissues, enabling advanced matching algorithm for gene-tissue association. This assists in evaluating differentiation protocols and discovering biomarkers for tissues and cells. Results are directly linked to gene, disease, or cell "cards" in the GeneCards suite. Future developments aim to enhance the GeneAnalytics algorithm as well as visualizations, employing varied graphical display items. Such attributes make GeneAnalytics a broadly applicable postgenomics data analyses and interpretation tool for translation of data to knowledge-based innovation in various Big Data fields such as precision medicine, ecogenomics, nutrigenomics, pharmacogenomics, vaccinomics

  19. Comparison of RNA-seq and microarray-based models for clinical endpoint prediction.

    Science.gov (United States)

    Zhang, Wenqian; Yu, Ying; Hertwig, Falk; Thierry-Mieg, Jean; Zhang, Wenwei; Thierry-Mieg, Danielle; Wang, Jian; Furlanello, Cesare; Devanarayan, Viswanath; Cheng, Jie; Deng, Youping; Hero, Barbara; Hong, Huixiao; Jia, Meiwen; Li, Li; Lin, Simon M; Nikolsky, Yuri; Oberthuer, André; Qing, Tao; Su, Zhenqiang; Volland, Ruth; Wang, Charles; Wang, May D; Ai, Junmei; Albanese, Davide; Asgharzadeh, Shahab; Avigad, Smadar; Bao, Wenjun; Bessarabova, Marina; Brilliant, Murray H; Brors, Benedikt; Chierici, Marco; Chu, Tzu-Ming; Zhang, Jibin; Grundy, Richard G; He, Min Max; Hebbring, Scott; Kaufman, Howard L; Lababidi, Samir; Lancashire, Lee J; Li, Yan; Lu, Xin X; Luo, Heng; Ma, Xiwen; Ning, Baitang; Noguera, Rosa; Peifer, Martin; Phan, John H; Roels, Frederik; Rosswog, Carolina; Shao, Susan; Shen, Jie; Theissen, Jessica; Tonini, Gian Paolo; Vandesompele, Jo; Wu, Po-Yen; Xiao, Wenzhong; Xu, Joshua; Xu, Weihong; Xuan, Jiekun; Yang, Yong; Ye, Zhan; Dong, Zirui; Zhang, Ke K; Yin, Ye; Zhao, Chen; Zheng, Yuanting; Wolfinger, Russell D; Shi, Tieliu; Malkas, Linda H; Berthold, Frank; Wang, Jun; Tong, Weida; Shi, Leming; Peng, Zhiyu; Fischer, Matthias

    2015-06-25

    Gene expression profiling is being widely applied in cancer research to identify biomarkers for clinical endpoint prediction. Since RNA-seq provides a powerful tool for transcriptome-based applications beyond the limitations of microarrays, we sought to systematically evaluate the performance of RNA-seq-based and microarray-based classifiers in this MAQC-III/SEQC study for clinical endpoint prediction using neuroblastoma as a model. We generate gene expression profiles from 498 primary neuroblastomas using both RNA-seq and 44 k microarrays. Characterization of the neuroblastoma transcriptome by RNA-seq reveals that more than 48,000 genes and 200,000 transcripts are being expressed in this malignancy. We also find that RNA-seq provides much more detailed information on specific transcript expression patterns in clinico-genetic neuroblastoma subgroups than microarrays. To systematically compare the power of RNA-seq and microarray-based models in predicting clinical endpoints, we divide the cohort randomly into training and validation sets and develop 360 predictive models on six clinical endpoints of varying predictability. Evaluation of factors potentially affecting model performances reveals that prediction accuracies are most strongly influenced by the nature of the clinical endpoint, whereas technological platforms (RNA-seq vs. microarrays), RNA-seq data analysis pipelines, and feature levels (gene vs. transcript vs. exon-junction level) do not significantly affect performances of the models. We demonstrate that RNA-seq outperforms microarrays in determining the transcriptomic characteristics of cancer, while RNA-seq and microarray-based models perform similarly in clinical endpoint prediction. Our findings may be valuable to guide future studies on the development of gene expression-based predictive models and their implementation in clinical practice.

  20. Customized oligonucleotide microarray gene expression-based classification of neuroblastoma patients outperforms current clinical risk stratification.

    Science.gov (United States)

    Oberthuer, André; Berthold, Frank; Warnat, Patrick; Hero, Barbara; Kahlert, Yvonne; Spitz, Rüdiger; Ernestus, Karen; König, Rainer; Haas, Stefan; Eils, Roland; Schwab, Manfred; Brors, Benedikt; Westermann, Frank; Fischer, Matthias

    2006-11-01

    To develop a gene expression-based classifier for neuroblastoma patients that reliably predicts courses of the disease. Two hundred fifty-one neuroblastoma specimens were analyzed using a customized oligonucleotide microarray comprising 10,163 probes for transcripts with differential expression in clinical subgroups of the disease. Subsequently, the prediction analysis for microarrays (PAM) was applied to a first set of patients with maximally divergent clinical courses (n = 77). The classification accuracy was estimated by a complete 10-times-repeated 10-fold cross validation, and a 144-gene predictor was constructed from this set. This classifier's predictive power was evaluated in an independent second set (n = 174) by comparing results of the gene expression-based classification with those of risk stratification systems of current trials from Germany, Japan, and the United States. The first set of patients was accurately predicted by PAM (cross-validated accuracy, 99%). Within the second set, the PAM classifier significantly separated cohorts with distinct courses (3-year event-free survival [EFS] 0.86 +/- 0.03 [favorable; n = 115] v 0.52 +/- 0.07 [unfavorable; n = 59] and 3-year overall survival 0.99 +/- 0.01 v 0.84 +/- 0.05; both P model, the PAM predictor classified patients of the second set more accurately than risk stratification of current trials from Germany, Japan, and the United States (P < .001; hazard ratio, 4.756 [95% CI, 2.544 to 8.893]). Integration of gene expression-based class prediction of neuroblastoma patients may improve risk estimation of current neuroblastoma trials.

  1. Candidate genes for drought tolerance and improved productivity in ...

    Indian Academy of Sciences (India)

    Madhu

    Improving drought tolerance and productivity is one of the most difficult tasks for ... Keywords. Candidate gene; mapping population; polymerase chain reaction; single marker analysis. .... ple and the mean value computed. 2.4 Isolation of DNA.

  2. Identification of Differentially Expressed IGFBP5-Related Genes in Breast Cancer Tumor Tissues Using cDNA Microarray Experiments.

    Science.gov (United States)

    Akkiprik, Mustafa; Peker, İrem; Özmen, Tolga; Amuran, Gökçe Güllü; Güllüoğlu, Bahadır M; Kaya, Handan; Özer, Ayşe

    2015-11-10

    IGFBP5 is an important regulatory protein in breast cancer progression. We tried to identify differentially expressed genes (DEGs) between breast tumor tissues with IGFBP5 overexpression and their adjacent normal tissues. In this study, thirty-eight breast cancer and adjacent normal breast tissue samples were used to determine IGFBP5 expression by qPCR. cDNA microarrays were applied to the highest IGFBP5 overexpressed tumor samples compared to their adjacent normal breast tissue. Microarray analysis revealed that a total of 186 genes were differentially expressed in breast cancer compared with normal breast tissues. Of the 186 genes, 169 genes were downregulated and 17 genes were upregulated in the tumor samples. KEGG pathway analyses showed that protein digestion and absorption, focal adhesion, salivary secretion, drug metabolism-cytochrome P450, and phenylalanine metabolism pathways are involved. Among these DEGs, the prominent top two genes (MMP11 and COL1A1) which potentially correlated with IGFBP5 were selected for validation using real time RT-qPCR. Only COL1A1 expression showed a consistent upregulation with IGFBP5 expression and COL1A1 and MMP11 were significantly positively correlated. We concluded that the discovery of coordinately expressed genes related with IGFBP5 might contribute to understanding of the molecular mechanism of the function of IGFBP5 in breast cancer. Further functional studies on DEGs and association with IGFBP5 may identify novel biomarkers for clinical applications in breast cancer.

  3. Discovering time-lagged rules from microarray data using gene profile classifiers

    Directory of Open Access Journals (Sweden)

    Ponzoni Ignacio

    2011-04-01

    Full Text Available Abstract Background Gene regulatory networks have an essential role in every process of life. In this regard, the amount of genome-wide time series data is becoming increasingly available, providing the opportunity to discover the time-delayed gene regulatory networks that govern the majority of these molecular processes. Results This paper aims at reconstructing gene regulatory networks from multiple genome-wide microarray time series datasets. In this sense, a new model-free algorithm called GRNCOP2 (Gene Regulatory Network inference by Combinatorial OPtimization 2, which is a significant evolution of the GRNCOP algorithm, was developed using combinatorial optimization of gene profile classifiers. The method is capable of inferring potential time-delay relationships with any span of time between genes from various time series datasets given as input. The proposed algorithm was applied to time series data composed of twenty yeast genes that are highly relevant for the cell-cycle study, and the results were compared against several related approaches. The outcomes have shown that GRNCOP2 outperforms the contrasted methods in terms of the proposed metrics, and that the results are consistent with previous biological knowledge. Additionally, a genome-wide study on multiple publicly available time series data was performed. In this case, the experimentation has exhibited the soundness and scalability of the new method which inferred highly-related statistically-significant gene associations. Conclusions A novel method for inferring time-delayed gene regulatory networks from genome-wide time series datasets is proposed in this paper. The method was carefully validated with several publicly available data sets. The results have demonstrated that the algorithm constitutes a usable model-free approach capable of predicting meaningful relationships between genes, revealing the time-trends of gene regulation.

  4. The unique genomic properties of sex-biased genes: Insights from avian microarray data

    Directory of Open Access Journals (Sweden)

    Webster Matthew T

    2008-03-01

    Full Text Available Abstract Background In order to develop a framework for the analysis of sex-biased genes, we present a characterization of microarray data comparing male and female gene expression in 18 day chicken embryos for brain, gonad, and heart tissue. Results From the 15982 significantly expressed coding regions that have been assigned to either the autosomes or the Z chromosome (12979 in brain, 13301 in gonad, and 12372 in heart, roughly 18% were significantly sex-biased in any one tissue, though only 4 gene targets were biased in all tissues. The gonad was the most sex-biased tissue, followed by the brain. Sex-biased autosomal genes tended to be expressed at lower levels and in fewer tissues than unbiased gene targets, and autosomal somatic sex-biased genes had more expression noise than similar unbiased genes. Sex-biased genes linked to the Z-chromosome showed reduced expression in females, but not in males, when compared to unbiased Z-linked genes, and sex-biased Z-linked genes were also expressed in fewer tissues than unbiased Z coding regions. Third position GC content, and codon usage bias showed some sex-biased effects, primarily for autosomal genes expressed in the gonad. Finally, there were several over-represented Gene Ontology terms in the sex-biased gene sets. Conclusion On the whole, this analysis suggests that sex-biased genes have unique genomic and organismal properties that delineate them from genes that are expressed equally in males and females.

  5. Prioritization of candidate disease genes by topological similarity between disease and protein diffusion profiles.

    Science.gov (United States)

    Zhu, Jie; Qin, Yufang; Liu, Taigang; Wang, Jun; Zheng, Xiaoqi

    2013-01-01

    Identification of gene-phenotype relationships is a fundamental challenge in human health clinic. Based on the observation that genes causing the same or similar phenotypes tend to correlate with each other in the protein-protein interaction network, a lot of network-based approaches were proposed based on different underlying models. A recent comparative study showed that diffusion-based methods achieve the state-of-the-art predictive performance. In this paper, a new diffusion-based method was proposed to prioritize candidate disease genes. Diffusion profile of a disease was defined as the stationary distribution of candidate genes given a random walk with restart where similarities between phenotypes are incorporated. Then, candidate disease genes are prioritized by comparing their diffusion profiles with that of the disease. Finally, the effectiveness of our method was demonstrated through the leave-one-out cross-validation against control genes from artificial linkage intervals and randomly chosen genes. Comparative study showed that our method achieves improved performance compared to some classical diffusion-based methods. To further illustrate our method, we used our algorithm to predict new causing genes of 16 multifactorial diseases including Prostate cancer and Alzheimer's disease, and the top predictions were in good consistent with literature reports. Our study indicates that integration of multiple information sources, especially the phenotype similarity profile data, and introduction of global similarity measure between disease and gene diffusion profiles are helpful for prioritizing candidate disease genes. Programs and data are available upon request.

  6. Tumour auto-antibody screening: performance of protein microarrays using SEREX derived antigens

    International Nuclear Information System (INIS)

    Stempfer, René; Weinhäusel, Andreas; Syed, Parvez; Vierlinger, Klemens; Pichler, Rudolf; Meese, Eckart; Leidinger, Petra; Ludwig, Nicole; Kriegner, Albert; Nöhammer, Christa

    2010-01-01

    The simplicity and potential of minimal invasive testing using serum from patients make auto-antibody based biomarkers a very promising tool for use in diagnostics of cancer and auto-immune disease. Although several methods exist for elucidating candidate-protein markers, immobilizing these onto membranes and generating so called macroarrays is of limited use for marker validation. Especially when several hundred samples have to be analysed, microarrays could serve as a good alternative since processing macro membranes is cumbersome and reproducibility of results is moderate. Candidate markers identified by SEREX (serological identification of antigens by recombinant expression cloning) screenings of brain and lung tumour were used for macroarray and microarray production. For microarray production recombinant proteins were expressed in E. coli by autoinduction and purified His-tag (histidine-tagged) proteins were then used for the production of protein microarrays. Protein arrays were hybridized with the serum samples from brain and lung tumour patients. Methods for the generation of microarrays were successfully established when using antigens derived from membrane-based selection. Signal patterns obtained by microarrays analysis of brain and lung tumour patients' sera were highly reproducible (R = 0.92-0.96). This provides the technical foundation for diagnostic applications on the basis of auto-antibody patterns. In this limited test set, the assay provided high reproducibility and a broad dynamic range to classify all brain and lung samples correctly. Protein microarray is an efficient means for auto-antibody-based detection when using SEREX-derived clones expressing antigenic proteins. Protein microarrays are preferred to macroarrays due to the easier handling and the high reproducibility of auto-antibody testing. Especially when using only a few microliters of patient samples protein microarrays are ideally suited for validation of auto

  7. Microarray expression profiling of human dental pulp from single subject.

    Science.gov (United States)

    Tete, Stefano; Mastrangelo, Filiberto; Scioletti, Anna Paola; Tranasi, Michelangelo; Raicu, Florina; Paolantonio, Michele; Stuppia, Liborio; Vinci, Raffaele; Gherlone, Enrico; Ciampoli, Cristian; Sberna, Maria Teresa; Conti, Pio

    2008-01-01

    Microarray is a recently developed simultaneous analysis of expression patterns of thousand of genes. The aim of this research was to evaluate the expression profile of human healthy dental pulp in order to find the presence of genes activated and encoding for proteins involved in the physiological process of human dental pulp. We report data obtained by analyzing expression profiles of human tooth pulp from single subjects, using an approach based on the amplification of the total RNA. Experiments were performed on a high-density array able to analyse about 21,000 oligonucleotide sequences of about 70 bases in duplicate, using an approach based on the amplification of the total RNA from the pulp of a single tooth. Obtained data were analyzed using the S.A.M. system (Significance Analysis of Microarray) and genes were merged according to their molecular functions and biological process by the Onto-Express software. The microarray analysis revealed 362 genes with specific pulp expression. Genes showing significant high expression were classified in genes involved in tooth development, protoncogenes, genes of collagen, DNAse, Metallopeptidases and Growth factors. We report a microarray analysis, carried out by extraction of total RNA from specimens of healthy human dental pulp tissue. This approach represents a powerful tool in the study of human normal and pathological pulp, allowing minimization of the genetic variability due to the pooling of samples from different individuals.

  8. Virulence Characterization of Salmonella enterica by a New Microarray: Detection and Evaluation of the Cytolethal Distending Toxin Gene Activity in the Unusual Host S. Typhimurium.

    Directory of Open Access Journals (Sweden)

    Rui Figueiredo

    Full Text Available Salmonella enterica is a zoonotic foodborne pathogen that causes acute gastroenteritis in humans. We assessed the virulence potential of one-hundred and six Salmonella strains isolated from food animals and products. A high through-put virulence genes microarray demonstrated Salmonella Pathogenicity Islands (SPI and adherence genes were highly conserved, while prophages and virulence plasmid genes were variably present. Isolates were grouped by serotype, and virulence plasmids separated S. Typhimurium in two clusters. Atypical microarray results lead to whole genome sequencing (WGS of S. Infantis Sal147, which identified deletion of thirty-eight SPI-1 genes. Sal147 was unable to invade HeLa cells and showed reduced mortality in Galleria mellonella infection model, in comparison to a SPI-1 harbouring S. Infantis. Microarray and WGS of S. Typhimurium Sal199, established for the first time in S. Typhimurium presence of cdtB and other Typhi-related genes. Characterization of Sal199 showed cdtB genes were upstream of transposase IS911, and co-expressed with other Typhi-related genes. Cell cycle arrest, cytoplasmic distension, and nuclear enlargement were detected in HeLa cells infected by Sal199, but not with S. Typhimurium LT2. Increased mortality of Galleria was detected on infection with Sal199 compared to LT2. Thus, Salmonella isolates were rapidly characterized using a high through-put microarray; helping to identify unusual virulence features which were corroborated by further characterisation. This work demonstrates that the use of suitable screening methods for Salmonella virulence can help assess the potential risk associated with certain Salmonella to humans. Incorporation of such methodology into surveillance could help reduce the risk of emergence of epidemic Salmonella strains.

  9. Microarray analysis of genes affected by salt stress in tomato | Zhou ...

    African Journals Online (AJOL)

    This study has provided a set of candidate genes, especially those in the regulatory machinery that can be further investigated to define salt stress in tomato and other plant species. Keywords: Antioxidants, cellular metabolism, cell wall, chaperonine, ethylene, protein kinase, tomato, transcription regulator, translation ...

  10. A power law global error model for the identification of differentially expressed genes in microarray data

    Directory of Open Access Journals (Sweden)

    Granucci Francesca

    2004-12-01

    Full Text Available Abstract Background High-density oligonucleotide microarray technology enables the discovery of genes that are transcriptionally modulated in different biological samples due to physiology, disease or intervention. Methods for the identification of these so-called "differentially expressed genes" (DEG would largely benefit from a deeper knowledge of the intrinsic measurement variability. Though it is clear that variance of repeated measures is highly dependent on the average expression level of a given gene, there is still a lack of consensus on how signal reproducibility is linked to signal intensity. The aim of this study was to empirically model the variance versus mean dependence in microarray data to improve the performance of existing methods for identifying DEG. Results In the present work we used data generated by our lab as well as publicly available data sets to show that dispersion of repeated measures depends on location of the measures themselves following a power law. This enables us to construct a power law global error model (PLGEM that is applicable to various Affymetrix GeneChip data sets. A new DEG identification method is therefore proposed, consisting of a statistic designed to make explicit use of model-derived measurement spread estimates and a resampling-based hypothesis testing algorithm. Conclusions The new method provides a control of the false positive rate, a good sensitivity vs. specificity trade-off and consistent results with varying number of replicates and even using single samples.

  11. The Candidate Cancer Gene Database: a database of cancer driver genes from forward genetic screens in mice.

    Science.gov (United States)

    Abbott, Kenneth L; Nyre, Erik T; Abrahante, Juan; Ho, Yen-Yi; Isaksson Vogel, Rachel; Starr, Timothy K

    2015-01-01

    Identification of cancer driver gene mutations is crucial for advancing cancer therapeutics. Due to the overwhelming number of passenger mutations in the human tumor genome, it is difficult to pinpoint causative driver genes. Using transposon mutagenesis in mice many laboratories have conducted forward genetic screens and identified thousands of candidate driver genes that are highly relevant to human cancer. Unfortunately, this information is difficult to access and utilize because it is scattered across multiple publications using different mouse genome builds and strength metrics. To improve access to these findings and facilitate meta-analyses, we developed the Candidate Cancer Gene Database (CCGD, http://ccgd-starrlab.oit.umn.edu/). The CCGD is a manually curated database containing a unified description of all identified candidate driver genes and the genomic location of transposon common insertion sites (CISs) from all currently published transposon-based screens. To demonstrate relevance to human cancer, we performed a modified gene set enrichment analysis using KEGG pathways and show that human cancer pathways are highly enriched in the database. We also used hierarchical clustering to identify pathways enriched in blood cancers compared to solid cancers. The CCGD is a novel resource available to scientists interested in the identification of genetic drivers of cancer. © The Author(s) 2014. Published by Oxford University Press on behalf of Nucleic Acids Research.

  12. Identification of Putative Ortholog Gene Blocks Involved in Gestant and Lactating Mammary Gland Development: A Rodent Cross-Species Microarray Transcriptomics Approach

    Science.gov (United States)

    Rodríguez-Cruz, Maricela; Coral-Vázquez, Ramón M.; Hernández-Stengele, Gabriel; Sánchez, Raúl; Salazar, Emmanuel; Sanchez-Muñoz, Fausto; Encarnación-Guevara, Sergio; Ramírez-Salcedo, Jorge

    2013-01-01

    The mammary gland (MG) undergoes functional and metabolic changes during the transition from pregnancy to lactation, possibly by regulation of conserved genes. The objective was to elucidate orthologous genes, chromosome clusters and putative conserved transcriptional modules during MG development. We analyzed expression of 22,000 transcripts using murine microarrays and RNA samples of MG from virgin, pregnant, and lactating rats by cross-species hybridization. We identified 521 transcripts differentially expressed; upregulated in early (78%) and midpregnancy (89%) and early lactation (64%), but downregulated in mid-lactation (61%). Putative orthologous genes were identified. We mapped the altered genes to orthologous chromosomal locations in human and mouse. Eighteen sets of conserved genes associated with key cellular functions were revealed and conserved transcription factor binding site search entailed possible coregulation among all eight block sets of genes. This study demonstrates that the use of heterologous array hybridization for screening of orthologous gene expression from rat revealed sets of conserved genes arranged in chromosomal order implicated in signaling pathways and functional ontology. Results demonstrate the utilization power of comparative genomics and prove the feasibility of using rodent microarrays to identification of putative coexpressed orthologous genes involved in the control of human mammary gland development. PMID:24288657

  13. Common Subcluster Mining in Microarray Data for Molecular Biomarker Discovery.

    Science.gov (United States)

    Sadhu, Arnab; Bhattacharyya, Balaram

    2017-10-11

    Molecular biomarkers can be potential facilitators for detection of cancer at early stage which is otherwise difficult through conventional biomarkers. Gene expression data from microarray experiments on both normal and diseased cell samples provide enormous scope to explore genetic relations of disease using computational techniques. Varied patterns of expressions of thousands of genes at different cell conditions along with inherent experimental error make the task of isolating disease related genes challenging. In this paper, we present a data mining method, common subcluster mining (CSM), to discover highly perturbed genes under diseased condition from differential expression patterns. The method builds heap through superposing near centroid clusters from gene expression data of normal samples and extracts its core part. It, thus, isolates genes exhibiting the most stable state across normal samples and constitute a reference set for each centroid. It performs the same operation on datasets from corresponding diseased samples and isolates the genes showing drastic changes in their expression patterns. The method thus finds the disease-sensitive genesets when applied to datasets of lung cancer, prostrate cancer, pancreatic cancer, breast cancer, leukemia and pulmonary arterial hypertension. In majority of the cases, few new genes are found over and above some previously reported ones. Genes with distinct deviations in diseased samples are prospective candidates for molecular biomarkers of the respective disease.

  14. Not proper ROC curves as new tool for the analysis of differentially expressed genes in microarray experiments

    Directory of Open Access Journals (Sweden)

    Pistoia Vito

    2008-10-01

    Full Text Available Abstract Background Most microarray experiments are carried out with the purpose of identifying genes whose expression varies in relation with specific conditions or in response to environmental stimuli. In such studies, genes showing similar mean expression values between two or more groups are considered as not differentially expressed, even if hidden subclasses with different expression values may exist. In this paper we propose a new method for identifying differentially expressed genes, based on the area between the ROC curve and the rising diagonal (ABCR. ABCR represents a more general approach than the standard area under the ROC curve (AUC, because it can identify both proper (i.e., concave and not proper ROC curves (NPRC. In particular, NPRC may correspond to those genes that tend to escape standard selection methods. Results We assessed the performance of our method using data from a publicly available database of 4026 genes, including 14 normal B cell samples (NBC and 20 heterogeneous lymphomas (namely: 9 follicular lymphomas and 11 chronic lymphocytic leukemias. Moreover, NBC also included two sub-classes, i.e., 6 heavily stimulated and 8 slightly or not stimulated samples. We identified 1607 differentially expressed genes with an estimated False Discovery Rate of 15%. Among them, 16 corresponded to NPRC and all escaped standard selection procedures based on AUC and t statistics. Moreover, a simple inspection to the shape of such plots allowed to identify the two subclasses in either one class in 13 cases (81%. Conclusion NPRC represent a new useful tool for the analysis of microarray data.

  15. Performance comparison of two microarray platforms to assess differential gene expression in human monocyte and macrophage cells

    Directory of Open Access Journals (Sweden)

    Montalescot Gilles

    2008-06-01

    Full Text Available Abstract Background In this study we assessed the respective ability of Affymetrix and Illumina microarray methodologies to answer a relevant biological question, namely the change in gene expression between resting monocytes and macrophages derived from these monocytes. Five RNA samples for each type of cell were hybridized to the two platforms in parallel. In addition, a reference list of differentially expressed genes (DEG was generated from a larger number of hybridizations (mRNA from 86 individuals using the RNG/MRC two-color platform. Results Our results show an important overlap of the Illumina and Affymetrix DEG lists. In addition, more than 70% of the genes in these lists were also present in the reference list. Overall the two platforms had very similar performance in terms of biological significance, evaluated by the presence in the DEG lists of an excess of genes belonging to Gene Ontology (GO categories relevant for the biology of monocytes and macrophages. Our results support the conclusion of the MicroArray Quality Control (MAQC project that the criteria used to constitute the DEG lists strongly influence the degree of concordance among platforms. However the importance of prioritizing genes by magnitude of effect (fold change rather than statistical significance (p-value to enhance cross-platform reproducibility recommended by the MAQC authors was not supported by our data. Conclusion Functional analysis based on GO enrichment demonstrates that the 2 compared technologies delivered very similar results and identified most of the relevant GO categories enriched in the reference list.

  16. Network Expansion and Pathway Enrichment Analysis towards Biologically Significant Findings from Microarrays

    Directory of Open Access Journals (Sweden)

    Wu Xiaogang

    2012-06-01

    Full Text Available In many cases, crucial genes show relatively slight changes between groups of samples (e.g. normal vs. disease, and many genes selected from microarray differential analysis by measuring the expression level statistically are also poorly annotated and lack of biological significance. In this paper, we present an innovative approach - network expansion and pathway enrichment analysis (NEPEA for integrative microarray analysis. We assume that organized knowledge will help microarray data analysis in significant ways, and the organized knowledge could be represented as molecular interaction networks or biological pathways. Based on this hypothesis, we develop the NEPEA framework based on network expansion from the human annotated and predicted protein interaction (HAPPI database, and pathway enrichment from the human pathway database (HPD. We use a recently-published microarray dataset (GSE24215 related to insulin resistance and type 2 diabetes (T2D as case study, since this study provided a thorough experimental validation for both genes and pathways identified computationally from classical microarray analysis and pathway analysis. We perform our NEPEA analysis for this dataset based on the results from the classical microarray analysis to identify biologically significant genes and pathways. Our findings are not only consistent with the original findings mostly, but also obtained more supports from other literatures.

  17. Next-generation sequencing to identify candidate genes and develop diagnostic markers for a novel Phytophthora resistance gene, RpsHC18, in soybean.

    Science.gov (United States)

    Zhong, Chao; Sun, Suli; Li, Yinping; Duan, Canxing; Zhu, Zhendong

    2018-03-01

    A novel Phytophthora sojae resistance gene RpsHC18 was identified and finely mapped on soybean chromosome 3. Two NBS-LRR candidate genes were identified and two diagnostic markers of RpsHC18 were developed. Phytophthora root rot caused by Phytophthora sojae is a destructive disease of soybean. The most effective disease-control strategy is to deploy resistant cultivars carrying Phytophthora-resistant Rps genes. The soybean cultivar Huachun 18 has a broad and distinct resistance spectrum to 12 P. sojae isolates. Quantitative trait loci sequencing (QTL-seq), based on the whole-genome resequencing (WGRS) of two extreme resistant and susceptible phenotype bulks from an F 2:3 population, was performed, and one 767-kb genomic region with ΔSNP-index ≥ 0.9 on chromosome 3 was identified as the RpsHC18 candidate region in Huachun 18. The candidate region was reduced to a 146-kb region by fine mapping. Nonsynonymous SNP and haplotype analyses were carried out in the 146-kb region among ten soybean genotypes using WGRS. Four specific nonsynonymous SNPs were identified in two nucleotide-binding sites-leucine-rich repeat (NBS-LRR) genes, RpsHC18-NBL1 and RpsHC18-NBL2, which were considered to be the candidate genes. Finally, one specific SNP marker in each candidate gene was successfully developed using a tetra-primer ARMS-PCR assay, and the two markers were verified to be specific for RpsHC18 and to effectively distinguish other known Rps genes. In this study, we applied an integrated genomic-based strategy combining WGRS with traditional genetic mapping to identify RpsHC18 candidate genes and develop diagnostic markers. These results suggest that next-generation sequencing is a precise, rapid and cost-effective way to identify candidate genes and develop diagnostic markers, and it can accelerate Rps gene cloning and marker-assisted selection for breeding of P. sojae-resistant soybean cultivars.

  18. The Local Maximum Clustering Method and Its Application in Microarray Gene Expression Data Analysis

    Directory of Open Access Journals (Sweden)

    Chen Yidong

    2004-01-01

    Full Text Available An unsupervised data clustering method, called the local maximum clustering (LMC method, is proposed for identifying clusters in experiment data sets based on research interest. A magnitude property is defined according to research purposes, and data sets are clustered around each local maximum of the magnitude property. By properly defining a magnitude property, this method can overcome many difficulties in microarray data clustering such as reduced projection in similarities, noises, and arbitrary gene distribution. To critically evaluate the performance of this clustering method in comparison with other methods, we designed three model data sets with known cluster distributions and applied the LMC method as well as the hierarchic clustering method, the -mean clustering method, and the self-organized map method to these model data sets. The results show that the LMC method produces the most accurate clustering results. As an example of application, we applied the method to cluster the leukemia samples reported in the microarray study of Golub et al. (1999.

  19. Analysis of PSPHL as a Candidate Gene Influencing the Racial Disparity in Endometrial Cancer

    Energy Technology Data Exchange (ETDEWEB)

    Allard, Jay E. [Walter Reed Army Medical Center, Washington, DC (United States); Chandramouli, Gadisetti V. R. [Department of Obstetrics, Gynecology and Reproductive Biology, Michigan State University College of Human Medicine, Grand Rapids, MI (United States); Stagliano, Katherine [Curtis and Elizabeth Anderson Cancer Institute at Memorial Health University Medical Center, Savannah, GA (United States); Hood, Brian L. [Women’s Health Integrated Research Center at Inova Health System, Annandale, VA (United States); Litzi, Tracy [Walter Reed Army Medical Center, Washington, DC (United States); Women’s Health Integrated Research Center at Inova Health System, Annandale, VA (United States); Shoji, Yutaka [Department of Obstetrics, Gynecology and Reproductive Biology, Michigan State University College of Human Medicine, Grand Rapids, MI (United States); Curtis and Elizabeth Anderson Cancer Institute at Memorial Health University Medical Center, Savannah, GA (United States); Boyd, Jeff [Curtis and Elizabeth Anderson Cancer Institute at Memorial Health University Medical Center, Savannah, GA (United States); Fox Chase Cancer Center, Philadelphia, PA (United States); Berchuck, Andrew [Division of Gynecologic Oncology, Duke University, Durham, NC (United States); Conrads, Thomas P. [Curtis and Elizabeth Anderson Cancer Institute at Memorial Health University Medical Center, Savannah, GA (United States); Maxwell, G. Larry [Walter Reed Army Medical Center, Washington, DC (United States); Women’s Health Integrated Research Center at Inova Health System, Annandale, VA (United States); Risinger, John I., E-mail: john.risinger@hc.msu.edu [Department of Obstetrics, Gynecology and Reproductive Biology, Michigan State University College of Human Medicine, Grand Rapids, MI (United States); Curtis and Elizabeth Anderson Cancer Institute at Memorial Health University Medical Center, Savannah, GA (United States)

    2012-07-04

    Endometrial cancer is the most commonly diagnosed gynecologic malignancy in the United States. A well recognized disparity by race in both incidence and survival outcome exists for this cancer. Specifically Caucasians are about two times more likely to develop endometrial cancer than are African-Americans. However, African-American women are more likely to die from this disease than are Caucasians. The basis for this disparity remains unknown. Previous studies have identified differences in the types and frequencies of gene mutations among endometrial cancers from Caucasians and African-Americans suggesting that the tumors from these two groups might have differing underlying genetic defects. We performed a gene expression microarray study in an effort to identify differentially expressed transcripts between African-American and Caucasian women’s endometrial cancers. Our gene expression screen identified a list of potential biomarkers that are differentially expressed between these two groups of cancers. Of these we identified a poorly characterized transcript with a region of homology to phospho serine phosphatase (PSPH) and designated phospho serine phosphatase like (PSPHL) as the most differentially over-expressed gene in cancers from African-Americans. We further clarified the nature of expressed transcripts. Northern blot analysis confirmed the message was limited to a transcript of under 1 kB. Sequence analysis of transcripts confirmed two alternate open reading frame (ORF) isoforms due to alternative splicing events. Splice specific primer sets confirmed both isoforms were differentially expressed in tissues from Caucasians and African-Americans. We further examined the expression in other tissues from women to include normal endometrium, normal and malignant ovary. In all cases PSPHL expression was more often present in tissues from African-Americans than Caucasians. Our data confirm the African-American based expression of the PSPHL transcript in

  20. Analysis of PSPHL as a Candidate Gene Influencing the Racial Disparity in Endometrial Cancer

    International Nuclear Information System (INIS)

    Allard, Jay E.; Chandramouli, Gadisetti V. R.; Stagliano, Katherine; Hood, Brian L.; Litzi, Tracy; Shoji, Yutaka; Boyd, Jeff; Berchuck, Andrew; Conrads, Thomas P.; Maxwell, G. Larry; Risinger, John I.

    2012-01-01

    Endometrial cancer is the most commonly diagnosed gynecologic malignancy in the United States. A well recognized disparity by race in both incidence and survival outcome exists for this cancer. Specifically Caucasians are about two times more likely to develop endometrial cancer than are African-Americans. However, African-American women are more likely to die from this disease than are Caucasians. The basis for this disparity remains unknown. Previous studies have identified differences in the types and frequencies of gene mutations among endometrial cancers from Caucasians and African-Americans suggesting that the tumors from these two groups might have differing underlying genetic defects. We performed a gene expression microarray study in an effort to identify differentially expressed transcripts between African-American and Caucasian women’s endometrial cancers. Our gene expression screen identified a list of potential biomarkers that are differentially expressed between these two groups of cancers. Of these we identified a poorly characterized transcript with a region of homology to phospho serine phosphatase (PSPH) and designated phospho serine phosphatase like (PSPHL) as the most differentially over-expressed gene in cancers from African-Americans. We further clarified the nature of expressed transcripts. Northern blot analysis confirmed the message was limited to a transcript of under 1 kB. Sequence analysis of transcripts confirmed two alternate open reading frame (ORF) isoforms due to alternative splicing events. Splice specific primer sets confirmed both isoforms were differentially expressed in tissues from Caucasians and African-Americans. We further examined the expression in other tissues from women to include normal endometrium, normal and malignant ovary. In all cases PSPHL expression was more often present in tissues from African-Americans than Caucasians. Our data confirm the African-American based expression of the PSPHL transcript in

  1. Unsupervised Bayesian linear unmixing of gene expression microarrays.

    Science.gov (United States)

    Bazot, Cécile; Dobigeon, Nicolas; Tourneret, Jean-Yves; Zaas, Aimee K; Ginsburg, Geoffrey S; Hero, Alfred O

    2013-03-19

    This paper introduces a new constrained model and the corresponding algorithm, called unsupervised Bayesian linear unmixing (uBLU), to identify biological signatures from high dimensional assays like gene expression microarrays. The basis for uBLU is a Bayesian model for the data samples which are represented as an additive mixture of random positive gene signatures, called factors, with random positive mixing coefficients, called factor scores, that specify the relative contribution of each signature to a specific sample. The particularity of the proposed method is that uBLU constrains the factor loadings to be non-negative and the factor scores to be probability distributions over the factors. Furthermore, it also provides estimates of the number of factors. A Gibbs sampling strategy is adopted here to generate random samples according to the posterior distribution of the factors, factor scores, and number of factors. These samples are then used to estimate all the unknown parameters. Firstly, the proposed uBLU method is applied to several simulated datasets with known ground truth and compared with previous factor decomposition methods, such as principal component analysis (PCA), non negative matrix factorization (NMF), Bayesian factor regression modeling (BFRM), and the gradient-based algorithm for general matrix factorization (GB-GMF). Secondly, we illustrate the application of uBLU on a real time-evolving gene expression dataset from a recent viral challenge study in which individuals have been inoculated with influenza A/H3N2/Wisconsin. We show that the uBLU method significantly outperforms the other methods on the simulated and real data sets considered here. The results obtained on synthetic and real data illustrate the accuracy of the proposed uBLU method when compared to other factor decomposition methods from the literature (PCA, NMF, BFRM, and GB-GMF). The uBLU method identifies an inflammatory component closely associated with clinical symptom scores

  2. Layered signaling regulatory networks analysis of gene expression involved in malignant tumorigenesis of non-resolving ulcerative colitis via integration of cross-study microarray profiles.

    Science.gov (United States)

    Fan, Shengjun; Pan, Zhenyu; Geng, Qiang; Li, Xin; Wang, Yefan; An, Yu; Xu, Yan; Tie, Lu; Pan, Yan; Li, Xuejun

    2013-01-01

    Ulcerative colitis (UC) was the most frequently diagnosed inflammatory bowel disease (IBD) and closely linked to colorectal carcinogenesis. By far, the underlying mechanisms associated with the disease are still unclear. With the increasing accumulation of microarray gene expression profiles, it is profitable to gain a systematic perspective based on gene regulatory networks to better elucidate the roles of genes associated with disorders. However, a major challenge for microarray data analysis is the integration of multiple-studies generated by different groups. In this study, firstly, we modeled a signaling regulatory network associated with colorectal cancer (CRC) initiation via integration of cross-study microarray expression data sets using Empirical Bayes (EB) algorithm. Secondly, a manually curated human cancer signaling map was established via comprehensive retrieval of the publicly available repositories. Finally, the co-differently-expressed genes were manually curated to portray the layered signaling regulatory networks. Overall, the remodeled signaling regulatory networks were separated into four major layers including extracellular, membrane, cytoplasm and nucleus, which led to the identification of five core biological processes and four signaling pathways associated with colorectal carcinogenesis. As a result, our biological interpretation highlighted the importance of EGF/EGFR signaling pathway, EPO signaling pathway, T cell signal transduction and members of the BCR signaling pathway, which were responsible for the malignant transition of CRC from the benign UC to the aggressive one. The present study illustrated a standardized normalization approach for cross-study microarray expression data sets. Our model for signaling networks construction was based on the experimentally-supported interaction and microarray co-expression modeling. Pathway-based signaling regulatory networks analysis sketched a directive insight into colorectal carcinogenesis

  3. Microarray analysis of expression of cell death-associated genes in rat spinal cord cells exposed to cyclic tensile stresses in vitro

    Directory of Open Access Journals (Sweden)

    Roberts Sally

    2010-07-01

    Full Text Available Abstract Background The application of mechanical insults to the spinal cord results in profound cellular and molecular changes, including the induction of neuronal cell death and altered gene expression profiles. Previous studies have described alterations in gene expression following spinal cord injury, but the specificity of this response to mechanical stimuli is difficult to investigate in vivo. Therefore, we have investigated the effect of cyclic tensile stresses on cultured spinal cord cells from E15 Sprague-Dawley rats, using the FX3000® Flexercell Strain Unit. We examined cell morphology and viability over a 72 hour time course. Microarray analysis of gene expression was performed using the Affymetrix GeneChip System®, where categorization of identified genes was performed using the Gene Ontology (GO and Kyoto Encyclopedia of Genes and Genomes (KEGG systems. Changes in expression of 12 genes were validated with quantitative real-time reverse transcription polymerase chain reaction (RT-PCR. Results The application of cyclic tensile stress reduced the viability of cultured spinal cord cells significantly in a dose- and time-dependent manner. Increasing either the strain or the strain rate independently was associated with significant decreases in spinal cord cell survival. There was no clear evidence of additive effects of strain level with strain rate. GO analysis identified 44 candidate genes which were significantly related to "apoptosis" and 17 genes related to "response to stimulus". KEGG analysis identified changes in the expression levels of 12 genes of the mitogen-activated protein kinase (MAPK signaling pathway, which were confirmed to be upregulated by RT-PCR analysis. Conclusions We have demonstrated that spinal cord cells undergo cell death in response to cyclic tensile stresses, which were dose- and time-dependent. In addition, we have identified the up regulation of various genes, in particular of the MAPK pathway, which

  4. Gene features selection for three-class disease classification via multiple orthogonal partial least square discriminant analysis and S-plot using microarray data.

    Science.gov (United States)

    Yang, Mingxing; Li, Xiumin; Li, Zhibin; Ou, Zhimin; Liu, Ming; Liu, Suhuan; Li, Xuejun; Yang, Shuyu

    2013-01-01

    DNA microarray analysis is characterized by obtaining a large number of gene variables from a small number of observations. Cluster analysis is widely used to analyze DNA microarray data to make classification and diagnosis of disease. Because there are so many irrelevant and insignificant genes in a dataset, a feature selection approach must be employed in data analysis. The performance of cluster analysis of this high-throughput data depends on whether the feature selection approach chooses the most relevant genes associated with disease classes. Here we proposed a new method using multiple Orthogonal Partial Least Squares-Discriminant Analysis (mOPLS-DA) models and S-plots to select the most relevant genes to conduct three-class disease classification and prediction. We tested our method using Golub's leukemia microarray data. For three classes with subtypes, we proposed hierarchical orthogonal partial least squares-discriminant analysis (OPLS-DA) models and S-plots to select features for two main classes and their subtypes. For three classes in parallel, we employed three OPLS-DA models and S-plots to choose marker genes for each class. The power of feature selection to classify and predict three-class disease was evaluated using cluster analysis. Further, the general performance of our method was tested using four public datasets and compared with those of four other feature selection methods. The results revealed that our method effectively selected the most relevant features for disease classification and prediction, and its performance was better than that of the other methods.

  5. Significance analysis of lexical bias in microarray data

    Directory of Open Access Journals (Sweden)

    Falkow Stanley

    2003-04-01

    Full Text Available Abstract Background Genes that are determined to be significantly differentially regulated in microarray analyses often appear to have functional commonalities, such as being components of the same biochemical pathway. This results in certain words being under- or overrepresented in the list of genes. Distinguishing between biologically meaningful trends and artifacts of annotation and analysis procedures is of the utmost importance, as only true biological trends are of interest for further experimentation. A number of sophisticated methods for identification of significant lexical trends are currently available, but these methods are generally too cumbersome for practical use by most microarray users. Results We have developed a tool, LACK, for calculating the statistical significance of apparent lexical bias in microarray datasets. The frequency of a user-specified list of search terms in a list of genes which are differentially regulated is assessed for statistical significance by comparison to randomly generated datasets. The simplicity of the input files and user interface targets the average microarray user who wishes to have a statistical measure of apparent lexical trends in analyzed datasets without the need for bioinformatics skills. The software is available as Perl source or a Windows executable. Conclusion We have used LACK in our laboratory to generate biological hypotheses based on our microarray data. We demonstrate the program's utility using an example in which we confirm significant upregulation of SPI-2 pathogenicity island of Salmonella enterica serovar Typhimurium by the cation chelator dipyridyl.

  6. Evaluation of HER2 Gene Amplification in Breast Cancer Using Nuclei Microarray in Situ Hybridization

    Directory of Open Access Journals (Sweden)

    Xuefeng Zhang

    2012-05-01

    Full Text Available Fluorescence in situ hybridization (FISH assay is considered the “gold standard” in evaluating HER2/neu (HER2 gene status. However, FISH detection is costly and time consuming. Thus, we established nuclei microarray with extracted intact nuclei from paraffin embedded breast cancer tissues for FISH detection. The nuclei microarray FISH (NMFISH technology serves as a useful platform for analyzing HER2 gene/chromosome 17 centromere ratio. We examined HER2 gene status in 152 cases of invasive ductal carcinomas of the breast that were resected surgically with FISH and NMFISH. HER2 gene amplification status was classified according to the guidelines of the American Society of Clinical Oncology and College of American Pathologists (ASCO/CAP. Comparison of the cut-off values for HER2/chromosome 17 centromere copy number ratio obtained by NMFISH and FISH showed that there was almost perfect agreement between the two methods (κ coefficient 0.920. The results of the two methods were almost consistent for the evaluation of HER2 gene counts. The present study proved that NMFISH is comparable with FISH for evaluating HER2 gene status. The use of nuclei microarray technology is highly efficient, time and reagent conserving and inexpensive.

  7. Identification of novel target genes involved in Indian Fanconi anemia patients using microarray.

    Science.gov (United States)

    Shyamsunder, Pavithra; Ganesh, Kripa S; Vidyasekar, Prasanna; Mohan, Sheila; Verma, Rama Shanker

    2013-12-01

    Fanconi anemia (FA) is a genetic disorder characterized by progressive bone marrow failure and a predisposition to cancers. Mutations have been documented in 15 FA genes that participate in the FA-BRCA DNA repair pathway, a fundamental pathway in the development of the disease and the presentation of its characteristic symptoms. Certain symptoms such as oxygen sensitivity, hematological abnormalities and impaired immunity suggest that FA proteins could participate in or independently control other pathways as well. In this study, we identified 9 DNA repair genes that were down regulated in a genome wide analysis of 6 Indian Fanconi anemia patients. Functional clustering of a total of 233 dysregulated genes identified key biological processes that included regulation of transcription, DNA repair, cell cycle and chromosomal organization. Microarray data revealed the down regulation of ATXN3, ARID4A and ETS-1, which were validated by RTPCR in a subsequent sample set of 9 Indian FA patients. Here we report for the first time a gene expression profile of Fanconi anemia patients from the Indian population and a pool of genes that might aid in the acquisition and progression of the FA phenotype. © 2013 Elsevier B.V. All rights reserved.

  8. Knowledge Discovery in Biological Databases for Revealing Candidate Genes Linked to Complex Phenotypes.

    Science.gov (United States)

    Hassani-Pak, Keywan; Rawlings, Christopher

    2017-06-13

    Genetics and "omics" studies designed to uncover genotype to phenotype relationships often identify large numbers of potential candidate genes, among which the causal genes are hidden. Scientists generally lack the time and technical expertise to review all relevant information available from the literature, from key model species and from a potentially wide range of related biological databases in a variety of data formats with variable quality and coverage. Computational tools are needed for the integration and evaluation of heterogeneous information in order to prioritise candidate genes and components of interaction networks that, if perturbed through potential interventions, have a positive impact on the biological outcome in the whole organism without producing negative side effects. Here we review several bioinformatics tools and databases that play an important role in biological knowledge discovery and candidate gene prioritization. We conclude with several key challenges that need to be addressed in order to facilitate biological knowledge discovery in the future.

  9. A quantitative comparison of cell-type-specific microarray gene expression profiling methods in the mouse brain.

    Directory of Open Access Journals (Sweden)

    Benjamin W Okaty

    Full Text Available Expression profiling of restricted neural populations using microarrays can facilitate neuronal classification and provide insight into the molecular bases of cellular phenotypes. Due to the formidable heterogeneity of intermixed cell types that make up the brain, isolating cell types prior to microarray processing poses steep technical challenges that have been met in various ways. These methodological differences have the potential to distort cell-type-specific gene expression profiles insofar as they may insufficiently filter out contaminating mRNAs or induce aberrant cellular responses not normally present in vivo. Thus we have compared the repeatability, susceptibility to contamination from off-target cell-types, and evidence for stress-responsive gene expression of five different purification methods--Laser Capture Microdissection (LCM, Translating Ribosome Affinity Purification (TRAP, Immunopanning (PAN, Fluorescence Activated Cell Sorting (FACS, and manual sorting of fluorescently labeled cells (Manual. We found that all methods obtained comparably high levels of repeatability, however, data from LCM and TRAP showed significantly higher levels of contamination than the other methods. While PAN samples showed higher activation of apoptosis-related, stress-related and immediate early genes, samples from FACS and Manual studies, which also require dissociated cells, did not. Given that TRAP targets actively translated mRNAs, whereas other methods target all transcribed mRNAs, observed differences may also reflect translational regulation.

  10. Dissection of the inflammatory bowel disease transcriptome using genome-wide cDNA microarrays.

    Directory of Open Access Journals (Sweden)

    Christine M Costello

    2005-08-01

    Full Text Available BACKGROUND: The differential pathophysiologic mechanisms that trigger and maintain the two forms of inflammatory bowel disease (IBD, Crohn disease (CD, and ulcerative colitis (UC are only partially understood. cDNA microarrays can be used to decipher gene regulation events at a genome-wide level and to identify novel unknown genes that might be involved in perpetuating inflammatory disease progression. METHODS AND FINDINGS: High-density cDNA microarrays representing 33,792 UniGene clusters were prepared. Biopsies were taken from the sigmoid colon of normal controls (n = 11, CD patients (n = 10 and UC patients (n = 10. 33P-radiolabeled cDNA from purified poly(A+ RNA extracted from biopsies (unpooled was hybridized to the arrays. We identified 500 and 272 transcripts differentially regulated in CD and UC, respectively. Interesting hits were independently verified by real-time PCR in a second sample of 100 individuals, and immunohistochemistry was used for exemplary localization. The main findings point to novel molecules important in abnormal immune regulation and the highly disturbed cell biology of colonic epithelial cells in IBD pathogenesis, e.g., CYLD (cylindromatosis, turban tumor syndrome and CDH11 (cadherin 11, type 2. By the nature of the array setup, many of the genes identified were to our knowledge previously uncharacterized, and prediction of the putative function of a subsection of these genes indicate that some could be involved in early events in disease pathophysiology. CONCLUSION: A comprehensive set of candidate genes not previously associated with IBD was revealed, which underlines the polygenic and complex nature of the disease. It points out substantial differences in pathophysiology between CD and UC. The multiple unknown genes identified may stimulate new research in the fields of barrier mechanisms and cell signalling in the context of IBD, and ultimately new therapeutic approaches.

  11. An Entropy-based gene selection method for cancer classification using microarray data

    Directory of Open Access Journals (Sweden)

    Krishnan Arun

    2005-03-01

    Full Text Available Abstract Background Accurate diagnosis of cancer subtypes remains a challenging problem. Building classifiers based on gene expression data is a promising approach; yet the selection of non-redundant but relevant genes is difficult. The selected gene set should be small enough to allow diagnosis even in regular clinical laboratories and ideally identify genes involved in cancer-specific regulatory pathways. Here an entropy-based method is proposed that selects genes related to the different cancer classes while at the same time reducing the redundancy among the genes. Results The present study identifies a subset of features by maximizing the relevance and minimizing the redundancy of the selected genes. A merit called normalized mutual information is employed to measure the relevance and the redundancy of the genes. In order to find a more representative subset of features, an iterative procedure is adopted that incorporates an initial clustering followed by data partitioning and the application of the algorithm to each of the partitions. A leave-one-out approach then selects the most commonly selected genes across all the different runs and the gene selection algorithm is applied again to pare down the list of selected genes until a minimal subset is obtained that gives a satisfactory accuracy of classification. The algorithm was applied to three different data sets and the results obtained were compared to work done by others using the same data sets Conclusion This study presents an entropy-based iterative algorithm for selecting genes from microarray data that are able to classify various cancer sub-types with high accuracy. In addition, the feature set obtained is very compact, that is, the redundancy between genes is reduced to a large extent. This implies that classifiers can be built with a smaller subset of genes.

  12. [Research on the relevance between the virulent genes differential expression and pathogenecity of Leptospira with microarray].

    Science.gov (United States)

    Yu, De-li; Bao, Lang

    2015-01-01

    To find the change of virulent gene expression and to analyze the relevance between the virulent change and the gene expression. Grouped guinea pigs were inoculated with 1 mL Leptospira cultured in vivo, Leptospira cultured in vitro and the Leptospira culture medium through abdominal subcutaneous respectively. The survival rate, body mass and temperature change of guinea pigs in different groups were measured within 15 d after the inoculation, then the survived guinea pigs were scarified, and the organ coefficient was also measured to know the virulence of Leptospira cultured in different environment. The amplified gene segments from Leptospira were used as probes and wrote the microarray. The total RNA was extracted from Leptospira standard strain cultured in culture medium and guinea pigs. After reverse transcription to cDNA, they were labeled with Cy3 and Cy5 respectively. Labeled cDNA was mixed and hybridized with the microarray. The hybridized mircroarray was scanned and analysed. The survival rate of inoculated guinea pig was different from group to group (in vivo group: 0%; in vitro group: 88.9%; culture medium group: 100%). The guinea pigs in vivo group had a higher temperature (PLeptospira: LA1027, LA1029, LA4004, LA3050, LA3540, LA0327, LA0378, LA1650, LA3937, LA2089, LA2144, LA3576, LA0011 and gene of Loa22 were up regulation after continuously cultured in guinea pigs. The pathogenic ability of Leptospira cultured in different environment is different and the gene expression of Leptospira is different between in vivo and in vitro as well. The understanding of the meaning of this change might help to know the pathogenecity of Leptospira.

  13. A probabilistic framework for microarray data analysis: fundamental probability models and statistical inference.

    Science.gov (United States)

    Ogunnaike, Babatunde A; Gelmi, Claudio A; Edwards, Jeremy S

    2010-05-21

    Gene expression studies generate large quantities of data with the defining characteristic that the number of genes (whose expression profiles are to be determined) exceed the number of available replicates by several orders of magnitude. Standard spot-by-spot analysis still seeks to extract useful information for each gene on the basis of the number of available replicates, and thus plays to the weakness of microarrays. On the other hand, because of the data volume, treating the entire data set as an ensemble, and developing theoretical distributions for these ensembles provides a framework that plays instead to the strength of microarrays. We present theoretical results that under reasonable assumptions, the distribution of microarray intensities follows the Gamma model, with the biological interpretations of the model parameters emerging naturally. We subsequently establish that for each microarray data set, the fractional intensities can be represented as a mixture of Beta densities, and develop a procedure for using these results to draw statistical inference regarding differential gene expression. We illustrate the results with experimental data from gene expression studies on Deinococcus radiodurans following DNA damage using cDNA microarrays. Copyright (c) 2010 Elsevier Ltd. All rights reserved.

  14. Microarray and cDNA sequence analysis of transcription during nerve-dependent limb regeneration

    Directory of Open Access Journals (Sweden)

    Bryant Susan V

    2009-01-01

    Full Text Available Abstract Background Microarray analysis and 454 cDNA sequencing were used to investigate a centuries-old problem in regenerative biology: the basis of nerve-dependent limb regeneration in salamanders. Innervated (NR and denervated (DL forelimbs of Mexican axolotls were amputated and transcripts were sampled after 0, 5, and 14 days of regeneration. Results Considerable similarity was observed between NR and DL transcriptional programs at 5 and 14 days post amputation (dpa. Genes with extracellular functions that are critical to wound healing were upregulated while muscle-specific genes were downregulated. Thus, many processes that are regulated during early limb regeneration do not depend upon nerve-derived factors. The majority of the transcriptional differences between NR and DL limbs were correlated with blastema formation; cell numbers increased in NR limbs after 5 dpa and this yielded distinct transcriptional signatures of cell proliferation in NR limbs at 14 dpa. These transcriptional signatures were not observed in DL limbs. Instead, gene expression changes within DL limbs suggest more diverse and protracted wound-healing responses. 454 cDNA sequencing complemented the microarray analysis by providing deeper sampling of transcriptional programs and associated biological processes. Assembly of new 454 cDNA sequences with existing expressed sequence tag (EST contigs from the Ambystoma EST database more than doubled (3935 to 9411 the number of non-redundant human-A. mexicanum orthologous sequences. Conclusion Many new candidate gene sequences were discovered for the first time and these will greatly enable future studies of wound healing, epigenetics, genome stability, and nerve-dependent blastema formation and outgrowth using the axolotl model.

  15. A random variance model for detection of differential gene expression in small microarray experiments.

    Science.gov (United States)

    Wright, George W; Simon, Richard M

    2003-12-12

    Microarray techniques provide a valuable way of characterizing the molecular nature of disease. Unfortunately expense and limited specimen availability often lead to studies with small sample sizes. This makes accurate estimation of variability difficult, since variance estimates made on a gene by gene basis will have few degrees of freedom, and the assumption that all genes share equal variance is unlikely to be true. We propose a model by which the within gene variances are drawn from an inverse gamma distribution, whose parameters are estimated across all genes. This results in a test statistic that is a minor variation of those used in standard linear models. We demonstrate that the model assumptions are valid on experimental data, and that the model has more power than standard tests to pick up large changes in expression, while not increasing the rate of false positives. This method is incorporated into BRB-ArrayTools version 3.0 (http://linus.nci.nih.gov/BRB-ArrayTools.html). ftp://linus.nci.nih.gov/pub/techreport/RVM_supplement.pdf

  16. Homogeneous versus heterogeneous probes for microbial ecological microarrays.

    Science.gov (United States)

    Bae, Jin-Woo; Park, Yong-Ha

    2006-07-01

    Microbial ecological microarrays have been developed for investigating the composition and functions of microorganism communities in environmental niches. These arrays include microbial identification microarrays, which use oligonucleotides, gene fragments or microbial genomes as probes. In this article, the advantages and disadvantages of each type of probe are reviewed. Oligonucleotide probes are currently useful for probing uncultivated bacteria that are not amenable to gene fragment probing, whereas the functional gene fragments amplified randomly from microbial genomes require phylogenetic and hierarchical categorization before use as microbial identification probes, despite their high resolution for both specificity and sensitivity. Until more bacteria are sequenced and gene fragment probes are thoroughly validated, heterogeneous bacterial genome probes will provide a simple, sensitive and quantitative tool for exploring the ecosystem structure.

  17. Grass cell wall feruloylation: distribution of bound ferulate and candidate gene expression in Brachypodium distachyon

    Directory of Open Access Journals (Sweden)

    Hugo Bruno Correa Molinari

    2013-03-01

    Full Text Available The cell walls of grasses such as wheat, maize, rice and sugar cane, contain large amounts of ferulate that is ester-linked to the cell wall polysaccharide glucuronoarabinoxylan (GAX. This ferulate is considered to limit the digestibility of polysaccharide in grass biomass as it forms covalent linkages between polysaccharide and lignin components. Candidate genes within a grass-specific clade of the BAHD acyl-coA transferase superfamily have been identified as being responsible for the ester linkage of ferulate to GAX. Manipulation of these BAHD genes may therefore be a biotechnological target for increasing efficiency of conversion of grass biomass into biofuel. Here, we describe the expression of these candidate genes and amounts of bound ferulate from various tissues and developmental stages of the model grass Brachypodium distachyon. BAHD candidate transcripts and significant amounts of bound ferulate were present in every tissue and developmental stage. We hypothesise that BAHD candidate genes similar to the recently described rice OsPMT gene (PMT sub-clade are principally responsible for the bound coumaric acid (pCA, and that other BAHD candidates (non-PMT sub-clade are responsible for bound ferulic acid (FA. There were some similarities with between the ratio of expression non-PMT / PMT genes and the ratio of bound FA / pCA between tissue types, compatible with this hypothesis. However, much further work to modify BAHD genes in grasses and to characterise the heterologously expressed proteins is required to demonstrate their function.

  18. Integrative missing value estimation for microarray data.

    Science.gov (United States)

    Hu, Jianjun; Li, Haifeng; Waterman, Michael S; Zhou, Xianghong Jasmine

    2006-10-12

    Missing value estimation is an important preprocessing step in microarray analysis. Although several methods have been developed to solve this problem, their performance is unsatisfactory for datasets with high rates of missing data, high measurement noise, or limited numbers of samples. In fact, more than 80% of the time-series datasets in Stanford Microarray Database contain less than eight samples. We present the integrative Missing Value Estimation method (iMISS) by incorporating information from multiple reference microarray datasets to improve missing value estimation. For each gene with missing data, we derive a consistent neighbor-gene list by taking reference data sets into consideration. To determine whether the given reference data sets are sufficiently informative for integration, we use a submatrix imputation approach. Our experiments showed that iMISS can significantly and consistently improve the accuracy of the state-of-the-art Local Least Square (LLS) imputation algorithm by up to 15% improvement in our benchmark tests. We demonstrated that the order-statistics-based integrative imputation algorithms can achieve significant improvements over the state-of-the-art missing value estimation approaches such as LLS and is especially good for imputing microarray datasets with a limited number of samples, high rates of missing data, or very noisy measurements. With the rapid accumulation of microarray datasets, the performance of our approach can be further improved by incorporating larger and more appropriate reference datasets.

  19. A microarray-based genotyping and genetic mapping approach for highly heterozygous outcrossing species enables localization of a large fraction of the unassembled Populus trichocarpa genome sequence.

    Science.gov (United States)

    Drost, Derek R; Novaes, Evandro; Boaventura-Novaes, Carolina; Benedict, Catherine I; Brown, Ryan S; Yin, Tongming; Tuskan, Gerald A; Kirst, Matias

    2009-06-01

    Microarrays have demonstrated significant power for genome-wide analyses of gene expression, and recently have also revolutionized the genetic analysis of segregating populations by genotyping thousands of loci in a single assay. Although microarray-based genotyping approaches have been successfully applied in yeast and several inbred plant species, their power has not been proven in an outcrossing species with extensive genetic diversity. Here we have developed methods for high-throughput microarray-based genotyping in such species using a pseudo-backcross progeny of 154 individuals of Populus trichocarpa and P. deltoides analyzed with long-oligonucleotide in situ-synthesized microarray probes. Our analysis resulted in high-confidence genotypes for 719 single-feature polymorphism (SFP) and 1014 gene expression marker (GEM) candidates. Using these genotypes and an established microsatellite (SSR) framework map, we produced a high-density genetic map comprising over 600 SFPs, GEMs and SSRs. The abundance of gene-based markers allowed us to localize over 35 million base pairs of previously unplaced whole-genome shotgun (WGS) scaffold sequence to putative locations in the genome of P. trichocarpa. A high proportion of sampled scaffolds could be verified for their placement with independently mapped SSRs, demonstrating the previously un-utilized power that high-density genotyping can provide in the context of map-based WGS sequence reassembly. Our results provide a substantial contribution to the continued improvement of the Populus genome assembly, while demonstrating the feasibility of microarray-based genotyping in a highly heterozygous population. The strategies presented are applicable to genetic mapping efforts in all plant species with similarly high levels of genetic diversity.

  20. Analysis of gene expression profile microarray data in complex regional pain syndrome.

    Science.gov (United States)

    Tan, Wulin; Song, Yiyan; Mo, Chengqiang; Jiang, Shuangjian; Wang, Zhongxing

    2017-09-01

    The aim of the present study was to predict key genes and proteins associated with complex regional pain syndrome (CRPS) using bioinformatics analysis. The gene expression profiling microarray data, GSE47603, which included peripheral blood samples from 4 patients with CRPS and 5 healthy controls, was obtained from the Gene Expression Omnibus (GEO) database. The differentially expressed genes (DEGs) in CRPS patients compared with healthy controls were identified using the GEO2R online tool. Functional enrichment analysis was then performed using The Database for Annotation Visualization and Integrated Discovery online tool. Protein‑protein interaction (PPI) network analysis was subsequently performed using Search Tool for the Retrieval of Interaction Genes database and analyzed with Cytoscape software. A total of 257 DEGs were identified, including 243 upregulated genes and 14 downregulated ones. Genes in the human leukocyte antigen (HLA) family were most significantly differentially expressed. Enrichment analysis demonstrated that signaling pathways, including immune response, cell motion, adhesion and angiogenesis were associated with CRPS. PPI network analysis revealed that key genes, including early region 1A binding protein p300 (EP300), CREB‑binding protein (CREBBP), signal transducer and activator of transcription (STAT)3, STAT5A and integrin α M were associated with CRPS. The results suggest that the immune response may therefore serve an important role in CRPS development. In addition, genes in the HLA family, such as HLA‑DQB1 and HLA‑DRB1, may present potential biomarkers for the diagnosis of CRPS. Furthermore, EP300, its paralog CREBBP, and the STAT family genes, STAT3 and STAT5 may be important in the development of CRPS.

  1. Replication of type 2 diabetes candidate genes variations in three geographically unrelated Indian population groups.

    Science.gov (United States)

    Ali, Shafat; Chopra, Rupali; Manvati, Siddharth; Singh, Yoginder Pal; Kaul, Nabodita; Behura, Anita; Mahajan, Ankit; Sehajpal, Prabodh; Gupta, Subash; Dhar, Manoj K; Chainy, Gagan B N; Bhanwer, Amarjit S; Sharma, Swarkar; Bamezai, Rameshwar N K

    2013-01-01

    Type 2 diabetes (T2D) is a syndrome of multiple metabolic disorders and is genetically heterogeneous. India comprises one of the largest global populations with highest number of reported type 2 diabetes cases. However, limited information about T2D associated loci is available for Indian populations. It is, therefore, pertinent to evaluate the previously associated candidates as well as identify novel genetic variations in Indian populations to understand the extent of genetic heterogeneity. We chose to do a cost effective high-throughput mass-array genotyping and studied the candidate gene variations associated with T2D in literature. In this case-control candidate genes association study, 91 SNPs from 55 candidate genes have been analyzed in three geographically independent population groups from India. We report the genetic variants in five candidate genes: TCF7L2, HHEX, ENPP1, IDE and FTO, are significantly associated (after Bonferroni correction, ppopulation. Interestingly, SNP rs7903146 of the TCF7L2 gene passed the genome wide significance threshold (combined P value = 2.05E-08) in the studied populations. We also observed the association of rs7903146 with blood glucose (fasting and postprandial) levels, supporting the role of TCF7L2 gene in blood glucose homeostasis. Further, we noted that the moderate risk provided by the independently associated loci in combined population with Odds Ratio (OR)<1.38 increased to OR = 2.44, (95%CI = 1.67-3.59) when the risk providing genotypes of TCF7L2, HHEX, ENPP1 and FTO genes were combined, suggesting the importance of gene-gene interactions evaluation in complex disorders like T2D.

  2. Microarray Data Processing Techniques for Genome-Scale Network Inference from Large Public Repositories.

    Science.gov (United States)

    Chockalingam, Sriram; Aluru, Maneesha; Aluru, Srinivas

    2016-09-19

    Pre-processing of microarray data is a well-studied problem. Furthermore, all popular platforms come with their own recommended best practices for differential analysis of genes. However, for genome-scale network inference using microarray data collected from large public repositories, these methods filter out a considerable number of genes. This is primarily due to the effects of aggregating a diverse array of experiments with different technical and biological scenarios. Here we introduce a pre-processing pipeline suitable for inferring genome-scale gene networks from large microarray datasets. We show that partitioning of the available microarray datasets according to biological relevance into tissue- and process-specific categories significantly extends the limits of downstream network construction. We demonstrate the effectiveness of our pre-processing pipeline by inferring genome-scale networks for the model plant Arabidopsis thaliana using two different construction methods and a collection of 11,760 Affymetrix ATH1 microarray chips. Our pre-processing pipeline and the datasets used in this paper are made available at http://alurulab.cc.gatech.edu/microarray-pp.

  3. SSHscreen and SSHdb, generic software for microarray based gene discovery: application to the stress response in cowpea

    Directory of Open Access Journals (Sweden)

    Oelofse Dean

    2010-04-01

    Full Text Available Abstract Background Suppression subtractive hybridization is a popular technique for gene discovery from non-model organisms without an annotated genome sequence, such as cowpea (Vigna unguiculata (L. Walp. We aimed to use this method to enrich for genes expressed during drought stress in a drought tolerant cowpea line. However, current methods were inefficient in screening libraries and management of the sequence data, and thus there was a need to develop software tools to facilitate the process. Results Forward and reverse cDNA libraries enriched for cowpea drought response genes were screened on microarrays, and the R software package SSHscreen 2.0.1 was developed (i to normalize the data effectively using spike-in control spot normalization, and (ii to select clones for sequencing based on the calculation of enrichment ratios with associated statistics. Enrichment ratio 3 values for each clone showed that 62% of the forward library and 34% of the reverse library clones were significantly differentially expressed by drought stress (adjusted p value 88% of the clones in both libraries were derived from rare transcripts in the original tester samples, thus supporting the notion that suppression subtractive hybridization enriches for rare transcripts. A set of 118 clones were chosen for sequencing, and drought-induced cowpea genes were identified, the most interesting encoding a late embryogenesis abundant Lea5 protein, a glutathione S-transferase, a thaumatin, a universal stress protein, and a wound induced protein. A lipid transfer protein and several components of photosynthesis were down-regulated by the drought stress. Reverse transcriptase quantitative PCR confirmed the enrichment ratio values for the selected cowpea genes. SSHdb, a web-accessible database, was developed to manage the clone sequences and combine the SSHscreen data with sequence annotations derived from BLAST and Blast2GO. The self-BLAST function within SSHdb grouped

  4. Looking into flowering time in almond (Prunus dulcis (Mill) D. A. Webb): the candidate gene approach.

    Science.gov (United States)

    Silva, C; Garcia-Mas, J; Sánchez, A M; Arús, P; Oliveira, M M

    2005-03-01

    Blooming time is one of the most important agronomic traits in almond. Biochemical and molecular events underlying flowering regulation must be understood before methods to stimulate late flowering can be developed. Attempts to elucidate the genetic control of this process have led to the identification of a major gene (Lb) and quantitative trait loci (QTLs) linked to observed phenotypic differences, but although this gene and these QTLs have been placed on the Prunus reference genetic map, their sequences and specific functions remain unknown. The aim of our investigation was to associate these loci with known genes using a candidate gene approach. Two almond cDNAs and eight Prunus expressed sequence tags were selected as candidate genes (CGs) since their sequences were highly identical to those of flowering regulatory genes characterized in other species. The CGs were amplified from both parental lines of the mapping population using specific primers. Sequence comparison revealed DNA polymorphisms between the parental lines, mainly of the single nucleotide type. Polymorphisms were used to develop co-dominant cleaved amplified polymorphic sequence markers or length polymorphisms based on insertion/deletion events for mapping the candidate genes on the Prunus reference map. Ten candidate genes were assigned to six linkage groups in the Prunus genome. The positions of two of these were compatible with the regions where two QTLs for blooming time were detected. One additional candidate was localized close to the position of the Evergrowing gene, which determines a non-deciduous behaviour in peach.

  5. A cDNA microarray, UniShrimpChip, for identification of genes relevant to testicular development in the black tiger shrimp (Penaeus monodon

    Directory of Open Access Journals (Sweden)

    Klinbunga Sirawut

    2011-04-01

    Full Text Available Abstract Background Poor reproductive maturation in captive male broodstock of the black tiger shrimp (Penaeus monodon is one of the serious problems to the farming industries. Without genome sequence, EST libraries of P. monodon were previously constructed to identify transcripts with important biological functions. In this study, a new version of cDNA microarray, UniShrimpChip, was constructed from the Peneaus monodon EST libraries of 12 tissues, containing 5,568 non-redundant cDNA clones from 10,536 unique cDNA in the P. monodon EST database. UniShrimpChip was used to study testicular development by comparing gene expression levels of wild brooders from the West and East coasts of Thailand and domesticated brooders with different ages (10-, 14-, 18-month-old. Results The overall gene expression patterns from the microarray experiments revealed distinct transcriptomic patterns between the wild and domesticated groups. Moreover, differentially expressed genes from the microarray comparisons were identified, and the expression patterns of eight selected transcripts were subsequently confirmed by reverse-transcriptase quantitative PCR (RT-qPCR. Among these, expression levels of six subunits (CSN2, 4, 5, 6, 7a, and 8 of the COP9 signalosome (CSN gene family in wild and different ages of domesticated brooders were examined by RT-qPCR. Among the six subunits, CSN5 and CSN6 were most highly expressed in wild brooders and least expressed in the 18-month-old domesticated group; therefore, their full-length cDNA sequences were characterized. Conclusions This study is the first report to employ cDNA microarray to study testicular development in the black tiger shrimp. We show that there are obvious differences between the wild and domesticated shrimp at the transcriptomic level. Furthermore, our study is the first to investigate the feasibility that the CSN gene family might have involved in reproduction and development of this economically important

  6. The application of nonsense-mediated mRNA decay inhibition to the identification of breast cancer susceptibility genes

    International Nuclear Information System (INIS)

    Johnson, Julie K; Waddell, Nic; Chenevix-Trench, Georgia

    2012-01-01

    Identification of novel, highly penetrant, breast cancer susceptibility genes will require the application of additional strategies beyond that of traditional linkage and candidate gene approaches. Approximately one-third of inherited genetic diseases, including breast cancer susceptibility, are caused by frameshift or nonsense mutations that truncate the protein product [1]. Transcripts harbouring premature termination codons are selectively and rapidly degraded by the nonsense-mediated mRNA decay (NMD) pathway. Blocking the NMD pathway in any given cell will stabilise these mutant transcripts, which can then be detected using gene expression microarrays. This technique, known as gene identification by nonsense-mediated mRNA decay inhibition (GINI), has proved successful in identifying sporadic nonsense mutations involved in many different cancer types. However, the approach has not yet been applied to identify germline mutations involved in breast cancer. We therefore attempted to use GINI on lymphoblastoid cell lines (LCLs) from multiple-case, non- BRCA1/2 breast cancer families in order to identify additional high-risk breast cancer susceptibility genes. We applied GINI to a total of 24 LCLs, established from breast-cancer affected and unaffected women from three multiple-case non-BRCA1/2 breast cancer families. We then used Illumina gene expression microarrays to identify transcripts stabilised by the NMD inhibition. The expression profiling identified a total of eight candidate genes from these three families. One gene, PPARGC1A, was a candidate in two separate families. We performed semi-quantitative real-time reverse transcriptase PCR of all candidate genes but only PPARGC1A showed successful validation by being stabilised in individuals with breast cancer but not in many unaffected members of the same family. Sanger sequencing of all coding and splice site regions of PPARGC1A did not reveal any protein truncating mutations. Haplotype analysis using short

  7. Extended analysis of benchmark datasets for Agilent two-color microarrays

    Directory of Open Access Journals (Sweden)

    Kerr Kathleen F

    2007-10-01

    Full Text Available Abstract Background As part of its broad and ambitious mission, the MicroArray Quality Control (MAQC project reported the results of experiments using External RNA Controls (ERCs on five microarray platforms. For most platforms, several different methods of data processing were considered. However, there was no similar consideration of different methods for processing the data from the Agilent two-color platform. While this omission is understandable given the scale of the project, it can create the false impression that there is consensus about the best way to process Agilent two-color data. It is also important to consider whether ERCs are representative of all the probes on a microarray. Results A comparison of different methods of processing Agilent two-color data shows substantial differences among methods for low-intensity genes. The sensitivity and specificity for detecting differentially expressed genes varies substantially for different methods. Analysis also reveals that the ERCs in the MAQC data only span the upper half of the intensity range, and therefore cannot be representative of all genes on the microarray. Conclusion Although ERCs demonstrate good agreement between observed and expected log-ratios on the Agilent two-color platform, such an analysis is incomplete. Simple loess normalization outperformed data processing with Agilent's Feature Extraction software for accurate identification of differentially expressed genes. Results from studies using ERCs should not be over-generalized when ERCs are not representative of all probes on a microarray.

  8. Improved microarray-based decision support with graph encoded interactome data.

    Directory of Open Access Journals (Sweden)

    Anneleen Daemen

    Full Text Available In the past, microarray studies have been criticized due to noise and the limited overlap between gene signatures. Prior biological knowledge should therefore be incorporated as side information in models based on gene expression data to improve the accuracy of diagnosis and prognosis in cancer. As prior knowledge, we investigated interaction and pathway information from the human interactome on different aspects of biological systems. By exploiting the properties of kernel methods, relations between genes with similar functions but active in alternative pathways could be incorporated in a support vector machine classifier based on spectral graph theory. Using 10 microarray data sets, we first reduced the number of data sources relevant for multiple cancer types and outcomes. Three sources on metabolic pathway information (KEGG, protein-protein interactions (OPHID and miRNA-gene targeting (microRNA.org outperformed the other sources with regard to the considered class of models. Both fixed and adaptive approaches were subsequently considered to combine the three corresponding classifiers. Averaging the predictions of these classifiers performed best and was significantly better than the model based on microarray data only. These results were confirmed on 6 validation microarray sets, with a significantly improved performance in 4 of them. Integrating interactome data thus improves classification of cancer outcome for the investigated microarray technologies and cancer types. Moreover, this strategy can be incorporated in any kernel method or non-linear version of a non-kernel method.

  9. Defining the Sequence Elements and Candidate Genes for the Coloboma Mutation.

    Directory of Open Access Journals (Sweden)

    Elizabeth A. Robb

    Full Text Available The chicken coloboma mutation exhibits features similar to human congenital developmental malformations such as ocular coloboma, cleft-palate, dwarfism, and polydactyly. The coloboma-associated region and encoded genes were investigated using advanced genomic, genetic, and gene expression technologies. Initially, the mutation was linked to a 990 kb region encoding 11 genes; the application of the genetic and genomic tools led to a reduction of the linked region to 176 kb and the elimination of 7 genes. Furthermore, bioinformatics analyses of capture array-next generation sequence data identified genetic elements including SNPs, insertions, deletions, gaps, chromosomal rearrangements, and miRNA binding sites within the introgressed causative region relative to the reference genome sequence. Coloboma-specific variants within exons, UTRs, and splice sites were studied for their contribution to the mutant phenotype. Our compiled results suggest three genes for future studies. The three candidate genes, SLC30A5 (a zinc transporter, CENPH (a centromere protein, and CDK7 (a cyclin-dependent kinase, are differentially expressed (compared to normal embryos at stages and in tissues affected by the coloboma mutation. Of these genes, two (SLC30A5 and CENPH are considered high-priority candidate based upon studies in other vertebrate model systems.

  10. Gene Expression Analysis Using Agilent DNA Microarrays

    DEFF Research Database (Denmark)

    Stangegaard, Michael

    2009-01-01

    Hybridization of labeled cDNA to microarrays is an intuitively simple and a vastly underestimated process. If it is not performed, optimized, and standardized with the same attention to detail as e.g., RNA amplification, information may be overlooked or even lost. Careful balancing of the amount ...

  11. DNA Microarray Technologies: A Novel Approach to Geonomic Research

    Energy Technology Data Exchange (ETDEWEB)

    Hinman, R.; Thrall, B.; Wong, K,

    2002-01-01

    A cDNA microarray allows biologists to examine the expression of thousands of genes simultaneously. Researchers may analyze the complete transcriptional program of an organism in response to specific physiological or developmental conditions. By design, a cDNA microarray is an experiment with many variables and few controls. One question that inevitably arises when working with a cDNA microarray is data reproducibility. How easy is it to confirm mRNA expression patterns? In this paper, a case study involving the treatment of a murine macrophage RAW 264.7 cell line with tumor necrosis factor alpha (TNF) was used to obtain a rough estimate of data reproducibility. Two trials were examined and a list of genes displaying either a > 2-fold or > 4-fold increase in gene expression was compiled. Variations in signal mean ratios between the two slides were observed. We can assume that erring in reproducibility may be compensated by greater inductive levels of similar genes. Steps taken to obtain results included serum starvation of cells before treatment, tests of mRNA for quality/consistency, and data normalization.

  12. Polymorphisms of candidate genes associated with meat quality and ...

    African Journals Online (AJOL)

    Hung Nguyen

    Abstract. The objectives of this study were to analyse genotype distribution and sequence variations of candidate genes putatively associated with meat quality and disease resistance in exotic and indigenous. Vietnamese pig breeds. For this purpose, 340 pigs from four indigenous and two exotic breeds were included.

  13. Integrating Biological Perspectives:. a Quantum Leap for Microarray Expression Analysis

    Science.gov (United States)

    Wanke, Dierk; Kilian, Joachim; Bloss, Ulrich; Mangelsen, Elke; Supper, Jochen; Harter, Klaus; Berendzen, Kenneth W.

    2009-02-01

    Biologists and bioinformatic scientists cope with the analysis of transcript abundance and the extraction of meaningful information from microarray expression data. By exploiting biological information accessible in public databases, we try to extend our current knowledge over the plant model organism Arabidopsis thaliana. Here, we give two examples of increasing the quality of information gained from large scale expression experiments by the integration of microarray-unrelated biological information: First, we utilize Arabidopsis microarray data to demonstrate that expression profiles are usually conserved between orthologous genes of different organisms. In an initial step of the analysis, orthology has to be inferred unambiguously, which then allows comparison of expression profiles between orthologs. We make use of the publicly available microarray expression data of Arabidopsis and barley, Hordeum vulgare. We found a generally positive correlation in expression trajectories between true orthologs although both organisms are only distantly related in evolutionary time scale. Second, extracting clusters of co-regulated genes implies similarities in transcriptional regulation via similar cis-regulatory elements (CREs). Vice versa approaches, where co-regulated gene clusters are found by investigating on CREs were not successful in general. Nonetheless, in some cases the presence of CREs in a defined position, orientation or CRE-combinations is positively correlated with co-regulated gene clusters. Here, we make use of genes involved in the phenylpropanoid biosynthetic pathway, to give one positive example for this approach.

  14. Replication of type 2 diabetes candidate genes variations in three geographically unrelated Indian population groups.

    Directory of Open Access Journals (Sweden)

    Shafat Ali

    Full Text Available Type 2 diabetes (T2D is a syndrome of multiple metabolic disorders and is genetically heterogeneous. India comprises one of the largest global populations with highest number of reported type 2 diabetes cases. However, limited information about T2D associated loci is available for Indian populations. It is, therefore, pertinent to evaluate the previously associated candidates as well as identify novel genetic variations in Indian populations to understand the extent of genetic heterogeneity. We chose to do a cost effective high-throughput mass-array genotyping and studied the candidate gene variations associated with T2D in literature. In this case-control candidate genes association study, 91 SNPs from 55 candidate genes have been analyzed in three geographically independent population groups from India. We report the genetic variants in five candidate genes: TCF7L2, HHEX, ENPP1, IDE and FTO, are significantly associated (after Bonferroni correction, p<5.5E-04 with T2D susceptibility in combined population. Interestingly, SNP rs7903146 of the TCF7L2 gene passed the genome wide significance threshold (combined P value = 2.05E-08 in the studied populations. We also observed the association of rs7903146 with blood glucose (fasting and postprandial levels, supporting the role of TCF7L2 gene in blood glucose homeostasis. Further, we noted that the moderate risk provided by the independently associated loci in combined population with Odds Ratio (OR<1.38 increased to OR = 2.44, (95%CI = 1.67-3.59 when the risk providing genotypes of TCF7L2, HHEX, ENPP1 and FTO genes were combined, suggesting the importance of gene-gene interactions evaluation in complex disorders like T2D.

  15. Positional RNA-Seq identifies candidate genes for phenotypic engineering of sexual traits

    NARCIS (Netherlands)

    Arbore, Roberto; Sekii, Kiyono; Beisel, Christian; Ladurner, Peter; Berezikov, Eugene; Schaerer, Lukas

    2015-01-01

    Introduction: RNA interference (RNAi) of trait-specific genes permits the manipulation of specific phenotypic traits ("phenotypic engineering") and thus represents a powerful tool to test trait function in evolutionary studies. The identification of suitable candidate genes, however, often relies on

  16. Candidate genes for drought tolerance and improved productivity in ...

    Indian Academy of Sciences (India)

    Madhu

    tropics. Improving drought tolerance and productivity is one of the most difficult tasks for cereal breeders. The diffi- culty arises from the diverse strategies adopted by plants themselves to combat drought stress depending on the timing,. Candidate genes for drought tolerance and improved productivity in rice (Oryza sativa L.).

  17. Polymorphisms of candidate genes associated with meat quality and ...

    African Journals Online (AJOL)

    The objectives of this study were to analyse genotype distribution and sequence variations of candidate genes putatively associated with meat quality and disease resistance in exotic and indigenous Vietnamese pig breeds. For this purpose, 340 pigs from four indigenous and two exotic breeds were included in the analysis ...

  18. Supervised group Lasso with applications to microarray data analysis

    Directory of Open Access Journals (Sweden)

    Huang Jian

    2007-02-01

    Full Text Available Abstract Background A tremendous amount of efforts have been devoted to identifying genes for diagnosis and prognosis of diseases using microarray gene expression data. It has been demonstrated that gene expression data have cluster structure, where the clusters consist of co-regulated genes which tend to have coordinated functions. However, most available statistical methods for gene selection do not take into consideration the cluster structure. Results We propose a supervised group Lasso approach that takes into account the cluster structure in gene expression data for gene selection and predictive model building. For gene expression data without biological cluster information, we first divide genes into clusters using the K-means approach and determine the optimal number of clusters using the Gap method. The supervised group Lasso consists of two steps. In the first step, we identify important genes within each cluster using the Lasso method. In the second step, we select important clusters using the group Lasso. Tuning parameters are determined using V-fold cross validation at both steps to allow for further flexibility. Prediction performance is evaluated using leave-one-out cross validation. We apply the proposed method to disease classification and survival analysis with microarray data. Conclusion We analyze four microarray data sets using the proposed approach: two cancer data sets with binary cancer occurrence as outcomes and two lymphoma data sets with survival outcomes. The results show that the proposed approach is capable of identifying a small number of influential gene clusters and important genes within those clusters, and has better prediction performance than existing methods.

  19. Bayesian meta-analysis models for microarray data: a comparative study

    Directory of Open Access Journals (Sweden)

    Song Joon J

    2007-03-01

    Full Text Available Abstract Background With the growing abundance of microarray data, statistical methods are increasingly needed to integrate results across studies. Two common approaches for meta-analysis of microarrays include either combining gene expression measures across studies or combining summaries such as p-values, probabilities or ranks. Here, we compare two Bayesian meta-analysis models that are analogous to these methods. Results Two Bayesian meta-analysis models for microarray data have recently been introduced. The first model combines standardized gene expression measures across studies into an overall mean, accounting for inter-study variability, while the second combines probabilities of differential expression without combining expression values. Both models produce the gene-specific posterior probability of differential expression, which is the basis for inference. Since the standardized expression integration model includes inter-study variability, it may improve accuracy of results versus the probability integration model. However, due to the small number of studies typical in microarray meta-analyses, the variability between studies is challenging to estimate. The probability integration model eliminates the need to model variability between studies, and thus its implementation is more straightforward. We found in simulations of two and five studies that combining probabilities outperformed combining standardized gene expression measures for three comparison values: the percent of true discovered genes in meta-analysis versus individual studies; the percent of true genes omitted in meta-analysis versus separate studies, and the number of true discovered genes for fixed levels of Bayesian false discovery. We identified similar results when pooling two independent studies of Bacillus subtilis. We assumed that each study was produced from the same microarray platform with only two conditions: a treatment and control, and that the data sets

  20. Gametogenesis in the Pacific oyster Crassostrea gigas: a microarrays-based analysis identifies sex and stage specific genes.

    Directory of Open Access Journals (Sweden)

    Nolwenn M Dheilly

    Full Text Available BACKGROUND: The Pacific oyster Crassostrea gigas (Mollusca, Lophotrochozoa is an alternative and irregular protandrous hermaphrodite: most individuals mature first as males and then change sex several times. Little is known about genetic and phenotypic basis of sex differentiation in oysters, and little more about the molecular pathways regulating reproduction. We have recently developed and validated a microarray containing 31,918 oligomers (Dheilly et al., 2011 representing the oyster transcriptome. The application of this microarray to the study of mollusk gametogenesis should provide a better understanding of the key factors involved in sex differentiation and the regulation of oyster reproduction. METHODOLOGY/PRINCIPAL FINDINGS: Gene expression was studied in gonads of oysters cultured over a yearly reproductive cycle. Principal component analysis and hierarchical clustering showed a significant divergence in gene expression patterns of males and females coinciding with the start of gonial mitosis. ANOVA analysis of the data revealed 2,482 genes differentially expressed during the course of males and/or females gametogenesis. The expression of 434 genes could be localized in either germ cells or somatic cells of the gonad by comparing the transcriptome of female gonads to the transcriptome of stripped oocytes and somatic tissues. Analysis of the annotated genes revealed conserved molecular mechanisms between mollusks and mammals: genes involved in chromatin condensation, DNA replication and repair, mitosis and meiosis regulation, transcription, translation and apoptosis were expressed in both male and female gonads. Most interestingly, early expressed male-specific genes included bindin and a dpy-30 homolog and female-specific genes included foxL2, nanos homolog 3, a pancreatic lipase related protein, cd63 and vitellogenin. Further functional analyses are now required in order to investigate their role in sex differentiation in oysters

  1. Microarray-based mutation analysis of the ABCA4 (ABCR) gene in autosomal recessive cone-rod dystrophy and retinitis pigmentosa.

    NARCIS (Netherlands)

    Klevering, B.J.; Ijzer, S.; Rohrschneider, K.; Zonneveld-Vrieling, M.N.; Allikmets, R.; Born, L.I. van den; Maugeri, A.; Hoyng, C.B.; Cremers, F.P.M.

    2004-01-01

    Mutations in the ABCA4 gene have been associated with autosomal recessive Stargardt disease (STGD1), cone-rod dystrophy (CRD), and retinitis pigmentosa (RP). We employed a recently developed genotyping microarray, the ABCR400-chip, to search for known ABCA4 mutations in patients with isolated or

  2. GeneChip microarrays-signal intensities, RNA concentrations and probe sequences

    International Nuclear Information System (INIS)

    Binder, Hans; Preibisch, Stephan

    2006-01-01

    GeneChip microarrays consist of hundreds of thousands of oligonucleotide probes. The transformation of their signal intensities into RNA transcript concentrations requires the knowledge of the response function of the measuring device. We analysed the 'apparatus' function of perfect match (PM) and mismatched (MM) oligonucleotide probes of GeneChip microarrays after changes of the target concentration using the results of a spiked-in experiment. In agreement with previous studies we found that a competitive two-species Langmuir-adsorption model describes the probe intensities well. Each PM and MM probe is characterized by two hybridization constants which specify the propensity of the probe to bind specific and non-specific transcripts. The affinity for non-specific hybridization is on average equal for PM and MM. The purine-pyrimidine asymmetry of base pair interaction strengths, however, causes a characteristic PM-MM intensity difference, the sign of which depends on the middle base of the probe. The affinity for specific hybridization of the PM exceeds that of the MM on average by nearly one order of magnitude because the central mismatched base only weakly contributes to the stability of the probe/target duplexes. For the first time we differentiate between the free energy parameters related to the 64 possible middle-triples of DNA/RNA oligomer duplexes with a central Watson-Crick pairing and a central mismatched pairing. Both the PM and MM probes respond to the concentration of specific transcripts, which can be estimated from the PM and MM probe intensities using the Langmuir-model. The analysis of the PM-MM intensity difference provides at least no loss of accuracy and precision of the estimated concentration compared with the PM-only estimates which in turn outperform the MM-only estimates. The results show that the processing of the PM-MM intensity difference requires the consideration of a background term due to non-specific hybridization, which is

  3. Integrative missing value estimation for microarray data

    Directory of Open Access Journals (Sweden)

    Zhou Xianghong

    2006-10-01

    Full Text Available Abstract Background Missing value estimation is an important preprocessing step in microarray analysis. Although several methods have been developed to solve this problem, their performance is unsatisfactory for datasets with high rates of missing data, high measurement noise, or limited numbers of samples. In fact, more than 80% of the time-series datasets in Stanford Microarray Database contain less than eight samples. Results We present the integrative Missing Value Estimation method (iMISS by incorporating information from multiple reference microarray datasets to improve missing value estimation. For each gene with missing data, we derive a consistent neighbor-gene list by taking reference data sets into consideration. To determine whether the given reference data sets are sufficiently informative for integration, we use a submatrix imputation approach. Our experiments showed that iMISS can significantly and consistently improve the accuracy of the state-of-the-art Local Least Square (LLS imputation algorithm by up to 15% improvement in our benchmark tests. Conclusion We demonstrated that the order-statistics-based integrative imputation algorithms can achieve significant improvements over the state-of-the-art missing value estimation approaches such as LLS and is especially good for imputing microarray datasets with a limited number of samples, high rates of missing data, or very noisy measurements. With the rapid accumulation of microarray datasets, the performance of our approach can be further improved by incorporating larger and more appropriate reference datasets.

  4. Novel candidate genes important for asthma and hypertension comorbidity revealed from associative gene networks.

    Science.gov (United States)

    Saik, Olga V; Demenkov, Pavel S; Ivanisenko, Timofey V; Bragina, Elena Yu; Freidin, Maxim B; Goncharova, Irina A; Dosenko, Victor E; Zolotareva, Olga I; Hofestaedt, Ralf; Lavrik, Inna N; Rogaev, Evgeny I; Ivanisenko, Vladimir A

    2018-02-13

    Hypertension and bronchial asthma are a major issue for people's health. As of 2014, approximately one billion adults, or ~ 22% of the world population, have had hypertension. As of 2011, 235-330 million people globally have been affected by asthma and approximately 250,000-345,000 people have died each year from the disease. The development of the effective treatment therapies against these diseases is complicated by their comorbidity features. This is often a major problem in diagnosis and their treatment. Hence, in this study the bioinformatical methodology for the analysis of the comorbidity of these two diseases have been developed. As such, the search for candidate genes related to the comorbid conditions of asthma and hypertension can help in elucidating the molecular mechanisms underlying the comorbid condition of these two diseases, and can also be useful for genotyping and identifying new drug targets. Using ANDSystem, the reconstruction and analysis of gene networks associated with asthma and hypertension was carried out. The gene network of asthma included 755 genes/proteins and 62,603 interactions, while the gene network of hypertension - 713 genes/proteins and 45,479 interactions. Two hundred and five genes/proteins and 9638 interactions were shared between asthma and hypertension. An approach for ranking genes implicated in the comorbid condition of two diseases was proposed. The approach is based on nine criteria for ranking genes by their importance, including standard methods of gene prioritization (Endeavor, ToppGene) as well as original criteria that take into account the characteristics of an associative gene network and the presence of known polymorphisms in the analysed genes. According to the proposed approach, the genes IL10, TLR4, and CAT had the highest priority in the development of comorbidity of these two diseases. Additionally, it was revealed that the list of top genes is enriched with apoptotic genes and genes involved in

  5. Comprehensive microarray-based analysis for stage-specific larval camouflage pattern-associated genes in the swallowtail butterfly, Papilio xuthus

    Directory of Open Access Journals (Sweden)

    Futahashi Ryo

    2012-05-01

    Full Text Available Abstract Background Body coloration is an ecologically important trait that is often involved in prey-predator interactions through mimicry and crypsis. Although this subject has attracted the interest of biologists and the general public, our scientific knowledge on the subject remains fragmentary. In the caterpillar of the swallowtail butterfly Papilio xuthus, spectacular changes in the color pattern are observed; the insect mimics bird droppings (mimetic pattern as a young larva, and switches to a green camouflage coloration (cryptic pattern in the final instar. Despite the wide variety and significance of larval color patterns, few studies have been conducted at a molecular level compared with the number of studies on adult butterfly wing patterns. Results To obtain a catalog of genes involved in larval mimetic and cryptic pattern formation, we constructed expressed sequence tag (EST libraries of larval epidermis for P. xuthus, and P. polytes that contained 20,736 and 5,376 clones, respectively, representing one of the largest collections available in butterflies. A comparison with silkworm epidermal EST information revealed the high expression of putative blue and yellow pigment-binding proteins in Papilio species. We also designed a microarray from the EST dataset information, analyzed more than five stages each for six markings, and confirmed spatial expression patterns by whole-mount in situ hybridization. Hence, we succeeded in elucidating many novel marking-specific genes for mimetic and cryptic pattern formation, including pigment-binding protein genes, the melanin-associated gene yellow-h3, the ecdysteroid synthesis enzyme gene 3-dehydroecdysone 3b-reductase, and Papilio-specific genes. We also found many cuticular protein genes with marking specificity that may be associated with the unique surface nanostructure of the markings. Furthermore, we identified two transcription factors, spalt and ecdysteroid signal-related E75, as genes

  6. Microarray Analysis on Gene Regulation by Estrogen, Progesterone and Tamoxifen in Human Endometrial Stromal Cells

    Directory of Open Access Journals (Sweden)

    Chun-E Ren

    2015-03-01

    Full Text Available Epithelial stromal cells represent a major cellular component of human uterine endometrium that is subject to tight hormonal regulation. Through cell-cell contacts and/or paracrine mechanisms, stromal cells play a significant role in the malignant transformation of epithelial cells. We isolated stromal cells from normal human endometrium and investigated the morphological and transcriptional changes induced by estrogen, progesterone and tamoxifen. We demonstrated that stromal cells express appreciable levels of estrogen and progesterone receptors and undergo different morphological changes upon hormonal stimulation. Microarray analysis indicated that both estrogen and progesterone induced dramatic alterations in a variety of genes associated with cell structure, transcription, cell cycle, and signaling. However, divergent patterns of changes, and in some genes opposite effects, were observed for the two hormones. A large number of genes are identified as novel targets for hormonal regulation. These hormone-responsive genes may be involved in normal uterine function and the development of endometrial malignancies.

  7. No Evidence That Schizophrenia Candidate Genes Are More Associated With Schizophrenia Than Noncandidate Genes

    NARCIS (Netherlands)

    Johnson, Emma C; Border, Richard; Melroy-Greif, Whitney E; de Leeuw, Christiaan A; Ehringer, Marissa A; Keller, Matthew C

    2017-01-01

    BACKGROUND: A recent analysis of 25 historical candidate gene polymorphisms for schizophrenia in the largest genome-wide association study conducted to date suggested that these commonly studied variants were no more associated with the disorder than would be expected by chance. However, the same

  8. Genome-Wide Association Study Identifying Candidate Genes Influencing Important Agronomic Traits of Flax (Linum usitatissimum L.) Using SLAF-seq.

    Science.gov (United States)

    Xie, Dongwei; Dai, Zhigang; Yang, Zemao; Sun, Jian; Zhao, Debao; Yang, Xue; Zhang, Liguo; Tang, Qing; Su, Jianguang

    2017-01-01

    Flax ( Linum usitatissimum L.) is an important cash crop, and its agronomic traits directly affect yield and quality. Molecular studies on flax remain inadequate because relatively few flax genes have been associated with agronomic traits or have been identified as having potential applications. To identify markers and candidate genes that can potentially be used for genetic improvement of crucial agronomic traits, we examined 224 specimens of core flax germplasm; specifically, phenotypic data for key traits, including plant height, technical length, number of branches, number of fruits, and 1000-grain weight were investigated under three environmental conditions before specific-locus amplified fragment sequencing (SLAF-seq) was employed to perform a genome-wide association study (GWAS) for these five agronomic traits. Subsequently, the results were used to screen single nucleotide polymorphism (SNP) loci and candidate genes that exhibited a significant correlation with the important agronomic traits. Our analyses identified a total of 42 SNP loci that showed significant correlations with the five important agronomic flax traits. Next, candidate genes were screened in the 10 kb zone of each of the 42 SNP loci. These SNP loci were then analyzed by a more stringent screening via co-identification using both a general linear model (GLM) and a mixed linear model (MLM) as well as co-occurrences in at least two of the three environments, whereby 15 final candidate genes were obtained. Based on these results, we determined that UGT and PL are candidate genes for plant height, GRAS and XTH are candidate genes for the number of branches, Contig1437 and LU0019C12 are candidate genes for the number of fruits, and PHO1 is a candidate gene for the 1000-seed weight. We propose that the identified SNP loci and corresponding candidate genes might serve as a biological basis for improving crucial agronomic flax traits.

  9. Genome-Wide Association Study Identifying Candidate Genes Influencing Important Agronomic Traits of Flax (Linum usitatissimum L. Using SLAF-seq

    Directory of Open Access Journals (Sweden)

    Dongwei Xie

    2018-01-01

    Full Text Available Flax (Linum usitatissimum L. is an important cash crop, and its agronomic traits directly affect yield and quality. Molecular studies on flax remain inadequate because relatively few flax genes have been associated with agronomic traits or have been identified as having potential applications. To identify markers and candidate genes that can potentially be used for genetic improvement of crucial agronomic traits, we examined 224 specimens of core flax germplasm; specifically, phenotypic data for key traits, including plant height, technical length, number of branches, number of fruits, and 1000-grain weight were investigated under three environmental conditions before specific-locus amplified fragment sequencing (SLAF-seq was employed to perform a genome-wide association study (GWAS for these five agronomic traits. Subsequently, the results were used to screen single nucleotide polymorphism (SNP loci and candidate genes that exhibited a significant correlation with the important agronomic traits. Our analyses identified a total of 42 SNP loci that showed significant correlations with the five important agronomic flax traits. Next, candidate genes were screened in the 10 kb zone of each of the 42 SNP loci. These SNP loci were then analyzed by a more stringent screening via co-identification using both a general linear model (GLM and a mixed linear model (MLM as well as co-occurrences in at least two of the three environments, whereby 15 final candidate genes were obtained. Based on these results, we determined that UGT and PL are candidate genes for plant height, GRAS and XTH are candidate genes for the number of branches, Contig1437 and LU0019C12 are candidate genes for the number of fruits, and PHO1 is a candidate gene for the 1000-seed weight. We propose that the identified SNP loci and corresponding candidate genes might serve as a biological basis for improving crucial agronomic flax traits.

  10. Monitoring expression profiles of rice (Oryza sativa L.) genes under abiotic stresses using cDNA Microarray Analysis (abstract)

    International Nuclear Information System (INIS)

    Rabbani, M.A.

    2005-01-01

    Transcript regulation in response to cold, drought, high salinity and ABA application was investigated in rice (Oryza sativa L., Nipponbare) with microarray analysis including approx. 1700 independent DNA elements derived from three cDNA libraries constructed from 15-day old rice seedlings stressed with drought, cold and high salinity. A total of 141 non-redundant genes were identified, whose expression ratios were more than three-fold compared with the control genes for at least one of stress treatments in microarray analysis. However, after RNA gel blot analysis, a total of 73 genes were identified, among them the transcripts of 36, 62, 57 and 43 genes were found increased after cold, drought, high salinity and ABA application, respectively. Sixteen of these identified genes have been reported previously to be stress inducible in rice, while 57 of which are novel that have not been reported earlier as stress responsive in rice. We observed a strong association in the expression patterns of stress responsive genes and found 15 stress inducible genes that responded to all four treatments. Based on Venn diagram analysis, 56 genes were induced by both drought and high salinity, whereas 22 genes were upregulated by both cold and high salinity stress. Similarly 43 genes were induced by both drought stress and ABA application, while only 17 genes were identified as cold and ABA inducible genes. These results indicated the existence of greater cross talk between drought, ABA and high salinity stress signaling processes than those between cold and ABA, and cold and high salinity stress signaling pathways. The cold, drought, high salinity and ABA inducible genes were classified into four gene groups from their expression profiles. Analysis of data enabled us to identify a number of promoters and possible cis-acting DNA elements of several genes induced by a variety of abiotic stresses by combining expression data with genomic sequence data of rice. Comparative analysis of

  11. Selection in the dopamine receptor 2 gene: a candidate SNP study

    Directory of Open Access Journals (Sweden)

    Tobias Göllner

    2015-08-01

    Full Text Available Dopamine is a major neurotransmitter in the human brain and is associated with various diseases. Schizophrenia, for example, is treated by blocking the dopamine receptors type 2. Shaner, Miller & Mintz (2004 stated that schizophrenia was the low fitness variant of a highly variable mental trait. We therefore explore whether the dopamine receptor 2 gene (DRD2 underwent any selection processes. We acquired genotype data of the 1,000 Genomes project (phase I, which contains 1,093 individuals from 14 populations. We included single nucleotide polymorphisms (SNPs with two minor allele frequencies (MAFs in the analysis: MAF over 0.05 and over 0.01. This is equivalent to 151 SNPs (MAF > 0.05 and 246 SNPs (MAF > 0.01 for DRD2. We used two different approaches (an outlier approach and a Bayesian approach to detect loci under selection. The combined results of both approaches yielded nine (MAF > 0.05 and two candidate SNPs (MAF > 0.01, under balancing selection. We also found weak signs for directional selection on DRD2, but in our opinion these were too weak to draw any final conclusions on directional selection in DRD2. All candidates for balancing selection are in the intronic region of the gene and only one (rs12574471 has been mentioned in the literature. Two of our candidate SNPs are located in specific regions of the gene: rs80215768 lies within a promoter flanking region and rs74751335 lies within a transcription factor binding site. We strongly encourage research on our candidate SNPs and their possible effects.

  12. Mapping a candidate gene (MdMYB10 for red flesh and foliage colour in apple

    Directory of Open Access Journals (Sweden)

    Allan Andrew C

    2007-07-01

    Full Text Available Abstract Background Integrating plant genomics and classical breeding is a challenge for both plant breeders and molecular biologists. Marker-assisted selection (MAS is a tool that can be used to accelerate the development of novel apple varieties such as cultivars that have fruit with anthocyanin through to the core. In addition, determining the inheritance of novel alleles, such as the one responsible for red flesh, adds to our understanding of allelic variation. Our goal was to map candidate anthocyanin biosynthetic and regulatory genes in a population segregating for the red flesh phenotypes. Results We have identified the Rni locus, a major genetic determinant of the red foliage and red colour in the core of apple fruit. In a population segregating for the red flesh and foliage phenotype we have determined the inheritance of the Rni locus and DNA polymorphisms of candidate anthocyanin biosynthetic and regulatory genes. Simple Sequence Repeats (SSRs and Single Nucleotide Polymorphisms (SNPs in the candidate genes were also located on an apple genetic map. We have shown that the MdMYB10 gene co-segregates with the Rni locus and is on Linkage Group (LG 09 of the apple genome. Conclusion We have performed candidate gene mapping in a fruit tree crop and have provided genetic evidence that red colouration in the fruit core as well as red foliage are both controlled by a single locus named Rni. We have shown that the transcription factor MdMYB10 may be the gene underlying Rni as there were no recombinants between the marker for this gene and the red phenotype in a population of 516 individuals. Associating markers derived from candidate genes with a desirable phenotypic trait has demonstrated the application of genomic tools in a breeding programme of a horticultural crop species.

  13. Candidate gene analysis using imputed genotypes: cell cycle single-nucleotide polymorphisms and ovarian cancer risk

    DEFF Research Database (Denmark)

    Goode, Ellen L; Fridley, Brooke L; Vierkant, Robert A

    2009-01-01

    Polymorphisms in genes critical to cell cycle control are outstanding candidates for association with ovarian cancer risk; numerous genes have been interrogated by multiple research groups using differing tagging single-nucleotide polymorphism (SNP) sets. To maximize information gleaned from......, and rs3212891; CDK2 rs2069391, rs2069414, and rs17528736; and CCNE1 rs3218036. These results exemplify the utility of imputation in candidate gene studies and lend evidence to a role of cell cycle genes in ovarian cancer etiology, suggest a reduced set of SNPs to target in additional cases and controls....

  14. Analysis of breast cancer metastasis candidate genes from next generation-sequencing via systematic functional genomics

    DEFF Research Database (Denmark)

    Blomstrøm, Monica Marie

    2016-01-01

    several growth modulators and invasion modulators were identified and independently validated. These candidates revealed a group of genes with metastasis-related functions in vitro that are involved in RNA-related processes, such as RNA-processing. Moreover, a general feature was that proliferation......) and non-CSCs. The main goal of this project was to functionally characterize a set of candidate genes recovered from next-generation sequencing analysis for their role in breast cancer metastasis formation. The starting gene set comprised 104 gene variants; i.e. 57 wildtype and 47 mutated variants. During...

  15. Vaccine-induced modulation of gene expression in turbot peritoneal cells. A microarray approach.

    Science.gov (United States)

    Fontenla, Francisco; Blanco-Abad, Verónica; Pardo, Belén G; Folgueira, Iria; Noia, Manuel; Gómez-Tato, Antonio; Martínez, Paulino; Leiro, José M; Lamas, Jesús

    2016-07-01

    We used a microarray approach to examine changes in gene expression in turbot peritoneal cells after injection of the fish with vaccines containing the ciliate parasite Philasterides dicentrarchi as antigen and one of the following adjuvants: chitosan-PVMMA microspheres, Freund́s complete adjuvant, aluminium hydroxide gel or Matrix-Q (Isconova, Sweden). We identified 374 genes that were differentially expressed in all groups of fish. Forty-two genes related to tight junctions and focal adhesions and/or actin cytoskeleton were differentially expressed in free peritoneal cells. The profound changes in gene expression related to cell adherence and cytoskeleton may be associated with cell migration and also with the formation of cell-vaccine masses and their attachment to the peritoneal wall. Thirty-five genes related to apoptosis were differentially expressed. Although most of the proteins coded by these genes have a proapoptotic effect, others are antiapoptotic, indicating that both types of signals occur in peritoneal leukocytes of vaccinated fish. Interestingly, many of the genes related to lymphocytes and lymphocyte activity were downregulated in the groups injected with vaccine. We also observed decreased expression of genes related to antigen presentation, suggesting that macrophages (which were abundant in the peritoneal cavity after vaccination) did not express these during the early inflammatory response in the peritoneal cavity. Finally, several genes that participate in the inflammatory response were differentially expressed, and most participated in resolution of inflammation, indicating that an M2 macrophage response is generated in the peritoneal cavity of fish one day post vaccination. Copyright © 2016 Elsevier Ltd. All rights reserved.

  16. Prosecutor: parameter-free inference of gene function for prokaryotes using DNA microarray data, genomic context and multiple gene annotation sources

    Directory of Open Access Journals (Sweden)

    van Hijum Sacha AFT

    2008-10-01

    Full Text Available Abstract Background Despite a plethora of functional genomic efforts, the function of many genes in sequenced genomes remains unknown. The increasing amount of microarray data for many species allows employing the guilt-by-association principle to predict function on a large scale: genes exhibiting similar expression patterns are more likely to participate in shared biological processes. Results We developed Prosecutor, an application that enables researchers to rapidly infer gene function based on available gene expression data and functional annotations. Our parameter-free functional prediction method uses a sensitive algorithm to achieve a high association rate of linking genes with unknown function to annotated genes. Furthermore, Prosecutor utilizes additional biological information such as genomic context and known regulatory mechanisms that are specific for prokaryotes. We analyzed publicly available transcriptome data sets and used literature sources to validate putative functions suggested by Prosecutor. We supply the complete results of our analysis for 11 prokaryotic organisms on a dedicated website. Conclusion The Prosecutor software and supplementary datasets available at http://www.prosecutor.nl allow researchers working on any of the analyzed organisms to quickly identify the putative functions of their genes of interest. A de novo analysis allows new organisms to be studied.

  17. Identifying Novel Candidate Genes Related to Apoptosis from a Protein-Protein Interaction Network

    Directory of Open Access Journals (Sweden)

    Baoman Wang

    2015-01-01

    Full Text Available Apoptosis is the process of programmed cell death (PCD that occurs in multicellular organisms. This process of normal cell death is required to maintain the balance of homeostasis. In addition, some diseases, such as obesity, cancer, and neurodegenerative diseases, can be cured through apoptosis, which produces few side effects. An effective comprehension of the mechanisms underlying apoptosis will be helpful to prevent and treat some diseases. The identification of genes related to apoptosis is essential to uncover its underlying mechanisms. In this study, a computational method was proposed to identify novel candidate genes related to apoptosis. First, protein-protein interaction information was used to construct a weighted graph. Second, a shortest path algorithm was applied to the graph to search for new candidate genes. Finally, the obtained genes were filtered by a permutation test. As a result, 26 genes were obtained, and we discuss their likelihood of being novel apoptosis-related genes by collecting evidence from published literature.

  18. Transcriptional profiling of endocrine cerebro-osteodysplasia using microarray and next-generation sequencing.

    Directory of Open Access Journals (Sweden)

    Piya Lahiry

    Full Text Available BACKGROUND: Transcriptome profiling of patterns of RNA expression is a powerful approach to identify networks of genes that play a role in disease. To date, most mRNA profiling of tissues has been accomplished using microarrays, but next-generation sequencing can offer a richer and more comprehensive picture. METHODOLOGY/PRINCIPAL FINDINGS: ECO is a rare multi-system developmental disorder caused by a homozygous mutation in ICK encoding intestinal cell kinase. We performed gene expression profiling using both cDNA microarrays and next-generation mRNA sequencing (mRNA-seq of skin fibroblasts from ECO-affected subjects. We then validated a subset of differentially expressed transcripts identified by each method using quantitative reverse transcription-polymerase chain reaction (qRT-PCR. Finally, we used gene ontology (GO to identify critical pathways and processes that were abnormal according to each technical platform. Methodologically, mRNA-seq identifies a much larger number of differentially expressed genes with much better correlation to qRT-PCR results than the microarray (r² = 0.794 and 0.137, respectively. Biologically, cDNA microarray identified functional pathways focused on anatomical structure and development, while the mRNA-seq platform identified a higher proportion of genes involved in cell division and DNA replication pathways. CONCLUSIONS/SIGNIFICANCE: Transcriptome profiling with mRNA-seq had greater sensitivity, range and accuracy than the microarray. The two platforms generated different but complementary hypotheses for further evaluation.

  19. Protective Effect of Gwakhyangjeonggisan Herbal Acupuncture Solution in Glioblastoma Cells: Microarray Analysis of Gene Expression

    Directory of Open Access Journals (Sweden)

    Hong-Seok Lee

    2005-12-01

    Full Text Available Objectives : Neurological disorders have been one of main therapeutic targets of acupuncture. The present study investigated the protective effects of Gwakhyangjeonggisan herbal acupuncture solution (GHAS. Methods : We performed 3-(4,5-dimethylthiazol-2-yl-2,5-diphenyltetrazolium bromide (MTT assay in glioblastoma cells, and did microarray analysis with cells exposed to reactive oxigen species (ROS of hydrogen peroxide by 8.0 k Human cDNA, with cut-off level of 2-fold changes in gene expression. Results : MTT assay showed protective effect of GHAS on the glioblastoma cells exposed to hydrogen peroxide. When glioblastoma cells were exposed to hydrogen peroxide, 24 genes were downregulated. When the cells were pretreated with GHAS before exposure to hydrogen peroxide, 46 genes were downregulated. Many of the genes downregulated by hydrogen peroxide stimulation were decreased in the amount of downregulation or reversed to upregulation. Conclusions : The gene expression changes observed in the present study are supposed to be related to the protective molecular mechanism of GHAS in the glioblastoma cells exposed to ROS stress.

  20. 'Omics' approaches in tomato aimed at identifying candidate genes ...

    African Journals Online (AJOL)

    adriana

    2013-12-04

    Dec 4, 2013 ... approaches could be combined in order to identify candidate genes for the genetic control of ascorbic ..... applied to other traits under the complex control of many ... Engineering increased vitamin C levels in ... Chem. Biol. 13:532–538. Giovannucci E, Rimm EB, Liu Y, Stampfer MJ, Willett WC (2002). A.

  1. Meta-Analysis of Multiple Sclerosis Microarray Data Reveals Dysregulation in RNA Splicing Regulatory Genes

    Directory of Open Access Journals (Sweden)

    Elvezia Maria Paraboschi

    2015-09-01

    Full Text Available Abnormalities in RNA metabolism and alternative splicing (AS are emerging as important players in complex disease phenotypes. In particular, accumulating evidence suggests the existence of pathogenic links between multiple sclerosis (MS and altered AS, including functional studies showing that an imbalance in alternatively-spliced isoforms may contribute to disease etiology. Here, we tested whether the altered expression of AS-related genes represents a MS-specific signature. A comprehensive comparative analysis of gene expression profiles of publicly-available microarray datasets (190 MS cases, 182 controls, followed by gene-ontology enrichment analysis, highlighted a significant enrichment for differentially-expressed genes involved in RNA metabolism/AS. In detail, a total of 17 genes were found to be differentially expressed in MS in multiple datasets, with CELF1 being dysregulated in five out of seven studies. We confirmed CELF1 downregulation in MS (p = 0.0015 by real-time RT-PCRs on RNA extracted from blood cells of 30 cases and 30 controls. As a proof of concept, we experimentally verified the unbalance in alternatively-spliced isoforms in MS of the NFAT5 gene, a putative CELF1 target. In conclusion, for the first time we provide evidence of a consistent dysregulation of splicing-related genes in MS and we discuss its possible implications in modulating specific AS events in MS susceptibility genes.

  2. Employing image processing techniques for cancer detection using microarray images.

    Science.gov (United States)

    Dehghan Khalilabad, Nastaran; Hassanpour, Hamid

    2017-02-01

    Microarray technology is a powerful genomic tool for simultaneously studying and analyzing the behavior of thousands of genes. The analysis of images obtained from this technology plays a critical role in the detection and treatment of diseases. The aim of the current study is to develop an automated system for analyzing data from microarray images in order to detect cancerous cases. The proposed system consists of three main phases, namely image processing, data mining, and the detection of the disease. The image processing phase performs operations such as refining image rotation, gridding (locating genes) and extracting raw data from images the data mining includes normalizing the extracted data and selecting the more effective genes. Finally, via the extracted data, cancerous cell is recognized. To evaluate the performance of the proposed system, microarray database is employed which includes Breast cancer, Myeloid Leukemia and Lymphomas from the Stanford Microarray Database. The results indicate that the proposed system is able to identify the type of cancer from the data set with an accuracy of 95.45%, 94.11%, and 100%, respectively. Copyright © 2017 Elsevier Ltd. All rights reserved.

  3. Validation of MIMGO: a method to identify differentially expressed GO terms in a microarray dataset

    Directory of Open Access Journals (Sweden)

    Yamada Yoichi

    2012-12-01

    Full Text Available Abstract Background We previously proposed an algorithm for the identification of GO terms that commonly annotate genes whose expression is upregulated or downregulated in some microarray data compared with in other microarray data. We call these “differentially expressed GO terms” and have named the algorithm “matrix-assisted identification method of differentially expressed GO terms” (MIMGO. MIMGO can also identify microarray data in which genes annotated with a differentially expressed GO term are upregulated or downregulated. However, MIMGO has not yet been validated on a real microarray dataset using all available GO terms. Findings We combined Gene Set Enrichment Analysis (GSEA with MIMGO to identify differentially expressed GO terms in a yeast cell cycle microarray dataset. GSEA followed by MIMGO (GSEA + MIMGO correctly identified (p Conclusions MIMGO is a reliable method to identify differentially expressed GO terms comprehensively.

  4. cDNA microarrays as a tool for identification of biomineralization proteins in the coccolithophorid Emiliania huxleyi (Haptophyta).

    Science.gov (United States)

    Quinn, Patrick; Bowers, Robert M; Zhang, Xiaoyu; Wahlund, Thomas M; Fanelli, Michael A; Olszova, Daniela; Read, Betsy A

    2006-08-01

    Marine unicellular coccolithophore algae produce species-specific calcite scales otherwise known as coccoliths. While the coccoliths and their elaborate architecture have attracted the attention of investigators from various scientific disciplines, our knowledge of the underpinnings of the process of biomineralization in this alga is still in its infancy. The processes of calcification and coccolithogenesis are highly regulated and likely to be complex, requiring coordinated expression of many genes and pathways. In this study, we have employed cDNA microarrays to investigate changes in gene expression associated with biomineralization in the most abundant coccolithophorid, Emiliania huxleyi. Expression profiling of cultures grown under calcifying and noncalcifying conditions has been carried out using cDNA microarrays corresponding to approximately 2,300 expressed sequence tags. A total of 127 significantly up- or down-regulated transcripts were identified using a P value of 0.01 and a change of >2.0-fold. Real-time reverse transcriptase PCR was used to test the overall validity of the microarray data, as well as the relevance of many of the proteins predicted to be associated with biomineralization, including a novel gamma-class carbonic anhydrase (A. R. Soto, H. Zheng, D. Shoemaker, J. Rodriguez, B. A. Read, and T. M. Wahlund, Appl. Environ. Microbiol. 72:5500-5511, 2006). Differentially regulated genes include those related to cellular metabolism, ion channels, transport proteins, vesicular trafficking, and cell signaling. The putative function of the vast majority of candidate transcripts could not be defined. Nonetheless, the data described herein represent profiles of the transcription changes associated with biomineralization-related pathways in E. huxleyi and have identified novel and potentially useful targets for more detailed analysis.

  5. Design issues in toxicogenomics using DNA microarray experiment

    International Nuclear Information System (INIS)

    Lee, Kyoung-Mu; Kim, Ju-Han; Kang, Daehee

    2005-01-01

    The methods of toxicogenomics might be classified into omics study (e.g., genomics, proteomics, and metabolomics) and population study focusing on risk assessment and gene-environment interaction. In omics study, microarray is the most popular approach. Genes falling into several categories (e.g., xenobiotics metabolism, cell cycle control, DNA repair etc.) can be selected up to 20,000 according to a priori hypothesis. The appropriate type of samples and species should be selected in advance. Multiple doses and varied exposure durations are suggested to identify those genes clearly linked to toxic response. Microarray experiments can be affected by numerous nuisance variables including experimental designs, sample extraction, type of scanners, etc. The number of slides might be determined from the magnitude and variance of expression change, false-positive rate, and desired power. Instead, pooling samples is an alternative. Online databases on chemicals with known exposure-disease outcomes and genetic information can aid the interpretation of the normalized results. Gene function can be inferred from microarray data analyzed by bioinformatics methods such as cluster analysis. The population study often adopts hospital-based or nested case-control design. Biases in subject selection and exposure assessment should be minimized, and confounding bias should also be controlled for in stratified or multiple regression analysis. Optimal sample sizes are dependent on the statistical test for gene-to-environment or gene-to-gene interaction. The design issues addressed in this mini-review are crucial in conducting toxicogenomics study. In addition, integrative approach of exposure assessment, epidemiology, and clinical trial is required

  6. Identification of Candidate B-Lymphoma Genes by Cross-Species Gene Expression Profiling

    Science.gov (United States)

    Tompkins, Van S.; Han, Seong-Su; Olivier, Alicia; Syrbu, Sergei; Bair, Thomas; Button, Anna; Jacobus, Laura; Wang, Zebin; Lifton, Samuel; Raychaudhuri, Pradip; Morse, Herbert C.; Weiner, George; Link, Brian; Smith, Brian J.; Janz, Siegfried

    2013-01-01

    Comparative genome-wide expression profiling of malignant tumor counterparts across the human-mouse species barrier has a successful track record as a gene discovery tool in liver, breast, lung, prostate and other cancers, but has been largely neglected in studies on neoplasms of mature B-lymphocytes such as diffuse large B cell lymphoma (DLBCL) and Burkitt lymphoma (BL). We used global gene expression profiles of DLBCL-like tumors that arose spontaneously in Myc-transgenic C57BL/6 mice as a phylogenetically conserved filter for analyzing the human DLBCL transcriptome. The human and mouse lymphomas were found to have 60 concordantly deregulated genes in common, including 8 genes that Cox hazard regression analysis associated with overall survival in a published landmark dataset of DLBCL. Genetic network analysis of the 60 genes followed by biological validation studies indicate FOXM1 as a candidate DLBCL and BL gene, supporting a number of studies contending that FOXM1 is a therapeutic target in mature B cell tumors. Our findings demonstrate the value of the “mouse filter” for genomic studies of human B-lineage neoplasms for which a vast knowledge base already exists. PMID:24130802

  7. Multi-platform whole-genome microarray analyses refine the epigenetic signature of breast cancer metastasis with gene expression and copy number.

    Directory of Open Access Journals (Sweden)

    Joseph Andrews

    2010-01-01

    Full Text Available We have previously identified genome-wide DNA methylation changes in a cell line model of breast cancer metastasis. These complex epigenetic changes that we observed, along with concurrent karyotype analyses, have led us to hypothesize that complex genomic alterations in cancer cells (deletions, translocations and ploidy are superimposed over promoter-specific methylation events that are responsible for gene-specific expression changes observed in breast cancer metastasis.We undertook simultaneous high-resolution, whole-genome analyses of MDA-MB-468GFP and MDA-MB-468GFP-LN human breast cancer cell lines (an isogenic, paired lymphatic metastasis cell line model using Affymetrix gene expression (U133, promoter (1.0R, and SNP/CNV (SNP 6.0 microarray platforms to correlate data from gene expression, epigenetic (DNA methylation, and combination copy number variant/single nucleotide polymorphism microarrays. Using Partek Software and Ingenuity Pathway Analysis we integrated datasets from these three platforms and detected multiple hypomethylation and hypermethylation events. Many of these epigenetic alterations correlated with gene expression changes. In addition, gene dosage events correlated with the karyotypic differences observed between the cell lines and were reflected in specific promoter methylation patterns. Gene subsets were identified that correlated hyper (and hypo methylation with the loss (or gain of gene expression and in parallel, with gene dosage losses and gains, respectively. Individual gene targets from these subsets were also validated for their methylation, expression and copy number status, and susceptible gene pathways were identified that may indicate how selective advantage drives the processes of tumourigenesis and metastasis.Our approach allows more precisely profiling of functionally relevant epigenetic signatures that are associated with cancer progression and metastasis.

  8. Candidate Genes Detected in Transcriptome Studies are Strongly Dependent on Genetic Background

    DEFF Research Database (Denmark)

    Sarup, Pernille Merete; Sørensen, Jesper Givskov; Kristensen, Torsten Nygård

    2011-01-01

    identified from studies of gene expression in Drosophila melanogaster using similar technical platforms. We found little overlap across studies between putative candidate genes for the same traits in the same sex. Instead there was a high degree of overlap between different traits and sexes within the same...

  9. Prediction of regulatory gene pairs using dynamic time warping and gene ontology.

    Science.gov (United States)

    Yang, Andy C; Hsu, Hui-Huang; Lu, Ming-Da; Tseng, Vincent S; Shih, Timothy K

    2014-01-01

    Selecting informative genes is the most important task for data analysis on microarray gene expression data. In this work, we aim at identifying regulatory gene pairs from microarray gene expression data. However, microarray data often contain multiple missing expression values. Missing value imputation is thus needed before further processing for regulatory gene pairs becomes possible. We develop a novel approach to first impute missing values in microarray time series data by combining k-Nearest Neighbour (KNN), Dynamic Time Warping (DTW) and Gene Ontology (GO). After missing values are imputed, we then perform gene regulation prediction based on our proposed DTW-GO distance measurement of gene pairs. Experimental results show that our approach is more accurate when compared with existing missing value imputation methods on real microarray data sets. Furthermore, our approach can also discover more regulatory gene pairs that are known in the literature than other methods.

  10. Extended -Regular Sequence for Automated Analysis of Microarray Images

    Directory of Open Access Journals (Sweden)

    Jin Hee-Jeong

    2006-01-01

    Full Text Available Microarray study enables us to obtain hundreds of thousands of expressions of genes or genotypes at once, and it is an indispensable technology for genome research. The first step is the analysis of scanned microarray images. This is the most important procedure for obtaining biologically reliable data. Currently most microarray image processing systems require burdensome manual block/spot indexing work. Since the amount of experimental data is increasing very quickly, automated microarray image analysis software becomes important. In this paper, we propose two automated methods for analyzing microarray images. First, we propose the extended -regular sequence to index blocks and spots, which enables a novel automatic gridding procedure. Second, we provide a methodology, hierarchical metagrid alignment, to allow reliable and efficient batch processing for a set of microarray images. Experimental results show that the proposed methods are more reliable and convenient than the commercial tools.

  11. Mining microarray datasets in nutrition: expression of the GPR120 (n-3 fatty acid receptor/sensor) gene is down-regulated in human adipocytes by macrophage secretions.

    Science.gov (United States)

    Trayhurn, Paul; Denyer, Gareth

    2012-01-01

    Microarray datasets are a rich source of information in nutritional investigation. Targeted mining of microarray data following initial, non-biased bioinformatic analysis can provide key insight into specific genes and metabolic processes of interest. Microarrays from human adipocytes were examined to explore the effects of macrophage secretions on the expression of the G-protein-coupled receptor (GPR) genes that encode fatty acid receptors/sensors. Exposure of the adipocytes to macrophage-conditioned medium for 4 or 24 h had no effect on GPR40 and GPR43 expression, but there was a marked stimulation of GPR84 expression (receptor for medium-chain fatty acids), the mRNA level increasing 13·5-fold at 24 h relative to unconditioned medium. Importantly, expression of GPR120, which encodes an n-3 PUFA receptor/sensor, was strongly inhibited by the conditioned medium (15-fold decrease in mRNA at 24 h). Macrophage secretions have major effects on the expression of fatty acid receptor/sensor genes in human adipocytes, which may lead to an augmentation of the inflammatory response in adipose tissue in obesity.

  12. EBF factors drive expression of multiple classes of target genes governing neuronal development.

    Science.gov (United States)

    Green, Yangsook S; Vetter, Monica L

    2011-04-30

    Early B cell factor (EBF) family members are transcription factors known to have important roles in several aspects of vertebrate neurogenesis, including commitment, migration and differentiation. Knowledge of how EBF family members contribute to neurogenesis is limited by a lack of detailed understanding of genes that are transcriptionally regulated by these factors. We performed a microarray screen in Xenopus animal caps to search for targets of EBF transcriptional activity, and identified candidate targets with multiple roles, including transcription factors of several classes. We determined that, among the most upregulated candidate genes with expected neuronal functions, most require EBF activity for some or all of their expression, and most have overlapping expression with ebf genes. We also found that the candidate target genes that had the most strongly overlapping expression patterns with ebf genes were predicted to be direct transcriptional targets of EBF transcriptional activity. The identification of candidate targets that are transcription factor genes, including nscl-1, emx1 and aml1, improves our understanding of how EBF proteins participate in the hierarchy of transcription control during neuronal development, and suggests novel mechanisms by which EBF activity promotes migration and differentiation. Other candidate targets, including pcdh8 and kcnk5, expand our knowledge of the types of terminal differentiated neuronal functions that EBF proteins regulate.

  13. Identification of candidate genes for dissecting complex branch number trait in chickpea.

    Science.gov (United States)

    Bajaj, Deepak; Upadhyaya, Hari D; Das, Shouvik; Kumar, Vinod; Gowda, C L L; Sharma, Shivali; Tyagi, Akhilesh K; Parida, Swarup K

    2016-04-01

    The present study exploited integrated genomics-assisted breeding strategy for genetic dissection of complex branch number quantitative trait in chickpea. Candidate gene-based association analysis in a branch number association panel was performed by utilizing the genotyping data of 401 SNP allelic variants mined from 27 known cloned branch number gene orthologs of chickpea. The genome-wide association study (GWAS) integrating both genome-wide GBS- (4556 SNPs) and candidate gene-based genotyping information of 4957 SNPs in a structured population of 60 sequenced desi and kabuli accessions (with 350-400 kb LD decay), detected 11 significant genomic loci (genes) associated (41% combined PVE) with branch number in chickpea. Of these, seven branch number-associated genes were further validated successfully in two inter (ICC 4958 × ICC 17160)- and intra (ICC 12299 × ICC 8261)-specific mapping populations. The axillary meristem and shoot apical meristem-specific expression, including differential up- and down-regulation (4-5 fold) of the validated seven branch number-associated genes especially in high branch number as compared to the low branch number-containing parental accessions and homozygous individuals of two aforesaid mapping populations was apparent. Collectively, this combinatorial genomic approach delineated diverse naturally occurring novel functional SNP allelic variants in seven potential known/candidate genes [PIN1 (PIN-FORMED protein 1), TB1 (teosinte branched 1), BA1/LAX1 (BARREN STALK1/LIKE AUXIN1), GRAS8 (gibberellic acid insensitive/GAI, Repressor of ga13/RGA and Scarecrow8/SCR8), ERF (ethylene-responsive element-binding factor), MAX2 (more axillary growth 2) and lipase] governing chickpea branch number. The useful information generated from this study have potential to expedite marker-assisted genetic enhancement by developing high-yielding cultivars with more number of productive (pods and seeds) branches in chickpea. Copyright © 2016 Elsevier

  14. Biological data warehousing system for identifying transcriptional regulatory sites from gene expressions of microarray data.

    Science.gov (United States)

    Tsou, Ann-Ping; Sun, Yi-Ming; Liu, Chia-Lin; Huang, Hsien-Da; Horng, Jorng-Tzong; Tsai, Meng-Feng; Liu, Baw-Juine

    2006-07-01

    Identification of transcriptional regulatory sites plays an important role in the investigation of gene regulation. For this propose, we designed and implemented a data warehouse to integrate multiple heterogeneous biological data sources with data types such as text-file, XML, image, MySQL database model, and Oracle database model. The utility of the biological data warehouse in predicting transcriptional regulatory sites of coregulated genes was explored using a synexpression group derived from a microarray study. Both of the binding sites of known transcription factors and predicted over-represented (OR) oligonucleotides were demonstrated for the gene group. The potential biological roles of both known nucleotides and one OR nucleotide were demonstrated using bioassays. Therefore, the results from the wet-lab experiments reinforce the power and utility of the data warehouse as an approach to the genome-wide search for important transcription regulatory elements that are the key to many complex biological systems.

  15. Rapid and reliable detection and identification of GM events using multiplex PCR coupled with oligonucleotide microarray.

    Science.gov (United States)

    Xu, Xiaodan; Li, Yingcong; Zhao, Heng; Wen, Si-yuan; Wang, Sheng-qi; Huang, Jian; Huang, Kun-lun; Luo, Yun-bo

    2005-05-18

    To devise a rapid and reliable method for the detection and identification of genetically modified (GM) events, we developed a multiplex polymerase chain reaction (PCR) coupled with a DNA microarray system simultaneously aiming at many targets in a single reaction. The system included probes for screening gene, species reference gene, specific gene, construct-specific gene, event-specific gene, and internal and negative control genes. 18S rRNA was combined with species reference genes as internal controls to assess the efficiency of all reactions and to eliminate false negatives. Two sets of the multiplex PCR system were used to amplify four and five targets, respectively. Eight different structure genes could be detected and identified simultaneously for Roundup Ready soybean in a single microarray. The microarray specificity was validated by its ability to discriminate two GM maizes Bt176 and Bt11. The advantages of this method are its high specificity and greatly reduced false-positives and -negatives. The multiplex PCR coupled with microarray technology presented here is a rapid and reliable tool for the simultaneous detection of GM organism ingredients.

  16. A GMM-IG framework for selecting genes as expression panel biomarkers.

    Science.gov (United States)

    Wang, Mingyi; Chen, Jake Y

    2010-01-01

    The limitation of small sample size of functional genomics experiments has made it necessary to integrate DNA microarray experimental data from different sources. However, experimentation noises and biases of different microarray platforms have made integrated data analysis challenging. In this work, we propose an integrative computational framework to identify candidate biomarker genes from publicly available functional genomics studies. We developed a new framework, Gaussian Mixture Modeling-Coupled Information Gain (GMM-IG). In this framework, we first apply a two-component Gaussian mixture model (GMM) to estimate the conditional probability distributions of gene expression data between two different types of samples, for example, normal versus cancer. An expectation-maximization algorithm is then used to estimate the maximum likelihood parameters of a mixture of two Gaussian models in the feature space and determine the underlying expression levels of genes. Gene expression results from different studies are discretized, based on GMM estimations and then unified. Significantly differentially-expressed genes are filtered and assessed with information gain (IG) measures. DNA microarray experimental data for lung cancers from three different prior studies was processed using the new GMM-IG method. Target gene markers from a gene expression panel were selected and compared with several conventional computational biomarker data analysis methods. GMM-IG showed consistently high accuracy for several classification assessments. A high reproducibility of gene selection results was also determined from statistical validations. Our study shows that the GMM-IG framework can overcome poor reliability issues from single-study DNA microarray experiment while maintaining high accuracies by combining true signals from multiple studies. We present a conceptually simple framework that enables reliable integration of true differential gene expression signals from multiple

  17. Quantitative inference of dynamic regulatory pathways via microarray data

    Directory of Open Access Journals (Sweden)

    Chen Bor-Sen

    2005-03-01

    Full Text Available Abstract Background The cellular signaling pathway (network is one of the main topics of organismic investigations. The intracellular interactions between genes in a signaling pathway are considered as the foundation of functional genomics. Thus, what genes and how much they influence each other through transcriptional binding or physical interactions are essential problems. Under the synchronous measures of gene expression via a microarray chip, an amount of dynamic information is embedded and remains to be discovered. Using a systematically dynamic modeling approach, we explore the causal relationship among genes in cellular signaling pathways from the system biology approach. Results In this study, a second-order dynamic model is developed to describe the regulatory mechanism of a target gene from the upstream causality point of view. From the expression profile and dynamic model of a target gene, we can estimate its upstream regulatory function. According to this upstream regulatory function, we would deduce the upstream regulatory genes with their regulatory abilities and activation delays, and then link up a regulatory pathway. Iteratively, these regulatory genes are considered as target genes to trace back their upstream regulatory genes. Then we could construct the regulatory pathway (or network to the genome wide. In short, we can infer the genetic regulatory pathways from gene-expression profiles quantitatively, which can confirm some doubted paths or seek some unknown paths in a regulatory pathway (network. Finally, the proposed approach is validated by randomly reshuffling the time order of microarray data. Conclusion We focus our algorithm on the inference of regulatory abilities of the identified causal genes, and how much delay before they regulate the downstream genes. With this information, a regulatory pathway would be built up using microarray data. In the present study, two signaling pathways, i.e. circadian regulatory

  18. Towards the integration, annotation and association of historical microarray experiments with RNA-seq.

    Science.gov (United States)

    Chavan, Shweta S; Bauer, Michael A; Peterson, Erich A; Heuck, Christoph J; Johann, Donald J

    2013-01-01

    Transcriptome analysis by microarrays has produced important advances in biomedicine. For instance in multiple myeloma (MM), microarray approaches led to the development of an effective disease subtyping via cluster assignment, and a 70 gene risk score. Both enabled an improved molecular understanding of MM, and have provided prognostic information for the purposes of clinical management. Many researchers are now transitioning to Next Generation Sequencing (NGS) approaches and RNA-seq in particular, due to its discovery-based nature, improved sensitivity, and dynamic range. Additionally, RNA-seq allows for the analysis of gene isoforms, splice variants, and novel gene fusions. Given the voluminous amounts of historical microarray data, there is now a need to associate and integrate microarray and RNA-seq data via advanced bioinformatic approaches. Custom software was developed following a model-view-controller (MVC) approach to integrate Affymetrix probe set-IDs, and gene annotation information from a variety of sources. The tool/approach employs an assortment of strategies to integrate, cross reference, and associate microarray and RNA-seq datasets. Output from a variety of transcriptome reconstruction and quantitation tools (e.g., Cufflinks) can be directly integrated, and/or associated with Affymetrix probe set data, as well as necessary gene identifiers and/or symbols from a diversity of sources. Strategies are employed to maximize the annotation and cross referencing process. Custom gene sets (e.g., MM 70 risk score (GEP-70)) can be specified, and the tool can be directly assimilated into an RNA-seq pipeline. A novel bioinformatic approach to aid in the facilitation of both annotation and association of historic microarray data, in conjunction with richer RNA-seq data, is now assisting with the study of MM cancer biology.

  19. Analysis of a positional candidate gene for inflammatory bowel disease: NRAMP2

    NARCIS (Netherlands)

    Stokkers, P. C.; Huibregtse, K.; Leegwater, A. C.; Reitsma, P. H.; Tytgat, G. N.; van Deventer, S. J.

    2000-01-01

    Genome scans have identified a region spanning 40 cM on the long arm of chromosome 12 as a susceptibility locus for inflammatory bowel disease (IBD). This locus contains several candidate genes for IBD, one of which is the gene for the natural resistance associated macrophage protein 2 (NRAMP2).

  20. Profiling trait anxiety: transcriptome analysis reveals cathepsin B (Ctsb as a novel candidate gene for emotionality in mice.

    Directory of Open Access Journals (Sweden)

    Ludwig Czibere

    Full Text Available Behavioral endophenotypes are determined by a multitude of counteracting but precisely balanced molecular and physiological mechanisms. In this study, we aim to identify potential novel molecular targets that contribute to the multigenic trait "anxiety". We used microarrays to investigate the gene expression profiles of different brain regions within the limbic system of mice which were selectively bred for either high (HAB or low (LAB anxiety-related behavior, and also show signs of comorbid depression-like behavior. We identified and confirmed sex-independent differences in the basal expression of 13 candidate genes, using tissue from the entire brain, including coronin 7 (Coro7, cathepsin B (Ctsb, muscleblind-like 1 (Mbnl1, metallothionein 1 (Mt1, solute carrier family 25 member 17 (Slc25a17, tribbles homolog 2 (Trib2, zinc finger protein 672 (Zfp672, syntaxin 3 (Stx3, ATP-binding cassette, sub-family A member 2 (Abca2, ectonucleotide pyrophosphatase/phosphodiesterase 5 (Enpp5, high mobility group nucleosomal binding domain 3 (Hmgn3 and pyruvate dehydrogenase beta (Pdhb. Additionally, we confirmed brain region-specific differences in the expression of synaptotagmin 4 (Syt4.Our identification of about 90 polymorphisms in Ctsb suggested that this gene might play a critical role in shaping our mouse model's behavioral endophenotypes. Indeed, the assessment of anxiety-related and depression-like behaviors of Ctsb knock-out mice revealed an increase in depression-like behavior in females. Altogether, our results suggest that Ctsb has significant effects on emotionality, irrespective of the tested mouse strain, making it a promising target for future pharmacotherapy.

  1. Identification of Candidate Genes Responsible for Stem Pith Production Using Expression Analysis in Solid-Stemmed Wheat.

    Science.gov (United States)

    Oiestad, A J; Martin, J M; Cook, J; Varella, A C; Giroux, M J

    2017-07-01

    The wheat stem sawfly (WSS) is an economically important pest of wheat in the Northern Great Plains. The primary means of WSS control is resistance associated with the single quantitative trait locus (QTL) , which controls most stem solidness variation. The goal of this study was to identify stem solidness candidate genes via RNA-seq. This study made use of 28 single nucleotide polymorphism (SNP) makers derived from expressed sequence tags (ESTs) linked to contained within a 5.13 cM region. Allele specific expression of EST markers was examined in stem tissue for solid and hollow-stemmed pairs of two spring wheat near isogenic lines (NILs) differing for the QTL. Of the 28 ESTs, 13 were located within annotated genes and 10 had detectable stem expression. Annotated genes corresponding to four of the ESTs were differentially expressed between solid and hollow-stemmed NILs and represent possible stem solidness gene candidates. Further examination of the 5.13 cM region containing the 28 EST markers identified 260 annotated genes. Twenty of the 260 linked genes were up-regulated in hollow NIL stems, while only seven genes were up-regulated in solid NIL stems. An -methyltransferase within the region of interest was identified as a candidate based on differential expression between solid and hollow-stemmed NILs and putative function. Further study of these candidate genes may lead to the identification of the gene(s) controlling stem solidness and an increased ability to select for wheat stem solidness and manage WSS. Copyright © 2017 Crop Science Society of America.

  2. Short-term arginine deprivation results in large-scale modulation of hepatic gene expression in both normal and tumor cells: microarray bioinformatic analysis

    Directory of Open Access Journals (Sweden)

    Sabo Edmond

    2006-09-01

    Full Text Available Abstract Background We have reported arginine-sensitive regulation of LAT1 amino acid transporter (SLC 7A5 in normal rodent hepatic cells with loss of arginine sensitivity and high level constitutive expression in tumor cells. We hypothesized that liver cell gene expression is highly sensitive to alterations in the amino acid microenvironment and that tumor cells may differ substantially in gene sets sensitive to amino acid availability. To assess the potential number and classes of hepatic genes sensitive to arginine availability at the RNA level and compare these between normal and tumor cells, we used an Affymetrix microarray approach, a paired in vitro model of normal rat hepatic cells and a tumorigenic derivative with triplicate independent replicates. Cells were exposed to arginine-deficient or control conditions for 18 hours in medium formulated to maintain differentiated function. Results Initial two-way analysis with a p-value of 0.05 identified 1419 genes in normal cells versus 2175 in tumor cells whose expression was altered in arginine-deficient conditions relative to controls, representing 9–14% of the rat genome. More stringent bioinformatic analysis with 9-way comparisons and a minimum of 2-fold variation narrowed this set to 56 arginine-responsive genes in normal liver cells and 162 in tumor cells. Approximately half the arginine-responsive genes in normal cells overlap with those in tumor cells. Of these, the majority was increased in expression and included multiple growth, survival, and stress-related genes. GADD45, TA1/LAT1, and caspases 11 and 12 were among this group. Previously known amino acid regulated genes were among the pool in both cell types. Available cDNA probes allowed independent validation of microarray data for multiple genes. Among genes downregulated under arginine-deficient conditions were multiple genes involved in cholesterol and fatty acid metabolism. Expression of low-density lipoprotein receptor was

  3. Computational biology of genome expression and regulation--a review of microarray bioinformatics.

    Science.gov (United States)

    Wang, Junbai

    2008-01-01

    Microarray technology is being used widely in various biomedical research areas; the corresponding microarray data analysis is an essential step toward the best utilizing of array technologies. Here we review two components of the microarray data analysis: a low level of microarray data analysis that emphasizes the designing, the quality control, and the preprocessing of microarray experiments, then a high level of microarray data analysis that focuses on the domain-specific microarray applications such as tumor classification, biomarker prediction, analyzing array CGH experiments, and reverse engineering of gene expression networks. Additionally, we will review the recent development of building a predictive model in genome expression and regulation studies. This review may help biologists grasp a basic knowledge of microarray bioinformatics as well as its potential impact on the future evolvement of biomedical research fields.

  4. Advanced spot quality analysis in two-colour microarray experiments

    Directory of Open Access Journals (Sweden)

    Vetter Guillaume

    2008-09-01

    Full Text Available Abstract Background Image analysis of microarrays and, in particular, spot quantification and spot quality control, is one of the most important steps in statistical analysis of microarray data. Recent methods of spot quality control are still in early age of development, often leading to underestimation of true positive microarray features and, consequently, to loss of important biological information. Therefore, improving and standardizing the statistical approaches of spot quality control are essential to facilitate the overall analysis of microarray data and subsequent extraction of biological information. Findings We evaluated the performance of two image analysis packages MAIA and GenePix (GP using two complementary experimental approaches with a focus on the statistical analysis of spot quality factors. First, we developed control microarrays with a priori known fluorescence ratios to verify the accuracy and precision of the ratio estimation of signal intensities. Next, we developed advanced semi-automatic protocols of spot quality evaluation in MAIA and GP and compared their performance with available facilities of spot quantitative filtering in GP. We evaluated these algorithms for standardised spot quality analysis in a whole-genome microarray experiment assessing well-characterised transcriptional modifications induced by the transcription regulator SNAI1. Using a set of RT-PCR or qRT-PCR validated microarray data, we found that the semi-automatic protocol of spot quality control we developed with MAIA allowed recovering approximately 13% more spots and 38% more differentially expressed genes (at FDR = 5% than GP with default spot filtering conditions. Conclusion Careful control of spot quality characteristics with advanced spot quality evaluation can significantly increase the amount of confident and accurate data resulting in more meaningful biological conclusions.

  5. Candidate genes for performance in horses, including monocarboxylate transporters

    Directory of Open Access Journals (Sweden)

    Inaê Cristina Regatieri

    Full Text Available ABSTRACT: Some horse breeds are highly selected for athletic activities. The athletic potential of each animal can be measured by its performance in sports. High athletic performance depends on the animal capacity to produce energy through aerobic and anaerobic metabolic pathways, among other factors. Transmembrane proteins called monocarboxylate transporters, mainly the isoform 1 (MCT1 and its ancillary protein CD147, can help the organism to adapt to physiological stress caused by physical exercise, transporting lactate and H+ ions. Horse breeds are selected for different purposes so we might expect differences in the amount of those proteins and in the genotypic frequencies for genes that play a significant role in the performance of the animals. The study of MCT1 and CD147 gene polymorphisms, which can affect the formation of the proteins and transport of lactate and H+, can provide enough information to be used for selection of athletic horses increasingly resistant to intense exercise. Two other candidate genes, the PDK4 and DMRT3, have been associated with athletic potential and indicated as possible markers for performance in horses. The oxidation of fatty acids is highly effective in generating ATP and is controlled by the expression of PDK4 (pyruvate dehydrogenase kinase, isozyme 4 in skeletal muscle during and after exercise. The doublesex and mab-3 related transcription factor 3 (DMRT3 gene encodes an important transcription factor in the setting of spinal cord circuits controlling movement in vertebrates and may be associated with gait performance in horses. This review describes how the monocarboxylate transporters work during physical exercise in athletic horses and the influence of polymorphisms in candidate genes for athletic performance in horses.

  6. Validation of MIMGO: a method to identify differentially expressed GO terms in a microarray dataset

    OpenAIRE

    Yamada, Yoichi; Sawada, Hiroki; Hirotani, Ken-ichi; Oshima, Masanobu; Satou, Kenji

    2012-01-01

    Abstract Background We previously proposed an algorithm for the identification of GO terms that commonly annotate genes whose expression is upregulated or downregulated in some microarray data compared with in other microarray data. We call these “differentially expressed GO terms” and have named the algorithm “matrix-assisted identification method of differentially expressed GO terms” (MIMGO). MIMGO can also identify microarray data in which genes annotated with a differentially expressed GO...

  7. Microarray-based screening of differentially expressed genes in glucocorticoid-induced avascular necrosis

    Science.gov (United States)

    Huang, Gangyong; Wei, Yibing; Zhao, Guanglei; Xia, Jun; Wang, Siqun; Wu, Jianguo; Chen, Feiyan; Chen, Jie; Shi, Jingshen

    2017-01-01

    The underlying mechanisms of glucocorticoid (GC)-induced avascular necrosis of the femoral head (ANFH) have yet to be fully understood, in particular the mechanisms associated with the change of gene expression pattern. The present study aimed to identify key genes with a differential expression pattern in GC-induced ANFH. E-MEXP-2751 microarray data were downloaded from the ArrayExpress database. Differentially expressed genes (DEGs) were identified in 5 femoral head samples of steroid-induced ANFH rats compared with 5 placebo-treated rat samples. Gene Ontology (GO) and pathway enrichment analyses were performed upon these DEGs. A total 93 DEGs (46 upregulated and 47 downregulated genes) were identified in GC-induced ANFH samples. These DEGs were enriched in different GO terms and pathways, including chondrocyte differentiation and detection of chemical stimuli. The enrichment map revealed that skeletal system development was interconnected with several other GO terms by gene overlap. The literature mined network analysis revealed that 5 upregulated genes were associated with femoral necrosis, including parathyroid hormone receptor 1 (PTHR1), vitamin D (1,25-Dihydroxyvitamin D3) receptor (VDR), collagen, type II, α1, proprotein convertase subtilisin/kexin type 6 and zinc finger protein 354C (ZFP354C). In addition, ZFP354C and VDR were identified to transcription factors. Furthermore, PTHR1 was revealed to interact with VDR, and α-2-macroglobulin (A2M) interacted with fibronectin 1 (FN1) in the PPI network. PTHR1 may be involved in GC-induced ANFH via interacting with VDR. A2M may also be involved in the development of GC-induced ANFH through interacting with FN1. An improved understanding of the molecular mechanisms underlying GC-induced ANFH may provide novel targets for diagnostics and therapeutic treatment. PMID:28393228

  8. Carbohydrate microarrays

    DEFF Research Database (Denmark)

    Park, Sungjin; Gildersleeve, Jeffrey C; Blixt, Klas Ola

    2012-01-01

    In the last decade, carbohydrate microarrays have been core technologies for analyzing carbohydrate-mediated recognition events in a high-throughput fashion. A number of methods have been exploited for immobilizing glycans on the solid surface in a microarray format. This microarray...... of substrate specificities of glycosyltransferases. This review covers the construction of carbohydrate microarrays, detection methods of carbohydrate microarrays and their applications in biological and biomedical research....

  9. No Association between Personality and Candidate Gene Polymorphisms in a Wild Bird Population.

    Directory of Open Access Journals (Sweden)

    Hannah A Edwards

    Full Text Available Consistency of between-individual differences in behaviour or personality is a phenomenon in populations that can have ecological consequences and evolutionary potential. One way that behaviour can evolve is to have a genetic basis. Identifying the molecular genetic basis of personality could therefore provide insight into how and why such variation is maintained, particularly in natural populations. Previously identified candidate genes for personality in birds include the dopamine receptor D4 (DRD4, and serotonin transporter (SERT. Studies of wild bird populations have shown that exploratory and bold behaviours are associated with polymorphisms in both DRD4 and SERT. Here we tested for polymorphisms in DRD4 and SERT in the Seychelles warbler (Acrocephalus sechellensis population on Cousin Island, Seychelles, and then investigated correlations between personality and polymorphisms in these genes. We found no genetic variation in DRD4, but identified four polymorphisms in SERT that clustered into five haplotypes. There was no correlation between bold or exploratory behaviours and SERT polymorphisms/haplotypes. The null result was not due to lack of power, and indicates that there was no association between these behaviours and variation in the candidate genes tested in this population. These null findings provide important data to facilitate representative future meta-analyses on candidate personality genes.

  10. Association analysis of 94 candidate genes and schizophrenia-related endophenotypes.

    Directory of Open Access Journals (Sweden)

    Tiffany A Greenwood

    Full Text Available While it is clear that schizophrenia is highly heritable, the genetic basis of this heritability is complex. Human genetic, brain imaging, and model organism studies have met with only modest gains. A complementary research tactic is to evaluate the genetic substrates of quantitative endophenotypes with demonstrated deficits in schizophrenia patients. We used an Illumina custom 1,536-SNP array to interrogate 94 functionally relevant candidate genes for schizophrenia and evaluate association with both the qualitative diagnosis of schizophrenia and quantitative endophenotypes for schizophrenia. Subjects included 219 schizophrenia patients and normal comparison subjects of European ancestry and 76 schizophrenia patients and normal comparison subjects of African ancestry, all ascertained by the UCSD Schizophrenia Research Program. Six neurophysiological and neurocognitive endophenotype test paradigms were assessed: prepulse inhibition (PPI, P50 suppression, the antisaccade oculomotor task, the Letter-Number Span Test, the California Verbal Learning Test-II, and the Wisconsin Card Sorting Test-64 Card Version. These endophenotype test paradigms yielded six primary endophenotypes with prior evidence of heritability and demonstrated schizophrenia-related impairments, as well as eight secondary measures investigated as candidate endophenotypes. Schizophrenia patients showed significant deficits on ten of the endophenotypic measures, replicating prior studies and facilitating genetic analyses of these phenotypes. A total of 38 genes were found to be associated with at least one endophenotypic measure or schizophrenia with an empirical p-value<0.01. Many of these genes have been shown to interact on a molecular level, and eleven genes displayed evidence for pleiotropy, revealing associations with three or more endophenotypic measures. Among these genes were ERBB4 and NRG1, providing further support for a role of these genes in schizophrenia susceptibility

  11. Cartilage-selective genes identified in genome-scale analysis of non-cartilage and cartilage gene expression

    Directory of Open Access Journals (Sweden)

    Cohn Zachary A

    2007-06-01

    Full Text Available Abstract Background Cartilage plays a fundamental role in the development of the human skeleton. Early in embryogenesis, mesenchymal cells condense and differentiate into chondrocytes to shape the early skeleton. Subsequently, the cartilage anlagen differentiate to form the growth plates, which are responsible for linear bone growth, and the articular chondrocytes, which facilitate joint function. However, despite the multiplicity of roles of cartilage during human fetal life, surprisingly little is known about its transcriptome. To address this, a whole genome microarray expression profile was generated using RNA isolated from 18–22 week human distal femur fetal cartilage and compared with a database of control normal human tissues aggregated at UCLA, termed Celsius. Results 161 cartilage-selective genes were identified, defined as genes significantly expressed in cartilage with low expression and little variation across a panel of 34 non-cartilage tissues. Among these 161 genes were cartilage-specific genes such as cartilage collagen genes and 25 genes which have been associated with skeletal phenotypes in humans and/or mice. Many of the other cartilage-selective genes do not have established roles in cartilage or are novel, unannotated genes. Quantitative RT-PCR confirmed the unique pattern of gene expression observed by microarray analysis. Conclusion Defining the gene expression pattern for cartilage has identified new genes that may contribute to human skeletogenesis as well as provided further candidate genes for skeletal dysplasias. The data suggest that fetal cartilage is a complex and transcriptionally active tissue and demonstrate that the set of genes selectively expressed in the tissue has been greatly underestimated.

  12. Gene expression analysis of the biocontrol fungus Trichoderma harzianum in the presence of tomato plants, chitin, or glucose using a high-density oligonucleotide microarray.

    Science.gov (United States)

    Samolski, Ilanit; de Luis, Alberto; Vizcaíno, Juan Antonio; Monte, Enrique; Suárez, M Belén

    2009-10-13

    It has recently been shown that the Trichoderma fungal species used for biocontrol of plant diseases are capable of interacting with plant roots directly, behaving as symbiotic microorganisms. With a view to providing further information at transcriptomic level about the early response of Trichoderma to a host plant, we developed a high-density oligonucleotide (HDO) microarray encompassing 14,081 Expressed Sequence Tag (EST)-based transcripts from eight Trichoderma spp. and 9,121 genome-derived transcripts of T. reesei, and we have used this microarray to examine the gene expression of T. harzianum either alone or in the presence of tomato plants, chitin, or glucose. Global microarray analysis revealed 1,617 probe sets showing differential expression in T. harzianum mycelia under at least one of the culture conditions tested as compared with one another. Hierarchical clustering and heat map representation showed that the expression patterns obtained in glucose medium clustered separately from the expression patterns observed in the presence of tomato plants and chitin. Annotations using the Blast2GO suite identified 85 of the 257 transcripts whose probe sets afforded up-regulated expression in response to tomato plants. Some of these transcripts were predicted to encode proteins related to Trichoderma-host (fungus or plant) associations, such as Sm1/Elp1 protein, proteases P6281 and PRA1, enchochitinase CHIT42, or QID74 protein, although previously uncharacterized genes were also identified, including those responsible for the possible biosynthesis of nitric oxide, xenobiotic detoxification, mycelium development, or those related to the formation of infection structures in plant tissues. The effectiveness of the Trichoderma HDO microarray to detect different gene responses under different growth conditions in the fungus T. harzianum strongly indicates that this tool should be useful for further assays that include different stages of plant colonization, as well as

  13. Use of homologous and heterologous gene expression profiling tools to characterize transcription dynamics during apple fruit maturation and ripening

    Directory of Open Access Journals (Sweden)

    Sansavini Silviero

    2010-10-01

    Full Text Available Abstract Background Fruit development, maturation and ripening consists of a complex series of biochemical and physiological changes that in climacteric fruits, including apple and tomato, are coordinated by the gaseous hormone ethylene. These changes lead to final fruit quality and understanding of the functional machinery underlying these processes is of both biological and practical importance. To date many reports have been made on the analysis of gene expression in apple. In this study we focused our investigation on the role of ethylene during apple maturation, specifically comparing transcriptomics of normal ripening with changes resulting from application of the hormone receptor competitor 1-Methylcyclopropene. Results To gain insight into the molecular process regulating ripening in apple, and to compare to tomato (model species for ripening studies, we utilized both homologous and heterologous (tomato microarray to profile transcriptome dynamics of genes involved in fruit development and ripening, emphasizing those which are ethylene regulated. The use of both types of microarrays facilitated transcriptome comparison between apple and tomato (for the later using data previously published and available at the TED: tomato expression database and highlighted genes conserved during ripening of both species, which in turn represent a foundation for further comparative genomic studies. The cross-species analysis had the secondary aim of examining the efficiency of heterologous (specifically tomato microarray hybridization for candidate gene identification as related to the ripening process. The resulting transcriptomics data revealed coordinated gene expression during fruit ripening of a subset of ripening-related and ethylene responsive genes, further facilitating the analysis of ethylene response during fruit maturation and ripening. Conclusion Our combined strategy based on microarray hybridization enabled transcriptome characterization

  14. Use of homologous and heterologous gene expression profiling tools to characterize transcription dynamics during apple fruit maturation and ripening.

    Science.gov (United States)

    Costa, Fabrizio; Alba, Rob; Schouten, Henk; Soglio, Valeria; Gianfranceschi, Luca; Serra, Sara; Musacchi, Stefano; Sansavini, Silviero; Costa, Guglielmo; Fei, Zhangjun; Giovannoni, James

    2010-10-25

    Fruit development, maturation and ripening consists of a complex series of biochemical and physiological changes that in climacteric fruits, including apple and tomato, are coordinated by the gaseous hormone ethylene. These changes lead to final fruit quality and understanding of the functional machinery underlying these processes is of both biological and practical importance. To date many reports have been made on the analysis of gene expression in apple. In this study we focused our investigation on the role of ethylene during apple maturation, specifically comparing transcriptomics of normal ripening with changes resulting from application of the hormone receptor competitor 1-methylcyclopropene. To gain insight into the molecular process regulating ripening in apple, and to compare to tomato (model species for ripening studies), we utilized both homologous and heterologous (tomato) microarray to profile transcriptome dynamics of genes involved in fruit development and ripening, emphasizing those which are ethylene regulated.The use of both types of microarrays facilitated transcriptome comparison between apple and tomato (for the later using data previously published and available at the TED: tomato expression database) and highlighted genes conserved during ripening of both species, which in turn represent a foundation for further comparative genomic studies. The cross-species analysis had the secondary aim of examining the efficiency of heterologous (specifically tomato) microarray hybridization for candidate gene identification as related to the ripening process. The resulting transcriptomics data revealed coordinated gene expression during fruit ripening of a subset of ripening-related and ethylene responsive genes, further facilitating the analysis of ethylene response during fruit maturation and ripening. Our combined strategy based on microarray hybridization enabled transcriptome characterization during normal climacteric apple ripening, as well as

  15. Detection of genomic deletions in rice using oligonucleotide microarrays

    Directory of Open Access Journals (Sweden)

    Bordeos Alicia

    2009-03-01

    Full Text Available Abstract Background The induction of genomic deletions by physical- or chemical- agents is an easy and inexpensive means to generate a genome-saturating collection of mutations. Different mutagens can be selected to ensure a mutant collection with a range of deletion sizes. This would allow identification of mutations in single genes or, alternatively, a deleted group of genes that might collectively govern a trait (e.g., quantitative trait loci, QTL. However, deletion mutants have not been widely used in functional genomics, because the mutated genes are not tagged and therefore, difficult to identify. Here, we present a microarray-based approach to identify deleted genomic regions in rice mutants selected from a large collection generated by gamma ray or fast neutron treatment. Our study focuses not only on the utility of this method for forward genetics, but also its potential as a reverse genetics tool through accumulation of hybridization data for a collection of deletion mutants harboring multiple genetic lesions. Results We demonstrate that hybridization of labeled genomic DNA directly onto the Affymetrix Rice GeneChip® allows rapid localization of deleted regions in rice mutants. Deletions ranged in size from one gene model to ~500 kb and were predicted on all 12 rice chromosomes. The utility of the technique as a tool in forward genetics was demonstrated in combination with an allelic series of mutants to rapidly narrow the genomic region, and eventually identify a candidate gene responsible for a lesion mimic phenotype. Finally, the positions of mutations in 14 mutants were aligned onto the rice pseudomolecules in a user-friendly genome browser to allow for rapid identification of untagged mutations http://irfgc.irri.org/cgi-bin/gbrowse/IR64_deletion_mutants/. Conclusion We demonstrate the utility of oligonucleotide arrays to discover deleted genes in rice. The density and distribution of deletions suggests the feasibility of a

  16. Genome-scale cluster analysis of replicated microarrays using shrinkage correlation coefficient.

    Science.gov (United States)

    Yao, Jianchao; Chang, Chunqi; Salmi, Mari L; Hung, Yeung Sam; Loraine, Ann; Roux, Stanley J

    2008-06-18

    Currently, clustering with some form of correlation coefficient as the gene similarity metric has become a popular method for profiling genomic data. The Pearson correlation coefficient and the standard deviation (SD)-weighted correlation coefficient are the two most widely-used correlations as the similarity metrics in clustering microarray data. However, these two correlations are not optimal for analyzing replicated microarray data generated by most laboratories. An effective correlation coefficient is needed to provide statistically sufficient analysis of replicated microarray data. In this study, we describe a novel correlation coefficient, shrinkage correlation coefficient (SCC), that fully exploits the similarity between the replicated microarray experimental samples. The methodology considers both the number of replicates and the variance within each experimental group in clustering expression data, and provides a robust statistical estimation of the error of replicated microarray data. The value of SCC is revealed by its comparison with two other correlation coefficients that are currently the most widely-used (Pearson correlation coefficient and SD-weighted correlation coefficient) using statistical measures on both synthetic expression data as well as real gene expression data from Saccharomyces cerevisiae. Two leading clustering methods, hierarchical and k-means clustering were applied for the comparison. The comparison indicated that using SCC achieves better clustering performance. Applying SCC-based hierarchical clustering to the replicated microarray data obtained from germinating spores of the fern Ceratopteris richardii, we discovered two clusters of genes with shared expression patterns during spore germination. Functional analysis suggested that some of the genetic mechanisms that control germination in such diverse plant lineages as mosses and angiosperms are also conserved among ferns. This study shows that SCC is an alternative to the Pearson

  17. Development of a cell-defined siRNA microarray for analysis of gene function in human bone marrow stromal cells

    Directory of Open Access Journals (Sweden)

    Hi Chul Kim

    2016-03-01

    The efficiency of this CDSM was verified using three siRNAs (targeting p65, Slug, and N-cadherin, with persistent gene silencing for 5 days. We obtained the significant and reliable data with effective knock-down in our condition, and suggested our method as the qualitatively improved siRNA microarray screening method for hBMSCs.

  18. Microarrays in ecological research: A case study of a cDNA microarray for plant-herbivore interactions

    Directory of Open Access Journals (Sweden)

    Gase Klaus

    2004-09-01

    Full Text Available Abstract Background Microarray technology allows researchers to simultaneously monitor changes in the expression ratios (ERs of hundreds of genes and has thereby revolutionized most of biology. Although this technique has the potential of elucidating early stages in an organism's phenotypic response to complex ecological interactions, to date, it has not been fully incorporated into ecological research. This is partially due to a lack of simple procedures of handling and analyzing the expression ratio (ER data produced from microarrays. Results We describe an analysis of the sources of variation in ERs from 73 hybridized cDNA microarrays, each with 234 herbivory-elicited genes from the model ecological expression system, Nicotiana attenuata, using procedures that are commonly used in ecologic research. Each gene is represented by two independently labeled PCR products and each product was arrayed in quadruplicate. We present a robust method of normalizing and analyzing ERs based on arbitrary thresholds and statistical criteria, and characterize a "norm of reaction" of ERs for 6 genes (4 of known function, 2 of unknown with different ERs as determined across all analyzed arrays to provide a biologically-informed alternative to the use of arbitrary expression ratios in determining significance of expression. These gene-specific ERs and their variance (gene CV were used to calculate array-based variances (array CV, which, in turn, were used to study the effects of array age, probe cDNA quantity and quality, and quality of spotted PCR products as estimates of technical variation. Cluster analysis and a Principal Component Analysis (PCA were used to reveal associations among the transcriptional "imprints" of arrays hybridized with cDNA probes derived from mRNA from N. attenuata plants variously elicited and attacked by different herbivore species and from three congeners: N. quadrivalis, N. longiflora and N. clevelandii. Additionally, the PCA

  19. Integration of gene-based markers in a pearl millet genetic map for identification of candidate genes underlying drought tolerance quantitative trait loci

    Directory of Open Access Journals (Sweden)

    Sehgal Deepmala

    2012-01-01

    Full Text Available Abstract Background Identification of genes underlying drought tolerance (DT quantitative trait loci (QTLs will facilitate understanding of molecular mechanisms of drought tolerance, and also will accelerate genetic improvement of pearl millet through marker-assisted selection. We report a map based on genes with assigned functional roles in plant adaptation to drought and other abiotic stresses and demonstrate its use in identifying candidate genes underlying a major DT-QTL. Results Seventy five single nucleotide polymorphism (SNP and conserved intron spanning primer (CISP markers were developed from available expressed sequence tags (ESTs using four genotypes, H 77/833-2, PRLT 2/89-33, ICMR 01029 and ICMR 01004, representing parents of two mapping populations. A total of 228 SNPs were obtained from 30.5 kb sequenced region resulting in a SNP frequency of 1/134 bp. The positions of major pearl millet linkage group (LG 2 DT-QTLs (reported from crosses H 77/833-2 × PRLT 2/89-33 and 841B × 863B were added to the present consensus function map which identified 18 genes, coding for PSI reaction center subunit III, PHYC, actin, alanine glyoxylate aminotransferase, uridylate kinase, acyl-CoA oxidase, dipeptidyl peptidase IV, MADS-box, serine/threonine protein kinase, ubiquitin conjugating enzyme, zinc finger C- × 8-C × 5-C × 3-H type, Hd3, acetyl CoA carboxylase, chlorophyll a/b binding protein, photolyase, protein phosphatase1 regulatory subunit SDS22 and two hypothetical proteins, co-mapping in this DT-QTL interval. Many of these candidate genes were found to have significant association with QTLs of grain yield, flowering time and leaf rolling under drought stress conditions. Conclusions We have exploited available pearl millet EST sequences to generate a mapped resource of seventy five new gene-based markers for pearl millet and demonstrated its use in identifying candidate genes underlying a major DT-QTL in this species. The reported gene

  20. Prioritization of candidate disease genes by combining topological similarity and semantic similarity.

    Science.gov (United States)

    Liu, Bin; Jin, Min; Zeng, Pan

    2015-10-01

    The identification of gene-phenotype relationships is very important for the treatment of human diseases. Studies have shown that genes causing the same or similar phenotypes tend to interact with each other in a protein-protein interaction (PPI) network. Thus, many identification methods based on the PPI network model have achieved good results. However, in the PPI network, some interactions between the proteins encoded by candidate gene and the proteins encoded by known disease genes are very weak. Therefore, some studies have combined the PPI network with other genomic information and reported good predictive performances. However, we believe that the results could be further improved. In this paper, we propose a new method that uses the semantic similarity between the candidate gene and known disease genes to set the initial probability vector of a random walk with a restart algorithm in a human PPI network. The effectiveness of our method was demonstrated by leave-one-out cross-validation, and the experimental results indicated that our method outperformed other methods. Additionally, our method can predict new causative genes of multifactor diseases, including Parkinson's disease, breast cancer and obesity. The top predictions were good and consistent with the findings in the literature, which further illustrates the effectiveness of our method. Copyright © 2015 Elsevier Inc. All rights reserved.

  1. Using microarrays to identify positional candidate genes for QTL: the case study of ACTH response in pigs

    DEFF Research Database (Denmark)

    Jouffe, Vincent; Rowe, Suzanne; Liaubet, Laurence

    2009-01-01

    this with information on published QTL. The starting point is a set of 237 differentially expressed cDNA clones in adrenal tissue from two pig breeds, before and after treatment with adrenocorticotropic hormone (ACTH) Results: Different approaches to localize the differentially expressed (DE) genes to the pig genome....... Different approaches to localize the differentially expressed (DE) genes to the pig genome showed different levels of success and a clear lack of concordance for some genes between the various approaches. For a focused analysis on 12 genes, overlapping QTL from the public domain were presented. Also...

  2. An Overview of DNA Microarray Grid Alignment and Foreground Separation Approaches

    Directory of Open Access Journals (Sweden)

    Bajcsy Peter

    2006-01-01

    Full Text Available This paper overviews DNA microarray grid alignment and foreground separation approaches. Microarray grid alignment and foreground separation are the basic processing steps of DNA microarray images that affect the quality of gene expression information, and hence impact our confidence in any data-derived biological conclusions. Thus, understanding microarray data processing steps becomes critical for performing optimal microarray data analysis. In the past, the grid alignment and foreground separation steps have not been covered extensively in the survey literature. We present several classifications of existing algorithms, and describe the fundamental principles of these algorithms. Challenges related to automation and reliability of processed image data are outlined at the end of this overview paper.

  3. Candidate genes revealed by a genome scan for mosquito resistance to a bacterial insecticide: sequence and gene expression variations

    Directory of Open Access Journals (Sweden)

    David Jean-Philippe

    2009-11-01

    Full Text Available Abstract Background Genome scans are becoming an increasingly popular approach to study the genetic basis of adaptation and speciation, but on their own, they are often helpless at identifying the specific gene(s or mutation(s targeted by selection. This shortcoming is hopefully bound to disappear in the near future, thanks to the wealth of new genomic resources that are currently being developed for many species. In this article, we provide a foretaste of this exciting new era by conducting a genome scan in the mosquito Aedes aegypti with the aim to look for candidate genes involved in resistance to Bacillus thuringiensis subsp. israelensis (Bti insecticidal toxins. Results The genome of a Bti-resistant and a Bti-susceptible strains was surveyed using about 500 MITE-based molecular markers, and the loci showing the highest inter-strain genetic differentiation were sequenced and mapped on the Aedes aegypti genome sequence. Several good candidate genes for Bti-resistance were identified in the vicinity of these highly differentiated markers. Two of them, coding for a cadherin and a leucine aminopeptidase, were further examined at the sequence and gene expression levels. In the resistant strain, the cadherin gene displayed patterns of nucleotide polymorphisms consistent with the action of positive selection (e.g. an excess of high compared to intermediate frequency mutations, as well as a significant under-expression compared to the susceptible strain. Conclusion Both sequence and gene expression analyses agree to suggest a role for positive selection in the evolution of this cadherin gene in the resistant strain. However, it is unlikely that resistance to Bti is conferred by this gene alone, and further investigation will be needed to characterize other genes significantly associated with Bti resistance in Ae. aegypti. Beyond these results, this article illustrates how genome scans can build on the body of new genomic information (here, full

  4. An Improved Fuzzy Based Missing Value Estimation in DNA Microarray Validated by Gene Ranking

    Directory of Open Access Journals (Sweden)

    Sujay Saha

    2016-01-01

    Full Text Available Most of the gene expression data analysis algorithms require the entire gene expression matrix without any missing values. Hence, it is necessary to devise methods which would impute missing data values accurately. There exist a number of imputation algorithms to estimate those missing values. This work starts with a microarray dataset containing multiple missing values. We first apply the modified version of the fuzzy theory based existing method LRFDVImpute to impute multiple missing values of time series gene expression data and then validate the result of imputation by genetic algorithm (GA based gene ranking methodology along with some regular statistical validation techniques, like RMSE method. Gene ranking, as far as our knowledge, has not been used yet to validate the result of missing value estimation. Firstly, the proposed method has been tested on the very popular Spellman dataset and results show that error margins have been drastically reduced compared to some previous works, which indirectly validates the statistical significance of the proposed method. Then it has been applied on four other 2-class benchmark datasets, like Colorectal Cancer tumours dataset (GDS4382, Breast Cancer dataset (GSE349-350, Prostate Cancer dataset, and DLBCL-FL (Leukaemia for both missing value estimation and ranking the genes, and the results show that the proposed method can reach 100% classification accuracy with very few dominant genes, which indirectly validates the biological significance of the proposed method.

  5. Genomic dissection and prioritizing of candidate genes of QTL for ...

    Indian Academy of Sciences (India)

    Genomic dissection and prioritizing of candidate genes of QTL for regulating spontaneous arthritis on chromosome 1 in mice deficient for interleukin-1 receptor antagonist. Yanhong Cao, Jifei Zhang, Yan Jiao, Jian Yan, Feng Jiao, XiaoYun Liu, Robert W. Williams, Karen A. Hasty,. John M. Stuart and Weikuan Gu. J. Genet.

  6. Extensive innate immune gene activation accompanies brain aging, increasing vulnerability to cognitive decline and neurodegeneration: a microarray study

    Science.gov (United States)

    2012-01-01

    Background This study undertakes a systematic and comprehensive analysis of brain gene expression profiles of immune/inflammation-related genes in aging and Alzheimer’s disease (AD). Methods In a well-powered microarray study of young (20 to 59 years), aged (60 to 99 years), and AD (74 to 95 years) cases, gene responses were assessed in the hippocampus, entorhinal cortex, superior frontal gyrus, and post-central gyrus. Results Several novel concepts emerge. First, immune/inflammation-related genes showed major changes in gene expression over the course of cognitively normal aging, with the extent of gene response far greater in aging than in AD. Of the 759 immune-related probesets interrogated on the microarray, approximately 40% were significantly altered in the SFG, PCG and HC with increasing age, with the majority upregulated (64 to 86%). In contrast, far fewer immune/inflammation genes were significantly changed in the transition to AD (approximately 6% of immune-related probesets), with gene responses primarily restricted to the SFG and HC. Second, relatively few significant changes in immune/inflammation genes were detected in the EC either in aging or AD, although many genes in the EC showed similar trends in responses as in the other brain regions. Third, immune/inflammation genes undergo gender-specific patterns of response in aging and AD, with the most pronounced differences emerging in aging. Finally, there was widespread upregulation of genes reflecting activation of microglia and perivascular macrophages in the aging brain, coupled with a downregulation of select factors (TOLLIP, fractalkine) that when present curtail microglial/macrophage activation. Notably, essentially all pathways of the innate immune system were upregulated in aging, including numerous complement components, genes involved in toll-like receptor signaling and inflammasome signaling, as well as genes coding for immunoglobulin (Fc) receptors and human leukocyte antigens I

  7. Extensive innate immune gene activation accompanies brain aging, increasing vulnerability to cognitive decline and neurodegeneration: a microarray study

    Directory of Open Access Journals (Sweden)

    Cribbs David H

    2012-07-01

    Full Text Available Abstract Background This study undertakes a systematic and comprehensive analysis of brain gene expression profiles of immune/inflammation-related genes in aging and Alzheimer’s disease (AD. Methods In a well-powered microarray study of young (20 to 59 years, aged (60 to 99 years, and AD (74 to 95 years cases, gene responses were assessed in the hippocampus, entorhinal cortex, superior frontal gyrus, and post-central gyrus. Results Several novel concepts emerge. First, immune/inflammation-related genes showed major changes in gene expression over the course of cognitively normal aging, with the extent of gene response far greater in aging than in AD. Of the 759 immune-related probesets interrogated on the microarray, approximately 40% were significantly altered in the SFG, PCG and HC with increasing age, with the majority upregulated (64 to 86%. In contrast, far fewer immune/inflammation genes were significantly changed in the transition to AD (approximately 6% of immune-related probesets, with gene responses primarily restricted to the SFG and HC. Second, relatively few significant changes in immune/inflammation genes were detected in the EC either in aging or AD, although many genes in the EC showed similar trends in responses as in the other brain regions. Third, immune/inflammation genes undergo gender-specific patterns of response in aging and AD, with the most pronounced differences emerging in aging. Finally, there was widespread upregulation of genes reflecting activation of microglia and perivascular macrophages in the aging brain, coupled with a downregulation of select factors (TOLLIP, fractalkine that when present curtail microglial/macrophage activation. Notably, essentially all pathways of the innate immune system were upregulated in aging, including numerous complement components, genes involved in toll-like receptor signaling and inflammasome signaling, as well as genes coding for immunoglobulin (Fc receptors and human

  8. Analyzing Multiple-Probe Microarray: Estimation and Application of Gene Expression Indexes

    KAUST Repository

    Maadooliat, Mehdi

    2012-07-26

    Gene expression index estimation is an essential step in analyzing multiple probe microarray data. Various modeling methods have been proposed in this area. Amidst all, a popular method proposed in Li and Wong (2001) is based on a multiplicative model, which is similar to the additive model discussed in Irizarry et al. (2003a) at the logarithm scale. Along this line, Hu et al. (2006) proposed data transformation to improve expression index estimation based on an ad hoc entropy criteria and naive grid search approach. In this work, we re-examined this problem using a new profile likelihood-based transformation estimation approach that is more statistically elegant and computationally efficient. We demonstrate the applicability of the proposed method using a benchmark Affymetrix U95A spiked-in experiment. Moreover, We introduced a new multivariate expression index and used the empirical study to shows its promise in terms of improving model fitting and power of detecting differential expression over the commonly used univariate expression index. As the other important content of the work, we discussed two generally encountered practical issues in application of gene expression index: normalization and summary statistic used for detecting differential expression. Our empirical study shows somewhat different findings from the MAQC project (MAQC, 2006).

  9. Genome-wide association links candidate genes to resistance to Plum Pox Virus in apricot (Prunus armeniaca).

    Science.gov (United States)

    Mariette, Stéphanie; Wong Jun Tai, Fabienne; Roch, Guillaume; Barre, Aurélien; Chague, Aurélie; Decroocq, Stéphane; Groppi, Alexis; Laizet, Yec'han; Lambert, Patrick; Tricon, David; Nikolski, Macha; Audergon, Jean-Marc; Abbott, Albert G; Decroocq, Véronique

    2016-01-01

    In fruit tree species, many important traits have been characterized genetically by using single-family descent mapping in progenies segregating for the traits. However, most mapped loci have not been sufficiently resolved to the individual genes due to insufficient progeny sizes for high resolution mapping and the previous lack of whole-genome sequence resources of the study species. To address this problem for Plum Pox Virus (PPV) candidate resistance gene identification in Prunus species, we implemented a genome-wide association (GWA) approach in apricot. This study exploited the broad genetic diversity of the apricot (Prunus armeniaca) germplasm containing resistance to PPV, next-generation sequence-based genotyping, and the high-quality peach (Prunus persica) genome reference sequence for single nucleotide polymorphism (SNP) identification. The results of this GWA study validated previously reported PPV resistance quantitative trait loci (QTL) intervals, highlighted other potential resistance loci, and resolved each to a limited set of candidate genes for further study. This work substantiates the association genetics approach for resolution of QTL to candidate genes in apricot and suggests that this approach could simplify identification of other candidate genes for other marked trait intervals in this germplasm. © 2015 INRA, UMR 1332 BFP New Phytologist © 2015 New Phytologist Trust.

  10. DNA microarray genotyping and virulence and antimicrobial resistance gene profiling of methicillin-resistant Staphylococcus aureus bloodstream isolates from renal patients.

    LENUS (Irish Health Repository)

    McNicholas, Sinead

    2012-02-01

    Thirty-six methicillin-resistant Staphylococcus aureus (MRSA) bloodstream isolates from renal patients were genetically characterized by DNA microarray analysis and spa typing. The isolates were highly clonal, belonging mainly to ST22-MRSA-IV. The immune evasion and enterotoxin gene clusters were found in 29\\/36 (80%) and 33\\/36 (92%) isolates, respectively.

  11. DNA microarray genotyping and virulence and antimicrobial resistance gene profiling of methicillin-resistant Staphylococcus aureus bloodstream isolates from renal patients.

    LENUS (Irish Health Repository)

    McNicholas, Sinead

    2011-12-01

    Thirty-six methicillin-resistant Staphylococcus aureus (MRSA) bloodstream isolates from renal patients were genetically characterized by DNA microarray analysis and spa typing. The isolates were highly clonal, belonging mainly to ST22-MRSA-IV. The immune evasion and enterotoxin gene clusters were found in 29\\/36 (80%) and 33\\/36 (92%) isolates, respectively.

  12. Mining meiosis and gametogenesis with DNA microarrays.

    Science.gov (United States)

    Schlecht, Ulrich; Primig, Michael

    2003-04-01

    Gametogenesis is a key developmental process that involves complex transcriptional regulation of numerous genes including many that are conserved between unicellular eukaryotes and mammals. Recent expression-profiling experiments using microarrays have provided insight into the co-ordinated transcription of several hundred genes during mitotic growth and meiotic development in budding and fission yeast. Furthermore, microarray-based studies have identified numerous loci that are regulated during the cell cycle or expressed in a germ-cell specific manner in eukaryotic model systems like Caenorhabditis elegans, Mus musculus as well as Homo sapiens. The unprecedented amount of information produced by post-genome biology has spawned novel approaches to organizing biological knowledge using currently available information technology. This review outlines experiments that contribute to an emerging comprehensive picture of the molecular machinery governing sexual reproduction in eukaryotes.

  13. Microarray analysis of pancreatic gene expression during biotin repletion in biotin-deficient rats.

    Science.gov (United States)

    Dakshinamurti, Krishnamurti; Bagchi, Rushita A; Abrenica, Bernard; Czubryt, Michael P

    2015-12-01

    Biotin is a B vitamin involved in multiple metabolic pathways. In humans, biotin deficiency is relatively rare but can cause dermatitis, alopecia, and perosis. Low biotin levels occur in individuals with type-2 diabetes, and supplementation with biotin plus chromium may improve blood sugar control. The acute effect on pancreatic gene expression of biotin repletion following chronic deficiency is unclear, therefore we induced biotin deficiency in adult male rats by feeding them a 20% raw egg white diet for 6 weeks. Animals were then randomized into 2 groups: one group received a single biotin supplement and returned to normal chow lacking egg white, while the second group remained on the depletion diet. After 1 week, pancreata were removed from biotin-deficient (BD) and biotin-repleted (BR) animals and RNA was isolated for microarray analysis. Biotin depletion altered gene expression in a manner indicative of inflammation, fibrosis, and defective pancreatic function. Conversely, biotin repletion activated numerous repair and anti-inflammatory pathways, reduced fibrotic gene expression, and induced multiple genes involved in pancreatic endocrine and exocrine function. A subset of the results was confirmed by quantitative real-time PCR analysis, as well as by treatment of pancreatic AR42J cells with biotin. The results indicate that biotin repletion, even after lengthy deficiency, results in the rapid induction of repair processes in the pancreas.

  14. Gene Expression Profile in the Early Stage of Angiotensin II-induced Cardiac Remodeling: a Time Series Microarray Study in a Mouse Model

    Directory of Open Access Journals (Sweden)

    Meng-Qiu Dang

    2015-01-01

    Full Text Available Background/Aims: Angiotensin II (Ang II plays a critical role in the cardiac remodeling contributing to heart failure. However, the gene expression profiles induced by Ang II in the early stage of cardiac remodeling remain unknown. Methods: Wild-type male mice (C57BL/6 background, 10-weeek-old were infused with Ang II (1500 ng/kg/min for 7 days. Blood pressure was measured. Cardiac function and remodeling were examined by echocardiography, H&E and Masson staining. The time series microarrays were then conducted to detected gene expression profiles. Results: Microarray results identified that 1,489 genes were differentially expressed in the hearts at day 1, 3 and 7 of Ang II injection. These genes were further classified into 26 profiles by hierarchical cluster analysis. Of them, 4 profiles were significant (No. 19, 8, 21 and 22 and contained 904 genes. Gene Ontology showed that these genes mainly participate in metabolic process, oxidation-reduction process, extracellular matrix organization, apoptotic process, immune response, and others. Significant pathways included focal adhesion, ECM-receptor interaction, cytokine-cytokine receptor interaction, MAPK and insulin signaling pathways, which were known to play important roles in Ang II-induced cardiac remodeling. Moreover, gene co-expression networks analysis suggested that serine/cysteine peptidase inhibitor, member 1 (Serpine1, also known as PAI-1 localized in the core of the network. Conclusions: Our results indicate that many genes are mainly involved in metabolism, inflammation, cardiac fibrosis and hypertrophy. Serpine1 may play a central role in the development of Ang II-induced cardiac remodeling at the early stage.

  15. Microarray analysis of DNA damage repair gene expression profiles in cervical cancer cells radioresistant to 252Cf neutron and X-rays

    International Nuclear Information System (INIS)

    Qing, Yi; Wang, Ge; Wang, Dong; Yang, Xue-Qin; Zhong, Zhao-Yang; Lei, Xin; Xie, Jia-Yin; Li, Meng-Xia; Xiang, De-Bing; Li, Zeng-Peng; Yang, Zhen-Zhou

    2010-01-01

    The aim of the study was to obtain stable radioresistant sub-lines from the human cervical cancer cell line HeLa by prolonged exposure to 252 Cf neutron and X-rays. Radioresistance mechanisms were investigated in the resulting cells using microarray analysis of DNA damage repair genes. HeLa cells were treated with fractionated 252 Cf neutron and X-rays, with a cumulative dose of 75 Gy each, over 8 months, yielding the sub-lines HeLaNR and HeLaXR. Radioresistant characteristics were detected by clone formation assay, ultrastructural observations, cell doubling time, cell cycle distribution, and apoptosis assay. Gene expression patterns of the radioresistant sub-lines were studied through microarray analysis and verified by Western blotting and real-time PCR. The radioresistant sub-lines HeLaNR and HeLaXR were more radioresisitant to 252 Cf neutron and X-rays than parental HeLa cells by detecting their radioresistant characteristics, respectively. Compared to HeLa cells, the expression of 24 genes was significantly altered by at least 2-fold in HeLaNR cells. Of these, 19 genes were up-regulated and 5 down-regulated. In HeLaXR cells, 41 genes were significantly altered by at least 2-fold; 38 genes were up-regulated and 3 down-regulated. Chronic exposure of cells to ionizing radiation induces adaptive responses that enhance tolerance of ionizing radiation and allow investigations of cellular radioresistance mechanisms. The insights gained into the molecular mechanisms activated by these 'radioresistance' genes will lead to new therapeutic targets for cervical cancer

  16. Genome-Wide Association Study with Sequence Variants Identifies Candidate Genes for Mastitis Resistance in Dairy Cattle

    DEFF Research Database (Denmark)

    Sahana, Goutam; Guldbrandtsen, Bernt; Bendixen, Christian

    Six genomic regions affecting clinical mastitis were identified through a GWAS study with imputed BovineHD chip genotype data in the Nordic Holstein cattle population. The association analyses were carried out using a SNP-by-SNP analysis by fitting the regression of allele dosage and a polygenic...... Effect Predictor (VEP) vers. 2.6 using ENSEMBL vers. 67 databases. Candidate polymorphisms affecting clinical mastitis were selected based on their association with the traits and functional annotations. A strong positional candidate gene for mastitis resistance on chromosome-6 is the NPFFR2 which...... Factor Receptor Alpha (LIFR) emerged as a strong candidate gene for mastitis resistance. The LIFR gene is involved in acute phase response and is expressed in saliva and mammary gland....

  17. Utilization of gene mapping and candidate gene mutation screening for diagnosing clinically equivocal conditions: a Norrie disease case study.

    Science.gov (United States)

    Chini, Vasiliki; Stambouli, Danai; Nedelea, Florina Mihaela; Filipescu, George Alexandru; Mina, Diana; Kambouris, Marios; El-Shantil, Hatem

    2014-06-01

    Prenatal diagnosis was requested for an undiagnosed eye disease showing X-linked inheritance in a family. No medical records existed for the affected family members. Mapping of the X chromosome and candidate gene mutation screening identified a c.C267A[p.F89L] mutation in NPD previously described as possibly causing Norrie disease. The detection of the c.C267A[p.F89L] variant in another unrelated family confirms the pathogenic nature of the mutation for the Norrie disease phenotype. Gene mapping, haplotype analysis, and candidate gene screening have been previously utilized in research applications but were applied here in a diagnostic setting due to the scarcity of available clinical information. The clinical diagnosis and mutation identification were critical for providing proper genetic counseling and prenatal diagnosis for this family.

  18. Validation of candidate genes associated with cardiovascular risk factors in psychiatric patients

    Science.gov (United States)

    Windemuth, Andreas; de Leon, Jose; Goethe, John W.; Schwartz, Harold I.; Woolley, Stephen; Susce, Margaret; Kocherla, Mohan; Bogaard, Kali; Holford, Theodore R.; Seip, Richard L.; Ruaño, Gualberto

    2016-01-01

    The purpose of this study was to identify genetic variants predictive of cardiovascular risk factors in a psychiatric population treated with second generation antipsychotics (SGA). 924 patients undergoing treatment for severe mental illness at four US hospitals were genotyped at 1.2 million single nucleotide polymorphisms. Patients were assessed for fasting serum lipid (low density lipoprotein cholesterol [LDLc], high density lipoprotein cholesterol [HDLc], and triglycerides) and obesity phenotypes (body mass index, BMI). Thirteen candidate genes from previous studies of the same phenotypes in non-psychiatric populations were tested for association. We confirmed 8 of the 13 candidate genes at the 95% confidence level. An increased genetic effect size was observed for triglycerides in the psychiatric population compared to that in the cardiovascular population. PMID:21851846

  19. Correction of technical bias in clinical microarray data improves concordance with known biological information

    DEFF Research Database (Denmark)

    Eklund, Aron Charles; Szallasi, Zoltan Imre

    2008-01-01

    The performance of gene expression microarrays has been well characterized using controlled reference samples, but the performance on clinical samples remains less clear. We identified sources of technical bias affecting many genes in concert, thus causing spurious correlations in clinical data...... sets and false associations between genes and clinical variables. We developed a method to correct for technical bias in clinical microarray data, which increased concordance with known biological relationships in multiple data sets....

  20. Selection and Validation of Reference Genes for qRT-PCR Expression Analysis of Candidate Genes Involved in Olfactory Communication in the Butterfly Bicyclus anynana

    OpenAIRE

    Arun, Alok; Bauml?, V?ronique; Amelot, Ga?l; Nieberding, Caroline M.

    2015-01-01

    Real-time quantitative reverse transcription PCR (qRT-PCR) is a technique widely used to quantify the transcriptional expression level of candidate genes. qRT-PCR requires the selection of one or several suitable reference genes, whose expression profiles remain stable across conditions, to normalize the qRT-PCR expression profiles of candidate genes. Although several butterfly species (Lepidoptera) have become important models in molecular evolutionary ecology, so far no study aimed at ident...

  1. EBF factors drive expression of multiple classes of target genes governing neuronal development

    Directory of Open Access Journals (Sweden)

    Vetter Monica L

    2011-04-01

    Full Text Available Abstract Background Early B cell factor (EBF family members are transcription factors known to have important roles in several aspects of vertebrate neurogenesis, including commitment, migration and differentiation. Knowledge of how EBF family members contribute to neurogenesis is limited by a lack of detailed understanding of genes that are transcriptionally regulated by these factors. Results We performed a microarray screen in Xenopus animal caps to search for targets of EBF transcriptional activity, and identified candidate targets with multiple roles, including transcription factors of several classes. We determined that, among the most upregulated candidate genes with expected neuronal functions, most require EBF activity for some or all of their expression, and most have overlapping expression with ebf genes. We also found that the candidate target genes that had the most strongly overlapping expression patterns with ebf genes were predicted to be direct transcriptional targets of EBF transcriptional activity. Conclusions The identification of candidate targets that are transcription factor genes, including nscl-1, emx1 and aml1, improves our understanding of how EBF proteins participate in the hierarchy of transcription control during neuronal development, and suggests novel mechanisms by which EBF activity promotes migration and differentiation. Other candidate targets, including pcdh8 and kcnk5, expand our knowledge of the types of terminal differentiated neuronal functions that EBF proteins regulate.

  2. A study of metaheuristic algorithms for high dimensional feature selection on microarray data

    Science.gov (United States)

    Dankolo, Muhammad Nasiru; Radzi, Nor Haizan Mohamed; Sallehuddin, Roselina; Mustaffa, Noorfa Haszlinna

    2017-11-01

    Microarray systems enable experts to examine gene profile at molecular level using machine learning algorithms. It increases the potentials of classification and diagnosis of many diseases at gene expression level. Though, numerous difficulties may affect the efficiency of machine learning algorithms which includes vast number of genes features comprised in the original data. Many of these features may be unrelated to the intended analysis. Therefore, feature selection is necessary to be performed in the data pre-processing. Many feature selection algorithms are developed and applied on microarray which including the metaheuristic optimization algorithms. This paper discusses the application of the metaheuristics algorithms for feature selection in microarray dataset. This study reveals that, the algorithms have yield an interesting result with limited resources thereby saving computational expenses of machine learning algorithms.

  3. Comparison of two DNA microarrays for detection of plasmid-mediated antimicrobial resistance and virulence factor genes in clinical isolates of Enterobacteriaceae and non-Enterobacteriaceae.

    LENUS (Irish Health Repository)

    Walsh, Fiona

    2010-06-01

    A DNA microarray was developed to detect plasmid-mediated antimicrobial resistance (AR) and virulence factor (VF) genes in clinical isolates of Enterobacteriaceae and non-Enterobacteriaceae. The array was validated with the following bacterial species: Escherichiacoli (n=17); Klebsiellapneumoniae (n=3); Enterobacter spp. (n=6); Acinetobacter genospecies 3 (n=1); Acinetobacterbaumannii (n=1); Pseudomonasaeruginosa (n=2); and Stenotrophomonasmaltophilia (n=2). The AR gene profiles of these isolates were identified by polymerase chain reaction (PCR). The DNA microarray consisted of 155 and 133 AR and VF gene probes, respectively. Results were compared with the commercially available Identibac AMR-ve Array Tube. Hybridisation results indicated that there was excellent correlation between PCR and array results for AR and VF genes. Genes conferring resistance to each antibiotic class were identified by the DNA array. Unusual resistance genes were also identified, such as bla(SHV-5) in a bla(OXA-23)-positive carbapenem-resistant A. baumannii. The phylogenetic group of each E. coli isolate was verified by the array. These data demonstrate that it is possible to screen simultaneously for all important classes of mobile AR and VF genes in Enterobacteriaceae and non-Enterobacteriaceae whilst also assigning a correct phylogenetic group to E. coli isolates. Therefore, it is feasible to test clinical Gram-negative bacteria for all known AR genes and to provide important information regarding pathogenicity simultaneously.

  4. Removing Batch Effects from Longitudinal Gene Expression - Quantile Normalization Plus ComBat as Best Approach for Microarray Transcriptome Data.

    Directory of Open Access Journals (Sweden)

    Christian Müller

    Full Text Available Technical variation plays an important role in microarray-based gene expression studies, and batch effects explain a large proportion of this noise. It is therefore mandatory to eliminate technical variation while maintaining biological variability. Several strategies have been proposed for the removal of batch effects, although they have not been evaluated in large-scale longitudinal gene expression data. In this study, we aimed at identifying a suitable method for batch effect removal in a large study of microarray-based longitudinal gene expression. Monocytic gene expression was measured in 1092 participants of the Gutenberg Health Study at baseline and 5-year follow up. Replicates of selected samples were measured at both time points to identify technical variability. Deming regression, Passing-Bablok regression, linear mixed models, non-linear models as well as ReplicateRUV and ComBat were applied to eliminate batch effects between replicates. In a second step, quantile normalization prior to batch effect correction was performed for each method. Technical variation between batches was evaluated by principal component analysis. Associations between body mass index and transcriptomes were calculated before and after batch removal. Results from association analyses were compared to evaluate maintenance of biological variability. Quantile normalization, separately performed in each batch, combined with ComBat successfully reduced batch effects and maintained biological variability. ReplicateRUV performed perfectly in the replicate data subset of the study, but failed when applied to all samples. All other methods did not substantially reduce batch effects in the replicate data subset. Quantile normalization plus ComBat appears to be a valuable approach for batch correction in longitudinal gene expression data.

  5. A cross-species genetic analysis identifies candidate genes for mouse anxiety and human bipolar disorder

    Directory of Open Access Journals (Sweden)

    David G Ashbrook

    2015-07-01

    Full Text Available Bipolar disorder (BD is a significant neuropsychiatric disorder with a lifetime prevalence of ~1%. To identify genetic variants underlying BD genome-wide association studies (GWAS have been carried out. While many variants of small effect associated with BD have been identified few have yet been confirmed, partly because of the low power of GWAS due to multiple comparisons being made. Complementary mapping studies using murine models have identified genetic variants for behavioral traits linked to BD, often with high power, but these identified regions often contain too many genes for clear identification of candidate genes. In the current study we have aligned human BD GWAS results and mouse linkage studies to help define and evaluate candidate genes linked to BD, seeking to use the power of the mouse mapping with the precision of GWAS. We use quantitative trait mapping for open field test and elevated zero maze data in the largest mammalian model system, the BXD recombinant inbred mouse population, to identify genomic regions associated with these BD-like phenotypes. We then investigate these regions in whole genome data from the Psychiatric Genomics Consortium’s bipolar disorder GWAS to identify candidate genes associated with BD. Finally we establish the biological relevance and pathways of these genes in a comprehensive systems genetics analysis.We identify four genes associated with both mouse anxiety and human BD. While TNR is a novel candidate for BD, we can confirm previously suggested associations with CMYA5, MCTP1 and RXRG. A cross-species, systems genetics analysis shows that MCTP1, RXRG and TNR coexpress with genes linked to psychiatric disorders and identify the striatum as a potential site of action. CMYA5, MCTP1, RXRG and TNR are associated with mouse anxiety and human BD. We hypothesize that MCTP1, RXRG and TNR influence intercellular signaling in the striatum.

  6. Detecting variants with Metabolic Design, a new software tool to design probes for explorative functional DNA microarray development

    Directory of Open Access Journals (Sweden)

    Gravelat Fabrice

    2010-09-01

    Full Text Available Abstract Background Microorganisms display vast diversity, and each one has its own set of genes, cell components and metabolic reactions. To assess their huge unexploited metabolic potential in different ecosystems, we need high throughput tools, such as functional microarrays, that allow the simultaneous analysis of thousands of genes. However, most classical functional microarrays use specific probes that monitor only known sequences, and so fail to cover the full microbial gene diversity present in complex environments. We have thus developed an algorithm, implemented in the user-friendly program Metabolic Design, to design efficient explorative probes. Results First we have validated our approach by studying eight enzymes involved in the degradation of polycyclic aromatic hydrocarbons from the model strain Sphingomonas paucimobilis sp. EPA505 using a designed microarray of 8,048 probes. As expected, microarray assays identified the targeted set of genes induced during biodegradation kinetics experiments with various pollutants. We have then confirmed the identity of these new genes by sequencing, and corroborated the quantitative discrimination of our microarray by quantitative real-time PCR. Finally, we have assessed metabolic capacities of microbial communities in soil contaminated with aromatic hydrocarbons. Results show that our probe design (sensitivity and explorative quality can be used to study a complex environment efficiently. Conclusions We successfully use our microarray to detect gene expression encoding enzymes involved in polycyclic aromatic hydrocarbon degradation for the model strain. In addition, DNA microarray experiments performed on soil polluted by organic pollutants without prior sequence assumptions demonstrate high specificity and sensitivity for gene detection. Metabolic Design is thus a powerful, efficient tool that can be used to design explorative probes and monitor metabolic pathways in complex environments

  7. Spot detection and image segmentation in DNA microarray data.

    Science.gov (United States)

    Qin, Li; Rueda, Luis; Ali, Adnan; Ngom, Alioune

    2005-01-01

    Following the invention of microarrays in 1994, the development and applications of this technology have grown exponentially. The numerous applications of microarray technology include clinical diagnosis and treatment, drug design and discovery, tumour detection, and environmental health research. One of the key issues in the experimental approaches utilising microarrays is to extract quantitative information from the spots, which represent genes in a given experiment. For this process, the initial stages are important and they influence future steps in the analysis. Identifying the spots and separating the background from the foreground is a fundamental problem in DNA microarray data analysis. In this review, we present an overview of state-of-the-art methods for microarray image segmentation. We discuss the foundations of the circle-shaped approach, adaptive shape segmentation, histogram-based methods and the recently introduced clustering-based techniques. We analytically show that clustering-based techniques are equivalent to the one-dimensional, standard k-means clustering algorithm that utilises the Euclidean distance.

  8. Natural Genetic Variation and Candidate Genes for Morphological Traits in Drosophila melanogaster

    Science.gov (United States)

    Carreira, Valeria Paula; Mensch, Julián; Hasson, Esteban; Fanara, Juan José

    2016-01-01

    Body size is a complex character associated to several fitness related traits that vary within and between species as a consequence of environmental and genetic factors. Latitudinal and altitudinal clines for different morphological traits have been described in several species of Drosophila and previous work identified genomic regions associated with such variation in D. melanogaster. However, the genetic factors that orchestrate morphological variation have been barely studied. Here, our main objective was to investigate genetic variation for different morphological traits associated to the second chromosome in natural populations of D. melanogaster along latitudinal and altitudinal gradients in Argentina. Our results revealed weak clinal signals and a strong population effect on morphological variation. Moreover, most pairwise comparisons between populations were significant. Our study also showed important within-population genetic variation, which must be associated to the second chromosome, as the lines are otherwise genetically identical. Next, we examined the contribution of different candidate genes to natural variation for these traits. We performed quantitative complementation tests using a battery of lines bearing mutated alleles at candidate genes located in the second chromosome and six second chromosome substitution lines derived from natural populations which exhibited divergent phenotypes. Results of complementation tests revealed that natural variation at all candidate genes studied, invected, Fasciclin 3, toucan, Reticulon-like1, jing and CG14478, affects the studied characters, suggesting that they are Quantitative Trait Genes for morphological traits. Finally, the phenotypic patterns observed suggest that different alleles of each gene might contribute to natural variation for morphological traits. However, non-additive effects cannot be ruled out, as wild-derived strains differ at myriads of second chromosome loci that may interact

  9. [Differential gene expression in incompatible interaction between Lilium regale Wilson and Fusarium oxysporum f. sp. lilii revealed by combined SSH and microarray analysis].

    Science.gov (United States)

    Rao, J; Liu, D; Zhang, N; He, H; Ge, F; Chen, C

    2014-01-01

    Fusarium wilt, caused by a soilborne pathogen Fusarium oxysporum f. sp. lilii, is the major disease of lily (Lilium L.). In order to isolate the genes differentially expressed in a resistant reaction to F. oxysporum in L. regale Wilson, a cDNA library was constructed with L. regale root during F. oxysporum infection using the suppression subtractive hybridization (SSH), and a total of 585 unique expressed sequence tags (ESTs) were obtained. Furthermore, the gene expression profiles in the incompatible interaction between L. regale and F. oxysporum were revealed by oligonucleotide microarray analysis of 585 unique ESTs comparison to the compatible interaction between a susceptible Lilium Oriental Hybrid 'Siberia' and F. oxysporum. The result of expression profile analysis indicated that the genes encoding pathogenesis-related proteins (PRs), antioxidative stress enzymes, secondary metabolism enzymes, transcription factors, signal transduction proteins as well as a large number of unknown genes were involved in early defense response of L. regale to F. oxysporum infection. Moreover, the following quantitative reverse transcription PCR (QRT-PCR) analysis confirmed reliability of the oligonucleotide microarray data. In the present study, isolation of differentially expressed genes in L. regale during response to F. oxysporum helped to uncover the molecular mechanism associated with the resistance of L. regale against F. oxysporum.

  10. Discovery of distinctive gene expression profiles in rheumatoid synovium using cDNA microarray technology: evidence for the existence of multiple pathways of tissue destruction and repair.

    NARCIS (Netherlands)

    Kraan, TC van der Pouw; Gaalen, van FA; Huizinga, T.W.; Pieterman, E; Breedveld, F.C.; Verweij, C.L.

    2003-01-01

    Rheumatoid arthritis (RA) is a heterogeneous disease. We used cDNA microarray technology to subclassify RA patients and disclose disease pathways in rheumatoid synovium. Hierarchical clustering of gene expression data identified two main groups of tissues (RA-I and RA-II). A total of 121 genes were

  11. Genetics of human longevity with emphasis on the relevance of HSP70 as candidate genes

    DEFF Research Database (Denmark)

    Singh, Ripudaman; Kølvrå, Steen; Rattan, Suresh I S

    2007-01-01

    Human longevity is determined to a certain extent by genetic factors. Several candidate genes have been studied for their association with human longevity, but the data collected so far are inconclusive. One of the reasons is the choice of the candidate genes in addition to the choice...... of an appropriate study design and methodology. Since aging is characterized by a progressive accumulation of molecular damage and an attenuation of the cellular defense mechanisms, the focus of studies on human longevity association with genes has now shifted to the pathways of cellular maintenance and repair...... mechanisms. One such pathway includes the battery of stress response genes, especially the heat shock protein HSP70 genes. Three such genes, HSPA1A, HSPA1B and HSPA1L, are present within the MHC-III region on the short arm of chromosome 6. We and others have found alleles, genotypes and haplotypes which have...

  12. DNA microarray analyses reveal a post-irradiation differential time-dependent gene expression profile in yeast cells exposed to X-rays and gamma-rays.

    Science.gov (United States)

    Kimura, Shinzo; Ishidou, Emi; Kurita, Sakiko; Suzuki, Yoshiteru; Shibato, Junko; Rakwal, Randeep; Iwahashi, Hitoshi

    2006-07-21

    Ionizing radiation (IR) is the most enigmatic of genotoxic stress inducers in our environment that has been around from the eons of time. IR is generally considered harmful, and has been the subject of numerous studies, mostly looking at the DNA damaging effects in cells and the repair mechanisms therein. Moreover, few studies have focused on large-scale identification of cellular responses to IR, and to this end, we describe here an initial study on the transcriptional responses of the unicellular genome model, yeast (Saccharomyces cerevisiae strain S288C), by cDNA microarray. The effect of two different IR, X-rays, and gamma (gamma)-rays, was investigated by irradiating the yeast cells cultured in YPD medium with 50 Gy doses of X- and gamma-rays, followed by resuspension of the cells in YPD for time-course experiments. The samples were collected for microarray analysis at 20, 40, and 80 min after irradiation. Microarray analysis revealed a time-course transcriptional profile of changed gene expressions. Up-regulated genes belonged to the functional categories mainly related to cell cycle and DNA processing, cell rescue defense and virulence, protein and cell fate, and metabolism (X- and gamma-rays). Similarly, for X- and gamma-rays, the down-regulated genes belonged to mostly transcription and protein synthesis, cell cycle and DNA processing, control of cellular organization, cell fate, and C-compound and carbohydrate metabolism categories, respectively. This study provides for the first time a snapshot of the genome-wide mRNA expression profiles in X- and gamma-ray post-irradiated yeast cells and comparatively interprets/discusses the changed gene functional categories as effects of these two radiations vis-à-vis their energy levels.

  13. Feasibility of subcutaneously implanted magnetic microarrays for site specific drug and gene targeting

    Directory of Open Access Journals (Sweden)

    M. Babincová

    2010-01-01

    Full Text Available The magnetic nanoparticles play a crucial role as a drug carriers in the human body. The wedge like magnetic arrays creatinga strongly non-homogeneous magnetic field are considered as a useful way to focus magnetic nanoparticles functionalizedwith various drugs or genes to desired sites. The goal of this study is to develop a numerical model of drug targetingusing subcutaneously implanted magnetic microarrays. The Finite Element Method is applied to solve partial differentialequations describing electromagnetic field (Maxwell equations and motion of these particles in a given magnetic field isobtained solving set of ordinary differential equations expressed by Newton law of motion. The results are encouragingshowing the potential to target drug to the tumour cell locally, without unwanted side effects.

  14. Candidate genes and pathogenesis investigation for sepsis-related acute respiratory distress syndrome based on gene expression profile.

    Science.gov (United States)

    Wang, Min; Yan, Jingjun; He, Xingxing; Zhong, Qiang; Zhan, Chengye; Li, Shusheng

    2016-04-18

    Acute respiratory distress syndrome (ARDS) is a potentially devastating form of acute inflammatory lung injury as well as a major cause of acute respiratory failure. Although researchers have made significant progresses in elucidating the pathophysiology of this complex syndrome over the years, the absence of a universal detail disease mechanism up until now has led to a series of practical problems for a definitive treatment. This study aimed to predict some genes or pathways associated with sepsis-related ARDS based on a public microarray dataset and to further explore the molecular mechanism of ARDS. A total of 122 up-regulated DEGs and 91 down-regulated differentially expressed genes (DEGs) were obtained. The up- and down-regulated DEGs were mainly involved in functions like mitotic cell cycle and pathway like cell cycle. Protein-protein interaction network of ARDS analysis revealed 20 hub genes including cyclin B1 (CCNB1), cyclin B2 (CCNB2) and topoisomerase II alpha (TOP2A). A total of seven transcription factors including forkhead box protein M1 (FOXM1) and 30 target genes were revealed in the transcription factor-target gene regulation network. Furthermore, co-cited genes including CCNB2-CCNB1 were revealed in literature mining for the relations ARDS related genes. Pathways like mitotic cell cycle were closed related with the development of ARDS. Genes including CCNB1, CCNB2 and TOP2A, as well as transcription factors like FOXM1 might be used as the novel gene therapy targets for sepsis related ARDS.

  15. Candidate genes expressed in human islets and their role in the pathogenesis of type 1 diabetes

    DEFF Research Database (Denmark)

    Storling, Joachim; Brorsson, Caroline Anna

    2013-01-01

    In type 1 diabetes (T1D), the insulin-producing β cells are destroyed by an immune-mediated process leading to complete insulin deficiency. There is a strong genetic component in T1D. Genes located in the human leukocyte antigen (HLA) region are the most important genetic determinants of disease......, but more than 40 additional loci are known to significantly affect T1D risk. Since most of the currently known genetic candidates have annotated immune cell functions, it is generally considered that most of the genetic susceptibility in T1D is caused by variation in genes affecting immune cell function....... Recent studies, however, indicate that most T1D candidate genes are expressed in human islets suggesting that the functions of the genes are not restricted to immune cells, but also play roles in the islets and possibly the β cells. Several candidates change expression levels within the islets following...

  16. Photoreceptor dysplasia (pd) in miniature schnauzer dogs: evaluation of candidate genes by molecular genetic analysis.

    Science.gov (United States)

    Zhang, Q; Baldwin, V J; Acland, G M; Parshall, C J; Haskel, J; Aguirre, G D; Ray, K

    1999-01-01

    Photoreceptor dysplasia (pd) is one of a group of at least six distinct autosomal and one X-linked retinal disorders identified in dogs which are collectively known as progressive retinal atrophy (PRA). It is an early onset retinal disease identified in miniature schnauzer dogs, and pedigree analysis and breeding studies have established autosomal recessive inheritance of the disease. Using a gene-based approach, a number of retina-expressed genes, including some members of the phototransduction pathway, have been causally implicated in retinal diseases of humans and other animals. Here we examined seven such potential candidate genes (opsin, RDS/peripherin, ROM1, rod cGMP-gated cation channel alpha-subunit, and three subunits of transducin) for their causal association with the pd locus by testing segregation of intragenic markers with the disease locus, or, in the absence of informative polymorphisms, sequencing of the coding regions of the genes. Based on these results, we have conclusively excluded four photoreceptor-specific genes as candidates for pd by linkage analysis. For three other photoreceptor-specific genes, we did not find any mutation in the coding sequences of the genes and have excluded them provisionally. Formal exclusion would require investigation of the levels of expression of the candidate genes in pd-affected dogs relative to age-matched controls. At present we are building suitable informative pedigrees for the disease locus with a sufficient number of meiosis to be useful for genomewide screening. This should identify markers linked to the disease locus and eventually permit progress toward the identification of the photoreceptor dysplasia gene and the disease-causing mutation.

  17. How the RNA isolation method can affect microRNA microarray results

    DEFF Research Database (Denmark)

    Podolska, Agnieszka; Kaczkowski, Bogumil; Litman, Thomas

    2011-01-01

    RNA microarray analysis on porcine brain tissue. One method is a phenol-guanidine isothiocyanate-based procedure that permits isolation of total RNA. The second method, miRVana™ microRNA isolation, is column based and recovers the small RNA fraction alone. We found that microarray analyses give different results...... that depend on the RNA fraction used, in particular because some microRNAs appear very sensitive to the RNA isolation method. We conclude that precautions need to be taken when comparing microarray studies based on RNA isolated with different methods.......The quality of RNA is crucial in gene expression experiments. RNA degradation interferes in the measurement of gene expression, and in this context, microRNA quantification can lead to an incorrect estimation. In the present study, two different RNA isolation methods were used to perform micro...

  18. Characterization of adjacent breast tumors using oligonucleotide microarrays

    International Nuclear Information System (INIS)

    Unger, Meredith A; Rishi, Mazhar; Clemmer, Virginia B; Hartman, Jennifer L; Keiper, Elizabeth A; Greshock, Joel D; Chodosh, Lewis A; Liebman, Michael N; Weber, Barbara L

    2001-01-01

    Current methodology often cannot distinguish second primary breast cancers from multifocal disease, a potentially important distinction for clinical management. In the present study we evaluated the use of oligonucleotide-based microarray analysis in determining the clonality of tumors by comparing gene expression profiles. Total RNA was extracted from two tumors with no apparent physical connection that were located in the right breast of an 87-year-old woman diagnosed with invasive ductal carcinoma (IDC). The RNA was hybridized to the Affymetrix Human Genome U95A Gene Chip ® (12,500 known human genes) and analyzed using the Gene Chip Analysis Suite ® 3.3 (Affymetrix, Inc, Santa Clara, CA, USA) and JMPIN ® 3.2.6 (SAS Institute, Inc, Cary, NC, USA). Gene expression profiles of tumors from five additional patients were compared in order to evaluate the heterogeneity in gene expression between tumors with similar clinical characteristics. The adjacent breast tumors had a pairwise correlation coefficient of 0.987, and were essentially indistinguishable by microarray analysis. Analysis of gene expression profiles from different individuals, however, generated a pairwise correlation coefficient of 0.710. Transcriptional profiling may be a useful diagnostic tool for determining tumor clonality and heterogeneity, and may ultimately impact on therapeutic decision making

  19. Fuzzy C-means method for clustering microarray data.

    Science.gov (United States)

    Dembélé, Doulaye; Kastner, Philippe

    2003-05-22

    Clustering analysis of data from DNA microarray hybridization studies is essential for identifying biologically relevant groups of genes. Partitional clustering methods such as K-means or self-organizing maps assign each gene to a single cluster. However, these methods do not provide information about the influence of a given gene for the overall shape of clusters. Here we apply a fuzzy partitioning method, Fuzzy C-means (FCM), to attribute cluster membership values to genes. A major problem in applying the FCM method for clustering microarray data is the choice of the fuzziness parameter m. We show that the commonly used value m = 2 is not appropriate for some data sets, and that optimal values for m vary widely from one data set to another. We propose an empirical method, based on the distribution of distances between genes in a given data set, to determine an adequate value for m. By setting threshold levels for the membership values, genes which are tigthly associated to a given cluster can be selected. Using a yeast cell cycle data set as an example, we show that this selection increases the overall biological significance of the genes within the cluster. Supplementary text and Matlab functions are available at http://www-igbmc.u-strasbg.fr/fcm/

  20. Modulation of gene expression in a human cell line caused by poliovirus, vaccinia virus and interferon

    Directory of Open Access Journals (Sweden)

    Hoddevik Gunnar

    2007-03-01

    Full Text Available Abstract Background The project was initiated to describe the response of a human embryonic fibroblast cell line to the replication of two different viruses, and, more specifically, to look for candidate genes involved in viral defense. For this purpose, the cells were synchronously infected with poliovirus in the absence or presence of interferon-alpha, or with vaccinia virus, a virus that is not inhibited by interferon. By comparing the changes in transcriptosome due to these different challenges, it should be possible to suggest genes that might be involved in defense. Results The viral titers were sufficient to yield productive infection in a majority of the cells. The cells were harvested in triplicate at various time-points, and the transcriptosome compared with mock infected cells using oligo-based, global 35 k microarrays. While there was very limited similarities in the response to the different viruses, a large proportion of the genes up-regulated by interferon-alpha were also up-regulated by poliovirus. Interferon-alpha inhibited poliovirus replication, but there were no signs of any interferons being induced by poliovirus. The observations suggest that the cells do launch an antiviral response to poliovirus in the absence of interferon. Analyses of the data led to a list of candidate antiviral genes. Functional information was limited, or absent, for most of the candidate genes. Conclusion The data are relevant for our understanding of how the cells respond to poliovirus and vaccinia virus infection. More annotations, and more microarray studies with related viruses, are required in order to narrow the list of putative defence-related genes.

  1. Gene expression patterns during the larval development of European sea bass (dicentrarchus labrax) by microarray analysis.

    Science.gov (United States)

    Darias, M J; Zambonino-Infante, J L; Hugot, K; Cahu, C L; Mazurais, D

    2008-01-01

    During the larval period, marine teleosts undergo very fast growth and dramatic changes in morphology, metabolism, and behavior to accomplish their metamorphosis into juvenile fish. Regulation of gene expression is widely thought to be a key mechanism underlying the management of the biological processes required for harmonious development over this phase of life. To provide an overall analysis of gene expression in the whole body during sea bass larval development, we monitored the expression of 6,626 distinct genes at 10 different points in time between 7 and 43 days post-hatching (dph) by using heterologous hybridization of a rainbow trout cDNA microarray. The differentially expressed genes (n = 485) could be grouped into two categories: genes that were generally up-expressed early, between 7 and 23 dph, and genes up-expressed between 25 and 43 dph. Interestingly, among the genes regulated during the larval period, those related to organogenesis, energy pathways, biosynthesis, and digestion were over-represented compared with total set of analyzed genes. We discuss the quantitative regulation of whole-body contents of these specific transcripts with regard to the ontogenesis and maturation of essential functions that take place over larval development. Our study is the first utilization of a transcriptomic approach in sea bass and reveals dynamic changes in gene expression patterns in relation to marine finfish larval development.

  2. Differential gene expression of wheat progeny with contrasting levels of transpiration efficiency.

    Science.gov (United States)

    Xue, Gang-Ping; McIntyre, C Lynne; Chapman, Scott; Bower, Neil I; Way, Heather; Reverter, Antonio; Clarke, Bryan; Shorter, Ray

    2006-08-01

    High water use efficiency or transpiration efficiency (TE) in wheat is a desirable physiological trait for increasing grain yield under water-limited environments. The identification of genes associated with this trait would facilitate the selection for genotypes with higher TE using molecular markers. We performed an expression profiling (microarray) analysis of approximately 16,000 unique wheat ESTs to identify genes that were differentially expressed between wheat progeny lines with contrasting TE levels from a cross between Quarrion (high TE) and Genaro 81 (low TE). We also conducted a second microarray analysis to identify genes responsive to drought stress in wheat leaves. Ninety-three genes that were differentially expressed between high and low TE progeny lines were identified. One fifth of these genes were markedly responsive to drought stress. Several potential growth-related regulatory genes, which were down-regulated by drought, were expressed at a higher level in the high TE lines than the low TE lines and are potentially associated with a biomass production component of the Quarrion-derived high TE trait. Eighteen of the TE differentially expressed genes were further analysed using quantitative RT-PCR on a separate set of plant samples from those used for microarray analysis. The expression levels of 11 of the 18 genes were positively correlated with the high TE trait, measured as carbon isotope discrimination (Delta(13)C). These data indicate that some of these TE differentially expressed genes are candidates for investigating processes that underlie the high TE trait or for use as expression quantitative trait loci (eQTLs) for TE.

  3. In vivo corrosion, tumor outcome, and microarray gene expression for two types of muscle-implanted tungsten alloys

    Energy Technology Data Exchange (ETDEWEB)

    Schuster, B.E. [U.S. Army Research Laboratory, Weapons and Materials Research Directorate, B434 Mulberry Road, Aberdeen Proving Ground, MD 21005-5609 (United States); Roszell, L.E. [U.S. Army Institute of Public Health, 5158 Blackhawk Road, Aberdeen Proving Ground, MD 21010‐5403 (United States); Murr, L.E.; Ramirez, D.A. [Department of Metallurgical and Materials Engineering, University of Texas, El Paso, TX 79968 (United States); Demaree, J.D. [U.S. Army Research Laboratory, Weapons and Materials Research Directorate, B434 Mulberry Road, Aberdeen Proving Ground, MD 21005-5609 (United States); Klotz, B.R. [Dynamic Science Inc., Aberdeen Proving Ground, MD 21005‐5609 (United States); Rosencrance, A.B.; Dennis, W.E. [U.S. Army Center for Environmental Health Research, Department of Chemistry, Ft. Detrick, MD 21702‐5010 (United States); Bao, W. [SAS Institute, Inc. SAS Campus Drive, Cary, NC 27513 (United States); Perkins, E.J. [U.S. Army Engineer Research and Development Center, 3909 Hall Ferry Road, Vicksburg MS 39180 (United States); Dillman, J.F. [U.S. Army Medical Research Institute of Chemical Defense, 3100 Ricketts Point Road, Aberdeen Proving Ground, MD 21010‐5400 (United States); Bannon, D.I., E-mail: desmond.bannon@us.army.mil [U.S. Army Institute of Public Health, 5158 Blackhawk Road, Aberdeen Proving Ground, MD 21010‐5403 (United States)

    2012-11-15

    Tungsten alloys are composed of tungsten microparticles embedded in a solid matrix of transition metals such as nickel, cobalt, or iron. To understand the toxicology of these alloys, male F344 rats were intramuscularly implanted with pellets of tungsten/nickel/cobalt, tungsten/nickel/iron, or pure tungsten, with tantalum pellets as a negative control. Between 6 and 12 months, aggressive rhabdomyosarcomas formed around tungsten/nickel/cobalt pellets, while those of tungsten/nickel/iron or pure tungsten did not cause cancers. Electron microscopy showed a progressive corrosion of the matrix phase of tungsten/nickel/cobalt pellets over 6 months, accompanied by high urinary concentrations of nickel and cobalt. In contrast, non-carcinogenic tungsten/nickel/iron pellets were minimally corroded and urinary metals were low; these pellets having developed a surface oxide layer in vivo that may have restricted the mobilization of carcinogenic nickel. Microarray analysis of tumors revealed large changes in gene expression compared with normal muscle, with biological processes involving the cell cycle significantly up‐regulated and those involved with muscle development and differentiation significantly down‐regulated. Top KEGG pathways disrupted were adherens junction, p53 signaling, and the cell cycle. Chromosomal enrichment analysis of genes showed a highly significant impact at cytoband 7q22 (chromosome 7) which included mouse double minute (MDM2) and cyclin‐dependant kinase (CDK4) as well as other genes associated with human sarcomas. In conclusion, the tumorigenic potential of implanted tungsten alloys is related to mobilization of carcinogenic metals nickel and cobalt from corroding pellets, while gene expression changes in the consequent tumors are similar to radiation induced animal sarcomas as well as sporadic human sarcomas. -- Highlights: ► Tungsten/nickel/cobalt, tungsten/nickel/iron, and pure tungsten were studied. ► Male Fischer rats implanted with

  4. In vivo corrosion, tumor outcome, and microarray gene expression for two types of muscle-implanted tungsten alloys

    International Nuclear Information System (INIS)

    Schuster, B.E.; Roszell, L.E.; Murr, L.E.; Ramirez, D.A.; Demaree, J.D.; Klotz, B.R.; Rosencrance, A.B.; Dennis, W.E.; Bao, W.; Perkins, E.J.; Dillman, J.F.; Bannon, D.I.

    2012-01-01

    Tungsten alloys are composed of tungsten microparticles embedded in a solid matrix of transition metals such as nickel, cobalt, or iron. To understand the toxicology of these alloys, male F344 rats were intramuscularly implanted with pellets of tungsten/nickel/cobalt, tungsten/nickel/iron, or pure tungsten, with tantalum pellets as a negative control. Between 6 and 12 months, aggressive rhabdomyosarcomas formed around tungsten/nickel/cobalt pellets, while those of tungsten/nickel/iron or pure tungsten did not cause cancers. Electron microscopy showed a progressive corrosion of the matrix phase of tungsten/nickel/cobalt pellets over 6 months, accompanied by high urinary concentrations of nickel and cobalt. In contrast, non-carcinogenic tungsten/nickel/iron pellets were minimally corroded and urinary metals were low; these pellets having developed a surface oxide layer in vivo that may have restricted the mobilization of carcinogenic nickel. Microarray analysis of tumors revealed large changes in gene expression compared with normal muscle, with biological processes involving the cell cycle significantly up‐regulated and those involved with muscle development and differentiation significantly down‐regulated. Top KEGG pathways disrupted were adherens junction, p53 signaling, and the cell cycle. Chromosomal enrichment analysis of genes showed a highly significant impact at cytoband 7q22 (chromosome 7) which included mouse double minute (MDM2) and cyclin‐dependant kinase (CDK4) as well as other genes associated with human sarcomas. In conclusion, the tumorigenic potential of implanted tungsten alloys is related to mobilization of carcinogenic metals nickel and cobalt from corroding pellets, while gene expression changes in the consequent tumors are similar to radiation induced animal sarcomas as well as sporadic human sarcomas. -- Highlights: ► Tungsten/nickel/cobalt, tungsten/nickel/iron, and pure tungsten were studied. ► Male Fischer rats implanted with

  5. Application of microarray and functional-based screening methods for the detection of antimicrobial resistance genes in the microbiomes of healthy humans.

    Directory of Open Access Journals (Sweden)

    Roderick M Card

    Full Text Available The aim of this study was to screen for the presence of antimicrobial resistance genes within the saliva and faecal microbiomes of healthy adult human volunteers from five European countries. Two non-culture based approaches were employed to obviate potential bias associated with difficult to culture members of the microbiota. In a gene target-based approach, a microarray was employed to screen for the presence of over 70 clinically important resistance genes in the saliva and faecal microbiomes. A total of 14 different resistance genes were detected encoding resistances to six antibiotic classes (aminoglycosides, β-lactams, macrolides, sulphonamides, tetracyclines and trimethoprim. The most commonly detected genes were erm(B, blaTEM, and sul2. In a functional-based approach, DNA prepared from pooled saliva samples was cloned into Escherichia coli and screened for expression of resistance to ampicillin or sulphonamide, two of the most common resistances found by array. The functional ampicillin resistance screen recovered genes encoding components of a predicted AcrRAB efflux pump. In the functional sulphonamide resistance screen, folP genes were recovered encoding mutant dihydropteroate synthase, the target of sulphonamide action. The genes recovered from the functional screens were from the chromosomes of commensal species that are opportunistically pathogenic and capable of exchanging DNA with related pathogenic species. Genes identified by microarray were not recovered in the activity-based screen, indicating that these two methods can be complementary in facilitating the identification of a range of resistance mechanisms present within the human microbiome. It also provides further evidence of the diverse reservoir of resistance mechanisms present in bacterial populations in the human gut and saliva. In future the methods described in this study can be used to monitor changes in the resistome in response to antibiotic therapy.

  6. MicroArray Facility: a laboratory information management system with extended support for Nylon based technologies

    Directory of Open Access Journals (Sweden)

    Beaudoing Emmanuel

    2006-09-01

    Full Text Available Abstract Background High throughput gene expression profiling (GEP is becoming a routine technique in life science laboratories. With experimental designs that repeatedly span thousands of genes and hundreds of samples, relying on a dedicated database infrastructure is no longer an option. GEP technology is a fast moving target, with new approaches constantly broadening the field diversity. This technology heterogeneity, compounded by the informatics complexity of GEP databases, means that software developments have so far focused on mainstream techniques, leaving less typical yet established techniques such as Nylon microarrays at best partially supported. Results MAF (MicroArray Facility is the laboratory database system we have developed for managing the design, production and hybridization of spotted microarrays. Although it can support the widely used glass microarrays and oligo-chips, MAF was designed with the specific idiosyncrasies of Nylon based microarrays in mind. Notably single channel radioactive probes, microarray stripping and reuse, vector control hybridizations and spike-in controls are all natively supported by the software suite. MicroArray Facility is MIAME supportive and dynamically provides feedback on missing annotations to help users estimate effective MIAME compliance. Genomic data such as clone identifiers and gene symbols are also directly annotated by MAF software using standard public resources. The MAGE-ML data format is implemented for full data export. Journalized database operations (audit tracking, data anonymization, material traceability and user/project level confidentiality policies are also managed by MAF. Conclusion MicroArray Facility is a complete data management system for microarray producers and end-users. Particular care has been devoted to adequately model Nylon based microarrays. The MAF system, developed and implemented in both private and academic environments, has proved a robust solution for

  7. Expression studies of the obesity candidate gene FTO in pig

    DEFF Research Database (Denmark)

    Madsen, Majbritt Busk; Birck, Malene Muusfeldt; Fredholm, Merete

    2010-01-01

    Obesity is an increasing problem worldwide and research on candidate genes in good animal models is highly needed. The pig is an excellent model as its metabolism, organ size, and eating habits resemble that of humans. The present study is focused on the characterization of the fat mass and obesity...... associated gene (FTO) in pig. This gene has recently been associated with increased body mass index in several human populations. To establish information on the expression profile of FTO in the pig we performed quantitative PCR in a panel of adult pig tissues and in tissues sampled at different...... and cerebellum). Additionally, in order to see the involvement of the FTO gene in obesity, the changes in expression level were investigated in a nutritional study in brain of Gottingen minipigs under a high cholesterol diet. Significantly higher (P

  8. Candidate gene association analyses for ketosis resistance in Holsteins.

    Science.gov (United States)

    Kroezen, V; Schenkel, F S; Miglior, F; Baes, C F; Squires, E J

    2018-06-01

    High-yielding dairy cattle are susceptible to ketosis, a metabolic disease that negatively affects the health, fertility, and milk production of the cow. Interest in breeding for more robust dairy cattle with improved resistance to disease is global; however, genetic evaluations for ketosis would benefit from the additional information provided by genetic markers. Candidate genes that are proposed to have a biological role in the pathogenesis of ketosis were investigated in silico and a custom panel of 998 putative single nucleotide polymorphism (SNP) markers was developed. The objective of this study was to test the associations of these new markers with deregressed estimated breeding values (EBV) for ketosis. A sample of 653 Canadian Holstein cows that had been previously genotyped with a medium-density SNP chip were regenotyped with the custom panel. The EBV for ketosis in first and later lactations were obtained for each animal and deregressed for use as pseudo-phenotypes for association analyses. Results of the mixed inheritance model for single SNP association analyses suggested 15 markers in 6 unique candidate genes were associated with the studied trait. Genes encoding proteins involved in metabolic processes, including the synthesis and degradation of fatty acids and ketone bodies, gluconeogenesis, lipid mobilization, and the citric acid cycle, were identified to contain SNP associated with ketosis resistance. This work confirmed the presence of previously described quantitative trait loci for dairy cattle, suggested novel markers for ketosis-resistance, and provided insight into the underlying biology of this disease. Copyright © 2018 American Dairy Science Association. Published by Elsevier Inc. All rights reserved.

  9. Evaluation of candidate reference genes for gene expression normalization in Brassica juncea using real time quantitative RT-PCR.

    Directory of Open Access Journals (Sweden)

    Ruby Chandna

    Full Text Available The real time quantitative reverse transcription PCR (qRT-PCR is becoming increasingly important to gain insight into function of genes. Given the increased sensitivity, ease and reproducibility of qRT-PCR, the requirement of suitable reference genes for normalization has become important and stringent. It is now known that the expression of internal control genes in living organism vary considerably during developmental stages and under different experimental conditions. For economically important Brassica crops, only a couple of reference genes are reported till date. In this study, expression stability of 12 candidate reference genes including ACT2, ELFA, GAPDH, TUA, UBQ9 (traditional housekeeping genes, ACP, CAC, SNF, TIPS-41, TMD, TSB and ZNF (new candidate reference genes, in a diverse set of 49 tissue samples representing different developmental stages, stress and hormone treated conditions and cultivars of Brassica juncea has been validated. For the normalization of vegetative stages the ELFA, ACT2, CAC and TIPS-41 combination would be appropriate whereas TIPS-41 along with CAC would be suitable for normalization of reproductive stages. A combination of GAPDH, TUA, TIPS-41 and CAC were identified as the most suitable reference genes for total developmental stages. In various stress and hormone treated samples, UBQ9 and TIPS-41 had the most stable expression. Across five cultivars of B. juncea, the expression of CAC and TIPS-41 did not vary significantly and were identified as the most stably expressed reference genes. This study provides comprehensive information that the new reference genes selected herein performed better than the traditional housekeeping genes. The selection of most suitable reference genes depends on the experimental conditions, and is tissue and cultivar-specific. Further, to attain accuracy in the results more than one reference genes are necessary for normalization.

  10. Characterization of Gene Candidates for Vacuolar Sodium Transport from Hordeum Vulgare

    KAUST Repository

    Scheu, Arne Hagen August

    2017-01-01

    Various potential causes are discussed, including inaccuracies in the genome resource used as reference for primer design and issues inherent to the model system. Finally, I make suggestions on how to proceed to further characterize the candidate genes and hopefully identify novel sodium transporters from barley.

  11. Epidermal growth factor gene is a newly identified candidate gene for gout.

    Science.gov (United States)

    Han, Lin; Cao, Chunwei; Jia, Zhaotong; Liu, Shiguo; Liu, Zhen; Xin, Ruosai; Wang, Can; Li, Xinde; Ren, Wei; Wang, Xuefeng; Li, Changgui

    2016-08-10

    Chromosome 4q25 has been identified as a genomic region associated with gout. However, the associations of gout with the genes in this region have not yet been confirmed. Here, we performed two-stage analysis to determine whether variations in candidate genes in the 4q25 region are associated with gout in a male Chinese Han population. We first evaluated 96 tag single nucleotide polymorphisms (SNPs) in eight inflammatory/immune pathway- or glucose/lipid metabolism-related genes in the 4q25 region in 480 male gout patients and 480 controls. The SNP rs12504538, located in the elongation of very-long-chain-fatty-acid-like family member 6 gene (Elovl6), was found to be associated with gout susceptibility (Padjusted = 0.00595). In the second stage of analysis, we performed fine mapping analysis of 93 tag SNPs in Elovl6 and in the epidermal growth factor gene (EGF) and its flanking regions in 1017 male patients gout and 1897 healthy male controls. We observed a significant association between the T allele of EGF rs2298999 and gout (odds ratio = 0.77, 95% confidence interval = 0.67-0.88, Padjusted = 6.42 × 10(-3)). These results provide the first evidence for an association between the EGF rs2298999 C/T polymorphism and gout. Our findings should be validated in additional populations.

  12. Transcriptome Analysis of Early Responsive Genes in Rice during Magnaporthe oryzae Infection

    Directory of Open Access Journals (Sweden)

    Yiming Wang

    2014-12-01

    Full Text Available Rice blast disease caused by Magnaporthe oryzae is one of the most serious diseases of cultivated rice (Oryza sativa L. in most rice-growing regions of the world. In order to investigate early response genes in rice, we utilized the transcriptome analysis approach using a 300 K tilling microarray to rice leaves infected with compatible and incompatible M. oryzae strains. Prior to the microarray experiment, total RNA was validated by measuring the differential expression of rice defense-related marker genes (chitinase 2, barwin, PBZ1, and PR-10 by RT-PCR, and phytoalexins (sakuranetin and momilactone A with HPLC. Microarray analysis revealed that 231 genes were up-regulated (>2 fold change, p < 0.05 in the incompatible interaction compared to the compatible one. Highly expressed genes were functionally characterized into metabolic processes and oxidation-reduction categories. The oxidative stress response was induced in both early and later infection stages. Biotic stress overview from MapMan analysis revealed that the phytohormone ethylene as well as signaling molecules jasmonic acid and salicylic acid is important for defense gene regulation. WRKY and Myb transcription factors were also involved in signal transduction processes. Additionally, receptor-like kinases were more likely associated with the defense response, and their expression patterns were validated by RT-PCR. Our results suggest that candidate genes, including receptor-like protein kinases, may play a key role in disease resistance against M. oryzae attack.

  13. Cross-species hybridization of woodchuck hepatitis virus-induced hepatocellular carcinoma using human oligonucleotide microarrays

    Institute of Scientific and Technical Information of China (English)

    Paul W Anderson; Bud C Tennant; Zhenghong Lee

    2006-01-01

    AIM: To demonstrate the feasibility of using woodchuck samples on human microarrays, to provide insight into pathways involving positron emission tomography (PET) imaging tracers and to identify genes that could be potential molecular imaging targets for woodchuck hepatocellular carcinoma.METHODS: Labeled cRNA from woodchuck tissue samples were hybridized to Affymetrix U133 plus 2.0 GeneChips(R). Ten genes were selected for validation using quantitative RT-PCR and literature review was made.RESULTS: Testis enhanced gene transcript (BAX Inhibitor 1), alpha-fetoprotein, isocitrate dehydrogenase 3 (NAD+) beta, acetyl-CoA synthetase 2, carnitine palmitoyltransferase 2, and N-myc2 were up-regulated and spermidine/spermine N1-acetyltransferase was down-regulated in the woodchuck HCC. We also found previously published results supporting 8 of the 10 most up-regulated genes and all 10 of the 10 most downregulated genes.CONCLUSION: Many of our microarray results were validated using RT-PCR or literature search. Hence, we believe that woodchuck HCC and non-cancerous liver samples can be used on human microarrays to yield meaningful results.

  14. Factorial microarray analysis of zebra mussel (Dreissena polymorpha: Dreissenidae, Bivalvia adhesion

    Directory of Open Access Journals (Sweden)

    Faisal Mohamed

    2010-05-01

    Full Text Available Abstract Background The zebra mussel (Dreissena polymorpha has been well known for its expertise in attaching to substances under the water. Studies in past decades on this underwater adhesion focused on the adhesive protein isolated from the byssogenesis apparatus of the zebra mussel. However, the mechanism of the initiation, maintenance, and determination of the attachment process remains largely unknown. Results In this study, we used a zebra mussel cDNA microarray previously developed in our lab and a factorial analysis to identify the genes that were involved in response to the changes of four factors: temperature (Factor A, current velocity (Factor B, dissolved oxygen (Factor C, and byssogenesis status (Factor D. Twenty probes in the microarray were found to be modified by one of the factors. The transcription products of four selected genes, DPFP-BG20_A01, EGP-BG97/192_B06, EGP-BG13_G05, and NH-BG17_C09 were unique to the zebra mussel foot based on the results of quantitative reverse transcription PCR (qRT-PCR. The expression profiles of these four genes under the attachment and non-attachment were also confirmed by qRT-PCR and the result is accordant to that from microarray assay. The in situ hybridization with the RNA probes of two identified genes DPFP-BG20_A01 and EGP-BG97/192_B06 indicated that both of them were expressed by a type of exocrine gland cell located in the middle part of the zebra mussel foot. Conclusions The results of this study suggested that the changes of D. polymorpha byssogenesis status and the environmental factors can dramatically affect the expression profiles of the genes unique to the foot. It turns out that the factorial design and analysis of the microarray experiment is a reliable method to identify the influence of multiple factors on the expression profiles of the probesets in the microarray; therein it provides a powerful tool to reveal the mechanism of zebra mussel underwater attachment.

  15. Factorial microarray analysis of zebra mussel (Dreissena polymorpha: Dreissenidae, Bivalvia) adhesion.

    Science.gov (United States)

    Xu, Wei; Faisal, Mohamed

    2010-05-28

    The zebra mussel (Dreissena polymorpha) has been well known for its expertise in attaching to substances under the water. Studies in past decades on this underwater adhesion focused on the adhesive protein isolated from the byssogenesis apparatus of the zebra mussel. However, the mechanism of the initiation, maintenance, and determination of the attachment process remains largely unknown. In this study, we used a zebra mussel cDNA microarray previously developed in our lab and a factorial analysis to identify the genes that were involved in response to the changes of four factors: temperature (Factor A), current velocity (Factor B), dissolved oxygen (Factor C), and byssogenesis status (Factor D). Twenty probes in the microarray were found to be modified by one of the factors. The transcription products of four selected genes, DPFP-BG20_A01, EGP-BG97/192_B06, EGP-BG13_G05, and NH-BG17_C09 were unique to the zebra mussel foot based on the results of quantitative reverse transcription PCR (qRT-PCR). The expression profiles of these four genes under the attachment and non-attachment were also confirmed by qRT-PCR and the result is accordant to that from microarray assay. The in situ hybridization with the RNA probes of two identified genes DPFP-BG20_A01 and EGP-BG97/192_B06 indicated that both of them were expressed by a type of exocrine gland cell located in the middle part of the zebra mussel foot. The results of this study suggested that the changes of D. polymorpha byssogenesis status and the environmental factors can dramatically affect the expression profiles of the genes unique to the foot. It turns out that the factorial design and analysis of the microarray experiment is a reliable method to identify the influence of multiple factors on the expression profiles of the probesets in the microarray; therein it provides a powerful tool to reveal the mechanism of zebra mussel underwater attachment.

  16. Comparing transformation methods for DNA microarray data

    NARCIS (Netherlands)

    Thygesen, Helene H.; Zwinderman, Aeilko H.

    2004-01-01

    Background: When DNA microarray data are used for gene clustering, genotype/phenotype correlation studies, or tissue classification the signal intensities are usually transformed and normalized in several steps in order to improve comparability and signal/noise ratio. These steps may include

  17. Rapid identification of carbapenemase genes in gram-negative bacteria with an oligonucleotide microarray-based assay.

    Directory of Open Access Journals (Sweden)

    Sascha D Braun

    Full Text Available Rapid molecular identification of carbapenemase genes in Gram-negative bacteria is crucial for infection control and prevention, surveillance and for epidemiological purposes. Furthermore, it may have a significant impact upon determining the appropriate initial treatment and greatly benefit for critically ill patients. A novel oligonucleotide microarray-based assay was developed to simultaneously detect genes encoding clinically important carbapenemases as well as selected extended (ESBL and narrow spectrum (NSBL beta-lactamases directly from clonal culture material within few hours. Additionally, a panel of species specific markers was included to identify Escherichia coli, Pseudomonas aeruginosa, Citrobacter freundii/braakii, Klebsiella pneumoniae and Acinetobacter baumannii. The assay was tested using a panel of 117 isolates collected from urinary, blood and stool samples. For these isolates, phenotypic identifications and susceptibility tests were available. An independent detection of carbapenemase, ESBL and NSBL genes was carried out by various external reference laboratories using PCR methods. In direct comparison, the microarray correctly identified 98.2% of the covered carbapenemase genes. This included blaVIM (13 out of 13, blaGIM (2/2, blaKPC (27/27, blaNDM (5/5, blaIMP-2/4/7/8/13/14/15/16/31 (10/10, blaOXA-23 (12/13, blaOXA-40-group (7/7, blaOXA-48-group (32/33, blaOXA-51 (1/1 and blaOXA-58 (1/1. Furthermore, the test correctly identified additional beta-lactamases [blaOXA-1 (16/16, blaOXA-2 (4/4, blaOXA-9 (33/33, OXA-10 (3/3, blaOXA-51 (25/25, blaOXA-58 (2/2, CTX-M1/M15 (17/17 and blaVIM (1/1]. In direct comparison to phenotypical identification obtained by VITEK or MALDI-TOF systems, 114 of 117 (97.4% isolates, including Acinetobacter baumannii (28/28, Enterobacter spec. (5/5, Escherichia coli (4/4, Klebsiella pneumoniae (62/63, Klebsiella oxytoca (0/2, Pseudomonas aeruginosa (12/12, Citrobacter freundii (1/1 and Citrobacter

  18. Classification of Non-Small Cell Lung Cancer Using Significance Analysis of Microarray-Gene Set Reduction Algorithm

    Directory of Open Access Journals (Sweden)

    Lei Zhang

    2016-01-01

    Full Text Available Among non-small cell lung cancer (NSCLC, adenocarcinoma (AC, and squamous cell carcinoma (SCC are two major histology subtypes, accounting for roughly 40% and 30% of all lung cancer cases, respectively. Since AC and SCC differ in their cell of origin, location within the lung, and growth pattern, they are considered as distinct diseases. Gene expression signatures have been demonstrated to be an effective tool for distinguishing AC and SCC. Gene set analysis is regarded as irrelevant to the identification of gene expression signatures. Nevertheless, we found that one specific gene set analysis method, significance analysis of microarray-gene set reduction (SAMGSR, can be adopted directly to select relevant features and to construct gene expression signatures. In this study, we applied SAMGSR to a NSCLC gene expression dataset. When compared with several novel feature selection algorithms, for example, LASSO, SAMGSR has equivalent or better performance in terms of predictive ability and model parsimony. Therefore, SAMGSR is a feature selection algorithm, indeed. Additionally, we applied SAMGSR to AC and SCC subtypes separately to discriminate their respective stages, that is, stage II versus stage I. Few overlaps between these two resulting gene signatures illustrate that AC and SCC are technically distinct diseases. Therefore, stratified analyses on subtypes are recommended when diagnostic or prognostic signatures of these two NSCLC subtypes are constructed.

  19. Microarrays for global expression constructed with a low redundancy set of 27,500 sequenced cDNAs representing an array of developmental stages and physiological conditions of the soybean plant

    Directory of Open Access Journals (Sweden)

    Retzel Ernest

    2004-09-01

    stages and physiological conditions of the soybean plant. We also demonstrate that the quality of the data from the soybean cDNA microarrays is sufficiently reliable to examine isogenic lines that differ with respect to a mutant phenotype and thereby to define a small list of candidate genes potentially encoding or modulated by the mutant phenotype.

  20. Microarray analysis of androgen-regulated gene expression in testis: the use of the androgen-binding protein (ABP-transgenic mouse as a model

    Directory of Open Access Journals (Sweden)

    Grossman Gail

    2005-12-01

    Full Text Available Abstract Background Spermatogenesis is an androgen-dependent process, yet the molecular mechanisms of androgens' actions in testis are poorly understood. Transgenic mice overexpressing rat androgen-binding protein (ABP in their testes have reduced levels of intratesticular androgens and, as a result, show a progressive impairment of spermatogenesis. We used this model to characterize changes in global gene expression in testis in response to reduced bioavailability of androgens. Methods Total RNA was extracted from testes of 30-day old transgenic and wild-type control mice, converted to cRNA, labeled with biotin, and hybridized to oligonucleotide microarrays. Microarray results were confirmed by real-time reverse transcription polymerase chain reaction. Results Three-hundred-eighty-one genes (3.05% of all transcripts represented on the chips were up-regulated and 198 genes (1.59% were down-regulated by at least a factor of 2 in the androgen-deficient animals compared to controls. Genes encoding membrane proteins, intracellular signaling molecules, enzymes, proteins participating in the immune response, and those involved in cytoskeleton organization were significantly overrepresented in the up-regulated group. Among the down-regulated transcripts, those coding for extracellular proteins were overrepresented most dramatically, followed by those related to proteolysis, cell adhesion, immune response, and growth factor, cytokine, and ion channel activities. Transcripts with the greatest potential impact on cellular activities included several transcription factors, intracellular signal transducers, secreted signaling molecules and enzymes, and various cell surface molecules. Major nodes in the up-regulated network were IL-6, AGT, MYC, and A2M, those in the down-regulated network were IL-2, -4, and -10, MAPK8, SOCS1, and CREB1. Conclusion Microarray analysis followed by gene ontology profiling and connectivity analysis identified several functional