WorldWideScience

Sample records for microarray based expression

  1. A Fisheye Viewer for microarray-based gene expression data.

    Science.gov (United States)

    Wu, Min; Thao, Cheng; Mu, Xiangming; Munson, Ethan V

    2006-10-13

    Microarray has been widely used to measure the relative amounts of every mRNA transcript from the genome in a single scan. Biologists have been accustomed to reading their experimental data directly from tables. However, microarray data are quite large and are stored in a series of files in a machine-readable format, so direct reading of the full data set is not feasible. The challenge is to design a user interface that allows biologists to usefully view large tables of raw microarray-based gene expression data. This paper presents one such interface--an electronic table (E-table) that uses fisheye distortion technology. The Fisheye Viewer for microarray-based gene expression data has been successfully developed to view MIAME data stored in the MAGE-ML format. The viewer can be downloaded from the project web site http://polaris.imt.uwm.edu:7777/fisheye/. The fisheye viewer was implemented in Java so that it could run on multiple platforms. We implemented the E-table by adapting JTable, a default table implementation in the Java Swing user interface library. Fisheye views use variable magnification to balance magnification for easy viewing and compression for maximizing the amount of data on the screen. This Fisheye Viewer is a lightweight but useful tool for biologists to quickly overview the raw microarray-based gene expression data in an E-table.

  2. A fisheye viewer for microarray-based gene expression data

    Directory of Open Access Journals (Sweden)

    Munson Ethan V

    2006-10-01

    Full Text Available Abstract Background Microarray has been widely used to measure the relative amounts of every mRNA transcript from the genome in a single scan. Biologists have been accustomed to reading their experimental data directly from tables. However, microarray data are quite large and are stored in a series of files in a machine-readable format, so direct reading of the full data set is not feasible. The challenge is to design a user interface that allows biologists to usefully view large tables of raw microarray-based gene expression data. This paper presents one such interface – an electronic table (E-table that uses fisheye distortion technology. Results The Fisheye Viewer for microarray-based gene expression data has been successfully developed to view MIAME data stored in the MAGE-ML format. The viewer can be downloaded from the project web site http://polaris.imt.uwm.edu:7777/fisheye/. The fisheye viewer was implemented in Java so that it could run on multiple platforms. We implemented the E-table by adapting JTable, a default table implementation in the Java Swing user interface library. Fisheye views use variable magnification to balance magnification for easy viewing and compression for maximizing the amount of data on the screen. Conclusion This Fisheye Viewer is a lightweight but useful tool for biologists to quickly overview the raw microarray-based gene expression data in an E-table.

  3. Density based pruning for identification of differentially expressed genes from microarray data

    Directory of Open Access Journals (Sweden)

    Xu Jia

    2010-11-01

    Full Text Available Abstract Motivation Identification of differentially expressed genes from microarray datasets is one of the most important analyses for microarray data mining. Popular algorithms such as statistical t-test rank genes based on a single statistics. The false positive rate of these methods can be improved by considering other features of differentially expressed genes. Results We proposed a pattern recognition strategy for identifying differentially expressed genes. Genes are mapped to a two dimension feature space composed of average difference of gene expression and average expression levels. A density based pruning algorithm (DB Pruning is developed to screen out potential differentially expressed genes usually located in the sparse boundary region. Biases of popular algorithms for identifying differentially expressed genes are visually characterized. Experiments on 17 datasets from Gene Omnibus Database (GEO with experimentally verified differentially expressed genes showed that DB pruning can significantly improve the prediction accuracy of popular identification algorithms such as t-test, rank product, and fold change. Conclusions Density based pruning of non-differentially expressed genes is an effective method for enhancing statistical testing based algorithms for identifying differentially expressed genes. It improves t-test, rank product, and fold change by 11% to 50% in the numbers of identified true differentially expressed genes. The source code of DB pruning is freely available on our website http://mleg.cse.sc.edu/degprune

  4. The Arabidopsis co-expression tool (act): a WWW-based tool and database for microarray-based gene expression analysis

    DEFF Research Database (Denmark)

    Jen, C. H.; Manfield, I. W.; Michalopoulos, D. W.

    2006-01-01

    be examined using the novel clique finder tool to determine the sets of genes most likely to be regulated in a similar manner. In combination, these tools offer three levels of analysis: creation of correlation lists of co-expressed genes, refinement of these lists using two-dimensional scatter plots......We present a new WWW-based tool for plant gene analysis, the Arabidopsis Co-Expression Tool (act) , based on a large Arabidopsis thaliana microarray data set obtained from the Nottingham Arabidopsis Stock Centre. The co-expression analysis tool allows users to identify genes whose expression...

  5. A regression-based differential expression detection algorithm for microarray studies with ultra-low sample size.

    Directory of Open Access Journals (Sweden)

    Daniel Vasiliu

    Full Text Available Global gene expression analysis using microarrays and, more recently, RNA-seq, has allowed investigators to understand biological processes at a system level. However, the identification of differentially expressed genes in experiments with small sample size, high dimensionality, and high variance remains challenging, limiting the usability of these tens of thousands of publicly available, and possibly many more unpublished, gene expression datasets. We propose a novel variable selection algorithm for ultra-low-n microarray studies using generalized linear model-based variable selection with a penalized binomial regression algorithm called penalized Euclidean distance (PED. Our method uses PED to build a classifier on the experimental data to rank genes by importance. In place of cross-validation, which is required by most similar methods but not reliable for experiments with small sample size, we use a simulation-based approach to additively build a list of differentially expressed genes from the rank-ordered list. Our simulation-based approach maintains a low false discovery rate while maximizing the number of differentially expressed genes identified, a feature critical for downstream pathway analysis. We apply our method to microarray data from an experiment perturbing the Notch signaling pathway in Xenopus laevis embryos. This dataset was chosen because it showed very little differential expression according to limma, a powerful and widely-used method for microarray analysis. Our method was able to detect a significant number of differentially expressed genes in this dataset and suggest future directions for investigation. Our method is easily adaptable for analysis of data from RNA-seq and other global expression experiments with low sample size and high dimensionality.

  6. Cross-platform analysis of cancer microarray data improves gene expression based classification of phenotypes

    Directory of Open Access Journals (Sweden)

    Eils Roland

    2005-11-01

    Full Text Available Abstract Background The extensive use of DNA microarray technology in the characterization of the cell transcriptome is leading to an ever increasing amount of microarray data from cancer studies. Although similar questions for the same type of cancer are addressed in these different studies, a comparative analysis of their results is hampered by the use of heterogeneous microarray platforms and analysis methods. Results In contrast to a meta-analysis approach where results of different studies are combined on an interpretative level, we investigate here how to directly integrate raw microarray data from different studies for the purpose of supervised classification analysis. We use median rank scores and quantile discretization to derive numerically comparable measures of gene expression from different platforms. These transformed data are then used for training of classifiers based on support vector machines. We apply this approach to six publicly available cancer microarray gene expression data sets, which consist of three pairs of studies, each examining the same type of cancer, i.e. breast cancer, prostate cancer or acute myeloid leukemia. For each pair, one study was performed by means of cDNA microarrays and the other by means of oligonucleotide microarrays. In each pair, high classification accuracies (> 85% were achieved with training and testing on data instances randomly chosen from both data sets in a cross-validation analysis. To exemplify the potential of this cross-platform classification analysis, we use two leukemia microarray data sets to show that important genes with regard to the biology of leukemia are selected in an integrated analysis, which are missed in either single-set analysis. Conclusion Cross-platform classification of multiple cancer microarray data sets yields discriminative gene expression signatures that are found and validated on a large number of microarray samples, generated by different laboratories and

  7. Emerging use of gene expression microarrays in plant physiology.

    Science.gov (United States)

    Wullschleger, Stan D; Difazio, Stephen P

    2003-01-01

    Microarrays have become an important technology for the global analysis of gene expression in humans, animals, plants, and microbes. Implemented in the context of a well-designed experiment, cDNA and oligonucleotide arrays can provide highthroughput, simultaneous analysis of transcript abundance for hundreds, if not thousands, of genes. However, despite widespread acceptance, the use of microarrays as a tool to better understand processes of interest to the plant physiologist is still being explored. To help illustrate current uses of microarrays in the plant sciences, several case studies that we believe demonstrate the emerging application of gene expression arrays in plant physiology were selected from among the many posters and presentations at the 2003 Plant and Animal Genome XI Conference. Based on this survey, microarrays are being used to assess gene expression in plants exposed to the experimental manipulation of air temperature, soil water content and aluminium concentration in the root zone. Analysis often includes characterizing transcript profiles for multiple post-treatment sampling periods and categorizing genes with common patterns of response using hierarchical clustering techniques. In addition, microarrays are also providing insights into developmental changes in gene expression associated with fibre and root elongation in cotton and maize, respectively. Technical and analytical limitations of microarrays are discussed and projects attempting to advance areas of microarray design and data analysis are highlighted. Finally, although much work remains, we conclude that microarrays are a valuable tool for the plant physiologist interested in the characterization and identification of individual genes and gene families with potential application in the fields of agriculture, horticulture and forestry.

  8. Microarray expression profiling of human dental pulp from single subject.

    Science.gov (United States)

    Tete, Stefano; Mastrangelo, Filiberto; Scioletti, Anna Paola; Tranasi, Michelangelo; Raicu, Florina; Paolantonio, Michele; Stuppia, Liborio; Vinci, Raffaele; Gherlone, Enrico; Ciampoli, Cristian; Sberna, Maria Teresa; Conti, Pio

    2008-01-01

    Microarray is a recently developed simultaneous analysis of expression patterns of thousand of genes. The aim of this research was to evaluate the expression profile of human healthy dental pulp in order to find the presence of genes activated and encoding for proteins involved in the physiological process of human dental pulp. We report data obtained by analyzing expression profiles of human tooth pulp from single subjects, using an approach based on the amplification of the total RNA. Experiments were performed on a high-density array able to analyse about 21,000 oligonucleotide sequences of about 70 bases in duplicate, using an approach based on the amplification of the total RNA from the pulp of a single tooth. Obtained data were analyzed using the S.A.M. system (Significance Analysis of Microarray) and genes were merged according to their molecular functions and biological process by the Onto-Express software. The microarray analysis revealed 362 genes with specific pulp expression. Genes showing significant high expression were classified in genes involved in tooth development, protoncogenes, genes of collagen, DNAse, Metallopeptidases and Growth factors. We report a microarray analysis, carried out by extraction of total RNA from specimens of healthy human dental pulp tissue. This approach represents a powerful tool in the study of human normal and pathological pulp, allowing minimization of the genetic variability due to the pooling of samples from different individuals.

  9. Emerging Use of Gene Expression Microarrays in Plant Physiology

    Directory of Open Access Journals (Sweden)

    Stephen P. Difazio

    2006-04-01

    Full Text Available Microarrays have become an important technology for the global analysis of gene expression in humans, animals, plants, and microbes. Implemented in the context of a well-designed experiment, cDNA and oligonucleotide arrays can provide highthroughput, simultaneous analysis of transcript abundance for hundreds, if not thousands, of genes. However, despite widespread acceptance, the use of microarrays as a tool to better understand processes of interest to the plant physiologist is still being explored. To help illustrate current uses of microarrays in the plant sciences, several case studies that we believe demonstrate the emerging application of gene expression arrays in plant physiology were selected from among the many posters and presentations at the 2003 Plant and Animal Genome XI Conference. Based on this survey, microarrays are being used to assess gene expression in plants exposed to the experimental manipulation of air temperature, soil water content and aluminium concentration in the root zone. Analysis often includes characterizing transcript profiles for multiple post-treatment sampling periods and categorizing genes with common patterns of response using hierarchical clustering techniques. In addition, microarrays are also providing insights into developmental changes in gene expression associated with fibre and root elongation in cotton and maize, respectively. Technical and analytical limitations of microarrays are discussed and projects attempting to advance areas of microarray design and data analysis are highlighted. Finally, although much work remains, we conclude that microarrays are a valuable tool for the plant physiologist interested in the characterization and identification of individual genes and gene families with potential application in the fields of agriculture, horticulture and forestry.

  10. Training ANFIS structure using genetic algorithm for liver cancer classification based on microarray gene expression data

    Directory of Open Access Journals (Sweden)

    Bülent Haznedar

    2017-02-01

    Full Text Available Classification is an important data mining technique, which is used in many fields mostly exemplified as medicine, genetics and biomedical engineering. The number of studies about classification of the datum on DNA microarray gene expression is specifically increased in recent years. However, because of the reasons as the abundance of gene numbers in the datum as microarray gene expressions and the nonlinear relations mostly across those datum, the success of conventional classification algorithms can be limited. Because of these reasons, the interest on classification methods which are based on artificial intelligence to solve the problem on classification has been gradually increased in recent times. In this study, a hybrid approach which is based on Adaptive Neuro-Fuzzy Inference System (ANFIS and Genetic Algorithm (GA are suggested in order to classify liver microarray cancer data set. Simulation results are compared with the results of other methods. According to the results obtained, it is seen that the recommended method is better than the other methods.

  11. Quantitative miRNA expression analysis: comparing microarrays with next-generation sequencing

    DEFF Research Database (Denmark)

    Willenbrock, Hanni; Salomon, Jesper; Søkilde, Rolf

    2009-01-01

    Recently, next-generation sequencing has been introduced as a promising, new platform for assessing the copy number of transcripts, while the existing microarray technology is considered less reliable for absolute, quantitative expression measurements. Nonetheless, so far, results from the two...... technologies have only been compared based on biological data, leading to the conclusion that, although they are somewhat correlated, expression values differ significantly. Here, we use synthetic RNA samples, resembling human microRNA samples, to find that microarray expression measures actually correlate...... better with sample RNA content than expression measures obtained from sequencing data. In addition, microarrays appear highly sensitive and perform equivalently to next-generation sequencing in terms of reproducibility and relative ratio quantification....

  12. Classification across gene expression microarray studies

    Directory of Open Access Journals (Sweden)

    Kuner Ruprecht

    2009-12-01

    Full Text Available Abstract Background The increasing number of gene expression microarray studies represents an important resource in biomedical research. As a result, gene expression based diagnosis has entered clinical practice for patient stratification in breast cancer. However, the integration and combined analysis of microarray studies remains still a challenge. We assessed the potential benefit of data integration on the classification accuracy and systematically evaluated the generalization performance of selected methods on four breast cancer studies comprising almost 1000 independent samples. To this end, we introduced an evaluation framework which aims to establish good statistical practice and a graphical way to monitor differences. The classification goal was to correctly predict estrogen receptor status (negative/positive and histological grade (low/high of each tumor sample in an independent study which was not used for the training. For the classification we chose support vector machines (SVM, predictive analysis of microarrays (PAM, random forest (RF and k-top scoring pairs (kTSP. Guided by considerations relevant for classification across studies we developed a generalization of kTSP which we evaluated in addition. Our derived version (DV aims to improve the robustness of the intrinsic invariance of kTSP with respect to technologies and preprocessing. Results For each individual study the generalization error was benchmarked via complete cross-validation and was found to be similar for all classification methods. The misclassification rates were substantially higher in classification across studies, when each single study was used as an independent test set while all remaining studies were combined for the training of the classifier. However, with increasing number of independent microarray studies used in the training, the overall classification performance improved. DV performed better than the average and showed slightly less variance. In

  13. Comparison of RNA-seq and microarray-based models for clinical endpoint prediction.

    Science.gov (United States)

    Zhang, Wenqian; Yu, Ying; Hertwig, Falk; Thierry-Mieg, Jean; Zhang, Wenwei; Thierry-Mieg, Danielle; Wang, Jian; Furlanello, Cesare; Devanarayan, Viswanath; Cheng, Jie; Deng, Youping; Hero, Barbara; Hong, Huixiao; Jia, Meiwen; Li, Li; Lin, Simon M; Nikolsky, Yuri; Oberthuer, André; Qing, Tao; Su, Zhenqiang; Volland, Ruth; Wang, Charles; Wang, May D; Ai, Junmei; Albanese, Davide; Asgharzadeh, Shahab; Avigad, Smadar; Bao, Wenjun; Bessarabova, Marina; Brilliant, Murray H; Brors, Benedikt; Chierici, Marco; Chu, Tzu-Ming; Zhang, Jibin; Grundy, Richard G; He, Min Max; Hebbring, Scott; Kaufman, Howard L; Lababidi, Samir; Lancashire, Lee J; Li, Yan; Lu, Xin X; Luo, Heng; Ma, Xiwen; Ning, Baitang; Noguera, Rosa; Peifer, Martin; Phan, John H; Roels, Frederik; Rosswog, Carolina; Shao, Susan; Shen, Jie; Theissen, Jessica; Tonini, Gian Paolo; Vandesompele, Jo; Wu, Po-Yen; Xiao, Wenzhong; Xu, Joshua; Xu, Weihong; Xuan, Jiekun; Yang, Yong; Ye, Zhan; Dong, Zirui; Zhang, Ke K; Yin, Ye; Zhao, Chen; Zheng, Yuanting; Wolfinger, Russell D; Shi, Tieliu; Malkas, Linda H; Berthold, Frank; Wang, Jun; Tong, Weida; Shi, Leming; Peng, Zhiyu; Fischer, Matthias

    2015-06-25

    Gene expression profiling is being widely applied in cancer research to identify biomarkers for clinical endpoint prediction. Since RNA-seq provides a powerful tool for transcriptome-based applications beyond the limitations of microarrays, we sought to systematically evaluate the performance of RNA-seq-based and microarray-based classifiers in this MAQC-III/SEQC study for clinical endpoint prediction using neuroblastoma as a model. We generate gene expression profiles from 498 primary neuroblastomas using both RNA-seq and 44 k microarrays. Characterization of the neuroblastoma transcriptome by RNA-seq reveals that more than 48,000 genes and 200,000 transcripts are being expressed in this malignancy. We also find that RNA-seq provides much more detailed information on specific transcript expression patterns in clinico-genetic neuroblastoma subgroups than microarrays. To systematically compare the power of RNA-seq and microarray-based models in predicting clinical endpoints, we divide the cohort randomly into training and validation sets and develop 360 predictive models on six clinical endpoints of varying predictability. Evaluation of factors potentially affecting model performances reveals that prediction accuracies are most strongly influenced by the nature of the clinical endpoint, whereas technological platforms (RNA-seq vs. microarrays), RNA-seq data analysis pipelines, and feature levels (gene vs. transcript vs. exon-junction level) do not significantly affect performances of the models. We demonstrate that RNA-seq outperforms microarrays in determining the transcriptomic characteristics of cancer, while RNA-seq and microarray-based models perform similarly in clinical endpoint prediction. Our findings may be valuable to guide future studies on the development of gene expression-based predictive models and their implementation in clinical practice.

  14. Integrated olfactory receptor and microarray gene expression databases

    Directory of Open Access Journals (Sweden)

    Crasto Chiquito J

    2007-06-01

    Full Text Available Abstract Background Gene expression patterns of olfactory receptors (ORs are an important component of the signal encoding mechanism in the olfactory system since they determine the interactions between odorant ligands and sensory neurons. We have developed the Olfactory Receptor Microarray Database (ORMD to house OR gene expression data. ORMD is integrated with the Olfactory Receptor Database (ORDB, which is a key repository of OR gene information. Both databases aim to aid experimental research related to olfaction. Description ORMD is a Web-accessible database that provides a secure data repository for OR microarray experiments. It contains both publicly available and private data; accessing the latter requires authenticated login. The ORMD is designed to allow users to not only deposit gene expression data but also manage their projects/experiments. For example, contributors can choose whether to make their datasets public. For each experiment, users can download the raw data files and view and export the gene expression data. For each OR gene being probed in a microarray experiment, a hyperlink to that gene in ORDB provides access to genomic and proteomic information related to the corresponding olfactory receptor. Individual ORs archived in ORDB are also linked to ORMD, allowing users access to the related microarray gene expression data. Conclusion ORMD serves as a data repository and project management system. It facilitates the study of microarray experiments of gene expression in the olfactory system. In conjunction with ORDB, ORMD integrates gene expression data with the genomic and functional data of ORs, and is thus a useful resource for both olfactory researchers and the public.

  15. Assessing Bacterial Interactions Using Carbohydrate-Based Microarrays

    Directory of Open Access Journals (Sweden)

    Andrea Flannery

    2015-12-01

    Full Text Available Carbohydrates play a crucial role in host-microorganism interactions and many host glycoconjugates are receptors or co-receptors for microbial binding. Host glycosylation varies with species and location in the body, and this contributes to species specificity and tropism of commensal and pathogenic bacteria. Additionally, bacterial glycosylation is often the first bacterial molecular species encountered and responded to by the host system. Accordingly, characterising and identifying the exact structures involved in these critical interactions is an important priority in deciphering microbial pathogenesis. Carbohydrate-based microarray platforms have been an underused tool for screening bacterial interactions with specific carbohydrate structures, but they are growing in popularity in recent years. In this review, we discuss carbohydrate-based microarrays that have been profiled with whole bacteria, recombinantly expressed adhesins or serum antibodies. Three main types of carbohydrate-based microarray platform are considered; (i conventional carbohydrate or glycan microarrays; (ii whole mucin microarrays; and (iii microarrays constructed from bacterial polysaccharides or their components. Determining the nature of the interactions between bacteria and host can help clarify the molecular mechanisms of carbohydrate-mediated interactions in microbial pathogenesis, infectious disease and host immune response and may lead to new strategies to boost therapeutic treatments.

  16. Single-cell multiple gene expression analysis based on single-molecule-detection microarray assay for multi-DNA determination

    Energy Technology Data Exchange (ETDEWEB)

    Li, Lu [School of Chemistry and Chemical Engineering, Shandong University, Jinan 250100 (China); Wang, Xianwei [School of Life Sciences, Shandong University, Jinan 250100 (China); Zhang, Xiaoli [School of Chemistry and Chemical Engineering, Shandong University, Jinan 250100 (China); Wang, Jinxing [School of Life Sciences, Shandong University, Jinan 250100 (China); Jin, Wenrui, E-mail: jwr@sdu.edu.cn [School of Chemistry and Chemical Engineering, Shandong University, Jinan 250100 (China)

    2015-01-07

    Highlights: • A single-molecule-detection (SMD) microarray for 10 samples is fabricated. • The based-SMD microarray assay (SMA) can determine 8 DNAs for each sample. • The limit of detection of SMA is as low as 1.3 × 10{sup −16} mol L{sup −1}. • The SMA can be applied in single-cell multiple gene expression analysis. - Abstract: We report a novel ultra-sensitive and high-selective single-molecule-detection microarray assay (SMA) for multiple DNA determination. In the SMA, a capture DNA (DNAc) microarray consisting of 10 subarrays with 9 spots for each subarray is fabricated on a silanized glass coverslip as the substrate. On the subarrays, the spot-to-spot spacing is 500 μm and each spot has a diameter of ∼300 μm. The sequence of the DNAcs on the 9 spots of a subarray is different, to determine 8 types of target DNAs (DNAts). Thus, 8 types of DNAts are captured to their complementary DNAcs at 8 spots of a subarray, respectively, and then labeled with quantum dots (QDs) attached to 8 types of detection DNAs (DNAds) with different sequences. The ninth spot is used to detect the blank value. In order to determine the same 8 types of DNAts in 10 samples, the 10 DNAc-modified subarrays on the microarray are identical. Fluorescence single-molecule images of the QD-labeled DNAts on each spot of the subarray are acquired using a home-made single-molecule microarray reader. The amounts of the DNAts are quantified by counting the bright dots from the QDs. For a microarray, 8 types of DNAts in 10 samples can be quantified in parallel. The limit of detection of the SMA for DNA determination is as low as 1.3 × 10{sup −16} mol L{sup −1}. The SMA for multi-DNA determination can also be applied in single-cell multiple gene expression analysis through quantification of complementary DNAs (cDNAs) corresponding to multiple messenger RNAs (mRNAs) in single cells. To do so, total RNA in single cells is extracted and reversely transcribed into their cDNAs. Three

  17. Microarray-based analysis of differential gene expression between infective and noninfective larvae of Strongyloides stercoralis.

    Directory of Open Access Journals (Sweden)

    Roshan Ramanathan

    2011-05-01

    Full Text Available Differences between noninfective first-stage (L1 and infective third-stage (L3i larvae of parasitic nematode Strongyloides stercoralis at the molecular level are relatively uncharacterized. DNA microarrays were developed and utilized for this purpose.Oligonucleotide hybridization probes for the array were designed to bind 3,571 putative mRNA transcripts predicted by analysis of 11,335 expressed sequence tags (ESTs obtained as part of the Nematode EST project. RNA obtained from S. stercoralis L3i and L1 was co-hybridized to each array after labeling the individual samples with different fluorescent tags. Bioinformatic predictions of gene function were developed using a novel cDNA Annotation System software. We identified 935 differentially expressed genes (469 L3i-biased; 466 L1-biased having two-fold expression differences or greater and microarray signals with a p value<0.01. Based on a functional analysis, L1 larvae have a larger number of genes putatively involved in transcription (p = 0.004, and L3i larvae have biased expression of putative heat shock proteins (such as hsp-90. Genes with products known to be immunoreactive in S. stercoralis-infected humans (such as SsIR and NIE had L3i biased expression. Abundantly expressed L3i contigs of interest included S. stercoralis orthologs of cytochrome oxidase ucr 2.1 and hsp-90, which may be potential chemotherapeutic targets. The S. stercoralis ortholog of fatty acid and retinol binding protein-1, successfully used in a vaccine against Ancylostoma ceylanicum, was identified among the 25 most highly expressed L3i genes. The sperm-containing glycoprotein domain, utilized in a vaccine against the nematode Cooperia punctata, was exclusively found in L3i biased genes and may be a valuable S. stercoralis target of interest.A new DNA microarray tool for the examination of S. stercoralis biology has been developed and provides new and valuable insights regarding differences between infective and

  18. Customized oligonucleotide microarray gene expression-based classification of neuroblastoma patients outperforms current clinical risk stratification.

    Science.gov (United States)

    Oberthuer, André; Berthold, Frank; Warnat, Patrick; Hero, Barbara; Kahlert, Yvonne; Spitz, Rüdiger; Ernestus, Karen; König, Rainer; Haas, Stefan; Eils, Roland; Schwab, Manfred; Brors, Benedikt; Westermann, Frank; Fischer, Matthias

    2006-11-01

    To develop a gene expression-based classifier for neuroblastoma patients that reliably predicts courses of the disease. Two hundred fifty-one neuroblastoma specimens were analyzed using a customized oligonucleotide microarray comprising 10,163 probes for transcripts with differential expression in clinical subgroups of the disease. Subsequently, the prediction analysis for microarrays (PAM) was applied to a first set of patients with maximally divergent clinical courses (n = 77). The classification accuracy was estimated by a complete 10-times-repeated 10-fold cross validation, and a 144-gene predictor was constructed from this set. This classifier's predictive power was evaluated in an independent second set (n = 174) by comparing results of the gene expression-based classification with those of risk stratification systems of current trials from Germany, Japan, and the United States. The first set of patients was accurately predicted by PAM (cross-validated accuracy, 99%). Within the second set, the PAM classifier significantly separated cohorts with distinct courses (3-year event-free survival [EFS] 0.86 +/- 0.03 [favorable; n = 115] v 0.52 +/- 0.07 [unfavorable; n = 59] and 3-year overall survival 0.99 +/- 0.01 v 0.84 +/- 0.05; both P model, the PAM predictor classified patients of the second set more accurately than risk stratification of current trials from Germany, Japan, and the United States (P < .001; hazard ratio, 4.756 [95% CI, 2.544 to 8.893]). Integration of gene expression-based class prediction of neuroblastoma patients may improve risk estimation of current neuroblastoma trials.

  19. Cell-Based Microarrays for In Vitro Toxicology

    Science.gov (United States)

    Wegener, Joachim

    2015-07-01

    DNA/RNA and protein microarrays have proven their outstanding bioanalytical performance throughout the past decades, given the unprecedented level of parallelization by which molecular recognition assays can be performed and analyzed. Cell microarrays (CMAs) make use of similar construction principles. They are applied to profile a given cell population with respect to the expression of specific molecular markers and also to measure functional cell responses to drugs and chemicals. This review focuses on the use of cell-based microarrays for assessing the cytotoxicity of drugs, toxins, or chemicals in general. It also summarizes CMA construction principles with respect to the cell types that are used for such microarrays, the readout parameters to assess toxicity, and the various formats that have been established and applied. The review ends with a critical comparison of CMAs and well-established microtiter plate (MTP) approaches.

  20. Computational biology of genome expression and regulation--a review of microarray bioinformatics.

    Science.gov (United States)

    Wang, Junbai

    2008-01-01

    Microarray technology is being used widely in various biomedical research areas; the corresponding microarray data analysis is an essential step toward the best utilizing of array technologies. Here we review two components of the microarray data analysis: a low level of microarray data analysis that emphasizes the designing, the quality control, and the preprocessing of microarray experiments, then a high level of microarray data analysis that focuses on the domain-specific microarray applications such as tumor classification, biomarker prediction, analyzing array CGH experiments, and reverse engineering of gene expression networks. Additionally, we will review the recent development of building a predictive model in genome expression and regulation studies. This review may help biologists grasp a basic knowledge of microarray bioinformatics as well as its potential impact on the future evolvement of biomedical research fields.

  1. Array2BIO: from microarray expression data to functional annotation of co-regulated genes

    Directory of Open Access Journals (Sweden)

    Rasley Amy

    2006-06-01

    Full Text Available Abstract Background There are several isolated tools for partial analysis of microarray expression data. To provide an integrative, easy-to-use and automated toolkit for the analysis of Affymetrix microarray expression data we have developed Array2BIO, an application that couples several analytical methods into a single web based utility. Results Array2BIO converts raw intensities into probe expression values, automatically maps those to genes, and subsequently identifies groups of co-expressed genes using two complementary approaches: (1 comparative analysis of signal versus control and (2 clustering analysis of gene expression across different conditions. The identified genes are assigned to functional categories based on Gene Ontology classification and KEGG protein interaction pathways. Array2BIO reliably handles low-expressor genes and provides a set of statistical methods for quantifying expression levels, including Benjamini-Hochberg and Bonferroni multiple testing corrections. An automated interface with the ECR Browser provides evolutionary conservation analysis for the identified gene loci while the interconnection with Crème allows prediction of gene regulatory elements that underlie observed expression patterns. Conclusion We have developed Array2BIO – a web based tool for rapid comprehensive analysis of Affymetrix microarray expression data, which also allows users to link expression data to Dcode.org comparative genomics tools and integrates a system for translating co-expression data into mechanisms of gene co-regulation. Array2BIO is publicly available at http://array2bio.dcode.org.

  2. Microarray-based method for the parallel analysis of genotypes and expression profiles of wood-forming tissues in Eucalyptus grandis

    CSIR Research Space (South Africa)

    Barros, E

    2009-05-01

    Full Text Available of Eucalyptus grandis planting stock that exhibit preferred wood qualities is thus a priority of the South African forestry industry. The researchers used microarray-based DNA-amplified fragment length polymorphism (AFLP) analysis in combination with expression...

  3. The application of DNA microarrays in gene expression analysis

    NARCIS (Netherlands)

    Hal, van N.L.W.; Vorst, O.; Houwelingen, van A.M.M.L.; Kok, E.J.; Peijnenburg, A.A.C.M.; Aharoni, A.; Tunen, van A.J.; Keijer, J.

    2000-01-01

    DNA microarray technology is a new and powerful technology that will substantially increase the speed of molecular biological research. This paper gives a survey of DNA microarray technology and its use in gene expression studies. The technical aspects and their potential improvements are discussed.

  4. Quantitative multiplex quantum dot in-situ hybridisation based gene expression profiling in tissue microarrays identifies prognostic genes in acute myeloid leukaemia

    Energy Technology Data Exchange (ETDEWEB)

    Tholouli, Eleni [Department of Haematology, Manchester Royal Infirmary, Oxford Road, Manchester, M13 9WL (United Kingdom); MacDermott, Sarah [The Medical School, The University of Manchester, Oxford Road, M13 9PT Manchester (United Kingdom); Hoyland, Judith [School of Biomedicine, Faculty of Medical and Human Sciences, The University of Manchester, Oxford Road, M13 9PT Manchester (United Kingdom); Yin, John Liu [Department of Haematology, Manchester Royal Infirmary, Oxford Road, Manchester, M13 9WL (United Kingdom); Byers, Richard, E-mail: richard.byers@cmft.nhs.uk [School of Cancer and Enabling Sciences, Faculty of Medical and Human Sciences, The University of Manchester, Stopford Building, Oxford Road, M13 9PT Manchester (United Kingdom)

    2012-08-24

    Highlights: Black-Right-Pointing-Pointer Development of a quantitative high throughput in situ expression profiling method. Black-Right-Pointing-Pointer Application to a tissue microarray of 242 AML bone marrow samples. Black-Right-Pointing-Pointer Identification of HOXA4, HOXA9, Meis1 and DNMT3A as prognostic markers in AML. -- Abstract: Measurement and validation of microarray gene signatures in routine clinical samples is problematic and a rate limiting step in translational research. In order to facilitate measurement of microarray identified gene signatures in routine clinical tissue a novel method combining quantum dot based oligonucleotide in situ hybridisation (QD-ISH) and post-hybridisation spectral image analysis was used for multiplex in-situ transcript detection in archival bone marrow trephine samples from patients with acute myeloid leukaemia (AML). Tissue-microarrays were prepared into which white cell pellets were spiked as a standard. Tissue microarrays were made using routinely processed bone marrow trephines from 242 patients with AML. QD-ISH was performed for six candidate prognostic genes using triplex QD-ISH for DNMT1, DNMT3A, DNMT3B, and for HOXA4, HOXA9, Meis1. Scrambled oligonucleotides were used to correct for background staining followed by normalisation of expression against the expression values for the white cell pellet standard. Survival analysis demonstrated that low expression of HOXA4 was associated with poorer overall survival (p = 0.009), whilst high expression of HOXA9 (p < 0.0001), Meis1 (p = 0.005) and DNMT3A (p = 0.04) were associated with early treatment failure. These results demonstrate application of a standardised, quantitative multiplex QD-ISH method for identification of prognostic markers in formalin-fixed paraffin-embedded clinical samples, facilitating measurement of gene expression signatures in routine clinical samples.

  5. Washing scaling of GeneChip microarray expression

    Directory of Open Access Journals (Sweden)

    Krohn Knut

    2010-05-01

    Full Text Available Abstract Background Post-hybridization washing is an essential part of microarray experiments. Both the quality of the experimental washing protocol and adequate consideration of washing in intensity calibration ultimately affect the quality of the expression estimates extracted from the microarray intensities. Results We conducted experiments on GeneChip microarrays with altered protocols for washing, scanning and staining to study the probe-level intensity changes as a function of the number of washing cycles. For calibration and analysis of the intensity data we make use of the 'hook' method which allows intensity contributions due to non-specific and specific hybridization of perfect match (PM and mismatch (MM probes to be disentangled in a sequence specific manner. On average, washing according to the standard protocol removes about 90% of the non-specific background and about 30-50% and less than 10% of the specific targets from the MM and PM, respectively. Analysis of the washing kinetics shows that the signal-to-noise ratio doubles roughly every ten stringent washing cycles. Washing can be characterized by time-dependent rate constants which reflect the heterogeneous character of target binding to microarray probes. We propose an empirical washing function which estimates the survival of probe bound targets. It depends on the intensity contribution due to specific and non-specific hybridization per probe which can be estimated for each probe using existing methods. The washing function allows probe intensities to be calibrated for the effect of washing. On a relative scale, proper calibration for washing markedly increases expression measures, especially in the limit of small and large values. Conclusions Washing is among the factors which potentially distort expression measures. The proposed first-order correction method allows direct implementation in existing calibration algorithms for microarray data. We provide an experimental

  6. Microarray analysis of gene expression profiles in ripening pineapple fruits.

    Science.gov (United States)

    Koia, Jonni H; Moyle, Richard L; Botella, Jose R

    2012-12-18

    Pineapple (Ananas comosus) is a tropical fruit crop of significant commercial importance. Although the physiological changes that occur during pineapple fruit development have been well characterized, little is known about the molecular events that occur during the fruit ripening process. Understanding the molecular basis of pineapple fruit ripening will aid the development of new varieties via molecular breeding or genetic modification. In this study we developed a 9277 element pineapple microarray and used it to profile gene expression changes that occur during pineapple fruit ripening. Microarray analyses identified 271 unique cDNAs differentially expressed at least 1.5-fold between the mature green and mature yellow stages of pineapple fruit ripening. Among these 271 sequences, 184 share significant homology with genes encoding proteins of known function, 53 share homology with genes encoding proteins of unknown function and 34 share no significant homology with any database accession. Of the 237 pineapple sequences with homologs, 160 were up-regulated and 77 were down-regulated during pineapple fruit ripening. DAVID Functional Annotation Cluster (FAC) analysis of all 237 sequences with homologs revealed confident enrichment scores for redox activity, organic acid metabolism, metalloenzyme activity, glycolysis, vitamin C biosynthesis, antioxidant activity and cysteine peptidase activity, indicating the functional significance and importance of these processes and pathways during pineapple fruit development. Quantitative real-time PCR analysis validated the microarray expression results for nine out of ten genes tested. This is the first report of a microarray based gene expression study undertaken in pineapple. Our bioinformatic analyses of the transcript profiles have identified a number of genes, processes and pathways with putative involvement in the pineapple fruit ripening process. This study extends our knowledge of the molecular basis of pineapple fruit

  7. Sensitivity and fidelity of DNA microarray improved with integration of Amplified Differential Gene Expression (ADGE

    Directory of Open Access Journals (Sweden)

    Ile Kristina E

    2003-07-01

    Full Text Available Abstract Background The ADGE technique is a method designed to magnify the ratios of gene expression before detection. It improves the detection sensitivity to small change of gene expression and requires small amount of starting material. However, the throughput of ADGE is low. We integrated ADGE with DNA microarray (ADGE microarray and compared it with regular microarray. Results When ADGE was integrated with DNA microarray, a quantitative relationship of a power function between detected and input ratios was found. Because of ratio magnification, ADGE microarray was better able to detect small changes in gene expression in a drug resistant model cell line system. The PCR amplification of templates and efficient labeling reduced the requirement of starting material to as little as 125 ng of total RNA for one slide hybridization and enhanced the signal intensity. Integration of ratio magnification, template amplification and efficient labeling in ADGE microarray reduced artifacts in microarray data and improved detection fidelity. The results of ADGE microarray were less variable and more reproducible than those of regular microarray. A gene expression profile generated with ADGE microarray characterized the drug resistant phenotype, particularly with reference to glutathione, proliferation and kinase pathways. Conclusion ADGE microarray magnified the ratios of differential gene expression in a power function, improved the detection sensitivity and fidelity and reduced the requirement for starting material while maintaining high throughput. ADGE microarray generated a more informative expression pattern than regular microarray.

  8. Integrating Biological Perspectives:. a Quantum Leap for Microarray Expression Analysis

    Science.gov (United States)

    Wanke, Dierk; Kilian, Joachim; Bloss, Ulrich; Mangelsen, Elke; Supper, Jochen; Harter, Klaus; Berendzen, Kenneth W.

    2009-02-01

    Biologists and bioinformatic scientists cope with the analysis of transcript abundance and the extraction of meaningful information from microarray expression data. By exploiting biological information accessible in public databases, we try to extend our current knowledge over the plant model organism Arabidopsis thaliana. Here, we give two examples of increasing the quality of information gained from large scale expression experiments by the integration of microarray-unrelated biological information: First, we utilize Arabidopsis microarray data to demonstrate that expression profiles are usually conserved between orthologous genes of different organisms. In an initial step of the analysis, orthology has to be inferred unambiguously, which then allows comparison of expression profiles between orthologs. We make use of the publicly available microarray expression data of Arabidopsis and barley, Hordeum vulgare. We found a generally positive correlation in expression trajectories between true orthologs although both organisms are only distantly related in evolutionary time scale. Second, extracting clusters of co-regulated genes implies similarities in transcriptional regulation via similar cis-regulatory elements (CREs). Vice versa approaches, where co-regulated gene clusters are found by investigating on CREs were not successful in general. Nonetheless, in some cases the presence of CREs in a defined position, orientation or CRE-combinations is positively correlated with co-regulated gene clusters. Here, we make use of genes involved in the phenylpropanoid biosynthetic pathway, to give one positive example for this approach.

  9. Xylella fastidiosa gene expression analysis by DNA microarrays

    OpenAIRE

    Travensolo,Regiane F.; Carareto-Alves,Lucia M.; Costa,Maria V.C.G.; Lopes,Tiago J.S.; Carrilho,Emanuel; Lemos,Eliana G.M.

    2009-01-01

    Xylella fastidiosa genome sequencing has generated valuable data by identifying genes acting either on metabolic pathways or in associated pathogenicity and virulence. Based on available information on these genes, new strategies for studying their expression patterns, such as microarray technology, were employed. A total of 2,600 primer pairs were synthesized and then used to generate fragments using the PCR technique. The arrays were hybridized against cDNAs labeled during reverse transcrip...

  10. A Java-based tool for the design of classification microarrays.

    Science.gov (United States)

    Meng, Da; Broschat, Shira L; Call, Douglas R

    2008-08-04

    Classification microarrays are used for purposes such as identifying strains of bacteria and determining genetic relationships to understand the epidemiology of an infectious disease. For these cases, mixed microarrays, which are composed of DNA from more than one organism, are more effective than conventional microarrays composed of DNA from a single organism. Selection of probes is a key factor in designing successful mixed microarrays because redundant sequences are inefficient and limited representation of diversity can restrict application of the microarray. We have developed a Java-based software tool, called PLASMID, for use in selecting the minimum set of probe sequences needed to classify different groups of plasmids or bacteria. The software program was successfully applied to several different sets of data. The utility of PLASMID was illustrated using existing mixed-plasmid microarray data as well as data from a virtual mixed-genome microarray constructed from different strains of Streptococcus. Moreover, use of data from expression microarray experiments demonstrated the generality of PLASMID. In this paper we describe a new software tool for selecting a set of probes for a classification microarray. While the tool was developed for the design of mixed microarrays-and mixed-plasmid microarrays in particular-it can also be used to design expression arrays. The user can choose from several clustering methods (including hierarchical, non-hierarchical, and a model-based genetic algorithm), several probe ranking methods, and several different display methods. A novel approach is used for probe redundancy reduction, and probe selection is accomplished via stepwise discriminant analysis. Data can be entered in different formats (including Excel and comma-delimited text), and dendrogram, heat map, and scatter plot images can be saved in several different formats (including jpeg and tiff). Weights generated using stepwise discriminant analysis can be stored for

  11. Gene Expression and Microarray Investigation of Dendrobium ...

    African Journals Online (AJOL)

    blood glucose > 16.7 mmol/L were used as the model group and treated with Dendrobium mixture. (DEN ... Keywords: Diabetes, Gene expression, Dendrobium mixture, Microarray testing ..... homeostasis in airway smooth muscle. Am J.

  12. Development, characterization and experimental validation of a cultivated sunflower (Helianthus annuus L. gene expression oligonucleotide microarray.

    Directory of Open Access Journals (Sweden)

    Paula Fernandez

    Full Text Available Oligonucleotide-based microarrays with accurate gene coverage represent a key strategy for transcriptional studies in orphan species such as sunflower, H. annuus L., which lacks full genome sequences. The goal of this study was the development and functional annotation of a comprehensive sunflower unigene collection and the design and validation of a custom sunflower oligonucleotide-based microarray. A large scale EST (>130,000 ESTs curation, assembly and sequence annotation was performed using Blast2GO (www.blast2go.de. The EST assembly comprises 41,013 putative transcripts (12,924 contigs and 28,089 singletons. The resulting Sunflower Unigen Resource (SUR version 1.0 was used to design an oligonucleotide-based Agilent microarray for cultivated sunflower. This microarray includes a total of 42,326 features: 1,417 Agilent controls, 74 control probes for sunflower replicated 10 times (740 controls and 40,169 different non-control probes. Microarray performance was validated using a model experiment examining the induction of senescence by water deficit. Pre-processing and differential expression analysis of Agilent microarrays was performed using the Bioconductor limma package. The analyses based on p-values calculated by eBayes (p<0.01 allowed the detection of 558 differentially expressed genes between water stress and control conditions; from these, ten genes were further validated by qPCR. Over-represented ontologies were identified using FatiScan in the Babelomics suite. This work generated a curated and trustable sunflower unigene collection, and a custom, validated sunflower oligonucleotide-based microarray using Agilent technology. Both the curated unigene collection and the validated oligonucleotide microarray provide key resources for sunflower genome analysis, transcriptional studies, and molecular breeding for crop improvement.

  13. The application of DNA microarrays in gene expression analysis.

    Science.gov (United States)

    van Hal, N L; Vorst, O; van Houwelingen, A M; Kok, E J; Peijnenburg, A; Aharoni, A; van Tunen, A J; Keijer, J

    2000-03-31

    DNA microarray technology is a new and powerful technology that will substantially increase the speed of molecular biological research. This paper gives a survey of DNA microarray technology and its use in gene expression studies. The technical aspects and their potential improvements are discussed. These comprise array manufacturing and design, array hybridisation, scanning, and data handling. Furthermore, it is discussed how DNA microarrays can be applied in the working fields of: safety, functionality and health of food and gene discovery and pathway engineering in plants.

  14. A Java-based tool for the design of classification microarrays

    Directory of Open Access Journals (Sweden)

    Broschat Shira L

    2008-08-01

    Full Text Available Abstract Background Classification microarrays are used for purposes such as identifying strains of bacteria and determining genetic relationships to understand the epidemiology of an infectious disease. For these cases, mixed microarrays, which are composed of DNA from more than one organism, are more effective than conventional microarrays composed of DNA from a single organism. Selection of probes is a key factor in designing successful mixed microarrays because redundant sequences are inefficient and limited representation of diversity can restrict application of the microarray. We have developed a Java-based software tool, called PLASMID, for use in selecting the minimum set of probe sequences needed to classify different groups of plasmids or bacteria. Results The software program was successfully applied to several different sets of data. The utility of PLASMID was illustrated using existing mixed-plasmid microarray data as well as data from a virtual mixed-genome microarray constructed from different strains of Streptococcus. Moreover, use of data from expression microarray experiments demonstrated the generality of PLASMID. Conclusion In this paper we describe a new software tool for selecting a set of probes for a classification microarray. While the tool was developed for the design of mixed microarrays–and mixed-plasmid microarrays in particular–it can also be used to design expression arrays. The user can choose from several clustering methods (including hierarchical, non-hierarchical, and a model-based genetic algorithm, several probe ranking methods, and several different display methods. A novel approach is used for probe redundancy reduction, and probe selection is accomplished via stepwise discriminant analysis. Data can be entered in different formats (including Excel and comma-delimited text, and dendrogram, heat map, and scatter plot images can be saved in several different formats (including jpeg and tiff. Weights

  15. Clustering approaches to identifying gene expression patterns from DNA microarray data.

    Science.gov (United States)

    Do, Jin Hwan; Choi, Dong-Kug

    2008-04-30

    The analysis of microarray data is essential for large amounts of gene expression data. In this review we focus on clustering techniques. The biological rationale for this approach is the fact that many co-expressed genes are co-regulated, and identifying co-expressed genes could aid in functional annotation of novel genes, de novo identification of transcription factor binding sites and elucidation of complex biological pathways. Co-expressed genes are usually identified in microarray experiments by clustering techniques. There are many such methods, and the results obtained even for the same datasets may vary considerably depending on the algorithms and metrics for dissimilarity measures used, as well as on user-selectable parameters such as desired number of clusters and initial values. Therefore, biologists who want to interpret microarray data should be aware of the weakness and strengths of the clustering methods used. In this review, we survey the basic principles of clustering of DNA microarray data from crisp clustering algorithms such as hierarchical clustering, K-means and self-organizing maps, to complex clustering algorithms like fuzzy clustering.

  16. Validation of MIMGO: a method to identify differentially expressed GO terms in a microarray dataset

    Directory of Open Access Journals (Sweden)

    Yamada Yoichi

    2012-12-01

    Full Text Available Abstract Background We previously proposed an algorithm for the identification of GO terms that commonly annotate genes whose expression is upregulated or downregulated in some microarray data compared with in other microarray data. We call these “differentially expressed GO terms” and have named the algorithm “matrix-assisted identification method of differentially expressed GO terms” (MIMGO. MIMGO can also identify microarray data in which genes annotated with a differentially expressed GO term are upregulated or downregulated. However, MIMGO has not yet been validated on a real microarray dataset using all available GO terms. Findings We combined Gene Set Enrichment Analysis (GSEA with MIMGO to identify differentially expressed GO terms in a yeast cell cycle microarray dataset. GSEA followed by MIMGO (GSEA + MIMGO correctly identified (p Conclusions MIMGO is a reliable method to identify differentially expressed GO terms comprehensively.

  17. A Critical Perspective On Microarray Breast Cancer Gene Expression Profiling

    NARCIS (Netherlands)

    Sontrop, H.M.J.

    2015-01-01

    Microarrays offer biologists an exciting tool that allows the simultaneous assessment of gene expression levels for thousands of genes at once. At the time of their inception, microarrays were hailed as the new dawn in cancer biology and oncology practice with the hope that within a decade diseases

  18. Development, characterization and experimental validation of a cultivated sunflower (Helianthus annuus L.) gene expression oligonucleotide microarray.

    Science.gov (United States)

    Fernandez, Paula; Soria, Marcelo; Blesa, David; DiRienzo, Julio; Moschen, Sebastian; Rivarola, Maximo; Clavijo, Bernardo Jose; Gonzalez, Sergio; Peluffo, Lucila; Príncipi, Dario; Dosio, Guillermo; Aguirrezabal, Luis; García-García, Francisco; Conesa, Ana; Hopp, Esteban; Dopazo, Joaquín; Heinz, Ruth Amelia; Paniego, Norma

    2012-01-01

    Oligonucleotide-based microarrays with accurate gene coverage represent a key strategy for transcriptional studies in orphan species such as sunflower, H. annuus L., which lacks full genome sequences. The goal of this study was the development and functional annotation of a comprehensive sunflower unigene collection and the design and validation of a custom sunflower oligonucleotide-based microarray. A large scale EST (>130,000 ESTs) curation, assembly and sequence annotation was performed using Blast2GO (www.blast2go.de). The EST assembly comprises 41,013 putative transcripts (12,924 contigs and 28,089 singletons). The resulting Sunflower Unigen Resource (SUR version 1.0) was used to design an oligonucleotide-based Agilent microarray for cultivated sunflower. This microarray includes a total of 42,326 features: 1,417 Agilent controls, 74 control probes for sunflower replicated 10 times (740 controls) and 40,169 different non-control probes. Microarray performance was validated using a model experiment examining the induction of senescence by water deficit. Pre-processing and differential expression analysis of Agilent microarrays was performed using the Bioconductor limma package. The analyses based on p-values calculated by eBayes (psunflower unigene collection, and a custom, validated sunflower oligonucleotide-based microarray using Agilent technology. Both the curated unigene collection and the validated oligonucleotide microarray provide key resources for sunflower genome analysis, transcriptional studies, and molecular breeding for crop improvement.

  19. Validation of MIMGO: a method to identify differentially expressed GO terms in a microarray dataset

    OpenAIRE

    Yamada, Yoichi; Sawada, Hiroki; Hirotani, Ken-ichi; Oshima, Masanobu; Satou, Kenji

    2012-01-01

    Abstract Background We previously proposed an algorithm for the identification of GO terms that commonly annotate genes whose expression is upregulated or downregulated in some microarray data compared with in other microarray data. We call these “differentially expressed GO terms” and have named the algorithm “matrix-assisted identification method of differentially expressed GO terms” (MIMGO). MIMGO can also identify microarray data in which genes annotated with a differentially expressed GO...

  20. Gene Expression Browser: Large-Scale and Cross-Experiment Microarray Data Management, Search & Visualization

    Science.gov (United States)

    The amount of microarray gene expression data in public repositories has been increasing exponentially for the last couple of decades. High-throughput microarray data integration and analysis has become a critical step in exploring the large amount of expression data for biological discovery. Howeve...

  1. Can subtle changes in gene expression be consistently detected with different microarray platforms?

    Directory of Open Access Journals (Sweden)

    Kuiper Rowan

    2008-03-01

    Full Text Available Abstract Background The comparability of gene expression data generated with different microarray platforms is still a matter of concern. Here we address the performance and the overlap in the detection of differentially expressed genes for five different microarray platforms in a challenging biological context where differences in gene expression are few and subtle. Results Gene expression profiles in the hippocampus of five wild-type and five transgenic δC-doublecortin-like kinase mice were evaluated with five microarray platforms: Applied Biosystems, Affymetrix, Agilent, Illumina, LGTC home-spotted arrays. Using a fixed false discovery rate of 10% we detected surprising differences between the number of differentially expressed genes per platform. Four genes were selected by ABI, 130 by Affymetrix, 3,051 by Agilent, 54 by Illumina, and 13 by LGTC. Two genes were found significantly differentially expressed by all platforms and the four genes identified by the ABI platform were found by at least three other platforms. Quantitative RT-PCR analysis confirmed 20 out of 28 of the genes detected by two or more platforms and 8 out of 15 of the genes detected by Agilent only. We observed improved correlations between platforms when ranking the genes based on the significance level than with a fixed statistical cut-off. We demonstrate significant overlap in the affected gene sets identified by the different platforms, although biological processes were represented by only partially overlapping sets of genes. Aberrances in GABA-ergic signalling in the transgenic mice were consistently found by all platforms. Conclusion The different microarray platforms give partially complementary views on biological processes affected. Our data indicate that when analyzing samples with only subtle differences in gene expression the use of two different platforms might be more attractive than increasing the number of replicates. Commercial two-color platforms seem to

  2. Microarray analysis of the gene expression profile in triethylene ...

    African Journals Online (AJOL)

    Microarray analysis of the gene expression profile in triethylene glycol dimethacrylate-treated human dental pulp cells. ... Conclusions: Our results suggest that TEGDMA can change the many functions of hDPCs through large changes in gene expression levels and complex interactions with different signaling pathways.

  3. Kernel Based Nonlinear Dimensionality Reduction and Classification for Genomic Microarray

    Directory of Open Access Journals (Sweden)

    Lan Shu

    2008-07-01

    Full Text Available Genomic microarrays are powerful research tools in bioinformatics and modern medicinal research because they enable massively-parallel assays and simultaneous monitoring of thousands of gene expression of biological samples. However, a simple microarray experiment often leads to very high-dimensional data and a huge amount of information, the vast amount of data challenges researchers into extracting the important features and reducing the high dimensionality. In this paper, a nonlinear dimensionality reduction kernel method based locally linear embedding(LLE is proposed, and fuzzy K-nearest neighbors algorithm which denoises datasets will be introduced as a replacement to the classical LLE’s KNN algorithm. In addition, kernel method based support vector machine (SVM will be used to classify genomic microarray data sets in this paper. We demonstrate the application of the techniques to two published DNA microarray data sets. The experimental results confirm the superiority and high success rates of the presented method.

  4. Microarray-Based Gene Expression Analysis for Veterinary Pathologists: A Review.

    Science.gov (United States)

    Raddatz, Barbara B; Spitzbarth, Ingo; Matheis, Katja A; Kalkuhl, Arno; Deschl, Ulrich; Baumgärtner, Wolfgang; Ulrich, Reiner

    2017-09-01

    High-throughput, genome-wide transcriptome analysis is now commonly used in all fields of life science research and is on the cusp of medical and veterinary diagnostic application. Transcriptomic methods such as microarrays and next-generation sequencing generate enormous amounts of data. The pathogenetic expertise acquired from understanding of general pathology provides veterinary pathologists with a profound background, which is essential in translating transcriptomic data into meaningful biological knowledge, thereby leading to a better understanding of underlying disease mechanisms. The scientific literature concerning high-throughput data-mining techniques usually addresses mathematicians or computer scientists as the target audience. In contrast, the present review provides the reader with a clear and systematic basis from a veterinary pathologist's perspective. Therefore, the aims are (1) to introduce the reader to the necessary methodological background; (2) to introduce the sequential steps commonly performed in a microarray analysis including quality control, annotation, normalization, selection of differentially expressed genes, clustering, gene ontology and pathway analysis, analysis of manually selected genes, and biomarker discovery; and (3) to provide references to publically available and user-friendly software suites. In summary, the data analysis methods presented within this review will enable veterinary pathologists to analyze high-throughput transcriptome data obtained from their own experiments, supplemental data that accompany scientific publications, or public repositories in order to obtain a more in-depth insight into underlying disease mechanisms.

  5. A permutation-based multiple testing method for time-course microarray experiments

    Directory of Open Access Journals (Sweden)

    George Stephen L

    2009-10-01

    Full Text Available Abstract Background Time-course microarray experiments are widely used to study the temporal profiles of gene expression. Storey et al. (2005 developed a method for analyzing time-course microarray studies that can be applied to discovering genes whose expression trajectories change over time within a single biological group, or those that follow different time trajectories among multiple groups. They estimated the expression trajectories of each gene using natural cubic splines under the null (no time-course and alternative (time-course hypotheses, and used a goodness of fit test statistic to quantify the discrepancy. The null distribution of the statistic was approximated through a bootstrap method. Gene expression levels in microarray data are often complicatedly correlated. An accurate type I error control adjusting for multiple testing requires the joint null distribution of test statistics for a large number of genes. For this purpose, permutation methods have been widely used because of computational ease and their intuitive interpretation. Results In this paper, we propose a permutation-based multiple testing procedure based on the test statistic used by Storey et al. (2005. We also propose an efficient computation algorithm. Extensive simulations are conducted to investigate the performance of the permutation-based multiple testing procedure. The application of the proposed method is illustrated using the Caenorhabditis elegans dauer developmental data. Conclusion Our method is computationally efficient and applicable for identifying genes whose expression levels are time-dependent in a single biological group and for identifying the genes for which the time-profile depends on the group in a multi-group setting.

  6. Microarray Gene Expression Analysis to Evaluate Cell Type Specific Expression of Targets Relevant for Immunotherapy of Hematological Malignancies.

    Directory of Open Access Journals (Sweden)

    M J Pont

    Full Text Available Cellular immunotherapy has proven to be effective in the treatment of hematological cancers by donor lymphocyte infusion after allogeneic hematopoietic stem cell transplantation and more recently by targeted therapy with chimeric antigen or T-cell receptor-engineered T cells. However, dependent on the tissue distribution of the antigens that are targeted, anti-tumor responses can be accompanied by undesired side effects. Therefore, detailed tissue distribution analysis is essential to estimate potential efficacy and toxicity of candidate targets for immunotherapy of hematological malignancies. We performed microarray gene expression analysis of hematological malignancies of different origins, healthy hematopoietic cells and various non-hematopoietic cell types from organs that are often targeted in detrimental immune responses after allogeneic stem cell transplantation leading to graft-versus-host disease. Non-hematopoietic cells were also cultured in the presence of IFN-γ to analyze gene expression under inflammatory circumstances. Gene expression was investigated by Illumina HT12.0 microarrays and quality control analysis was performed to confirm the cell-type origin and exclude contamination of non-hematopoietic cell samples with peripheral blood cells. Microarray data were validated by quantitative RT-PCR showing strong correlations between both platforms. Detailed gene expression profiles were generated for various minor histocompatibility antigens and B-cell surface antigens to illustrate the value of the microarray dataset to estimate efficacy and toxicity of candidate targets for immunotherapy. In conclusion, our microarray database provides a relevant platform to analyze and select candidate antigens with hematopoietic (lineage-restricted expression as potential targets for immunotherapy of hematological cancers.

  7. Observation of intermittency in gene expression on cDNA microarrays

    CERN Document Server

    Peterson, L E

    2002-01-01

    We used scaled factorial moments to search for intermittency in the log expression ratios (LERs) for thousands of genes spotted on cDNA microarrays (gene chips). Results indicate varying levels of intermittency in gene expression. The observation of intermittency in the data analyzed provides a complimentary handle on moderately expressed genes, generally not tackled by conventional techniques.

  8. Importance of the efficiency of double-stranded DNA formation in cDNA synthesis for the imprecision of microarray expression analysis.

    Science.gov (United States)

    Thormar, Hans G; Gudmundsson, Bjarki; Eiriksdottir, Freyja; Kil, Siyoen; Gunnarsson, Gudmundur H; Magnusson, Magnus Karl; Hsu, Jason C; Jonsson, Jon J

    2013-04-01

    The causes of imprecision in microarray expression analysis are poorly understood, limiting the use of this technology in molecular diagnostics. Two-dimensional strandness-dependent electrophoresis (2D-SDE) separates nucleic acid molecules on the basis of length and strandness, i.e., double-stranded DNA (dsDNA), single-stranded DNA (ssDNA), and RNA·DNA hybrids. We used 2D-SDE to measure the efficiency of cDNA synthesis and its importance for the imprecision of an in vitro transcription-based microarray expression analysis. The relative amount of double-stranded cDNA formed in replicate experiments that used the same RNA sample template was highly variable, ranging between 0% and 72% of the total DNA. Microarray experiments showed an inverse relationship between the difference between sample pairs in probe variance and the relative amount of dsDNA. Approximately 15% of probes showed between-sample variation (P cDNA synthesized can be an important component of the imprecision in T7 RNA polymerase-based microarray expression analysis. © 2013 American Association for Clinical Chemistry

  9. Incorporation of gene-specific variability improves expression analysis using high-density DNA microarrays

    Directory of Open Access Journals (Sweden)

    Spitznagel Edward

    2003-11-01

    Full Text Available Abstract Background The assessment of data reproducibility is essential for application of microarray technology to exploration of biological pathways and disease states. Technical variability in data analysis largely depends on signal intensity. Within that context, the reproducibility of individual probe sets has not been hitherto addressed. Results We used an extraordinarily large replicate data set derived from human placental trophoblast to analyze probe-specific contribution to variability of gene expression. We found that signal variability, in addition to being signal-intensity dependant, is probe set-specific. Importantly, we developed a novel method to quantify the contribution of this probe set-specific variability. Furthermore, we devised a formula that incorporates a priori-computed, replicate-based information on probe set- and intensity-specific variability in determination of expression changes even without technical replicates. Conclusion The strategy of incorporating probe set-specific variability is superior to analysis based on arbitrary fold-change thresholds. We recommend its incorporation to any computation of gene expression changes using high-density DNA microarrays. A Java application implementing our T-score is available at http://www.sadovsky.wustl.edu/tscore.html.

  10. Reproducibility of gene expression across generations of Affymetrix microarrays

    Directory of Open Access Journals (Sweden)

    Haslett Judith N

    2003-06-01

    Full Text Available Abstract Background The development of large-scale gene expression profiling technologies is rapidly changing the norms of biological investigation. But the rapid pace of change itself presents challenges. Commercial microarrays are regularly modified to incorporate new genes and improved target sequences. Although the ability to compare datasets across generations is crucial for any long-term research project, to date no means to allow such comparisons have been developed. In this study the reproducibility of gene expression levels across two generations of Affymetrix GeneChips® (HuGeneFL and HG-U95A was measured. Results Correlation coefficients were computed for gene expression values across chip generations based on different measures of similarity. Comparing the absolute calls assigned to the individual probe sets across the generations found them to be largely unchanged. Conclusion We show that experimental replicates are highly reproducible, but that reproducibility across generations depends on the degree of similarity of the probe sets and the expression level of the corresponding transcript.

  11. Identification of differentially expressed genes in cutaneous squamous cell carcinoma by microarray expression profiling

    Directory of Open Access Journals (Sweden)

    Sterry Wolfram

    2006-08-01

    Full Text Available Abstract Background Carcinogenesis is a multi-step process indicated by several genes up- or down-regulated during tumor progression. This study examined and identified differentially expressed genes in cutaneous squamous cell carcinoma (SCC. Results Three different biopsies of 5 immunosuppressed organ-transplanted recipients each normal skin (all were pooled, actinic keratosis (AK (two were pooled, and invasive SCC and additionally 5 normal skin tissues from immunocompetent patients were analyzed. Thus, total RNA of 15 specimens were used for hybridization with Affymetrix HG-U133A microarray technology containing 22,283 genes. Data analyses were performed by prediction analysis of microarrays using nearest shrunken centroids with the threshold 3.5 and ANOVA analysis was independently performed in order to identify differentially expressed genes (p vs. AK and SCC were observed for 118 genes. Conclusion The majority of identified differentially expressed genes in cutaneous SCC were previously not described.

  12. AffyMiner: mining differentially expressed genes and biological knowledge in GeneChip microarray data

    Directory of Open Access Journals (Sweden)

    Xia Yuannan

    2006-12-01

    Full Text Available Abstract Background DNA microarrays are a powerful tool for monitoring the expression of tens of thousands of genes simultaneously. With the advance of microarray technology, the challenge issue becomes how to analyze a large amount of microarray data and make biological sense of them. Affymetrix GeneChips are widely used microarrays, where a variety of statistical algorithms have been explored and used for detecting significant genes in the experiment. These methods rely solely on the quantitative data, i.e., signal intensity; however, qualitative data are also important parameters in detecting differentially expressed genes. Results AffyMiner is a tool developed for detecting differentially expressed genes in Affymetrix GeneChip microarray data and for associating gene annotation and gene ontology information with the genes detected. AffyMiner consists of the functional modules, GeneFinder for detecting significant genes in a treatment versus control experiment and GOTree for mapping genes of interest onto the Gene Ontology (GO space; and interfaces to run Cluster, a program for clustering analysis, and GenMAPP, a program for pathway analysis. AffyMiner has been used for analyzing the GeneChip data and the results were presented in several publications. Conclusion AffyMiner fills an important gap in finding differentially expressed genes in Affymetrix GeneChip microarray data. AffyMiner effectively deals with multiple replicates in the experiment and takes into account both quantitative and qualitative data in identifying significant genes. AffyMiner reduces the time and effort needed to compare data from multiple arrays and to interpret the possible biological implications associated with significant changes in a gene's expression.

  13. Improved microarray-based decision support with graph encoded interactome data.

    Directory of Open Access Journals (Sweden)

    Anneleen Daemen

    Full Text Available In the past, microarray studies have been criticized due to noise and the limited overlap between gene signatures. Prior biological knowledge should therefore be incorporated as side information in models based on gene expression data to improve the accuracy of diagnosis and prognosis in cancer. As prior knowledge, we investigated interaction and pathway information from the human interactome on different aspects of biological systems. By exploiting the properties of kernel methods, relations between genes with similar functions but active in alternative pathways could be incorporated in a support vector machine classifier based on spectral graph theory. Using 10 microarray data sets, we first reduced the number of data sources relevant for multiple cancer types and outcomes. Three sources on metabolic pathway information (KEGG, protein-protein interactions (OPHID and miRNA-gene targeting (microRNA.org outperformed the other sources with regard to the considered class of models. Both fixed and adaptive approaches were subsequently considered to combine the three corresponding classifiers. Averaging the predictions of these classifiers performed best and was significantly better than the model based on microarray data only. These results were confirmed on 6 validation microarray sets, with a significantly improved performance in 4 of them. Integrating interactome data thus improves classification of cancer outcome for the investigated microarray technologies and cancer types. Moreover, this strategy can be incorporated in any kernel method or non-linear version of a non-kernel method.

  14. Microarray BASICA: Background Adjustment, Segmentation, Image Compression and Analysis of Microarray Images

    Directory of Open Access Journals (Sweden)

    Jianping Hua

    2004-01-01

    Full Text Available This paper presents microarray BASICA: an integrated image processing tool for background adjustment, segmentation, image compression, and analysis of cDNA microarray images. BASICA uses a fast Mann-Whitney test-based algorithm to segment cDNA microarray images, and performs postprocessing to eliminate the segmentation irregularities. The segmentation results, along with the foreground and background intensities obtained with the background adjustment, are then used for independent compression of the foreground and background. We introduce a new distortion measurement for cDNA microarray image compression and devise a coding scheme by modifying the embedded block coding with optimized truncation (EBCOT algorithm (Taubman, 2000 to achieve optimal rate-distortion performance in lossy coding while still maintaining outstanding lossless compression performance. Experimental results show that the bit rate required to ensure sufficiently accurate gene expression measurement varies and depends on the quality of cDNA microarray images. For homogeneously hybridized cDNA microarray images, BASICA is able to provide from a bit rate as low as 5 bpp the gene expression data that are 99% in agreement with those of the original 32 bpp images.

  15. Detecting imbalanced expression of SNP alleles by minisequencing on microarrays

    Directory of Open Access Journals (Sweden)

    Dahlgren Andreas

    2004-10-01

    Full Text Available Abstract Background Each of the human genes or transcriptional units is likely to contain single nucleotide polymorphisms that may give rise to sequence variation between individuals and tissues on the level of RNA. Based on recent studies, differential expression of the two alleles of heterozygous coding single nucleotide polymorphisms (SNPs may be frequent for human genes. Methods with high accuracy to be used in a high throughput setting are needed for systematic surveys of expressed sequence variation. In this study we evaluated two formats of multiplexed, microarray based minisequencing for quantitative detection of imbalanced expression of SNP alleles. We used a panel of ten SNPs located in five genes known to be expressed in two endothelial cell lines as our model system. Results The accuracy and sensitivity of quantitative detection of allelic imbalance was assessed for each SNP by constructing regression lines using a dilution series of mixed samples from individuals of different genotype. Accurate quantification of SNP alleles by both assay formats was evidenced for by R2 values > 0.95 for the majority of the regression lines. According to a two sample t-test, we were able to distinguish 1–9% of a minority SNP allele from a homozygous genotype, with larger variation between SNPs than between assay formats. Six of the SNPs, heterozygous in either of the two cell lines, were genotyped in RNA extracted from the endothelial cells. The coefficient of variation between the fluorescent signals from five parallel reactions was similar for cDNA and genomic DNA. The fluorescence signal intensity ratios measured in the cDNA samples were compared to those in genomic DNA to determine the relative expression levels of the two alleles of each SNP. Four of the six SNPs tested displayed a higher than 1.4-fold difference in allelic ratios between cDNA and genomic DNA. The results were verified by allele-specific oligonucleotide hybridisation and

  16. Xylella fastidiosa gene expression analysis by DNA microarrays

    Directory of Open Access Journals (Sweden)

    Regiane F. Travensolo

    2009-01-01

    Full Text Available Xylella fastidiosa genome sequencing has generated valuable data by identifying genes acting either on metabolic pathways or in associated pathogenicity and virulence. Based on available information on these genes, new strategies for studying their expression patterns, such as microarray technology, were employed. A total of 2,600 primer pairs were synthesized and then used to generate fragments using the PCR technique. The arrays were hybridized against cDNAs labeled during reverse transcription reactions and which were obtained from bacteria grown under two different conditions (liquid XDM2 and liquid BCYE. All data were statistically analyzed to verify which genes were differentially expressed. In addition to exploring conditions for X. fastidiosa genome-wide transcriptome analysis, the present work observed the differential expression of several classes of genes (energy, protein, amino acid and nucleotide metabolism, transport, degradation of substances, toxins and hypothetical proteins, among others. The understanding of expressed genes in these two different media will be useful in comprehending the metabolic characteristics of X. fastidiosa, and in evaluating how important certain genes are for the functioning and survival of these bacteria in plants.

  17. Evaluation of gene importance in microarray data based upon probability of selection

    Directory of Open Access Journals (Sweden)

    Fu Li M

    2005-03-01

    Full Text Available Abstract Background Microarray devices permit a genome-scale evaluation of gene function. This technology has catalyzed biomedical research and development in recent years. As many important diseases can be traced down to the gene level, a long-standing research problem is to identify specific gene expression patterns linking to metabolic characteristics that contribute to disease development and progression. The microarray approach offers an expedited solution to this problem. However, it has posed a challenging issue to recognize disease-related genes expression patterns embedded in the microarray data. In selecting a small set of biologically significant genes for classifier design, the nature of high data dimensionality inherent in this problem creates substantial amount of uncertainty. Results Here we present a model for probability analysis of selected genes in order to determine their importance. Our contribution is that we show how to derive the P value of each selected gene in multiple gene selection trials based on different combinations of data samples and how to conduct a reliability analysis accordingly. The importance of a gene is indicated by its associated P value in that a smaller value implies higher information content from information theory. On the microarray data concerning the subtype classification of small round blue cell tumors, we demonstrate that the method is capable of finding the smallest set of genes (19 genes with optimal classification performance, compared with results reported in the literature. Conclusion In classifier design based on microarray data, the probability value derived from gene selection based on multiple combinations of data samples enables an effective mechanism for reducing the tendency of fitting local data particularities.

  18. A tiling microarray for global analysis of chloroplast genome expression in cucumber and other plants

    Directory of Open Access Journals (Sweden)

    Pląder Wojciech

    2011-09-01

    Full Text Available Abstract Plastids are small organelles equipped with their own genomes (plastomes. Although these organelles are involved in numerous plant metabolic pathways, current knowledge about the transcriptional activity of plastomes is limited. To solve this problem, we constructed a plastid tiling microarray (PlasTi-microarray consisting of 1629 oligonucleotide probes. The oligonucleotides were designed based on the cucumber chloroplast genomic sequence and targeted both strands of the plastome in a non-contiguous arrangement. Up to 4 specific probes were designed for each gene/exon, and the intergenic regions were covered regularly, with 70-nt intervals. We also developed a protocol for direct chemical labeling and hybridization of as little as 2 micrograms of chloroplast RNA. We used this protocol for profiling the expression of the cucumber chloroplast plastome on the PlasTi-microarray. Owing to the high sequence similarity of plant plastomes, the newly constructed microarray can be used to study plants other than cucumber. Comparative hybridization of chloroplast transcriptomes from cucumber, Arabidopsis, tomato and spinach showed that the PlasTi-microarray is highly versatile.

  19. Hierarchical information representation and efficient classification of gene expression microarray data

    OpenAIRE

    Bosio, Mattia

    2014-01-01

    In the field of computational biology, microarryas are used to measure the activity of thousands of genes at once and create a global picture of cellular function. Microarrays allow scientists to analyze expression of many genes in a single experiment quickly and eficiently. Even if microarrays are a consolidated research technology nowadays and the trends in high-throughput data analysis are shifting towards new technologies like Next Generation Sequencing (NGS), an optimum method for sample...

  20. Normal uniform mixture differential gene expression detection for cDNA microarrays

    Directory of Open Access Journals (Sweden)

    Raftery Adrian E

    2005-07-01

    Full Text Available Abstract Background One of the primary tasks in analysing gene expression data is finding genes that are differentially expressed in different samples. Multiple testing issues due to the thousands of tests run make some of the more popular methods for doing this problematic. Results We propose a simple method, Normal Uniform Differential Gene Expression (NUDGE detection for finding differentially expressed genes in cDNA microarrays. The method uses a simple univariate normal-uniform mixture model, in combination with new normalization methods for spread as well as mean that extend the lowess normalization of Dudoit, Yang, Callow and Speed (2002 1. It takes account of multiple testing, and gives probabilities of differential expression as part of its output. It can be applied to either single-slide or replicated experiments, and it is very fast. Three datasets are analyzed using NUDGE, and the results are compared to those given by other popular methods: unadjusted and Bonferroni-adjusted t tests, Significance Analysis of Microarrays (SAM, and Empirical Bayes for microarrays (EBarrays with both Gamma-Gamma and Lognormal-Normal models. Conclusion The method gives a high probability of differential expression to genes known/suspected a priori to be differentially expressed and a low probability to the others. In terms of known false positives and false negatives, the method outperforms all multiple-replicate methods except for the Gamma-Gamma EBarrays method to which it offers comparable results with the added advantages of greater simplicity, speed, fewer assumptions and applicability to the single replicate case. An R package called nudge to implement the methods in this paper will be made available soon at http://www.bioconductor.org.

  1. CDNA Microarray Based Comparative Gene Expression Analysis of Primary Breast Tumors Versus In Vitro Transformed Neoplastic Breast Epithelium

    National Research Council Canada - National Science Library

    Szallasi, Zoltan

    2001-01-01

    .... The first group of clones is being sorted by their ability to form tumors. We are currently performing cDNA microarray analysis quantifying the expression level of about 15,000 genes in these cell lines...

  2. Cross-platform comparison of SYBR® Green real-time PCR with TaqMan PCR, microarrays and other gene expression measurement technologies evaluated in the MicroArray Quality Control (MAQC study

    Directory of Open Access Journals (Sweden)

    Dial Stacey L

    2008-07-01

    Full Text Available Abstract Background The MicroArray Quality Control (MAQC project evaluated the inter- and intra-platform reproducibility of seven microarray platforms and three quantitative gene expression assays in profiling the expression of two commercially available Reference RNA samples (Nat Biotechnol 24:1115-22, 2006. The tested microarrays were the platforms from Affymetrix, Agilent Technologies, Applied Biosystems, GE Healthcare, Illumina, Eppendorf and the National Cancer Institute, and quantitative gene expression assays included TaqMan® Gene Expression PCR Assay, Standardized (Sta RT-PCR™ and QuantiGene®. The data showed great consistency in gene expression measurements across different microarray platforms, different technologies and test sites. However, SYBR® Green real-time PCR, another common technique utilized by half of all real-time PCR users for gene expression measurement, was not addressed in the MAQC study. In the present study, we compared the performance of SYBR Green PCR with TaqMan PCR, microarrays and other quantitative technologies using the same two Reference RNA samples as the MAQC project. We assessed SYBR Green real-time PCR using commercially available RT2 Profiler™ PCR Arrays from SuperArray, containing primer pairs that have been experimentally validated to ensure gene-specificity and high amplification efficiency. Results The SYBR Green PCR Arrays exhibit good reproducibility among different users, PCR instruments and test sites. In addition, the SYBR Green PCR Arrays have the highest concordance with TaqMan PCR, and a high level of concordance with other quantitative methods and microarrays that were evaluated in this study in terms of fold-change correlation and overlap of lists of differentially expressed genes. Conclusion These data demonstrate that SYBR Green real-time PCR delivers highly comparable results in gene expression measurement with TaqMan PCR and other high-density microarrays.

  3. Gene Expression Signature in Endemic Osteoarthritis by Microarray Analysis

    Directory of Open Access Journals (Sweden)

    Xi Wang

    2015-05-01

    Full Text Available Kashin-Beck Disease (KBD is an endemic osteochondropathy with an unknown pathogenesis. Diagnosis of KBD is effective only in advanced cases, which eliminates the possibility of early treatment and leads to an inevitable exacerbation of symptoms. Therefore, we aim to identify an accurate blood-based gene signature for the detection of KBD. Previously published gene expression profile data on cartilage and peripheral blood mononuclear cells (PBMCs from adults with KBD were compared to select potential target genes. Microarray analysis was conducted to evaluate the expression of the target genes in a cohort of 100 KBD patients and 100 healthy controls. A gene expression signature was identified using a training set, which was subsequently validated using an independent test set with a minimum redundancy maximum relevance (mRMR algorithm and support vector machine (SVM algorithm. Fifty unique genes were differentially expressed between KBD patients and healthy controls. A 20-gene signature was identified that distinguished between KBD patients and controls with 90% accuracy, 85% sensitivity, and 95% specificity. This study identified a 20-gene signature that accurately distinguishes between patients with KBD and controls using peripheral blood samples. These results promote the further development of blood-based genetic biomarkers for detection of KBD.

  4. GEPAS, a web-based tool for microarray data analysis and interpretation

    Science.gov (United States)

    Tárraga, Joaquín; Medina, Ignacio; Carbonell, José; Huerta-Cepas, Jaime; Minguez, Pablo; Alloza, Eva; Al-Shahrour, Fátima; Vegas-Azcárate, Susana; Goetz, Stefan; Escobar, Pablo; Garcia-Garcia, Francisco; Conesa, Ana; Montaner, David; Dopazo, Joaquín

    2008-01-01

    Gene Expression Profile Analysis Suite (GEPAS) is one of the most complete and extensively used web-based packages for microarray data analysis. During its more than 5 years of activity it has continuously been updated to keep pace with the state-of-the-art in the changing microarray data analysis arena. GEPAS offers diverse analysis options that include well established as well as novel algorithms for normalization, gene selection, class prediction, clustering and functional profiling of the experiment. New options for time-course (or dose-response) experiments, microarray-based class prediction, new clustering methods and new tests for differential expression have been included. The new pipeliner module allows automating the execution of sequential analysis steps by means of a simple but powerful graphic interface. An extensive re-engineering of GEPAS has been carried out which includes the use of web services and Web 2.0 technology features, a new user interface with persistent sessions and a new extended database of gene identifiers. GEPAS is nowadays the most quoted web tool in its field and it is extensively used by researchers of many countries and its records indicate an average usage rate of 500 experiments per day. GEPAS, is available at http://www.gepas.org. PMID:18508806

  5. Development and validation of a flax (Linum usitatissimum L.) gene expression oligo microarray.

    Science.gov (United States)

    Fenart, Stéphane; Ndong, Yves-Placide Assoumou; Duarte, Jorge; Rivière, Nathalie; Wilmer, Jeroen; van Wuytswinkel, Olivier; Lucau, Anca; Cariou, Emmanuelle; Neutelings, Godfrey; Gutierrez, Laurent; Chabbert, Brigitte; Guillot, Xavier; Tavernier, Reynald; Hawkins, Simon; Thomasset, Brigitte

    2010-10-21

    Flax (Linum usitatissimum L.) has been cultivated for around 9,000 years and is therefore one of the oldest cultivated species. Today, flax is still grown for its oil (oil-flax or linseed cultivars) and its cellulose-rich fibres (fibre-flax cultivars) used for high-value linen garments and composite materials. Despite the wide industrial use of flax-derived products, and our actual understanding of the regulation of both wood fibre production and oil biosynthesis more information must be acquired in both domains. Recent advances in genomics are now providing opportunities to improve our fundamental knowledge of these complex processes. In this paper we report the development and validation of a high-density oligo microarray platform dedicated to gene expression analyses in flax. Nine different RNA samples obtained from flax inner- and outer-stems, seeds, leaves and roots were used to generate a collection of 1,066,481 ESTs by massive parallel pyrosequencing. Sequences were assembled into 59,626 unigenes and 48,021 sequences were selected for oligo design and high-density microarray (Nimblegen 385K) fabrication with eight, non-overlapping 25-mers oligos per unigene. 18 independent experiments were used to evaluate the hybridization quality, precision, specificity and accuracy and all results confirmed the high technical quality of our microarray platform. Cross-validation of microarray data was carried out using quantitative qRT-PCR. Nine target genes were selected on the basis of microarray results and reflected the whole range of fold change (both up-regulated and down-regulated genes in different samples). A statistically significant positive correlation was obtained comparing expression levels for each target gene across all biological replicates both in qRT-PCR and microarray results. Further experiments illustrated the capacity of our arrays to detect differential gene expression in a variety of flax tissues as well as between two contrasted flax varieties

  6. Development and validation of a flax (Linum usitatissimum L. gene expression oligo microarray

    Directory of Open Access Journals (Sweden)

    Gutierrez Laurent

    2010-10-01

    Full Text Available Abstract Background Flax (Linum usitatissimum L. has been cultivated for around 9,000 years and is therefore one of the oldest cultivated species. Today, flax is still grown for its oil (oil-flax or linseed cultivars and its cellulose-rich fibres (fibre-flax cultivars used for high-value linen garments and composite materials. Despite the wide industrial use of flax-derived products, and our actual understanding of the regulation of both wood fibre production and oil biosynthesis more information must be acquired in both domains. Recent advances in genomics are now providing opportunities to improve our fundamental knowledge of these complex processes. In this paper we report the development and validation of a high-density oligo microarray platform dedicated to gene expression analyses in flax. Results Nine different RNA samples obtained from flax inner- and outer-stems, seeds, leaves and roots were used to generate a collection of 1,066,481 ESTs by massive parallel pyrosequencing. Sequences were assembled into 59,626 unigenes and 48,021 sequences were selected for oligo design and high-density microarray (Nimblegen 385K fabrication with eight, non-overlapping 25-mers oligos per unigene. 18 independent experiments were used to evaluate the hybridization quality, precision, specificity and accuracy and all results confirmed the high technical quality of our microarray platform. Cross-validation of microarray data was carried out using quantitative qRT-PCR. Nine target genes were selected on the basis of microarray results and reflected the whole range of fold change (both up-regulated and down-regulated genes in different samples. A statistically significant positive correlation was obtained comparing expression levels for each target gene across all biological replicates both in qRT-PCR and microarray results. Further experiments illustrated the capacity of our arrays to detect differential gene expression in a variety of flax tissues as well

  7. Analyzing Multiple-Probe Microarray: Estimation and Application of Gene Expression Indexes

    KAUST Repository

    Maadooliat, Mehdi

    2012-07-26

    Gene expression index estimation is an essential step in analyzing multiple probe microarray data. Various modeling methods have been proposed in this area. Amidst all, a popular method proposed in Li and Wong (2001) is based on a multiplicative model, which is similar to the additive model discussed in Irizarry et al. (2003a) at the logarithm scale. Along this line, Hu et al. (2006) proposed data transformation to improve expression index estimation based on an ad hoc entropy criteria and naive grid search approach. In this work, we re-examined this problem using a new profile likelihood-based transformation estimation approach that is more statistically elegant and computationally efficient. We demonstrate the applicability of the proposed method using a benchmark Affymetrix U95A spiked-in experiment. Moreover, We introduced a new multivariate expression index and used the empirical study to shows its promise in terms of improving model fitting and power of detecting differential expression over the commonly used univariate expression index. As the other important content of the work, we discussed two generally encountered practical issues in application of gene expression index: normalization and summary statistic used for detecting differential expression. Our empirical study shows somewhat different findings from the MAQC project (MAQC, 2006).

  8. Knowledge-based analysis of microarrays for the discovery of transcriptional regulation relationships.

    Science.gov (United States)

    Seok, Junhee; Kaushal, Amit; Davis, Ronald W; Xiao, Wenzhong

    2010-01-18

    The large amount of high-throughput genomic data has facilitated the discovery of the regulatory relationships between transcription factors and their target genes. While early methods for discovery of transcriptional regulation relationships from microarray data often focused on the high-throughput experimental data alone, more recent approaches have explored the integration of external knowledge bases of gene interactions. In this work, we develop an algorithm that provides improved performance in the prediction of transcriptional regulatory relationships by supplementing the analysis of microarray data with a new method of integrating information from an existing knowledge base. Using a well-known dataset of yeast microarrays and the Yeast Proteome Database, a comprehensive collection of known information of yeast genes, we show that knowledge-based predictions demonstrate better sensitivity and specificity in inferring new transcriptional interactions than predictions from microarray data alone. We also show that comprehensive, direct and high-quality knowledge bases provide better prediction performance. Comparison of our results with ChIP-chip data and growth fitness data suggests that our predicted genome-wide regulatory pairs in yeast are reasonable candidates for follow-up biological verification. High quality, comprehensive, and direct knowledge bases, when combined with appropriate bioinformatic algorithms, can significantly improve the discovery of gene regulatory relationships from high throughput gene expression data.

  9. Exploring matrix factorization techniques for significant genes identification of Alzheimer’s disease microarray gene expression data

    Directory of Open Access Journals (Sweden)

    Hu Xiaohua

    2011-07-01

    Full Text Available Abstract Background The wide use of high-throughput DNA microarray technology provide an increasingly detailed view of human transcriptome from hundreds to thousands of genes. Although biomedical researchers typically design microarray experiments to explore specific biological contexts, the relationships between genes are hard to identified because they are complex and noisy high-dimensional data and are often hindered by low statistical power. The main challenge now is to extract valuable biological information from the colossal amount of data to gain insight into biological processes and the mechanisms of human disease. To overcome the challenge requires mathematical and computational methods that are versatile enough to capture the underlying biological features and simple enough to be applied efficiently to large datasets. Methods Unsupervised machine learning approaches provide new and efficient analysis of gene expression profiles. In our study, two unsupervised knowledge-based matrix factorization methods, independent component analysis (ICA and nonnegative matrix factorization (NMF are integrated to identify significant genes and related pathways in microarray gene expression dataset of Alzheimer’s disease. The advantage of these two approaches is they can be performed as a biclustering method by which genes and conditions can be clustered simultaneously. Furthermore, they can group genes into different categories for identifying related diagnostic pathways and regulatory networks. The difference between these two method lies in ICA assume statistical independence of the expression modes, while NMF need positivity constrains to generate localized gene expression profiles. Results In our work, we performed FastICA and non-smooth NMF methods on DNA microarray gene expression data of Alzheimer’s disease respectively. The simulation results shows that both of the methods can clearly classify severe AD samples from control samples, and

  10. Microarray-Based Gene Expression Profiling to Elucidate Effectiveness of Fermented Codonopsis lanceolata in Mice

    Directory of Open Access Journals (Sweden)

    Woon Yong Choi

    2014-04-01

    Full Text Available In this study, the effect of Codonopsis lanceolata fermented by lactic acid on controlling gene expression levels related to obesity was observed in an oligonucleotide chip microarray. Among 8170 genes, 393 genes were up regulated and 760 genes were down regulated in feeding the fermented C. lanceolata (FCL. Another 374 genes were up regulated and 527 genes down regulated without feeding the sample. The genes were not affected by the FCL sample. It was interesting that among those genes, Chytochrome P450, Dmbt1, LOC76487, and thyroid hormones, etc., were mostly up or down regulated. These genes are more related to lipid synthesis. We could conclude that the FCL possibly controlled the gene expression levels related to lipid synthesis, which resulted in reducing obesity. However, more detailed protein expression experiments should be carried out.

  11. Examination of gene expression in mice exposed to low dose radiation using affymetrix cDNA microarrays

    Energy Technology Data Exchange (ETDEWEB)

    Morris, D.; Knox, D.; Lavoie, J.; Lemon, J.; Boreham, D. [McMaster Univ., Hamilton, Ontario (Canada)

    2005-07-01

    'Full text:' Gamma radiation acts via the indirect effect to damage cells by producing reactive oxygen species (ROS). These ROS are capable damaging macromolecules and, altering signal pathways and gene transcription. Cells have evolved enzymes and mechanisms to scavenge ROS and repair oxidative damage. Microarrays allow the survey of the gene transcription activity of thousands of genes simultaneously. Messenger RNA is extracted from cells, hybridized with the complementary DNA (cDNA) of a microarray chip, and examined with a chip reader. Affymetrix microarray chips have been produced by the CSCHAH in Winnipeg containing 26000 murine genes. Groups of female mice have been exposed to low dose whole body chronic gamma radiation exposures of 0,50,100, and 120 mGy, corresponding to 15,30,60, and 75 weeks, respectively. MRNA from mice brain tissue has been extracted, isolated, converted to cDNA and labeled. Gene expression in each irradiated mouse was compared to the pooled expression of the control mice. Analysis of gene expression levels are performed with microarray analytical software, Array Pro by Media Cybernetics, and powerful statistical software, BRB microarray tools. Differences in gene expressions, focusing on genes for cytokines, DNA repair mechanisms, immuno-modulators, apoptosis pathways, and enzymatic anti-oxidant systems, are being examined and will be reported. (author)

  12. MARS: Microarray analysis, retrieval, and storage system

    Directory of Open Access Journals (Sweden)

    Scheideler Marcel

    2005-04-01

    Full Text Available Abstract Background Microarray analysis has become a widely used technique for the study of gene-expression patterns on a genomic scale. As more and more laboratories are adopting microarray technology, there is a need for powerful and easy to use microarray databases facilitating array fabrication, labeling, hybridization, and data analysis. The wealth of data generated by this high throughput approach renders adequate database and analysis tools crucial for the pursuit of insights into the transcriptomic behavior of cells. Results MARS (Microarray Analysis and Retrieval System provides a comprehensive MIAME supportive suite for storing, retrieving, and analyzing multi color microarray data. The system comprises a laboratory information management system (LIMS, a quality control management, as well as a sophisticated user management system. MARS is fully integrated into an analytical pipeline of microarray image analysis, normalization, gene expression clustering, and mapping of gene expression data onto biological pathways. The incorporation of ontologies and the use of MAGE-ML enables an export of studies stored in MARS to public repositories and other databases accepting these documents. Conclusion We have developed an integrated system tailored to serve the specific needs of microarray based research projects using a unique fusion of Web based and standalone applications connected to the latest J2EE application server technology. The presented system is freely available for academic and non-profit institutions. More information can be found at http://genome.tugraz.at.

  13. RDFBuilder: a tool to automatically build RDF-based interfaces for MAGE-OM microarray data sources.

    Science.gov (United States)

    Anguita, Alberto; Martin, Luis; Garcia-Remesal, Miguel; Maojo, Victor

    2013-07-01

    This paper presents RDFBuilder, a tool that enables RDF-based access to MAGE-ML-compliant microarray databases. We have developed a system that automatically transforms the MAGE-OM model and microarray data stored in the ArrayExpress database into RDF format. Additionally, the system automatically enables a SPARQL endpoint. This allows users to execute SPARQL queries for retrieving microarray data, either from specific experiments or from more than one experiment at a time. Our system optimizes response times by caching and reusing information from previous queries. In this paper, we describe our methods for achieving this transformation. We show that our approach is complementary to other existing initiatives, such as Bio2RDF, for accessing and retrieving data from the ArrayExpress database. Copyright © 2013 Elsevier Ireland Ltd. All rights reserved.

  14. ZODET: software for the identification, analysis and visualisation of outlier genes in microarray expression data.

    Directory of Open Access Journals (Sweden)

    Daniel L Roden

    Full Text Available Complex human diseases can show significant heterogeneity between patients with the same phenotypic disorder. An outlier detection strategy was developed to identify variants at the level of gene transcription that are of potential biological and phenotypic importance. Here we describe a graphical software package (z-score outlier detection (ZODET that enables identification and visualisation of gross abnormalities in gene expression (outliers in individuals, using whole genome microarray data. Mean and standard deviation of expression in a healthy control cohort is used to detect both over and under-expressed probes in individual test subjects. We compared the potential of ZODET to detect outlier genes in gene expression datasets with a previously described statistical method, gene tissue index (GTI, using a simulated expression dataset and a publicly available monocyte-derived macrophage microarray dataset. Taken together, these results support ZODET as a novel approach to identify outlier genes of potential pathogenic relevance in complex human diseases. The algorithm is implemented using R packages and Java.The software is freely available from http://www.ucl.ac.uk/medicine/molecular-medicine/publications/microarray-outlier-analysis.

  15. Strategies for comparing gene expression profiles from different microarray platforms: application to a case-control experiment.

    Science.gov (United States)

    Severgnini, Marco; Bicciato, Silvio; Mangano, Eleonora; Scarlatti, Francesca; Mezzelani, Alessandra; Mattioli, Michela; Ghidoni, Riccardo; Peano, Clelia; Bonnal, Raoul; Viti, Federica; Milanesi, Luciano; De Bellis, Gianluca; Battaglia, Cristina

    2006-06-01

    Meta-analysis of microarray data is increasingly important, considering both the availability of multiple platforms using disparate technologies and the accumulation in public repositories of data sets from different laboratories. We addressed the issue of comparing gene expression profiles from two microarray platforms by devising a standardized investigative strategy. We tested this procedure by studying MDA-MB-231 cells, which undergo apoptosis on treatment with resveratrol. Gene expression profiles were obtained using high-density, short-oligonucleotide, single-color microarray platforms: GeneChip (Affymetrix) and CodeLink (Amersham). Interplatform analyses were carried out on 8414 common transcripts represented on both platforms, as identified by LocusLink ID, representing 70.8% and 88.6% of annotated GeneChip and CodeLink features, respectively. We identified 105 differentially expressed genes (DEGs) on CodeLink and 42 DEGs on GeneChip. Among them, only 9 DEGs were commonly identified by both platforms. Multiple analyses (BLAST alignment of probes with target sequences, gene ontology, literature mining, and quantitative real-time PCR) permitted us to investigate the factors contributing to the generation of platform-dependent results in single-color microarray experiments. An effective approach to cross-platform comparison involves microarrays of similar technologies, samples prepared by identical methods, and a standardized battery of bioinformatic and statistical analyses.

  16. Consistent Differential Expression Pattern (CDEP) on microarray to identify genes related to metastatic behavior.

    Science.gov (United States)

    Tsoi, Lam C; Qin, Tingting; Slate, Elizabeth H; Zheng, W Jim

    2011-11-11

    To utilize the large volume of gene expression information generated from different microarray experiments, several meta-analysis techniques have been developed. Despite these efforts, there remain significant challenges to effectively increasing the statistical power and decreasing the Type I error rate while pooling the heterogeneous datasets from public resources. The objective of this study is to develop a novel meta-analysis approach, Consistent Differential Expression Pattern (CDEP), to identify genes with common differential expression patterns across different datasets. We combined False Discovery Rate (FDR) estimation and the non-parametric RankProd approach to estimate the Type I error rate in each microarray dataset of the meta-analysis. These Type I error rates from all datasets were then used to identify genes with common differential expression patterns. Our simulation study showed that CDEP achieved higher statistical power and maintained low Type I error rate when compared with two recently proposed meta-analysis approaches. We applied CDEP to analyze microarray data from different laboratories that compared transcription profiles between metastatic and primary cancer of different types. Many genes identified as differentially expressed consistently across different cancer types are in pathways related to metastatic behavior, such as ECM-receptor interaction, focal adhesion, and blood vessel development. We also identified novel genes such as AMIGO2, Gem, and CXCL11 that have not been shown to associate with, but may play roles in, metastasis. CDEP is a flexible approach that borrows information from each dataset in a meta-analysis in order to identify genes being differentially expressed consistently. We have shown that CDEP can gain higher statistical power than other existing approaches under a variety of settings considered in the simulation study, suggesting its robustness and insensitivity to data variation commonly associated with microarray

  17. Analysis of Temporal-spatial Co-variation within Gene Expression Microarray Data in an Organogenesis Model

    Science.gov (United States)

    Ehler, Martin; Rajapakse, Vinodh; Zeeberg, Barry; Brooks, Brian; Brown, Jacob; Czaja, Wojciech; Bonner, Robert F.

    The gene networks underlying closure of the optic fissure during vertebrate eye development are poorly understood. We used a novel clustering method based on Laplacian Eigenmaps, a nonlinear dimension reduction method, to analyze microarray data from laser capture microdissected (LCM) cells at the site and developmental stages (days 10.5 to 12.5) of optic fissure closure. Our new method provided greater biological specificity than classical clustering algorithms in terms of identifying more biological processes and functions related to eye development as defined by Gene Ontology at lower false discovery rates. This new methodology builds on the advantages of LCM to isolate pure phenotypic populations within complex tissues and allows improved ability to identify critical gene products expressed at lower copy number. The combination of LCM of embryonic organs, gene expression microarrays, and extracting spatial and temporal co-variations appear to be a powerful approach to understanding the gene regulatory networks that specify mammalian organogenesis.

  18. Multivariate analysis of microarray data: differential expression and differential connection.

    Science.gov (United States)

    Kiiveri, Harri T

    2011-02-01

    Typical analysis of microarray data ignores the correlation between gene expression values. In this paper we present a model for microarray data which specifically allows for correlation between genes. As a result we combine gene network ideas with linear models and differential expression. We use sparse inverse covariance matrices and their associated graphical representation to capture the notion of gene networks. An important issue in using these models is the identification of the pattern of zeroes in the inverse covariance matrix. The limitations of existing methods for doing this are discussed and we provide a workable solution for determining the zero pattern. We then consider a method for estimating the parameters in the inverse covariance matrix which is suitable for very high dimensional matrices. We also show how to construct multivariate tests of hypotheses. These overall multivariate tests can be broken down into two components, the first one being similar to tests for differential expression and the second involving the connections between genes. The methods in this paper enable the extraction of a wealth of information concerning the relationships between genes which can be conveniently represented in graphical form. Differentially expressed genes can be placed in the context of the gene network and places in the gene network where unusual or interesting patterns have emerged can be identified, leading to the formulation of hypotheses for future experimentation.

  19. Development and application of an antibody-based protein microarray to assess physiological stress in grizzly bears (Ursus arctos).

    Science.gov (United States)

    Carlson, Ruth I; Cattet, Marc R L; Sarauer, Bryan L; Nielsen, Scott E; Boulanger, John; Stenhouse, Gordon B; Janz, David M

    2016-01-01

    A novel antibody-based protein microarray was developed that simultaneously determines expression of 31 stress-associated proteins in skin samples collected from free-ranging grizzly bears (Ursus arctos) in Alberta, Canada. The microarray determines proteins belonging to four broad functional categories associated with stress physiology: hypothalamic-pituitary-adrenal axis proteins, apoptosis/cell cycle proteins, cellular stress/proteotoxicity proteins and oxidative stress/inflammation proteins. Small skin samples (50-100 mg) were collected from captured bears using biopsy punches. Proteins were isolated and labelled with fluorescent dyes, with labelled protein homogenates loaded onto microarrays to hybridize with antibodies. Relative protein expression was determined by comparison with a pooled standard skin sample. The assay was sensitive, requiring 80 µg of protein per sample to be run in triplicate on the microarray. Intra-array and inter-array coefficients of variation for individual proteins were generally bears. This suggests that remotely delivered biopsy darts could be used in future sampling. Using generalized linear mixed models, certain proteins within each functional category demonstrated altered expression with respect to differences in year, season, geographical sampling location within Alberta and bear biological parameters, suggesting that these general variables may influence expression of specific proteins in the microarray. Our goal is to apply the protein microarray as a conservation physiology tool that can detect, evaluate and monitor physiological stress in grizzly bears and other species at risk over time in response to environmental change.

  20. "Hook"-calibration of GeneChip-microarrays: Chip characteristics and expression measures

    Directory of Open Access Journals (Sweden)

    Krohn Knut

    2008-08-01

    Full Text Available Abstract Background Microarray experiments rely on several critical steps that may introduce biases and uncertainty in downstream analyses. These steps include mRNA sample extraction, amplification and labelling, hybridization, and scanning causing chip-specific systematic variations on the raw intensity level. Also the chosen array-type and the up-to-dateness of the genomic information probed on the chip affect the quality of the expression measures. In the accompanying publication we presented theory and algorithm of the so-called hook method which aims at correcting expression data for systematic biases using a series of new chip characteristics. Results In this publication we summarize the essential chip characteristics provided by this method, analyze special benchmark experiments to estimate transcript related expression measures and illustrate the potency of the method to detect and to quantify the quality of a particular hybridization. It is shown that our single-chip approach provides expression measures responding linearly on changes of the transcript concentration over three orders of magnitude. In addition, the method calculates a detection call judging the relation between the signal and the detection limit of the particular measurement. The performance of the method in the context of different chip generations and probe set assignments is illustrated. The hook method characterizes the RNA-quality in terms of the 3'/5'-amplification bias and the sample-specific calling rate. We show that the proper judgement of these effects requires the disentanglement of non-specific and specific hybridization which, otherwise, can lead to misinterpretations of expression changes. The consequences of modifying probe/target interactions by either changing the labelling protocol or by substituting RNA by DNA targets are demonstrated. Conclusion The single-chip based hook-method provides accurate expression estimates and chip-summary characteristics

  1. OpWise: Operons aid the identification of differentially expressed genes in bacterial microarray experiments

    Directory of Open Access Journals (Sweden)

    Arkin Adam P

    2006-01-01

    Full Text Available Abstract Background Differentially expressed genes are typically identified by analyzing the variation between replicate measurements. These procedures implicitly assume that there are no systematic errors in the data even though several sources of systematic error are known. Results OpWise estimates the amount of systematic error in bacterial microarray data by assuming that genes in the same operon have matching expression patterns. OpWise then performs a Bayesian analysis of a linear model to estimate significance. In simulations, OpWise corrects for systematic error and is robust to deviations from its assumptions. In several bacterial data sets, significant amounts of systematic error are present, and replicate-based approaches overstate the confidence of the changers dramatically, while OpWise does not. Finally, OpWise can identify additional changers by assigning genes higher confidence if they are consistent with other genes in the same operon. Conclusion Although microarray data can contain large amounts of systematic error, operons provide an external standard and allow for reasonable estimates of significance. OpWise is available at http://microbesonline.org/OpWise.

  2. Layered signaling regulatory networks analysis of gene expression involved in malignant tumorigenesis of non-resolving ulcerative colitis via integration of cross-study microarray profiles.

    Science.gov (United States)

    Fan, Shengjun; Pan, Zhenyu; Geng, Qiang; Li, Xin; Wang, Yefan; An, Yu; Xu, Yan; Tie, Lu; Pan, Yan; Li, Xuejun

    2013-01-01

    Ulcerative colitis (UC) was the most frequently diagnosed inflammatory bowel disease (IBD) and closely linked to colorectal carcinogenesis. By far, the underlying mechanisms associated with the disease are still unclear. With the increasing accumulation of microarray gene expression profiles, it is profitable to gain a systematic perspective based on gene regulatory networks to better elucidate the roles of genes associated with disorders. However, a major challenge for microarray data analysis is the integration of multiple-studies generated by different groups. In this study, firstly, we modeled a signaling regulatory network associated with colorectal cancer (CRC) initiation via integration of cross-study microarray expression data sets using Empirical Bayes (EB) algorithm. Secondly, a manually curated human cancer signaling map was established via comprehensive retrieval of the publicly available repositories. Finally, the co-differently-expressed genes were manually curated to portray the layered signaling regulatory networks. Overall, the remodeled signaling regulatory networks were separated into four major layers including extracellular, membrane, cytoplasm and nucleus, which led to the identification of five core biological processes and four signaling pathways associated with colorectal carcinogenesis. As a result, our biological interpretation highlighted the importance of EGF/EGFR signaling pathway, EPO signaling pathway, T cell signal transduction and members of the BCR signaling pathway, which were responsible for the malignant transition of CRC from the benign UC to the aggressive one. The present study illustrated a standardized normalization approach for cross-study microarray expression data sets. Our model for signaling networks construction was based on the experimentally-supported interaction and microarray co-expression modeling. Pathway-based signaling regulatory networks analysis sketched a directive insight into colorectal carcinogenesis

  3. Improving the scaling normalization for high-density oligonucleotide GeneChip expression microarrays

    Directory of Open Access Journals (Sweden)

    Lu Chao

    2004-07-01

    Full Text Available Abstract Background Normalization is an important step for microarray data analysis to minimize biological and technical variations. Choosing a suitable approach can be critical. The default method in GeneChip expression microarray uses a constant factor, the scaling factor (SF, for every gene on an array. The SF is obtained from a trimmed average signal of the array after excluding the 2% of the probe sets with the highest and the lowest values. Results Among the 76 U34A GeneChip experiments, the total signals on each array showed 25.8% variations in terms of the coefficient of variation, although all microarrays were hybridized with the same amount of biotin-labeled cRNA. The 2% of the probe sets with the highest signals that were normally excluded from SF calculation accounted for 34% to 54% of the total signals (40.7% ± 4.4%, mean ± sd. In comparison with normalization factors obtained from the median signal or from the mean of the log transformed signal, SF showed the greatest variation. The normalization factors obtained from log transformed signals showed least variation. Conclusions Eliminating 40% of the signal data during SF calculation failed to show any benefit. Normalization factors obtained with log transformed signals performed the best. Thus, it is suggested to use the mean of the logarithm transformed data for normalization, rather than the arithmetic mean of signals in GeneChip gene expression microarrays.

  4. Implementation of plaid model biclustering method on microarray of carcinoma and adenoma tumor gene expression data

    Science.gov (United States)

    Ardaneswari, Gianinna; Bustamam, Alhadi; Sarwinda, Devvi

    2017-10-01

    A Tumor is an abnormal growth of cells that serves no purpose. Carcinoma is a tumor that grows from the top of the cell membrane and the organ adenoma is a benign tumor of the gland-like cells or epithelial tissue. In the field of molecular biology, the development of microarray technology is used in the data store of disease genetic expression. For each of microarray gene, an amount of information is stored for each trait or condition. In gene expression data clustering can be done with a bicluster algorithm, thats clustering method which not only the objects to be clustered, but also the properties or condition of the object. This research proposed Plaid Model Biclustering as one of biclustering method. In this study, we discuss the implementation of Plaid Model Biclustering Method on microarray of Carcinoma and Adenoma tumor gene expression data. From the experimental results, we found three biclusters are formed by Carcinoma gene expression data and four biclusters are formed by Adenoma gene expression data.

  5. Complete gene expression profiling of Saccharopolyspora erythraea using GeneChip DNA microarrays

    Directory of Open Access Journals (Sweden)

    Bordoni Roberta

    2007-11-01

    Full Text Available Abstract Background The Saccharopolyspora erythraea genome sequence, recently published, presents considerable divergence from those of streptomycetes in gene organization and function, confirming the remarkable potential of S. erythraea for producing many other secondary metabolites in addition to erythromycin. In order to investigate, at whole transcriptome level, how S. erythraea genes are modulated, a DNA microarray was specifically designed and constructed on the S. erythraea strain NRRL 2338 genome sequence, and the expression profiles of 6494 ORFs were monitored during growth in complex liquid medium. Results The transcriptional analysis identified a set of 404 genes, whose transcriptional signals vary during growth and characterize three distinct phases: a rapid growth until 32 h (Phase A; a growth slowdown until 52 h (Phase B; and another rapid growth phase from 56 h to 72 h (Phase C before the cells enter the stationary phase. A non-parametric statistical method, that identifies chromosomal regions with transcriptional imbalances, determined regional organization of transcription along the chromosome, highlighting differences between core and non-core regions, and strand specific patterns of expression. Microarray data were used to characterize the temporal behaviour of major functional classes and of all the gene clusters for secondary metabolism. The results confirmed that the ery cluster is up-regulated during Phase A and identified six additional clusters (for terpenes and non-ribosomal peptides that are clearly regulated in later phases. Conclusion The use of a S. erythraea DNA microarray improved specificity and sensitivity of gene expression analysis, allowing a global and at the same time detailed picture of how S. erythraea genes are modulated. This work underlines the importance of using DNA microarrays, coupled with an exhaustive statistical and bioinformatic analysis of the results, to understand the transcriptional

  6. Comparison of small n statistical tests of differential expression applied to microarrays

    Directory of Open Access Journals (Sweden)

    Lee Anna Y

    2009-02-01

    Full Text Available Abstract Background DNA microarrays provide data for genome wide patterns of expression between observation classes. Microarray studies often have small samples sizes, however, due to cost constraints or specimen availability. This can lead to poor random error estimates and inaccurate statistical tests of differential expression. We compare the performance of the standard t-test, fold change, and four small n statistical test methods designed to circumvent these problems. We report results of various normalization methods for empirical microarray data and of various random error models for simulated data. Results Three Empirical Bayes methods (CyberT, BRB, and limma t-statistics were the most effective statistical tests across simulated and both 2-colour cDNA and Affymetrix experimental data. The CyberT regularized t-statistic in particular was able to maintain expected false positive rates with simulated data showing high variances at low gene intensities, although at the cost of low true positive rates. The Local Pooled Error (LPE test introduced a bias that lowered false positive rates below theoretically expected values and had lower power relative to the top performers. The standard two-sample t-test and fold change were also found to be sub-optimal for detecting differentially expressed genes. The generalized log transformation was shown to be beneficial in improving results with certain data sets, in particular high variance cDNA data. Conclusion Pre-processing of data influences performance and the proper combination of pre-processing and statistical testing is necessary for obtaining the best results. All three Empirical Bayes methods assessed in our study are good choices for statistical tests for small n microarray studies for both Affymetrix and cDNA data. Choice of method for a particular study will depend on software and normalization preferences.

  7. Comparison of microarray platforms for measuring differential microRNA expression in paired normal/cancer colon tissues.

    Directory of Open Access Journals (Sweden)

    Maurizio Callari

    Full Text Available BACKGROUND: Microarray technology applied to microRNA (miRNA profiling is a promising tool in many research fields; nevertheless, independent studies characterizing the same pathology have often reported poorly overlapping results. miRNA analysis methods have only recently been systematically compared but only in few cases using clinical samples. METHODOLOGY/PRINCIPAL FINDINGS: We investigated the inter-platform reproducibility of four miRNA microarray platforms (Agilent, Exiqon, Illumina, and Miltenyi, comparing nine paired tumor/normal colon tissues. The most concordant and selected discordant miRNAs were further studied by quantitative RT-PCR. Globally, a poor overlap among differentially expressed miRNAs identified by each platform was found. Nevertheless, for eight miRNAs high agreement in differential expression among the four platforms and comparability to qRT-PCR was observed. Furthermore, most of the miRNA sets identified by each platform are coherently enriched in data from the other platforms and the great majority of colon cancer associated miRNA sets derived from the literature were validated in our data, independently from the platform. Computational integration of miRNA and gene expression profiles suggested that anti-correlated predicted target genes of differentially expressed miRNAs are commonly enriched in cancer-related pathways and in genes involved in glycolysis and nutrient transport. CONCLUSIONS: Technical and analytical challenges in measuring miRNAs still remain and further research is required in order to increase consistency between different microarray-based methodologies. However, a better inter-platform agreement was found by looking at miRNA sets instead of single miRNAs and through a miRNAs - gene expression integration approach.

  8. Radioactive cDNA microarray (II): Gene expression profiling of antidepressant treatment by human cDNA microarray

    Energy Technology Data Exchange (ETDEWEB)

    Lee, Ji Hye; Kang, Rhee Hun; Ham, Byung Joo; Lee, Min Su; Shin, Kyung Ho; Choe, Jae Gol; Kim, Meyoung Kon [College of Medicine, Univ. of Korea, Seoul (Korea, Republic of)

    2003-07-01

    Major depressive disorder is a prevalent psychiatric disorder in primary care, associated with impaired patient functioning and well-being. Fluoxetine is a selective serotonin-reuptake inhibitors (SSRIs) and is a commonly prescribed antidepressant compound. Its action is primarily attributed to selective inhibition of the reuptake of serotonin (5-hydroxytryptamine) in the central nervous system. Objectives ; the aims of this study were two-fold: (1) to determine the usefulness for investigation of the transcription profiles in depression patients, and (2) to assess the differences in gene expression profiles between positive response group and negative response groups by fluoxetine treatment. This study included 53 patients with major depression (26 in positive response group with antidepressant treatment, 27 in negative response group with antidepressant treatment), and 53 healthy controls. To examine the difference of gene expression profile in depression patients, radioactive complementary DNA microarrays were used to evaluate changes in the expression of 1,152 genes in total. Using 33p-labeled probes, this method provided highly sensitive gene expression profiles including brain receptors, drug metabolism, and cellular signaling. Gene transcription profiles were classified into several categories in accordance with the antidepressant gene-regulation. The gene profiles were significantly up-(22 genes) and down-(16 genes) regulated in the positive response group when compared to the control group. Also, in the negative response group, 35 genes were up-regulated and 8 genes were down-regulated when compared to the control group. Consequently, we demonstrated that radioactive human cDNA microarray is highly likely to be an efficient technology for evaluating the gene regulation of antidepressants, such as selective serotonin-reuptake inhibitors (SSRIs), by using high-throughput biotechnology.

  9. Radioactive cDNA microarray (II): Gene expression profiling of antidepressant treatment by human cDNA microarray

    International Nuclear Information System (INIS)

    Lee, Ji Hye; Kang, Rhee Hun; Ham, Byung Joo; Lee, Min Su; Shin, Kyung Ho; Choe, Jae Gol; Kim, Meyoung Kon

    2003-01-01

    Major depressive disorder is a prevalent psychiatric disorder in primary care, associated with impaired patient functioning and well-being. Fluoxetine is a selective serotonin-reuptake inhibitors (SSRIs) and is a commonly prescribed antidepressant compound. Its action is primarily attributed to selective inhibition of the reuptake of serotonin (5-hydroxytryptamine) in the central nervous system. Objectives ; the aims of this study were two-fold: (1) to determine the usefulness for investigation of the transcription profiles in depression patients, and (2) to assess the differences in gene expression profiles between positive response group and negative response groups by fluoxetine treatment. This study included 53 patients with major depression (26 in positive response group with antidepressant treatment, 27 in negative response group with antidepressant treatment), and 53 healthy controls. To examine the difference of gene expression profile in depression patients, radioactive complementary DNA microarrays were used to evaluate changes in the expression of 1,152 genes in total. Using 33p-labeled probes, this method provided highly sensitive gene expression profiles including brain receptors, drug metabolism, and cellular signaling. Gene transcription profiles were classified into several categories in accordance with the antidepressant gene-regulation. The gene profiles were significantly up-(22 genes) and down-(16 genes) regulated in the positive response group when compared to the control group. Also, in the negative response group, 35 genes were up-regulated and 8 genes were down-regulated when compared to the control group. Consequently, we demonstrated that radioactive human cDNA microarray is highly likely to be an efficient technology for evaluating the gene regulation of antidepressants, such as selective serotonin-reuptake inhibitors (SSRIs), by using high-throughput biotechnology

  10. Evaluation of Different Normalization and Analysis Procedures for Illumina Gene Expression Microarray Data Involving Small Changes

    Science.gov (United States)

    Johnstone, Daniel M.; Riveros, Carlos; Heidari, Moones; Graham, Ross M.; Trinder, Debbie; Berretta, Regina; Olynyk, John K.; Scott, Rodney J.; Moscato, Pablo; Milward, Elizabeth A.

    2013-01-01

    While Illumina microarrays can be used successfully for detecting small gene expression changes due to their high degree of technical replicability, there is little information on how different normalization and differential expression analysis strategies affect outcomes. To evaluate this, we assessed concordance across gene lists generated by applying different combinations of normalization strategy and analytical approach to two Illumina datasets with modest expression changes. In addition to using traditional statistical approaches, we also tested an approach based on combinatorial optimization. We found that the choice of both normalization strategy and analytical approach considerably affected outcomes, in some cases leading to substantial differences in gene lists and subsequent pathway analysis results. Our findings suggest that important biological phenomena may be overlooked when there is a routine practice of using only one approach to investigate all microarray datasets. Analytical artefacts of this kind are likely to be especially relevant for datasets involving small fold changes, where inherent technical variation—if not adequately minimized by effective normalization—may overshadow true biological variation. This report provides some basic guidelines for optimizing outcomes when working with Illumina datasets involving small expression changes. PMID:27605185

  11. Microarray-based screening of heat shock protein inhibitors.

    Science.gov (United States)

    Schax, Emilia; Walter, Johanna-Gabriela; Märzhäuser, Helene; Stahl, Frank; Scheper, Thomas; Agard, David A; Eichner, Simone; Kirschning, Andreas; Zeilinger, Carsten

    2014-06-20

    Based on the importance of heat shock proteins (HSPs) in diseases such as cancer, Alzheimer's disease or malaria, inhibitors of these chaperons are needed. Today's state-of-the-art techniques to identify HSP inhibitors are performed in microplate format, requiring large amounts of proteins and potential inhibitors. In contrast, we have developed a miniaturized protein microarray-based assay to identify novel inhibitors, allowing analysis with 300 pmol of protein. The assay is based on competitive binding of fluorescence-labeled ATP and potential inhibitors to the ATP-binding site of HSP. Therefore, the developed microarray enables the parallel analysis of different ATP-binding proteins on a single microarray. We have demonstrated the possibility of multiplexing by immobilizing full-length human HSP90α and HtpG of Helicobacter pylori on microarrays. Fluorescence-labeled ATP was competed by novel geldanamycin/reblastatin derivatives with IC50 values in the range of 0.5 nM to 4 μM and Z(*)-factors between 0.60 and 0.96. Our results demonstrate the potential of a target-oriented multiplexed protein microarray to identify novel inhibitors for different members of the HSP90 family. Copyright © 2014 Elsevier B.V. All rights reserved.

  12. Microarray analysis of the gene expression profile in triethylene glycol dimethacrylate-treated human dental pulp cells.

    Science.gov (United States)

    Torun, D; Torun, Z Ö; Demirkaya, K; Sarper, M; Elçi, M P; Avcu, F

    2017-11-01

    Triethylene glycol dimethacrylate (TEGDMA) is an important resin monomer commonly used in the structure of dental restorative materials. Recent studies have shown that unpolymerized resin monomers may be released into the oral environment and cause harmful biological effects. We investigated changes in the gene expression profiles of TEGDMA-treated human dental pulp cells (hDPCs) following short- (1-day) and long-term (7-days) exposure. HDPCs were exposed to a noncytotoxic concentration of TEGDMA, and gene expression profiles were evaluated by microarray analysis. The results were confirmed by quantitative reverse-transcriptase PCR (qRT PCR). In total, 1282 and 1319 genes (up- or down-regulated) were differentially expressed compared with control group after the 1- and 7-day incubation periods, respectively. Biological ontology-based analyses revealed that metabolic, cellular, and developmental processes constituted the largest groups of biological functional processes. qRT-PCR analysis on bone morphogenetic protein-2 (BMP-2), BMP-4, secreted protein, acidic, cysteine-rich, collagen type I alpha 1, oxidative stress-induced growth inhibitor 1, MMP3, interleukin-6, and heme oxygenase-1 genes confirmed the changes in expression observed in the microarray analysis. Our results suggest that TEGDMA can change the many functions of hDPCs through large changes in gene expression levels and complex interactions with different signaling pathways.

  13. Multivariate analysis of microarray data: differential expression and differential connection

    Directory of Open Access Journals (Sweden)

    Kiiveri Harri T

    2011-02-01

    Full Text Available Abstract Background Typical analysis of microarray data ignores the correlation between gene expression values. In this paper we present a model for microarray data which specifically allows for correlation between genes. As a result we combine gene network ideas with linear models and differential expression. Results We use sparse inverse covariance matrices and their associated graphical representation to capture the notion of gene networks. An important issue in using these models is the identification of the pattern of zeroes in the inverse covariance matrix. The limitations of existing methods for doing this are discussed and we provide a workable solution for determining the zero pattern. We then consider a method for estimating the parameters in the inverse covariance matrix which is suitable for very high dimensional matrices. We also show how to construct multivariate tests of hypotheses. These overall multivariate tests can be broken down into two components, the first one being similar to tests for differential expression and the second involving the connections between genes. Conclusion The methods in this paper enable the extraction of a wealth of information concerning the relationships between genes which can be conveniently represented in graphical form. Differentially expressed genes can be placed in the context of the gene network and places in the gene network where unusual or interesting patterns have emerged can be identified, leading to the formulation of hypotheses for future experimentation.

  14. Kernel-imbedded Gaussian processes for disease classification using microarray gene expression data

    Directory of Open Access Journals (Sweden)

    Cheung Leo

    2007-02-01

    Full Text Available Abstract Background Designing appropriate machine learning methods for identifying genes that have a significant discriminating power for disease outcomes has become more and more important for our understanding of diseases at genomic level. Although many machine learning methods have been developed and applied to the area of microarray gene expression data analysis, the majority of them are based on linear models, which however are not necessarily appropriate for the underlying connection between the target disease and its associated explanatory genes. Linear model based methods usually also bring in false positive significant features more easily. Furthermore, linear model based algorithms often involve calculating the inverse of a matrix that is possibly singular when the number of potentially important genes is relatively large. This leads to problems of numerical instability. To overcome these limitations, a few non-linear methods have recently been introduced to the area. Many of the existing non-linear methods have a couple of critical problems, the model selection problem and the model parameter tuning problem, that remain unsolved or even untouched. In general, a unified framework that allows model parameters of both linear and non-linear models to be easily tuned is always preferred in real-world applications. Kernel-induced learning methods form a class of approaches that show promising potentials to achieve this goal. Results A hierarchical statistical model named kernel-imbedded Gaussian process (KIGP is developed under a unified Bayesian framework for binary disease classification problems using microarray gene expression data. In particular, based on a probit regression setting, an adaptive algorithm with a cascading structure is designed to find the appropriate kernel, to discover the potentially significant genes, and to make the optimal class prediction accordingly. A Gibbs sampler is built as the core of the algorithm to make

  15. Extracting gene expression patterns and identifying co-expressed genes from microarray data reveals biologically responsive processes

    Directory of Open Access Journals (Sweden)

    Paules Richard S

    2007-11-01

    Full Text Available Abstract Background A common observation in the analysis of gene expression data is that many genes display similarity in their expression patterns and therefore appear to be co-regulated. However, the variation associated with microarray data and the complexity of the experimental designs make the acquisition of co-expressed genes a challenge. We developed a novel method for Extracting microarray gene expression Patterns and Identifying co-expressed Genes, designated as EPIG. The approach utilizes the underlying structure of gene expression data to extract patterns and identify co-expressed genes that are responsive to experimental conditions. Results Through evaluation of the correlations among profiles, the magnitude of variation in gene expression profiles, and profile signal-to-noise ratio's, EPIG extracts a set of patterns representing co-expressed genes. The method is shown to work well with a simulated data set and microarray data obtained from time-series studies of dauer recovery and L1 starvation in C. elegans and after ultraviolet (UV or ionizing radiation (IR-induced DNA damage in diploid human fibroblasts. With the simulated data set, EPIG extracted the appropriate number of patterns which were more stable and homogeneous than the set of patterns that were determined using the CLICK or CAST clustering algorithms. However, CLICK performed better than EPIG and CAST with respect to the average correlation between clusters/patterns of the simulated data. With real biological data, EPIG extracted more dauer-specific patterns than CLICK. Furthermore, analysis of the IR/UV data revealed 18 unique patterns and 2661 genes out of approximately 17,000 that were identified as significantly expressed and categorized to the patterns by EPIG. The time-dependent patterns displayed similar and dissimilar responses between IR and UV treatments. Gene Ontology analysis applied to each pattern-related subset of co-expressed genes revealed underlying

  16. Evaluation of gene expression data generated from expired Affymetrix GeneChip® microarrays using MAQC reference RNA samples

    Directory of Open Access Journals (Sweden)

    Tong Weida

    2010-10-01

    Full Text Available Abstract Background The Affymetrix GeneChip® system is a commonly used platform for microarray analysis but the technology is inherently expensive. Unfortunately, changes in experimental planning and execution, such as the unavailability of previously anticipated samples or a shift in research focus, may render significant numbers of pre-purchased GeneChip® microarrays unprocessed before their manufacturer’s expiration dates. Researchers and microarray core facilities wonder whether expired microarrays are still useful for gene expression analysis. In addition, it was not clear whether the two human reference RNA samples established by the MAQC project in 2005 still maintained their transcriptome integrity over a period of four years. Experiments were conducted to answer these questions. Results Microarray data were generated in 2009 in three replicates for each of the two MAQC samples with either expired Affymetrix U133A or unexpired U133Plus2 microarrays. These results were compared with data obtained in 2005 on the U133Plus2 microarray. The percentage of overlap between the lists of differentially expressed genes (DEGs from U133Plus2 microarray data generated in 2009 and in 2005 was 97.44%. While there was some degree of fold change compression in the expired U133A microarrays, the percentage of overlap between the lists of DEGs from the expired and unexpired microarrays was as high as 96.99%. Moreover, the microarray data generated using the expired U133A microarrays in 2009 were highly concordant with microarray and TaqMan® data generated by the MAQC project in 2005. Conclusions Our results demonstrated that microarray data generated using U133A microarrays, which were more than four years past the manufacturer’s expiration date, were highly specific and consistent with those from unexpired microarrays in identifying DEGs despite some appreciable fold change compression and decrease in sensitivity. Our data also suggested that the

  17. Annotating breast cancer microarray samples using ontologies

    Science.gov (United States)

    Liu, Hongfang; Li, Xin; Yoon, Victoria; Clarke, Robert

    2008-01-01

    As the most common cancer among women, breast cancer results from the accumulation of mutations in essential genes. Recent advance in high-throughput gene expression microarray technology has inspired researchers to use the technology to assist breast cancer diagnosis, prognosis, and treatment prediction. However, the high dimensionality of microarray experiments and public access of data from many experiments have caused inconsistencies which initiated the development of controlled terminologies and ontologies for annotating microarray experiments, such as the standard microarray Gene Expression Data (MGED) ontology (MO). In this paper, we developed BCM-CO, an ontology tailored specifically for indexing clinical annotations of breast cancer microarray samples from the NCI Thesaurus. Our research showed that the coverage of NCI Thesaurus is very limited with respect to i) terms used by researchers to describe breast cancer histology (covering 22 out of 48 histology terms); ii) breast cancer cell lines (covering one out of 12 cell lines); and iii) classes corresponding to the breast cancer grading and staging. By incorporating a wider range of those terms into BCM-CO, we were able to indexed breast cancer microarray samples from GEO using BCM-CO and MGED ontology and developed a prototype system with web interface that allows the retrieval of microarray data based on the ontology annotations. PMID:18999108

  18. MicroArray Facility: a laboratory information management system with extended support for Nylon based technologies

    Directory of Open Access Journals (Sweden)

    Beaudoing Emmanuel

    2006-09-01

    Full Text Available Abstract Background High throughput gene expression profiling (GEP is becoming a routine technique in life science laboratories. With experimental designs that repeatedly span thousands of genes and hundreds of samples, relying on a dedicated database infrastructure is no longer an option. GEP technology is a fast moving target, with new approaches constantly broadening the field diversity. This technology heterogeneity, compounded by the informatics complexity of GEP databases, means that software developments have so far focused on mainstream techniques, leaving less typical yet established techniques such as Nylon microarrays at best partially supported. Results MAF (MicroArray Facility is the laboratory database system we have developed for managing the design, production and hybridization of spotted microarrays. Although it can support the widely used glass microarrays and oligo-chips, MAF was designed with the specific idiosyncrasies of Nylon based microarrays in mind. Notably single channel radioactive probes, microarray stripping and reuse, vector control hybridizations and spike-in controls are all natively supported by the software suite. MicroArray Facility is MIAME supportive and dynamically provides feedback on missing annotations to help users estimate effective MIAME compliance. Genomic data such as clone identifiers and gene symbols are also directly annotated by MAF software using standard public resources. The MAGE-ML data format is implemented for full data export. Journalized database operations (audit tracking, data anonymization, material traceability and user/project level confidentiality policies are also managed by MAF. Conclusion MicroArray Facility is a complete data management system for microarray producers and end-users. Particular care has been devoted to adequately model Nylon based microarrays. The MAF system, developed and implemented in both private and academic environments, has proved a robust solution for

  19. Production of DNA microarray and expression analysis of genes from Xylella fastidiosa in different culture media

    Directory of Open Access Journals (Sweden)

    Regiane de Fátima Travensolo

    2009-06-01

    Full Text Available DNA Microarray was developed to monitor the expression of many genes from Xylella fastidiosa, allowing the side by-side comparison of two situations in a single experiment. The experiments were performed using X. fastidiosa cells grown in two culture media: BCYE and XDM2. The primers were synthesized, spotted onto glass slides and the array was hybridized against fluorescently labeled cDNAs. The emitted signals were quantified, normalized and the data were statistically analyzed to verify the differentially expressed genes. According to the data, 104 genes were differentially expressed in XDM2 and 30 genes in BCYE media. The present study showed that DNA microarray technique efficiently differentiate the expressed genes under different conditions.DNA Microarray foi desenvolvida para monitorar a expressão de muitos genes de Xylella fastidiosa, permitindo a comparação de duas situações distintas em um único experimento. Os experimentos foram feitos utilizando células de X. fastidiosa cultivada em dois meios de cultura: BCYE e XDM2. Pares de oligonucleotídeos iniciadores foram sintetizados, depositados em lâminas de vidro e o arranjo foi hibridizado contra cDNAs marcados fluorescentemente. Os sinais emitidos foram quantificados, normalizados e os dados foram estatisticamente analisados para verificar os genes diferencialmente expressos. De acordo com nossos dados, 104 genes foram diferencialmente expressos para o meio de cultura XDM2 e 30 genes para o BCYE. No presente estudo, nós demonstramos que a técnica de DNA microarrays eficientemente diferencia genes expressos sob diferentes condições de cultivo.

  20. Expression profiling of cell cycle regulatory proteins in oropharyngeal carcinomas using tissue microarrays.

    Science.gov (United States)

    Ribeiro, Daniel A; Nascimento, Fabio D; Fracalossi, Ana Carolina C; Gomes, Thiago S; Oshima, Celina T F; Franco, Marcello F

    2010-01-01

    The aim of this study was to investigate the expressions of cell cycle regulatory proteins such as p53, p16, p21, and Rb in squamous cell carcinoma of the oropharynx and their relation to histological differentiation, staging of disease, and prognosis. Paraffin blocks from 21 primary tumors were obtained from archives of the Department of Pathology, Paulista Medical School, Federal University of Sao Paulo, UNIFESP/EPM. Immunohistochemistry was used to detect the expression of p53, p16, p21, and Rb by means of tissue microarrays. Expression of p53, p21, p16 and Rb was not correlated with the stage of disease, histopathological grading or recurrence in squamous cell carcinoma of the oropharynx. Taken together, our results suggest that p53, p16, p21 and Rb are not reliable biomarkers for prognosis of the tumor severity or recurrence in squamous cell carcinoma of the oropharynx as depicted by tissue microarrays and immunohistochemistry.

  1. Radioactive cDNA microarray in neurospsychiatry

    International Nuclear Information System (INIS)

    Choe, Jae Gol; Shin, Kyung Ho; Lee, Min Soo; Kim, Meyoung Kon

    2003-01-01

    Microarray technology allows the simultaneous analysis of gene expression patterns of thousands of genes, in a systematic fashion, under a similar set of experimental conditions, thus making the data highly comparable. In some cases arrays are used simply as a primary screen leading to downstream molecular characterization of individual gene candidates. In other cases, the goal of expression profiling is to begin to identify complex regulatory networks underlying developmental processes and disease states. Microarrays were originally used with cell lines or other simple model systems. More recently, microarrays have been used in the analysis of more complex biological tissues including neural systems and the brain. The application of cDNA arrays in neuropsychiatry has lagged behind other fields for a number of reasons. These include a requirement for a large amount of input probe RNA in fluorescent-glass based array systems and the cellular complexity introduced by multicellular brain and neural tissues. An additional factor that impacts the general use of microarrays in neuropsychiatry is the lack of availability of sequenced clone sets from model systems. While human cDNA clones have been widely available, high quality rat, mouse, and drosophilae, among others are just becoming widely available. A final factor in the application of cDNA microarrays in neuropsychiatry is cost of commercial arrays. As academic microarray facilitates become more commonplace custom made arrays will become more widely available at a lower cost allowing more widespread applications. In summary, microarray technology is rapidly having an impact on many areas of biomedical research. Radioisotope-nylon based microarrays offer alternatives that may in some cases be more sensitive, flexible, inexpensive, and universal as compared to other array formats, such as fluorescent-glass arrays. In some situations of limited RNA or exotic species, radioactive membrane microarrays may be the most

  2. Radioactive cDNA microarray in neurospsychiatry

    Energy Technology Data Exchange (ETDEWEB)

    Choe, Jae Gol; Shin, Kyung Ho; Lee, Min Soo; Kim, Meyoung Kon [Korea University Medical School, Seoul (Korea, Republic of)

    2003-02-01

    Microarray technology allows the simultaneous analysis of gene expression patterns of thousands of genes, in a systematic fashion, under a similar set of experimental conditions, thus making the data highly comparable. In some cases arrays are used simply as a primary screen leading to downstream molecular characterization of individual gene candidates. In other cases, the goal of expression profiling is to begin to identify complex regulatory networks underlying developmental processes and disease states. Microarrays were originally used with cell lines or other simple model systems. More recently, microarrays have been used in the analysis of more complex biological tissues including neural systems and the brain. The application of cDNA arrays in neuropsychiatry has lagged behind other fields for a number of reasons. These include a requirement for a large amount of input probe RNA in fluorescent-glass based array systems and the cellular complexity introduced by multicellular brain and neural tissues. An additional factor that impacts the general use of microarrays in neuropsychiatry is the lack of availability of sequenced clone sets from model systems. While human cDNA clones have been widely available, high quality rat, mouse, and drosophilae, among others are just becoming widely available. A final factor in the application of cDNA microarrays in neuropsychiatry is cost of commercial arrays. As academic microarray facilitates become more commonplace custom made arrays will become more widely available at a lower cost allowing more widespread applications. In summary, microarray technology is rapidly having an impact on many areas of biomedical research. Radioisotope-nylon based microarrays offer alternatives that may in some cases be more sensitive, flexible, inexpensive, and universal as compared to other array formats, such as fluorescent-glass arrays. In some situations of limited RNA or exotic species, radioactive membrane microarrays may be the most

  3. A microarray analysis of sex- and gonad-biased gene expression in the zebrafish: Evidence for masculinization of the transcriptome

    Directory of Open Access Journals (Sweden)

    Mo Qianxing

    2009-12-01

    Full Text Available Abstract Background In many taxa, males and females are very distinct phenotypically, and these differences often reflect divergent selective pressures acting on the sexes. Phenotypic sexual dimorphism almost certainly reflects differing patterns of gene expression between the sexes, and microarray studies have documented widespread sexually dimorphic gene expression. Although the evolutionary significance of sexual dimorphism in gene expression remains unresolved, these studies have led to the formulation of a hypothesis that male-driven evolution has resulted in the masculinization of animal transcriptomes. Here we use a microarray assessment of sex- and gonad-biased gene expression to test this hypothesis in zebrafish. Results By using zebrafish Affymetrix microarrays to compare gene expression patterns in male and female somatic and gonadal tissues, we identified a large number of genes (5899 demonstrating differences in transcript abundance between male and female Danio rerio. Under conservative statistical significance criteria, all sex-biases in gene expression were due to differences between testes and ovaries. Male-enriched genes were more abundant than female-enriched genes, and expression bias for male-enriched genes was greater in magnitude than that for female-enriched genes. We also identified a large number of genes demonstrating elevated transcript abundance in testes and ovaries relative to male body and female body, respectively. Conclusion Overall our results support the hypothesis that male-biased evolutionary pressures have resulted in male-biased patterns of gene expression. Interestingly, our results seem to be at odds with a handful of other microarray-based studies of sex-specific gene expression patterns in zebrafish. However, ours was the only study designed to address this specific hypothesis, and major methodological differences among studies could explain the discrepancies. Regardless, all of these studies agree

  4. Microarray Я US: a user-friendly graphical interface to Bioconductor tools that enables accurate microarray data analysis and expedites comprehensive functional analysis of microarray results.

    Science.gov (United States)

    Dai, Yilin; Guo, Ling; Li, Meng; Chen, Yi-Bu

    2012-06-08

    Microarray data analysis presents a significant challenge to researchers who are unable to use the powerful Bioconductor and its numerous tools due to their lack of knowledge of R language. Among the few existing software programs that offer a graphic user interface to Bioconductor packages, none have implemented a comprehensive strategy to address the accuracy and reliability issue of microarray data analysis due to the well known probe design problems associated with many widely used microarray chips. There is also a lack of tools that would expedite the functional analysis of microarray results. We present Microarray Я US, an R-based graphical user interface that implements over a dozen popular Bioconductor packages to offer researchers a streamlined workflow for routine differential microarray expression data analysis without the need to learn R language. In order to enable a more accurate analysis and interpretation of microarray data, we incorporated the latest custom probe re-definition and re-annotation for Affymetrix and Illumina chips. A versatile microarray results output utility tool was also implemented for easy and fast generation of input files for over 20 of the most widely used functional analysis software programs. Coupled with a well-designed user interface, Microarray Я US leverages cutting edge Bioconductor packages for researchers with no knowledge in R language. It also enables a more reliable and accurate microarray data analysis and expedites downstream functional analysis of microarray results.

  5. mRMR-ABC: A Hybrid Gene Selection Algorithm for Cancer Classification Using Microarray Gene Expression Profiling

    Directory of Open Access Journals (Sweden)

    Hala Alshamlan

    2015-01-01

    Full Text Available An artificial bee colony (ABC is a relatively recent swarm intelligence optimization approach. In this paper, we propose the first attempt at applying ABC algorithm in analyzing a microarray gene expression profile. In addition, we propose an innovative feature selection algorithm, minimum redundancy maximum relevance (mRMR, and combine it with an ABC algorithm, mRMR-ABC, to select informative genes from microarray profile. The new approach is based on a support vector machine (SVM algorithm to measure the classification accuracy for selected genes. We evaluate the performance of the proposed mRMR-ABC algorithm by conducting extensive experiments on six binary and multiclass gene expression microarray datasets. Furthermore, we compare our proposed mRMR-ABC algorithm with previously known techniques. We reimplemented two of these techniques for the sake of a fair comparison using the same parameters. These two techniques are mRMR when combined with a genetic algorithm (mRMR-GA and mRMR when combined with a particle swarm optimization algorithm (mRMR-PSO. The experimental results prove that the proposed mRMR-ABC algorithm achieves accurate classification performance using small number of predictive genes when tested using both datasets and compared to previously suggested methods. This shows that mRMR-ABC is a promising approach for solving gene selection and cancer classification problems.

  6. mRMR-ABC: A Hybrid Gene Selection Algorithm for Cancer Classification Using Microarray Gene Expression Profiling.

    Science.gov (United States)

    Alshamlan, Hala; Badr, Ghada; Alohali, Yousef

    2015-01-01

    An artificial bee colony (ABC) is a relatively recent swarm intelligence optimization approach. In this paper, we propose the first attempt at applying ABC algorithm in analyzing a microarray gene expression profile. In addition, we propose an innovative feature selection algorithm, minimum redundancy maximum relevance (mRMR), and combine it with an ABC algorithm, mRMR-ABC, to select informative genes from microarray profile. The new approach is based on a support vector machine (SVM) algorithm to measure the classification accuracy for selected genes. We evaluate the performance of the proposed mRMR-ABC algorithm by conducting extensive experiments on six binary and multiclass gene expression microarray datasets. Furthermore, we compare our proposed mRMR-ABC algorithm with previously known techniques. We reimplemented two of these techniques for the sake of a fair comparison using the same parameters. These two techniques are mRMR when combined with a genetic algorithm (mRMR-GA) and mRMR when combined with a particle swarm optimization algorithm (mRMR-PSO). The experimental results prove that the proposed mRMR-ABC algorithm achieves accurate classification performance using small number of predictive genes when tested using both datasets and compared to previously suggested methods. This shows that mRMR-ABC is a promising approach for solving gene selection and cancer classification problems.

  7. Microarray-based screening of differentially expressed genes in glucocorticoid-induced avascular necrosis

    Science.gov (United States)

    Huang, Gangyong; Wei, Yibing; Zhao, Guanglei; Xia, Jun; Wang, Siqun; Wu, Jianguo; Chen, Feiyan; Chen, Jie; Shi, Jingshen

    2017-01-01

    The underlying mechanisms of glucocorticoid (GC)-induced avascular necrosis of the femoral head (ANFH) have yet to be fully understood, in particular the mechanisms associated with the change of gene expression pattern. The present study aimed to identify key genes with a differential expression pattern in GC-induced ANFH. E-MEXP-2751 microarray data were downloaded from the ArrayExpress database. Differentially expressed genes (DEGs) were identified in 5 femoral head samples of steroid-induced ANFH rats compared with 5 placebo-treated rat samples. Gene Ontology (GO) and pathway enrichment analyses were performed upon these DEGs. A total 93 DEGs (46 upregulated and 47 downregulated genes) were identified in GC-induced ANFH samples. These DEGs were enriched in different GO terms and pathways, including chondrocyte differentiation and detection of chemical stimuli. The enrichment map revealed that skeletal system development was interconnected with several other GO terms by gene overlap. The literature mined network analysis revealed that 5 upregulated genes were associated with femoral necrosis, including parathyroid hormone receptor 1 (PTHR1), vitamin D (1,25-Dihydroxyvitamin D3) receptor (VDR), collagen, type II, α1, proprotein convertase subtilisin/kexin type 6 and zinc finger protein 354C (ZFP354C). In addition, ZFP354C and VDR were identified to transcription factors. Furthermore, PTHR1 was revealed to interact with VDR, and α-2-macroglobulin (A2M) interacted with fibronectin 1 (FN1) in the PPI network. PTHR1 may be involved in GC-induced ANFH via interacting with VDR. A2M may also be involved in the development of GC-induced ANFH through interacting with FN1. An improved understanding of the molecular mechanisms underlying GC-induced ANFH may provide novel targets for diagnostics and therapeutic treatment. PMID:28393228

  8. Development of a porcine skeletal muscle cDNA microarray: analysis of differential transcript expression in phenotypically distinct muscles

    Directory of Open Access Journals (Sweden)

    Stear Michael

    2003-03-01

    Full Text Available Abstract Background Microarray profiling has the potential to illuminate the molecular processes that govern the phenotypic characteristics of porcine skeletal muscles, such as hypertrophy or atrophy, and the expression of specific fibre types. This information is not only important for understanding basic muscle biology but also provides underpinning knowledge for enhancing the efficiency of livestock production. Results We report on the de novo development of a composite skeletal muscle cDNA microarray, comprising 5500 clones from two developmentally distinct cDNA libraries (longissimus dorsi of a 50-day porcine foetus and the gastrocnemius of a 3-day-old pig. Clones selected for the microarray assembly were of low to moderate abundance, as indicated by colony hybridisation. We profiled the differential expression of genes between the psoas (red muscle and the longissimus dorsi (white muscle, by co-hybridisation of Cy3 and Cy5 labelled cDNA derived from these two muscles. Results from seven microarray slides (replicates correctly identified genes that were expected to be differentially expressed, as well as a number of novel candidate regulatory genes. Quantitative real-time RT-PCR on selected genes was used to confirm the results from the microarray. Conclusion We have developed a porcine skeletal muscle cDNA microarray and have identified a number of candidate genes that could be involved in muscle phenotype determination, including several members of the casein kinase 2 signalling pathway.

  9. Not proper ROC curves as new tool for the analysis of differentially expressed genes in microarray experiments

    Directory of Open Access Journals (Sweden)

    Pistoia Vito

    2008-10-01

    Full Text Available Abstract Background Most microarray experiments are carried out with the purpose of identifying genes whose expression varies in relation with specific conditions or in response to environmental stimuli. In such studies, genes showing similar mean expression values between two or more groups are considered as not differentially expressed, even if hidden subclasses with different expression values may exist. In this paper we propose a new method for identifying differentially expressed genes, based on the area between the ROC curve and the rising diagonal (ABCR. ABCR represents a more general approach than the standard area under the ROC curve (AUC, because it can identify both proper (i.e., concave and not proper ROC curves (NPRC. In particular, NPRC may correspond to those genes that tend to escape standard selection methods. Results We assessed the performance of our method using data from a publicly available database of 4026 genes, including 14 normal B cell samples (NBC and 20 heterogeneous lymphomas (namely: 9 follicular lymphomas and 11 chronic lymphocytic leukemias. Moreover, NBC also included two sub-classes, i.e., 6 heavily stimulated and 8 slightly or not stimulated samples. We identified 1607 differentially expressed genes with an estimated False Discovery Rate of 15%. Among them, 16 corresponded to NPRC and all escaped standard selection procedures based on AUC and t statistics. Moreover, a simple inspection to the shape of such plots allowed to identify the two subclasses in either one class in 13 cases (81%. Conclusion NPRC represent a new useful tool for the analysis of microarray data.

  10. ESTs, cDNA microarrays, and gene expression profiling: tools for dissecting plant physiology and development.

    Science.gov (United States)

    Alba, Rob; Fei, Zhangjun; Payton, Paxton; Liu, Yang; Moore, Shanna L; Debbie, Paul; Cohn, Jonathan; D'Ascenzo, Mark; Gordon, Jeffrey S; Rose, Jocelyn K C; Martin, Gregory; Tanksley, Steven D; Bouzayen, Mondher; Jahn, Molly M; Giovannoni, Jim

    2004-09-01

    Gene expression profiling holds tremendous promise for dissecting the regulatory mechanisms and transcriptional networks that underlie biological processes. Here we provide details of approaches used by others and ourselves for gene expression profiling in plants with emphasis on cDNA microarrays and discussion of both experimental design and downstream analysis. We focus on methods and techniques emphasizing fabrication of cDNA microarrays, fluorescent labeling, cDNA hybridization, experimental design, and data processing. We include specific examples that demonstrate how this technology can be used to further our understanding of plant physiology and development (specifically fruit development and ripening) and for comparative genomics by comparing transcriptome activity in tomato and pepper fruit.

  11. Generalization of DNA microarray dispersion properties: microarray equivalent of t-distribution

    DEFF Research Database (Denmark)

    Novak, Jaroslav P; Kim, Seon-Young; Xu, Jun

    2006-01-01

    BACKGROUND: DNA microarrays are a powerful technology that can provide a wealth of gene expression data for disease studies, drug development, and a wide scope of other investigations. Because of the large volume and inherent variability of DNA microarray data, many new statistical methods have...

  12. Previously unidentified changes in renal cell carcinoma gene expression identified by parametric analysis of microarray data

    International Nuclear Information System (INIS)

    Lenburg, Marc E; Liou, Louis S; Gerry, Norman P; Frampton, Garrett M; Cohen, Herbert T; Christman, Michael F

    2003-01-01

    Renal cell carcinoma is a common malignancy that often presents as a metastatic-disease for which there are no effective treatments. To gain insights into the mechanism of renal cell carcinogenesis, a number of genome-wide expression profiling studies have been performed. Surprisingly, there is very poor agreement among these studies as to which genes are differentially regulated. To better understand this lack of agreement we profiled renal cell tumor gene expression using genome-wide microarrays (45,000 probe sets) and compare our analysis to previous microarray studies. We hybridized total RNA isolated from renal cell tumors and adjacent normal tissue to Affymetrix U133A and U133B arrays. We removed samples with technical defects and removed probesets that failed to exhibit sequence-specific hybridization in any of the samples. We detected differential gene expression in the resulting dataset with parametric methods and identified keywords that are overrepresented in the differentially expressed genes with the Fisher-exact test. We identify 1,234 genes that are more than three-fold changed in renal tumors by t-test, 800 of which have not been previously reported to be altered in renal cell tumors. Of the only 37 genes that have been identified as being differentially expressed in three or more of five previous microarray studies of renal tumor gene expression, our analysis finds 33 of these genes (89%). A key to the sensitivity and power of our analysis is filtering out defective samples and genes that are not reliably detected. The widespread use of sample-wise voting schemes for detecting differential expression that do not control for false positives likely account for the poor overlap among previous studies. Among the many genes we identified using parametric methods that were not previously reported as being differentially expressed in renal cell tumors are several oncogenes and tumor suppressor genes that likely play important roles in renal cell

  13. Microarray Expression Profile and Functional Analysis of Circular RNAs in Osteosarcoma

    Directory of Open Access Journals (Sweden)

    Weihai Liu

    2017-09-01

    Full Text Available Background/Aims: Osteosarcoma (OS is the most common primary malignant bone tumor in children and adolescents. However, the molecular mechanisms regulating osteosarcoma tumorigenesis and progression are still poorly understood. Circular RNAs (circRNAs have been identified as microRNA sponges and are involved in many important biological processes. This study aims to investigate the global changes in the expression pattern of circRNAs in osteosarcoma and provide a comprehensive understanding of differentially expressed circRNAs. Methods: Microarray based circRNA expression was determined in osteosarcoma cell lines and compared with hFOB1.19, which was used as the normal control. We confirmed the microarray data by real time-qPCR in both osteosarcoma cell lines and tissues. The circRNA/microRNA/mRNA interaction network was predicted using bioinformatics. Gene Ontology analysis and 4 annotation tools for pathway analysis (KEGG, Biocarta, PANTHER and Reactome were used to predict the functions of differentially expressed circRNAs. Results: We revealed a number of differentially expressed circRNAs and 12 of them were confirmed, which suggests a potential role of circRNAs in OS. Among these differentially expressed circRNAs, hsa_circRNA_103801 was up-regulated in both osteosarcoma cell lines and tissues, while hsa_circRNA_104980 was down-regulated. The most likely potential target miRNAs for hsa_circRNA_103801 include hsa-miR-370-3p, hsa-miR-338-3p and hsa-miR-877-3p, while the most potential target miRNAs of hsa_circRNA_104980 consist of hsa-miR-1298-3p and hsa-miR-660-3p. Functional analysis found that hsa_circRNA_103801 was involved in pathways in cancer, such as the HIF-1, VEGF and angiogenesis pathway, the Rap1 signaling pathway and the PI3K-Akt signaling pathway, while hsa_circRNA_104980 was related to some pathways such as the tight junction pathway. Conclusions: This study has identified the comprehensive expression profile of circRNAs in

  14. Analysis of gene expression in resynthesized Brassica napus Allopolyploids using arabidopsis 70mer oligo microarrays.

    Directory of Open Access Journals (Sweden)

    Robert T Gaeta

    Full Text Available BACKGROUND: Studies in resynthesized Brassica napus allopolyploids indicate that homoeologous chromosome exchanges in advanced generations (S(5ratio6 alter gene expression through the loss and doubling of homoeologous genes within the rearrangements. Rearrangements may also indirectly affect global gene expression if homoeologous copies of gene regulators within rearrangements have differential affects on the transcription of genes in networks. METHODOLOGY/PRINCIPAL FINDINGS: We utilized Arabidopsis 70mer oligonucleotide microarrays for exploring gene expression in three resynthesized B. napus lineages at the S(0ratio1 and S(5ratio6 generations as well as their diploid progenitors B. rapa and B. oleracea. Differential gene expression between the progenitors and additive (midparent expression in the allopolyploids were tested. The S(5ratio6 lines differed in the number of genetic rearrangements, allowing us to test if the number of genes displaying nonadditive expression was related to the number of rearrangements. Estimates using per-gene and common variance ANOVA models indicated that 6-15% of 26,107 genes were differentially expressed between the progenitors. Individual allopolyploids showed nonadditive expression for 1.6-32% of all genes. Less than 0.3% of genes displayed nonadditive expression in all S(0ratio1 lines and 0.1-0.2% were nonadditive among all S(5ratio6 lines. Differentially expressed genes in the polyploids were over-represented by genes differential between the progenitors. The total number of differentially expressed genes was correlated with the number of genetic changes in S(5ratio6 lines under the common variance model; however, there was no relationship using a per-gene variance model, and many genes showed nonadditive expression in S(0ratio1 lines. CONCLUSIONS/SIGNIFICANCE: Few genes reproducibly demonstrated nonadditive expression among lineages, suggesting few changes resulted from a general response to polyploidization

  15. The MGED Ontology: a resource for semantics-based description of microarray experiments.

    Science.gov (United States)

    Whetzel, Patricia L; Parkinson, Helen; Causton, Helen C; Fan, Liju; Fostel, Jennifer; Fragoso, Gilberto; Game, Laurence; Heiskanen, Mervi; Morrison, Norman; Rocca-Serra, Philippe; Sansone, Susanna-Assunta; Taylor, Chris; White, Joseph; Stoeckert, Christian J

    2006-04-01

    The generation of large amounts of microarray data and the need to share these data bring challenges for both data management and annotation and highlights the need for standards. MIAME specifies the minimum information needed to describe a microarray experiment and the Microarray Gene Expression Object Model (MAGE-OM) and resulting MAGE-ML provide a mechanism to standardize data representation for data exchange, however a common terminology for data annotation is needed to support these standards. Here we describe the MGED Ontology (MO) developed by the Ontology Working Group of the Microarray Gene Expression Data (MGED) Society. The MO provides terms for annotating all aspects of a microarray experiment from the design of the experiment and array layout, through to the preparation of the biological sample and the protocols used to hybridize the RNA and analyze the data. The MO was developed to provide terms for annotating experiments in line with the MIAME guidelines, i.e. to provide the semantics to describe a microarray experiment according to the concepts specified in MIAME. The MO does not attempt to incorporate terms from existing ontologies, e.g. those that deal with anatomical parts or developmental stages terms, but provides a framework to reference terms in other ontologies and therefore facilitates the use of ontologies in microarray data annotation. The MGED Ontology version.1.2.0 is available as a file in both DAML and OWL formats at http://mged.sourceforge.net/ontologies/index.php. Release notes and annotation examples are provided. The MO is also provided via the NCICB's Enterprise Vocabulary System (http://nciterms.nci.nih.gov/NCIBrowser/Dictionary.do). Stoeckrt@pcbi.upenn.edu Supplementary data are available at Bioinformatics online.

  16. Age-Specific Gene Expression Profiles of Rhesus Monkey Ovaries Detected by Microarray Analysis

    Directory of Open Access Journals (Sweden)

    Hengxi Wei

    2015-01-01

    Full Text Available The biological function of human ovaries declines with age. To identify the potential molecular changes in ovarian aging, we performed genome-wide gene expression analysis by microarray of ovaries from young, middle-aged, and old rhesus monkeys. Microarray data was validated by quantitative real-time PCR. Results showed that a total of 503 (60 upregulated, 443 downregulated and 84 (downregulated genes were differentially expressed in old ovaries compared to young and middle-aged groups, respectively. No difference in gene expression was found between middle-aged and young groups. Differentially expressed genes were mainly enriched in cell and organelle, cellular and physiological process, binding, and catalytic activity. These genes were primarily associated with KEGG pathways of cell cycle, DNA replication and repair, oocyte meiosis and maturation, MAPK, TGF-beta, and p53 signaling pathway. Genes upregulated were involved in aging, defense response, oxidation reduction, and negative regulation of cellular process; genes downregulated have functions in reproduction, cell cycle, DNA and RNA process, macromolecular complex assembly, and positive regulation of macromolecule metabolic process. These findings show that monkey ovary undergoes substantial change in global transcription with age. Gene expression profiles are useful in understanding the mechanisms underlying ovarian aging and age-associated infertility in primates.

  17. A comprehensive comparison of RNA-Seq-based transcriptome analysis from reads to differential gene expression and cross-comparison with microarrays: a case study in Saccharomyces cerevisiae

    DEFF Research Database (Denmark)

    Nookaew, Intawat; Papini, Marta; Pornputtapong, Natapol

    2012-01-01

    RNA-seq, has recently become an attractive method of choice in the studies of transcriptomes, promising several advantages compared with microarrays. In this study, we sought to assess the contribution of the different analytical steps involved in the analysis of RNA-seq data generated with the I......RNA-seq, has recently become an attractive method of choice in the studies of transcriptomes, promising several advantages compared with microarrays. In this study, we sought to assess the contribution of the different analytical steps involved in the analysis of RNA-seq data generated...... gene expression identification derived from the different statistical methods, as well as their integrated analysis results based on gene ontology annotation are in good agreement. Overall, our study provides a useful and comprehensive comparison between the two platforms (RNA-seq and microrrays...

  18. DNA microarrays of baculovirus genomes: differential expression of viral genes in two susceptible insect cell lines.

    Science.gov (United States)

    Yamagishi, J; Isobe, R; Takebuchi, T; Bando, H

    2003-03-01

    We describe, for the first time, the generation of a viral DNA chip for simultaneous expression measurements of nearly all known open reading frames (ORFs) in the best-studied members of the family Baculoviridae, Autographa californica multiple nucleopolyhedrovirus (AcMNPV) and Bombyx mori nucleopolyhedrovirus (BmNPV). In this study, a viral DNA chip (Ac-BmNPV chip) was fabricated and used to characterize the viral gene expression profile for AcMNPV in different cell types. The viral chip is composed of microarrays of viral DNA prepared by robotic deposition of PCR-amplified viral DNA fragments on glass for ORFs in the NPV genome. Viral gene expression was monitored by hybridization to the DNA fragment microarrays with fluorescently labeled cDNAs prepared from infected Spodoptera frugiperda, Sf9 cells and Trichoplusia ni, TnHigh-Five cells, the latter a major producer of baculovirus and recombinant proteins. A comparison of expression profiles of known ORFs in AcMNPV elucidated six genes (ORF150, p10, pk2, and three late gene expression factor genes lef-3, p35 and lef- 6) the expression of each of which was regulated differently in the two cell lines. Most of these genes are known to be closely involved in the viral life cycle such as in DNA replication, late gene expression and the release of polyhedra from infected cells. These results imply that the differential expression of these viral genes accounts for the differences in viral replication between these two cell lines. Thus, these fabricated microarrays of NPV DNA which allow a rapid analysis of gene expression at the viral genome level should greatly speed the functional analysis of large genomes of NPV.

  19. [Preparation of the cDNA microarray on the differential expressed cDNA of senescence-accelerated mouse's hippocampus].

    Science.gov (United States)

    Cheng, Xiao-Rui; Zhou, Wen-Xia; Zhang, Yong-Xiang

    2006-05-01

    Alzheimer' s disease (AD) is the most common form of dementia in the elderly. AD is an invariably fatal neurodegenerative disorder with no effective treatment. Senescence-accelerated mouse prone 8 (SAMP8) is a model for studying age-related cognitive impairments and also is a good model to study brain aging and one of mouse model of AD. The technique of cDNA microarray can monitor the expression levels of thousands of genes simultaneously and can be used to study AD with the character of multi-mechanism, multi-targets and multi-pathway. In order to disclose the mechanism of AD and find the drug targets of AD, cDNA microarray containing 3136 cDNAs amplified from the suppression subtracted cDNA library of hippocampus of SAMP8 and SAMR1 was prepared with 16 blocks and 14 x 14 pins, the housekeeping gene beta-actin and G3PDH as inner conference. The background of this microarray was low and unanimous, and dots divided evenly. The conditions of hybridization and washing were optimized during the hybridization of probe and target molecule. After the data of hybridization analysis, the differential expressed cDNAs were sequenced and analyzed by the bioinformatics, and some of genes were quantified by the real time RT-PCR and the reliability of this cDNA microarray were validated. This cDNA microarray may be the good means to select the differential expressed genes and disclose the molecular mechanism of SAMP8's brain aging and AD.

  20. Tumour auto-antibody screening: performance of protein microarrays using SEREX derived antigens

    International Nuclear Information System (INIS)

    Stempfer, René; Weinhäusel, Andreas; Syed, Parvez; Vierlinger, Klemens; Pichler, Rudolf; Meese, Eckart; Leidinger, Petra; Ludwig, Nicole; Kriegner, Albert; Nöhammer, Christa

    2010-01-01

    The simplicity and potential of minimal invasive testing using serum from patients make auto-antibody based biomarkers a very promising tool for use in diagnostics of cancer and auto-immune disease. Although several methods exist for elucidating candidate-protein markers, immobilizing these onto membranes and generating so called macroarrays is of limited use for marker validation. Especially when several hundred samples have to be analysed, microarrays could serve as a good alternative since processing macro membranes is cumbersome and reproducibility of results is moderate. Candidate markers identified by SEREX (serological identification of antigens by recombinant expression cloning) screenings of brain and lung tumour were used for macroarray and microarray production. For microarray production recombinant proteins were expressed in E. coli by autoinduction and purified His-tag (histidine-tagged) proteins were then used for the production of protein microarrays. Protein arrays were hybridized with the serum samples from brain and lung tumour patients. Methods for the generation of microarrays were successfully established when using antigens derived from membrane-based selection. Signal patterns obtained by microarrays analysis of brain and lung tumour patients' sera were highly reproducible (R = 0.92-0.96). This provides the technical foundation for diagnostic applications on the basis of auto-antibody patterns. In this limited test set, the assay provided high reproducibility and a broad dynamic range to classify all brain and lung samples correctly. Protein microarray is an efficient means for auto-antibody-based detection when using SEREX-derived clones expressing antigenic proteins. Protein microarrays are preferred to macroarrays due to the easier handling and the high reproducibility of auto-antibody testing. Especially when using only a few microliters of patient samples protein microarrays are ideally suited for validation of auto

  1. How the RNA isolation method can affect microRNA microarray results

    DEFF Research Database (Denmark)

    Podolska, Agnieszka; Kaczkowski, Bogumil; Litman, Thomas

    2011-01-01

    RNA microarray analysis on porcine brain tissue. One method is a phenol-guanidine isothiocyanate-based procedure that permits isolation of total RNA. The second method, miRVana™ microRNA isolation, is column based and recovers the small RNA fraction alone. We found that microarray analyses give different results...... that depend on the RNA fraction used, in particular because some microRNAs appear very sensitive to the RNA isolation method. We conclude that precautions need to be taken when comparing microarray studies based on RNA isolated with different methods.......The quality of RNA is crucial in gene expression experiments. RNA degradation interferes in the measurement of gene expression, and in this context, microRNA quantification can lead to an incorrect estimation. In the present study, two different RNA isolation methods were used to perform micro...

  2. Gene expression profiling in gill tissues of White spot syndrome virus infected black tiger shrimp Penaeus monodon by DNA microarray.

    Science.gov (United States)

    Shekhar, M S; Gomathi, A; Gopikrishna, G; Ponniah, A G

    2015-06-01

    White spot syndrome virus (WSSV) continues to be the most devastating viral pathogen infecting penaeid shrimp the world over. The genome of WSSV has been deciphered and characterized from three geographical isolates and significant progress has been made in developing various molecular diagnostic methods to detect the virus. However, the information on host immune gene response to WSSV pathogenesis is limited. Microarray analysis was carried out as an approach to analyse the gene expression in black tiger shrimp Penaeus monodon in response to WSSV infection. Gill tissues collected from the WSSV infected shrimp at 6, 24, 48 h and moribund stage were analysed for differential gene expression. Shrimp cDNAs of 40,059 unique sequences were considered for designing the microarray chip. The Cy3-labeled cRNA derived from healthy and WSSV-infected shrimp was subjected to hybridization with all the DNA spots in the microarray which revealed 8,633 and 11,147 as up- and down-regulated genes respectively at different time intervals post infection. The altered expression of these numerous genes represented diverse functions such as immune response, osmoregulation, apoptosis, nucleic acid binding, energy and metabolism, signal transduction, stress response and molting. The changes in gene expression profiles observed by microarray analysis provides molecular insights and framework of genes which are up- and down-regulated at different time intervals during WSSV infection in shrimp. The microarray data was validated by Real Time analysis of four differentially expressed genes involved in apoptosis (translationally controlled tumor protein, inhibitor of apoptosis protein, ubiquitin conjugated enzyme E2 and caspase) for gene expression levels. The role of apoptosis related genes in WSSV infected shrimp is discussed herein.

  3. Microarray-based analysis of the differential expression of melanin synthesis genes in dark and light-muzzle Korean cattle.

    Science.gov (United States)

    Kim, Sang Hwan; Hwang, Sue Yun; Yoon, Jong Taek

    2014-01-01

    The coat color of mammals is determined by the melanogenesis pathway, which is responsible for maintaining the balance between black-brown eumelanin and yellow-reddish pheomelanin. It is also believed that the color of the bovine muzzle is regulated in a similar manner; however, the molecular mechanism underlying pigment deposition in the dark-muzzle has yet to be elucidated. The aim of the present study was to identify melanogenesis-associated genes that are differentially expressed in the dark vs. light muzzle of native Korean cows. Using microarray clustering and real-time polymerase chain reaction techniques, we observed that the expression of genes involved in the mitogen-activated protein kinase (MAPK) and Wnt signaling pathways is distinctively regulated in the dark and light muzzle tissues. Differential expression of tyrosinase was also noticed, although the difference was not as distinct as those of MAPK and Wnt. We hypothesize that emphasis on the MAPK pathway in the dark-muzzle induces eumelanin synthesis through the activation of cAMP response element-binding protein and tyrosinase, while activation of Wnt signaling counteracts this process and raises the amount of pheomelanin in the light-muzzle. We also found 2 novel genes (GenBank No. NM-001076026 and XM-588439) with increase expression in the black nose, which may provide additional information about the mechanism of nose pigmentation. Regarding the increasing interest in the genetic diversity of cattle stocks, genes we identified for differential expression in the dark vs. light muzzle may serve as novel markers for genetic diversity among cows based on the muzzle color phenotype.

  4. Microarray-based analysis of the differential expression of melanin synthesis genes in dark and light-muzzle Korean cattle.

    Directory of Open Access Journals (Sweden)

    Sang Hwan Kim

    Full Text Available The coat color of mammals is determined by the melanogenesis pathway, which is responsible for maintaining the balance between black-brown eumelanin and yellow-reddish pheomelanin. It is also believed that the color of the bovine muzzle is regulated in a similar manner; however, the molecular mechanism underlying pigment deposition in the dark-muzzle has yet to be elucidated. The aim of the present study was to identify melanogenesis-associated genes that are differentially expressed in the dark vs. light muzzle of native Korean cows. Using microarray clustering and real-time polymerase chain reaction techniques, we observed that the expression of genes involved in the mitogen-activated protein kinase (MAPK and Wnt signaling pathways is distinctively regulated in the dark and light muzzle tissues. Differential expression of tyrosinase was also noticed, although the difference was not as distinct as those of MAPK and Wnt. We hypothesize that emphasis on the MAPK pathway in the dark-muzzle induces eumelanin synthesis through the activation of cAMP response element-binding protein and tyrosinase, while activation of Wnt signaling counteracts this process and raises the amount of pheomelanin in the light-muzzle. We also found 2 novel genes (GenBank No. NM-001076026 and XM-588439 with increase expression in the black nose, which may provide additional information about the mechanism of nose pigmentation. Regarding the increasing interest in the genetic diversity of cattle stocks, genes we identified for differential expression in the dark vs. light muzzle may serve as novel markers for genetic diversity among cows based on the muzzle color phenotype.

  5. Microintaglio Printing for Soft Lithography-Based in Situ Microarrays

    Directory of Open Access Journals (Sweden)

    Manish Biyani

    2015-07-01

    Full Text Available Advances in lithographic approaches to fabricating bio-microarrays have been extensively explored over the last two decades. However, the need for pattern flexibility, a high density, a high resolution, affordability and on-demand fabrication is promoting the development of unconventional routes for microarray fabrication. This review highlights the development and uses of a new molecular lithography approach, called “microintaglio printing technology”, for large-scale bio-microarray fabrication using a microreactor array (µRA-based chip consisting of uniformly-arranged, femtoliter-size µRA molds. In this method, a single-molecule-amplified DNA microarray pattern is self-assembled onto a µRA mold and subsequently converted into a messenger RNA or protein microarray pattern by simultaneously producing and transferring (immobilizing a messenger RNA or a protein from a µRA mold to a glass surface. Microintaglio printing allows the self-assembly and patterning of in situ-synthesized biomolecules into high-density (kilo-giga-density, ordered arrays on a chip surface with µm-order precision. This holistic aim, which is difficult to achieve using conventional printing and microarray approaches, is expected to revolutionize and reshape proteomics. This review is not written comprehensively, but rather substantively, highlighting the versatility of microintaglio printing for developing a prerequisite platform for microarray technology for the postgenomic era.

  6. A general framework for optimization of probes for gene expression microarray and its application to the fungus Podospora anserina.

    Science.gov (United States)

    Bidard, Frédérique; Imbeaud, Sandrine; Reymond, Nancie; Lespinet, Olivier; Silar, Philippe; Clavé, Corinne; Delacroix, Hervé; Berteaux-Lecellier, Véronique; Debuchy, Robert

    2010-06-18

    The development of new microarray technologies makes custom long oligonucleotide arrays affordable for many experimental applications, notably gene expression analyses. Reliable results depend on probe design quality and selection. Probe design strategy should cope with the limited accuracy of de novo gene prediction programs, and annotation up-dating. We present a novel in silico procedure which addresses these issues and includes experimental screening, as an empirical approach is the best strategy to identify optimal probes in the in silico outcome. We used four criteria for in silico probe selection: cross-hybridization, hairpin stability, probe location relative to coding sequence end and intron position. This latter criterion is critical when exon-intron gene structure predictions for intron-rich genes are inaccurate. For each coding sequence (CDS), we selected a sub-set of four probes. These probes were included in a test microarray, which was used to evaluate the hybridization behavior of each probe. The best probe for each CDS was selected according to three experimental criteria: signal-to-noise ratio, signal reproducibility, and representative signal intensities. This procedure was applied for the development of a gene expression Agilent platform for the filamentous fungus Podospora anserina and the selection of a single 60-mer probe for each of the 10,556 P. anserina CDS. A reliable gene expression microarray version based on the Agilent 44K platform was developed with four spot replicates of each probe to increase statistical significance of analysis.

  7. A general framework for optimization of probes for gene expression microarray and its application to the fungus Podospora anserina

    Directory of Open Access Journals (Sweden)

    Bidard Frédérique

    2010-06-01

    Full Text Available Abstract Background The development of new microarray technologies makes custom long oligonucleotide arrays affordable for many experimental applications, notably gene expression analyses. Reliable results depend on probe design quality and selection. Probe design strategy should cope with the limited accuracy of de novo gene prediction programs, and annotation up-dating. We present a novel in silico procedure which addresses these issues and includes experimental screening, as an empirical approach is the best strategy to identify optimal probes in the in silico outcome. Findings We used four criteria for in silico probe selection: cross-hybridization, hairpin stability, probe location relative to coding sequence end and intron position. This latter criterion is critical when exon-intron gene structure predictions for intron-rich genes are inaccurate. For each coding sequence (CDS, we selected a sub-set of four probes. These probes were included in a test microarray, which was used to evaluate the hybridization behavior of each probe. The best probe for each CDS was selected according to three experimental criteria: signal-to-noise ratio, signal reproducibility, and representative signal intensities. This procedure was applied for the development of a gene expression Agilent platform for the filamentous fungus Podospora anserina and the selection of a single 60-mer probe for each of the 10,556 P. anserina CDS. Conclusions A reliable gene expression microarray version based on the Agilent 44K platform was developed with four spot replicates of each probe to increase statistical significance of analysis.

  8. BASE - 2nd generation software for microarray data management and analysis

    Directory of Open Access Journals (Sweden)

    Nordborg Nicklas

    2009-10-01

    Full Text Available Abstract Background Microarray experiments are increasing in size and samples are collected asynchronously over long time. Available data are re-analysed as more samples are hybridized. Systematic use of collected data requires tracking of biomaterials, array information, raw data, and assembly of annotations. To meet the information tracking and data analysis challenges in microarray experiments we reimplemented and improved BASE version 1.2. Results The new BASE presented in this report is a comprehensive annotable local microarray data repository and analysis application providing researchers with an efficient information management and analysis tool. The information management system tracks all material from biosource, via sample and through extraction and labelling to raw data and analysis. All items in BASE can be annotated and the annotations can be used as experimental factors in downstream analysis. BASE stores all microarray experiment related data regardless if analysis tools for specific techniques or data formats are readily available. The BASE team is committed to continue improving and extending BASE to make it usable for even more experimental setups and techniques, and we encourage other groups to target their specific needs leveraging on the infrastructure provided by BASE. Conclusion BASE is a comprehensive management application for information, data, and analysis of microarray experiments, available as free open source software at http://base.thep.lu.se under the terms of the GPLv3 license.

  9. BASE--2nd generation software for microarray data management and analysis.

    Science.gov (United States)

    Vallon-Christersson, Johan; Nordborg, Nicklas; Svensson, Martin; Häkkinen, Jari

    2009-10-12

    Microarray experiments are increasing in size and samples are collected asynchronously over long time. Available data are re-analysed as more samples are hybridized. Systematic use of collected data requires tracking of biomaterials, array information, raw data, and assembly of annotations. To meet the information tracking and data analysis challenges in microarray experiments we reimplemented and improved BASE version 1.2. The new BASE presented in this report is a comprehensive annotable local microarray data repository and analysis application providing researchers with an efficient information management and analysis tool. The information management system tracks all material from biosource, via sample and through extraction and labelling to raw data and analysis. All items in BASE can be annotated and the annotations can be used as experimental factors in downstream analysis. BASE stores all microarray experiment related data regardless if analysis tools for specific techniques or data formats are readily available. The BASE team is committed to continue improving and extending BASE to make it usable for even more experimental setups and techniques, and we encourage other groups to target their specific needs leveraging on the infrastructure provided by BASE. BASE is a comprehensive management application for information, data, and analysis of microarray experiments, available as free open source software at http://base.thep.lu.se under the terms of the GPLv3 license.

  10. In Silico Analysis of Microarray-Based Gene Expression Profiles Predicts Tumor Cell Response to Withanolides

    Directory of Open Access Journals (Sweden)

    Thomas Efferth

    2012-05-01

    Full Text Available Withania somnifera (L. Dunal (Indian ginseng, winter cherry, Solanaceae is widely used in traditional medicine. Roots are either chewed or used to prepare beverages (aqueous decocts. The major secondary metabolites of Withania somnifera are the withanolides, which are C-28-steroidal lactone triterpenoids. Withania somnifera extracts exert chemopreventive and anticancer activities in vitro and in vivo. The aims of the present in silico study were, firstly, to investigate whether tumor cells develop cross-resistance between standard anticancer drugs and withanolides and, secondly, to elucidate the molecular determinants of sensitivity and resistance of tumor cells towards withanolides. Using IC50 concentrations of eight different withanolides (withaferin A, withaferin A diacetate, 3-azerininylwithaferin A, withafastuosin D diacetate, 4-B-hydroxy-withanolide E, isowithanololide E, withafastuosin E, and withaperuvin and 19 established anticancer drugs, we analyzed the cross-resistance profile of 60 tumor cell lines. The cell lines revealed cross-resistance between the eight withanolides. Consistent cross-resistance between withanolides and nitrosoureas (carmustin, lomustin, and semimustin was also observed. Then, we performed transcriptomic microarray-based COMPARE and hierarchical cluster analyses of mRNA expression to identify mRNA expression profiles predicting sensitivity or resistance towards withanolides. Genes from diverse functional groups were significantly associated with response of tumor cells to withaferin A diacetate, e.g. genes functioning in DNA damage and repair, stress response, cell growth regulation, extracellular matrix components, cell adhesion and cell migration, constituents of the ribosome, cytoskeletal organization and regulation, signal transduction, transcription factors, and others.

  11. A non-parametric meta-analysis approach for combining independent microarray datasets: application using two microarray datasets pertaining to chronic allograft nephropathy

    Directory of Open Access Journals (Sweden)

    Archer Kellie J

    2008-02-01

    Full Text Available Abstract Background With the popularity of DNA microarray technology, multiple groups of researchers have studied the gene expression of similar biological conditions. Different methods have been developed to integrate the results from various microarray studies, though most of them rely on distributional assumptions, such as the t-statistic based, mixed-effects model, or Bayesian model methods. However, often the sample size for each individual microarray experiment is small. Therefore, in this paper we present a non-parametric meta-analysis approach for combining data from independent microarray studies, and illustrate its application on two independent Affymetrix GeneChip studies that compared the gene expression of biopsies from kidney transplant recipients with chronic allograft nephropathy (CAN to those with normal functioning allograft. Results The simulation study comparing the non-parametric meta-analysis approach to a commonly used t-statistic based approach shows that the non-parametric approach has better sensitivity and specificity. For the application on the two CAN studies, we identified 309 distinct genes that expressed differently in CAN. By applying Fisher's exact test to identify enriched KEGG pathways among those genes called differentially expressed, we found 6 KEGG pathways to be over-represented among the identified genes. We used the expression measurements of the identified genes as predictors to predict the class labels for 6 additional biopsy samples, and the predicted results all conformed to their pathologist diagnosed class labels. Conclusion We present a new approach for combining data from multiple independent microarray studies. This approach is non-parametric and does not rely on any distributional assumptions. The rationale behind the approach is logically intuitive and can be easily understood by researchers not having advanced training in statistics. Some of the identified genes and pathways have been

  12. A quantitative comparison of cell-type-specific microarray gene expression profiling methods in the mouse brain.

    Directory of Open Access Journals (Sweden)

    Benjamin W Okaty

    Full Text Available Expression profiling of restricted neural populations using microarrays can facilitate neuronal classification and provide insight into the molecular bases of cellular phenotypes. Due to the formidable heterogeneity of intermixed cell types that make up the brain, isolating cell types prior to microarray processing poses steep technical challenges that have been met in various ways. These methodological differences have the potential to distort cell-type-specific gene expression profiles insofar as they may insufficiently filter out contaminating mRNAs or induce aberrant cellular responses not normally present in vivo. Thus we have compared the repeatability, susceptibility to contamination from off-target cell-types, and evidence for stress-responsive gene expression of five different purification methods--Laser Capture Microdissection (LCM, Translating Ribosome Affinity Purification (TRAP, Immunopanning (PAN, Fluorescence Activated Cell Sorting (FACS, and manual sorting of fluorescently labeled cells (Manual. We found that all methods obtained comparably high levels of repeatability, however, data from LCM and TRAP showed significantly higher levels of contamination than the other methods. While PAN samples showed higher activation of apoptosis-related, stress-related and immediate early genes, samples from FACS and Manual studies, which also require dissociated cells, did not. Given that TRAP targets actively translated mRNAs, whereas other methods target all transcribed mRNAs, observed differences may also reflect translational regulation.

  13. BioCichlid: central dogma-based 3D visualization system of time-course microarray data on a hierarchical biological network.

    Science.gov (United States)

    Ishiwata, Ryosuke R; Morioka, Masaki S; Ogishima, Soichi; Tanaka, Hiroshi

    2009-02-15

    BioCichlid is a 3D visualization system of time-course microarray data on molecular networks, aiming at interpretation of gene expression data by transcriptional relationships based on the central dogma with physical and genetic interactions. BioCichlid visualizes both physical (protein) and genetic (regulatory) network layers, and provides animation of time-course gene expression data on the genetic network layer. Transcriptional regulations are represented to bridge the physical network (transcription factors) and genetic network (regulated genes) layers, thus integrating promoter analysis into the pathway mapping. BioCichlid enhances the interpretation of microarray data and allows for revealing the underlying mechanisms causing differential gene expressions. BioCichlid is freely available and can be accessed at http://newton.tmd.ac.jp/. Source codes for both biocichlid server and client are also available.

  14. High quality protein microarray using in situ protein purification

    Directory of Open Access Journals (Sweden)

    Fleischmann Robert D

    2009-08-01

    Full Text Available Abstract Background In the postgenomic era, high throughput protein expression and protein microarray technologies have progressed markedly permitting screening of therapeutic reagents and discovery of novel protein functions. Hexa-histidine is one of the most commonly used fusion tags for protein expression due to its small size and convenient purification via immobilized metal ion affinity chromatography (IMAC. This purification process has been adapted to the protein microarray format, but the quality of in situ His-tagged protein purification on slides has not been systematically evaluated. We established methods to determine the level of purification of such proteins on metal chelate-modified slide surfaces. Optimized in situ purification of His-tagged recombinant proteins has the potential to become the new gold standard for cost-effective generation of high-quality and high-density protein microarrays. Results Two slide surfaces were examined, chelated Cu2+ slides suspended on a polyethylene glycol (PEG coating and chelated Ni2+ slides immobilized on a support without PEG coating. Using PEG-coated chelated Cu2+ slides, consistently higher purities of recombinant proteins were measured. An optimized wash buffer (PBST composed of 10 mM phosphate buffer, 2.7 mM KCl, 140 mM NaCl and 0.05% Tween 20, pH 7.4, further improved protein purity levels. Using Escherichia coli cell lysates expressing 90 recombinant Streptococcus pneumoniae proteins, 73 proteins were successfully immobilized, and 66 proteins were in situ purified with greater than 90% purity. We identified several antigens among the in situ-purified proteins via assays with anti-S. pneumoniae rabbit antibodies and a human patient antiserum, as a demonstration project of large scale microarray-based immunoproteomics profiling. The methodology is compatible with higher throughput formats of in vivo protein expression, eliminates the need for resin-based purification and circumvents

  15. EzArray: A web-based highly automated Affymetrix expression array data management and analysis system

    Directory of Open Access Journals (Sweden)

    Zhu Yuelin

    2008-01-01

    Full Text Available Abstract Background Though microarray experiments are very popular in life science research, managing and analyzing microarray data are still challenging tasks for many biologists. Most microarray programs require users to have sophisticated knowledge of mathematics, statistics and computer skills for usage. With accumulating microarray data deposited in public databases, easy-to-use programs to re-analyze previously published microarray data are in high demand. Results EzArray is a web-based Affymetrix expression array data management and analysis system for researchers who need to organize microarray data efficiently and get data analyzed instantly. EzArray organizes microarray data into projects that can be analyzed online with predefined or custom procedures. EzArray performs data preprocessing and detection of differentially expressed genes with statistical methods. All analysis procedures are optimized and highly automated so that even novice users with limited pre-knowledge of microarray data analysis can complete initial analysis quickly. Since all input files, analysis parameters, and executed scripts can be downloaded, EzArray provides maximum reproducibility for each analysis. In addition, EzArray integrates with Gene Expression Omnibus (GEO and allows instantaneous re-analysis of published array data. Conclusion EzArray is a novel Affymetrix expression array data analysis and sharing system. EzArray provides easy-to-use tools for re-analyzing published microarray data and will help both novice and experienced users perform initial analysis of their microarray data from the location of data storage. We believe EzArray will be a useful system for facilities with microarray services and laboratories with multiple members involved in microarray data analysis. EzArray is freely available from http://www.ezarray.com/.

  16. A mixture model-based approach to the clustering of microarray expression data.

    Science.gov (United States)

    McLachlan, G J; Bean, R W; Peel, D

    2002-03-01

    This paper introduces the software EMMIX-GENE that has been developed for the specific purpose of a model-based approach to the clustering of microarray expression data, in particular, of tissue samples on a very large number of genes. The latter is a nonstandard problem in parametric cluster analysis because the dimension of the feature space (the number of genes) is typically much greater than the number of tissues. A feasible approach is provided by first selecting a subset of the genes relevant for the clustering of the tissue samples by fitting mixtures of t distributions to rank the genes in order of increasing size of the likelihood ratio statistic for the test of one versus two components in the mixture model. The imposition of a threshold on the likelihood ratio statistic used in conjunction with a threshold on the size of a cluster allows the selection of a relevant set of genes. However, even this reduced set of genes will usually be too large for a normal mixture model to be fitted directly to the tissues, and so the use of mixtures of factor analyzers is exploited to reduce effectively the dimension of the feature space of genes. The usefulness of the EMMIX-GENE approach for the clustering of tissue samples is demonstrated on two well-known data sets on colon and leukaemia tissues. For both data sets, relevant subsets of the genes are able to be selected that reveal interesting clusterings of the tissues that are either consistent with the external classification of the tissues or with background and biological knowledge of these sets. EMMIX-GENE is available at http://www.maths.uq.edu.au/~gjm/emmix-gene/

  17. DNA microarray-based PCR ribotyping of Clostridium difficile.

    Science.gov (United States)

    Schneeberg, Alexander; Ehricht, Ralf; Slickers, Peter; Baier, Vico; Neubauer, Heinrich; Zimmermann, Stefan; Rabold, Denise; Lübke-Becker, Antina; Seyboldt, Christian

    2015-02-01

    This study presents a DNA microarray-based assay for fast and simple PCR ribotyping of Clostridium difficile strains. Hybridization probes were designed to query the modularly structured intergenic spacer region (ISR), which is also the template for conventional and PCR ribotyping with subsequent capillary gel electrophoresis (seq-PCR) ribotyping. The probes were derived from sequences available in GenBank as well as from theoretical ISR module combinations. A database of reference hybridization patterns was set up from a collection of 142 well-characterized C. difficile isolates representing 48 seq-PCR ribotypes. The reference hybridization patterns calculated by the arithmetic mean were compared using a similarity matrix analysis. The 48 investigated seq-PCR ribotypes revealed 27 array profiles that were clearly distinguishable. The most frequent human-pathogenic ribotypes 001, 014/020, 027, and 078/126 were discriminated by the microarray. C. difficile strains related to 078/126 (033, 045/FLI01, 078, 126, 126/FLI01, 413, 413/FLI01, 598, 620, 652, and 660) and 014/020 (014, 020, and 449) showed similar hybridization patterns, confirming their genetic relatedness, which was previously reported. A panel of 50 C. difficile field isolates was tested by seq-PCR ribotyping and the DNA microarray-based assay in parallel. Taking into account that the current version of the microarray does not discriminate some closely related seq-PCR ribotypes, all isolates were typed correctly. Moreover, seq-PCR ribotypes without reference profiles available in the database (ribotype 009 and 5 new types) were correctly recognized as new ribotypes, confirming the performance and expansion potential of the microarray. Copyright © 2015, American Society for Microbiology. All Rights Reserved.

  18. Gene expression analysis of the biocontrol fungus Trichoderma harzianum in the presence of tomato plants, chitin, or glucose using a high-density oligonucleotide microarray.

    Science.gov (United States)

    Samolski, Ilanit; de Luis, Alberto; Vizcaíno, Juan Antonio; Monte, Enrique; Suárez, M Belén

    2009-10-13

    It has recently been shown that the Trichoderma fungal species used for biocontrol of plant diseases are capable of interacting with plant roots directly, behaving as symbiotic microorganisms. With a view to providing further information at transcriptomic level about the early response of Trichoderma to a host plant, we developed a high-density oligonucleotide (HDO) microarray encompassing 14,081 Expressed Sequence Tag (EST)-based transcripts from eight Trichoderma spp. and 9,121 genome-derived transcripts of T. reesei, and we have used this microarray to examine the gene expression of T. harzianum either alone or in the presence of tomato plants, chitin, or glucose. Global microarray analysis revealed 1,617 probe sets showing differential expression in T. harzianum mycelia under at least one of the culture conditions tested as compared with one another. Hierarchical clustering and heat map representation showed that the expression patterns obtained in glucose medium clustered separately from the expression patterns observed in the presence of tomato plants and chitin. Annotations using the Blast2GO suite identified 85 of the 257 transcripts whose probe sets afforded up-regulated expression in response to tomato plants. Some of these transcripts were predicted to encode proteins related to Trichoderma-host (fungus or plant) associations, such as Sm1/Elp1 protein, proteases P6281 and PRA1, enchochitinase CHIT42, or QID74 protein, although previously uncharacterized genes were also identified, including those responsible for the possible biosynthesis of nitric oxide, xenobiotic detoxification, mycelium development, or those related to the formation of infection structures in plant tissues. The effectiveness of the Trichoderma HDO microarray to detect different gene responses under different growth conditions in the fungus T. harzianum strongly indicates that this tool should be useful for further assays that include different stages of plant colonization, as well as

  19. Comparative RNA-Seq and microarray analysis of gene expression changes in B-cell lymphomas of Canis familiaris.

    Directory of Open Access Journals (Sweden)

    Marie Mooney

    Full Text Available Comparative oncology is a developing research discipline that is being used to assist our understanding of human neoplastic diseases. Companion canines are a preferred animal oncology model due to spontaneous tumor development and similarity to human disease at the pathophysiological level. We use a paired RNA sequencing (RNA-Seq/microarray analysis of a set of four normal canine lymph nodes and ten canine lymphoma fine needle aspirates to identify technical biases and variation between the technologies and convergence on biological disease pathways. Surrogate Variable Analysis (SVA provides a formal multivariate analysis of the combined RNA-Seq/microarray data set. Applying SVA to the data allows us to decompose variation into contributions associated with transcript abundance, differences between the technology, and latent variation within each technology. A substantial and highly statistically significant component of the variation reflects transcript abundance, and RNA-Seq appeared more sensitive for detection of transcripts expressed at low levels. Latent random variation among RNA-Seq samples is also distinct in character from that impacting microarray samples. In particular, we observed variation between RNA-Seq samples that reflects transcript GC content. Platform-independent variable decomposition without a priori knowledge of the sources of variation using SVA represents a generalizable method for accomplishing cross-platform data analysis. We identified genes differentially expressed between normal lymph nodes of disease free dogs and a subset of the diseased dogs diagnosed with B-cell lymphoma using each technology. There is statistically significant overlap between the RNA-Seq and microarray sets of differentially expressed genes. Analysis of overlapping genes in the context of biological systems suggests elevated expression and activity of PI3K signaling in B-cell lymphoma biopsies compared with normal biopsies, consistent with

  20. eSensor: an electrochemical detection-based DNA microarray technology enabling sample-to-answer molecular diagnostics

    Science.gov (United States)

    Liu, Robin H.; Longiaru, Mathew

    2009-05-01

    DNA microarrays are becoming a widespread tool used in life science and drug screening due to its many benefits of miniaturization and integration. Microarrays permit a highly multiplexed DNA analysis. Recently, the development of new detection methods and simplified methodologies has rapidly expanded the use of microarray technologies from predominantly gene expression analysis into the arena of diagnostics. Osmetech's eSensor® is an electrochemical detection platform based on a low-to- medium density DNA hybridization array on a cost-effective printed circuit board substrate. eSensor® has been cleared by FDA for Warfarin sensitivity test and Cystic Fibrosis Carrier Detection. Other genetic-based diagnostic and infectious disease detection tests are under development. The eSensor® platform eliminates the need for an expensive laser-based optical system and fluorescent reagents. It allows one to perform hybridization and detection in a single and small instrument without any fluidic processing and handling. Furthermore, the eSensor® platform is readily adaptable to on-chip sample-to-answer genetic analyses using microfluidics technology. The eSensor® platform provides a cost-effective solution to direct sample-to-answer genetic analysis, and thus have a potential impact in the fields of point-of-care genetic analysis, environmental testing, and biological warfare agent detection.

  1. Improving cluster-based missing value estimation of DNA microarray data.

    Science.gov (United States)

    Brás, Lígia P; Menezes, José C

    2007-06-01

    We present a modification of the weighted K-nearest neighbours imputation method (KNNimpute) for missing values (MVs) estimation in microarray data based on the reuse of estimated data. The method was called iterative KNN imputation (IKNNimpute) as the estimation is performed iteratively using the recently estimated values. The estimation efficiency of IKNNimpute was assessed under different conditions (data type, fraction and structure of missing data) by the normalized root mean squared error (NRMSE) and the correlation coefficients between estimated and true values, and compared with that of other cluster-based estimation methods (KNNimpute and sequential KNN). We further investigated the influence of imputation on the detection of differentially expressed genes using SAM by examining the differentially expressed genes that are lost after MV estimation. The performance measures give consistent results, indicating that the iterative procedure of IKNNimpute can enhance the prediction ability of cluster-based methods in the presence of high missing rates, in non-time series experiments and in data sets comprising both time series and non-time series data, because the information of the genes having MVs is used more efficiently and the iterative procedure allows refining the MV estimates. More importantly, IKNN has a smaller detrimental effect on the detection of differentially expressed genes.

  2. ArraySolver: An Algorithm for Colour-Coded Graphical Display and Wilcoxon Signed-Rank Statistics for Comparing Microarray Gene Expression Data

    OpenAIRE

    Khan, Haseeb Ahmad

    2004-01-01

    The massive surge in the production of microarray data poses a great challenge for proper analysis and interpretation. In recent years numerous computational tools have been developed to extract meaningful interpretation of microarray gene expression data. However, a convenient tool for two-groups comparison of microarray data is still lacking and users have to rely on commercial statistical packages that might be costly and require special skills, in addition to extra time and effort for tra...

  3. Broad spectrum microarray for fingerprint-based bacterial species identification

    Directory of Open Access Journals (Sweden)

    Frey Jürg E

    2010-02-01

    Full Text Available Abstract Background Microarrays are powerful tools for DNA-based molecular diagnostics and identification of pathogens. Most target a limited range of organisms and are based on only one or a very few genes for specific identification. Such microarrays are limited to organisms for which specific probes are available, and often have difficulty discriminating closely related taxa. We have developed an alternative broad-spectrum microarray that employs hybridisation fingerprints generated by high-density anonymous markers distributed over the entire genome for identification based on comparison to a reference database. Results A high-density microarray carrying 95,000 unique 13-mer probes was designed. Optimized methods were developed to deliver reproducible hybridisation patterns that enabled confident discrimination of bacteria at the species, subspecies, and strain levels. High correlation coefficients were achieved between replicates. A sub-selection of 12,071 probes, determined by ANOVA and class prediction analysis, enabled the discrimination of all samples in our panel. Mismatch probe hybridisation was observed but was found to have no effect on the discriminatory capacity of our system. Conclusions These results indicate the potential of our genome chip for reliable identification of a wide range of bacterial taxa at the subspecies level without laborious prior sequencing and probe design. With its high resolution capacity, our proof-of-principle chip demonstrates great potential as a tool for molecular diagnostics of broad taxonomic groups.

  4. Comparative analysis of gene expression by microarray analysis of male and female flowers of Asparagus officinalis.

    Science.gov (United States)

    Gao, Wu-Jun; Li, Shu-Fen; Zhang, Guo-Jun; Wang, Ning-Na; Deng, Chuan-Liang; Lu, Long-Dou

    2013-01-01

    To identify rapidly a number of genes probably involved in sex determination and differentiation of the dioecious plant Asparagus officinalis, gene expression profiles in early flower development for male and female plants were investigated by microarray assay with 8,665 probes. In total, 638 male-biased and 543 female-biased genes were identified. These genes with biased-expression for male and female were involved in a variety of processes associated with molecular functions, cellular components, and biological processes, suggesting that a complex mechanism underlies the sex development of asparagus. Among the differentially expressed genes involved in the reproductive process, a number of genes associated with floral development were identified. Reverse transcription-PCR was performed for validation, and the results were largely consistent with those obtained by microarray analysis. The findings of this study might contribute to understanding of the molecular mechanisms of sex determination and differentiation in dioecious asparagus and provide a foundation for further studies of this plant.

  5. ArraySolver: an algorithm for colour-coded graphical display and Wilcoxon signed-rank statistics for comparing microarray gene expression data.

    Science.gov (United States)

    Khan, Haseeb Ahmad

    2004-01-01

    The massive surge in the production of microarray data poses a great challenge for proper analysis and interpretation. In recent years numerous computational tools have been developed to extract meaningful interpretation of microarray gene expression data. However, a convenient tool for two-groups comparison of microarray data is still lacking and users have to rely on commercial statistical packages that might be costly and require special skills, in addition to extra time and effort for transferring data from one platform to other. Various statistical methods, including the t-test, analysis of variance, Pearson test and Mann-Whitney U test, have been reported for comparing microarray data, whereas the utilization of the Wilcoxon signed-rank test, which is an appropriate test for two-groups comparison of gene expression data, has largely been neglected in microarray studies. The aim of this investigation was to build an integrated tool, ArraySolver, for colour-coded graphical display and comparison of gene expression data using the Wilcoxon signed-rank test. The results of software validation showed similar outputs with ArraySolver and SPSS for large datasets. Whereas the former program appeared to be more accurate for 25 or fewer pairs (n < or = 25), suggesting its potential application in analysing molecular signatures that usually contain small numbers of genes. The main advantages of ArraySolver are easy data selection, convenient report format, accurate statistics and the familiar Excel platform.

  6. Microarray meta-analysis to explore abiotic stress-specific gene expression patterns in Arabidopsis.

    Science.gov (United States)

    Shen, Po-Chih; Hour, Ai-Ling; Liu, Li-Yu Daisy

    2017-12-01

    Abiotic stresses are the major limiting factors that affect plant growth, development, yield and final quality. Deciphering the underlying mechanisms of plants' adaptations to stresses using few datasets might overlook the different aspects of stress tolerance in plants, which might be simultaneously and consequently operated in the system. Fortunately, the accumulated microarray expression data offer an opportunity to infer abiotic stress-specific gene expression patterns through meta-analysis. In this study, we propose to combine microarray gene expression data under control, cold, drought, heat, and salt conditions and determined modules (gene sets) of genes highly associated with each other according to the observed expression data. By analyzing the expression variations of the Eigen genes from different conditions, we had identified two, three, and five gene modules as cold-, heat-, and salt-specific modules, respectively. Most of the cold- or heat-specific modules were differentially expressed to a particular degree in shoot samples, while most of the salt-specific modules were differentially expressed to a particular degree in root samples. A gene ontology (GO) analysis on the stress-specific modules suggested that the gene modules exclusively enriched stress-related GO terms and that different genes under the same GO terms may be alternatively disturbed in different conditions. The gene regulatory events for two genes, DREB1A and DEAR1, in the cold-specific gene module had also been validated, as evidenced through the literature search. Our protocols study the specificity of the gene modules that were specifically activated under a particular type of abiotic stress. The biplot can also assist to visualize the stress-specific gene modules. In conclusion, our approach has the potential to further elucidate mechanisms in plants and beneficial for future experiments design under different abiotic stresses.

  7. Prognostic value of matrix metalloproteinase 9 expression in patients with juvenile nasopharyngeal angiofibroma: tissue microarray analysis.

    Science.gov (United States)

    Sun, Xicai; Guo, Limin; Wang, Jingjing; Wang, Huan; Liu, Zhuofu; Liu, Juan; Yu, Huapeng; Hu, Li; Li, Han; Wang, Dehui

    2014-08-01

    Although JNA is a benign neoplasm histopathologically, it has a propensity for locally destructive growth and remains a higher postoperative recurrence rate. The aim of this study was to analyze the expression and localization of MMP-9 in JNA using tissue microarray to elucidate its correlation with clinicopathological features and recurrence. The expression of MMP-9 was assessed by immunohistochemistry in a tissue microarray from 70 patients with JNA and 10 control subjects. Correlation between the levels of MMP-9 expression and clinicopathologic variables, as well as tumor recurrence, were analyzed. MMP-9 was detected in perivascular and extravascular less differentiated cells and stromal cells of patients with JNA but not in the matured vascular endothelial cells of these patients. The presence of MMP-9 expression in JNA was correlated with patient's age (p=0.001). Spearman correlation analysis suggested that high expression of MMP-9 in JNA had negative correlation with patient's age (r=-0.412, p<0.001). The recurrence rate in JNA patients with high MMP-9 expression was significantly higher than those with low MMP-9 expression (p=0.002). In multivariate and ROC curve analysis, MMP-9 was a good prognostic factor for tumor recurrence of JNA. Higher MMP-9 expression is a poor prognostic factor for patients with JNA who have been surgically treated. Copyright © 2014 Elsevier Ireland Ltd. All rights reserved.

  8. A Reliable and Distributed LIMS for Efficient Management of the Microarray Experiment Environment

    Directory of Open Access Journals (Sweden)

    Jin Hee-Jeong

    2007-03-01

    Full Text Available A microarray is a principal technology in molecular biology. It generates thousands of expressions of genotypes at once. Typically, a microarray experiment contains many kinds of information, such as gene names, sequences, expression profiles, scanned images, and annotation. So, the organization and analysis of vast amounts of data are required. Microarray LIMS (Laboratory Information Management System provides data management, search, and basic analysis. Recently, microarray joint researches, such as the skeletal system disease and anti-cancer medicine have been widely conducted. This research requires data sharing among laboratories within the joint research group. In this paper, we introduce a web based microarray LIMS, SMILE (Small and solid MIcroarray Lims for Experimenters, especially for shared data management. The data sharing function of SMILE is based on Friend-to-Friend (F2F, which is based on anonymous P2P (Peer-to-Peer, in which people connect directly with their “friends”. It only allows its friends to exchange data directly using IP addresses or digital signatures you trust. In SMILE, there are two types of friends: “service provider”, which provides data, and “client”, which is provided with data. So, the service provider provides shared data only to its clients. SMILE provides useful functions for microarray experiments, such as variant data management, image analysis, normalization, system management, project schedule management, and shared data management. Moreover, it connections with two systems: ArrayMall for analyzing microarray images and GENAW for constructing a genetic network. SMILE is available on http://neobio.cs.pusan.ac.kr:8080/smile.

  9. Removing Batch Effects from Longitudinal Gene Expression - Quantile Normalization Plus ComBat as Best Approach for Microarray Transcriptome Data.

    Directory of Open Access Journals (Sweden)

    Christian Müller

    Full Text Available Technical variation plays an important role in microarray-based gene expression studies, and batch effects explain a large proportion of this noise. It is therefore mandatory to eliminate technical variation while maintaining biological variability. Several strategies have been proposed for the removal of batch effects, although they have not been evaluated in large-scale longitudinal gene expression data. In this study, we aimed at identifying a suitable method for batch effect removal in a large study of microarray-based longitudinal gene expression. Monocytic gene expression was measured in 1092 participants of the Gutenberg Health Study at baseline and 5-year follow up. Replicates of selected samples were measured at both time points to identify technical variability. Deming regression, Passing-Bablok regression, linear mixed models, non-linear models as well as ReplicateRUV and ComBat were applied to eliminate batch effects between replicates. In a second step, quantile normalization prior to batch effect correction was performed for each method. Technical variation between batches was evaluated by principal component analysis. Associations between body mass index and transcriptomes were calculated before and after batch removal. Results from association analyses were compared to evaluate maintenance of biological variability. Quantile normalization, separately performed in each batch, combined with ComBat successfully reduced batch effects and maintained biological variability. ReplicateRUV performed perfectly in the replicate data subset of the study, but failed when applied to all samples. All other methods did not substantially reduce batch effects in the replicate data subset. Quantile normalization plus ComBat appears to be a valuable approach for batch correction in longitudinal gene expression data.

  10. Evaluation of toxicity of the mycotoxin citrinin using yeast ORF DNA microarray and Oligo DNA microarray

    Directory of Open Access Journals (Sweden)

    Nobumasa Hitoshi

    2007-04-01

    Full Text Available Abstract Background Mycotoxins are fungal secondary metabolites commonly present in feed and food, and are widely regarded as hazardous contaminants. Citrinin, one of the very well known mycotoxins that was first isolated from Penicillium citrinum, is produced by more than 10 kinds of fungi, and is possibly spread all over the world. However, the information on the action mechanism of the toxin is limited. Thus, we investigated the citrinin-induced genomic response for evaluating its toxicity. Results Citrinin inhibited growth of yeast cells at a concentration higher than 100 ppm. We monitored the citrinin-induced mRNA expression profiles in yeast using the ORF DNA microarray and Oligo DNA microarray, and the expression profiles were compared with those of the other stress-inducing agents. Results obtained from both microarray experiments clustered together, but were different from those of the mycotoxin patulin. The oxidative stress response genes – AADs, FLR1, OYE3, GRE2, and MET17 – were significantly induced. In the functional category, expression of genes involved in "metabolism", "cell rescue, defense and virulence", and "energy" were significantly activated. In the category of "metabolism", genes involved in the glutathione synthesis pathway were activated, and in the category of "cell rescue, defense and virulence", the ABC transporter genes were induced. To alleviate the induced stress, these cells might pump out the citrinin after modification with glutathione. While, the citrinin treatment did not induce the genes involved in the DNA repair. Conclusion Results from both microarray studies suggest that citrinin treatment induced oxidative stress in yeast cells. The genotoxicity was less severe than the patulin, suggesting that citrinin is less toxic than patulin. The reproducibility of the expression profiles was much better with the Oligo DNA microarray. However, the Oligo DNA microarray did not completely overcome cross

  11. Principles of gene microarray data analysis.

    Science.gov (United States)

    Mocellin, Simone; Rossi, Carlo Riccardo

    2007-01-01

    The development of several gene expression profiling methods, such as comparative genomic hybridization (CGH), differential display, serial analysis of gene expression (SAGE), and gene microarray, together with the sequencing of the human genome, has provided an opportunity to monitor and investigate the complex cascade of molecular events leading to tumor development and progression. The availability of such large amounts of information has shifted the attention of scientists towards a nonreductionist approach to biological phenomena. High throughput technologies can be used to follow changing patterns of gene expression over time. Among them, gene microarray has become prominent because it is easier to use, does not require large-scale DNA sequencing, and allows for the parallel quantification of thousands of genes from multiple samples. Gene microarray technology is rapidly spreading worldwide and has the potential to drastically change the therapeutic approach to patients affected with tumor. Therefore, it is of paramount importance for both researchers and clinicians to know the principles underlying the analysis of the huge amount of data generated with microarray technology.

  12.  DNA microarray-based gene expression profiling in diagnosis, assessing prognosis and predicting response to therapy in colorectal cancer

    Directory of Open Access Journals (Sweden)

    Przemysław Kwiatkowski

    2012-06-01

    Full Text Available  Colorectal cancer is the most common cancer of the gastrointestinal tract. It is considered as a biological model of a certain type of cancerogenesis process in which progression from an early to late stage adenoma and cancer is accompanied by distinct genetic alterations.Clinical and pathological parameters commonly used in clinical practice are often insufficient to determine groups of patients suitable for personalized treatment. Moreover, reliable molecular markers with high prognostic value have not yet been determined. Molecular studies using DNA-based microarrays have identified numerous genes involved in cell proliferation and differentiation during the process of cancerogenesis. Assessment of the genetic profile of colorectal cancer using the microarray technique might be a useful tool in determining the groups of patients with different clinical outcomes who would benefit from additional personalized treatment.The main objective of this study was to present the current state of knowledge on the practical application of gene profiling techniques using microarrays for determining diagnosis, prognosis and response to treatment in colorectal cancer.

  13. Expression microarray identifies the unliganded glucocorticoid receptor as a regulator of gene expression in mammary epithelial cells

    International Nuclear Information System (INIS)

    Ritter, Heather D; Mueller, Christopher R

    2014-01-01

    While glucocorticoids and the liganded glucocorticoid receptor (GR) have a well-established role in the maintenance of differentiation and suppression of apoptosis in breast tissue, the involvement of unliganded GR in cellular processes is less clear. Our previous studies implicated unliganded GR as a positive regulator of the BRCA1 tumour suppressor gene in the absence of glucocorticoid hormone, which suggested it could play a similar role in the regulation of other genes. An shRNA vector directed against GR was used to create mouse mammary cell lines with depleted endogenous levels of this receptor in order to further characterize the role of GR in breast cells. An expression microarray screen for targets of unliganded GR was performed using our GR-depleted cell lines maintained in the absence of glucocorticoids. Candidate genes positively regulated by unliganded GR were identified, classified by Gene Ontology and Ingenuity Pathway Analysis, and validated using quantitative real-time reverse transcriptase PCR. Chromatin immunoprecipitation and dual luciferase expression assays were conducted to further investigate the mechanism through which unliganded GR regulates these genes. Expression microarray analysis revealed 260 targets negatively regulated and 343 targets positively regulated by unliganded GR. A number of the positively regulated targets were involved in pro-apoptotic networks, possibly opposing the activity of liganded GR targets. Validation and further analysis of five candidates from the microarray indicated that two of these, Hsd11b1 and Ch25h, were regulated by unliganded GR in a manner similar to Brca1 during glucocorticoid treatment. Furthermore, GR was shown to interact directly with and upregulate the Ch25h promoter in the absence, but not the presence, of hydrocortisone (HC), confirming our previously described model of gene regulation by unliganded GR. This work presents the first identification of targets of unliganded GR. We propose that

  14. Gene Expression Profiling and Identification of Resistance Genes to Aspergillus flavus Infection in Peanut through EST and Microarray Strategies

    Directory of Open Access Journals (Sweden)

    Baozhu Guo

    2011-06-01

    Full Text Available Aspergillus flavus and A. parasiticus infect peanut seeds and produce aflatoxins, which are associated with various diseases in domestic animals and humans throughout the world. The most cost-effective strategy to minimize aflatoxin contamination involves the development of peanut cultivars that are resistant to fungal infection and/or aflatoxin production. To identify peanut Aspergillus-interactive and peanut Aspergillus-resistance genes, we carried out a large scale peanut Expressed Sequence Tag (EST project which we used to construct a peanut glass slide oligonucleotide microarray. The fabricated microarray represents over 40% of the protein coding genes in the peanut genome. For expression profiling, resistant and susceptible peanut cultivars were infected with a mixture of Aspergillus flavus and parasiticus spores. The subsequent microarray analysis identified 62 genes in resistant cultivars that were up-expressed in response to Aspergillus infection. In addition, we identified 22 putative Aspergillus-resistance genes that were constitutively up-expressed in the resistant cultivar in comparison to the susceptible cultivar. Some of these genes were homologous to peanut, corn, and soybean genes that were previously shown to confer resistance to fungal infection. This study is a first step towards a comprehensive genome-scale platform for developing Aspergillus-resistant peanut cultivars through targeted marker-assisted breeding and genetic engineering.

  15. Genome-scale cluster analysis of replicated microarrays using shrinkage correlation coefficient.

    Science.gov (United States)

    Yao, Jianchao; Chang, Chunqi; Salmi, Mari L; Hung, Yeung Sam; Loraine, Ann; Roux, Stanley J

    2008-06-18

    Currently, clustering with some form of correlation coefficient as the gene similarity metric has become a popular method for profiling genomic data. The Pearson correlation coefficient and the standard deviation (SD)-weighted correlation coefficient are the two most widely-used correlations as the similarity metrics in clustering microarray data. However, these two correlations are not optimal for analyzing replicated microarray data generated by most laboratories. An effective correlation coefficient is needed to provide statistically sufficient analysis of replicated microarray data. In this study, we describe a novel correlation coefficient, shrinkage correlation coefficient (SCC), that fully exploits the similarity between the replicated microarray experimental samples. The methodology considers both the number of replicates and the variance within each experimental group in clustering expression data, and provides a robust statistical estimation of the error of replicated microarray data. The value of SCC is revealed by its comparison with two other correlation coefficients that are currently the most widely-used (Pearson correlation coefficient and SD-weighted correlation coefficient) using statistical measures on both synthetic expression data as well as real gene expression data from Saccharomyces cerevisiae. Two leading clustering methods, hierarchical and k-means clustering were applied for the comparison. The comparison indicated that using SCC achieves better clustering performance. Applying SCC-based hierarchical clustering to the replicated microarray data obtained from germinating spores of the fern Ceratopteris richardii, we discovered two clusters of genes with shared expression patterns during spore germination. Functional analysis suggested that some of the genetic mechanisms that control germination in such diverse plant lineages as mosses and angiosperms are also conserved among ferns. This study shows that SCC is an alternative to the Pearson

  16. Mining meiosis and gametogenesis with DNA microarrays.

    Science.gov (United States)

    Schlecht, Ulrich; Primig, Michael

    2003-04-01

    Gametogenesis is a key developmental process that involves complex transcriptional regulation of numerous genes including many that are conserved between unicellular eukaryotes and mammals. Recent expression-profiling experiments using microarrays have provided insight into the co-ordinated transcription of several hundred genes during mitotic growth and meiotic development in budding and fission yeast. Furthermore, microarray-based studies have identified numerous loci that are regulated during the cell cycle or expressed in a germ-cell specific manner in eukaryotic model systems like Caenorhabditis elegans, Mus musculus as well as Homo sapiens. The unprecedented amount of information produced by post-genome biology has spawned novel approaches to organizing biological knowledge using currently available information technology. This review outlines experiments that contribute to an emerging comprehensive picture of the molecular machinery governing sexual reproduction in eukaryotes.

  17. Application of four dyes in gene expression analyses by microarrays

    Directory of Open Access Journals (Sweden)

    van Schooten Frederik J

    2005-07-01

    Full Text Available Abstract Background DNA microarrays are widely used in gene expression analyses. To increase throughput and minimize costs without reducing gene expression data obtained, we investigated whether four mRNA samples can be analyzed simultaneously by applying four different fluorescent dyes. Results Following tests for cross-talk of fluorescence signals, Alexa 488, Alexa 594, Cyanine 3 and Cyanine 5 were selected for hybridizations. For self-hybridizations, a single RNA sample was labelled with all dyes and hybridized on commercial cDNA arrays or on in-house spotted oligonucleotide arrays. Correlation coefficients for all combinations of dyes were above 0.9 on the cDNA array. On the oligonucleotide array they were above 0.8, except combinations with Alexa 488, which were approximately 0.5. Standard deviation of expression differences for replicate spots were similar on the cDNA array for all dye combinations, but on the oligonucleotide array combinations with Alexa 488 showed a higher variation. Conclusion In conclusion, the four dyes can be used simultaneously for gene expression experiments on the tested cDNA array, but only three dyes can be used on the tested oligonucleotide array. This was confirmed by hybridizations of control with test samples, as all combinations returned similar numbers of differentially expressed genes with comparable effects on gene expression.

  18. Early Gene Expression in Wounded Human Keratinocytes Revealed by DNA Microarray Analysis

    Directory of Open Access Journals (Sweden)

    Pascal Barbry

    2006-04-01

    Full Text Available Wound healing involves several steps: spreading of the cells, migration and proliferation. We have profiled gene expression during the early events of wound healing in normal human keratinocytes with a home-made DNA microarray containing about 1000 relevant human probes. An original wounding machine was used, that allows the wounding of up to 40% of the surface of a confluent monolayer of cultured cells grown on a Petri dish (compared with 5% with a classical ‘scratch’ method. The two aims of the present study were: (a to validate a limited number of genes by comparing the expression levels obtained with this technique with those found in the literature; (b to combine the use of the wounding machine with DNA microarray analysis for large-scale detection of the molecular events triggered during the early stages of the wound-healing process. The time-courses of RNA expression observed at 0.5, 1.5, 3, 6 and 15 h after wounding for genes such as c-Fos, c-Jun, Egr1, the plasminogen activator PLAU (uPA and the signal transducer and transcription activator STAT3, were consistent with previously published data. This suggests that our methodologies are able to perform quantitative measurement of gene expression. Transcripts encoding two zinc finger proteins, ZFP36 and ZNF161, and the tumour necrosis factor α-induced protein TNFAIP3, were also overexpressed after wounding. The role of the p38 mitogen-activated protein kinase (p38MAPK in wound healing was shown after the inhibition of p38 by SB203580, but our results also suggest the existence of surrogate activating pathways.

  19. Clustering gene expression data based on predicted differential effects of GV interaction.

    Science.gov (United States)

    Pan, Hai-Yan; Zhu, Jun; Han, Dan-Fu

    2005-02-01

    Microarray has become a popular biotechnology in biological and medical research. However, systematic and stochastic variabilities in microarray data are expected and unavoidable, resulting in the problem that the raw measurements have inherent "noise" within microarray experiments. Currently, logarithmic ratios are usually analyzed by various clustering methods directly, which may introduce bias interpretation in identifying groups of genes or samples. In this paper, a statistical method based on mixed model approaches was proposed for microarray data cluster analysis. The underlying rationale of this method is to partition the observed total gene expression level into various variations caused by different factors using an ANOVA model, and to predict the differential effects of GV (gene by variety) interaction using the adjusted unbiased prediction (AUP) method. The predicted GV interaction effects can then be used as the inputs of cluster analysis. We illustrated the application of our method with a gene expression dataset and elucidated the utility of our approach using an external validation.

  20. A microarray-based genotyping and genetic mapping approach for highly heterozygous outcrossing species enables localization of a large fraction of the unassembled Populus trichocarpa genome sequence.

    Science.gov (United States)

    Drost, Derek R; Novaes, Evandro; Boaventura-Novaes, Carolina; Benedict, Catherine I; Brown, Ryan S; Yin, Tongming; Tuskan, Gerald A; Kirst, Matias

    2009-06-01

    Microarrays have demonstrated significant power for genome-wide analyses of gene expression, and recently have also revolutionized the genetic analysis of segregating populations by genotyping thousands of loci in a single assay. Although microarray-based genotyping approaches have been successfully applied in yeast and several inbred plant species, their power has not been proven in an outcrossing species with extensive genetic diversity. Here we have developed methods for high-throughput microarray-based genotyping in such species using a pseudo-backcross progeny of 154 individuals of Populus trichocarpa and P. deltoides analyzed with long-oligonucleotide in situ-synthesized microarray probes. Our analysis resulted in high-confidence genotypes for 719 single-feature polymorphism (SFP) and 1014 gene expression marker (GEM) candidates. Using these genotypes and an established microsatellite (SSR) framework map, we produced a high-density genetic map comprising over 600 SFPs, GEMs and SSRs. The abundance of gene-based markers allowed us to localize over 35 million base pairs of previously unplaced whole-genome shotgun (WGS) scaffold sequence to putative locations in the genome of P. trichocarpa. A high proportion of sampled scaffolds could be verified for their placement with independently mapped SSRs, demonstrating the previously un-utilized power that high-density genotyping can provide in the context of map-based WGS sequence reassembly. Our results provide a substantial contribution to the continued improvement of the Populus genome assembly, while demonstrating the feasibility of microarray-based genotyping in a highly heterozygous population. The strategies presented are applicable to genetic mapping efforts in all plant species with similarly high levels of genetic diversity.

  1. Analysis of baseline and cisplatin-inducible gene expression in Fanconi anemia cells using oligonucleotide-based microarrays

    Directory of Open Access Journals (Sweden)

    Liu Johnson M

    2002-11-01

    Full Text Available Abstract Background Patients with Fanconi anemia (FA suffer from multiple defects, most notably of the hematological compartment (bone marrow failure, and susceptibility to cancer. Cells from FA patients show increased spontaneous chromosomal damage, which is aggravated by exposure to low concentrations of DNA cross-linking agents such as mitomycin C or cisplatin. Five of the identified FA proteins form a nuclear core complex. However, the molecular function of these proteins remains obscure. Methods Oligonucleotide microarrays were used to compare the expression of approximately 12,000 genes from FA cells with matched controls. Expression profiles were studied in lymphoblastoid cell lines derived from three different FA patients, one from the FA-A and two from the FA-C complementation groups. The isogenic control cell lines were obtained by either transfecting the cells with vectors expressing the complementing cDNAs or by using a spontaneous revertant cell line derived from the same patient. In addition, we analyzed expression profiles from two cell line couples at several time points after a 1-hour pulse treatment with a discriminating dose of cisplatin. Results Analysis of the expression profiles showed differences in expression of a number of genes, many of which have unknown function or are difficult to relate to the FA defect. However, from a selected number of proteins involved in cell cycle regulation, DNA repair and chromatin structure, Western blot analysis showed that p21waf1/Cip1 was significantly upregulated after low dose cisplatin treatment in FA cells specifically (as well as being expressed at elevated levels in untreated FA cells. Conclusions The observed increase in expression of p21waf1/Cip1 after treatment of FA cells with crosslinkers suggests that the sustained elevated levels of p21waf1/Cip1 in untreated FA cells detected by Western blot analysis likely reflect increased spontaneous damage in these cells.

  2. Plant-pathogen interactions: what microarray tells about it?

    Science.gov (United States)

    Lodha, T D; Basak, J

    2012-01-01

    Plant defense responses are mediated by elementary regulatory proteins that affect expression of thousands of genes. Over the last decade, microarray technology has played a key role in deciphering the underlying networks of gene regulation in plants that lead to a wide variety of defence responses. Microarray is an important tool to quantify and profile the expression of thousands of genes simultaneously, with two main aims: (1) gene discovery and (2) global expression profiling. Several microarray technologies are currently in use; most include a glass slide platform with spotted cDNA or oligonucleotides. Till date, microarray technology has been used in the identification of regulatory genes, end-point defence genes, to understand the signal transduction processes underlying disease resistance and its intimate links to other physiological pathways. Microarray technology can be used for in-depth, simultaneous profiling of host/pathogen genes as the disease progresses from infection to resistance/susceptibility at different developmental stages of the host, which can be done in different environments, for clearer understanding of the processes involved. A thorough knowledge of plant disease resistance using successful combination of microarray and other high throughput techniques, as well as biochemical, genetic, and cell biological experiments is needed for practical application to secure and stabilize yield of many crop plants. This review starts with a brief introduction to microarray technology, followed by the basics of plant-pathogen interaction, the use of DNA microarrays over the last decade to unravel the mysteries of plant-pathogen interaction, and ends with the future prospects of this technology.

  3. Fast gene ontology based clustering for microarray experiments.

    Science.gov (United States)

    Ovaska, Kristian; Laakso, Marko; Hautaniemi, Sampsa

    2008-11-21

    Analysis of a microarray experiment often results in a list of hundreds of disease-associated genes. In order to suggest common biological processes and functions for these genes, Gene Ontology annotations with statistical testing are widely used. However, these analyses can produce a very large number of significantly altered biological processes. Thus, it is often challenging to interpret GO results and identify novel testable biological hypotheses. We present fast software for advanced gene annotation using semantic similarity for Gene Ontology terms combined with clustering and heat map visualisation. The methodology allows rapid identification of genes sharing the same Gene Ontology cluster. Our R based semantic similarity open-source package has a speed advantage of over 2000-fold compared to existing implementations. From the resulting hierarchical clustering dendrogram genes sharing a GO term can be identified, and their differences in the gene expression patterns can be seen from the heat map. These methods facilitate advanced annotation of genes resulting from data analysis.

  4. FiGS: a filter-based gene selection workbench for microarray data

    Directory of Open Access Journals (Sweden)

    Yun Taegyun

    2010-01-01

    Full Text Available Abstract Background The selection of genes that discriminate disease classes from microarray data is widely used for the identification of diagnostic biomarkers. Although various gene selection methods are currently available and some of them have shown excellent performance, no single method can retain the best performance for all types of microarray datasets. It is desirable to use a comparative approach to find the best gene selection result after rigorous test of different methodological strategies for a given microarray dataset. Results FiGS is a web-based workbench that automatically compares various gene selection procedures and provides the optimal gene selection result for an input microarray dataset. FiGS builds up diverse gene selection procedures by aligning different feature selection techniques and classifiers. In addition to the highly reputed techniques, FiGS diversifies the gene selection procedures by incorporating gene clustering options in the feature selection step and different data pre-processing options in classifier training step. All candidate gene selection procedures are evaluated by the .632+ bootstrap errors and listed with their classification accuracies and selected gene sets. FiGS runs on parallelized computing nodes that capacitate heavy computations. FiGS is freely accessible at http://gexp.kaist.ac.kr/figs. Conclusion FiGS is an web-based application that automates an extensive search for the optimized gene selection analysis for a microarray dataset in a parallel computing environment. FiGS will provide both an efficient and comprehensive means of acquiring optimal gene sets that discriminate disease states from microarray datasets.

  5. Difference-based clustering of short time-course microarray data with replicates

    Directory of Open Access Journals (Sweden)

    Kim Jihoon

    2007-07-01

    Full Text Available Abstract Background There are some limitations associated with conventional clustering methods for short time-course gene expression data. The current algorithms require prior domain knowledge and do not incorporate information from replicates. Moreover, the results are not always easy to interpret biologically. Results We propose a novel algorithm for identifying a subset of genes sharing a significant temporal expression pattern when replicates are used. Our algorithm requires no prior knowledge, instead relying on an observed statistic which is based on the first and second order differences between adjacent time-points. Here, a pattern is predefined as the sequence of symbols indicating direction and the rate of change between time-points, and each gene is assigned to a cluster whose members share a similar pattern. We evaluated the performance of our algorithm to those of K-means, Self-Organizing Map and the Short Time-series Expression Miner methods. Conclusions Assessments using simulated and real data show that our method outperformed aforementioned algorithms. Our approach is an appropriate solution for clustering short time-course microarray data with replicates.

  6. A power law global error model for the identification of differentially expressed genes in microarray data

    Directory of Open Access Journals (Sweden)

    Granucci Francesca

    2004-12-01

    Full Text Available Abstract Background High-density oligonucleotide microarray technology enables the discovery of genes that are transcriptionally modulated in different biological samples due to physiology, disease or intervention. Methods for the identification of these so-called "differentially expressed genes" (DEG would largely benefit from a deeper knowledge of the intrinsic measurement variability. Though it is clear that variance of repeated measures is highly dependent on the average expression level of a given gene, there is still a lack of consensus on how signal reproducibility is linked to signal intensity. The aim of this study was to empirically model the variance versus mean dependence in microarray data to improve the performance of existing methods for identifying DEG. Results In the present work we used data generated by our lab as well as publicly available data sets to show that dispersion of repeated measures depends on location of the measures themselves following a power law. This enables us to construct a power law global error model (PLGEM that is applicable to various Affymetrix GeneChip data sets. A new DEG identification method is therefore proposed, consisting of a statistic designed to make explicit use of model-derived measurement spread estimates and a resampling-based hypothesis testing algorithm. Conclusions The new method provides a control of the false positive rate, a good sensitivity vs. specificity trade-off and consistent results with varying number of replicates and even using single samples.

  7. Microarrays in ecological research: A case study of a cDNA microarray for plant-herbivore interactions

    Directory of Open Access Journals (Sweden)

    Gase Klaus

    2004-09-01

    Full Text Available Abstract Background Microarray technology allows researchers to simultaneously monitor changes in the expression ratios (ERs of hundreds of genes and has thereby revolutionized most of biology. Although this technique has the potential of elucidating early stages in an organism's phenotypic response to complex ecological interactions, to date, it has not been fully incorporated into ecological research. This is partially due to a lack of simple procedures of handling and analyzing the expression ratio (ER data produced from microarrays. Results We describe an analysis of the sources of variation in ERs from 73 hybridized cDNA microarrays, each with 234 herbivory-elicited genes from the model ecological expression system, Nicotiana attenuata, using procedures that are commonly used in ecologic research. Each gene is represented by two independently labeled PCR products and each product was arrayed in quadruplicate. We present a robust method of normalizing and analyzing ERs based on arbitrary thresholds and statistical criteria, and characterize a "norm of reaction" of ERs for 6 genes (4 of known function, 2 of unknown with different ERs as determined across all analyzed arrays to provide a biologically-informed alternative to the use of arbitrary expression ratios in determining significance of expression. These gene-specific ERs and their variance (gene CV were used to calculate array-based variances (array CV, which, in turn, were used to study the effects of array age, probe cDNA quantity and quality, and quality of spotted PCR products as estimates of technical variation. Cluster analysis and a Principal Component Analysis (PCA were used to reveal associations among the transcriptional "imprints" of arrays hybridized with cDNA probes derived from mRNA from N. attenuata plants variously elicited and attacked by different herbivore species and from three congeners: N. quadrivalis, N. longiflora and N. clevelandii. Additionally, the PCA

  8. GENE EXPRESSION IN THE TESTES OF NORMOSPERMIC VERSUS TERATOSPERMIC DOMESTIC CATS USING HUMAN CDNA MICROARRAY ANALYSES

    Science.gov (United States)

    GENE EXPRESSION IN THE TESTES OF NORMOSPERMIC VERSUS TERATOSPERMIC DOMESTIC CATS USING HUMAN cDNA MICROARRAY ANALYSESB.S. Pukazhenthi1, J. C. Rockett2, M. Ouyang3, D.J. Dix2, J.G. Howard1, P. Georgopoulos4, W.J. J. Welsh3 and D. E. Wildt11Department of Reproductiv...

  9. GOBO: gene expression-based outcome for breast cancer online.

    Directory of Open Access Journals (Sweden)

    Markus Ringnér

    Full Text Available Microarray-based gene expression analysis holds promise of improving prognostication and treatment decisions for breast cancer patients. However, the heterogeneity of breast cancer emphasizes the need for validation of prognostic gene signatures in larger sample sets stratified into relevant subgroups. Here, we describe a multifunctional user-friendly online tool, GOBO (http://co.bmc.lu.se/gobo, allowing a range of different analyses to be performed in an 1881-sample breast tumor data set, and a 51-sample breast cancer cell line set, both generated on Affymetrix U133A microarrays. GOBO supports a wide range of applications including: 1 rapid assessment of gene expression levels in subgroups of breast tumors and cell lines, 2 identification of co-expressed genes for creation of potential metagenes, 3 association with outcome for gene expression levels of single genes, sets of genes, or gene signatures in multiple subgroups of the 1881-sample breast cancer data set. The design and implementation of GOBO facilitate easy incorporation of additional query functions and applications, as well as additional data sets irrespective of tumor type and array platform.

  10. SoFoCles: feature filtering for microarray classification based on gene ontology.

    Science.gov (United States)

    Papachristoudis, Georgios; Diplaris, Sotiris; Mitkas, Pericles A

    2010-02-01

    Marker gene selection has been an important research topic in the classification analysis of gene expression data. Current methods try to reduce the "curse of dimensionality" by using statistical intra-feature set calculations, or classifiers that are based on the given dataset. In this paper, we present SoFoCles, an interactive tool that enables semantic feature filtering in microarray classification problems with the use of external, well-defined knowledge retrieved from the Gene Ontology. The notion of semantic similarity is used to derive genes that are involved in the same biological path during the microarray experiment, by enriching a feature set that has been initially produced with legacy methods. Among its other functionalities, SoFoCles offers a large repository of semantic similarity methods that are used in order to derive feature sets and marker genes. The structure and functionality of the tool are discussed in detail, as well as its ability to improve classification accuracy. Through experimental evaluation, SoFoCles is shown to outperform other classification schemes in terms of classification accuracy in two real datasets using different semantic similarity computation approaches.

  11. The construction and use of bacterial DNA microarrays based on an optimized two-stage PCR strategy

    Directory of Open Access Journals (Sweden)

    Pesta David

    2003-06-01

    Full Text Available Abstract Background DNA microarrays are a powerful tool with important applications such as global gene expression profiling. Construction of bacterial DNA microarrays from genomic sequence data using a two-stage PCR amplification approach for the production of arrayed DNA is attractive because it allows, in principal, the continued re-amplification of DNA fragments and facilitates further utilization of the DNA fragments for additional uses (e.g. over-expression of protein. We describe the successful construction and use of DNA microarrays by the two-stage amplification approach and discuss the technical challenges that were met and resolved during the project. Results Chimeric primers that contained both gene-specific and shared, universal sequence allowed the two-stage amplification of the 3,168 genes identified on the genome of Synechocystis sp. PCC6803, an important prokaryotic model organism for the study of oxygenic photosynthesis. The gene-specific component of the primer was of variable length to maintain uniform annealing temperatures during the 1st round of PCR synthesis, and situated to preserve full-length ORFs. Genes were truncated at 2 kb for efficient amplification, so that about 92% of the PCR fragments were full-length genes. The two-stage amplification had the additional advantage of normalizing the yield of PCR products and this improved the uniformity of DNA features robotically deposited onto the microarray surface. We also describe the techniques utilized to optimize hybridization conditions and signal-to-noise ratio of the transcription profile. The inter-lab transportability was demonstrated by the virtual error-free amplification of the entire genome complement of 3,168 genes using the universal primers in partner labs. The printed slides have been successfully used to identify differentially expressed genes in response to a number of environmental conditions, including salt stress. Conclusions The technique detailed

  12. Network Expansion and Pathway Enrichment Analysis towards Biologically Significant Findings from Microarrays

    Directory of Open Access Journals (Sweden)

    Wu Xiaogang

    2012-06-01

    Full Text Available In many cases, crucial genes show relatively slight changes between groups of samples (e.g. normal vs. disease, and many genes selected from microarray differential analysis by measuring the expression level statistically are also poorly annotated and lack of biological significance. In this paper, we present an innovative approach - network expansion and pathway enrichment analysis (NEPEA for integrative microarray analysis. We assume that organized knowledge will help microarray data analysis in significant ways, and the organized knowledge could be represented as molecular interaction networks or biological pathways. Based on this hypothesis, we develop the NEPEA framework based on network expansion from the human annotated and predicted protein interaction (HAPPI database, and pathway enrichment from the human pathway database (HPD. We use a recently-published microarray dataset (GSE24215 related to insulin resistance and type 2 diabetes (T2D as case study, since this study provided a thorough experimental validation for both genes and pathways identified computationally from classical microarray analysis and pathway analysis. We perform our NEPEA analysis for this dataset based on the results from the classical microarray analysis to identify biologically significant genes and pathways. Our findings are not only consistent with the original findings mostly, but also obtained more supports from other literatures.

  13. A Discrete Wavelet Based Feature Extraction and Hybrid Classification Technique for Microarray Data Analysis

    Directory of Open Access Journals (Sweden)

    Jaison Bennet

    2014-01-01

    Full Text Available Cancer classification by doctors and radiologists was based on morphological and clinical features and had limited diagnostic ability in olden days. The recent arrival of DNA microarray technology has led to the concurrent monitoring of thousands of gene expressions in a single chip which stimulates the progress in cancer classification. In this paper, we have proposed a hybrid approach for microarray data classification based on nearest neighbor (KNN, naive Bayes, and support vector machine (SVM. Feature selection prior to classification plays a vital role and a feature selection technique which combines discrete wavelet transform (DWT and moving window technique (MWT is used. The performance of the proposed method is compared with the conventional classifiers like support vector machine, nearest neighbor, and naive Bayes. Experiments have been conducted on both real and benchmark datasets and the results indicate that the ensemble approach produces higher classification accuracy than conventional classifiers. This paper serves as an automated system for the classification of cancer and can be applied by doctors in real cases which serve as a boon to the medical community. This work further reduces the misclassification of cancers which is highly not allowed in cancer detection.

  14. Uso de microarrays na busca de perfis de expressão gênica: aplicação no estudo de fenótipos complexos Use of microarrays in the search of gene expression patterns: application to the study of complex phenotypes

    Directory of Open Access Journals (Sweden)

    Camila Guindalini

    2007-12-01

    Full Text Available Com o advento do seqüenciamento de genoma humano, novas tecnologias foram desenvolvidas e despontaram como promissoras ferramentas metodológicas e científicas para o avanço na compreensão dos mecanismos envolvidos em várias doenças complexas. Dentre elas, a técnica de análise em larga escala (conhecida como microarrays ou chips de DNA é particularmente eficaz em permitir uma visão global na busca de padrões de expressão gênica em amostras biológicas. Por meio da determinação da expressão de milhares de genes simultaneamente, a promissora tecnologia permite que pesquisadores comparem o comportamento molecular de diversos tipos de linhagens celulares e tecidos diferentes, quando expostos a uma determinada condição patológica ou experimental. A aplicação do método pode trazer novas perspectivas de análise de processos fisiológicos e facilitar a identificação de marcadores moleculares para o diagnóstico, prognóstico e para o tratamento farmacológico atual. Nesse artigo, apresentaremos conceitos teóricos e metodológicos que permeiam a tecnologia de microarrays, assim como suas vantagens, perspectivas e direcionamentos futuros. Com o intuito de exemplificar sua aplicabilidade e eficiência no estudo de fenômenos complexos, serão apresentados e também discutidos resultados iniciais sobre padrões de expressão gênica em amostra de cérebros post-mortem de pacientes psiquiátricos e sobre as conseqüências moleculares e funcionais de perturbações no sono, comumente associadas a transtornos psiquiátricos.Sequencing the human genome has prompted the development of new technologies, which have emerged as promising methodological and scientific tools for advancing the current knowledge about the causes and mechanisms involved in various complex disorders. Among those, the high-throughput technique known as microarray is particularly powerful in providing a global view of gene expression patterns in biological samples

  15. Microarray-based ultra-high resolution discovery of genomic deletion mutations

    Science.gov (United States)

    2014-01-01

    Background Oligonucleotide microarray-based comparative genomic hybridization (CGH) offers an attractive possible route for the rapid and cost-effective genome-wide discovery of deletion mutations. CGH typically involves comparison of the hybridization intensities of genomic DNA samples with microarray chip representations of entire genomes, and has widespread potential application in experimental research and medical diagnostics. However, the power to detect small deletions is low. Results Here we use a graduated series of Arabidopsis thaliana genomic deletion mutations (of sizes ranging from 4 bp to ~5 kb) to optimize CGH-based genomic deletion detection. We show that the power to detect smaller deletions (4, 28 and 104 bp) depends upon oligonucleotide density (essentially the number of genome-representative oligonucleotides on the microarray chip), and determine the oligonucleotide spacings necessary to guarantee detection of deletions of specified size. Conclusions Our findings will enhance a wide range of research and clinical applications, and in particular will aid in the discovery of genomic deletions in the absence of a priori knowledge of their existence. PMID:24655320

  16. Microarray Analysis of Gene Expression Alteration in Human Middle Ear Epithelial Cells Induced by Asian Sand Dust.

    Science.gov (United States)

    Go, Yoon Young; Park, Moo Kyun; Kwon, Jee Young; Seo, Young Rok; Chae, Sung-Won; Song, Jae-Jun

    2015-12-01

    The primary aim of this study is to evaluate the gene expression profile of Asian sand dust (ASD)-treated human middle ear epithelial cell (HMEEC) using microarray analysis. The HMEEC was treated with ASD (400 µg/mL) and total RNA was extracted for microarray analysis. Molecular pathways among differentially expressed genes were further analyzed. For selected genes, the changes in gene expression were confirmed by real-time polymerase chain reaction. A total of 1,274 genes were differentially expressed by ASD. Among them, 1,138 genes were 2 folds up-regulated, whereas 136 genes were 2 folds down-regulated. Up-regulated genes were mainly involved in cellular processes, including apoptosis, cell differentiation, and cell proliferation. Down-regulated genes affected cellular processes, including apoptosis, cell cycle, cell differentiation, and cell proliferation. The 10 genes including ADM, CCL5, EDN1, EGR1, FOS, GHRL, JUN, SOCS3, TNF, and TNFSF10 were identified as main modulators in up-regulated genes. A total of 11 genes including CSF3, DKK1, FOSL1, FST, TERT, MMP13, PTHLH, SPRY2, TGFBR2, THBS1, and TIMP1 acted as main components of pathway associated with 2-fold down regulated genes. We identified the differentially expressed genes in ASD-treated HMEEC. Our work indicates that air pollutant like ASD, may play an important role in the pathogenesis of otitis media.

  17. "Harshlighting" small blemishes on microarrays

    Directory of Open Access Journals (Sweden)

    Wittkowski Knut M

    2005-03-01

    Full Text Available Abstract Background Microscopists are familiar with many blemishes that fluorescence images can have due to dust and debris, glass flaws, uneven distribution of fluids or surface coatings, etc. Microarray scans show similar artefacts, which affect the analysis, particularly when one tries to detect subtle changes. However, most blemishes are hard to find by the unaided eye, particularly in high-density oligonucleotide arrays (HDONAs. Results We present a method that harnesses the statistical power provided by having several HDONAs available, which are obtained under similar conditions except for the experimental factor. This method "harshlights" blemishes and renders them evident. We find empirically that about 25% of our chips are blemished, and we analyze the impact of masking them on screening for differentially expressed genes. Conclusion Experiments attempting to assess subtle expression changes should be carefully screened for blemishes on the chips. The proposed method provides investigators with a novel robust approach to improve the sensitivity of microarray analyses. By utilizing topological information to identify and mask blemishes prior to model based analyses, the method prevents artefacts from confounding the process of background correction, normalization, and summarization.

  18. Unsupervised Bayesian linear unmixing of gene expression microarrays.

    Science.gov (United States)

    Bazot, Cécile; Dobigeon, Nicolas; Tourneret, Jean-Yves; Zaas, Aimee K; Ginsburg, Geoffrey S; Hero, Alfred O

    2013-03-19

    This paper introduces a new constrained model and the corresponding algorithm, called unsupervised Bayesian linear unmixing (uBLU), to identify biological signatures from high dimensional assays like gene expression microarrays. The basis for uBLU is a Bayesian model for the data samples which are represented as an additive mixture of random positive gene signatures, called factors, with random positive mixing coefficients, called factor scores, that specify the relative contribution of each signature to a specific sample. The particularity of the proposed method is that uBLU constrains the factor loadings to be non-negative and the factor scores to be probability distributions over the factors. Furthermore, it also provides estimates of the number of factors. A Gibbs sampling strategy is adopted here to generate random samples according to the posterior distribution of the factors, factor scores, and number of factors. These samples are then used to estimate all the unknown parameters. Firstly, the proposed uBLU method is applied to several simulated datasets with known ground truth and compared with previous factor decomposition methods, such as principal component analysis (PCA), non negative matrix factorization (NMF), Bayesian factor regression modeling (BFRM), and the gradient-based algorithm for general matrix factorization (GB-GMF). Secondly, we illustrate the application of uBLU on a real time-evolving gene expression dataset from a recent viral challenge study in which individuals have been inoculated with influenza A/H3N2/Wisconsin. We show that the uBLU method significantly outperforms the other methods on the simulated and real data sets considered here. The results obtained on synthetic and real data illustrate the accuracy of the proposed uBLU method when compared to other factor decomposition methods from the literature (PCA, NMF, BFRM, and GB-GMF). The uBLU method identifies an inflammatory component closely associated with clinical symptom scores

  19. Gene Expression Analysis Using Agilent DNA Microarrays

    DEFF Research Database (Denmark)

    Stangegaard, Michael

    2009-01-01

    Hybridization of labeled cDNA to microarrays is an intuitively simple and a vastly underestimated process. If it is not performed, optimized, and standardized with the same attention to detail as e.g., RNA amplification, information may be overlooked or even lost. Careful balancing of the amount ...

  20. Systematic gene microarray analysis of the lncRNA expression profiles in human uterine cervix carcinoma.

    Science.gov (United States)

    Chen, Jie; Fu, Ziyi; Ji, Chenbo; Gu, Pingqing; Xu, Pengfei; Yu, Ningzhu; Kan, Yansheng; Wu, Xiaowei; Shen, Rong; Shen, Yan

    2015-05-01

    The human uterine cervix carcinoma is one of the most well-known malignancy reproductive system cancers, which threatens women health globally. However, the mechanisms of the oncogenesis and development process of cervix carcinoma are not yet fully understood. Long non-coding RNAs (lncRNAs) have been proved to play key roles in various biological processes, especially development of cancer. The function and mechanism of lncRNAs on cervix carcinoma is still rarely reported. We selected 3 cervix cancer and normal cervix tissues separately, then performed lncRNA microarray to detect the differentially expressed lncRNAs. Subsequently, we explored the potential function of these dysregulated lncRNAs through online bioinformatics databases. Finally, quantity real-time PCR was carried out to confirm the expression levels of these dysregulated lncRNAs in cervix cancer and normal tissues. We uncovered the profiles of differentially expressed lncRNAs between normal and cervix carcinoma tissues by using the microarray techniques, and found 1622 upregulated and 3026 downregulated lncRNAs (fold-change>2.0) in cervix carcinoma compared to the normal cervical tissue. Furthermore, we found HOXA11-AS might participate in cervix carcinogenesis by regulating HOXA11, which is involved in regulating biological processes of cervix cancer. This study afforded expression profiles of lncRNAs between cervix carcinoma tissue and normal cervical tissue, which could provide database for further research about the function and mechanism of key-lncRNAs in cervix carcinoma, and might be helpful to explore potential diagnosis factors and therapeutic targets for cervix carcinoma. Copyright © 2015 Elsevier Masson SAS. All rights reserved.

  1. Fast Gene Ontology based clustering for microarray experiments

    Directory of Open Access Journals (Sweden)

    Ovaska Kristian

    2008-11-01

    Full Text Available Abstract Background Analysis of a microarray experiment often results in a list of hundreds of disease-associated genes. In order to suggest common biological processes and functions for these genes, Gene Ontology annotations with statistical testing are widely used. However, these analyses can produce a very large number of significantly altered biological processes. Thus, it is often challenging to interpret GO results and identify novel testable biological hypotheses. Results We present fast software for advanced gene annotation using semantic similarity for Gene Ontology terms combined with clustering and heat map visualisation. The methodology allows rapid identification of genes sharing the same Gene Ontology cluster. Conclusion Our R based semantic similarity open-source package has a speed advantage of over 2000-fold compared to existing implementations. From the resulting hierarchical clustering dendrogram genes sharing a GO term can be identified, and their differences in the gene expression patterns can be seen from the heat map. These methods facilitate advanced annotation of genes resulting from data analysis.

  2. Microarray expression analysis of genes involved in innate immune memory in peritoneal macrophages

    Directory of Open Access Journals (Sweden)

    Keisuke Yoshida

    2016-03-01

    Full Text Available Immunological memory has been believed to be a feature of the adaptive immune system for long period, but recent reports suggest that the innate immune system also exhibits memory-like reaction. Although evidence of innate immune memory is accumulating, no in vivo experimental data has clearly implicated a molecular mechanism, or even a cell-type, for this phenomenon. In this study of data deposited into Gene Expression Omnibus (GEO under GSE71111, we analyzed the expression profile of peritoneal macrophages isolated from mice pre-administrated with toll-like receptor (TLR ligands, mimicking pathogen infection. In these macrophages, increased expression of a group of innate immunity-related genes was sustained over a long period of time, and these genes overlapped with ATF7-regulated genes. We conclude that ATF7 plays an important role in innate immune memory in macrophages. Keywords: Macrophage, ATF7, Innate immune memory, Microarray

  3. A high-density transcript linkage map with 1,845 expressed genes positioned by microarray-based Single Feature Polymorphisms (SFP) in Eucalyptus

    Science.gov (United States)

    2011-01-01

    Background Technological advances are progressively increasing the application of genomics to a wider array of economically and ecologically important species. High-density maps enriched for transcribed genes facilitate the discovery of connections between genes and phenotypes. We report the construction of a high-density linkage map of expressed genes for the heterozygous genome of Eucalyptus using Single Feature Polymorphism (SFP) markers. Results SFP discovery and mapping was achieved using pseudo-testcross screening and selective mapping to simultaneously optimize linkage mapping and microarray costs. SFP genotyping was carried out by hybridizing complementary RNA prepared from 4.5 year-old trees xylem to an SFP array containing 103,000 25-mer oligonucleotide probes representing 20,726 unigenes derived from a modest size expressed sequence tags collection. An SFP-mapping microarray with 43,777 selected candidate SFP probes representing 15,698 genes was subsequently designed and used to genotype SFPs in a larger subset of the segregating population drawn by selective mapping. A total of 1,845 genes were mapped, with 884 of them ordered with high likelihood support on a framework map anchored to 180 microsatellites with average density of 1.2 cM. Using more probes per unigene increased by two-fold the likelihood of detecting segregating SFPs eventually resulting in more genes mapped. In silico validation showed that 87% of the SFPs map to the expected location on the 4.5X draft sequence of the Eucalyptus grandis genome. Conclusions The Eucalyptus 1,845 gene map is the most highly enriched map for transcriptional information for any forest tree species to date. It represents a major improvement on the number of genes previously positioned on Eucalyptus maps and provides an initial glimpse at the gene space for this global tree genome. A general protocol is proposed to build high-density transcript linkage maps in less characterized plant species by SFP genotyping

  4. Protein-protein interactions: an application of Tus-Ter mediated protein microarray system.

    Science.gov (United States)

    Sitaraman, Kalavathy; Chatterjee, Deb K

    2011-01-01

    In this chapter, we present a novel, cost-effective microarray strategy that utilizes expression-ready plasmid DNAs to generate protein arrays on-demand and its use to validate protein-protein interactions. These expression plasmids were constructed in such a way so as to serve a dual purpose of synthesizing the protein of interest as well as capturing the synthesized protein. The microarray system is based on the high affinity binding of Escherichia coli "Tus" protein to "Ter," a 20 bp DNA sequence involved in the regulation of DNA replication. The protein expression is carried out in a cell-free protein synthesis system, with rabbit reticulocyte lysates, and the target proteins are detected either by labeled incorporated tag specific or by gene-specific antibodies. This microarray system has been successfully used for the detection of protein-protein interaction because both the target protein and the query protein can be transcribed and translated simultaneously in the microarray slides. The utility of this system for detecting protein-protein interaction is demonstrated by a few well-known examples: Jun/Fos, FRB/FKBP12, p53/MDM2, and CDK4/p16. In all these cases, the presence of protein complexes resulted in the localization of fluorophores at the specific sites of the immobilized target plasmids. Interestingly, during our interactions studies we also detected a previously unknown interaction between CDK2 and p16. Thus, this Tus-Ter based system of protein microarray can be used for the validation of known protein interactions as well as for identifying new protein-protein interactions. In addition, it can be used to examine and identify targets of nucleic acid-protein, ligand-receptor, enzyme-substrate, and drug-protein interactions.

  5. Endoglin (CD105) expression on microvessel endothelial cells in juvenile nasopharyngeal angiofibroma: tissue microarray analysis and association with prognostic significance.

    Science.gov (United States)

    Wang, Jing-Jing; Sun, Xi-Cai; Hu, Li; Liu, Zhuo-Fu; Yu, Hua-Peng; Li, Han; Wang, Shu-Yi; Wang, De-Hui

    2013-12-01

    The purpose of this study was to examine endoglin (CD105) expression on microvessel endothelial cells (ECs) in juvenile nasopharyngeal angiofibroma (JNA) and its relationship with recurrence. Immunohistochemistry was performed to detect CD105 expression in a tissue microarray from 70 patients with JNA. Correlation between CD105 expression on microvessel ECs and clinicopathological features, as well as tumor recurrence, were analyzed. Immunohistochemistry revealed CD105 expression on ECs but not in stroma of patients with JNA. Chi-square analysis indicated CD105-based microvessel density (MVD) was correlated with JNA recurrence (p = .013). Univariate and multivariate analyses determined that MVD was a significant predictor of time to recurrence (p = .009). The CD105-based MVD was better for predicting disease recurrence (AUROC: 0.673; p = .036) than other clinicopathological features. MVD is a useful predictor for poor prognosis of patients with JNA after curative resection. Angiogenesis, which may play an important role in the occurrence and development of JNA, is therefore a potential therapeutic target for JNA. Copyright © 2013 Wiley Periodicals, Inc., A Wiley Company.

  6. In silico design and performance of peptide microarrays for breast cancer tumour-auto-antibody testing

    Directory of Open Access Journals (Sweden)

    Andreas Weinhäusel

    2012-06-01

    Full Text Available The simplicity and potential of minimally invasive testing using sera from patients makes auto-antibody based biomarkers a very promising tool for use in cancer diagnostics. Protein microarrays have been used for the identification of such auto-antibody signatures. Because high throughput protein expression and purification is laborious, synthetic peptides might be a good alternative for microarray generation and multiplexed analyses. In this study, we designed 1185 antigenic peptides, deduced from proteins expressed by 642 cDNA expression clones found to be sero-reactive in both breast tumour patients and controls. The sero-reactive proteins and the corresponding peptides were used for the production of protein and peptide microarrays. Serum samples from females with benign and malignant breast tumours and healthy control sera (n=16 per group were then analysed. Correct classification of the serum samples on peptide microarrays were 78% for discrimination of ‘malignant versus healthy controls’, 72% for ‘benign versus malignant’ and 94% for ‘benign versus controls’. On protein arrays, correct classification for these contrasts was 69%, 59% and 59%, respectively. The over-representation analysis of the classifiers derived from class prediction showed enrichment of genes associated with ribosomes, spliceosomes, endocytosis and the pentose phosphate pathway. Sequence analyses of the peptides with the highest sero-reactivity demonstrated enrichment of the zinc-finger domain. Peptides’ sero-reactivities were found negatively correlated with hydrophobicity and positively correlated with positive charge, high inter-residue protein contact energies and a secondary structure propensity bias. This study hints at the possibility of using in silico designed antigenic peptide microarrays as an alternative to protein microarrays for the improvement of tumour auto-antibody based diagnostics.

  7. Mining microarray datasets in nutrition: expression of the GPR120 (n-3 fatty acid receptor/sensor) gene is down-regulated in human adipocytes by macrophage secretions.

    Science.gov (United States)

    Trayhurn, Paul; Denyer, Gareth

    2012-01-01

    Microarray datasets are a rich source of information in nutritional investigation. Targeted mining of microarray data following initial, non-biased bioinformatic analysis can provide key insight into specific genes and metabolic processes of interest. Microarrays from human adipocytes were examined to explore the effects of macrophage secretions on the expression of the G-protein-coupled receptor (GPR) genes that encode fatty acid receptors/sensors. Exposure of the adipocytes to macrophage-conditioned medium for 4 or 24 h had no effect on GPR40 and GPR43 expression, but there was a marked stimulation of GPR84 expression (receptor for medium-chain fatty acids), the mRNA level increasing 13·5-fold at 24 h relative to unconditioned medium. Importantly, expression of GPR120, which encodes an n-3 PUFA receptor/sensor, was strongly inhibited by the conditioned medium (15-fold decrease in mRNA at 24 h). Macrophage secretions have major effects on the expression of fatty acid receptor/sensor genes in human adipocytes, which may lead to an augmentation of the inflammatory response in adipose tissue in obesity.

  8. A Combinatory Approach for Selecting Prognostic Genes in Microarray Studies of Tumour Survivals

    Directory of Open Access Journals (Sweden)

    Qihua Tan

    2009-01-01

    Full Text Available Different from significant gene expression analysis which looks for genes that are differentially regulated, feature selection in the microarray-based prognostic gene expression analysis aims at finding a subset of marker genes that are not only differentially expressed but also informative for prediction. Unfortunately feature selection in literature of microarray study is predominated by the simple heuristic univariate gene filter paradigm that selects differentially expressed genes according to their statistical significances. We introduce a combinatory feature selection strategy that integrates differential gene expression analysis with the Gram-Schmidt process to identify prognostic genes that are both statistically significant and highly informative for predicting tumour survival outcomes. Empirical application to leukemia and ovarian cancer survival data through-within- and cross-study validations shows that the feature space can be largely reduced while achieving improved testing performances.

  9. The IronChip evaluation package: a package of perl modules for robust analysis of custom microarrays

    Directory of Open Access Journals (Sweden)

    Brazma Alvis

    2010-03-01

    Full Text Available Abstract Background Gene expression studies greatly contribute to our understanding of complex relationships in gene regulatory networks. However, the complexity of array design, production and manipulations are limiting factors, affecting data quality. The use of customized DNA microarrays improves overall data quality in many situations, however, only if for these specifically designed microarrays analysis tools are available. Results The IronChip Evaluation Package (ICEP is a collection of Perl utilities and an easy to use data evaluation pipeline for the analysis of microarray data with a focus on data quality of custom-designed microarrays. The package has been developed for the statistical and bioinformatical analysis of the custom cDNA microarray IronChip but can be easily adapted for other cDNA or oligonucleotide-based designed microarray platforms. ICEP uses decision tree-based algorithms to assign quality flags and performs robust analysis based on chip design properties regarding multiple repetitions, ratio cut-off, background and negative controls. Conclusions ICEP is a stand-alone Windows application to obtain optimal data quality from custom-designed microarrays and is freely available here (see "Additional Files" section and at: http://www.alice-dsl.net/evgeniy.vainshtein/ICEP/

  10. Serious limitations of the QTL/Microarray approach for QTL gene discovery

    Directory of Open Access Journals (Sweden)

    Warden Craig H

    2010-07-01

    Full Text Available Abstract Background It has been proposed that the use of gene expression microarrays in nonrecombinant parental or congenic strains can accelerate the process of isolating individual genes underlying quantitative trait loci (QTL. However, the effectiveness of this approach has not been assessed. Results Thirty-seven studies that have implemented the QTL/microarray approach in rodents were reviewed. About 30% of studies showed enrichment for QTL candidates, mostly in comparisons between congenic and background strains. Three studies led to the identification of an underlying QTL gene. To complement the literature results, a microarray experiment was performed using three mouse congenic strains isolating the effects of at least 25 biometric QTL. Results show that genes in the congenic donor regions were preferentially selected. However, within donor regions, the distribution of differentially expressed genes was homogeneous once gene density was accounted for. Genes within identical-by-descent (IBD regions were less likely to be differentially expressed in chromosome 2, but not in chromosomes 11 and 17. Furthermore, expression of QTL regulated in cis (cis eQTL showed higher expression in the background genotype, which was partially explained by the presence of single nucleotide polymorphisms (SNP. Conclusions The literature shows limited successes from the QTL/microarray approach to identify QTL genes. Our own results from microarray profiling of three congenic strains revealed a strong tendency to select cis-eQTL over trans-eQTL. IBD regions had little effect on rate of differential expression, and we provide several reasons why IBD should not be used to discard eQTL candidates. In addition, mismatch probes produced false cis-eQTL that could not be completely removed with the current strains genotypes and low probe density microarrays. The reviewed studies did not account for lack of coverage from the platforms used and therefore removed genes

  11. Gene expression of panaxydol-treated human melanoma cells using radioactive cDNA microarrays

    International Nuclear Information System (INIS)

    Cho, Joong Youn; Yu, Su Jin; Soh, Jeong Won; Kim, Meyoung Kon

    2001-01-01

    Polyacetylenic alcohols derived from Panax ginseng have been studied to be an anticancer reagent previously. One of the Panax ginseng polyacetylenic alcohols, i.e., panaxydol, has been studied to possess an antiproliferative effect on human melanoma cell line (SK-MEL-1). In ths study, radioactive cDNA microarrays enabled an efficient approach to analyze the pattern of gene expression (3.194 genes in a total) simultaneously. The bioinformatics selection of human cDNAs, which is specifically designed for immunology, apoptosis and signal transduction, were arrayed on nylon membranes. Using with 33 P labeled probes, this method provided highly sensitive gene expression profiles of our interest including apoptosis, cell proliferation, cell cycle, and signal transduction. Gene expression profiles were also classified into several categories in accordance with the duration of panaxydol treatment. Consequently, the gene profiles of our interest were significantly up (199 genes, > 2.0 of Z-ratio) or down-(196 genes, < 2.0 of Z-ratio) regulated in panaxydol-treated human melanoma cells

  12. Gene expression of panaxydol-treated human melanoma cells using radioactive cDNA microarrays

    Energy Technology Data Exchange (ETDEWEB)

    Cho, Joong Youn; Yu, Su Jin; Soh, Jeong Won; Kim, Meyoung Kon [College of Medicine, Korea Univ., Seoul (Korea, Republic of)

    2001-07-01

    Polyacetylenic alcohols derived from Panax ginseng have been studied to be an anticancer reagent previously. One of the Panax ginseng polyacetylenic alcohols, i.e., panaxydol, has been studied to possess an antiproliferative effect on human melanoma cell line (SK-MEL-1). In ths study, radioactive cDNA microarrays enabled an efficient approach to analyze the pattern of gene expression (3.194 genes in a total) simultaneously. The bioinformatics selection of human cDNAs, which is specifically designed for immunology, apoptosis and signal transduction, were arrayed on nylon membranes. Using with {sup 33}P labeled probes, this method provided highly sensitive gene expression profiles of our interest including apoptosis, cell proliferation, cell cycle, and signal transduction. Gene expression profiles were also classified into several categories in accordance with the duration of panaxydol treatment. Consequently, the gene profiles of our interest were significantly up (199 genes, > 2.0 of Z-ratio) or down-(196 genes, < 2.0 of Z-ratio) regulated in panaxydol-treated human melanoma cells.

  13. MicroRNA expression in melanocytic nevi: the usefulness of formalin-fixed, paraffin-embedded material for miRNA microarray profiling.

    Science.gov (United States)

    Glud, Martin; Klausen, Mikkel; Gniadecki, Robert; Rossing, Maria; Hastrup, Nina; Nielsen, Finn C; Drzewiecki, Krzysztof T

    2009-05-01

    MicroRNAs (miRNAs) are small, noncoding RNA molecules that regulate cellular differentiation, proliferation, and apoptosis. MiRNAs are expressed in a developmentally regulated and tissue-specific manner. Aberrant expression may contribute to pathological processes such as cancer, and miRNA may therefore serve as biomarkers that may be useful in a clinical environment for diagnosis of various diseases. Most miRNA profiling studies have used fresh tissue samples. However, in some types of cancer, including malignant melanoma, fresh material is difficult to obtain from primary tumors, and most surgical specimens are formalin fixed and paraffin embedded (FFPE). To explore whether FFPE material would be suitable for miRNA profiling in melanocytic lesions, we compared miRNA expression patterns in FFPE versus fresh frozen samples, obtained from 15 human melanocytic nevi. Out of microarray data, we identified 84 miRNAs that were expressed in both types of samples and represented an miRNA profile of melanocytic nevi. Our results showed a high correlation in miRNA expression (Spearman r-value of 0.80) between paired FFPE and fresh frozen material. The data were further validated by quantitative RT-PCR. In conclusion, FFPE specimens of melanocytic lesions are suitable as a source for miRNA microarray profiling.

  14. Identification of Differentially Expressed IGFBP5-Related Genes in Breast Cancer Tumor Tissues Using cDNA Microarray Experiments.

    Science.gov (United States)

    Akkiprik, Mustafa; Peker, İrem; Özmen, Tolga; Amuran, Gökçe Güllü; Güllüoğlu, Bahadır M; Kaya, Handan; Özer, Ayşe

    2015-11-10

    IGFBP5 is an important regulatory protein in breast cancer progression. We tried to identify differentially expressed genes (DEGs) between breast tumor tissues with IGFBP5 overexpression and their adjacent normal tissues. In this study, thirty-eight breast cancer and adjacent normal breast tissue samples were used to determine IGFBP5 expression by qPCR. cDNA microarrays were applied to the highest IGFBP5 overexpressed tumor samples compared to their adjacent normal breast tissue. Microarray analysis revealed that a total of 186 genes were differentially expressed in breast cancer compared with normal breast tissues. Of the 186 genes, 169 genes were downregulated and 17 genes were upregulated in the tumor samples. KEGG pathway analyses showed that protein digestion and absorption, focal adhesion, salivary secretion, drug metabolism-cytochrome P450, and phenylalanine metabolism pathways are involved. Among these DEGs, the prominent top two genes (MMP11 and COL1A1) which potentially correlated with IGFBP5 were selected for validation using real time RT-qPCR. Only COL1A1 expression showed a consistent upregulation with IGFBP5 expression and COL1A1 and MMP11 were significantly positively correlated. We concluded that the discovery of coordinately expressed genes related with IGFBP5 might contribute to understanding of the molecular mechanism of the function of IGFBP5 in breast cancer. Further functional studies on DEGs and association with IGFBP5 may identify novel biomarkers for clinical applications in breast cancer.

  15. Parallel scan hyperspectral fluorescence imaging system and biomedical application for microarrays

    International Nuclear Information System (INIS)

    Liu Zhiyi; Ma Suihua; Liu Le; Guo Jihua; He Yonghong; Ji Yanhong

    2011-01-01

    Microarray research offers great potential for analysis of gene expression profile and leads to greatly improved experimental throughput. A number of instruments have been reported for microarray detection, such as chemiluminescence, surface plasmon resonance, and fluorescence markers. Fluorescence imaging is popular for the readout of microarrays. In this paper we develop a quasi-confocal, multichannel parallel scan hyperspectral fluorescence imaging system for microarray research. Hyperspectral imaging records the entire emission spectrum for every voxel within the imaged area in contrast to recording only fluorescence intensities of filter-based scanners. Coupled with data analysis, the recorded spectral information allows for quantitative identification of the contributions of multiple, spectrally overlapping fluorescent dyes and elimination of unwanted artifacts. The mechanism of quasi-confocal imaging provides a high signal-to-noise ratio, and parallel scan makes this approach a high throughput technique for microarray analysis. This system is improved with a specifically designed spectrometer which can offer a spectral resolution of 0.2 nm, and operates with spatial resolutions ranging from 2 to 30 μm . Finally, the application of the system is demonstrated by reading out microarrays for identification of bacteria.

  16. A microarray analysis of the rice transcriptome and its comparison to Arabidopsis

    DEFF Research Database (Denmark)

    Ma, Ligeng; Chen, Chen; Liu, Xigang

    2005-01-01

    Arabidopsis and rice are the only two model plants whose finished phase genome sequence has been completed. Here we report the construction of an oligomer microarray based on the presently known and predicted gene models in the rice genome. This microarray was used to analyze the transcriptional...... with similar genome-wide surveys of the Arabidopsis transcriptome, our results indicate that similar proportions of the two genomes are expressed in their corresponding organ types. A large percentage of the rice gene models that lack significant Arabidopsis homologs are expressed. Furthermore, the expression...... patterns of rice and Arabidopsis best-matched homologous genes in distinct functional groups indicate dramatic differences in their degree of conservation between the two species. Thus, this initial comparative analysis reveals some basic similarities and differences between the Arabidopsis and rice...

  17. Performance comparison of two microarray platforms to assess differential gene expression in human monocyte and macrophage cells

    Directory of Open Access Journals (Sweden)

    Montalescot Gilles

    2008-06-01

    Full Text Available Abstract Background In this study we assessed the respective ability of Affymetrix and Illumina microarray methodologies to answer a relevant biological question, namely the change in gene expression between resting monocytes and macrophages derived from these monocytes. Five RNA samples for each type of cell were hybridized to the two platforms in parallel. In addition, a reference list of differentially expressed genes (DEG was generated from a larger number of hybridizations (mRNA from 86 individuals using the RNG/MRC two-color platform. Results Our results show an important overlap of the Illumina and Affymetrix DEG lists. In addition, more than 70% of the genes in these lists were also present in the reference list. Overall the two platforms had very similar performance in terms of biological significance, evaluated by the presence in the DEG lists of an excess of genes belonging to Gene Ontology (GO categories relevant for the biology of monocytes and macrophages. Our results support the conclusion of the MicroArray Quality Control (MAQC project that the criteria used to constitute the DEG lists strongly influence the degree of concordance among platforms. However the importance of prioritizing genes by magnitude of effect (fold change rather than statistical significance (p-value to enhance cross-platform reproducibility recommended by the MAQC authors was not supported by our data. Conclusion Functional analysis based on GO enrichment demonstrates that the 2 compared technologies delivered very similar results and identified most of the relevant GO categories enriched in the reference list.

  18. Preparation of oligonucleotide microarray for radiation-associated gene expression detection and its application in lung cancer cell lines

    International Nuclear Information System (INIS)

    Guo Wanfeng; Lin Ruxian; Huang Jian; Guo Guozhen; Wang Shengqi

    2005-01-01

    Objective: The response of tumor cell to radiation is accompanied by complex change in patterns of gene expression. It is highly probable that a better understanding of molecular and genetic changes can help to sensitize the radioresistant tumor cells. Methods: Oligonucleotide microarray provides a powerful tool for high-throughput identifying a wider range of genes involved in the radioresistance. Therefore, the authors designed one oligonucleotide microarray according to the biological effect of IR. By using different radiosensitive lung cancer cell lines, the authors identified genes showing altered expression in lung cancer cell lines. To provide independent confirmation of microarray data, semi-quantitative RT-PCR was performed on a selection of genes. Results: In radioresistant A549 cell lines, a total of 18 genes were selected as having significant fold-changes compared to NCI-H446, 8 genes were up-regulated and 10 genes were down-regulated. Subsequently, A549 and NCI-H446 cells were delivered by ionizing radiation. In A549 cell line, we found 22 (19 up-regulated and 3 down-regulated) and 26 (8 up-regulated and 18 down-regulated) differentially expressed genes at 6h and 24h after ionizing radiation. In NCI-H446 cell line, we identified 17 (9 up-regulated and 8 down-regulated) and 18 (6 up-regulated and 12 down-regulated) differentially expressed genes at 6 h and 24 h after ionizing radiation. The authors tested seven genes (MDM2, p53, XRCC5, Bcl-2, PIM2, NFKBIA and Cyclin B1) for RT-PCR, and found that the results were in good agreement with those from the microarray data except for NFKBIA gene, even though the value for each mRNA level might be different between the two measurements. In present study, the authors identified some genes with cell proliferation and anti-apoptosis, such as MdM2, BCL-2, PKCz and PIM2 expression levels increased in A549 cells and decreased in NCI-H446 cells after radiation, and other genes with DNA repair, such as XRCC5, ERCC5

  19. Microarray profiling and co-expression network analysis of circulating lncRNAs and mRNAs associated with major depressive disorder.

    Directory of Open Access Journals (Sweden)

    Zhifen Liu

    Full Text Available LncRNAs, which represent one of the most highly expressed classes of ncRNAs in the brain, are becoming increasingly interesting with regard to brain functions and disorders. However, changes in the expression of regulatory lncRNAs in Major Depressive Disorder (MDD have not yet been reported. Using microarrays, we profiled the expression of 34834 lncRNAs and 39224 mRNAs in peripheral blood sampled from MDD patients as well as demographically-matched controls. Among these, we found that 2007 lncRNAs and 1667 mRNAs were differentially expressed, 17 of which were documented as depression-related gene in previous studies. Gene Ontology (GO and pathway analyses indicated that the biological functions of differentially expressed mRNAs were related to fundamental metabolic processes and neurodevelopment diseases. To investigate the potential regulatory roles of the differentially expressed lncRNAs on the mRNAs, we also constructed co-expression networks composed of the lncRNAs and mRNAs, which shows significant correlated patterns of expression. In the MDD-derived network, there were a greater number of nodes and connections than that in the control-derived network. The lncRNAs located at chr10:874695-874794, chr10:75873456-75873642, and chr3:47048304-47048512 may be important factors regulating the expression of mRNAs as they have previously been reported associations with MDD. This study is the first to explore genome-wide lncRNA expression and co-expression with mRNA patterns in MDD using microarray technology. We identified circulating lncRNAs that are aberrantly expressed in MDD and the results suggest that lncRNAs may contribute to the molecular pathogenesis of MDD.

  20. GeneRank: Using search engine technology for the analysis of microarray experiments

    Directory of Open Access Journals (Sweden)

    Breitling Rainer

    2005-09-01

    Full Text Available Abstract Background Interpretation of simple microarray experiments is usually based on the fold-change of gene expression between a reference and a "treated" sample where the treatment can be of many types from drug exposure to genetic variation. Interpretation of the results usually combines lists of differentially expressed genes with previous knowledge about their biological function. Here we evaluate a method – based on the PageRank algorithm employed by the popular search engine Google – that tries to automate some of this procedure to generate prioritized gene lists by exploiting biological background information. Results GeneRank is an intuitive modification of PageRank that maintains many of its mathematical properties. It combines gene expression information with a network structure derived from gene annotations (gene ontologies or expression profile correlations. Using both simulated and real data we find that the algorithm offers an improved ranking of genes compared to pure expression change rankings. Conclusion Our modification of the PageRank algorithm provides an alternative method of evaluating microarray experimental results which combines prior knowledge about the underlying network. GeneRank offers an improvement compared to assessing the importance of a gene based on its experimentally observed fold-change alone and may be used as a basis for further analytical developments.

  1. GeneRank: using search engine technology for the analysis of microarray experiments.

    Science.gov (United States)

    Morrison, Julie L; Breitling, Rainer; Higham, Desmond J; Gilbert, David R

    2005-09-21

    Interpretation of simple microarray experiments is usually based on the fold-change of gene expression between a reference and a "treated" sample where the treatment can be of many types from drug exposure to genetic variation. Interpretation of the results usually combines lists of differentially expressed genes with previous knowledge about their biological function. Here we evaluate a method--based on the PageRank algorithm employed by the popular search engine Google--that tries to automate some of this procedure to generate prioritized gene lists by exploiting biological background information. GeneRank is an intuitive modification of PageRank that maintains many of its mathematical properties. It combines gene expression information with a network structure derived from gene annotations (gene ontologies) or expression profile correlations. Using both simulated and real data we find that the algorithm offers an improved ranking of genes compared to pure expression change rankings. Our modification of the PageRank algorithm provides an alternative method of evaluating microarray experimental results which combines prior knowledge about the underlying network. GeneRank offers an improvement compared to assessing the importance of a gene based on its experimentally observed fold-change alone and may be used as a basis for further analytical developments.

  2. A Model-Based Joint Identification of Differentially Expressed Genes and Phenotype-Associated Genes

    Science.gov (United States)

    Seo, Minseok; Shin, Su-kyung; Kwon, Eun-Young; Kim, Sung-Eun; Bae, Yun-Jung; Lee, Seungyeoun; Sung, Mi-Kyung; Choi, Myung-Sook; Park, Taesung

    2016-01-01

    Over the last decade, many analytical methods and tools have been developed for microarray data. The detection of differentially expressed genes (DEGs) among different treatment groups is often a primary purpose of microarray data analysis. In addition, association studies investigating the relationship between genes and a phenotype of interest such as survival time are also popular in microarray data analysis. Phenotype association analysis provides a list of phenotype-associated genes (PAGs). However, it is sometimes necessary to identify genes that are both DEGs and PAGs. We consider the joint identification of DEGs and PAGs in microarray data analyses. The first approach we used was a naïve approach that detects DEGs and PAGs separately and then identifies the genes in an intersection of the list of PAGs and DEGs. The second approach we considered was a hierarchical approach that detects DEGs first and then chooses PAGs from among the DEGs or vice versa. In this study, we propose a new model-based approach for the joint identification of DEGs and PAGs. Unlike the previous two-step approaches, the proposed method identifies genes simultaneously that are DEGs and PAGs. This method uses standard regression models but adopts different null hypothesis from ordinary regression models, which allows us to perform joint identification in one-step. The proposed model-based methods were evaluated using experimental data and simulation studies. The proposed methods were used to analyze a microarray experiment in which the main interest lies in detecting genes that are both DEGs and PAGs, where DEGs are identified between two diet groups and PAGs are associated with four phenotypes reflecting the expression of leptin, adiponectin, insulin-like growth factor 1, and insulin. Model-based approaches provided a larger number of genes, which are both DEGs and PAGs, than other methods. Simulation studies showed that they have more power than other methods. Through analysis of

  3. A Model-Based Joint Identification of Differentially Expressed Genes and Phenotype-Associated Genes.

    Directory of Open Access Journals (Sweden)

    Samuel Sunghwan Cho

    Full Text Available Over the last decade, many analytical methods and tools have been developed for microarray data. The detection of differentially expressed genes (DEGs among different treatment groups is often a primary purpose of microarray data analysis. In addition, association studies investigating the relationship between genes and a phenotype of interest such as survival time are also popular in microarray data analysis. Phenotype association analysis provides a list of phenotype-associated genes (PAGs. However, it is sometimes necessary to identify genes that are both DEGs and PAGs. We consider the joint identification of DEGs and PAGs in microarray data analyses. The first approach we used was a naïve approach that detects DEGs and PAGs separately and then identifies the genes in an intersection of the list of PAGs and DEGs. The second approach we considered was a hierarchical approach that detects DEGs first and then chooses PAGs from among the DEGs or vice versa. In this study, we propose a new model-based approach for the joint identification of DEGs and PAGs. Unlike the previous two-step approaches, the proposed method identifies genes simultaneously that are DEGs and PAGs. This method uses standard regression models but adopts different null hypothesis from ordinary regression models, which allows us to perform joint identification in one-step. The proposed model-based methods were evaluated using experimental data and simulation studies. The proposed methods were used to analyze a microarray experiment in which the main interest lies in detecting genes that are both DEGs and PAGs, where DEGs are identified between two diet groups and PAGs are associated with four phenotypes reflecting the expression of leptin, adiponectin, insulin-like growth factor 1, and insulin. Model-based approaches provided a larger number of genes, which are both DEGs and PAGs, than other methods. Simulation studies showed that they have more power than other methods

  4. Transcriptomic identification of candidate genes involved in sunflower responses to chilling and salt stresses based on cDNA microarray analysis

    Directory of Open Access Journals (Sweden)

    Paniego Norma

    2008-01-01

    Full Text Available Abstract Background Considering that sunflower production is expanding to arid regions, tolerance to abiotic stresses as drought, low temperatures and salinity arises as one of the main constrains nowadays. Differential organ-specific sunflower ESTs (expressed sequence tags were previously generated by a subtractive hybridization method that included a considerable number of putative abiotic stress associated sequences. The objective of this work is to analyze concerted gene expression profiles of organ-specific ESTs by fluorescence microarray assay, in response to high sodium chloride concentration and chilling treatments with the aim to identify and follow up candidate genes for early responses to abiotic stress in sunflower. Results Abiotic-related expressed genes were the target of this characterization through a gene expression analysis using an organ-specific cDNA fluorescence microarray approach in response to high salinity and low temperatures. The experiment included three independent replicates from leaf samples. We analyzed 317 unigenes previously isolated from differential organ-specific cDNA libraries from leaf, stem and flower at R1 and R4 developmental stage. A statistical analysis based on mean comparison by ANOVA and ordination by Principal Component Analysis allowed the detection of 80 candidate genes for either salinity and/or chilling stresses. Out of them, 50 genes were up or down regulated under both stresses, supporting common regulatory mechanisms and general responses to chilling and salinity. Interestingly 15 and 12 sequences were up regulated or down regulated specifically in one stress but not in the other, respectively. These genes are potentially involved in different regulatory mechanisms including transcription/translation/protein degradation/protein folding/ROS production or ROS-scavenging. Differential gene expression patterns were confirmed by qRT-PCR for 12.5% of the microarray candidate sequences. Conclusion

  5. Frequency-based time-series gene expression recomposition using PRIISM

    Directory of Open Access Journals (Sweden)

    Rosa Bruce A

    2012-06-01

    Full Text Available Abstract Background Circadian rhythm pathways influence the expression patterns of as much as 31% of the Arabidopsis genome through complicated interaction pathways, and have been found to be significantly disrupted by biotic and abiotic stress treatments, complicating treatment-response gene discovery methods due to clock pattern mismatches in the fold change-based statistics. The PRIISM (Pattern Recomposition for the Isolation of Independent Signals in Microarray data algorithm outlined in this paper is designed to separate pattern changes induced by different forces, including treatment-response pathways and circadian clock rhythm disruptions. Results Using the Fourier transform, high-resolution time-series microarray data is projected to the frequency domain. By identifying the clock frequency range from the core circadian clock genes, we separate the frequency spectrum to different sections containing treatment-frequency (representing up- or down-regulation by an adaptive treatment response, clock-frequency (representing the circadian clock-disruption response and noise-frequency components. Then, we project the components’ spectra back to the expression domain to reconstruct isolated, independent gene expression patterns representing the effects of the different influences. By applying PRIISM on a high-resolution time-series Arabidopsis microarray dataset under a cold treatment, we systematically evaluated our method using maximum fold change and principal component analyses. The results of this study showed that the ranked treatment-frequency fold change results produce fewer false positives than the original methodology, and the 26-hour timepoint in our dataset was the best statistic for distinguishing the most known cold-response genes. In addition, six novel cold-response genes were discovered. PRIISM also provides gene expression data which represents only circadian clock influences, and may be useful for circadian clock studies

  6. Characterization of adjacent breast tumors using oligonucleotide microarrays

    International Nuclear Information System (INIS)

    Unger, Meredith A; Rishi, Mazhar; Clemmer, Virginia B; Hartman, Jennifer L; Keiper, Elizabeth A; Greshock, Joel D; Chodosh, Lewis A; Liebman, Michael N; Weber, Barbara L

    2001-01-01

    Current methodology often cannot distinguish second primary breast cancers from multifocal disease, a potentially important distinction for clinical management. In the present study we evaluated the use of oligonucleotide-based microarray analysis in determining the clonality of tumors by comparing gene expression profiles. Total RNA was extracted from two tumors with no apparent physical connection that were located in the right breast of an 87-year-old woman diagnosed with invasive ductal carcinoma (IDC). The RNA was hybridized to the Affymetrix Human Genome U95A Gene Chip ® (12,500 known human genes) and analyzed using the Gene Chip Analysis Suite ® 3.3 (Affymetrix, Inc, Santa Clara, CA, USA) and JMPIN ® 3.2.6 (SAS Institute, Inc, Cary, NC, USA). Gene expression profiles of tumors from five additional patients were compared in order to evaluate the heterogeneity in gene expression between tumors with similar clinical characteristics. The adjacent breast tumors had a pairwise correlation coefficient of 0.987, and were essentially indistinguishable by microarray analysis. Analysis of gene expression profiles from different individuals, however, generated a pairwise correlation coefficient of 0.710. Transcriptional profiling may be a useful diagnostic tool for determining tumor clonality and heterogeneity, and may ultimately impact on therapeutic decision making

  7. Current Knowledge on Microarray Technology - An Overview

    African Journals Online (AJOL)

    Erah

    This paper reviews basics and updates of each microarray technology and serves to .... through protein microarrays. Protein microarrays also known as protein chips are nothing but grids that ... conditioned media, patient sera, plasma and urine. Clontech ... based antibody arrays) is similar to membrane-based antibody ...

  8. A comprehensive sensitivity analysis of microarray breast cancer classification under feature variability

    Directory of Open Access Journals (Sweden)

    Reinders Marcel JT

    2009-11-01

    Full Text Available Abstract Background Large discrepancies in signature composition and outcome concordance have been observed between different microarray breast cancer expression profiling studies. This is often ascribed to differences in array platform as well as biological variability. We conjecture that other reasons for the observed discrepancies are the measurement error associated with each feature and the choice of preprocessing method. Microarray data are known to be subject to technical variation and the confidence intervals around individual point estimates of expression levels can be wide. Furthermore, the estimated expression values also vary depending on the selected preprocessing scheme. In microarray breast cancer classification studies, however, these two forms of feature variability are almost always ignored and hence their exact role is unclear. Results We have performed a comprehensive sensitivity analysis of microarray breast cancer classification under the two types of feature variability mentioned above. We used data from six state of the art preprocessing methods, using a compendium consisting of eight diferent datasets, involving 1131 hybridizations, containing data from both one and two-color array technology. For a wide range of classifiers, we performed a joint study on performance, concordance and stability. In the stability analysis we explicitly tested classifiers for their noise tolerance by using perturbed expression profiles that are based on uncertainty information directly related to the preprocessing methods. Our results indicate that signature composition is strongly influenced by feature variability, even if the array platform and the stratification of patient samples are identical. In addition, we show that there is often a high level of discordance between individual class assignments for signatures constructed on data coming from different preprocessing schemes, even if the actual signature composition is identical

  9. Development and evaluation of a high-throughput, low-cost genotyping platform based on oligonucleotide microarrays in rice

    Directory of Open Access Journals (Sweden)

    Liu Bin

    2008-05-01

    Full Text Available Abstract Background We report the development of a microarray platform for rapid and cost-effective genetic mapping, and its evaluation using rice as a model. In contrast to methods employing whole-genome tiling microarrays for genotyping, our method is based on low-cost spotted microarray production, focusing only on known polymorphic features. Results We have produced a genotyping microarray for rice, comprising 880 single feature polymorphism (SFP elements derived from insertions/deletions identified by aligning genomic sequences of the japonica cultivar Nipponbare and the indica cultivar 93-11. The SFPs were experimentally verified by hybridization with labeled genomic DNA prepared from the two cultivars. Using the genotyping microarrays, we found high levels of polymorphism across diverse rice accessions, and were able to classify all five subpopulations of rice with high bootstrap support. The microarrays were used for mapping of a gene conferring resistance to Magnaporthe grisea, the causative organism of rice blast disease, by quantitative genotyping of samples from a recombinant inbred line population pooled by phenotype. Conclusion We anticipate this microarray-based genotyping platform, based on its low cost-per-sample, to be particularly useful in applications requiring whole-genome molecular marker coverage across large numbers of individuals.

  10. Gene Expression Commons: an open platform for absolute gene expression profiling.

    Directory of Open Access Journals (Sweden)

    Jun Seita

    Full Text Available Gene expression profiling using microarrays has been limited to comparisons of gene expression between small numbers of samples within individual experiments. However, the unknown and variable sensitivities of each probeset have rendered the absolute expression of any given gene nearly impossible to estimate. We have overcome this limitation by using a very large number (>10,000 of varied microarray data as a common reference, so that statistical attributes of each probeset, such as the dynamic range and threshold between low and high expression, can be reliably discovered through meta-analysis. This strategy is implemented in a web-based platform named "Gene Expression Commons" (https://gexc.stanford.edu/ which contains data of 39 distinct highly purified mouse hematopoietic stem/progenitor/differentiated cell populations covering almost the entire hematopoietic system. Since the Gene Expression Commons is designed as an open platform, investigators can explore the expression level of any gene, search by expression patterns of interest, submit their own microarray data, and design their own working models representing biological relationship among samples.

  11. Factorial microarray analysis of zebra mussel (Dreissena polymorpha: Dreissenidae, Bivalvia adhesion

    Directory of Open Access Journals (Sweden)

    Faisal Mohamed

    2010-05-01

    Full Text Available Abstract Background The zebra mussel (Dreissena polymorpha has been well known for its expertise in attaching to substances under the water. Studies in past decades on this underwater adhesion focused on the adhesive protein isolated from the byssogenesis apparatus of the zebra mussel. However, the mechanism of the initiation, maintenance, and determination of the attachment process remains largely unknown. Results In this study, we used a zebra mussel cDNA microarray previously developed in our lab and a factorial analysis to identify the genes that were involved in response to the changes of four factors: temperature (Factor A, current velocity (Factor B, dissolved oxygen (Factor C, and byssogenesis status (Factor D. Twenty probes in the microarray were found to be modified by one of the factors. The transcription products of four selected genes, DPFP-BG20_A01, EGP-BG97/192_B06, EGP-BG13_G05, and NH-BG17_C09 were unique to the zebra mussel foot based on the results of quantitative reverse transcription PCR (qRT-PCR. The expression profiles of these four genes under the attachment and non-attachment were also confirmed by qRT-PCR and the result is accordant to that from microarray assay. The in situ hybridization with the RNA probes of two identified genes DPFP-BG20_A01 and EGP-BG97/192_B06 indicated that both of them were expressed by a type of exocrine gland cell located in the middle part of the zebra mussel foot. Conclusions The results of this study suggested that the changes of D. polymorpha byssogenesis status and the environmental factors can dramatically affect the expression profiles of the genes unique to the foot. It turns out that the factorial design and analysis of the microarray experiment is a reliable method to identify the influence of multiple factors on the expression profiles of the probesets in the microarray; therein it provides a powerful tool to reveal the mechanism of zebra mussel underwater attachment.

  12. Factorial microarray analysis of zebra mussel (Dreissena polymorpha: Dreissenidae, Bivalvia) adhesion.

    Science.gov (United States)

    Xu, Wei; Faisal, Mohamed

    2010-05-28

    The zebra mussel (Dreissena polymorpha) has been well known for its expertise in attaching to substances under the water. Studies in past decades on this underwater adhesion focused on the adhesive protein isolated from the byssogenesis apparatus of the zebra mussel. However, the mechanism of the initiation, maintenance, and determination of the attachment process remains largely unknown. In this study, we used a zebra mussel cDNA microarray previously developed in our lab and a factorial analysis to identify the genes that were involved in response to the changes of four factors: temperature (Factor A), current velocity (Factor B), dissolved oxygen (Factor C), and byssogenesis status (Factor D). Twenty probes in the microarray were found to be modified by one of the factors. The transcription products of four selected genes, DPFP-BG20_A01, EGP-BG97/192_B06, EGP-BG13_G05, and NH-BG17_C09 were unique to the zebra mussel foot based on the results of quantitative reverse transcription PCR (qRT-PCR). The expression profiles of these four genes under the attachment and non-attachment were also confirmed by qRT-PCR and the result is accordant to that from microarray assay. The in situ hybridization with the RNA probes of two identified genes DPFP-BG20_A01 and EGP-BG97/192_B06 indicated that both of them were expressed by a type of exocrine gland cell located in the middle part of the zebra mussel foot. The results of this study suggested that the changes of D. polymorpha byssogenesis status and the environmental factors can dramatically affect the expression profiles of the genes unique to the foot. It turns out that the factorial design and analysis of the microarray experiment is a reliable method to identify the influence of multiple factors on the expression profiles of the probesets in the microarray; therein it provides a powerful tool to reveal the mechanism of zebra mussel underwater attachment.

  13. Multiplex cDNA quantification method that facilitates the standardization of gene expression data

    Science.gov (United States)

    Gotoh, Osamu; Murakami, Yasufumi; Suyama, Akira

    2011-01-01

    Microarray-based gene expression measurement is one of the major methods for transcriptome analysis. However, current microarray data are substantially affected by microarray platforms and RNA references because of the microarray method can provide merely the relative amounts of gene expression levels. Therefore, valid comparisons of the microarray data require standardized platforms, internal and/or external controls and complicated normalizations. These requirements impose limitations on the extensive comparison of gene expression data. Here, we report an effective approach to removing the unfavorable limitations by measuring the absolute amounts of gene expression levels on common DNA microarrays. We have developed a multiplex cDNA quantification method called GEP-DEAN (Gene expression profiling by DCN-encoding-based analysis). The method was validated by using chemically synthesized DNA strands of known quantities and cDNA samples prepared from mouse liver, demonstrating that the absolute amounts of cDNA strands were successfully measured with a sensitivity of 18 zmol in a highly multiplexed manner in 7 h. PMID:21415008

  14. MicroRNA expression in melanocytic nevi: the usefulness of formalin-fixed, paraffin-embedded material for miRNA microarray profiling

    DEFF Research Database (Denmark)

    Glud, M.; Klausen, M.; Gniadecki, R.

    2009-01-01

    surgical specimens are formalin fixed and paraffin embedded (FFPE). To explore whether FFPE material would be suitable for miRNA profiling in melanocytic lesions, we compared miRNA expression patterns in FFPE versus fresh frozen samples, obtained from 15 human melanocytic nevi. Out of microarray data, we...

  15. Meta-analysis of Drosophila circadian microarray studies identifies a novel set of rhythmically expressed genes.

    Directory of Open Access Journals (Sweden)

    Kevin P Keegan

    2007-11-01

    Full Text Available Five independent groups have reported microarray studies that identify dozens of rhythmically expressed genes in the fruit fly Drosophila melanogaster. Limited overlap among the lists of discovered genes makes it difficult to determine which, if any, exhibit truly rhythmic patterns of expression. We reanalyzed data from all five reports and found two sources for the observed discrepancies, the use of different expression pattern detection algorithms and underlying variation among the datasets. To improve upon the methods originally employed, we developed a new analysis that involves compilation of all existing data, application of identical transformation and standardization procedures followed by ANOVA-based statistical prescreening, and three separate classes of post hoc analysis: cross-correlation to various cycling waveforms, autocorrelation, and a previously described fast Fourier transform-based technique. Permutation-based statistical tests were used to derive significance measures for all post hoc tests. We find application of our method, most significantly the ANOVA prescreening procedure, significantly reduces the false discovery rate relative to that observed among the results of the original five reports while maintaining desirable statistical power. We identify a set of 81 cycling transcripts previously found in one or more of the original reports as well as a novel set of 133 transcripts not found in any of the original studies. We introduce a novel analysis method that compensates for variability observed among the original five Drosophila circadian array reports. Based on the statistical fidelity of our meta-analysis results, and the results of our initial validation experiments (quantitative RT-PCR, we predict many of our newly found genes to be bona fide cyclers, and suggest that they may lead to new insights into the pathways through which clock mechanisms regulate behavioral rhythms.

  16. The Importance of Normalization on Large and Heterogeneous Microarray Datasets

    Science.gov (United States)

    DNA microarray technology is a powerful functional genomics tool increasingly used for investigating global gene expression in environmental studies. Microarrays can also be used in identifying biological networks, as they give insight on the complex gene-to-gene interactions, ne...

  17. Hybrid Feature Selection Approach Based on GRASP for Cancer Microarray Data

    Directory of Open Access Journals (Sweden)

    Arpita Nagpal

    2017-01-01

    Full Text Available Microarray data usually contain a large number of genes, but a small number of samples. Feature subset selection for microarray data aims at reducing the number of genes so that useful information can be extracted from the samples. Reducing the dimension of data sets further helps in improving the computational efficiency of the learning model. In this paper, we propose a modified algorithm based on the tabu search as local search procedures to a Greedy Randomized Adaptive Search Procedure (GRASP for high dimensional microarray data sets. The proposed Tabu based Greedy Randomized Adaptive Search Procedure algorithm is named as TGRASP. In TGRASP, a new parameter has been introduced named as Tabu Tenure and the existing parameters, NumIter and size have been modified. We observed that different parameter settings affect the quality of the optimum. The second proposed algorithm known as FFGRASP (Firefly Greedy Randomized Adaptive Search Procedure uses a firefly optimization algorithm in the local search optimzation phase of the greedy randomized adaptive search procedure (GRASP. Firefly algorithm is one of the powerful algorithms for optimization of multimodal applications. Experimental results show that the proposed TGRASP and FFGRASP algorithms are much better than existing algorithm with respect to three performance parameters viz. accuracy, run time, number of a selected subset of features. We have also compared both the approaches with a unified metric (Extended Adjusted Ratio of Ratios which has shown that TGRASP approach outperforms existing approach for six out of nine cancer microarray datasets and FFGRASP performs better on seven out of nine datasets.

  18. Carbohydrate microarrays

    DEFF Research Database (Denmark)

    Park, Sungjin; Gildersleeve, Jeffrey C; Blixt, Klas Ola

    2012-01-01

    In the last decade, carbohydrate microarrays have been core technologies for analyzing carbohydrate-mediated recognition events in a high-throughput fashion. A number of methods have been exploited for immobilizing glycans on the solid surface in a microarray format. This microarray...... of substrate specificities of glycosyltransferases. This review covers the construction of carbohydrate microarrays, detection methods of carbohydrate microarrays and their applications in biological and biomedical research....

  19. Autoregressive-model-based missing value estimation for DNA microarray time series data.

    Science.gov (United States)

    Choong, Miew Keen; Charbit, Maurice; Yan, Hong

    2009-01-01

    Missing value estimation is important in DNA microarray data analysis. A number of algorithms have been developed to solve this problem, but they have several limitations. Most existing algorithms are not able to deal with the situation where a particular time point (column) of the data is missing entirely. In this paper, we present an autoregressive-model-based missing value estimation method (ARLSimpute) that takes into account the dynamic property of microarray temporal data and the local similarity structures in the data. ARLSimpute is especially effective for the situation where a particular time point contains many missing values or where the entire time point is missing. Experiment results suggest that our proposed algorithm is an accurate missing value estimator in comparison with other imputation methods on simulated as well as real microarray time series datasets.

  20. A novel multifunctional oligonucleotide microarray for Toxoplasma gondii

    Directory of Open Access Journals (Sweden)

    Chen Feng

    2010-10-01

    Full Text Available Abstract Background Microarrays are invaluable tools for genome interrogation, SNP detection, and expression analysis, among other applications. Such broad capabilities would be of value to many pathogen research communities, although the development and use of genome-scale microarrays is often a costly undertaking. Therefore, effective methods for reducing unnecessary probes while maintaining or expanding functionality would be relevant to many investigators. Results Taking advantage of available genome sequences and annotation for Toxoplasma gondii (a pathogenic parasite responsible for illness in immunocompromised individuals and Plasmodium falciparum (a related parasite responsible for severe human malaria, we designed a single oligonucleotide microarray capable of supporting a wide range of applications at relatively low cost, including genome-wide expression profiling for Toxoplasma, and single-nucleotide polymorphism (SNP-based genotyping of both T. gondii and P. falciparum. Expression profiling of the three clonotypic lineages dominating T. gondii populations in North America and Europe provides a first comprehensive view of the parasite transcriptome, revealing that ~49% of all annotated genes are expressed in parasite tachyzoites (the acutely lytic stage responsible for pathogenesis and 26% of genes are differentially expressed among strains. A novel design utilizing few probes provided high confidence genotyping, used here to resolve recombination points in the clonal progeny of sexual crosses. Recent sequencing of additional T. gondii isolates identifies >620 K new SNPs, including ~11 K that intersect with expression profiling probes, yielding additional markers for genotyping studies, and further validating the utility of a combined expression profiling/genotyping array design. Additional applications facilitating SNP and transcript discovery, alternative statistical methods for quantifying gene expression, etc. are also pursued at

  1. Ontology-based, Tissue MicroArray oriented, image centered tissue bank

    Directory of Open Access Journals (Sweden)

    Viti Federica

    2008-04-01

    Full Text Available Abstract Background Tissue MicroArray technique is becoming increasingly important in pathology for the validation of experimental data from transcriptomic analysis. This approach produces many images which need to be properly managed, if possible with an infrastructure able to support tissue sharing between institutes. Moreover, the available frameworks oriented to Tissue MicroArray provide good storage for clinical patient, sample treatment and block construction information, but their utility is limited by the lack of data integration with biomolecular information. Results In this work we propose a Tissue MicroArray web oriented system to support researchers in managing bio-samples and, through the use of ontologies, enables tissue sharing aimed at the design of Tissue MicroArray experiments and results evaluation. Indeed, our system provides ontological description both for pre-analysis tissue images and for post-process analysis image results, which is crucial for information exchange. Moreover, working on well-defined terms it is then possible to query web resources for literature articles to integrate both pathology and bioinformatics data. Conclusions Using this system, users associate an ontology-based description to each image uploaded into the database and also integrate results with the ontological description of biosequences identified in every tissue. Moreover, it is possible to integrate the ontological description provided by the user with a full compliant gene ontology definition, enabling statistical studies about correlation between the analyzed pathology and the most commonly related biological processes.

  2. Microarray analysis of gene expression profiles of Schistosoma japonicum derived from less-susceptible host water buffalo and susceptible host goat.

    Directory of Open Access Journals (Sweden)

    Jianmei Yang

    Full Text Available BACKGROUND: Water buffalo and goats are natural hosts for S. japonicum in endemic areas of China. The susceptibility of these two hosts to schistosome infection is different, as water buffalo are less conducive to S. japonicum growth and development. To identify genes that may affect schistosome development and survival, we compared gene expression profiles of schistosomes derived from these two natural hosts using high-throughput microarray technology. RESULTS: The worm recovery rate was lower and the length and width of worms from water buffalo were smaller compared to those from goats following S. japonicum infection for 7 weeks. Besides obvious morphological difference between the schistosomes derived from the two hosts, differences were also observed by scanning and transmission electron microscopy. Microarray analysis showed differentially expressed gene patterns for parasites from the two hosts, which revealed that genes related to lipid and nucleotide metabolism, as well as protein folding, sorting, and degradation were upregulated, while others associated with signal transduction, endocrine function, development, immune function, endocytosis, and amino acid/carbohydrate/glycan metabolism were downregulated in schistosomes from water buffalo. KEGG pathway analysis deduced that the differentially expressed genes mainly involved lipid metabolism, the MAPK and ErbB signaling pathways, progesterone-mediated oocyte maturation, dorso-ventral axis formation, reproduction, and endocytosis, etc. CONCLUSION: The microarray gene analysis in schistosomes derived from water buffalo and goats provide a useful platform to disclose differences determining S. japonicum host compatibility to better understand the interplay between natural hosts and parasites, and identify schistosome target genes associated with susceptibility to screen vaccine candidates.

  3. Feature selection and classification of MAQC-II breast cancer and multiple myeloma microarray gene expression data.

    Directory of Open Access Journals (Sweden)

    Qingzhong Liu

    Full Text Available Microarray data has a high dimension of variables but available datasets usually have only a small number of samples, thereby making the study of such datasets interesting and challenging. In the task of analyzing microarray data for the purpose of, e.g., predicting gene-disease association, feature selection is very important because it provides a way to handle the high dimensionality by exploiting information redundancy induced by associations among genetic markers. Judicious feature selection in microarray data analysis can result in significant reduction of cost while maintaining or improving the classification or prediction accuracy of learning machines that are employed to sort out the datasets. In this paper, we propose a gene selection method called Recursive Feature Addition (RFA, which combines supervised learning and statistical similarity measures. We compare our method with the following gene selection methods: Support Vector Machine Recursive Feature Elimination (SVMRFE, Leave-One-Out Calculation Sequential Forward Selection (LOOCSFS, Gradient based Leave-one-out Gene Selection (GLGS. To evaluate the performance of these gene selection methods, we employ several popular learning classifiers on the MicroArray Quality Control phase II on predictive modeling (MAQC-II breast cancer dataset and the MAQC-II multiple myeloma dataset. Experimental results show that gene selection is strictly paired with learning classifier. Overall, our approach outperforms other compared methods. The biological functional analysis based on the MAQC-II breast cancer dataset convinced us to apply our method for phenotype prediction. Additionally, learning classifiers also play important roles in the classification of microarray data and our experimental results indicate that the Nearest Mean Scale Classifier (NMSC is a good choice due to its prediction reliability and its stability across the three performance measurements: Testing accuracy, MCC values, and

  4. Direct calibration of PICKY-designed microarrays

    Directory of Open Access Journals (Sweden)

    Ronald Pamela C

    2009-10-01

    Full Text Available Abstract Background Few microarrays have been quantitatively calibrated to identify optimal hybridization conditions because it is difficult to precisely determine the hybridization characteristics of a microarray using biologically variable cDNA samples. Results Using synthesized samples with known concentrations of specific oligonucleotides, a series of microarray experiments was conducted to evaluate microarrays designed by PICKY, an oligo microarray design software tool, and to test a direct microarray calibration method based on the PICKY-predicted, thermodynamically closest nontarget information. The complete set of microarray experiment results is archived in the GEO database with series accession number GSE14717. Additional data files and Perl programs described in this paper can be obtained from the website http://www.complex.iastate.edu under the PICKY Download area. Conclusion PICKY-designed microarray probes are highly reliable over a wide range of hybridization temperatures and sample concentrations. The microarray calibration method reported here allows researchers to experimentally optimize their hybridization conditions. Because this method is straightforward, uses existing microarrays and relatively inexpensive synthesized samples, it can be used by any lab that uses microarrays designed by PICKY. In addition, other microarrays can be reanalyzed by PICKY to obtain the thermodynamically closest nontarget information for calibration.

  5. A comprehensive comparison of random forests and support vector machines for microarray-based cancer classification

    Directory of Open Access Journals (Sweden)

    Wang Lily

    2008-07-01

    Full Text Available Abstract Background Cancer diagnosis and clinical outcome prediction are among the most important emerging applications of gene expression microarray technology with several molecular signatures on their way toward clinical deployment. Use of the most accurate classification algorithms available for microarray gene expression data is a critical ingredient in order to develop the best possible molecular signatures for patient care. As suggested by a large body of literature to date, support vector machines can be considered "best of class" algorithms for classification of such data. Recent work, however, suggests that random forest classifiers may outperform support vector machines in this domain. Results In the present paper we identify methodological biases of prior work comparing random forests and support vector machines and conduct a new rigorous evaluation of the two algorithms that corrects these limitations. Our experiments use 22 diagnostic and prognostic datasets and show that support vector machines outperform random forests, often by a large margin. Our data also underlines the importance of sound research design in benchmarking and comparison of bioinformatics algorithms. Conclusion We found that both on average and in the majority of microarray datasets, random forests are outperformed by support vector machines both in the settings when no gene selection is performed and when several popular gene selection methods are used.

  6. A probabilistic framework for microarray data analysis: fundamental probability models and statistical inference.

    Science.gov (United States)

    Ogunnaike, Babatunde A; Gelmi, Claudio A; Edwards, Jeremy S

    2010-05-21

    Gene expression studies generate large quantities of data with the defining characteristic that the number of genes (whose expression profiles are to be determined) exceed the number of available replicates by several orders of magnitude. Standard spot-by-spot analysis still seeks to extract useful information for each gene on the basis of the number of available replicates, and thus plays to the weakness of microarrays. On the other hand, because of the data volume, treating the entire data set as an ensemble, and developing theoretical distributions for these ensembles provides a framework that plays instead to the strength of microarrays. We present theoretical results that under reasonable assumptions, the distribution of microarray intensities follows the Gamma model, with the biological interpretations of the model parameters emerging naturally. We subsequently establish that for each microarray data set, the fractional intensities can be represented as a mixture of Beta densities, and develop a procedure for using these results to draw statistical inference regarding differential gene expression. We illustrate the results with experimental data from gene expression studies on Deinococcus radiodurans following DNA damage using cDNA microarrays. Copyright (c) 2010 Elsevier Ltd. All rights reserved.

  7. Extended -Regular Sequence for Automated Analysis of Microarray Images

    Directory of Open Access Journals (Sweden)

    Jin Hee-Jeong

    2006-01-01

    Full Text Available Microarray study enables us to obtain hundreds of thousands of expressions of genes or genotypes at once, and it is an indispensable technology for genome research. The first step is the analysis of scanned microarray images. This is the most important procedure for obtaining biologically reliable data. Currently most microarray image processing systems require burdensome manual block/spot indexing work. Since the amount of experimental data is increasing very quickly, automated microarray image analysis software becomes important. In this paper, we propose two automated methods for analyzing microarray images. First, we propose the extended -regular sequence to index blocks and spots, which enables a novel automatic gridding procedure. Second, we provide a methodology, hierarchical metagrid alignment, to allow reliable and efficient batch processing for a set of microarray images. Experimental results show that the proposed methods are more reliable and convenient than the commercial tools.

  8. Bayesian meta-analysis models for microarray data: a comparative study

    Directory of Open Access Journals (Sweden)

    Song Joon J

    2007-03-01

    Full Text Available Abstract Background With the growing abundance of microarray data, statistical methods are increasingly needed to integrate results across studies. Two common approaches for meta-analysis of microarrays include either combining gene expression measures across studies or combining summaries such as p-values, probabilities or ranks. Here, we compare two Bayesian meta-analysis models that are analogous to these methods. Results Two Bayesian meta-analysis models for microarray data have recently been introduced. The first model combines standardized gene expression measures across studies into an overall mean, accounting for inter-study variability, while the second combines probabilities of differential expression without combining expression values. Both models produce the gene-specific posterior probability of differential expression, which is the basis for inference. Since the standardized expression integration model includes inter-study variability, it may improve accuracy of results versus the probability integration model. However, due to the small number of studies typical in microarray meta-analyses, the variability between studies is challenging to estimate. The probability integration model eliminates the need to model variability between studies, and thus its implementation is more straightforward. We found in simulations of two and five studies that combining probabilities outperformed combining standardized gene expression measures for three comparison values: the percent of true discovered genes in meta-analysis versus individual studies; the percent of true genes omitted in meta-analysis versus separate studies, and the number of true discovered genes for fixed levels of Bayesian false discovery. We identified similar results when pooling two independent studies of Bacillus subtilis. We assumed that each study was produced from the same microarray platform with only two conditions: a treatment and control, and that the data sets

  9. Microarray Analysis of microRNA Expression during Axolotl Limb Regeneration

    Science.gov (United States)

    Holman, Edna C.; Campbell, Leah J.; Hines, John; Crews, Craig M.

    2012-01-01

    Among vertebrates, salamanders stand out for their remarkable capacity to quickly regrow a myriad of tissues and organs after injury or amputation. The limb regeneration process in axolotls (Ambystoma mexicanum) has been well studied for decades at the cell-tissue level. While several developmental genes are known to be reactivated during this epimorphic process, less is known about the role of microRNAs in urodele amphibian limb regeneration. Given the compelling evidence that many microRNAs tightly regulate cell fate and morphogenetic processes through development and adulthood by modulating the expression (or re-expression) of developmental genes, we investigated the possibility that microRNA levels change during limb regeneration. Using two different microarray platforms to compare the axolotl microRNA expression between mid-bud limb regenerating blastemas and non-regenerating stump tissues, we found that miR-21 was overexpressed in mid-bud blastemas compared to stump tissue. Mature A. mexicanum (“Amex”) miR-21 was detected in axolotl RNA by Northern blot and differential expression of Amex-miR-21 in blastema versus stump was confirmed by quantitative RT-PCR. We identified the Amex Jagged1 as a putative target gene for miR-21 during salamander limb regeneration. We cloned the full length 3′UTR of Amex-Jag1, and our in vitro assays demonstrated that its single miR-21 target recognition site is functional and essential for the response of the Jagged1 gene to miR-21 levels. Our findings pave the road for advanced in vivo functional assays aimed to clarify how microRNAs such as miR-21, often linked to pathogenic cell growth, might be modulating the redeployment of developmental genes such as Jagged1 during regenerative processes. PMID:23028429

  10. Microarray analysis of microRNA expression during axolotl limb regeneration.

    Directory of Open Access Journals (Sweden)

    Edna C Holman

    Full Text Available Among vertebrates, salamanders stand out for their remarkable capacity to quickly regrow a myriad of tissues and organs after injury or amputation. The limb regeneration process in axolotls (Ambystoma mexicanum has been well studied for decades at the cell-tissue level. While several developmental genes are known to be reactivated during this epimorphic process, less is known about the role of microRNAs in urodele amphibian limb regeneration. Given the compelling evidence that many microRNAs tightly regulate cell fate and morphogenetic processes through development and adulthood by modulating the expression (or re-expression of developmental genes, we investigated the possibility that microRNA levels change during limb regeneration. Using two different microarray platforms to compare the axolotl microRNA expression between mid-bud limb regenerating blastemas and non-regenerating stump tissues, we found that miR-21 was overexpressed in mid-bud blastemas compared to stump tissue. Mature A. mexicanum ("Amex" miR-21 was detected in axolotl RNA by Northern blot and differential expression of Amex-miR-21 in blastema versus stump was confirmed by quantitative RT-PCR. We identified the Amex Jagged1 as a putative target gene for miR-21 during salamander limb regeneration. We cloned the full length 3'UTR of Amex-Jag1, and our in vitro assays demonstrated that its single miR-21 target recognition site is functional and essential for the response of the Jagged1 gene to miR-21 levels. Our findings pave the road for advanced in vivo functional assays aimed to clarify how microRNAs such as miR-21, often linked to pathogenic cell growth, might be modulating the redeployment of developmental genes such as Jagged1 during regenerative processes.

  11. Bone health nutraceuticals alter microarray mRNA gene expression: A randomized, parallel, open-label clinical study.

    Science.gov (United States)

    Lin, Yumei; Kazlova, Valentina; Ramakrishnan, Shyam; Murray, Mary A; Fast, David; Chandra, Amitabh; Gellenbeck, Kevin W

    2016-01-15

    Dietary intake of fruits and vegetables has been suggested to have a role in promoting bone health. More specifically, the polyphenols they contain have been linked to physiological effects related to bone mineral density and bone metabolism. In this research, we use standard microarray analyses of peripheral whole blood from post-menopausal women treated with two fixed combinations of plant extracts standardized to polyphenol content to identify differentially expressed genes relevant to bone health. In this 28-day open-label study, healthy post-menopausal women were randomized into three groups, each receiving one of three investigational fixed combinations of plant extracts: an anti-resorptive (AR) combination of pomegranate fruit (Punica granatum L.) and grape seed (Vitis vinifera L.) extracts; a bone formation (BF) combination of quercetin (Dimorphandra mollis Benth) and licorice (Glycyrrhiza glabra L.) extracts; and a fixed combination of all four plant extracts (AR plus BF). Standard microarray analysis was performed on peripheral whole blood samples taken before and after each treatment. Annotated genes were analyzed for their association to bone health by comparison to a gene library. The AR combination down-regulated a number of genes involved in reduction of bone resorption including cathepsin G (CTSG) and tachykinin receptor 1 (TACR1). The AR combination also up-regulated genes associated with formation of extracellular matrix including heparan sulfate proteoglycan 2 (HSPG2) and hyaluronoglucosaminidase 1 (HYAL1). In contrast, treatment with the BF combination resulted in up-regulation of bone morphogenetic protein 2 (BMP-2) and COL1A1 (collagen type I α1) genes which are linked to bone and collagen formation while down-regulating genes linked to osteoclastogenesis. Treatment with a combination of all four plant extracts had a distinctly different effect on gene expression than the results of the AR and BF combinations individually. These results could

  12. A DNA microarray-based methylation-sensitive (MS)-AFLP hybridization method for genetic and epigenetic analyses.

    Science.gov (United States)

    Yamamoto, F; Yamamoto, M

    2004-07-01

    We previously developed a PCR-based DNA fingerprinting technique named the Methylation Sensitive (MS)-AFLP method, which permits comparative genome-wide scanning of methylation status with a manageable number of fingerprinting experiments. The technique uses the methylation sensitive restriction enzyme NotI in the context of the existing Amplified Fragment Length Polymorphism (AFLP) method. Here we report the successful conversion of this gel electrophoresis-based DNA fingerprinting technique into a DNA microarray hybridization technique (DNA Microarray MS-AFLP). By performing a total of 30 (15 x 2 reciprocal labeling) DNA Microarray MS-AFLP hybridization experiments on genomic DNA from two breast and three prostate cancer cell lines in all pairwise combinations, and Southern hybridization experiments using more than 100 different probes, we have demonstrated that the DNA Microarray MS-AFLP is a reliable method for genetic and epigenetic analyses. No statistically significant differences were observed in the number of differences between the breast-prostate hybridization experiments and the breast-breast or prostate-prostate comparisons.

  13. Microarray Study of Pathway Analysis Expression Profile Associated with MicroRNA-29a with Regard to Murine Cholestatic Liver Injuries

    Directory of Open Access Journals (Sweden)

    Sung-Chou Li

    2016-03-01

    Full Text Available Accumulating evidence demonstrates that microRNA-29 (miR-29 expression is prominently decreased in patients with hepatic fibrosis, which consequently stimulates hepatic stellate cells’ (HSCs activation. We used a cDNA microarray study to gain a more comprehensive understanding of genome-wide gene expressions by adjusting miR-29a expression in a bile duct-ligation (BDL animal model. Methods: Using miR-29a transgenic mice and wild-type littermates and applying the BDL mouse model, we characterized the function of miR-29a with regard to cholestatic liver fibrosis. Pathway enrichment analysis and/or specific validation were performed for differentially expressed genes found within the comparisons. Results: Analysis of the microarray data identified a number of differentially expressed genes due to the miR-29a transgene, BDL, or both. Additional pathway enrichment analysis revealed that TGF-β signaling had a significantly differential activated pathway depending on the occurrence of miR-29a overexpression or the lack thereof. Furthermore, overexpression was found to elicit changes in Wnt/β-catenin after BDL. Conclusion: This study verified that an elevated miR-29a level could alleviate liver fibrosis caused by cholestasis. Furthermore, the protective effects of miR-29a correlate with the downregulation of TGF-β and associated with Wnt/β-catenin signal pathway following BDL.

  14. ELISA-BASE: an integrated bioinformatics tool for analyzing and tracking ELISA microarray data

    OpenAIRE

    White, Amanda M.; Collett, James R.; Seurynck-Servoss, Shannon L.; Daly, Don S.; Zangar, Richard C.

    2009-01-01

    Summary:ELISA-BASE is an open source database for capturing, organizing and analyzing enzyme-linked immunosorbent assay (ELISA) microarray data. ELISA-BASE is an extension of the BioArray Software Environment (BASE) database system.

  15. Vaccine-induced modulation of gene expression in turbot peritoneal cells. A microarray approach.

    Science.gov (United States)

    Fontenla, Francisco; Blanco-Abad, Verónica; Pardo, Belén G; Folgueira, Iria; Noia, Manuel; Gómez-Tato, Antonio; Martínez, Paulino; Leiro, José M; Lamas, Jesús

    2016-07-01

    We used a microarray approach to examine changes in gene expression in turbot peritoneal cells after injection of the fish with vaccines containing the ciliate parasite Philasterides dicentrarchi as antigen and one of the following adjuvants: chitosan-PVMMA microspheres, Freund́s complete adjuvant, aluminium hydroxide gel or Matrix-Q (Isconova, Sweden). We identified 374 genes that were differentially expressed in all groups of fish. Forty-two genes related to tight junctions and focal adhesions and/or actin cytoskeleton were differentially expressed in free peritoneal cells. The profound changes in gene expression related to cell adherence and cytoskeleton may be associated with cell migration and also with the formation of cell-vaccine masses and their attachment to the peritoneal wall. Thirty-five genes related to apoptosis were differentially expressed. Although most of the proteins coded by these genes have a proapoptotic effect, others are antiapoptotic, indicating that both types of signals occur in peritoneal leukocytes of vaccinated fish. Interestingly, many of the genes related to lymphocytes and lymphocyte activity were downregulated in the groups injected with vaccine. We also observed decreased expression of genes related to antigen presentation, suggesting that macrophages (which were abundant in the peritoneal cavity after vaccination) did not express these during the early inflammatory response in the peritoneal cavity. Finally, several genes that participate in the inflammatory response were differentially expressed, and most participated in resolution of inflammation, indicating that an M2 macrophage response is generated in the peritoneal cavity of fish one day post vaccination. Copyright © 2016 Elsevier Ltd. All rights reserved.

  16. Evaluation of chronic lymphocytic leukemia by BAC-based microarray analysis

    Directory of Open Access Journals (Sweden)

    McDaniel Lisa D

    2011-02-01

    Full Text Available Abstract Background Chronic lymphocytic leukemia (CLL is a highly variable disease with life expectancies ranging from months to decades. Cytogenetic findings play an integral role in defining the prognostic significance and treatment for individual patients. Results We have evaluated 25 clinical cases from a tertiary cancer center that have an established diagnosis of CLL and for which there was prior cytogenetic and/or fluorescence in situ hybridization (FISH data. We performed microarray-based comparative genomic hybridization (aCGH using a bacterial artificial chromosome (BAC-based microarray designed for the detection of known constitutional genetic syndromes. In 15 of the 25 cases, aCGH detected all copy number imbalances identified by prior cytogenetic and/or FISH studies. For the majority of those not detected, the aberrations were present at low levels of mosaicism. Furthermore, for 15 of the 25 cases, additional abnormalities were detected. Four of those cases had deletions that mapped to intervals implicated in inherited predisposition to CLL. For most cases, aCGH was able to detect abnormalities present in as few as 10% of cells. Although changes in ploidy are not easily discernable by aCGH, results for two cases illustrate the detection of additional copy gains and losses present within a mosaic tetraploid cell population. Conclusions Our results illustrate the successful evaluation of CLL using a microarray optimized for the interrogation of inherited disorders and the identification of alterations with possible relevance to CLL susceptibility.

  17. Universal Reference RNA as a standard for microarray experiments

    Directory of Open Access Journals (Sweden)

    Fero Michael

    2004-03-01

    Full Text Available Abstract Background Obtaining reliable and reproducible two-color microarray gene expression data is critically important for understanding the biological significance of perturbations made on a cellular system. Microarray design, RNA preparation and labeling, hybridization conditions and data acquisition and analysis are variables difficult to simultaneously control. A useful tool for monitoring and controlling intra- and inter-experimental variation is Universal Reference RNA (URR, developed with the goal of providing hybridization signal at each microarray probe location (spot. Measuring signal at each spot as the ratio of experimental RNA to reference RNA targets, rather than relying on absolute signal intensity, decreases variability by normalizing signal output in any two-color hybridization experiment. Results Human, mouse and rat URR (UHRR, UMRR and URRR, respectively were prepared from pools of RNA derived from individual cell lines representing different tissues. A variety of microarrays were used to determine percentage of spots hybridizing with URR and producing signal above a user defined threshold (microarray coverage. Microarray coverage was consistently greater than 80% for all arrays tested. We confirmed that individual cell lines contribute their own unique set of genes to URR, arguing for a pool of RNA from several cell lines as a better configuration for URR as opposed to a single cell line source for URR. Microarray coverage comparing two separately prepared batches each of UHRR, UMRR and URRR were highly correlated (Pearson's correlation coefficients of 0.97. Conclusion Results of this study demonstrate that large quantities of pooled RNA from individual cell lines are reproducibly prepared and possess diverse gene representation. This type of reference provides a standard for reducing variation in microarray experiments and allows more reliable comparison of gene expression data within and between experiments and

  18. Microarray-based analysis of plasma cirDNA epigenetic modification profiling in xenografted mice exposed to intermittent hypoxia

    Directory of Open Access Journals (Sweden)

    Rene Cortese

    2015-09-01

    Full Text Available Intermittent hypoxia (IH during sleep is one of the major abnormalities occurring in patients suffering from obstructive sleep apnea (OSA, a highly prevalent disorder affecting 6–15% of the general population, particularly among obese people. IH has been proposed as a major determinant of oncogenetically-related processes such as tumor growth, invasion and metastasis. During the growth and expansion of tumors, fragmented DNA is released into the bloodstream and enters the circulation. Circulating tumor DNA (cirDNA conserves the genetic and epigenetic profiles from the tumor of origin and can be isolated from the plasma fraction. Here we report a microarray-based epigenetic profiling of cirDNA isolated from blood samples of mice engrafted with TC1 epithelial lung cancer cells and controls, which were exposed to IH during sleep (XenoIH group, n = 3 or control conditions, (i.e., room air (RA; XenoRA group, n = 3 conditions. To prepare the targets for microarray hybridization, we applied a previously developed method that enriches the modified fraction of the cirDNA without amplification of genomic DNA. Regions of differential cirDNA modification between the two groups were identified by hybridizing the enriched fractions for each sample to Affymetrix GeneChip Human Promoter Arrays 1.0R. Microarray raw and processed data were deposited in NCBI's Gene Expression Omnibus (GEO database (accession number: GSE61070.

  19. Identification of self-consistent modulons from bacterial microarray expression data with the help of structured regulon gene sets

    KAUST Repository

    Permina, Elizaveta A.

    2013-01-01

    Identification of bacterial modulons from series of gene expression measurements on microarrays is a principal problem, especially relevant for inadequately studied but practically important species. Usage of a priori information on regulatory interactions helps to evaluate parameters for regulatory subnetwork inference. We suggest a procedure for modulon construction where a seed regulon is iteratively updated with genes having expression patterns similar to those for regulon member genes. A set of genes essential for a regulon is used to control modulon updating. Essential genes for a regulon were selected as a subset of regulon genes highly related by different measures to each other. Using Escherichia coli as a model, we studied how modulon identification depends on the data, including the microarray experiments set, the adopted relevance measure and the regulon itself. We have found that results of modulon identification are highly dependent on all parameters studied and thus the resulting modulon varies substantially depending on the identification procedure. Yet, modulons that were identified correctly displayed higher stability during iterations, which allows developing a procedure for reliable modulon identification in the case of less studied species where the known regulatory interactions are sparse. Copyright © 2013 Taylor & Francis.

  20. Microarray Meta-Analysis Identifies Acute Lung Injury Biomarkers in Donor Lungs That Predict Development of Primary Graft Failure in Recipients

    Science.gov (United States)

    Haitsma, Jack J.; Furmli, Suleiman; Masoom, Hussain; Liu, Mingyao; Imai, Yumiko; Slutsky, Arthur S.; Beyene, Joseph; Greenwood, Celia M. T.; dos Santos, Claudia

    2012-01-01

    Objectives To perform a meta-analysis of gene expression microarray data from animal studies of lung injury, and to identify an injury-specific gene expression signature capable of predicting the development of lung injury in humans. Methods We performed a microarray meta-analysis using 77 microarray chips across six platforms, two species and different animal lung injury models exposed to lung injury with or/and without mechanical ventilation. Individual gene chips were classified and grouped based on the strategy used to induce lung injury. Effect size (change in gene expression) was calculated between non-injurious and injurious conditions comparing two main strategies to pool chips: (1) one-hit and (2) two-hit lung injury models. A random effects model was used to integrate individual effect sizes calculated from each experiment. Classification models were built using the gene expression signatures generated by the meta-analysis to predict the development of lung injury in human lung transplant recipients. Results Two injury-specific lists of differentially expressed genes generated from our meta-analysis of lung injury models were validated using external data sets and prospective data from animal models of ventilator-induced lung injury (VILI). Pathway analysis of gene sets revealed that both new and previously implicated VILI-related pathways are enriched with differentially regulated genes. Classification model based on gene expression signatures identified in animal models of lung injury predicted development of primary graft failure (PGF) in lung transplant recipients with larger than 80% accuracy based upon injury profiles from transplant donors. We also found that better classifier performance can be achieved by using meta-analysis to identify differentially-expressed genes than using single study-based differential analysis. Conclusion Taken together, our data suggests that microarray analysis of gene expression data allows for the detection of

  1. DNA Microarray Technologies: A Novel Approach to Geonomic Research

    Energy Technology Data Exchange (ETDEWEB)

    Hinman, R.; Thrall, B.; Wong, K,

    2002-01-01

    A cDNA microarray allows biologists to examine the expression of thousands of genes simultaneously. Researchers may analyze the complete transcriptional program of an organism in response to specific physiological or developmental conditions. By design, a cDNA microarray is an experiment with many variables and few controls. One question that inevitably arises when working with a cDNA microarray is data reproducibility. How easy is it to confirm mRNA expression patterns? In this paper, a case study involving the treatment of a murine macrophage RAW 264.7 cell line with tumor necrosis factor alpha (TNF) was used to obtain a rough estimate of data reproducibility. Two trials were examined and a list of genes displaying either a > 2-fold or > 4-fold increase in gene expression was compiled. Variations in signal mean ratios between the two slides were observed. We can assume that erring in reproducibility may be compensated by greater inductive levels of similar genes. Steps taken to obtain results included serum starvation of cells before treatment, tests of mRNA for quality/consistency, and data normalization.

  2. Identification of listeria species isolated in Tunisia by Microarray based assay : results of a preliminary study

    International Nuclear Information System (INIS)

    Hmaied, Fatma; Helel, Salma; Barkallah, Insaf; Leberre, V.; Francois, J.M.; Kechrid, A.

    2008-01-01

    Microarray-based assay is a new molecular approach for genetic screening and identification of microorganisms. We have developed a rapid microarray-based assay for the reliable detection and discrimination of Listeria spp. in food and clinical isolates from Tunisia. The method used in the present study is based on the PCR amplification of a virulence factor gene (iap gene). the PCR mixture contained cyanine Cy5labeled dCTP. Therefore, The PCR products were fluorescently labeled. The presence of multiple species-specific sequences within the iap gene enabled us to design different oligoprobes per species. The species-specific sequences of the iap gene used in this study were obtained from genBank and then aligned for phylogenetic analysis in order to identify and retrieve the sequences of homologues of the amplified iap gene analysed. 20 probes were used for detection and identification of 22 food isolates and clinical isolates of Listeria spp (L. monocytogenes, L. ivanovi), L. welshimeri, L. seeligeri, and L. grayi). Each bacterial gene was identified by hybridization to oligoprobes specific for each Listeria species and immobilized on a glass surface. The microarray analysis showed that 5 clinical isolates and 2 food isolates were identified listeria monocytogenes. Concerning the remaining 15 food isolates; 13 were identified listeria innocua and 2 isolates could not be identified by microarray based assay. Further phylogenetic and molecular analysis are required to design more species-specific probes for the identification of Listeria spp. Microarray-based assay is a simple and rapid method used for Listeria species discrimination

  3. Genome-wide prediction and analysis of human tissue-selective genes using microarray expression data

    Directory of Open Access Journals (Sweden)

    Teng Shaolei

    2013-01-01

    Full Text Available Abstract Background Understanding how genes are expressed specifically in particular tissues is a fundamental question in developmental biology. Many tissue-specific genes are involved in the pathogenesis of complex human diseases. However, experimental identification of tissue-specific genes is time consuming and difficult. The accurate predictions of tissue-specific gene targets could provide useful information for biomarker development and drug target identification. Results In this study, we have developed a machine learning approach for predicting the human tissue-specific genes using microarray expression data. The lists of known tissue-specific genes for different tissues were collected from UniProt database, and the expression data retrieved from the previously compiled dataset according to the lists were used for input vector encoding. Random Forests (RFs and Support Vector Machines (SVMs were used to construct accurate classifiers. The RF classifiers were found to outperform SVM models for tissue-specific gene prediction. The results suggest that the candidate genes for brain or liver specific expression can provide valuable information for further experimental studies. Our approach was also applied for identifying tissue-selective gene targets for different types of tissues. Conclusions A machine learning approach has been developed for accurately identifying the candidate genes for tissue specific/selective expression. The approach provides an efficient way to select some interesting genes for developing new biomedical markers and improve our knowledge of tissue-specific expression.

  4. Expression microarray reproducibility is improved by optimising purification steps in RNA amplification and labelling

    Directory of Open Access Journals (Sweden)

    Brenton James D

    2004-01-01

    Full Text Available Abstract Background Expression microarrays have evolved into a powerful tool with great potential for clinical application and therefore reliability of data is essential. RNA amplification is used when the amount of starting material is scarce, as is frequently the case with clinical samples. Purification steps are critical in RNA amplification and labelling protocols, and there is a lack of sufficient data to validate and optimise the process. Results Here the purification steps involved in the protocol for indirect labelling of amplified RNA are evaluated and the experimentally determined best method for each step with respect to yield, purity, size distribution of the transcripts, and dye coupling is used to generate targets tested in replicate hybridisations. DNase treatment of diluted total RNA samples followed by phenol extraction is the optimal way to remove genomic DNA contamination. Purification of double-stranded cDNA is best achieved by phenol extraction followed by isopropanol precipitation at room temperature. Extraction with guanidinium-phenol and Lithium Chloride precipitation are the optimal methods for purification of amplified RNA and labelled aRNA respectively. Conclusion This protocol provides targets that generate highly reproducible microarray data with good representation of transcripts across the size spectrum and a coefficient of repeatability significantly better than that reported previously.

  5. A microarray-based analysis of gametogenesis in two Portuguese populations of the European clam Ruditapes decussatus.

    Directory of Open Access Journals (Sweden)

    Joana Teixeira de Sousa

    Full Text Available The European clam, Ruditapes decussatus is a species with a high commercial importance in Portugal and other Southern European countries. Its production is almost exclusively based on natural recruitment, which is subject to high annual fluctuations. Increased knowledge of the natural reproductive cycle of R. decussatus and its molecular mechanisms would be particularly important in providing new highly valuable genomic information for better understanding the regulation of reproduction in this economically important aquaculture species. In this study, the transcriptomic bases of R. decussatus reproduction have been analysed using a custom oligonucleotide microarray representing 51,678 assembled contigs. Microarray analyses were performed in four gonadal maturation stages from two different Portuguese wild populations, characterized by different responses to spawning induction when used as progenitors in hatchery. A comparison between the two populations elucidated a specific pathway involved in the recognition signals and binding between the oocyte and components of the sperm plasma membrane. We suggest that this pathway can explain part of the differences in terms of spawning induction success between the two populations. In addition, sexes and reproductive stages were compared and a correlation between mRNA levels and gonadal area was investigated. The lists of differentially expressed genes revealed that sex explains most of the variance in gonadal gene expression. Additionally, genes like Foxl2, vitellogenin, condensing 2, mitotic apparatus protein p62, Cep57, sperm associated antigens 6, 16 and 17, motile sperm domain containing protein 2, sperm surface protein Sp17, sperm flagellar proteins 1 and 2 and dpy-30, were identified as being correlated with the gonad area and therefore supposedly with the number and/or the size of the gametes produced.

  6. [Research on the relevance between the virulent genes differential expression and pathogenecity of Leptospira with microarray].

    Science.gov (United States)

    Yu, De-li; Bao, Lang

    2015-01-01

    To find the change of virulent gene expression and to analyze the relevance between the virulent change and the gene expression. Grouped guinea pigs were inoculated with 1 mL Leptospira cultured in vivo, Leptospira cultured in vitro and the Leptospira culture medium through abdominal subcutaneous respectively. The survival rate, body mass and temperature change of guinea pigs in different groups were measured within 15 d after the inoculation, then the survived guinea pigs were scarified, and the organ coefficient was also measured to know the virulence of Leptospira cultured in different environment. The amplified gene segments from Leptospira were used as probes and wrote the microarray. The total RNA was extracted from Leptospira standard strain cultured in culture medium and guinea pigs. After reverse transcription to cDNA, they were labeled with Cy3 and Cy5 respectively. Labeled cDNA was mixed and hybridized with the microarray. The hybridized mircroarray was scanned and analysed. The survival rate of inoculated guinea pig was different from group to group (in vivo group: 0%; in vitro group: 88.9%; culture medium group: 100%). The guinea pigs in vivo group had a higher temperature (PLeptospira: LA1027, LA1029, LA4004, LA3050, LA3540, LA0327, LA0378, LA1650, LA3937, LA2089, LA2144, LA3576, LA0011 and gene of Loa22 were up regulation after continuously cultured in guinea pigs. The pathogenic ability of Leptospira cultured in different environment is different and the gene expression of Leptospira is different between in vivo and in vitro as well. The understanding of the meaning of this change might help to know the pathogenecity of Leptospira.

  7. Dimension reduction methods for microarray data: a review

    Directory of Open Access Journals (Sweden)

    Rabia Aziz

    2017-03-01

    Full Text Available Dimension reduction has become inevitable for pre-processing of high dimensional data. “Gene expression microarray data” is an instance of such high dimensional data. Gene expression microarray data displays the maximum number of genes (features simultaneously at a molecular level with a very small number of samples. The copious numbers of genes are usually provided to a learning algorithm for producing a complete characterization of the classification task. However, most of the times the majority of the genes are irrelevant or redundant to the learning task. It will deteriorate the learning accuracy and training speed as well as lead to the problem of overfitting. Thus, dimension reduction of microarray data is a crucial preprocessing step for prediction and classification of disease. Various feature selection and feature extraction techniques have been proposed in the literature to identify the genes, that have direct impact on the various machine learning algorithms for classification and eliminate the remaining ones. This paper describes the taxonomy of dimension reduction methods with their characteristics, evaluation criteria, advantages and disadvantages. It also presents a review of numerous dimension reduction approaches for microarray data, mainly those methods that have been proposed over the past few years.

  8. Design of an Enterobacteriaceae Pan-genome Microarray Chip

    DEFF Research Database (Denmark)

    Lukjancenko, Oksana; Ussery, David

    2010-01-01

    -density microarray chip has been designed, using 116 Enterobacteriaceae genome sequences, taking into account the enteric pan-genome. Probes for the microarray were checked in silico and performance of the chip, based on experimental strains from four different genera, demonstrate a relatively high ability...... to distinguish those strains on genus, species, and pathotype/serovar levels. Additionally, the microarray performed well when investigating which genes were found in a given strain of interest. The Enterobacteriaceae pan-genome microarray, based on 116 genomes, provides a valuable tool for determination...

  9. Microarray labeling extension values: laboratory signatures for Affymetrix GeneChips

    Science.gov (United States)

    Lee, Yun-Shien; Chen, Chun-Houh; Tsai, Chi-Neu; Tsai, Chia-Lung; Chao, Angel; Wang, Tzu-Hao

    2009-01-01

    Interlaboratory comparison of microarray data, even when using the same platform, imposes several challenges to scientists. RNA quality, RNA labeling efficiency, hybridization procedures and data-mining tools can all contribute variations in each laboratory. In Affymetrix GeneChips, about 11–20 different 25-mer oligonucleotides are used to measure the level of each transcript. Here, we report that ‘labeling extension values (LEVs)’, which are correlation coefficients between probe intensities and probe positions, are highly correlated with the gene expression levels (GEVs) on eukayotic Affymetrix microarray data. By analyzing LEVs and GEVs in the publicly available 2414 cel files of 20 Affymetrix microarray types covering 13 species, we found that correlations between LEVs and GEVs only exist in eukaryotic RNAs, but not in prokaryotic ones. Surprisingly, Affymetrix results of the same specimens that were analyzed in different laboratories could be clearly differentiated only by LEVs, leading to the identification of ‘laboratory signatures’. In the examined dataset, GSE10797, filtering out high-LEV genes did not compromise the discovery of biological processes that are constructed by differentially expressed genes. In conclusion, LEVs provide a new filtering parameter for microarray analysis of gene expression and it may improve the inter- and intralaboratory comparability of Affymetrix GeneChips data. PMID:19295132

  10. Tissue microarray immunohistochemical detection of brachyury is not a prognostic indicator in chordoma.

    Science.gov (United States)

    Zhang, Linlin; Guo, Shang; Schwab, Joseph H; Nielsen, G Petur; Choy, Edwin; Ye, Shunan; Zhang, Zhan; Mankin, Henry; Hornicek, Francis J; Duan, Zhenfeng

    2013-01-01

    Brachyury is a marker for notochord-derived tissues and neoplasms, such as chordoma. However, the prognostic relevance of brachyury expression in chordoma is still unknown. The improvement of tissue microarray technology has provided the opportunity to perform analyses of tumor tissues on a large scale in a uniform and consistent manner. This study was designed with the use of tissue microarray to determine the expression of brachyury. Brachyury expression in chordoma tissues from 78 chordoma patients was analyzed by immunohistochemical staining of tissue microarray. The clinicopathologic parameters, including gender, age, location of tumor and metastatic status were evaluated. Fifty-nine of 78 (75.64%) tumors showed nuclear staining for brachyury, and among them, 29 tumors (49.15%) showed 1+ (mobile spine. However, there was no significant relationship between brachyury expression and other clinical variables. By Kaplan-Meier analysis, brachyury expression failed to produce any significant relationship with the overall survival rate. In conclusion, brachyury expression is not a prognostic indicator in chordoma.

  11. An Overview of DNA Microarray Grid Alignment and Foreground Separation Approaches

    Directory of Open Access Journals (Sweden)

    Bajcsy Peter

    2006-01-01

    Full Text Available This paper overviews DNA microarray grid alignment and foreground separation approaches. Microarray grid alignment and foreground separation are the basic processing steps of DNA microarray images that affect the quality of gene expression information, and hence impact our confidence in any data-derived biological conclusions. Thus, understanding microarray data processing steps becomes critical for performing optimal microarray data analysis. In the past, the grid alignment and foreground separation steps have not been covered extensively in the survey literature. We present several classifications of existing algorithms, and describe the fundamental principles of these algorithms. Challenges related to automation and reliability of processed image data are outlined at the end of this overview paper.

  12. Missing value imputation for microarray gene expression data using histone acetylation information

    Directory of Open Access Journals (Sweden)

    Feng Jihua

    2008-05-01

    Full Text Available Abstract Background It is an important pre-processing step to accurately estimate missing values in microarray data, because complete datasets are required in numerous expression profile analysis in bioinformatics. Although several methods have been suggested, their performances are not satisfactory for datasets with high missing percentages. Results The paper explores the feasibility of doing missing value imputation with the help of gene regulatory mechanism. An imputation framework called histone acetylation information aided imputation method (HAIimpute method is presented. It incorporates the histone acetylation information into the conventional KNN(k-nearest neighbor and LLS(local least square imputation algorithms for final prediction of the missing values. The experimental results indicated that the use of acetylation information can provide significant improvements in microarray imputation accuracy. The HAIimpute methods consistently improve the widely used methods such as KNN and LLS in terms of normalized root mean squared error (NRMSE. Meanwhile, the genes imputed by HAIimpute methods are more correlated with the original complete genes in terms of Pearson correlation coefficients. Furthermore, the proposed methods also outperform GOimpute, which is one of the existing related methods that use the functional similarity as the external information. Conclusion We demonstrated that the using of histone acetylation information could greatly improve the performance of the imputation especially at high missing percentages. This idea can be generalized to various imputation methods to facilitate the performance. Moreover, with more knowledge accumulated on gene regulatory mechanism in addition to histone acetylation, the performance of our approach can be further improved and verified.

  13. AMDA: an R package for the automated microarray data analysis

    Directory of Open Access Journals (Sweden)

    Foti Maria

    2006-07-01

    Full Text Available Abstract Background Microarrays are routinely used to assess mRNA transcript levels on a genome-wide scale. Large amount of microarray datasets are now available in several databases, and new experiments are constantly being performed. In spite of this fact, few and limited tools exist for quickly and easily analyzing the results. Microarray analysis can be challenging for researchers without the necessary training and it can be time-consuming for service providers with many users. Results To address these problems we have developed an automated microarray data analysis (AMDA software, which provides scientists with an easy and integrated system for the analysis of Affymetrix microarray experiments. AMDA is free and it is available as an R package. It is based on the Bioconductor project that provides a number of powerful bioinformatics and microarray analysis tools. This automated pipeline integrates different functions available in the R and Bioconductor projects with newly developed functions. AMDA covers all of the steps, performing a full data analysis, including image analysis, quality controls, normalization, selection of differentially expressed genes, clustering, correspondence analysis and functional evaluation. Finally a LaTEX document is dynamically generated depending on the performed analysis steps. The generated report contains comments and analysis results as well as the references to several files for a deeper investigation. Conclusion AMDA is freely available as an R package under the GPL license. The package as well as an example analysis report can be downloaded in the Services/Bioinformatics section of the Genopolis http://www.genopolis.it/

  14. Monitoring expression profiles of rice (Oryza sativa L.) genes under abiotic stresses using cDNA Microarray Analysis (abstract)

    International Nuclear Information System (INIS)

    Rabbani, M.A.

    2005-01-01

    Transcript regulation in response to cold, drought, high salinity and ABA application was investigated in rice (Oryza sativa L., Nipponbare) with microarray analysis including approx. 1700 independent DNA elements derived from three cDNA libraries constructed from 15-day old rice seedlings stressed with drought, cold and high salinity. A total of 141 non-redundant genes were identified, whose expression ratios were more than three-fold compared with the control genes for at least one of stress treatments in microarray analysis. However, after RNA gel blot analysis, a total of 73 genes were identified, among them the transcripts of 36, 62, 57 and 43 genes were found increased after cold, drought, high salinity and ABA application, respectively. Sixteen of these identified genes have been reported previously to be stress inducible in rice, while 57 of which are novel that have not been reported earlier as stress responsive in rice. We observed a strong association in the expression patterns of stress responsive genes and found 15 stress inducible genes that responded to all four treatments. Based on Venn diagram analysis, 56 genes were induced by both drought and high salinity, whereas 22 genes were upregulated by both cold and high salinity stress. Similarly 43 genes were induced by both drought stress and ABA application, while only 17 genes were identified as cold and ABA inducible genes. These results indicated the existence of greater cross talk between drought, ABA and high salinity stress signaling processes than those between cold and ABA, and cold and high salinity stress signaling pathways. The cold, drought, high salinity and ABA inducible genes were classified into four gene groups from their expression profiles. Analysis of data enabled us to identify a number of promoters and possible cis-acting DNA elements of several genes induced by a variety of abiotic stresses by combining expression data with genomic sequence data of rice. Comparative analysis of

  15. A new method for class prediction based on signed-rank algorithms applied to Affymetrix® microarray experiments

    Directory of Open Access Journals (Sweden)

    Vassal Aurélien

    2008-01-01

    Full Text Available Abstract Background The huge amount of data generated by DNA chips is a powerful basis to classify various pathologies. However, constant evolution of microarray technology makes it difficult to mix data from different chip types for class prediction of limited sample populations. Affymetrix® technology provides both a quantitative fluorescence signal and a decision (detection call: absent or present based on signed-rank algorithms applied to several hybridization repeats of each gene, with a per-chip normalization. We developed a new prediction method for class belonging based on the detection call only from recent Affymetrix chip type. Biological data were obtained by hybridization on U133A, U133B and U133Plus 2.0 microarrays of purified normal B cells and cells from three independent groups of multiple myeloma (MM patients. Results After a call-based data reduction step to filter out non class-discriminative probe sets, the gene list obtained was reduced to a predictor with correction for multiple testing by iterative deletion of probe sets that sequentially improve inter-class comparisons and their significance. The error rate of the method was determined using leave-one-out and 5-fold cross-validation. It was successfully applied to (i determine a sex predictor with the normal donor group classifying gender with no error in all patient groups except for male MM samples with a Y chromosome deletion, (ii predict the immunoglobulin light and heavy chains expressed by the malignant myeloma clones of the validation group and (iii predict sex, light and heavy chain nature for every new patient. Finally, this method was shown powerful when compared to the popular classification method Prediction Analysis of Microarray (PAM. Conclusion This normalization-free method is routinely used for quality control and correction of collection errors in patient reports to clinicians. It can be easily extended to multiple class prediction suitable with

  16. A resampling-based meta-analysis for detection of differential gene expression in breast cancer

    International Nuclear Information System (INIS)

    Gur-Dedeoglu, Bala; Konu, Ozlen; Kir, Serkan; Ozturk, Ahmet Rasit; Bozkurt, Betul; Ergul, Gulusan; Yulug, Isik G

    2008-01-01

    Accuracy in the diagnosis of breast cancer and classification of cancer subtypes has improved over the years with the development of well-established immunohistopathological criteria. More recently, diagnostic gene-sets at the mRNA expression level have been tested as better predictors of disease state. However, breast cancer is heterogeneous in nature; thus extraction of differentially expressed gene-sets that stably distinguish normal tissue from various pathologies poses challenges. Meta-analysis of high-throughput expression data using a collection of statistical methodologies leads to the identification of robust tumor gene expression signatures. A resampling-based meta-analysis strategy, which involves the use of resampling and application of distribution statistics in combination to assess the degree of significance in differential expression between sample classes, was developed. Two independent microarray datasets that contain normal breast, invasive ductal carcinoma (IDC), and invasive lobular carcinoma (ILC) samples were used for the meta-analysis. Expression of the genes, selected from the gene list for classification of normal breast samples and breast tumors encompassing both the ILC and IDC subtypes were tested on 10 independent primary IDC samples and matched non-tumor controls by real-time qRT-PCR. Other existing breast cancer microarray datasets were used in support of the resampling-based meta-analysis. The two independent microarray studies were found to be comparable, although differing in their experimental methodologies (Pearson correlation coefficient, R = 0.9389 and R = 0.8465 for ductal and lobular samples, respectively). The resampling-based meta-analysis has led to the identification of a highly stable set of genes for classification of normal breast samples and breast tumors encompassing both the ILC and IDC subtypes. The expression results of the selected genes obtained through real-time qRT-PCR supported the meta-analysis results. The

  17. A resampling-based meta-analysis for detection of differential gene expression in breast cancer

    Directory of Open Access Journals (Sweden)

    Ergul Gulusan

    2008-12-01

    Full Text Available Abstract Background Accuracy in the diagnosis of breast cancer and classification of cancer subtypes has improved over the years with the development of well-established immunohistopathological criteria. More recently, diagnostic gene-sets at the mRNA expression level have been tested as better predictors of disease state. However, breast cancer is heterogeneous in nature; thus extraction of differentially expressed gene-sets that stably distinguish normal tissue from various pathologies poses challenges. Meta-analysis of high-throughput expression data using a collection of statistical methodologies leads to the identification of robust tumor gene expression signatures. Methods A resampling-based meta-analysis strategy, which involves the use of resampling and application of distribution statistics in combination to assess the degree of significance in differential expression between sample classes, was developed. Two independent microarray datasets that contain normal breast, invasive ductal carcinoma (IDC, and invasive lobular carcinoma (ILC samples were used for the meta-analysis. Expression of the genes, selected from the gene list for classification of normal breast samples and breast tumors encompassing both the ILC and IDC subtypes were tested on 10 independent primary IDC samples and matched non-tumor controls by real-time qRT-PCR. Other existing breast cancer microarray datasets were used in support of the resampling-based meta-analysis. Results The two independent microarray studies were found to be comparable, although differing in their experimental methodologies (Pearson correlation coefficient, R = 0.9389 and R = 0.8465 for ductal and lobular samples, respectively. The resampling-based meta-analysis has led to the identification of a highly stable set of genes for classification of normal breast samples and breast tumors encompassing both the ILC and IDC subtypes. The expression results of the selected genes obtained through real

  18. Transcriptional profiling of endocrine cerebro-osteodysplasia using microarray and next-generation sequencing.

    Directory of Open Access Journals (Sweden)

    Piya Lahiry

    Full Text Available BACKGROUND: Transcriptome profiling of patterns of RNA expression is a powerful approach to identify networks of genes that play a role in disease. To date, most mRNA profiling of tissues has been accomplished using microarrays, but next-generation sequencing can offer a richer and more comprehensive picture. METHODOLOGY/PRINCIPAL FINDINGS: ECO is a rare multi-system developmental disorder caused by a homozygous mutation in ICK encoding intestinal cell kinase. We performed gene expression profiling using both cDNA microarrays and next-generation mRNA sequencing (mRNA-seq of skin fibroblasts from ECO-affected subjects. We then validated a subset of differentially expressed transcripts identified by each method using quantitative reverse transcription-polymerase chain reaction (qRT-PCR. Finally, we used gene ontology (GO to identify critical pathways and processes that were abnormal according to each technical platform. Methodologically, mRNA-seq identifies a much larger number of differentially expressed genes with much better correlation to qRT-PCR results than the microarray (r² = 0.794 and 0.137, respectively. Biologically, cDNA microarray identified functional pathways focused on anatomical structure and development, while the mRNA-seq platform identified a higher proportion of genes involved in cell division and DNA replication pathways. CONCLUSIONS/SIGNIFICANCE: Transcriptome profiling with mRNA-seq had greater sensitivity, range and accuracy than the microarray. The two platforms generated different but complementary hypotheses for further evaluation.

  19. Construction of a cDNA microarray derived from the ascidian Ciona intestinalis.

    Science.gov (United States)

    Azumi, Kaoru; Takahashi, Hiroki; Miki, Yasufumi; Fujie, Manabu; Usami, Takeshi; Ishikawa, Hisayoshi; Kitayama, Atsusi; Satou, Yutaka; Ueno, Naoto; Satoh, Nori

    2003-10-01

    A cDNA microarray was constructed from a basal chordate, the ascidian Ciona intestinalis. The draft genome of Ciona has been read and inferred to contain approximately 16,000 protein-coding genes, and cDNAs for transcripts of 13,464 genes have been characterized and compiled as the "Ciona intestinalis Gene Collection Release I". In the present study, we constructed a cDNA microarray of these 13,464 Ciona genes. A preliminary experiment with Cy3- and Cy5-labeled probes showed extensive differential gene expression between fertilized eggs and larvae. In addition, there was a good correlation between results obtained by the present microarray analysis and those from previous EST analyses. This first microarray of a large collection of Ciona intestinalis cDNA clones should facilitate the analysis of global gene expression and gene networks during the embryogenesis of basal chordates.

  20. Microfluidic extraction and microarray detection of biomarkers from cancer tissue slides

    Science.gov (United States)

    Nguyen, H. T.; Dupont, L. N.; Jean, A. M.; Géhin, T.; Chevolot, Y.; Laurenceau, E.; Gijs, M. A. M.

    2018-03-01

    We report here a new microfluidic method allowing for the quantification of human epidermal growth factor receptor 2 (HER2) expression levels from formalin-fixed breast cancer tissues. After partial extraction of proteins from the tissue slide, the extract is routed to an antibody (Ab) microarray for HER2 titration by fluorescence. Then the HER2-expressing cell area is evaluated by immunofluorescence (IF) staining of the tissue slide and used to normalize the fluorescent HER2 signal measured from the Ab microarray. The number of HER2 gene copies measured by fluorescence in situ hybridization (FISH) on an adjacent tissue slide is concordant with the normalized HER2 expression signal. This work is the first study implementing biomarker extraction and detection from cancer tissue slides using microfluidics in combination with a microarray system, paving the way for further developments towards multiplex and precise quantification of cancer biomarkers.

  1. Microarray-based transcriptomic analysis of differences between long-term gregarious and solitarious desert locusts.

    Directory of Open Access Journals (Sweden)

    Liesbeth Badisco

    Full Text Available Desert locusts (Schistocerca gregaria show an extreme form of phenotypic plasticity and can transform between a cryptic solitarious phase and a swarming gregarious phase. The two phases differ extensively in behavior, morphology and physiology but very little is known about the molecular basis of these differences. We used our recently generated Expressed Sequence Tag (EST database derived from S. gregaria central nervous system (CNS to design oligonucleotide microarrays and compare the expression of thousands of genes in the CNS of long-term gregarious and solitarious adult desert locusts. This identified 214 differentially expressed genes, of which 40% have been annotated to date. These include genes encoding proteins that are associated with CNS development and modeling, sensory perception, stress response and resistance, and fundamental cellular processes. Our microarray analysis has identified genes whose altered expression may enable locusts of either phase to deal with the different challenges they face. Genes for heat shock proteins and proteins which confer protection from infection were upregulated in gregarious locusts, which may allow them to respond to acute physiological challenges. By contrast the longer-lived solitarious locusts appear to be more strongly protected from the slowly accumulating effects of ageing by an upregulation of genes related to anti-oxidant systems, detoxification and anabolic renewal. Gregarious locusts also had a greater abundance of transcripts for proteins involved in sensory processing and in nervous system development and plasticity. Gregarious locusts live in a more complex sensory environment than solitarious locusts and may require a greater turnover of proteins involved in sensory transduction, and possibly greater neuronal plasticity.

  2. Detecting Outlier Microarray Arrays by Correlation and Percentage of Outliers Spots

    Directory of Open Access Journals (Sweden)

    Song Yang

    2006-01-01

    Full Text Available We developed a quality assurance (QA tool, namely microarray outlier filter (MOF, and have applied it to our microarray datasets for the identification of problematic arrays. Our approach is based on the comparison of the arrays using the correlation coefficient and the number of outlier spots generated on each array to reveal outlier arrays. For a human universal reference (HUR dataset, which is used as a technical control in our standard hybridization procedure, 3 outlier arrays were identified out of 35 experiments. For a human blood dataset, 12 outlier arrays were identified from 185 experiments. In general, arrays from human blood samples displayed greater variation in their gene expression profiles than arrays from HUR samples. As a result, MOF identified two distinct patterns in the occurrence of outlier arrays. These results demonstrate that this methodology is a valuable QA practice to identify questionable microarray data prior to downstream analysis.

  3. Evaluation of an expanded microarray for detecting antibiotic resistance genes in a broad range of gram-negative bacterial pathogens.

    Science.gov (United States)

    Card, Roderick; Zhang, Jiancheng; Das, Priya; Cook, Charlotte; Woodford, Neil; Anjum, Muna F

    2013-01-01

    A microarray capable of detecting genes for resistance to 75 clinically relevant antibiotics encompassing 19 different antimicrobial classes was tested on 132 Gram-negative bacteria. Microarray-positive results correlated >91% with antimicrobial resistance phenotypes, assessed using British Society for Antimicrobial Chemotherapy clinical breakpoints; the overall test specificity was >83%. Microarray-positive results without a corresponding resistance phenotype matched 94% with PCR results, indicating accurate detection of genes present in the respective bacteria by microarray when expression was low or absent and, hence, undetectable by susceptibility testing. The low sensitivity and negative predictive values of the microarray results for identifying resistance to some antimicrobial resistance classes are likely due to the limited number of resistance genes present on the current microarray for those antimicrobial agents or to mutation-based resistance mechanisms. With regular updates, this microarray can be used for clinical diagnostics to help accurate therapeutic options to be taken following infection with multiple-antibiotic-resistant Gram-negative bacteria and prevent treatment failure.

  4. The Plasmodium falciparum Sexual Development Transcriptome: A Microarray Analysis using Ontology-Based Pattern Identification

    National Research Council Canada - National Science Library

    Young, Jason A; Fivelman, Quinton L; Blair, Peter L; de la Vega, Patricia; Le Roch, Karine G; Zhou, Yingyao; Carucci, Daniel J; Baker, David A; Winzeler, Elizabeth A

    2005-01-01

    ... a full-genome high-density oligonucleotide microarray. The interpretation of this transcriptional data was aided by applying a novel knowledge-based data-mining algorithm termed ontology-based pattern identification (OPI...

  5. Microarrays for global expression constructed with a low redundancy set of 27,500 sequenced cDNAs representing an array of developmental stages and physiological conditions of the soybean plant

    Directory of Open Access Journals (Sweden)

    Retzel Ernest

    2004-09-01

    Full Text Available Abstract Background Microarrays are an important tool with which to examine coordinated gene expression. Soybean (Glycine max is one of the most economically valuable crop species in the world food supply. In order to accelerate both gene discovery as well as hypothesis-driven research in soybean, global expression resources needed to be developed. The applications of microarray for determining patterns of expression in different tissues or during conditional treatments by dual labeling of the mRNAs are unlimited. In addition, discovery of the molecular basis of traits through examination of naturally occurring variation in hundreds of mutant lines could be enhanced by the construction and use of soybean cDNA microarrays. Results We report the construction and analysis of a low redundancy 'unigene' set of 27,513 clones that represent a variety of soybean cDNA libraries made from a wide array of source tissue and organ systems, developmental stages, and stress or pathogen-challenged plants. The set was assembled from the 5' sequence data of the cDNA clones using cluster analysis programs. The selected clones were then physically reracked and sequenced at the 3' end. In order to increase gene discovery from immature cotyledon libraries that contain abundant mRNAs representing storage protein gene families, we utilized a high density filter normalization approach to preferentially select more weakly expressed cDNAs. All 27,513 cDNA inserts were amplified by polymerase chain reaction. The amplified products, along with some repetitively spotted control or 'choice' clones, were used to produce three 9,728-element microarrays that have been used to examine tissue specific gene expression and global expression in mutant isolines. Conclusions Global expression studies will be greatly aided by the availability of the sequence-validated and low redundancy cDNA sets described in this report. These cDNAs and ESTs represent a wide array of developmental

  6. Small Molecule Microarrays Enable the Identification of a Selective, Quadruplex-Binding Inhibitor of MYC Expression.

    Science.gov (United States)

    Felsenstein, Kenneth M; Saunders, Lindsey B; Simmons, John K; Leon, Elena; Calabrese, David R; Zhang, Shuling; Michalowski, Aleksandra; Gareiss, Peter; Mock, Beverly A; Schneekloth, John S

    2016-01-15

    The transcription factor MYC plays a pivotal role in cancer initiation, progression, and maintenance. However, it has proven difficult to develop small molecule inhibitors of MYC. One attractive route to pharmacological inhibition of MYC has been the prevention of its expression through small molecule-mediated stabilization of the G-quadruplex (G4) present in its promoter. Although molecules that bind globally to quadruplex DNA and influence gene expression are well-known, the identification of new chemical scaffolds that selectively modulate G4-driven genes remains a challenge. Here, we report an approach for the identification of G4-binding small molecules using small molecule microarrays (SMMs). We use the SMM screening platform to identify a novel G4-binding small molecule that inhibits MYC expression in cell models, with minimal impact on the expression of other G4-associated genes. Surface plasmon resonance (SPR) and thermal melt assays demonstrated that this molecule binds reversibly to the MYC G4 with single digit micromolar affinity, and with weaker or no measurable binding to other G4s. Biochemical and cell-based assays demonstrated that the compound effectively silenced MYC transcription and translation via a G4-dependent mechanism of action. The compound induced G1 arrest and was selectively toxic to MYC-driven cancer cell lines containing the G4 in the promoter but had minimal effects in peripheral blood mononucleocytes or a cell line lacking the G4 in its MYC promoter. As a measure of selectivity, gene expression analysis and qPCR experiments demonstrated that MYC and several MYC target genes were downregulated upon treatment with this compound, while the expression of several other G4-driven genes was not affected. In addition to providing a novel chemical scaffold that modulates MYC expression through G4 binding, this work suggests that the SMM screening approach may be broadly useful as an approach for the identification of new G4-binding small

  7. Microarray Analysis of Iris Gene Expression in Mice with Mutations Influencing Pigmentation

    Science.gov (United States)

    Trantow, Colleen M.; Cuffy, Tryphena L.; Fingert, John H.; Kuehn, Markus H.

    2011-01-01

    Purpose. Several ocular diseases involve the iris, notably including oculocutaneous albinism, pigment dispersion syndrome, and exfoliation syndrome. To screen for candidate genes that may contribute to the pathogenesis of these diseases, genome-wide iris gene expression patterns were comparatively analyzed from mouse models of these conditions. Methods. Iris samples from albino mice with a Tyr mutation, pigment dispersion–prone mice with Tyrp1 and Gpnmb mutations, and mice resembling exfoliation syndrome with a Lyst mutation were compared with samples from wild-type mice. All mice were strain (C57BL/6J), age (60 days old), and sex (female) matched. Microarrays were used to compare transcriptional profiles, and differentially expressed transcripts were described by functional annotation clustering using DAVID Bioinformatics Resources. Quantitative real-time PCR was performed to validate a subset of identified changes. Results. Compared with wild-type C57BL/6J mice, each disease context exhibited a large number of statistically significant changes in gene expression, including 685 transcripts differentially expressed in albino irides, 403 in pigment dispersion–prone irides, and 460 in exfoliative-like irides. Conclusions. Functional annotation clusterings were particularly striking among the overrepresented genes, with albino and pigment dispersion–prone irides both exhibiting overall evidence of crystallin-mediated stress responses. Exfoliative-like irides from mice with a Lyst mutation showed overall evidence of involvement of genes that influence immune system processes, lytic vacuoles, and lysosomes. These findings have several biologically relevant implications, particularly with respect to secondary forms of glaucoma, and represent a useful resource as a hypothesis-generating dataset. PMID:20739468

  8. A DNA Microarray-Based Assay to Detect Dual Infection with Two Dengue Virus Serotypes

    Directory of Open Access Journals (Sweden)

    Alvaro Díaz-Badillo

    2014-04-01

    Full Text Available Here; we have described and tested a microarray based-method for the screening of dengue virus (DENV serotypes. This DNA microarray assay is specific and sensitive and can detect dual infections with two dengue virus serotypes and single-serotype infections. Other methodologies may underestimate samples containing more than one serotype. This technology can be used to discriminate between the four DENV serotypes. Single-stranded DNA targets were covalently attached to glass slides and hybridised with specific labelled probes. DENV isolates and dengue samples were used to evaluate microarray performance. Our results demonstrate that the probes hybridized specifically to DENV serotypes; with no detection of unspecific signals. This finding provides evidence that specific probes can effectively identify single and double infections in DENV samples.

  9. DNA microarray data and contextual analysis of correlation graphs

    Directory of Open Access Journals (Sweden)

    Hingamp Pascal

    2003-04-01

    Full Text Available Abstract Background DNA microarrays are used to produce large sets of expression measurements from which specific biological information is sought. Their analysis requires efficient and reliable algorithms for dimensional reduction, classification and annotation. Results We study networks of co-expressed genes obtained from DNA microarray experiments. The mathematical concept of curvature on graphs is used to group genes or samples into clusters to which relevant gene or sample annotations are automatically assigned. Application to publicly available yeast and human lymphoma data demonstrates the reliability of the method in spite of its simplicity, especially with respect to the small number of parameters involved. Conclusions We provide a method for automatically determining relevant gene clusters among the many genes monitored with microarrays. The automatic annotations and the graphical interface improve the readability of the data. A C++ implementation, called Trixy, is available from http://tagc.univ-mrs.fr/bioinformatics/trixy.html.

  10. Seeded Bayesian Networks: Constructing genetic networks from microarray data

    Directory of Open Access Journals (Sweden)

    Quackenbush John

    2008-07-01

    Full Text Available Abstract Background DNA microarrays and other genomics-inspired technologies provide large datasets that often include hidden patterns of correlation between genes reflecting the complex processes that underlie cellular metabolism and physiology. The challenge in analyzing large-scale expression data has been to extract biologically meaningful inferences regarding these processes – often represented as networks – in an environment where the datasets are often imperfect and biological noise can obscure the actual signal. Although many techniques have been developed in an attempt to address these issues, to date their ability to extract meaningful and predictive network relationships has been limited. Here we describe a method that draws on prior information about gene-gene interactions to infer biologically relevant pathways from microarray data. Our approach consists of using preliminary networks derived from the literature and/or protein-protein interaction data as seeds for a Bayesian network analysis of microarray results. Results Through a bootstrap analysis of gene expression data derived from a number of leukemia studies, we demonstrate that seeded Bayesian Networks have the ability to identify high-confidence gene-gene interactions which can then be validated by comparison to other sources of pathway data. Conclusion The use of network seeds greatly improves the ability of Bayesian Network analysis to learn gene interaction networks from gene expression data. We demonstrate that the use of seeds derived from the biomedical literature or high-throughput protein-protein interaction data, or the combination, provides improvement over a standard Bayesian Network analysis, allowing networks involving dynamic processes to be deduced from the static snapshots of biological systems that represent the most common source of microarray data. Software implementing these methods has been included in the widely used TM4 microarray analysis package.

  11. [Differential gene expression in incompatible interaction between Lilium regale Wilson and Fusarium oxysporum f. sp. lilii revealed by combined SSH and microarray analysis].

    Science.gov (United States)

    Rao, J; Liu, D; Zhang, N; He, H; Ge, F; Chen, C

    2014-01-01

    Fusarium wilt, caused by a soilborne pathogen Fusarium oxysporum f. sp. lilii, is the major disease of lily (Lilium L.). In order to isolate the genes differentially expressed in a resistant reaction to F. oxysporum in L. regale Wilson, a cDNA library was constructed with L. regale root during F. oxysporum infection using the suppression subtractive hybridization (SSH), and a total of 585 unique expressed sequence tags (ESTs) were obtained. Furthermore, the gene expression profiles in the incompatible interaction between L. regale and F. oxysporum were revealed by oligonucleotide microarray analysis of 585 unique ESTs comparison to the compatible interaction between a susceptible Lilium Oriental Hybrid 'Siberia' and F. oxysporum. The result of expression profile analysis indicated that the genes encoding pathogenesis-related proteins (PRs), antioxidative stress enzymes, secondary metabolism enzymes, transcription factors, signal transduction proteins as well as a large number of unknown genes were involved in early defense response of L. regale to F. oxysporum infection. Moreover, the following quantitative reverse transcription PCR (QRT-PCR) analysis confirmed reliability of the oligonucleotide microarray data. In the present study, isolation of differentially expressed genes in L. regale during response to F. oxysporum helped to uncover the molecular mechanism associated with the resistance of L. regale against F. oxysporum.

  12. 16S rRNA gene-based phylogenetic microarray for simultaneous identification of members of the genus Burkholderia.

    Science.gov (United States)

    Schönmann, Susan; Loy, Alexander; Wimmersberger, Céline; Sobek, Jens; Aquino, Catharine; Vandamme, Peter; Frey, Beat; Rehrauer, Hubert; Eberl, Leo

    2009-04-01

    For cultivation-independent and highly parallel analysis of members of the genus Burkholderia, an oligonucleotide microarray (phylochip) consisting of 131 hierarchically nested 16S rRNA gene-targeted oligonucleotide probes was developed. A novel primer pair was designed for selective amplification of a 1.3 kb 16S rRNA gene fragment of Burkholderia species prior to microarray analysis. The diagnostic performance of the microarray for identification and differentiation of Burkholderia species was tested with 44 reference strains of the genera Burkholderia, Pandoraea, Ralstonia and Limnobacter. Hybridization patterns based on presence/absence of probe signals were interpreted semi-automatically using the novel likelihood-based strategy of the web-tool Phylo- Detect. Eighty-eight per cent of the reference strains were correctly identified at the species level. The evaluated microarray was applied to investigate shifts in the Burkholderia community structure in acidic forest soil upon addition of cadmium, a condition that selected for Burkholderia species. The microarray results were in agreement with those obtained from phylogenetic analysis of Burkholderia 16S rRNA gene sequences recovered from the same cadmiumcontaminated soil, demonstrating the value of the Burkholderia phylochip for determinative and environmental studies.

  13. Multi-platform whole-genome microarray analyses refine the epigenetic signature of breast cancer metastasis with gene expression and copy number.

    Directory of Open Access Journals (Sweden)

    Joseph Andrews

    2010-01-01

    Full Text Available We have previously identified genome-wide DNA methylation changes in a cell line model of breast cancer metastasis. These complex epigenetic changes that we observed, along with concurrent karyotype analyses, have led us to hypothesize that complex genomic alterations in cancer cells (deletions, translocations and ploidy are superimposed over promoter-specific methylation events that are responsible for gene-specific expression changes observed in breast cancer metastasis.We undertook simultaneous high-resolution, whole-genome analyses of MDA-MB-468GFP and MDA-MB-468GFP-LN human breast cancer cell lines (an isogenic, paired lymphatic metastasis cell line model using Affymetrix gene expression (U133, promoter (1.0R, and SNP/CNV (SNP 6.0 microarray platforms to correlate data from gene expression, epigenetic (DNA methylation, and combination copy number variant/single nucleotide polymorphism microarrays. Using Partek Software and Ingenuity Pathway Analysis we integrated datasets from these three platforms and detected multiple hypomethylation and hypermethylation events. Many of these epigenetic alterations correlated with gene expression changes. In addition, gene dosage events correlated with the karyotypic differences observed between the cell lines and were reflected in specific promoter methylation patterns. Gene subsets were identified that correlated hyper (and hypo methylation with the loss (or gain of gene expression and in parallel, with gene dosage losses and gains, respectively. Individual gene targets from these subsets were also validated for their methylation, expression and copy number status, and susceptible gene pathways were identified that may indicate how selective advantage drives the processes of tumourigenesis and metastasis.Our approach allows more precisely profiling of functionally relevant epigenetic signatures that are associated with cancer progression and metastasis.

  14. Candidate Genes for Testicular Cancer Evaluated by In Situ Protein Expression Analyses on Tissue Microarrays

    Directory of Open Access Journals (Sweden)

    Rolf I. Skotheim

    2003-09-01

    Full Text Available By the use of high-throughput molecular technologies, the number of genes and proteins potentially relevant to testicular germ cell tumor (TGCT and other diseases will increase rapidly. In a recent transcriptional profiling, we demonstrated the overexpression of GRB7 and JUP in TGCTs, confirmed the reported overexpression of CCND2. We also have recent evidences for frequent genetic alterations of FHIT and epigenetic alterations of MGMT. To evaluate whether the expression of these genes is related to any clinicopathological variables, we constructed a tissue microarray with 510 testicular tissue cores from 279 patients diagnosed with TGCT, covering various histological subgroups and clinical stages. By immunohistochemistry, we found that JUP, GRB7, CCND2 proteins were rarely present in normal testis, but frequently expressed at high levels in TGCT. Additionally, all premalignant intratubular germ cell neoplasias were JUP-immunopositive. MGMT and FHIT were expressed by normal testicular tissues, but at significantly lower frequencies in TGCT. Except for CCND2, the expressions of all markers were significantly associated with various TGCT subtypes. In summary, we have developed a high-throughput tool for the evaluation of TGCT markers, utilized this to validate five candidate genes whose protein expressions were indeed deregulated in TGCT.

  15. MAGMA: analysis of two-channel microarrays made easy.

    Science.gov (United States)

    Rehrauer, Hubert; Zoller, Stefan; Schlapbach, Ralph

    2007-07-01

    The web application MAGMA provides a simple and intuitive interface to identify differentially expressed genes from two-channel microarray data. While the underlying algorithms are not superior to those of similar web applications, MAGMA is particularly user friendly and can be used without prior training. The user interface guides the novice user through the most typical microarray analysis workflow consisting of data upload, annotation, normalization and statistical analysis. It automatically generates R-scripts that document MAGMA's entire data processing steps, thereby allowing the user to regenerate all results in his local R installation. The implementation of MAGMA follows the model-view-controller design pattern that strictly separates the R-based statistical data processing, the web-representation and the application logic. This modular design makes the application flexible and easily extendible by experts in one of the fields: statistical microarray analysis, web design or software development. State-of-the-art Java Server Faces technology was used to generate the web interface and to perform user input processing. MAGMA's object-oriented modular framework makes it easily extendible and applicable to other fields and demonstrates that modern Java technology is also suitable for rather small and concise academic projects. MAGMA is freely available at www.magma-fgcz.uzh.ch.

  16. Microarray analysis in the archaeon Halobacterium salinarum strain R1.

    Directory of Open Access Journals (Sweden)

    Jens Twellmeyer

    Full Text Available BACKGROUND: Phototrophy of the extremely halophilic archaeon Halobacterium salinarum was explored for decades. The research was mainly focused on the expression of bacteriorhodopsin and its functional properties. In contrast, less is known about genome wide transcriptional changes and their impact on the physiological adaptation to phototrophy. The tool of choice to record transcriptional profiles is the DNA microarray technique. However, the technique is still rarely used for transcriptome analysis in archaea. METHODOLOGY/PRINCIPAL FINDINGS: We developed a whole-genome DNA microarray based on our sequence data of the Hbt. salinarum strain R1 genome. The potential of our tool is exemplified by the comparison of cells growing under aerobic and phototrophic conditions, respectively. We processed the raw fluorescence data by several stringent filtering steps and a subsequent MAANOVA analysis. The study revealed a lot of transcriptional differences between the two cell states. We found that the transcriptional changes were relatively weak, though significant. Finally, the DNA microarray data were independently verified by a real-time PCR analysis. CONCLUSION/SIGNIFICANCE: This is the first DNA microarray analysis of Hbt. salinarum cells that were actually grown under phototrophic conditions. By comparing the transcriptomics data with current knowledge we could show that our DNA microarray tool is well applicable for transcriptome analysis in the extremely halophilic archaeon Hbt. salinarum. The reliability of our tool is based on both the high-quality array of DNA probes and the stringent data handling including MAANOVA analysis. Among the regulated genes more than 50% had unknown functions. This underlines the fact that haloarchaeal phototrophy is still far away from being completely understood. Hence, the data recorded in this study will be subject to future systems biology analysis.

  17. Design and evaluation of Actichip, a thematic microarray for the study of the actin cytoskeleton

    Science.gov (United States)

    Muller, Jean; Mehlen, André; Vetter, Guillaume; Yatskou, Mikalai; Muller, Arnaud; Chalmel, Frédéric; Poch, Olivier; Friederich, Evelyne; Vallar, Laurent

    2007-01-01

    Background The actin cytoskeleton plays a crucial role in supporting and regulating numerous cellular processes. Mutations or alterations in the expression levels affecting the actin cytoskeleton system or related regulatory mechanisms are often associated with complex diseases such as cancer. Understanding how qualitative or quantitative changes in expression of the set of actin cytoskeleton genes are integrated to control actin dynamics and organisation is currently a challenge and should provide insights in identifying potential targets for drug discovery. Here we report the development of a dedicated microarray, the Actichip, containing 60-mer oligonucleotide probes for 327 genes selected for transcriptome analysis of the human actin cytoskeleton. Results Genomic data and sequence analysis features were retrieved from GenBank and stored in an integrative database called Actinome. From these data, probes were designed using a home-made program (CADO4MI) allowing sequence refinement and improved probe specificity by combining the complementary information recovered from the UniGene and RefSeq databases. Actichip performance was analysed by hybridisation with RNAs extracted from epithelial MCF-7 cells and human skeletal muscle. Using thoroughly standardised procedures, we obtained microarray images with excellent quality resulting in high data reproducibility. Actichip displayed a large dynamic range extending over three logs with a limit of sensitivity between one and ten copies of transcript per cell. The array allowed accurate detection of small changes in gene expression and reliable classification of samples based on the expression profiles of tissue-specific genes. When compared to two other oligonucleotide microarray platforms, Actichip showed similar sensitivity and concordant expression ratios. Moreover, Actichip was able to discriminate the highly similar actin isoforms whereas the two other platforms did not. Conclusion Our data demonstrate that

  18. Comparative analysis of human conjunctival and corneal epithelial gene expression with oligonucleotide microarrays.

    Science.gov (United States)

    Turner, Helen C; Budak, Murat T; Akinci, M A Murat; Wolosin, J Mario

    2007-05-01

    To determine global mRNA expression levels in corneal and conjunctival epithelia and identify transcripts that exhibit preferential tissue expression. cDNA samples derived from human conjunctival and corneal epithelia were hybridized in three independent experiments to a commercial oligonucleotide array representing more than 22,000 transcripts. The resultant signal intensities and microarray software transcript present/absent calls were used in conjunction with the local pooled error (LPE) statistical method to identify transcripts that are preferentially or exclusively expressed in one of the two tissues at significant levels (expression >1% of the beta-actin level). EASE (Expression Analysis Systematic Explorer software) was used to identify biological systems comparatively overrepresented in either epithelium. Immuno-, and cytohistochemistry was performed to validate or expand on selected results of interest. The analysis identified 332 preferential and 93 exclusive significant corneal epithelial transcripts. The corresponding numbers of conjunctival epithelium transcripts were 592 and 211, respectively. The overrepresented biological processes in the cornea were related to cell adhesion and oxiredox equilibria and cytoprotection activities. In the conjunctiva, the biological processes that were most prominent were related to innate immunity and melanogenesis. Immunohistochemistry for antigen-presenting cells and melanocytes was consistent with these gene signatures. The transcript comparison identified a substantial number of genes that have either not been identified previously or are not known to be highly expressed in these two epithelia, including testican-1, ECM1, formin, CRTAC1, and NQO1 in the cornea and, in the conjunctiva, sPLA(2)-IIA, lipocalin 2, IGFBP3, multiple MCH class II proteins, and the Na-Pi cotransporter type IIb. Comparative gene expression profiling leads to the identification of many biological processes and previously unknown genes that

  19. Fuzzy support vector machine for microarray imbalanced data classification

    Science.gov (United States)

    Ladayya, Faroh; Purnami, Santi Wulan; Irhamah

    2017-11-01

    DNA microarrays are data containing gene expression with small sample sizes and high number of features. Furthermore, imbalanced classes is a common problem in microarray data. This occurs when a dataset is dominated by a class which have significantly more instances than the other minority classes. Therefore, it is needed a classification method that solve the problem of high dimensional and imbalanced data. Support Vector Machine (SVM) is one of the classification methods that is capable of handling large or small samples, nonlinear, high dimensional, over learning and local minimum issues. SVM has been widely applied to DNA microarray data classification and it has been shown that SVM provides the best performance among other machine learning methods. However, imbalanced data will be a problem because SVM treats all samples in the same importance thus the results is bias for minority class. To overcome the imbalanced data, Fuzzy SVM (FSVM) is proposed. This method apply a fuzzy membership to each input point and reformulate the SVM such that different input points provide different contributions to the classifier. The minority classes have large fuzzy membership so FSVM can pay more attention to the samples with larger fuzzy membership. Given DNA microarray data is a high dimensional data with a very large number of features, it is necessary to do feature selection first using Fast Correlation based Filter (FCBF). In this study will be analyzed by SVM, FSVM and both methods by applying FCBF and get the classification performance of them. Based on the overall results, FSVM on selected features has the best classification performance compared to SVM.

  20. cluML: A markup language for clustering and cluster validity assessment of microarray data.

    Science.gov (United States)

    Bolshakova, Nadia; Cunningham, Pádraig

    2005-01-01

    cluML is a new markup language for microarray data clustering and cluster validity assessment. The XML-based format has been designed to address some of the limitations observed in traditional formats, such as inability to store multiple clustering (including biclustering) and validation results within a dataset. cluML is an effective tool to support biomedical knowledge representation in gene expression data analysis. Although cluML was developed for DNA microarray analysis applications, it can be effectively used for the representation of clustering and for the validation of other biomedical and physical data that has no limitations.

  1. Amygdala-enriched genes identified by microarray technology are restricted to specific amygdaloid subnuclei

    OpenAIRE

    Zirlinger, M.; Kreiman, Gabriel; Anderson, D. J.

    2001-01-01

    Microarray technology represents a potentially powerful method for identifying cell type- and regionally restricted genes expressed in the brain. Here we have combined a microarray analysis of differential gene expression among five selected brain regions, including the amygdala, cerebellum, hippocampus, olfactory bulb, and periaqueductal gray, with in situ hybridization. On average, 0.3% of the 34,000 genes interrogated were highly enriched in each of the five regions...

  2. Analysis of gene expression profile microarray data in complex regional pain syndrome.

    Science.gov (United States)

    Tan, Wulin; Song, Yiyan; Mo, Chengqiang; Jiang, Shuangjian; Wang, Zhongxing

    2017-09-01

    The aim of the present study was to predict key genes and proteins associated with complex regional pain syndrome (CRPS) using bioinformatics analysis. The gene expression profiling microarray data, GSE47603, which included peripheral blood samples from 4 patients with CRPS and 5 healthy controls, was obtained from the Gene Expression Omnibus (GEO) database. The differentially expressed genes (DEGs) in CRPS patients compared with healthy controls were identified using the GEO2R online tool. Functional enrichment analysis was then performed using The Database for Annotation Visualization and Integrated Discovery online tool. Protein‑protein interaction (PPI) network analysis was subsequently performed using Search Tool for the Retrieval of Interaction Genes database and analyzed with Cytoscape software. A total of 257 DEGs were identified, including 243 upregulated genes and 14 downregulated ones. Genes in the human leukocyte antigen (HLA) family were most significantly differentially expressed. Enrichment analysis demonstrated that signaling pathways, including immune response, cell motion, adhesion and angiogenesis were associated with CRPS. PPI network analysis revealed that key genes, including early region 1A binding protein p300 (EP300), CREB‑binding protein (CREBBP), signal transducer and activator of transcription (STAT)3, STAT5A and integrin α M were associated with CRPS. The results suggest that the immune response may therefore serve an important role in CRPS development. In addition, genes in the HLA family, such as HLA‑DQB1 and HLA‑DRB1, may present potential biomarkers for the diagnosis of CRPS. Furthermore, EP300, its paralog CREBBP, and the STAT family genes, STAT3 and STAT5 may be important in the development of CRPS.

  3. Robust Feature Selection from Microarray Data Based on Cooperative Game Theory and Qualitative Mutual Information

    Directory of Open Access Journals (Sweden)

    Atiyeh Mortazavi

    2016-01-01

    Full Text Available High dimensionality of microarray data sets may lead to low efficiency and overfitting. In this paper, a multiphase cooperative game theoretic feature selection approach is proposed for microarray data classification. In the first phase, due to high dimension of microarray data sets, the features are reduced using one of the two filter-based feature selection methods, namely, mutual information and Fisher ratio. In the second phase, Shapley index is used to evaluate the power of each feature. The main innovation of the proposed approach is to employ Qualitative Mutual Information (QMI for this purpose. The idea of Qualitative Mutual Information causes the selected features to have more stability and this stability helps to deal with the problem of data imbalance and scarcity. In the third phase, a forward selection scheme is applied which uses a scoring function to weight each feature. The performance of the proposed method is compared with other popular feature selection algorithms such as Fisher ratio, minimum redundancy maximum relevance, and previous works on cooperative game based feature selection. The average classification accuracy on eleven microarray data sets shows that the proposed method improves both average accuracy and average stability compared to other approaches.

  4. Study of hepatitis B virus gene mutations with enzymatic colorimetry-based DNA microarray.

    Science.gov (United States)

    Mao, Hailei; Wang, Huimin; Zhang, Donglei; Mao, Hongju; Zhao, Jianlong; Shi, Jian; Cui, Zhichu

    2006-01-01

    To establish a modified microarray method for detecting HBV gene mutations in the clinic. Site-specific oligonucleotide probes were immobilized to microarray slides and hybridized to biotin-labeled HBV gene fragments amplified from two-step PCR. Hybridized targets were transferred to nitrocellulose membranes, followed by intensity measurement using BCIP/NBT colorimetry. HBV genes from 99 Hepatitis B patients and 40 healthy blood donors were analyzed. Mutation frequencies of HBV pre-core/core and basic core promoter (BCP) regions were found to be significantly higher in the patient group (42%, 40% versus 2.5%, 5%, P colorimetry method exhibited the same level of sensitivity and reproducibility. An enzymatic colorimetry-based DNA microarray assay was successfully established to monitor HBV mutations. Pre-core/core and BCP mutations of HBV genes could be major causes of HBV infection in HBeAg-negative patients and could also be relevant to chronicity and aggravation of hepatitis B.

  5. The Local Maximum Clustering Method and Its Application in Microarray Gene Expression Data Analysis

    Directory of Open Access Journals (Sweden)

    Chen Yidong

    2004-01-01

    Full Text Available An unsupervised data clustering method, called the local maximum clustering (LMC method, is proposed for identifying clusters in experiment data sets based on research interest. A magnitude property is defined according to research purposes, and data sets are clustered around each local maximum of the magnitude property. By properly defining a magnitude property, this method can overcome many difficulties in microarray data clustering such as reduced projection in similarities, noises, and arbitrary gene distribution. To critically evaluate the performance of this clustering method in comparison with other methods, we designed three model data sets with known cluster distributions and applied the LMC method as well as the hierarchic clustering method, the -mean clustering method, and the self-organized map method to these model data sets. The results show that the LMC method produces the most accurate clustering results. As an example of application, we applied the method to cluster the leukemia samples reported in the microarray study of Golub et al. (1999.

  6. Microarray MAPH: accurate array-based detection of relative copy number in genomic DNA

    Directory of Open Access Journals (Sweden)

    Chan Alan

    2006-06-01

    Full Text Available Abstract Background Current methods for measurement of copy number do not combine all the desirable qualities of convenience, throughput, economy, accuracy and resolution. In this study, to improve the throughput associated with Multiplex Amplifiable Probe Hybridisation (MAPH we aimed to develop a modification based on the 3-Dimensional, Flow-Through Microarray Platform from PamGene International. In this new method, electrophoretic analysis of amplified products is replaced with photometric analysis of a probed oligonucleotide array. Copy number analysis of hybridised probes is based on a dual-label approach by comparing the intensity of Cy3-labelled MAPH probes amplified from test samples co-hybridised with similarly amplified Cy5-labelled reference MAPH probes. The key feature of using a hybridisation-based end point with MAPH is that discrimination of amplified probes is based on sequence and not fragment length. Results In this study we showed that microarray MAPH measurement of PMP22 gene dosage correlates well with PMP22 gene dosage determined by capillary MAPH and that copy number was accurately reported in analyses of DNA from 38 individuals, 12 of which were known to have Charcot-Marie-Tooth disease type 1A (CMT1A. Conclusion Measurement of microarray-based endpoints for MAPH appears to be of comparable accuracy to electrophoretic methods, and holds the prospect of fully exploiting the potential multiplicity of MAPH. The technology has the potential to simplify copy number assays for genes with a large number of exons, or of expanded sets of probes from dispersed genomic locations.

  7. Microarray MAPH: accurate array-based detection of relative copy number in genomic DNA.

    Science.gov (United States)

    Gibbons, Brian; Datta, Parikkhit; Wu, Ying; Chan, Alan; Al Armour, John

    2006-06-30

    Current methods for measurement of copy number do not combine all the desirable qualities of convenience, throughput, economy, accuracy and resolution. In this study, to improve the throughput associated with Multiplex Amplifiable Probe Hybridisation (MAPH) we aimed to develop a modification based on the 3-Dimensional, Flow-Through Microarray Platform from PamGene International. In this new method, electrophoretic analysis of amplified products is replaced with photometric analysis of a probed oligonucleotide array. Copy number analysis of hybridised probes is based on a dual-label approach by comparing the intensity of Cy3-labelled MAPH probes amplified from test samples co-hybridised with similarly amplified Cy5-labelled reference MAPH probes. The key feature of using a hybridisation-based end point with MAPH is that discrimination of amplified probes is based on sequence and not fragment length. In this study we showed that microarray MAPH measurement of PMP22 gene dosage correlates well with PMP22 gene dosage determined by capillary MAPH and that copy number was accurately reported in analyses of DNA from 38 individuals, 12 of which were known to have Charcot-Marie-Tooth disease type 1A (CMT1A). Measurement of microarray-based endpoints for MAPH appears to be of comparable accuracy to electrophoretic methods, and holds the prospect of fully exploiting the potential multiplicity of MAPH. The technology has the potential to simplify copy number assays for genes with a large number of exons, or of expanded sets of probes from dispersed genomic locations.

  8. Microarray evaluation of age-related changes in human dental pulp.

    Science.gov (United States)

    Tranasi, Michelangelo; Sberna, Maria Teresa; Zizzari, Vincenzo; D'Apolito, Giuseppe; Mastrangelo, Filiberto; Salini, Luisa; Stuppia, Liborio; Tetè, Stefano

    2009-09-01

    The dental pulp undergoes age-related changes that could be ascribed to physiological, defensive, or pathological irritant-induced changes. These changes are regulated by pulp cell activity and by a variety of extracellular matrix (ECM) macromolecules, playing important roles in growth regulation, tissue differentiation and organization, formation of calcified tissue, and defense mechanisms and reactions to inflammatory stimuli. The aim of this research was to better understand the genetic changes that underlie the histological modification of the dental pulp in aging. The gene expression profile of the human dental pulp in young and older subjects was compared by RNA microarray analysis that allowed to simultaneously analyze the expression levels of thousands of genes. Data were statistically analyzed by Significance Analysis of Microarrays (SAM) Ingenuity Pathway Analysis (IPA) software. Semiquantitative and real-time reverse-transcriptase polymerase chain reaction analyses were performed to confirm the results. Microarray analysis revealed several differentially expressed genes that were categorized in growth factors, transcription regulators, apoptosis regulators, and genes of the ECM. The comparison analysis showed a high expression level of the biological functions of cell and tissue differentiation, development, and proliferation and of the immune, lymphatic, and hematologic system in young dental pulp, whereas the pathway of apoptosis was highly expressed in older dental pulp. Expression profile analyses of human dental pulp represent a sensible and useful tool for the study of mechanisms involved in differentiation, growth and aging of human dental pulp in physiological and pathological conditions.

  9. An Improved Fuzzy Based Missing Value Estimation in DNA Microarray Validated by Gene Ranking

    Directory of Open Access Journals (Sweden)

    Sujay Saha

    2016-01-01

    Full Text Available Most of the gene expression data analysis algorithms require the entire gene expression matrix without any missing values. Hence, it is necessary to devise methods which would impute missing data values accurately. There exist a number of imputation algorithms to estimate those missing values. This work starts with a microarray dataset containing multiple missing values. We first apply the modified version of the fuzzy theory based existing method LRFDVImpute to impute multiple missing values of time series gene expression data and then validate the result of imputation by genetic algorithm (GA based gene ranking methodology along with some regular statistical validation techniques, like RMSE method. Gene ranking, as far as our knowledge, has not been used yet to validate the result of missing value estimation. Firstly, the proposed method has been tested on the very popular Spellman dataset and results show that error margins have been drastically reduced compared to some previous works, which indirectly validates the statistical significance of the proposed method. Then it has been applied on four other 2-class benchmark datasets, like Colorectal Cancer tumours dataset (GDS4382, Breast Cancer dataset (GSE349-350, Prostate Cancer dataset, and DLBCL-FL (Leukaemia for both missing value estimation and ranking the genes, and the results show that the proposed method can reach 100% classification accuracy with very few dominant genes, which indirectly validates the biological significance of the proposed method.

  10. Support vector machine classification and validation of cancer tissue samples using microarray expression data.

    Science.gov (United States)

    Furey, T S; Cristianini, N; Duffy, N; Bednarski, D W; Schummer, M; Haussler, D

    2000-10-01

    DNA microarray experiments generating thousands of gene expression measurements, are being used to gather information from tissue and cell samples regarding gene expression differences that will be useful in diagnosing disease. We have developed a new method to analyse this kind of data using support vector machines (SVMs). This analysis consists of both classification of the tissue samples, and an exploration of the data for mis-labeled or questionable tissue results. We demonstrate the method in detail on samples consisting of ovarian cancer tissues, normal ovarian tissues, and other normal tissues. The dataset consists of expression experiment results for 97,802 cDNAs for each tissue. As a result of computational analysis, a tissue sample is discovered and confirmed to be wrongly labeled. Upon correction of this mistake and the removal of an outlier, perfect classification of tissues is achieved, but not with high confidence. We identify and analyse a subset of genes from the ovarian dataset whose expression is highly differentiated between the types of tissues. To show robustness of the SVM method, two previously published datasets from other types of tissues or cells are analysed. The results are comparable to those previously obtained. We show that other machine learning methods also perform comparably to the SVM on many of those datasets. The SVM software is available at http://www.cs. columbia.edu/ approximately bgrundy/svm.

  11. Microarray analysis of pancreatic gene expression during biotin repletion in biotin-deficient rats.

    Science.gov (United States)

    Dakshinamurti, Krishnamurti; Bagchi, Rushita A; Abrenica, Bernard; Czubryt, Michael P

    2015-12-01

    Biotin is a B vitamin involved in multiple metabolic pathways. In humans, biotin deficiency is relatively rare but can cause dermatitis, alopecia, and perosis. Low biotin levels occur in individuals with type-2 diabetes, and supplementation with biotin plus chromium may improve blood sugar control. The acute effect on pancreatic gene expression of biotin repletion following chronic deficiency is unclear, therefore we induced biotin deficiency in adult male rats by feeding them a 20% raw egg white diet for 6 weeks. Animals were then randomized into 2 groups: one group received a single biotin supplement and returned to normal chow lacking egg white, while the second group remained on the depletion diet. After 1 week, pancreata were removed from biotin-deficient (BD) and biotin-repleted (BR) animals and RNA was isolated for microarray analysis. Biotin depletion altered gene expression in a manner indicative of inflammation, fibrosis, and defective pancreatic function. Conversely, biotin repletion activated numerous repair and anti-inflammatory pathways, reduced fibrotic gene expression, and induced multiple genes involved in pancreatic endocrine and exocrine function. A subset of the results was confirmed by quantitative real-time PCR analysis, as well as by treatment of pancreatic AR42J cells with biotin. The results indicate that biotin repletion, even after lengthy deficiency, results in the rapid induction of repair processes in the pancreas.

  12. Temporal Gene Expression Profiling of the Wheat Leaf Rust Pathosystem Using cDNA Microarray Reveals Differences in Compatible and Incompatible Defence Pathways

    OpenAIRE

    Fofana, Bourlaye; Banks, Travis W.; McCallum, Brent; Strelkov, Stephen E.; Cloutier, Sylvie

    2007-01-01

    In this study, we detail the construction of a custom cDNA spotted microarray containing 7728 wheat ESTs and the use of the array to identify host genes that are differentially expressed upon challenges with leaf rust fungal pathogens. Wheat cultivar RL6003 (Thatcher Lr1) was inoculated with Puccinia triticina virulence phenotypes BBB (incompatible) or TJB (7-2) (compatible) and sampled at four different time points (3, 6, 12, and 24 hours) after inoculation. Transcript expression levels rela...

  13. On the classification techniques in data mining for microarray data classification

    Science.gov (United States)

    Aydadenta, Husna; Adiwijaya

    2018-03-01

    Cancer is one of the deadly diseases, according to data from WHO by 2015 there are 8.8 million more deaths caused by cancer, and this will increase every year if not resolved earlier. Microarray data has become one of the most popular cancer-identification studies in the field of health, since microarray data can be used to look at levels of gene expression in certain cell samples that serve to analyze thousands of genes simultaneously. By using data mining technique, we can classify the sample of microarray data thus it can be identified with cancer or not. In this paper we will discuss some research using some data mining techniques using microarray data, such as Support Vector Machine (SVM), Artificial Neural Network (ANN), Naive Bayes, k-Nearest Neighbor (kNN), and C4.5, and simulation of Random Forest algorithm with technique of reduction dimension using Relief. The result of this paper show performance measure (accuracy) from classification algorithm (SVM, ANN, Naive Bayes, kNN, C4.5, and Random Forets).The results in this paper show the accuracy of Random Forest algorithm higher than other classification algorithms (Support Vector Machine (SVM), Artificial Neural Network (ANN), Naive Bayes, k-Nearest Neighbor (kNN), and C4.5). It is hoped that this paper can provide some information about the speed, accuracy, performance and computational cost generated from each Data Mining Classification Technique based on microarray data.

  14. Development of a cDNA microarray for the measurement of gene expression in the sheep scab mite Psoroptes ovis

    Directory of Open Access Journals (Sweden)

    Burgess Stewart TG

    2012-02-01

    Full Text Available Abstract Background Sheep scab is caused by the ectoparasitic mite Psoroptes ovis which initiates a profound cutaneous inflammatory response, leading to the development of the skin lesions which are characteristic of the disease. Existing control strategies rely upon injectable endectocides and acaricidal dips but concerns over residues, eco-toxicity and the development of acaricide resistance limit the sustainability of this approach. In order to identify alternative means of disease control, a deeper understanding of both the parasite and its interaction with the host are required. Methods Herein we describe the development and utilisation of an annotated P. ovis cDNA microarray containing 3,456 elements for the measurement of gene expression in this economically important ectoparasite. The array consists of 981 P. ovis EST sequences printed in triplicate along with 513 control elements. Array performance was validated through the analysis of gene expression differences between fed and starved P. ovis mites. Results Sequences represented on the array include homologues of major house dust mite allergens and tick salivary proteins, along with factors potentially involved in mite reproduction and xenobiotic metabolism. In order to validate the performance of this unique resource under biological conditions we used the array to analyse gene expression differences between fed and starved P. ovis mites. These analyses identified a number of house dust mite allergen homologues up-regulated in fed mites and P. ovis transcripts involved in stress responses, autophagy and chemosensory perception up-regulated in starved mites. Conclusion The P. ovis cDNA microarray described here has been shown to be both robust and reproducible and will enable future studies to analyse gene expression in this important ectoparasite.

  15. Discovering biological progression underlying microarray samples.

    Directory of Open Access Journals (Sweden)

    Peng Qiu

    2011-04-01

    Full Text Available In biological systems that undergo processes such as differentiation, a clear concept of progression exists. We present a novel computational approach, called Sample Progression Discovery (SPD, to discover patterns of biological progression underlying microarray gene expression data. SPD assumes that individual samples of a microarray dataset are related by an unknown biological process (i.e., differentiation, development, cell cycle, disease progression, and that each sample represents one unknown point along the progression of that process. SPD aims to organize the samples in a manner that reveals the underlying progression and to simultaneously identify subsets of genes that are responsible for that progression. We demonstrate the performance of SPD on a variety of microarray datasets that were generated by sampling a biological process at different points along its progression, without providing SPD any information of the underlying process. When applied to a cell cycle time series microarray dataset, SPD was not provided any prior knowledge of samples' time order or of which genes are cell-cycle regulated, yet SPD recovered the correct time order and identified many genes that have been associated with the cell cycle. When applied to B-cell differentiation data, SPD recovered the correct order of stages of normal B-cell differentiation and the linkage between preB-ALL tumor cells with their cell origin preB. When applied to mouse embryonic stem cell differentiation data, SPD uncovered a landscape of ESC differentiation into various lineages and genes that represent both generic and lineage specific processes. When applied to a prostate cancer microarray dataset, SPD identified gene modules that reflect a progression consistent with disease stages. SPD may be best viewed as a novel tool for synthesizing biological hypotheses because it provides a likely biological progression underlying a microarray dataset and, perhaps more importantly, the

  16. Gene expression patterns during the larval development of European sea bass (dicentrarchus labrax) by microarray analysis.

    Science.gov (United States)

    Darias, M J; Zambonino-Infante, J L; Hugot, K; Cahu, C L; Mazurais, D

    2008-01-01

    During the larval period, marine teleosts undergo very fast growth and dramatic changes in morphology, metabolism, and behavior to accomplish their metamorphosis into juvenile fish. Regulation of gene expression is widely thought to be a key mechanism underlying the management of the biological processes required for harmonious development over this phase of life. To provide an overall analysis of gene expression in the whole body during sea bass larval development, we monitored the expression of 6,626 distinct genes at 10 different points in time between 7 and 43 days post-hatching (dph) by using heterologous hybridization of a rainbow trout cDNA microarray. The differentially expressed genes (n = 485) could be grouped into two categories: genes that were generally up-expressed early, between 7 and 23 dph, and genes up-expressed between 25 and 43 dph. Interestingly, among the genes regulated during the larval period, those related to organogenesis, energy pathways, biosynthesis, and digestion were over-represented compared with total set of analyzed genes. We discuss the quantitative regulation of whole-body contents of these specific transcripts with regard to the ontogenesis and maturation of essential functions that take place over larval development. Our study is the first utilization of a transcriptomic approach in sea bass and reveals dynamic changes in gene expression patterns in relation to marine finfish larval development.

  17. Molecular sub-classification of renal epithelial tumors using meta-analysis of gene expression microarrays.

    Directory of Open Access Journals (Sweden)

    Thomas Sanford

    Full Text Available To evaluate the accuracy of the sub-classification of renal cortical neoplasms using molecular signatures.A search of publicly available databases was performed to identify microarray datasets with multiple histologic sub-types of renal cortical neoplasms. Meta-analytic techniques were utilized to identify differentially expressed genes for each histologic subtype. The lists of genes obtained from the meta-analysis were used to create predictive signatures through the use of a pair-based method. These signatures were organized into an algorithm to sub-classify renal neoplasms. The use of these signatures according to our algorithm was validated on several independent datasets.We identified three Gene Expression Omnibus datasets that fit our criteria to develop a training set. All of the datasets in our study utilized the Affymetrix platform. The final training dataset included 149 samples represented by the four most common histologic subtypes of renal cortical neoplasms: 69 clear cell, 41 papillary, 16 chromophobe, and 23 oncocytomas. When validation of our signatures was performed on external datasets, we were able to correctly classify 68 of the 72 samples (94%. The correct classification by subtype was 19/20 (95% for clear cell, 14/14 (100% for papillary, 17/19 (89% for chromophobe, 18/19 (95% for oncocytomas.Through the use of meta-analytic techniques, we were able to create an algorithm that sub-classified renal neoplasms on a molecular level with 94% accuracy across multiple independent datasets. This algorithm may aid in selecting molecular therapies and may improve the accuracy of subtyping of renal cortical tumors.

  18. Extended analysis of benchmark datasets for Agilent two-color microarrays

    Directory of Open Access Journals (Sweden)

    Kerr Kathleen F

    2007-10-01

    Full Text Available Abstract Background As part of its broad and ambitious mission, the MicroArray Quality Control (MAQC project reported the results of experiments using External RNA Controls (ERCs on five microarray platforms. For most platforms, several different methods of data processing were considered. However, there was no similar consideration of different methods for processing the data from the Agilent two-color platform. While this omission is understandable given the scale of the project, it can create the false impression that there is consensus about the best way to process Agilent two-color data. It is also important to consider whether ERCs are representative of all the probes on a microarray. Results A comparison of different methods of processing Agilent two-color data shows substantial differences among methods for low-intensity genes. The sensitivity and specificity for detecting differentially expressed genes varies substantially for different methods. Analysis also reveals that the ERCs in the MAQC data only span the upper half of the intensity range, and therefore cannot be representative of all genes on the microarray. Conclusion Although ERCs demonstrate good agreement between observed and expected log-ratios on the Agilent two-color platform, such an analysis is incomplete. Simple loess normalization outperformed data processing with Agilent's Feature Extraction software for accurate identification of differentially expressed genes. Results from studies using ERCs should not be over-generalized when ERCs are not representative of all probes on a microarray.

  19. GeneTrailExpress: a web-based pipeline for the statistical evaluation of microarray experiments

    Directory of Open Access Journals (Sweden)

    Kohlbacher Oliver

    2008-12-01

    Full Text Available Abstract Background High-throughput methods that allow for measuring the expression of thousands of genes or proteins simultaneously have opened new avenues for studying biochemical processes. While the noisiness of the data necessitates an extensive pre-processing of the raw data, the high dimensionality requires effective statistical analysis methods that facilitate the identification of crucial biological features and relations. For these reasons, the evaluation and interpretation of expression data is a complex, labor-intensive multi-step process. While a variety of tools for normalizing, analysing, or visualizing expression profiles has been developed in the last years, most of these tools offer only functionality for accomplishing certain steps of the evaluation pipeline. Results Here, we present a web-based toolbox that provides rich functionality for all steps of the evaluation pipeline. Our tool GeneTrailExpress offers besides standard normalization procedures powerful statistical analysis methods for studying a large variety of biological categories and pathways. Furthermore, an integrated graph visualization tool, BiNA, enables the user to draw the relevant biological pathways applying cutting-edge graph-layout algorithms. Conclusion Our gene expression toolbox with its interactive visualization of the pathways and the expression values projected onto the nodes will simplify the analysis and interpretation of biochemical pathways considerably.

  20. Shared probe design and existing microarray reanalysis using PICKY

    Directory of Open Access Journals (Sweden)

    Chou Hui-Hsien

    2010-04-01

    Full Text Available Abstract Background Large genomes contain families of highly similar genes that cannot be individually identified by microarray probes. This limitation is due to thermodynamic restrictions and cannot be resolved by any computational method. Since gene annotations are updated more frequently than microarrays, another common issue facing microarray users is that existing microarrays must be routinely reanalyzed to determine probes that are still useful with respect to the updated annotations. Results PICKY 2.0 can design shared probes for sets of genes that cannot be individually identified using unique probes. PICKY 2.0 uses novel algorithms to track sharable regions among genes and to strictly distinguish them from other highly similar but nontarget regions during thermodynamic comparisons. Therefore, PICKY does not sacrifice the quality of shared probes when choosing them. The latest PICKY 2.1 includes the new capability to reanalyze existing microarray probes against updated gene sets to determine probes that are still valid to use. In addition, more precise nonlinear salt effect estimates and other improvements are added, making PICKY 2.1 more versatile to microarray users. Conclusions Shared probes allow expressed gene family members to be detected; this capability is generally more desirable than not knowing anything about these genes. Shared probes also enable the design of cross-genome microarrays, which facilitate multiple species identification in environmental samples. The new nonlinear salt effect calculation significantly increases the precision of probes at a lower buffer salt concentration, and the probe reanalysis function improves existing microarray result interpretations.

  1. Microarray analysis of gene expression by skeletal muscle of three mouse models of Kennedy disease/spinal bulbar muscular atrophy.

    Directory of Open Access Journals (Sweden)

    Kaiguo Mo

    2010-09-01

    Full Text Available Emerging evidence implicates altered gene expression within skeletal muscle in the pathogenesis of Kennedy disease/spinal bulbar muscular atrophy (KD/SBMA. We therefore broadly characterized gene expression in skeletal muscle of three independently generated mouse models of this disease. The mouse models included a polyglutamine expanded (polyQ AR knock-in model (AR113Q, a polyQ AR transgenic model (AR97Q, and a transgenic mouse that overexpresses wild type AR solely in skeletal muscle (HSA-AR. HSA-AR mice were included because they substantially reproduce the KD/SBMA phenotype despite the absence of polyQ AR.We performed microarray analysis of lower hindlimb muscles taken from these three models relative to wild type controls using high density oligonucleotide arrays. All microarray comparisons were made with at least 3 animals in each condition, and only those genes having at least 2-fold difference and whose coefficient of variance was less than 100% were considered to be differentially expressed. When considered globally, there was a similar overlap in gene changes between the 3 models: 19% between HSA-AR and AR97Q, 21% between AR97Q and AR113Q, and 17% between HSA-AR and AR113Q, with 8% shared by all models. Several patterns of gene expression relevant to the disease process were observed. Notably, patterns of gene expression typical of loss of AR function were observed in all three models, as were alterations in genes involved in cell adhesion, energy balance, muscle atrophy and myogenesis. We additionally measured changes similar to those observed in skeletal muscle of a mouse model of Huntington's Disease, and to those common to muscle atrophy from diverse causes.By comparing patterns of gene expression in three independent models of KD/SBMA, we have been able to identify candidate genes that might mediate the core myogenic features of KD/SBMA.

  2. Detection of EBV Infection and Gene Expression in Oral Cancer from Patients in Taiwan by Microarray Analysis

    Directory of Open Access Journals (Sweden)

    Ching-Yu Yen

    2009-01-01

    Full Text Available Epstein-Barr virus is known to cause nasopharyngeal carcinoma. Although oral cavity is located close to the nasal pharynx, the pathogenetic role of Epstein-Barr virus (EBV in oral cancers is unclear. This molecular epidemiology study uses EBV genomic microarray (EBV-chip to simultaneously detect the prevalent rate and viral gene expression patterns in 57 oral squamous cell carcinoma biopsies (OSCC collected from patients in Taiwan. The majority of the specimens (82.5% were EBV-positive that probably expressed coincidently the genes for EBNAs, LMP2A and 2B, and certain structural proteins. Importantly, the genes fabricated at the spots 61 (BBRF1, BBRF2, and BBRF3 and 68 (BDLF4 and BDRF1 on EBV-chip were actively expressed in a significantly greater number of OSCC exhibiting exophytic morphology or ulceration than those tissues with deep invasive lesions (P=.0265 and .0141, resp.. The results may thus provide the lead information for understanding the role of EBV in oral cancer pathogenesis.

  3. Spot detection and image segmentation in DNA microarray data.

    Science.gov (United States)

    Qin, Li; Rueda, Luis; Ali, Adnan; Ngom, Alioune

    2005-01-01

    Following the invention of microarrays in 1994, the development and applications of this technology have grown exponentially. The numerous applications of microarray technology include clinical diagnosis and treatment, drug design and discovery, tumour detection, and environmental health research. One of the key issues in the experimental approaches utilising microarrays is to extract quantitative information from the spots, which represent genes in a given experiment. For this process, the initial stages are important and they influence future steps in the analysis. Identifying the spots and separating the background from the foreground is a fundamental problem in DNA microarray data analysis. In this review, we present an overview of state-of-the-art methods for microarray image segmentation. We discuss the foundations of the circle-shaped approach, adaptive shape segmentation, histogram-based methods and the recently introduced clustering-based techniques. We analytically show that clustering-based techniques are equivalent to the one-dimensional, standard k-means clustering algorithm that utilises the Euclidean distance.

  4. Translating microarray data for diagnostic testing in childhood leukaemia

    International Nuclear Information System (INIS)

    Hoffmann, Katrin; Firth, Martin J; Beesley, Alex H; Klerk, Nicholas H de; Kees, Ursula R

    2006-01-01

    Recent findings from microarray studies have raised the prospect of a standardized diagnostic gene expression platform to enhance accurate diagnosis and risk stratification in paediatric acute lymphoblastic leukaemia (ALL). However, the robustness as well as the format for such a diagnostic test remains to be determined. As a step towards clinical application of these findings, we have systematically analyzed a published ALL microarray data set using Robust Multi-array Analysis (RMA) and Random Forest (RF). We examined published microarray data from 104 ALL patients specimens, that represent six different subgroups defined by cytogenetic features and immunophenotypes. Using the decision-tree based supervised learning algorithm Random Forest (RF), we determined a small set of genes for optimal subgroup distinction and subsequently validated their predictive power in an independent patient cohort. We achieved very high overall ALL subgroup prediction accuracies of about 98%, and were able to verify the robustness of these genes in an independent panel of 68 specimens obtained from a different institution and processed in a different laboratory. Our study established that the selection of discriminating genes is strongly dependent on the analysis method. This may have profound implications for clinical use, particularly when the classifier is reduced to a small set of genes. We have demonstrated that as few as 26 genes yield accurate class prediction and importantly, almost 70% of these genes have not been previously identified as essential for class distinction of the six ALL subgroups. Our finding supports the feasibility of qRT-PCR technology for standardized diagnostic testing in paediatric ALL and should, in conjunction with conventional cytogenetics lead to a more accurate classification of the disease. In addition, we have demonstrated that microarray findings from one study can be confirmed in an independent study, using an entirely independent patient cohort

  5. Workflows for microarray data processing in the Kepler environment

    Science.gov (United States)

    2012-01-01

    Background Microarray data analysis has been the subject of extensive and ongoing pipeline development due to its complexity, the availability of several options at each analysis step, and the development of new analysis demands, including integration with new data sources. Bioinformatics pipelines are usually custom built for different applications, making them typically difficult to modify, extend and repurpose. Scientific workflow systems are intended to address these issues by providing general-purpose frameworks in which to develop and execute such pipelines. The Kepler workflow environment is a well-established system under continual development that is employed in several areas of scientific research. Kepler provides a flexible graphical interface, featuring clear display of parameter values, for design and modification of workflows. It has capabilities for developing novel computational components in the R, Python, and Java programming languages, all of which are widely used for bioinformatics algorithm development, along with capabilities for invoking external applications and using web services. Results We developed a series of fully functional bioinformatics pipelines addressing common tasks in microarray processing in the Kepler workflow environment. These pipelines consist of a set of tools for GFF file processing of NimbleGen chromatin immunoprecipitation on microarray (ChIP-chip) datasets and more comprehensive workflows for Affymetrix gene expression microarray bioinformatics and basic primer design for PCR experiments, which are often used to validate microarray results. Although functional in themselves, these workflows can be easily customized, extended, or repurposed to match the needs of specific projects and are designed to be a toolkit and starting point for specific applications. These workflows illustrate a workflow programming paradigm focusing on local resources (programs and data) and therefore are close to traditional shell scripting or

  6. Workflows for microarray data processing in the Kepler environment

    Directory of Open Access Journals (Sweden)

    Stropp Thomas

    2012-05-01

    Full Text Available Abstract Background Microarray data analysis has been the subject of extensive and ongoing pipeline development due to its complexity, the availability of several options at each analysis step, and the development of new analysis demands, including integration with new data sources. Bioinformatics pipelines are usually custom built for different applications, making them typically difficult to modify, extend and repurpose. Scientific workflow systems are intended to address these issues by providing general-purpose frameworks in which to develop and execute such pipelines. The Kepler workflow environment is a well-established system under continual development that is employed in several areas of scientific research. Kepler provides a flexible graphical interface, featuring clear display of parameter values, for design and modification of workflows. It has capabilities for developing novel computational components in the R, Python, and Java programming languages, all of which are widely used for bioinformatics algorithm development, along with capabilities for invoking external applications and using web services. Results We developed a series of fully functional bioinformatics pipelines addressing common tasks in microarray processing in the Kepler workflow environment. These pipelines consist of a set of tools for GFF file processing of NimbleGen chromatin immunoprecipitation on microarray (ChIP-chip datasets and more comprehensive workflows for Affymetrix gene expression microarray bioinformatics and basic primer design for PCR experiments, which are often used to validate microarray results. Although functional in themselves, these workflows can be easily customized, extended, or repurposed to match the needs of specific projects and are designed to be a toolkit and starting point for specific applications. These workflows illustrate a workflow programming paradigm focusing on local resources (programs and data and therefore are close to

  7. Workflows for microarray data processing in the Kepler environment.

    Science.gov (United States)

    Stropp, Thomas; McPhillips, Timothy; Ludäscher, Bertram; Bieda, Mark

    2012-05-17

    Microarray data analysis has been the subject of extensive and ongoing pipeline development due to its complexity, the availability of several options at each analysis step, and the development of new analysis demands, including integration with new data sources. Bioinformatics pipelines are usually custom built for different applications, making them typically difficult to modify, extend and repurpose. Scientific workflow systems are intended to address these issues by providing general-purpose frameworks in which to develop and execute such pipelines. The Kepler workflow environment is a well-established system under continual development that is employed in several areas of scientific research. Kepler provides a flexible graphical interface, featuring clear display of parameter values, for design and modification of workflows. It has capabilities for developing novel computational components in the R, Python, and Java programming languages, all of which are widely used for bioinformatics algorithm development, along with capabilities for invoking external applications and using web services. We developed a series of fully functional bioinformatics pipelines addressing common tasks in microarray processing in the Kepler workflow environment. These pipelines consist of a set of tools for GFF file processing of NimbleGen chromatin immunoprecipitation on microarray (ChIP-chip) datasets and more comprehensive workflows for Affymetrix gene expression microarray bioinformatics and basic primer design for PCR experiments, which are often used to validate microarray results. Although functional in themselves, these workflows can be easily customized, extended, or repurposed to match the needs of specific projects and are designed to be a toolkit and starting point for specific applications. These workflows illustrate a workflow programming paradigm focusing on local resources (programs and data) and therefore are close to traditional shell scripting or R

  8. Data Integration for Microarrays: Enhanced Inference for Gene Regulatory Networks

    Directory of Open Access Journals (Sweden)

    Alina Sîrbu

    2015-05-01

    Full Text Available Microarray technologies have been the basis of numerous important findings regarding gene expression in the few last decades. Studies have generated large amounts of data describing various processes, which, due to the existence of public databases, are widely available for further analysis. Given their lower cost and higher maturity compared to newer sequencing technologies, these data continue to be produced, even though data quality has been the subject of some debate. However, given the large volume of data generated, integration can help overcome some issues related, e.g., to noise or reduced time resolution, while providing additional insight on features not directly addressed by sequencing methods. Here, we present an integration test case based on public Drosophila melanogaster datasets (gene expression, binding site affinities, known interactions. Using an evolutionary computation framework, we show how integration can enhance the ability to recover transcriptional gene regulatory networks from these data, as well as indicating which data types are more important for quantitative and qualitative network inference. Our results show a clear improvement in performance when multiple datasets are integrated, indicating that microarray data will remain a valuable and viable resource for some time to come.

  9. Data Integration for Microarrays: Enhanced Inference for Gene Regulatory Networks.

    Science.gov (United States)

    Sîrbu, Alina; Crane, Martin; Ruskin, Heather J

    2015-05-14

    Microarray technologies have been the basis of numerous important findings regarding gene expression in the few last decades. Studies have generated large amounts of data describing various processes, which, due to the existence of public databases, are widely available for further analysis. Given their lower cost and higher maturity compared to newer sequencing technologies, these data continue to be produced, even though data quality has been the subject of some debate. However, given the large volume of data generated, integration can help overcome some issues related, e.g., to noise or reduced time resolution, while providing additional insight on features not directly addressed by sequencing methods. Here, we present an integration test case based on public Drosophila melanogaster datasets (gene expression, binding site affinities, known interactions). Using an evolutionary computation framework, we show how integration can enhance the ability to recover transcriptional gene regulatory networks from these data, as well as indicating which data types are more important for quantitative and qualitative network inference. Our results show a clear improvement in performance when multiple datasets are integrated, indicating that microarray data will remain a valuable and viable resource for some time to come.

  10. Microarray Meta-Analysis of RNA-Binding Protein Functions in Alternative Polyadenylation

    Science.gov (United States)

    Hu, Wenchao; Liu, Yuting; Yan, Jun

    2014-01-01

    Alternative polyadenylation (APA) is a post-transcriptional mechanism to generate diverse mRNA transcripts with different 3′UTRs from the same gene. In this study, we systematically searched for the APA events with differential expression in public mouse microarray data. Hundreds of genes with over-represented differential APA events and the corresponding experiments were identified. We further revealed that global APA differential expression occurred prevalently in tissues such as brain comparing to peripheral tissues, and biological processes such as development, differentiation and immune responses. Interestingly, we also observed widespread differential APA events in RNA-binding protein (RBP) genes such as Rbm3, Eif4e2 and Elavl1. Given the fact that RBPs are considered as the main regulators of differential APA expression, we constructed a co-expression network between APAs and RBPs using the microarray data. Further incorporation of CLIP-seq data of selected RBPs showed that Nova2 represses and Mbnl1 promotes the polyadenylation of closest poly(A) sites respectively. Altogether, our study is the first microarray meta-analysis in a mammal on the regulation of APA by RBPs that integrated massive mRNA expression data under a wide-range of biological conditions. Finally, we present our results as a comprehensive resource in an online website for the research community. PMID:24622240

  11. CAFE: an R package for the detection of gross chromosomal abnormalities from gene expression microarray data.

    Science.gov (United States)

    Bollen, Sander; Leddin, Mathias; Andrade-Navarro, Miguel A; Mah, Nancy

    2014-05-15

    The current methods available to detect chromosomal abnormalities from DNA microarray expression data are cumbersome and inflexible. CAFE has been developed to alleviate these issues. It is implemented as an R package that analyzes Affymetrix *.CEL files and comes with flexible plotting functions, easing visualization of chromosomal abnormalities. CAFE is available from https://bitbucket.org/cob87icW6z/cafe/ as both source and compiled packages for Linux and Windows. It is released under the GPL version 3 license. CAFE will also be freely available from Bioconductor. sander.h.bollen@gmail.com or nancy.mah@mdc-berlin.de Supplementary data are available at Bioinformatics online.

  12. Nanotechnology: moving from microarrays toward nanoarrays.

    Science.gov (United States)

    Chen, Hua; Li, Jun

    2007-01-01

    Microarrays are important tools for high-throughput analysis of biomolecules. The use of microarrays for parallel screening of nucleic acid and protein profiles has become an industry standard. A few limitations of microarrays are the requirement for relatively large sample volumes and elongated incubation time, as well as the limit of detection. In addition, traditional microarrays make use of bulky instrumentation for the detection, and sample amplification and labeling are quite laborious, which increase analysis cost and delays the time for obtaining results. These problems limit microarray techniques from point-of-care and field applications. One strategy for overcoming these problems is to develop nanoarrays, particularly electronics-based nanoarrays. With further miniaturization, higher sensitivity, and simplified sample preparation, nanoarrays could potentially be employed for biomolecular analysis in personal healthcare and monitoring of trace pathogens. In this chapter, it is intended to introduce the concept and advantage of nanotechnology and then describe current methods and protocols for novel nanoarrays in three aspects: (1) label-free nucleic acids analysis using nanoarrays, (2) nanoarrays for protein detection by conventional optical fluorescence microscopy as well as by novel label-free methods such as atomic force microscopy, and (3) nanoarray for enzymatic-based assay. These nanoarrays will have significant applications in drug discovery, medical diagnosis, genetic testing, environmental monitoring, and food safety inspection.

  13. Microarray mRNA expression analysis of Fanconi anemia fibroblasts.

    Science.gov (United States)

    Galetzka, D; Weis, E; Rittner, G; Schindler, D; Haaf, T

    2008-01-01

    Fanconi anemia (FA) cells are generally hypersensitive to DNA cross-linking agents, implying that mutations in the different FANC genes cause a similar DNA repair defect(s). By using a customized cDNA microarray chip for DNA repair- and cell cycle-associated genes, we identified three genes, cathepsin B (CTSB), glutaredoxin (GLRX), and polo-like kinase 2 (PLK2), that were misregulated in untreated primary fibroblasts from three unrelated FA-D2 patients, compared to six controls. Quantitative real-time RT PCR was used to validate these results and to study possible molecular links between FA-D2 and other FA subtypes. GLRX was misregulated to opposite directions in a variety of different FA subtypes. Increased CTSB and decreased PLK2 expression was found in all or almost all of the analyzed complementation groups and, therefore, may be related to the defective FA pathway. Transcriptional upregulation of the CTSB proteinase appears to be a secondary phenomenon due to proliferation differences between FA and normal fibroblast cultures. In contrast, PLK2 is known to play a pivotal role in processes that are linked to FA defects and may contribute in multiple ways to the FA phenotype: PLK2 is a target gene for TP53, is likely to function as a tumor suppressor gene in hematologic neoplasia, and Plk2(-/-) mice are small because of defective embryonal development. (c) 2008 S. Karger AG, Basel.

  14. Development and Use of Integrated Microarray-Based Genomic Technologies for Assessing Microbial Community Composition and Dynamics

    Energy Technology Data Exchange (ETDEWEB)

    J. Zhou; S.-K. Rhee; C. Schadt; T. Gentry; Z. He; X. Li; X. Liu; J. Liebich; S.C. Chong; L. Wu

    2004-03-17

    To effectively monitor microbial populations involved in various important processes, a 50-mer-based oligonucleotide microarray was developed based on known genes and pathways involved in: biodegradation, metal resistance and reduction, denitrification, nitrification, nitrogen fixation, methane oxidation, methanogenesis, carbon polymer decomposition, and sulfate reduction. This array contains approximately 2000 unique and group-specific probes with <85% similarity to their non-target sequences. Based on artificial probes, our results showed that at hybridization conditions of 50 C and 50% formamide, the 50-mer microarray hybridization can differentiate sequences having <88% similarity. Specificity tests with representative pure cultures indicated that the designed probes on the arrays appeared to be specific to their corresponding target genes. Detection limits were about 5-10ng genomic DNA in the absence of background DNA, and 50-100ng ({approx}1.3{sup o} 10{sup 7} cells) in the presence background DNA. Strong linear relationships between signal intensity and target DNA and RNA concentration were observed (r{sup 2} = 0.95-0.99). Application of this microarray to naphthalene-amended enrichments and soil microcosms demonstrated that composition of the microflora varied depending on incubation conditions. While the naphthalene-degrading genes from Rhodococcus-type microorganisms were dominant in enrichments, the genes involved in naphthalene degradation from Gram-negative microorganisms such as Ralstonia, Comamonas, and Burkholderia were most abundant in the soil microcosms (as well as those for polyaromatic hydrocarbon and nitrotoluene degradation). Although naphthalene degradation is widely known and studied in Pseudomonas, Pseudomonas genes were not detected in either system. Real-time PCR analysis of 4 representative genes was consistent with microarray-based quantification (r{sup 2} = 0.95). Currently, we are also applying this microarray to the study of several

  15. Microarray-based approach identifies microRNAs and their target functional patterns in polycystic kidney disease

    Directory of Open Access Journals (Sweden)

    Boehn Susanne NE

    2008-12-01

    Full Text Available Abstract Background MicroRNAs (miRNAs play key roles in mammalian gene expression and several cellular processes, including differentiation, development, apoptosis and cancer pathomechanisms. Recently the biological importance of primary cilia has been recognized in a number of human genetic diseases. Numerous disorders are related to cilia dysfunction, including polycystic kidney disease (PKD. Although involvement of certain genes and transcriptional networks in PKD development has been shown, not much is known how they are regulated molecularly. Results Given the emerging role of miRNAs in gene expression, we explored the possibilities of miRNA-based regulations in PKD. Here, we analyzed the simultaneous expression changes of miRNAs and mRNAs by microarrays. 935 genes, classified into 24 functional categories, were differentially regulated between PKD and control animals. In parallel, 30 miRNAs were differentially regulated in PKD rats: our results suggest that several miRNAs might be involved in regulating genetic switches in PKD. Furthermore, we describe some newly detected miRNAs, miR-31 and miR-217, in the kidney which have not been reported previously. We determine functionally related gene sets, or pathways to reveal the functional correlation between differentially expressed mRNAs and miRNAs. Conclusion We find that the functional patterns of predicted miRNA targets and differentially expressed mRNAs are similar. Our results suggest an important role of miRNAs in specific pathways underlying PKD.

  16. Microarray-based RNA profiling of breast cancer

    DEFF Research Database (Denmark)

    Larsen, Martin J; Thomassen, Mads; Tan, Qihua

    2014-01-01

    analyzed the same 234 breast cancers on two different microarray platforms. One dataset contained known batch-effects associated with the fabrication procedure used. The aim was to assess the significance of correcting for systematic batch-effects when integrating data from different platforms. We here...

  17. Robust gene selection methods using weighting schemes for microarray data analysis.

    Science.gov (United States)

    Kang, Suyeon; Song, Jongwoo

    2017-09-02

    A common task in microarray data analysis is to identify informative genes that are differentially expressed between two different states. Owing to the high-dimensional nature of microarray data, identification of significant genes has been essential in analyzing the data. However, the performances of many gene selection techniques are highly dependent on the experimental conditions, such as the presence of measurement error or a limited number of sample replicates. We have proposed new filter-based gene selection techniques, by applying a simple modification to significance analysis of microarrays (SAM). To prove the effectiveness of the proposed method, we considered a series of synthetic datasets with different noise levels and sample sizes along with two real datasets. The following findings were made. First, our proposed methods outperform conventional methods for all simulation set-ups. In particular, our methods are much better when the given data are noisy and sample size is small. They showed relatively robust performance regardless of noise level and sample size, whereas the performance of SAM became significantly worse as the noise level became high or sample size decreased. When sufficient sample replicates were available, SAM and our methods showed similar performance. Finally, our proposed methods are competitive with traditional methods in classification tasks for microarrays. The results of simulation study and real data analysis have demonstrated that our proposed methods are effective for detecting significant genes and classification tasks, especially when the given data are noisy or have few sample replicates. By employing weighting schemes, we can obtain robust and reliable results for microarray data analysis.

  18. A newly designed 45 to 60 mer oligonucleotide Agilent platform microarray for global gene expression studies of Synechocystis PCC6803: example salt stress experiment

    NARCIS (Netherlands)

    Aguirre von Wobeser, E.; Huisman, J.; Ibelings, B.; Matthijs, H.C.P.; Matthijs, H.C.P.

    2005-01-01

    A newly designed 45 to 60 mer oligonucleotide Agilent platform microarray for global gene expression studies of Synechocystis PCC6803: example salt stress experiment Eneas Aguirre-von-Wobeser 1, Jef Huisman1, Bas Ibelings2 and Hans C.P. Matthijs1 1 Universiteit van Amsterdam, Amsterdam, The

  19. Domain-oriented functional analysis based on expression profiling

    Directory of Open Access Journals (Sweden)

    Greene Jonathan

    2002-10-01

    Full Text Available Abstract Background Co-regulation of genes may imply involvement in similar biological processes or related function. Many clusters of co-regulated genes have been identified using microarray experiments. In this study, we examined co-regulated gene families using large-scale cDNA microarray experiments on the human transcriptome. Results We present a simple model, which, for each probe pair, distills expression changes into binary digits and summarizes the expression of multiple members of a gene family as the Family Regulation Ratio. The set of Family Regulation Ratios for each protein family across multiple experiments is called a Family Regulation Profile. We analyzed these Family Regulation Profiles using Pearson Correlation Coefficients and derived a network diagram portraying relationships between the Family Regulation Profiles of gene families that are well represented on the microarrays. Our strategy was cross-validated with two randomly chosen data subsets and was proven to be a reliable approach. Conclusion This work will help us to understand and identify the functional relationships between gene families and the regulatory pathways in which each family is involved. Concepts presented here may be useful for objective clustering of protein functions and deriving a comprehensive protein interaction map. Functional genomic approaches such as this may also be applicable to the elucidation of complex genetic regulatory networks.

  20. Normalization and gene p-value estimation: issues in microarray data processing.

    Science.gov (United States)

    Fundel, Katrin; Küffner, Robert; Aigner, Thomas; Zimmer, Ralf

    2008-05-28

    Numerous methods exist for basic processing, e.g. normalization, of microarray gene expression data. These methods have an important effect on the final analysis outcome. Therefore, it is crucial to select methods appropriate for a given dataset in order to assure the validity and reliability of expression data analysis. Furthermore, biological interpretation requires expression values for genes, which are often represented by several spots or probe sets on a microarray. How to best integrate spot/probe set values into gene values has so far been a somewhat neglected problem. We present a case study comparing different between-array normalization methods with respect to the identification of differentially expressed genes. Our results show that it is feasible and necessary to use prior knowledge on gene expression measurements to select an adequate normalization method for the given data. Furthermore, we provide evidence that combining spot/probe set p-values into gene p-values for detecting differentially expressed genes has advantages compared to combining expression values for spots/probe sets into gene expression values. The comparison of different methods suggests to use Stouffer's method for this purpose. The study has been conducted on gene expression experiments investigating human joint cartilage samples of osteoarthritis related groups: a cDNA microarray (83 samples, four groups) and an Affymetrix (26 samples, two groups) data set. The apparently straight forward steps of gene expression data analysis, e.g. between-array normalization and detection of differentially regulated genes, can be accomplished by numerous different methods. We analyzed multiple methods and the possible effects and thereby demonstrate the importance of the single decisions taken during data processing. We give guidelines for evaluating normalization outcomes. An overview of these effects via appropriate measures and plots compared to prior knowledge is essential for the biological

  1. Global gene expression analysis for evaluation and design of biomaterials

    Directory of Open Access Journals (Sweden)

    Nobutaka Hanagata, Taro Takemura and Takashi Minowa

    2010-01-01

    Full Text Available Comprehensive gene expression analysis using DNA microarrays has become a widespread technique in molecular biological research. In the biomaterials field, it is used to evaluate the biocompatibility or cellular toxicity of metals, polymers and ceramics. Studies in this field have extracted differentially expressed genes in the context of differences in cellular responses among multiple materials. Based on these genes, the effects of materials on cells at the molecular level have been examined. Expression data ranging from several to tens of thousands of genes can be obtained from DNA microarrays. For this reason, several tens or hundreds of differentially expressed genes are often present in different materials. In this review, we outline the principles of DNA microarrays, and provide an introduction to methods of extracting information which is useful for evaluating and designing biomaterials from comprehensive gene expression data.

  2. Global gene expression analysis for evaluation and design of biomaterials

    International Nuclear Information System (INIS)

    Hanagata, Nobutaka; Takemura, Taro; Minowa, Takashi

    2010-01-01

    Comprehensive gene expression analysis using DNA microarrays has become a widespread technique in molecular biological research. In the biomaterials field, it is used to evaluate the biocompatibility or cellular toxicity of metals, polymers and ceramics. Studies in this field have extracted differentially expressed genes in the context of differences in cellular responses among multiple materials. Based on these genes, the effects of materials on cells at the molecular level have been examined. Expression data ranging from several to tens of thousands of genes can be obtained from DNA microarrays. For this reason, several tens or hundreds of differentially expressed genes are often present in different materials. In this review, we outline the principles of DNA microarrays, and provide an introduction to methods of extracting information which is useful for evaluating and designing biomaterials from comprehensive gene expression data. (topical review)

  3. Tissue microarray immunohistochemical detection of brachyury is not a prognostic indicator in chordoma.

    Directory of Open Access Journals (Sweden)

    Linlin Zhang

    Full Text Available Brachyury is a marker for notochord-derived tissues and neoplasms, such as chordoma. However, the prognostic relevance of brachyury expression in chordoma is still unknown. The improvement of tissue microarray technology has provided the opportunity to perform analyses of tumor tissues on a large scale in a uniform and consistent manner. This study was designed with the use of tissue microarray to determine the expression of brachyury. Brachyury expression in chordoma tissues from 78 chordoma patients was analyzed by immunohistochemical staining of tissue microarray. The clinicopathologic parameters, including gender, age, location of tumor and metastatic status were evaluated. Fifty-nine of 78 (75.64% tumors showed nuclear staining for brachyury, and among them, 29 tumors (49.15% showed 1+ (<30% positive cells staining, 15 tumors (25.42% had 2+ (31% to 60% positive cells staining, and 15 tumors (25.42% demonstrated 3+ (61% to 100% positive cells staining. Brachyury nuclear staining was detected more frequently in sacral chordomas than in chordomas of the mobile spine. However, there was no significant relationship between brachyury expression and other clinical variables. By Kaplan-Meier analysis, brachyury expression failed to produce any significant relationship with the overall survival rate. In conclusion, brachyury expression is not a prognostic indicator in chordoma.

  4. Microarray expression analysis of meiosis and microsporogenesis in hexaploid bread wheat

    Directory of Open Access Journals (Sweden)

    Langridge Peter

    2006-10-01

    Full Text Available Abstract Background Our understanding of the mechanisms that govern the cellular process of meiosis is limited in higher plants with polyploid genomes. Bread wheat is an allohexaploid that behaves as a diploid during meiosis. Chromosome pairing is restricted to homologous chromosomes despite the presence of homoeologues in the nucleus. The importance of wheat as a crop and the extensive use of wild wheat relatives in breeding programs has prompted many years of cytogenetic and genetic research to develop an understanding of the control of chromosome pairing and recombination. The rapid advance of biochemical and molecular information on meiosis in model organisms such as yeast provides new opportunities to investigate the molecular basis of chromosome pairing control in wheat. However, building the link between the model and wheat requires points of data contact. Results We report here a large-scale transcriptomics study using the Affymetrix wheat GeneChip® aimed at providing this link between wheat and model systems and at identifying early meiotic genes. Analysis of the microarray data identified 1,350 transcripts temporally-regulated during the early stages of meiosis. Expression profiles with annotated transcript functions including chromatin condensation, synaptonemal complex formation, recombination and fertility were identified. From the 1,350 transcripts, 30 displayed at least an eight-fold expression change between and including pre-meiosis and telophase II, with more than 50% of these having no similarities to known sequences in NCBI and TIGR databases. Conclusion This resource is now available to support research into the molecular basis of pairing and recombination control in the complex polyploid, wheat.

  5. Expression microarray meta-analysis identifies genes associated with Ras/MAPK and related pathways in progression of muscle-invasive bladder transition cell carcinoma.

    Directory of Open Access Journals (Sweden)

    Jonathan A Ewald

    Full Text Available The effective detection and management of muscle-invasive bladder Transition Cell Carcinoma (TCC continues to be an urgent clinical challenge. While some differences of gene expression and function in papillary (Ta, superficial (T1 and muscle-invasive (≥T2 bladder cancers have been investigated, the understanding of mechanisms involved in the progression of bladder tumors remains incomplete. Statistical methods of pathway-enrichment, cluster analysis and text-mining can extract and help interpret functional information about gene expression patterns in large sets of genomic data. The public availability of patient-derived expression microarray data allows open access and analysis of large amounts of clinical data. Using these resources, we investigated gene expression differences associated with tumor progression and muscle-invasive TCC. Gene expression was calculated relative to Ta tumors to assess progression-associated differences, revealing a network of genes related to Ras/MAPK and PI3K signaling pathways with increased expression. Further, we identified genes within this network that are similarly expressed in superficial Ta and T1 stages but altered in muscle-invasive T2 tumors, finding 7 genes (COL3A1, COL5A1, COL11A1, FN1, ErbB3, MAPK10 and CDC25C whose expression patterns in muscle-invasive tumors are consistent in 5 to 7 independent outside microarray studies. Further, we found increased expression of the fibrillar collagen proteins COL3A1 and COL5A1 in muscle-invasive tumor samples and metastatic T24 cells. Our results suggest that increased expression of genes involved in mitogenic signaling may support the progression of muscle-invasive bladder tumors that generally lack activating mutations in these pathways, while expression changes of fibrillar collagens, fibronectin and specific signaling proteins are associated with muscle-invasive disease. These results identify potential biomarkers and targets for TCC treatments, and

  6. Validation of the performance of a GMO multiplex screening assay based on microarray detection

    NARCIS (Netherlands)

    Leimanis, S.; Hamels, S.; Naze, F.; Mbongolo, G.; Sneyers, M.; Hochegger, R.; Broll, H.; Roth, L.; Dallmann, K.; Micsinai, A.; Dijk, van J.P.; Kok, E.J.

    2008-01-01

    A new screening method for the detection and identification of GMO, based on the use of multiplex PCR followed by microarray, has been developed and is presented. The technology is based on the identification of quite ubiquitous GMO genetic target elements first amplified by PCR, followed by direct

  7. Towards precise classification of cancers based on robust gene functional expression profiles

    Directory of Open Access Journals (Sweden)

    Zhu Jing

    2005-03-01

    Full Text Available Abstract Background Development of robust and efficient methods for analyzing and interpreting high dimension gene expression profiles continues to be a focus in computational biology. The accumulated experiment evidence supports the assumption that genes express and perform their functions in modular fashions in cells. Therefore, there is an open space for development of the timely and relevant computational algorithms that use robust functional expression profiles towards precise classification of complex human diseases at the modular level. Results Inspired by the insight that genes act as a module to carry out a highly integrated cellular function, we thus define a low dimension functional expression profile for data reduction. After annotating each individual gene to functional categories defined in a proper gene function classification system such as Gene Ontology applied in this study, we identify those functional categories enriched with differentially expressed genes. For each functional category or functional module, we compute a summary measure (s for the raw expression values of the annotated genes to capture the overall activity level of the module. In this way, we can treat the gene expressions within a functional module as an integrative data point to replace the multiple values of individual genes. We compare the classification performance of decision trees based on functional expression profiles with the conventional gene expression profiles using four publicly available datasets, which indicates that precise classification of tumour types and improved interpretation can be achieved with the reduced functional expression profiles. Conclusion This modular approach is demonstrated to be a powerful alternative approach to analyzing high dimension microarray data and is robust to high measurement noise and intrinsic biological variance inherent in microarray data. Furthermore, efficient integration with current biological knowledge

  8. Gene selection and classification for cancer microarray data based on machine learning and similarity measures

    Directory of Open Access Journals (Sweden)

    Liu Qingzhong

    2011-12-01

    Full Text Available Abstract Background Microarray data have a high dimension of variables and a small sample size. In microarray data analyses, two important issues are how to choose genes, which provide reliable and good prediction for disease status, and how to determine the final gene set that is best for classification. Associations among genetic markers mean one can exploit information redundancy to potentially reduce classification cost in terms of time and money. Results To deal with redundant information and improve classification, we propose a gene selection method, Recursive Feature Addition, which combines supervised learning and statistical similarity measures. To determine the final optimal gene set for prediction and classification, we propose an algorithm, Lagging Prediction Peephole Optimization. By using six benchmark microarray gene expression data sets, we compared Recursive Feature Addition with recently developed gene selection methods: Support Vector Machine Recursive Feature Elimination, Leave-One-Out Calculation Sequential Forward Selection and several others. Conclusions On average, with the use of popular learning machines including Nearest Mean Scaled Classifier, Support Vector Machine, Naive Bayes Classifier and Random Forest, Recursive Feature Addition outperformed other methods. Our studies also showed that Lagging Prediction Peephole Optimization is superior to random strategy; Recursive Feature Addition with Lagging Prediction Peephole Optimization obtained better testing accuracies than the gene selection method varSelRF.

  9. The PowerAtlas: a power and sample size atlas for microarray experimental design and research

    Directory of Open Access Journals (Sweden)

    Wang Jelai

    2006-02-01

    Full Text Available Abstract Background Microarrays permit biologists to simultaneously measure the mRNA abundance of thousands of genes. An important issue facing investigators planning microarray experiments is how to estimate the sample size required for good statistical power. What is the projected sample size or number of replicate chips needed to address the multiple hypotheses with acceptable accuracy? Statistical methods exist for calculating power based upon a single hypothesis, using estimates of the variability in data from pilot studies. There is, however, a need for methods to estimate power and/or required sample sizes in situations where multiple hypotheses are being tested, such as in microarray experiments. In addition, investigators frequently do not have pilot data to estimate the sample sizes required for microarray studies. Results To address this challenge, we have developed a Microrarray PowerAtlas 1. The atlas enables estimation of statistical power by allowing investigators to appropriately plan studies by building upon previous studies that have similar experimental characteristics. Currently, there are sample sizes and power estimates based on 632 experiments from Gene Expression Omnibus (GEO. The PowerAtlas also permits investigators to upload their own pilot data and derive power and sample size estimates from these data. This resource will be updated regularly with new datasets from GEO and other databases such as The Nottingham Arabidopsis Stock Center (NASC. Conclusion This resource provides a valuable tool for investigators who are planning efficient microarray studies and estimating required sample sizes.

  10. Supervised group Lasso with applications to microarray data analysis

    Directory of Open Access Journals (Sweden)

    Huang Jian

    2007-02-01

    Full Text Available Abstract Background A tremendous amount of efforts have been devoted to identifying genes for diagnosis and prognosis of diseases using microarray gene expression data. It has been demonstrated that gene expression data have cluster structure, where the clusters consist of co-regulated genes which tend to have coordinated functions. However, most available statistical methods for gene selection do not take into consideration the cluster structure. Results We propose a supervised group Lasso approach that takes into account the cluster structure in gene expression data for gene selection and predictive model building. For gene expression data without biological cluster information, we first divide genes into clusters using the K-means approach and determine the optimal number of clusters using the Gap method. The supervised group Lasso consists of two steps. In the first step, we identify important genes within each cluster using the Lasso method. In the second step, we select important clusters using the group Lasso. Tuning parameters are determined using V-fold cross validation at both steps to allow for further flexibility. Prediction performance is evaluated using leave-one-out cross validation. We apply the proposed method to disease classification and survival analysis with microarray data. Conclusion We analyze four microarray data sets using the proposed approach: two cancer data sets with binary cancer occurrence as outcomes and two lymphoma data sets with survival outcomes. The results show that the proposed approach is capable of identifying a small number of influential gene clusters and important genes within those clusters, and has better prediction performance than existing methods.

  11. Microarray-based identification of clinically relevant vaginal bacteria in relation to bacterial vaginosis

    NARCIS (Netherlands)

    Dols, J.A.M.; Smit, P.W.; Kort, R.; Reid, G.; Schuren, F.H.J.; Tempelman, H.; Bontekoe, T.R.; Korporaal, H.; Boon, M.E.

    2011-01-01

    Objective: The objective was to examine the use of a tailor-made DNA microarray containing probes representing the vaginal microbiota to examine bacterial vaginosis. Study Design: One hundred one women attending a health center for HIV testing in South Africa were enrolled. Stained, liquid-based

  12. Microarray-based identification of clinically relevant vaginal bacteria in relation to bacterial vaginosis

    NARCIS (Netherlands)

    Dols, Joke A M; Smit, Pieter W; Kort, Remco; Reid, Gregor; Schuren, Frank H J; Tempelman, Hugo; Bontekoe, Tj Romke; Korporaal, Hans; Boon, Mathilde E

    OBJECTIVE: The objective was to examine the use of a tailor-made DNA microarray containing probes representing the vaginal microbiota to examine bacterial vaginosis. STUDY DESIGN: One hundred one women attending a health center for HIV testing in South Africa were enrolled. Stained, liquid-based

  13. Microengineering methods for cell-based microarrays and high-throughput drug-screening applications

    International Nuclear Information System (INIS)

    Xu Feng; Wu Jinhui; Wang Shuqi; Gurkan, Umut Atakan; Demirci, Utkan; Durmus, Naside Gozde

    2011-01-01

    Screening for effective therapeutic agents from millions of drug candidates is costly, time consuming, and often faces concerns due to the extensive use of animals. To improve cost effectiveness, and to minimize animal testing in pharmaceutical research, in vitro monolayer cell microarrays with multiwell plate assays have been developed. Integration of cell microarrays with microfluidic systems has facilitated automated and controlled component loading, significantly reducing the consumption of the candidate compounds and the target cells. Even though these methods significantly increased the throughput compared to conventional in vitro testing systems and in vivo animal models, the cost associated with these platforms remains prohibitively high. Besides, there is a need for three-dimensional (3D) cell-based drug-screening models which can mimic the in vivo microenvironment and the functionality of the native tissues. Here, we present the state-of-the-art microengineering approaches that can be used to develop 3D cell-based drug-screening assays. We highlight the 3D in vitro cell culture systems with live cell-based arrays, microfluidic cell culture systems, and their application to high-throughput drug screening. We conclude that among the emerging microengineering approaches, bioprinting holds great potential to provide repeatable 3D cell-based constructs with high temporal, spatial control and versatility.

  14. Microengineering methods for cell-based microarrays and high-throughput drug-screening applications

    Energy Technology Data Exchange (ETDEWEB)

    Xu Feng; Wu Jinhui; Wang Shuqi; Gurkan, Umut Atakan; Demirci, Utkan [Department of Medicine, Demirci Bio-Acoustic-MEMS in Medicine (BAMM) Laboratory, Center for Biomedical Engineering, Brigham and Women' s Hospital, Harvard Medical School, Boston, MA (United States); Durmus, Naside Gozde, E-mail: udemirci@rics.bwh.harvard.edu [School of Engineering and Division of Biology and Medicine, Brown University, Providence, RI (United States)

    2011-09-15

    Screening for effective therapeutic agents from millions of drug candidates is costly, time consuming, and often faces concerns due to the extensive use of animals. To improve cost effectiveness, and to minimize animal testing in pharmaceutical research, in vitro monolayer cell microarrays with multiwell plate assays have been developed. Integration of cell microarrays with microfluidic systems has facilitated automated and controlled component loading, significantly reducing the consumption of the candidate compounds and the target cells. Even though these methods significantly increased the throughput compared to conventional in vitro testing systems and in vivo animal models, the cost associated with these platforms remains prohibitively high. Besides, there is a need for three-dimensional (3D) cell-based drug-screening models which can mimic the in vivo microenvironment and the functionality of the native tissues. Here, we present the state-of-the-art microengineering approaches that can be used to develop 3D cell-based drug-screening assays. We highlight the 3D in vitro cell culture systems with live cell-based arrays, microfluidic cell culture systems, and their application to high-throughput drug screening. We conclude that among the emerging microengineering approaches, bioprinting holds great potential to provide repeatable 3D cell-based constructs with high temporal, spatial control and versatility.

  15. Identification of bovine leukemia virus tax function associated with host cell transcription, signaling, stress response and immune response pathway by microarray-based gene expression analysis

    Directory of Open Access Journals (Sweden)

    Arainga Mariluz

    2012-03-01

    Full Text Available Abstract Background Bovine leukemia virus (BLV is associated with enzootic bovine leukosis and is closely related to human T-cell leukemia virus type I. The Tax protein of BLV is a transcriptional activator of viral replication and a key contributor to oncogenic potential. We previously identified interesting mutant forms of Tax with elevated (TaxD247G or reduced (TaxS240P transactivation effects on BLV replication and propagation. However, the effects of these mutations on functions other than transcriptional activation are unknown. In this study, to identify genes that play a role in the cascade of signal events regulated by wild-type and mutant Tax proteins, we used a large-scale host cell gene-profiling approach. Results Using a microarray containing approximately 18,400 human mRNA transcripts, we found several alterations after the expression of Tax proteins in genes involved in many cellular functions such as transcription, signal transduction, cell growth, apoptosis, stress response, and immune response, indicating that Tax protein has multiple biological effects on various cellular environments. We also found that TaxD247G strongly regulated more genes involved in transcription, signal transduction, and cell growth functions, contrary to TaxS240P, which regulated fewer genes. In addition, the expression of genes related to stress response significantly increased in the presence of TaxS240P as compared to wild-type Tax and TaxD247G. By contrast, the largest group of downregulated genes was related to immune response, and the majority of these genes belonged to the interferon family. However, no significant difference in the expression level of downregulated genes was observed among the Tax proteins. Finally, the expression of important cellular factors obtained from the human microarray results were validated at the RNA and protein levels by real-time quantitative reverse transcription-polymerase chain reaction and western blotting

  16. Identifying significant genetic regulatory networks in the prostate cancer from microarray data based on transcription factor analysis and conditional independency

    Directory of Open Access Journals (Sweden)

    Yeh Cheng-Yu

    2009-12-01

    Full Text Available Abstract Background Prostate cancer is a world wide leading cancer and it is characterized by its aggressive metastasis. According to the clinical heterogeneity, prostate cancer displays different stages and grades related to the aggressive metastasis disease. Although numerous studies used microarray analysis and traditional clustering method to identify the individual genes during the disease processes, the important gene regulations remain unclear. We present a computational method for inferring genetic regulatory networks from micorarray data automatically with transcription factor analysis and conditional independence testing to explore the potential significant gene regulatory networks that are correlated with cancer, tumor grade and stage in the prostate cancer. Results To deal with missing values in microarray data, we used a K-nearest-neighbors (KNN algorithm to determine the precise expression values. We applied web services technology to wrap the bioinformatics toolkits and databases to automatically extract the promoter regions of DNA sequences and predicted the transcription factors that regulate the gene expressions. We adopt the microarray datasets consists of 62 primary tumors, 41 normal prostate tissues from Stanford Microarray Database (SMD as a target dataset to evaluate our method. The predicted results showed that the possible biomarker genes related to cancer and denoted the androgen functions and processes may be in the development of the prostate cancer and promote the cell death in cell cycle. Our predicted results showed that sub-networks of genes SREBF1, STAT6 and PBX1 are strongly related to a high extent while ETS transcription factors ELK1, JUN and EGR2 are related to a low extent. Gene SLC22A3 may explain clinically the differentiation associated with the high grade cancer compared with low grade cancer. Enhancer of Zeste Homolg 2 (EZH2 regulated by RUNX1 and STAT3 is correlated to the pathological stage

  17. Identifying significant genetic regulatory networks in the prostate cancer from microarray data based on transcription factor analysis and conditional independency.

    Science.gov (United States)

    Yeh, Hsiang-Yuan; Cheng, Shih-Wu; Lin, Yu-Chun; Yeh, Cheng-Yu; Lin, Shih-Fang; Soo, Von-Wun

    2009-12-21

    Prostate cancer is a world wide leading cancer and it is characterized by its aggressive metastasis. According to the clinical heterogeneity, prostate cancer displays different stages and grades related to the aggressive metastasis disease. Although numerous studies used microarray analysis and traditional clustering method to identify the individual genes during the disease processes, the important gene regulations remain unclear. We present a computational method for inferring genetic regulatory networks from micorarray data automatically with transcription factor analysis and conditional independence testing to explore the potential significant gene regulatory networks that are correlated with cancer, tumor grade and stage in the prostate cancer. To deal with missing values in microarray data, we used a K-nearest-neighbors (KNN) algorithm to determine the precise expression values. We applied web services technology to wrap the bioinformatics toolkits and databases to automatically extract the promoter regions of DNA sequences and predicted the transcription factors that regulate the gene expressions. We adopt the microarray datasets consists of 62 primary tumors, 41 normal prostate tissues from Stanford Microarray Database (SMD) as a target dataset to evaluate our method. The predicted results showed that the possible biomarker genes related to cancer and denoted the androgen functions and processes may be in the development of the prostate cancer and promote the cell death in cell cycle. Our predicted results showed that sub-networks of genes SREBF1, STAT6 and PBX1 are strongly related to a high extent while ETS transcription factors ELK1, JUN and EGR2 are related to a low extent. Gene SLC22A3 may explain clinically the differentiation associated with the high grade cancer compared with low grade cancer. Enhancer of Zeste Homolg 2 (EZH2) regulated by RUNX1 and STAT3 is correlated to the pathological stage. We provide a computational framework to reconstruct

  18. Exploring the use of internal and externalcontrols for assessing microarray technical performance

    Directory of Open Access Journals (Sweden)

    Game Laurence

    2010-12-01

    Full Text Available Abstract Background The maturing of gene expression microarray technology and interest in the use of microarray-based applications for clinical and diagnostic applications calls for quantitative measures of quality. This manuscript presents a retrospective study characterizing several approaches to assess technical performance of microarray data measured on the Affymetrix GeneChip platform, including whole-array metrics and information from a standard mixture of external spike-in and endogenous internal controls. Spike-in controls were found to carry the same information about technical performance as whole-array metrics and endogenous "housekeeping" genes. These results support the use of spike-in controls as general tools for performance assessment across time, experimenters and array batches, suggesting that they have potential for comparison of microarray data generated across species using different technologies. Results A layered PCA modeling methodology that uses data from a number of classes of controls (spike-in hybridization, spike-in polyA+, internal RNA degradation, endogenous or "housekeeping genes" was used for the assessment of microarray data quality. The controls provide information on multiple stages of the experimental protocol (e.g., hybridization, RNA amplification. External spike-in, hybridization and RNA labeling controls provide information related to both assay and hybridization performance whereas internal endogenous controls provide quality information on the biological sample. We find that the variance of the data generated from the external and internal controls carries critical information about technical performance; the PCA dissection of this variance is consistent with whole-array quality assessment based on a number of quality assurance/quality control (QA/QC metrics. Conclusions These results provide support for the use of both external and internal RNA control data to assess the technical quality of microarray

  19. Deep learning for tissue microarray image-based outcome prediction in patients with colorectal cancer

    Science.gov (United States)

    Bychkov, Dmitrii; Turkki, Riku; Haglund, Caj; Linder, Nina; Lundin, Johan

    2016-03-01

    Recent advances in computer vision enable increasingly accurate automated pattern classification. In the current study we evaluate whether a convolutional neural network (CNN) can be trained to predict disease outcome in patients with colorectal cancer based on images of tumor tissue microarray samples. We compare the prognostic accuracy of CNN features extracted from the whole, unsegmented tissue microarray spot image, with that of CNN features extracted from the epithelial and non-epithelial compartments, respectively. The prognostic accuracy of visually assessed histologic grade is used as a reference. The image data set consists of digitized hematoxylin-eosin (H and E) stained tissue microarray samples obtained from 180 patients with colorectal cancer. The patient samples represent a variety of histological grades, have data available on a series of clinicopathological variables including long-term outcome and ground truth annotations performed by experts. The CNN features extracted from images of the epithelial tissue compartment significantly predicted outcome (hazard ratio (HR) 2.08; CI95% 1.04-4.16; area under the curve (AUC) 0.66) in a test set of 60 patients, as compared to the CNN features extracted from unsegmented images (HR 1.67; CI95% 0.84-3.31, AUC 0.57) and visually assessed histologic grade (HR 1.96; CI95% 0.99-3.88, AUC 0.61). As a conclusion, a deep-learning classifier can be trained to predict outcome of colorectal cancer based on images of H and E stained tissue microarray samples and the CNN features extracted from the epithelial compartment only resulted in a prognostic discrimination comparable to that of visually determined histologic grade.

  20. Polyadenylation state microarray (PASTA) analysis.

    Science.gov (United States)

    Beilharz, Traude H; Preiss, Thomas

    2011-01-01

    Nearly all eukaryotic mRNAs terminate in a poly(A) tail that serves important roles in mRNA utilization. In the cytoplasm, the poly(A) tail promotes both mRNA stability and translation, and these functions are frequently regulated through changes in tail length. To identify the scope of poly(A) tail length control in a transcriptome, we developed the polyadenylation state microarray (PASTA) method. It involves the purification of mRNA based on poly(A) tail length using thermal elution from poly(U) sepharose, followed by microarray analysis of the resulting fractions. In this chapter we detail our PASTA approach and describe some methods for bulk and mRNA-specific poly(A) tail length measurements of use to monitor the procedure and independently verify the microarray data.

  1. Characterization of fetal cells from the maternal circulation by microarray gene expression analysis - Could the extravillous trophoblasts be a target for future cell-based non-invasive prenatal diagnosis?

    DEFF Research Database (Denmark)

    Hatt, Lotte; Brinch, Marie; Singh, Ripudaman

    2014-01-01

    stem cell microarray analysis. Results: 39 genes were identified as candidates for unique fetal cell markers. More than half of these are genes known to be expressed in the placenta, especially in extravillous trophoblasts (EVTs). Immunohistochemical staining of placental tissue confirmed CD105......Introduction: Circulating fetal cells in maternal blood provide a tool for risk-free, non-invasive prenatal diagnosis. However, fetal cells in the maternal circulation are scarce, and to effectively isolate enough of them for reliable diagnostics, it is crucial to know which fetal cell type......(s) should be targeted. Materials and Methods: Fetal cells were enriched from maternal blood by magnetic-activated cell sorting using the endothelial cell marker CD105 and identified by XY fluorescence in situ hybridization. Expression pattern was compared between fetal cells and maternal blood cells using...

  2. Protective Effect of Gwakhyangjeonggisan Herbal Acupuncture Solution in Glioblastoma Cells: Microarray Analysis of Gene Expression

    Directory of Open Access Journals (Sweden)

    Hong-Seok Lee

    2005-12-01

    Full Text Available Objectives : Neurological disorders have been one of main therapeutic targets of acupuncture. The present study investigated the protective effects of Gwakhyangjeonggisan herbal acupuncture solution (GHAS. Methods : We performed 3-(4,5-dimethylthiazol-2-yl-2,5-diphenyltetrazolium bromide (MTT assay in glioblastoma cells, and did microarray analysis with cells exposed to reactive oxigen species (ROS of hydrogen peroxide by 8.0 k Human cDNA, with cut-off level of 2-fold changes in gene expression. Results : MTT assay showed protective effect of GHAS on the glioblastoma cells exposed to hydrogen peroxide. When glioblastoma cells were exposed to hydrogen peroxide, 24 genes were downregulated. When the cells were pretreated with GHAS before exposure to hydrogen peroxide, 46 genes were downregulated. Many of the genes downregulated by hydrogen peroxide stimulation were decreased in the amount of downregulation or reversed to upregulation. Conclusions : The gene expression changes observed in the present study are supposed to be related to the protective molecular mechanism of GHAS in the glioblastoma cells exposed to ROS stress.

  3. Micro-Analyzer: automatic preprocessing of Affymetrix microarray data.

    Science.gov (United States)

    Guzzi, Pietro Hiram; Cannataro, Mario

    2013-08-01

    A current trend in genomics is the investigation of the cell mechanism using different technologies, in order to explain the relationship among genes, molecular processes and diseases. For instance, the combined use of gene-expression arrays and genomic arrays has been demonstrated as an effective instrument in clinical practice. Consequently, in a single experiment different kind of microarrays may be used, resulting in the production of different types of binary data (images and textual raw data). The analysis of microarray data requires an initial preprocessing phase, that makes raw data suitable for use on existing analysis platforms, such as the TIGR M4 (TM4) Suite. An additional challenge to be faced by emerging data analysis platforms is the ability to treat in a combined way those different microarray formats coupled with clinical data. In fact, resulting integrated data may include both numerical and symbolic data (e.g. gene expression and SNPs regarding molecular data), as well as temporal data (e.g. the response to a drug, time to progression and survival rate), regarding clinical data. Raw data preprocessing is a crucial step in analysis but is often performed in a manual and error prone way using different software tools. Thus novel, platform independent, and possibly open source tools enabling the semi-automatic preprocessing and annotation of different microarray data are needed. The paper presents Micro-Analyzer (Microarray Analyzer), a cross-platform tool for the automatic normalization, summarization and annotation of Affymetrix gene expression and SNP binary data. It represents the evolution of the μ-CS tool, extending the preprocessing to SNP arrays that were not allowed in μ-CS. The Micro-Analyzer is provided as a Java standalone tool and enables users to read, preprocess and analyse binary microarray data (gene expression and SNPs) by invoking TM4 platform. It avoids: (i) the manual invocation of external tools (e.g. the Affymetrix Power

  4. Cancer microarray data feature selection using multi-objective binary particle swarm optimization algorithm

    Science.gov (United States)

    Annavarapu, Chandra Sekhara Rao; Dara, Suresh; Banka, Haider

    2016-01-01

    Cancer investigations in microarray data play a major role in cancer analysis and the treatment. Cancer microarray data consists of complex gene expressed patterns of cancer. In this article, a Multi-Objective Binary Particle Swarm Optimization (MOBPSO) algorithm is proposed for analyzing cancer gene expression data. Due to its high dimensionality, a fast heuristic based pre-processing technique is employed to reduce some of the crude domain features from the initial feature set. Since these pre-processed and reduced features are still high dimensional, the proposed MOBPSO algorithm is used for finding further feature subsets. The objective functions are suitably modeled by optimizing two conflicting objectives i.e., cardinality of feature subsets and distinctive capability of those selected subsets. As these two objective functions are conflicting in nature, they are more suitable for multi-objective modeling. The experiments are carried out on benchmark gene expression datasets, i.e., Colon, Lymphoma and Leukaemia available in literature. The performance of the selected feature subsets with their classification accuracy and validated using 10 fold cross validation techniques. A detailed comparative study is also made to show the betterment or competitiveness of the proposed algorithm. PMID:27822174

  5. The role of metalloendopeptidases in oropharyngeal carcinomas assessed by tissue microarray.

    Science.gov (United States)

    Ribeiro, Daniel A; Nascimento, Fabio D; Fracalossi, Ana Carolina C; Noguti, Juliana; Oshima, Celina T F; Ihara, Silvia S M; Franco, Marcello F

    2011-01-01

    The goal of this study was to investigate the expression of some metalloendopeptidases in squamous cell carcinomas of the oropharynx as well as its relation to histological differentiation, staging of disease, and prognosis. Paraffin blocks from 21 primary tumors were obtained from archives of the Department of Pathology, Paulista Medical School, Federal University of Sao Paulo, UNIFESP/EPM. Immunohistochemistry was used to detect the expression of EP24.15 and EP24.16 by means of tissue microarrays. Expression of EP24.15 or EP24.16 was not correlated with the stage of disease, histopathological grading or recurrence in squamous cell carcinomas of the oropharynx. In summary, our results support the notion that EP24.15 and EP24.16 are expressed in carcinoma of the oropharynx; however, these do not appear to be suitable biomarkers for histological grading, disease stage or recurrence as depicted by tissue microarrays and immunohistochemistry.

  6. cDNA microarray screening in food safety

    International Nuclear Information System (INIS)

    Roy, Sashwati; Sen, Chandan K.

    2006-01-01

    The cDNA microarray technology and related bioinformatics tools presents a wide range of novel application opportunities. The technology may be productively applied to address food safety. In this mini-review article, we present an update highlighting the late breaking discoveries that demonstrate the vitality of cDNA microarray technology as a tool to analyze food safety with reference to microbial pathogens and genetically modified foods. In order to bring the microarray technology to mainstream food safety, it is important to develop robust user-friendly tools that may be applied in a field setting. In addition, there needs to be a standardized process for regulatory agencies to interpret and act upon microarray-based data. The cDNA microarray approach is an emergent technology in diagnostics. Its values lie in being able to provide complimentary molecular insight when employed in addition to traditional tests for food safety, as part of a more comprehensive battery of tests

  7. SCK-CEN Genomic Platform: the microarray technology

    International Nuclear Information System (INIS)

    Benotmane, R.

    2006-01-01

    The human body contains approximately 10 14 cells, wherein each one is a nucleus. The nucleus contains 2x23 chromosomes, or two complete sets of the human genome, one set coming from the mother and the other from the father. In principle each set includes 30.000-40.000 genes. If the genome was a book, it would be twenty-three chapters, called chromosomes,each chapter with several thousand stories, called genes. Each story made up of paragraphs, called exons and introns. Each paragraph made up of 3 letter words, called codons. Each word is written with letters called bases (AGCT). But the whole is written in a single very long sentence, which is the DNA molecule or deoxy nucleic acid. The usual state of DNA is two complementary strands intertwined forming a double helix. In the cell, DNA is duplicated during each cell division to ensure the transmission of the genome to the daughter cells. For expression, the DNA is transcribed to messenger RNA. The RNA is edited and finally translated to a protein, each three bases coding for one amino acid. When the whole message is translated, the chain of amino acids folds itself up into a distinctive shape that depends on its sequence. Proteins are the effectors of the genes, and are responsible for all metabolic, hormonal and enzymatic reactions in the cells. The expressed RNA determines the amount of proteins to be produced and subsequently the desired effect (strong or weak) in the cell. The microarray technology aims at quantifying the amount of RNA present in the cell from each expressed gene, and at evaluating the changes of these amounts after exposure of the cell to toxic chemicals, ionising radiation or other stress components. The global picture of expressed genes helps to understand the affected genetic pathways in the cell at the molecular level. The microarray technology is used in the Radiobiology and Microbiology topics to study the effect of ionising radiation on human cells and mouse tissue, as well as the

  8. Prenatal alcohol exposure alters gene expression in the rat brain: Experimental design and bioinformatic analysis of microarray data

    Directory of Open Access Journals (Sweden)

    Alexandre A. Lussier

    2015-09-01

    Full Text Available We previously identified gene expression changes in the prefrontal cortex and hippocampus of rats prenatally exposed to alcohol under both steady-state and challenge conditions (Lussier et al., 2015, Alcohol.: Clin. Exp. Res., 39, 251–261. In this study, adult female rats from three prenatal treatment groups (ad libitum-fed control, pair-fed, and ethanol-fed were injected with physiological saline solution or complete Freund׳s adjuvant (CFA to induce arthritis (adjuvant-induced arthritis, AA. The prefrontal cortex and hippocampus were collected 16 days (peak of arthritis or 39 days (during recovery following injection, and whole genome gene expression was assayed using Illumina׳s RatRef-12 expression microarray. Here, we provide additional metadata, detailed explanations of data pre-processing steps and quality control, as well as a basic framework for the bioinformatic analyses performed. The datasets from this study are publicly available on the GEO repository (accession number GSE63561.

  9. Gene Expression Music Algorithm-Based Characterization of the Ewing Sarcoma Stem Cell Signature

    Directory of Open Access Journals (Sweden)

    Martin Sebastian Staege

    2016-01-01

    Full Text Available Gene Expression Music Algorithm (GEMusicA is a method for the transformation of DNA microarray data into melodies that can be used for the characterization of differentially expressed genes. Using this method we compared gene expression profiles from endothelial cells (EC, hematopoietic stem cells, neuronal stem cells, embryonic stem cells (ESC, and mesenchymal stem cells (MSC and defined a set of genes that can discriminate between the different stem cell types. We analyzed the behavior of public microarray data sets from Ewing sarcoma (“Ewing family tumors,” EFT cell lines and biopsies in GEMusicA after prefiltering DNA microarray data for the probe sets from the stem cell signature. Our results demonstrate that individual Ewing sarcoma cell lines have a high similarity to ESC or EC. Ewing sarcoma cell lines with inhibited Ewing sarcoma breakpoint region 1-Friend leukemia virus integration 1 (EWSR1-FLI1 oncogene retained the similarity to ESC and EC. However, correlation coefficients between GEMusicA-processed expression data between EFT and ESC decreased whereas correlation coefficients between EFT and EC as well as between EFT and MSC increased after knockdown of EWSR1-FLI1. Our data support the concept of EFT being derived from cells with features of embryonic and endothelial cells.

  10. In vivo corrosion, tumor outcome, and microarray gene expression for two types of muscle-implanted tungsten alloys

    Energy Technology Data Exchange (ETDEWEB)

    Schuster, B.E. [U.S. Army Research Laboratory, Weapons and Materials Research Directorate, B434 Mulberry Road, Aberdeen Proving Ground, MD 21005-5609 (United States); Roszell, L.E. [U.S. Army Institute of Public Health, 5158 Blackhawk Road, Aberdeen Proving Ground, MD 21010‐5403 (United States); Murr, L.E.; Ramirez, D.A. [Department of Metallurgical and Materials Engineering, University of Texas, El Paso, TX 79968 (United States); Demaree, J.D. [U.S. Army Research Laboratory, Weapons and Materials Research Directorate, B434 Mulberry Road, Aberdeen Proving Ground, MD 21005-5609 (United States); Klotz, B.R. [Dynamic Science Inc., Aberdeen Proving Ground, MD 21005‐5609 (United States); Rosencrance, A.B.; Dennis, W.E. [U.S. Army Center for Environmental Health Research, Department of Chemistry, Ft. Detrick, MD 21702‐5010 (United States); Bao, W. [SAS Institute, Inc. SAS Campus Drive, Cary, NC 27513 (United States); Perkins, E.J. [U.S. Army Engineer Research and Development Center, 3909 Hall Ferry Road, Vicksburg MS 39180 (United States); Dillman, J.F. [U.S. Army Medical Research Institute of Chemical Defense, 3100 Ricketts Point Road, Aberdeen Proving Ground, MD 21010‐5400 (United States); Bannon, D.I., E-mail: desmond.bannon@us.army.mil [U.S. Army Institute of Public Health, 5158 Blackhawk Road, Aberdeen Proving Ground, MD 21010‐5403 (United States)

    2012-11-15

    Tungsten alloys are composed of tungsten microparticles embedded in a solid matrix of transition metals such as nickel, cobalt, or iron. To understand the toxicology of these alloys, male F344 rats were intramuscularly implanted with pellets of tungsten/nickel/cobalt, tungsten/nickel/iron, or pure tungsten, with tantalum pellets as a negative control. Between 6 and 12 months, aggressive rhabdomyosarcomas formed around tungsten/nickel/cobalt pellets, while those of tungsten/nickel/iron or pure tungsten did not cause cancers. Electron microscopy showed a progressive corrosion of the matrix phase of tungsten/nickel/cobalt pellets over 6 months, accompanied by high urinary concentrations of nickel and cobalt. In contrast, non-carcinogenic tungsten/nickel/iron pellets were minimally corroded and urinary metals were low; these pellets having developed a surface oxide layer in vivo that may have restricted the mobilization of carcinogenic nickel. Microarray analysis of tumors revealed large changes in gene expression compared with normal muscle, with biological processes involving the cell cycle significantly up‐regulated and those involved with muscle development and differentiation significantly down‐regulated. Top KEGG pathways disrupted were adherens junction, p53 signaling, and the cell cycle. Chromosomal enrichment analysis of genes showed a highly significant impact at cytoband 7q22 (chromosome 7) which included mouse double minute (MDM2) and cyclin‐dependant kinase (CDK4) as well as other genes associated with human sarcomas. In conclusion, the tumorigenic potential of implanted tungsten alloys is related to mobilization of carcinogenic metals nickel and cobalt from corroding pellets, while gene expression changes in the consequent tumors are similar to radiation induced animal sarcomas as well as sporadic human sarcomas. -- Highlights: ► Tungsten/nickel/cobalt, tungsten/nickel/iron, and pure tungsten were studied. ► Male Fischer rats implanted with

  11. In vivo corrosion, tumor outcome, and microarray gene expression for two types of muscle-implanted tungsten alloys

    International Nuclear Information System (INIS)

    Schuster, B.E.; Roszell, L.E.; Murr, L.E.; Ramirez, D.A.; Demaree, J.D.; Klotz, B.R.; Rosencrance, A.B.; Dennis, W.E.; Bao, W.; Perkins, E.J.; Dillman, J.F.; Bannon, D.I.

    2012-01-01

    Tungsten alloys are composed of tungsten microparticles embedded in a solid matrix of transition metals such as nickel, cobalt, or iron. To understand the toxicology of these alloys, male F344 rats were intramuscularly implanted with pellets of tungsten/nickel/cobalt, tungsten/nickel/iron, or pure tungsten, with tantalum pellets as a negative control. Between 6 and 12 months, aggressive rhabdomyosarcomas formed around tungsten/nickel/cobalt pellets, while those of tungsten/nickel/iron or pure tungsten did not cause cancers. Electron microscopy showed a progressive corrosion of the matrix phase of tungsten/nickel/cobalt pellets over 6 months, accompanied by high urinary concentrations of nickel and cobalt. In contrast, non-carcinogenic tungsten/nickel/iron pellets were minimally corroded and urinary metals were low; these pellets having developed a surface oxide layer in vivo that may have restricted the mobilization of carcinogenic nickel. Microarray analysis of tumors revealed large changes in gene expression compared with normal muscle, with biological processes involving the cell cycle significantly up‐regulated and those involved with muscle development and differentiation significantly down‐regulated. Top KEGG pathways disrupted were adherens junction, p53 signaling, and the cell cycle. Chromosomal enrichment analysis of genes showed a highly significant impact at cytoband 7q22 (chromosome 7) which included mouse double minute (MDM2) and cyclin‐dependant kinase (CDK4) as well as other genes associated with human sarcomas. In conclusion, the tumorigenic potential of implanted tungsten alloys is related to mobilization of carcinogenic metals nickel and cobalt from corroding pellets, while gene expression changes in the consequent tumors are similar to radiation induced animal sarcomas as well as sporadic human sarcomas. -- Highlights: ► Tungsten/nickel/cobalt, tungsten/nickel/iron, and pure tungsten were studied. ► Male Fischer rats implanted with

  12. Scaling of gene expression data allowing the comparison of different gene expression platforms

    NARCIS (Netherlands)

    van Ruissen, Fred; Schaaf, Gerben J.; Kool, Marcel; Baas, Frank; Ruijter, Jan M.

    2008-01-01

    Serial analysis of gene expression (SAGE) and microarrays have found a widespread application, but much ambiguity exists regarding the amalgamation of the data resulting from these technologies. Cross-platform utilization of gene expression data from the SAGE and microarray technology could reduce

  13. Microarray-based apoptosis gene screening technique in trichostatin A-induced drug-resisted lung cancer A549/CDDP cells

    Directory of Open Access Journals (Sweden)

    Ya-jun WANG

    2016-09-01

    Full Text Available Objective  To detect the expression profile changes of apoptosis-related genes in trichostatin A (TSA-induced drug-resisted lung cancer cells A549/CDDP by microarray, in order to screen the target genes in TSA treating cisplatin-resisted lung cancer. Methods  A549/CDDP cells were treated by TSA for 24 hours. Total RNA was extracted and reversely transcribed into cDNA. Gene expression levels were detected by the NimbleGen whole genome microarray. Differences of expression profiles between TSA-treated and control group were measured by NimbleScan 2.5 software and GO analysis. Apoptosis and proliferation related genes were screened from the expression changed genes. Results  Compared with the control group, 85 apoptosis-related genes were up-regulated and 43 growth or proliferation related genes were down-regulated in the TSA-treated group. GO analysis showed that the functions of these genes are mainly regulating apoptosis, cell resistance to chem ical stimuli protein, as well as regulating cell growth, proliferation and the biological process of maintaining the cell biological quality. TSA-activated not only the mitochondrial apoptotic pathways, but also the death receptor related apoptosis pathway, and down-regulated the drug resistance related genes BAG3 and ABCC2. Conclusion  TSA may cause the expression changes of apoptotic and proliferation genes in A549/CDDP cells, these genes may play a role in TSA treating cisplatin-resisted lung cancer. DOI: 10.11855/j.issn.0577-7402.2016.08.07

  14. MULTI-K: accurate classification of microarray subtypes using ensemble k-means clustering

    Directory of Open Access Journals (Sweden)

    Ashlock Daniel

    2009-08-01

    Full Text Available Abstract Background Uncovering subtypes of disease from microarray samples has important clinical implications such as survival time and sensitivity of individual patients to specific therapies. Unsupervised clustering methods have been used to classify this type of data. However, most existing methods focus on clusters with compact shapes and do not reflect the geometric complexity of the high dimensional microarray clusters, which limits their performance. Results We present a cluster-number-based ensemble clustering algorithm, called MULTI-K, for microarray sample classification, which demonstrates remarkable accuracy. The method amalgamates multiple k-means runs by varying the number of clusters and identifies clusters that manifest the most robust co-memberships of elements. In addition to the original algorithm, we newly devised the entropy-plot to control the separation of singletons or small clusters. MULTI-K, unlike the simple k-means or other widely used methods, was able to capture clusters with complex and high-dimensional structures accurately. MULTI-K outperformed other methods including a recently developed ensemble clustering algorithm in tests with five simulated and eight real gene-expression data sets. Conclusion The geometric complexity of clusters should be taken into account for accurate classification of microarray data, and ensemble clustering applied to the number of clusters tackles the problem very well. The C++ code and the data sets tested are available from the authors.

  15. MULTI-K: accurate classification of microarray subtypes using ensemble k-means clustering.

    Science.gov (United States)

    Kim, Eun-Youn; Kim, Seon-Young; Ashlock, Daniel; Nam, Dougu

    2009-08-22

    Uncovering subtypes of disease from microarray samples has important clinical implications such as survival time and sensitivity of individual patients to specific therapies. Unsupervised clustering methods have been used to classify this type of data. However, most existing methods focus on clusters with compact shapes and do not reflect the geometric complexity of the high dimensional microarray clusters, which limits their performance. We present a cluster-number-based ensemble clustering algorithm, called MULTI-K, for microarray sample classification, which demonstrates remarkable accuracy. The method amalgamates multiple k-means runs by varying the number of clusters and identifies clusters that manifest the most robust co-memberships of elements. In addition to the original algorithm, we newly devised the entropy-plot to control the separation of singletons or small clusters. MULTI-K, unlike the simple k-means or other widely used methods, was able to capture clusters with complex and high-dimensional structures accurately. MULTI-K outperformed other methods including a recently developed ensemble clustering algorithm in tests with five simulated and eight real gene-expression data sets. The geometric complexity of clusters should be taken into account for accurate classification of microarray data, and ensemble clustering applied to the number of clusters tackles the problem very well. The C++ code and the data sets tested are available from the authors.

  16. Short-term arginine deprivation results in large-scale modulation of hepatic gene expression in both normal and tumor cells: microarray bioinformatic analysis

    Directory of Open Access Journals (Sweden)

    Sabo Edmond

    2006-09-01

    Full Text Available Abstract Background We have reported arginine-sensitive regulation of LAT1 amino acid transporter (SLC 7A5 in normal rodent hepatic cells with loss of arginine sensitivity and high level constitutive expression in tumor cells. We hypothesized that liver cell gene expression is highly sensitive to alterations in the amino acid microenvironment and that tumor cells may differ substantially in gene sets sensitive to amino acid availability. To assess the potential number and classes of hepatic genes sensitive to arginine availability at the RNA level and compare these between normal and tumor cells, we used an Affymetrix microarray approach, a paired in vitro model of normal rat hepatic cells and a tumorigenic derivative with triplicate independent replicates. Cells were exposed to arginine-deficient or control conditions for 18 hours in medium formulated to maintain differentiated function. Results Initial two-way analysis with a p-value of 0.05 identified 1419 genes in normal cells versus 2175 in tumor cells whose expression was altered in arginine-deficient conditions relative to controls, representing 9–14% of the rat genome. More stringent bioinformatic analysis with 9-way comparisons and a minimum of 2-fold variation narrowed this set to 56 arginine-responsive genes in normal liver cells and 162 in tumor cells. Approximately half the arginine-responsive genes in normal cells overlap with those in tumor cells. Of these, the majority was increased in expression and included multiple growth, survival, and stress-related genes. GADD45, TA1/LAT1, and caspases 11 and 12 were among this group. Previously known amino acid regulated genes were among the pool in both cell types. Available cDNA probes allowed independent validation of microarray data for multiple genes. Among genes downregulated under arginine-deficient conditions were multiple genes involved in cholesterol and fatty acid metabolism. Expression of low-density lipoprotein receptor was

  17. Replicate high-density rat genome oligonucleotide microarrays reveal hundreds of regulated genes in the dorsal root ganglion after peripheral nerve injury.

    Directory of Open Access Journals (Sweden)

    Mannion James W

    2002-10-01

    Full Text Available Abstract Background Rat oligonucleotide microarrays were used to detect changes in gene expression in the dorsal root ganglion (DRG 3 days following sciatic nerve transection (axotomy. Two comparisons were made using two sets of triplicate microarrays, naïve versus naïve and naïve versus axotomy. Results Microarray variability was assessed using the naïve versus naïve comparison. These results support use of a P 1.5-fold expression change and P 1.5-fold and P in situ hybridization verified the expression of 24 transcripts. These data showed an 83% concordance rate with the arrays; most mismatches represent genes with low expression levels reflecting limits of array sensitivity. A significant correlation was found between actual mRNA differences and relative changes between microarrays (r2 = 0.8567. Temporal patterns of individual genes regulation varied. Conclusions We identify parameters for microarray analysis which reduce error while identifying many putatively regulated genes. Functional classification of these genes suggest reorganization of cell structural components, activation of genes expressed by immune and inflammatory cells and down-regulation of genes involved in neurotransmission.

  18. Microarray data and gene expression statistics for Saccharomyces cerevisiae exposed to simulated asbestos mine drainage

    Directory of Open Access Journals (Sweden)

    Heather E. Driscoll

    2017-08-01

    Full Text Available Here we describe microarray expression data (raw and normalized, experimental metadata, and gene-level data with expression statistics from Saccharomyces cerevisiae exposed to simulated asbestos mine drainage from the Vermont Asbestos Group (VAG Mine on Belvidere Mountain in northern Vermont, USA. For nearly 100 years (between the late 1890s and 1993, chrysotile asbestos fibers were extracted from serpentinized ultramafic rock at the VAG Mine for use in construction and manufacturing industries. Studies have shown that water courses and streambeds nearby have become contaminated with asbestos mine tailings runoff, including elevated levels of magnesium, nickel, chromium, and arsenic, elevated pH, and chrysotile asbestos-laden mine tailings, due to leaching and gradual erosion of massive piles of mine waste covering approximately 9 km2. We exposed yeast to simulated VAG Mine tailings leachate to help gain insight on how eukaryotic cells exposed to VAG Mine drainage may respond in the mine environment. Affymetrix GeneChip® Yeast Genome 2.0 Arrays were utilized to assess gene expression after 24-h exposure to simulated VAG Mine tailings runoff. The chemistry of mine-tailings leachate, mine-tailings leachate plus yeast extract peptone dextrose media, and control yeast extract peptone dextrose media is also reported. To our knowledge this is the first dataset to assess global gene expression patterns in a eukaryotic model system simulating asbestos mine tailings runoff exposure. Raw and normalized gene expression data are accessible through the National Center for Biotechnology Information Gene Expression Omnibus (NCBI GEO Database Series GSE89875 (https://www.ncbi.nlm.nih.gov/geo/query/acc.cgi?acc=GSE89875.

  19. Application of Microarray technology in research and diagnostics

    DEFF Research Database (Denmark)

    Helweg-Larsen, Rehannah Borup

    The overall purpose of this thesis is to evaluate the use of microarray analysis to investigate the transcriptome of human cancers and human follicular cells and define the correlation between expression of human genes and specific cancer types as well as the developmental competence of the oocyte...

  20. Advanced spot quality analysis in two-colour microarray experiments

    Directory of Open Access Journals (Sweden)

    Vetter Guillaume

    2008-09-01

    Full Text Available Abstract Background Image analysis of microarrays and, in particular, spot quantification and spot quality control, is one of the most important steps in statistical analysis of microarray data. Recent methods of spot quality control are still in early age of development, often leading to underestimation of true positive microarray features and, consequently, to loss of important biological information. Therefore, improving and standardizing the statistical approaches of spot quality control are essential to facilitate the overall analysis of microarray data and subsequent extraction of biological information. Findings We evaluated the performance of two image analysis packages MAIA and GenePix (GP using two complementary experimental approaches with a focus on the statistical analysis of spot quality factors. First, we developed control microarrays with a priori known fluorescence ratios to verify the accuracy and precision of the ratio estimation of signal intensities. Next, we developed advanced semi-automatic protocols of spot quality evaluation in MAIA and GP and compared their performance with available facilities of spot quantitative filtering in GP. We evaluated these algorithms for standardised spot quality analysis in a whole-genome microarray experiment assessing well-characterised transcriptional modifications induced by the transcription regulator SNAI1. Using a set of RT-PCR or qRT-PCR validated microarray data, we found that the semi-automatic protocol of spot quality control we developed with MAIA allowed recovering approximately 13% more spots and 38% more differentially expressed genes (at FDR = 5% than GP with default spot filtering conditions. Conclusion Careful control of spot quality characteristics with advanced spot quality evaluation can significantly increase the amount of confident and accurate data resulting in more meaningful biological conclusions.

  1. Gene Expression Profile in the Early Stage of Angiotensin II-induced Cardiac Remodeling: a Time Series Microarray Study in a Mouse Model

    Directory of Open Access Journals (Sweden)

    Meng-Qiu Dang

    2015-01-01

    Full Text Available Background/Aims: Angiotensin II (Ang II plays a critical role in the cardiac remodeling contributing to heart failure. However, the gene expression profiles induced by Ang II in the early stage of cardiac remodeling remain unknown. Methods: Wild-type male mice (C57BL/6 background, 10-weeek-old were infused with Ang II (1500 ng/kg/min for 7 days. Blood pressure was measured. Cardiac function and remodeling were examined by echocardiography, H&E and Masson staining. The time series microarrays were then conducted to detected gene expression profiles. Results: Microarray results identified that 1,489 genes were differentially expressed in the hearts at day 1, 3 and 7 of Ang II injection. These genes were further classified into 26 profiles by hierarchical cluster analysis. Of them, 4 profiles were significant (No. 19, 8, 21 and 22 and contained 904 genes. Gene Ontology showed that these genes mainly participate in metabolic process, oxidation-reduction process, extracellular matrix organization, apoptotic process, immune response, and others. Significant pathways included focal adhesion, ECM-receptor interaction, cytokine-cytokine receptor interaction, MAPK and insulin signaling pathways, which were known to play important roles in Ang II-induced cardiac remodeling. Moreover, gene co-expression networks analysis suggested that serine/cysteine peptidase inhibitor, member 1 (Serpine1, also known as PAI-1 localized in the core of the network. Conclusions: Our results indicate that many genes are mainly involved in metabolism, inflammation, cardiac fibrosis and hypertrophy. Serpine1 may play a central role in the development of Ang II-induced cardiac remodeling at the early stage.

  2. A signature-based method for indexing cell cycle phase distribution from microarray profiles

    Directory of Open Access Journals (Sweden)

    Mizuno Hideaki

    2009-03-01

    Full Text Available Abstract Background The cell cycle machinery interprets oncogenic signals and reflects the biology of cancers. To date, various methods for cell cycle phase estimation such as mitotic index, S phase fraction, and immunohistochemistry have provided valuable information on cancers (e.g. proliferation rate. However, those methods rely on one or few measurements and the scope of the information is limited. There is a need for more systematic cell cycle analysis methods. Results We developed a signature-based method for indexing cell cycle phase distribution from microarray profiles under consideration of cycling and non-cycling cells. A cell cycle signature masterset, composed of genes which express preferentially in cycling cells and in a cell cycle-regulated manner, was created to index the proportion of cycling cells in the sample. Cell cycle signature subsets, composed of genes whose expressions peak at specific stages of the cell cycle, were also created to index the proportion of cells in the corresponding stages. The method was validated using cell cycle datasets and quiescence-induced cell datasets. Analyses of a mouse tumor model dataset and human breast cancer datasets revealed variations in the proportion of cycling cells. When the influence of non-cycling cells was taken into account, "buried" cell cycle phase distributions were depicted that were oncogenic-event specific in the mouse tumor model dataset and were associated with patients' prognosis in the human breast cancer datasets. Conclusion The signature-based cell cycle analysis method presented in this report, would potentially be of value for cancer characterization and diagnostics.

  3. Applications of nanotechnology, next generation sequencing and microarrays in biomedical research.

    Science.gov (United States)

    Elingaramil, Sauli; Li, Xiaolong; He, Nongyue

    2013-07-01

    Next-generation sequencing technologies, microarrays and advances in bio nanotechnology have had an enormous impact on research within a short time frame. This impact appears certain to increase further as many biomedical institutions are now acquiring these prevailing new technologies. Beyond conventional sampling of genome content, wide-ranging applications are rapidly evolving for next-generation sequencing, microarrays and nanotechnology. To date, these technologies have been applied in a variety of contexts, including whole-genome sequencing, targeted re sequencing and discovery of transcription factor binding sites, noncoding RNA expression profiling and molecular diagnostics. This paper thus discusses current applications of nanotechnology, next-generation sequencing technologies and microarrays in biomedical research and highlights the transforming potential these technologies offer.

  4. Advanced Data Mining of Leukemia Cells Micro-Arrays

    Directory of Open Access Journals (Sweden)

    Richard S. Segall

    2009-12-01

    Full Text Available This paper provides continuation and extensions of previous research by Segall and Pierce (2009a that discussed data mining for micro-array databases of Leukemia cells for primarily self-organized maps (SOM. As Segall and Pierce (2009a and Segall and Pierce (2009b the results of applying data mining are shown and discussed for the data categories of microarray databases of HL60, Jurkat, NB4 and U937 Leukemia cells that are also described in this article. First, a background section is provided on the work of others pertaining to the applications of data mining to micro-array databases of Leukemia cells and micro-array databases in general. As noted in predecessor article by Segall and Pierce (2009a, micro-array databases are one of the most popular functional genomics tools in use today. This research in this paper is intended to use advanced data mining technologies for better interpretations and knowledge discovery as generated by the patterns of gene expressions of HL60, Jurkat, NB4 and U937 Leukemia cells. The advanced data mining performed entailed using other data mining tools such as cubic clustering criterion, variable importance rankings, decision trees, and more detailed examinations of data mining statistics and study of other self-organized maps (SOM clustering regions of workspace as generated by SAS Enterprise Miner version 4. Conclusions and future directions of the research are also presented.

  5. Prognostic meta-signature of breast cancer developed by two-stage mixture modeling of microarray data

    Directory of Open Access Journals (Sweden)

    Ghosh Debashis

    2004-12-01

    Full Text Available Abstract Background An increasing number of studies have profiled tumor specimens using distinct microarray platforms and analysis techniques. With the accumulating amount of microarray data, one of the most intriguing yet challenging tasks is to develop robust statistical models to integrate the findings. Results By applying a two-stage Bayesian mixture modeling strategy, we were able to assimilate and analyze four independent microarray studies to derive an inter-study validated "meta-signature" associated with breast cancer prognosis. Combining multiple studies (n = 305 samples on a common probability scale, we developed a 90-gene meta-signature, which strongly associated with survival in breast cancer patients. Given the set of independent studies using different microarray platforms which included spotted cDNAs, Affymetrix GeneChip, and inkjet oligonucleotides, the individually identified classifiers yielded gene sets predictive of survival in each study cohort. The study-specific gene signatures, however, had minimal overlap with each other, and performed poorly in pairwise cross-validation. The meta-signature, on the other hand, accommodated such heterogeneity and achieved comparable or better prognostic performance when compared with the individual signatures. Further by comparing to a global standardization method, the mixture model based data transformation demonstrated superior properties for data integration and provided solid basis for building classifiers at the second stage. Functional annotation revealed that genes involved in cell cycle and signal transduction activities were over-represented in the meta-signature. Conclusion The mixture modeling approach unifies disparate gene expression data on a common probability scale allowing for robust, inter-study validated prognostic signatures to be obtained. With the emerging utility of microarrays for cancer prognosis, it will be important to establish paradigms to meta

  6. Robust embryo identification using first polar body single nucleotide polymorphism microarray-based DNA fingerprinting.

    Science.gov (United States)

    Treff, Nathan R; Su, Jing; Kasabwala, Natasha; Tao, Xin; Miller, Kathleen A; Scott, Richard T

    2010-05-01

    This study sought to validate a novel, minimally invasive system for embryo tracking by single nucleotide polymorphism microarray-based DNA fingerprinting of the first polar body. First polar body-based assignments of which embryos implanted and were delivered after multiple ET were 100% consistent with previously validated embryo DNA fingerprinting-based assignments. Copyright 2010 American Society for Reproductive Medicine. Published by Elsevier Inc. All rights reserved.

  7. A cluster merging method for time series microarray with production values.

    Science.gov (United States)

    Chira, Camelia; Sedano, Javier; Camara, Monica; Prieto, Carlos; Villar, Jose R; Corchado, Emilio

    2014-09-01

    A challenging task in time-course microarray data analysis is to cluster genes meaningfully combining the information provided by multiple replicates covering the same key time points. This paper proposes a novel cluster merging method to accomplish this goal obtaining groups with highly correlated genes. The main idea behind the proposed method is to generate a clustering starting from groups created based on individual temporal series (representing different biological replicates measured in the same time points) and merging them by taking into account the frequency by which two genes are assembled together in each clustering. The gene groups at the level of individual time series are generated using several shape-based clustering methods. This study is focused on a real-world time series microarray task with the aim to find co-expressed genes related to the production and growth of a certain bacteria. The shape-based clustering methods used at the level of individual time series rely on identifying similar gene expression patterns over time which, in some models, are further matched to the pattern of production/growth. The proposed cluster merging method is able to produce meaningful gene groups which can be naturally ranked by the level of agreement on the clustering among individual time series. The list of clusters and genes is further sorted based on the information correlation coefficient and new problem-specific relevant measures. Computational experiments and results of the cluster merging method are analyzed from a biological perspective and further compared with the clustering generated based on the mean value of time series and the same shape-based algorithm.

  8. Generalized Correlation Coefficient for Non-Parametric Analysis of Microarray Time-Course Data.

    Science.gov (United States)

    Tan, Qihua; Thomassen, Mads; Burton, Mark; Mose, Kristian Fredløv; Andersen, Klaus Ejner; Hjelmborg, Jacob; Kruse, Torben

    2017-06-06

    Modeling complex time-course patterns is a challenging issue in microarray study due to complex gene expression patterns in response to the time-course experiment. We introduce the generalized correlation coefficient and propose a combinatory approach for detecting, testing and clustering the heterogeneous time-course gene expression patterns. Application of the method identified nonlinear time-course patterns in high agreement with parametric analysis. We conclude that the non-parametric nature in the generalized correlation analysis could be an useful and efficient tool for analyzing microarray time-course data and for exploring the complex relationships in the omics data for studying their association with disease and health.

  9. Multiclass classification for skin cancer profiling based on the integration of heterogeneous gene expression series.

    Science.gov (United States)

    Gálvez, Juan Manuel; Castillo, Daniel; Herrera, Luis Javier; San Román, Belén; Valenzuela, Olga; Ortuño, Francisco Manuel; Rojas, Ignacio

    2018-01-01

    Most of the research studies developed applying microarray technology to the characterization of different pathological states of any disease may fail in reaching statistically significant results. This is largely due to the small repertoire of analysed samples, and to the limitation in the number of states or pathologies usually addressed. Moreover, the influence of potential deviations on the gene expression quantification is usually disregarded. In spite of the continuous changes in omic sciences, reflected for instance in the emergence of new Next-Generation Sequencing-related technologies, the existing availability of a vast amount of gene expression microarray datasets should be properly exploited. Therefore, this work proposes a novel methodological approach involving the integration of several heterogeneous skin cancer series, and a later multiclass classifier design. This approach is thus a way to provide the clinicians with an intelligent diagnosis support tool based on the use of a robust set of selected biomarkers, which simultaneously distinguishes among different cancer-related skin states. To achieve this, a multi-platform combination of microarray datasets from Affymetrix and Illumina manufacturers was carried out. This integration is expected to strengthen the statistical robustness of the study as well as the finding of highly-reliable skin cancer biomarkers. Specifically, the designed operation pipeline has allowed the identification of a small subset of 17 differentially expressed genes (DEGs) from which to distinguish among 7 involved skin states. These genes were obtained from the assessment of a number of potential batch effects on the gene expression data. The biological interpretation of these genes was inspected in the specific literature to understand their underlying information in relation to skin cancer. Finally, in order to assess their possible effectiveness in cancer diagnosis, a cross-validation Support Vector Machines (SVM)-based

  10. Bacterial identification and subtyping using DNA microarray and DNA sequencing.

    Science.gov (United States)

    Al-Khaldi, Sufian F; Mossoba, Magdi M; Allard, Marc M; Lienau, E Kurt; Brown, Eric D

    2012-01-01

    The era of fast and accurate discovery of biological sequence motifs in prokaryotic and eukaryotic cells is here. The co-evolution of direct genome sequencing and DNA microarray strategies not only will identify, isotype, and serotype pathogenic bacteria, but also it will aid in the discovery of new gene functions by detecting gene expressions in different diseases and environmental conditions. Microarray bacterial identification has made great advances in working with pure and mixed bacterial samples. The technological advances have moved beyond bacterial gene expression to include bacterial identification and isotyping. Application of new tools such as mid-infrared chemical imaging improves detection of hybridization in DNA microarrays. The research in this field is promising and future work will reveal the potential of infrared technology in bacterial identification. On the other hand, DNA sequencing by using 454 pyrosequencing is so cost effective that the promise of $1,000 per bacterial genome sequence is becoming a reality. Pyrosequencing technology is a simple to use technique that can produce accurate and quantitative analysis of DNA sequences with a great speed. The deposition of massive amounts of bacterial genomic information in databanks is creating fingerprint phylogenetic analysis that will ultimately replace several technologies such as Pulsed Field Gel Electrophoresis. In this chapter, we will review (1) the use of DNA microarray using fluorescence and infrared imaging detection for identification of pathogenic bacteria, and (2) use of pyrosequencing in DNA cluster analysis to fingerprint bacterial phylogenetic trees.

  11. Differential gene expression from genome-wide microarray analyses distinguishes Lohmann Selected Leghorn and Lohmann Brown layers.

    Directory of Open Access Journals (Sweden)

    Christin Habig

    Full Text Available The Lohmann Selected Leghorn (LSL and Lohmann Brown (LB layer lines have been selected for high egg production since more than 50 years and belong to the worldwide leading commercial layer lines. The objectives of the present study were to characterize the molecular processes that are different among these two layer lines using whole genome RNA expression profiles. The hens were kept in the newly developed small group housing system Eurovent German with two different group sizes. Differential expression was observed for 6,276 microarray probes (FDR adjusted P-value <0.05 among the two layer lines LSL and LB. A 2-fold or greater change in gene expression was identified on 151 probe sets. In LSL, 72 of the 151 probe sets were up- and 79 of them were down-regulated. Gene ontology (GO enrichment analysis accounting for biological processes evinced 18 GO-terms for the 72 probe sets with higher expression in LSL, especially those taking part in immune system processes and membrane organization. A total of 32 enriched GO-terms were determined among the 79 down-regulated probe sets of LSL. Particularly, these terms included phosphorus metabolic processes and signaling pathways. In conclusion, the phenotypic differences among the two layer lines LSL and LB are clearly reflected in their gene expression profiles of the cerebrum. These novel findings provide clues for genes involved in economically important line characteristics of commercial laying hens.

  12. A cell spot microarray method for production of high density siRNA transfection microarrays

    Directory of Open Access Journals (Sweden)

    Mpindi John-Patrick

    2011-03-01

    Full Text Available Abstract Background High-throughput RNAi screening is widely applied in biological research, but remains expensive, infrastructure-intensive and conversion of many assays to HTS applications in microplate format is not feasible. Results Here, we describe the optimization of a miniaturized cell spot microarray (CSMA method, which facilitates utilization of the transfection microarray technique for disparate RNAi analyses. To promote rapid adaptation of the method, the concept has been tested with a panel of 92 adherent cell types, including primary human cells. We demonstrate the method in the systematic screening of 492 GPCR coding genes for impact on growth and survival of cultured human prostate cancer cells. Conclusions The CSMA method facilitates reproducible preparation of highly parallel cell microarrays for large-scale gene knockdown analyses. This will be critical towards expanding the cell based functional genetic screens to include more RNAi constructs, allow combinatorial RNAi analyses, multi-parametric phenotypic readouts or comparative analysis of many different cell types.

  13. A novel approach to select differential pathways associated with hypertrophic cardiomyopathy based on gene co‑expression analysis.

    Science.gov (United States)

    Chen, Xiao-Min; Feng, Ming-Jun; Shen, Cai-Jie; He, Bin; Du, Xian-Feng; Yu, Yi-Bo; Liu, Jing; Chu, Hui-Min

    2017-07-01

    The present study was designed to develop a novel method for identifying significant pathways associated with human hypertrophic cardiomyopathy (HCM), based on gene co‑expression analysis. The microarray dataset associated with HCM (E‑GEOD‑36961) was obtained from the European Molecular Biology Laboratory‑European Bioinformatics Institute database. Informative pathways were selected based on the Reactome pathway database and screening treatments. An empirical Bayes method was utilized to construct co‑expression networks for informative pathways, and a weight value was assigned to each pathway. Differential pathways were extracted based on weight threshold, which was calculated using a random model. In order to assess whether the co‑expression method was feasible, it was compared with traditional pathway enrichment analysis of differentially expressed genes, which were identified using the significance analysis of microarrays package. A total of 1,074 informative pathways were screened out for subsequent investigations and their weight values were also obtained. According to the threshold of weight value of 0.01057, 447 differential pathways, including folding of actin by chaperonin containing T‑complex protein 1 (CCT)/T‑complex protein 1 ring complex (TRiC), purine ribonucleoside monophosphate biosynthesis and ubiquinol biosynthesis, were obtained. Compared with traditional pathway enrichment analysis, the number of pathways obtained from the co‑expression approach was increased. The results of the present study demonstrated that this method may be useful to predict marker pathways for HCM. The pathways of folding of actin by CCT/TRiC and purine ribonucleoside monophosphate biosynthesis may provide evidence of the underlying molecular mechanisms of HCM, and offer novel therapeutic directions for HCM.

  14. DNA microarray global gene expression analysis of influenza virus-infected chicken and duck cells

    Directory of Open Access Journals (Sweden)

    Suresh V. Kuchipudi

    2015-06-01

    Full Text Available The data described in this article pertain to the article by Kuchipudi et al. (2014 titled “Highly Pathogenic Avian Influenza Virus Infection in Chickens But Not Ducks Is Associated with Elevated Host Immune and Pro-inflammatory Responses” [1]. While infection of chickens with highly pathogenic avian influenza (HPAI H5N1 virus subtypes often leads to 100% mortality within 1 to 2 days, infection of ducks in contrast causes mild or no clinical signs. The rapid onset of fatal disease in chickens, but with no evidence of severe clinical symptoms in ducks, suggests underlying differences in their innate immune mechanisms. We used Chicken Genechip microarrays (Affymetrix to analyse the gene expression profiles of primary chicken and duck lung cells infected with a low pathogenic avian influenza (LPAI H2N3 virus and two HPAI H5N1 virus subtypes to understand the molecular basis of host susceptibility and resistance in chickens and ducks. Here, we described the experimental design, quality control and analysis that were performed on the data set. The data are publicly available through the Gene Expression Omnibus (GEOdatabase with accession number GSE33389, and the analysis and interpretation of these data are included in Kuchipudi et al. (2014 [1].

  15. Simulation of microarray data with realistic characteristics

    Directory of Open Access Journals (Sweden)

    Lehmussola Antti

    2006-07-01

    Full Text Available Abstract Background Microarray technologies have become common tools in biological research. As a result, a need for effective computational methods for data analysis has emerged. Numerous different algorithms have been proposed for analyzing the data. However, an objective evaluation of the proposed algorithms is not possible due to the lack of biological ground truth information. To overcome this fundamental problem, the use of simulated microarray data for algorithm validation has been proposed. Results We present a microarray simulation model which can be used to validate different kinds of data analysis algorithms. The proposed model is unique in the sense that it includes all the steps that affect the quality of real microarray data. These steps include the simulation of biological ground truth data, applying biological and measurement technology specific error models, and finally simulating the microarray slide manufacturing and hybridization. After all these steps are taken into account, the simulated data has realistic biological and statistical characteristics. The applicability of the proposed model is demonstrated by several examples. Conclusion The proposed microarray simulation model is modular and can be used in different kinds of applications. It includes several error models that have been proposed earlier and it can be used with different types of input data. The model can be used to simulate both spotted two-channel and oligonucleotide based single-channel microarrays. All this makes the model a valuable tool for example in validation of data analysis algorithms.

  16. Development and application of a microarray meter tool to optimize microarray experiments

    Directory of Open Access Journals (Sweden)

    Rouse Richard JD

    2008-07-01

    Full Text Available Abstract Background Successful microarray experimentation requires a complex interplay between the slide chemistry, the printing pins, the nucleic acid probes and targets, and the hybridization milieu. Optimization of these parameters and a careful evaluation of emerging slide chemistries are a prerequisite to any large scale array fabrication effort. We have developed a 'microarray meter' tool which assesses the inherent variations associated with microarray measurement prior to embarking on large scale projects. Findings The microarray meter consists of nucleic acid targets (reference and dynamic range control and probe components. Different plate designs containing identical probe material were formulated to accommodate different robotic and pin designs. We examined the variability in probe quality and quantity (as judged by the amount of DNA printed and remaining post-hybridization using three robots equipped with capillary printing pins. Discussion The generation of microarray data with minimal variation requires consistent quality control of the (DNA microarray manufacturing and experimental processes. Spot reproducibility is a measure primarily of the variations associated with printing. The microarray meter assesses array quality by measuring the DNA content for every feature. It provides a post-hybridization analysis of array quality by scoring probe performance using three metrics, a a measure of variability in the signal intensities, b a measure of the signal dynamic range and c a measure of variability of the spot morphologies.

  17. Discovery of distinctive gene expression profiles in rheumatoid synovium using cDNA microarray technology: evidence for the existence of multiple pathways of tissue destruction and repair.

    NARCIS (Netherlands)

    Kraan, TC van der Pouw; Gaalen, van FA; Huizinga, T.W.; Pieterman, E; Breedveld, F.C.; Verweij, C.L.

    2003-01-01

    Rheumatoid arthritis (RA) is a heterogeneous disease. We used cDNA microarray technology to subclassify RA patients and disclose disease pathways in rheumatoid synovium. Hierarchical clustering of gene expression data identified two main groups of tissues (RA-I and RA-II). A total of 121 genes were

  18. Microarray-Based Identification of Transcription Factor Target Genes

    NARCIS (Netherlands)

    Gorte, M.; Horstman, A.; Page, R.B.; Heidstra, R.; Stromberg, A.; Boutilier, K.A.

    2011-01-01

    Microarray analysis is widely used to identify transcriptional changes associated with genetic perturbation or signaling events. Here we describe its application in the identification of plant transcription factor target genes with emphasis on the design of suitable DNA constructs for controlling TF

  19. Increased Inhibitor of Differentiation 4 (Id4 Expression in Glioblastoma: A Tissue Microarray Study

    Directory of Open Access Journals (Sweden)

    Weifin Zeng, Elisabeth J. Rushing, Daniel P. Hartmann, Norio Azumi

    2010-01-01

    Full Text Available Background: The inhibitor of differentiation/DNA binding protein family (Id1-4 is involved in cell cycle control, tumorigenesis and angiogenesis through the negative regulation of helix-loop-helix transcription factors. Of these proteins, Id4 is known to play an important role in neural stem cell differentiation, and deregulation has been implicated in glial neoplasia. However, the expression and significance of Id4 in astrocytomas has not been fully addressed. Herein we report the differential expression of Id4 in astrocytomas of various grades using tissue microarrays (TMA and immunohistochemistry (IHC. Design: The GBM TMA was constructed from 53 archival cases at Georgetown University Hospital and a TMA with normal brain controls and grades II-III astrocytoma was obtained from Cybrdi (Rockville, MD. TMA sections were stained with Id4 antibody and the slides were scored according to the percentage of staining astrocytic nuclei (<9% -, 10-50% +, >51% ++. The Fisher Exact test was used to test for statistical significance. Results: Nuclear staining for Id4 was seen in 73.58% GBMs, 25% grade III, and 12.5% grade II astrocytomas; staining was absent in normal brain tissue. There was a statistically significant difference between GBM and grades II, III astrocytoma (p <0.01. Significant Id4 expression was not detected in normal brain. Conclusions: Our study confirms the frequent upregulation of Id4 expression in GBM, which lends support to its role in tumorigenesis, possibly in the transformation of low to high-grade astrocytoma (i.e. GBM. Further studies are warranted to determine the precise role of Id4 in glial neoplasia and its potential use in targeted therapy for GBM.

  20. Novel statistical framework to identify differentially expressed genes allowing transcriptomic background differences.

    Science.gov (United States)

    Ling, Zhi-Qiang; Wang, Yi; Mukaisho, Kenichi; Hattori, Takanori; Tatsuta, Takeshi; Ge, Ming-Hua; Jin, Li; Mao, Wei-Min; Sugihara, Hiroyuki

    2010-06-01

    Tests of differentially expressed genes (DEGs) from microarray experiments are based on the null hypothesis that genes that are irrelevant to the phenotype/stimulus are expressed equally in the target and control samples. However, this strict hypothesis is not always true, as there can be several transcriptomic background differences between target and control samples, including different cell/tissue types, different cell cycle stages and different biological donors. These differences lead to increased false positives, which have little biological/medical significance. In this article, we propose a statistical framework to identify DEGs between target and control samples from expression microarray data allowing transcriptomic background differences between these samples by introducing a modified null hypothesis that the gene expression background difference is normally distributed. We use an iterative procedure to perform robust estimation of the null hypothesis and identify DEGs as outliers. We evaluated our method using our own triplicate microarray experiment, followed by validations with reverse transcription-polymerase chain reaction (RT-PCR) and on the MicroArray Quality Control dataset. The evaluations suggest that our technique (i) results in less false positive and false negative results, as measured by the degree of agreement with RT-PCR of the same samples, (ii) can be applied to different microarray platforms and results in better reproducibility as measured by the degree of DEG identification concordance both intra- and inter-platforms and (iii) can be applied efficiently with only a few microarray replicates. Based on these evaluations, we propose that this method not only identifies more reliable and biologically/medically significant DEG, but also reduces the power-cost tradeoff problem in the microarray field. Source code and binaries freely available for download at http://comonca.org.cn/fdca/resources/softwares/deg.zip.

  1. Expression Profiling of Tyrosine Kinase Genes

    National Research Council Canada - National Science Library

    Weier, Heinz

    2000-01-01

    ... of these genes parallels the progression of tumors to a more malignant phenotype. We developed a DNA micro-array based screening system to monitor the level of expression of tyrosine kinase (tk...

  2. Gametogenesis in the Pacific oyster Crassostrea gigas: a microarrays-based analysis identifies sex and stage specific genes.

    Directory of Open Access Journals (Sweden)

    Nolwenn M Dheilly

    Full Text Available BACKGROUND: The Pacific oyster Crassostrea gigas (Mollusca, Lophotrochozoa is an alternative and irregular protandrous hermaphrodite: most individuals mature first as males and then change sex several times. Little is known about genetic and phenotypic basis of sex differentiation in oysters, and little more about the molecular pathways regulating reproduction. We have recently developed and validated a microarray containing 31,918 oligomers (Dheilly et al., 2011 representing the oyster transcriptome. The application of this microarray to the study of mollusk gametogenesis should provide a better understanding of the key factors involved in sex differentiation and the regulation of oyster reproduction. METHODOLOGY/PRINCIPAL FINDINGS: Gene expression was studied in gonads of oysters cultured over a yearly reproductive cycle. Principal component analysis and hierarchical clustering showed a significant divergence in gene expression patterns of males and females coinciding with the start of gonial mitosis. ANOVA analysis of the data revealed 2,482 genes differentially expressed during the course of males and/or females gametogenesis. The expression of 434 genes could be localized in either germ cells or somatic cells of the gonad by comparing the transcriptome of female gonads to the transcriptome of stripped oocytes and somatic tissues. Analysis of the annotated genes revealed conserved molecular mechanisms between mollusks and mammals: genes involved in chromatin condensation, DNA replication and repair, mitosis and meiosis regulation, transcription, translation and apoptosis were expressed in both male and female gonads. Most interestingly, early expressed male-specific genes included bindin and a dpy-30 homolog and female-specific genes included foxL2, nanos homolog 3, a pancreatic lipase related protein, cd63 and vitellogenin. Further functional analyses are now required in order to investigate their role in sex differentiation in oysters

  3. Design issues in toxicogenomics using DNA microarray experiment

    International Nuclear Information System (INIS)

    Lee, Kyoung-Mu; Kim, Ju-Han; Kang, Daehee

    2005-01-01

    The methods of toxicogenomics might be classified into omics study (e.g., genomics, proteomics, and metabolomics) and population study focusing on risk assessment and gene-environment interaction. In omics study, microarray is the most popular approach. Genes falling into several categories (e.g., xenobiotics metabolism, cell cycle control, DNA repair etc.) can be selected up to 20,000 according to a priori hypothesis. The appropriate type of samples and species should be selected in advance. Multiple doses and varied exposure durations are suggested to identify those genes clearly linked to toxic response. Microarray experiments can be affected by numerous nuisance variables including experimental designs, sample extraction, type of scanners, etc. The number of slides might be determined from the magnitude and variance of expression change, false-positive rate, and desired power. Instead, pooling samples is an alternative. Online databases on chemicals with known exposure-disease outcomes and genetic information can aid the interpretation of the normalized results. Gene function can be inferred from microarray data analyzed by bioinformatics methods such as cluster analysis. The population study often adopts hospital-based or nested case-control design. Biases in subject selection and exposure assessment should be minimized, and confounding bias should also be controlled for in stratified or multiple regression analysis. Optimal sample sizes are dependent on the statistical test for gene-to-environment or gene-to-gene interaction. The design issues addressed in this mini-review are crucial in conducting toxicogenomics study. In addition, integrative approach of exposure assessment, epidemiology, and clinical trial is required

  4. Improved precision and accuracy for microarrays using updated probe set definitions

    Directory of Open Access Journals (Sweden)

    Larsson Ola

    2007-02-01

    Full Text Available Abstract Background Microarrays enable high throughput detection of transcript expression levels. Different investigators have recently introduced updated probe set definitions to more accurately map probes to our current knowledge of genes and transcripts. Results We demonstrate that updated probe set definitions provide both better precision and accuracy in probe set estimates compared to the original Affymetrix definitions. We show that the improved precision mainly depends on the increased number of probes that are integrated into each probe set, but we also demonstrate an improvement when the same number of probes is used. Conclusion Updated probe set definitions does not only offer expression levels that are more accurately associated to genes and transcripts but also improvements in the estimated transcript expression levels. These results give support for the use of updated probe set definitions for analysis and meta-analysis of microarray data.

  5. AN IMPROVED FUZZY CLUSTERING ALGORITHM FOR MICROARRAY IMAGE SPOTS SEGMENTATION

    Directory of Open Access Journals (Sweden)

    V.G. Biju

    2015-11-01

    Full Text Available An automatic cDNA microarray image processing using an improved fuzzy clustering algorithm is presented in this paper. The spot segmentation algorithm proposed uses the gridding technique developed by the authors earlier, for finding the co-ordinates of each spot in an image. Automatic cropping of spots from microarray image is done using these co-ordinates. The present paper proposes an improved fuzzy clustering algorithm Possibility fuzzy local information c means (PFLICM to segment the spot foreground (FG from background (BG. The PFLICM improves fuzzy local information c means (FLICM algorithm by incorporating typicality of a pixel along with gray level information and local spatial information. The performance of the algorithm is validated using a set of simulated cDNA microarray images added with different levels of AWGN noise. The strength of the algorithm is tested by computing the parameters such as the Segmentation matching factor (SMF, Probability of error (pe, Discrepancy distance (D and Normal mean square error (NMSE. SMF value obtained for PFLICM algorithm shows an improvement of 0.9 % and 0.7 % for high noise and low noise microarray images respectively compared to FLICM algorithm. The PFLICM algorithm is also applied on real microarray images and gene expression values are computed.

  6. Macrophage Gene Expression Associated with Remodeling of the Prepartum Rat Cervix: Microarray and Pathway Analyses

    Science.gov (United States)

    Dobyns, Abigail E.; Goyal, Ravi; Carpenter, Lauren Grisham; Freeman, Tom C.; Longo, Lawrence D.; Yellon, Steven M.

    2015-01-01

    As the critical gatekeeper for birth, prepartum remodeling of the cervix is associated with increased resident macrophages (Mφ), proinflammatory processes, and extracellular matrix degradation. This study tested the hypothesis that expression of genes unique to Mφs characterizes the prepartum from unremodeled nonpregnant cervix. Perfused cervix from prepartum day 21 postbreeding (D21) or nonpregnant (NP) rats, with or without Mφs, had RNA extracted and whole genome microarray analysis performed. By subtractive analyses, expression of 194 and 120 genes related to Mφs in the cervix from D21 rats were increased and decreased, respectively. In both D21 and NP groups, 158 and 57 Mφ genes were also more or less up- or down-regulated, respectively. Mφ gene expression patterns were most strongly correlated within groups and in 5 major clustering patterns. In the cervix from D21 rats, functional categories and canonical pathways of increased expression by Mφ gene related to extracellular matrix, cell proliferation, differentiation, as well as cell signaling. Pathways were characteristic of inflammation and wound healing, e.g., CD163, CD206, and CCR2. Signatures of only inflammation pathways, e.g., CSF1R, EMR1, and MMP12 were common to both D21 and NP groups. Thus, a novel and complex balance of Mφ genes and clusters differentiated the degraded extracellular matrix and cellular genomic activities in the cervix before birth from the unremodeled state. Predicted Mφ activities, pathways, and networks raise the possibility that expression patterns of specific genes characterize and promote prepartum remodeling of the cervix for parturition at term and with preterm labor. PMID:25811906

  7. Microarray-based DNA methylation study of Ewing's sarcoma of the bone.

    Science.gov (United States)

    Park, Hye-Rim; Jung, Woon-Won; Kim, Hyun-Sook; Park, Yong-Koo

    2014-10-01

    Alterations in DNA methylation patterns are a hallmark of malignancy. However, the majority of epigenetic studies of Ewing's sarcoma have focused on the analysis of only a few candidate genes. Comprehensive studies are thus lacking and are required. The aim of the present study was to identify novel methylation markers in Ewing's sarcoma using microarray analysis. The current study reports the microarray-based DNA methylation study of 1,505 CpG sites of 807 cancer-related genes from 69 Ewing's sarcoma samples. The Illumina GoldenGate Methylation Cancer Panel I microarray was used, and with the appropriate controls (n=14), a total of 92 hypermethylated genes were identified in the Ewing's sarcoma samples. The majority of the hypermethylated genes were associated with cell adhesion, cell regulation, development and signal transduction. The overall methylation mean values were compared between patients who survived and those that did not. The overall methylation mean was significantly higher in the patients who did not survive (0.25±0.03) than in those who did (0.22±0.05) (P=0.0322). However, the overall methylation mean was not found to significantly correlate with age, gender or tumor location. GDF10 , OSM , APC and HOXA11 were the most significant differentially-methylated genes, however, their methylation levels were not found to significantly correlate with the survival rate. The DNA methylation profile of Ewing's sarcoma was characterized and 92 genes that were significantly hypermethylated were detected. A trend towards a more aggressive behavior was identified in the methylated group. The results of this study indicated that methylation may be significant in the development of Ewing's sarcoma.

  8. BioconductorBuntu: a Linux distribution that implements a web-based DNA microarray analysis server.

    Science.gov (United States)

    Geeleher, Paul; Morris, Dermot; Hinde, John P; Golden, Aaron

    2009-06-01

    BioconductorBuntu is a custom distribution of Ubuntu Linux that automatically installs a server-side microarray processing environment, providing a user-friendly web-based GUI to many of the tools developed by the Bioconductor Project, accessible locally or across a network. System installation is via booting off a CD image or by using a Debian package provided to upgrade an existing Ubuntu installation. In its current version, several microarray analysis pipelines are supported including oligonucleotide, dual-or single-dye experiments, including post-processing with Gene Set Enrichment Analysis. BioconductorBuntu is designed to be extensible, by server-side integration of further relevant Bioconductor modules as required, facilitated by its straightforward underlying Python-based infrastructure. BioconductorBuntu offers an ideal environment for the development of processing procedures to facilitate the analysis of next-generation sequencing datasets. BioconductorBuntu is available for download under a creative commons license along with additional documentation and a tutorial from (http://bioinf.nuigalway.ie).

  9. DNA microarray analyses reveal a post-irradiation differential time-dependent gene expression profile in yeast cells exposed to X-rays and gamma-rays.

    Science.gov (United States)

    Kimura, Shinzo; Ishidou, Emi; Kurita, Sakiko; Suzuki, Yoshiteru; Shibato, Junko; Rakwal, Randeep; Iwahashi, Hitoshi

    2006-07-21

    Ionizing radiation (IR) is the most enigmatic of genotoxic stress inducers in our environment that has been around from the eons of time. IR is generally considered harmful, and has been the subject of numerous studies, mostly looking at the DNA damaging effects in cells and the repair mechanisms therein. Moreover, few studies have focused on large-scale identification of cellular responses to IR, and to this end, we describe here an initial study on the transcriptional responses of the unicellular genome model, yeast (Saccharomyces cerevisiae strain S288C), by cDNA microarray. The effect of two different IR, X-rays, and gamma (gamma)-rays, was investigated by irradiating the yeast cells cultured in YPD medium with 50 Gy doses of X- and gamma-rays, followed by resuspension of the cells in YPD for time-course experiments. The samples were collected for microarray analysis at 20, 40, and 80 min after irradiation. Microarray analysis revealed a time-course transcriptional profile of changed gene expressions. Up-regulated genes belonged to the functional categories mainly related to cell cycle and DNA processing, cell rescue defense and virulence, protein and cell fate, and metabolism (X- and gamma-rays). Similarly, for X- and gamma-rays, the down-regulated genes belonged to mostly transcription and protein synthesis, cell cycle and DNA processing, control of cellular organization, cell fate, and C-compound and carbohydrate metabolism categories, respectively. This study provides for the first time a snapshot of the genome-wide mRNA expression profiles in X- and gamma-ray post-irradiated yeast cells and comparatively interprets/discusses the changed gene functional categories as effects of these two radiations vis-à-vis their energy levels.

  10. Microarray gene expression profiling and analysis in renal cell carcinoma

    Directory of Open Access Journals (Sweden)

    Sadhukhan Provash

    2004-06-01

    Full Text Available Abstract Background Renal cell carcinoma (RCC is the most common cancer in adult kidney. The accuracy of current diagnosis and prognosis of the disease and the effectiveness of the treatment for the disease are limited by the poor understanding of the disease at the molecular level. To better understand the genetics and biology of RCC, we profiled the expression of 7,129 genes in both clear cell RCC tissue and cell lines using oligonucleotide arrays. Methods Total RNAs isolated from renal cell tumors, adjacent normal tissue and metastatic RCC cell lines were hybridized to affymatrix HuFL oligonucleotide arrays. Genes were categorized into different functional groups based on the description of the Gene Ontology Consortium and analyzed based on the gene expression levels. Gene expression profiles of the tissue and cell line samples were visualized and classified by singular value decomposition. Reverse transcription polymerase chain reaction was performed to confirm the expression alterations of selected genes in RCC. Results Selected genes were annotated based on biological processes and clustered into functional groups. The expression levels of genes in each group were also analyzed. Seventy-four commonly differentially expressed genes with more than five-fold changes in RCC tissues were identified. The expression alterations of selected genes from these seventy-four genes were further verified using reverse transcription polymerase chain reaction (RT-PCR. Detailed comparison of gene expression patterns in RCC tissue and RCC cell lines shows significant differences between the two types of samples, but many important expression patterns were preserved. Conclusions This is one of the initial studies that examine the functional ontology of a large number of genes in RCC. Extensive annotation, clustering and analysis of a large number of genes based on the gene functional ontology revealed many interesting gene expression patterns in RCC. Most

  11. NM23 protein expression in colorectal carcinoma using TMA (tissue microarray: association with metastases and survival

    Directory of Open Access Journals (Sweden)

    Levindo Alves de Oliveira

    2010-12-01

    Full Text Available CONTEXT: NM23, a metastasis suppressor gene, may be associated with prognosis in patients with colorectal carcinoma. OBJECTIVE: To analyze NM23 expression and its association with the presence of lymph node and liver metastases and survival in patients operated on for colorectal carcinoma. METHODS: One hundred thirty patients operated on for colorectal carcinoma were investigated. Tissue microarray blocks containing neoplastic tissue and tumor-adjacent non-neoplastic mucosa were obtained and analyzed by immunohistochemical staining using a monoclonal anti-NM23 antibody. Immunohistochemical expression was assessed using a semiquantitative scoring method, counting the percentage of stained cells. The results were compared regarding morphological and histological characteristics of the colorectal carcinoma, presence of lymph node and liver metastases, tumor staging, and patient survival. Statistical analysis was performed using the Mann-Whitney test, the Kruskal-Wallis test and Fisher's exact test. Survival analysis was performed using the Kaplan-Meier method and the log-rank test. RESULTS: NM23 expression was higher in colorectal carcinoma tissue than in adjacent non-neoplastic mucosa (P<0.0001. NM23 protein expression did not correlate with degree of cell differentiation (P = 0.57, vascular invasion (P = 0.85, lymphatic invasion (P = 0.41, perineural infiltration (P = 0.46, staging (P = 0.19, lymph node metastases (P = 0.08, or liver metastases (P = 0.59. Disease-free survival showed significant association (P = 0.01 with the intensity of NM23 protein immunohistochemical expression in colorectal carcinoma tissue, whereas overall survival showed no association with NM23 protein expression (P = 0.13. CONCLUSIONS: NM23 protein expression was higher in neoplastic colorectal carcinoma tissue than in adjacent non-neoplastic mucosa, showing no correlation with morphological aspects, presence of lymph node or liver metastases, colorectal carcinoma

  12. A dynamic bead-based microarray for parallel DNA detection

    International Nuclear Information System (INIS)

    Sochol, R D; Lin, L; Casavant, B P; Dueck, M E; Lee, L P

    2011-01-01

    A microfluidic system has been designed and constructed by means of micromachining processes to integrate both microfluidic mixing of mobile microbeads and hydrodynamic microbead arraying capabilities on a single chip to simultaneously detect multiple bio-molecules. The prototype system has four parallel reaction chambers, which include microchannels of 18 × 50 µm 2 cross-sectional area and a microfluidic mixing section of 22 cm length. Parallel detection of multiple DNA oligonucleotide sequences was achieved via molecular beacon probes immobilized on polystyrene microbeads of 16 µm diameter. Experimental results show quantitative detection of three distinct DNA oligonucleotide sequences from the Hepatitis C viral (HCV) genome with single base-pair mismatch specificity. Our dynamic bead-based microarray offers an effective microfluidic platform to increase parallelization of reactions and improve microbead handling for various biological applications, including bio-molecule detection, medical diagnostics and drug screening

  13. Classification of Dukes' B and C colorectal cancers using expression arrays

    DEFF Research Database (Denmark)

    Frederiksen, C.M.; Knudsen, Steen; Laurberg, S.

    2003-01-01

    Purpose. Colorectal cancer is one of the most common malignancies. Substaging of the cancer is of importance not only to prognosis but also to treatment. Classification of substages based on DNA microarray technology is currently the most promising approach. We therefore investigated if gene...... expression microarrays could be used to classify colorectal tumors. Methods. We used the Affymetrix oligonucleotide arrays to analyze the expression of more than 5,000 genes in samples from the sigmoid and upper rectum of the left colon. Five samples were from normal mucosa and five samples from each...... expression of one of the most common malignancies, colorectal cancer, now seems to be within reach. The data indicates that it is possible at least to classify Dukes' B and C colorectal tumors with microarrays....

  14. Moving Toward Integrating Gene Expression Profiling into High-throughput Testing:A Gene Expression Biomarker Accurately Predicts Estrogen Receptor α Modulation in a Microarray Compendium

    Science.gov (United States)

    Microarray profiling of chemical-induced effects is being increasingly used in medium and high-throughput formats. In this study, we describe computational methods to identify molecular targets from whole-genome microarray data using as an example the estrogen receptor α (ERα), ...

  15. Immunohistochemistry - Microarray Analysis of Patients with Peritoneal Metastases of Appendiceal or Colorectal Origin

    Directory of Open Access Journals (Sweden)

    Danielle E Green

    2015-01-01

    Full Text Available BackgroundThe value of immunohistochemistry (IHC-microarray analysis of pathological specimens in the management of patients is controversial although preliminary data suggests potential benefit. We describe the characteristics of patients undergoing a commercially available IHC-microarray method in patients with peritoneal metastases (PM and the feasibility of this technique in this population.MethodsWe retrospectively analyzed consecutive patients with pathologically confirmed PM from appendiceal or colorectal primary who underwent Caris Molecular IntelligenceTM testing. IHC, microarray, FISH and mutational analysis were included and stratified by PCI score, histology and treatment characteristics. Statistical analysis was performed using non-parametric tests.ResultsOur study included 5 patients with appendiceal and 11 with colorectal PM. The median age of patients was 51 (IQR 39-65 years, with 11(68% female. The median PCI score of the patients was 17(IQR 10-25. Hyperthermic intra-peritoneal chemoperfusion (HIPEC was performed in 4 (80% patients with appendiceal primary tumors and 4 (36% with colorectal primary. KRAS mutations were encountered in 40% of appendiceal vs. 30% colorectal tumors, while BRAF mutations were seen in 40% of colorectal PM and none of the patients with appendiceal PM (p=0.06. IHC biomarker expression was not significantly different between the two primaries. Sufficient tumor for microarray analysis was found in 44% (n=7 patients, which was not associated with previous use of chemotherapy (p>0.20 for 5-FU/LV, Irinotecan and Oxaliplatin.ConclusionsIn a small sample of patients with peritoneal metastases, the feasibility and results of IHC-microarray staining based on a commercially available test is reported. The apparent high incidence of the BRAF mutation in patients with PM may potentially offer opportunities for novel therapeutics and suggest that IHC-microarray is a method that can be used in this population.

  16. Identification of potential biomarkers from microarray experiments using multiple criteria optimization

    International Nuclear Information System (INIS)

    Sánchez-Peña, Matilde L; Isaza, Clara E; Pérez-Morales, Jaileene; Rodríguez-Padilla, Cristina; Castro, José M; Cabrera-Ríos, Mauricio

    2013-01-01

    Microarray experiments are capable of determining the relative expression of tens of thousands of genes simultaneously, thus resulting in very large databases. The analysis of these databases and the extraction of biologically relevant knowledge from them are challenging tasks. The identification of potential cancer biomarker genes is one of the most important aims for microarray analysis and, as such, has been widely targeted in the literature. However, identifying a set of these genes consistently across different experiments, researches, microarray platforms, or cancer types is still an elusive endeavor. Besides the inherent difficulty of the large and nonconstant variability in these experiments and the incommensurability between different microarray technologies, there is the issue of the users having to adjust a series of parameters that significantly affect the outcome of the analyses and that do not have a biological or medical meaning. In this study, the identification of potential cancer biomarkers from microarray data is casted as a multiple criteria optimization (MCO) problem. The efficient solutions to this problem, found here through data envelopment analysis (DEA), are associated to genes that are proposed as potential cancer biomarkers. The method does not require any parameter adjustment by the user, and thus fosters repeatability. The approach also allows the analysis of different microarray experiments, microarray platforms, and cancer types simultaneously. The results include the analysis of three publicly available microarray databases related to cervix cancer. This study points to the feasibility of modeling the selection of potential cancer biomarkers from microarray data as an MCO problem and solve it using DEA. Using MCO entails a new optic to the identification of potential cancer biomarkers as it does not require the definition of a threshold value to establish significance for a particular gene and the selection of a normalization

  17. Patterns of gene expression in carp liver after exposure to a mixture of waterborne and dietary cadmium using a custom-made microarray

    International Nuclear Information System (INIS)

    Reynders, Hans; Ven, Karlijn van der; Moens, Lotte N.; Remortel, Piet van; De Coen, Wim M.; Blust, Ronny

    2006-01-01

    Gene expression changes in carp liver tissue were studied after acute (3 and 24 h) and subchronic (7 and 28 days) exposure to a mixture of waterborne (9, 105 and 480 μg/l) and dietary (9.5, 122 and 144 μg/g) cadmium, using a custom-made microarray. Suppression subtractive hybridization-PCR (SSH-PCR) was applied to isolate a set of 643 liver genes, involved in multiple biological pathways, such as energy metabolism (e.g. glucokinase), immune response (e.g. complement C3) and stress and detoxification (e.g. cytochrome P450 2F2, glutathione-S-transferase pi). These genes were subsequently spotted on glass-slides for the construction of a custom-made microarray. Resulting microarray hybridizations indicated a highly dynamic response to cadmium exposure. At low exposure concentrations (9 μg/l through water and 9.5 μg/g dry weight through food) mostly energy-related genes (e.g. glucokinase, elastase) were influenced, while a general stress response was obvious through induction of several stress-related genes, including hemopexin and cytochrome P450 2F2, at high cadmium concentrations. In addition, fish exposed to the highest cadmium concentrations showed liver damage after 7 days of exposure, as measured by elevated alanine transaminase activity in plasma and increased liver water content (wet-to-dry weight ratio). Moreover, decreased hematocrit and growth were found at the end of the experiment. Altogether this study clearly demonstrated the importance of varying exposure conditions for the characterization of the molecular impact of cadmium and showed that microarray results can provide important information, required to unravel the molecular events and responses related to cadmium exposure

  18. [Saccharomyces boulardii reduced intestinal inflammation in mice model of 2,4,6-trinitrobencene sulfonic acid induced colitis: based on microarray].

    Science.gov (United States)

    Lee, Sang Kil; Kim, Hyo Jong; Chi, Sung Gil

    2010-01-01

    Saccharomyces boulardii has been reported to be beneficial in the treatment of inflammatory bowel disease. The aim of this work was to evaluate the effect of S. boulardii in a mice model of 2,4,6-trinitrobencene sulfonic acid (TNBS) induced colitis and analyze the expression of genes in S. boulardii treated mice by microarray. BALB/c mice received TNBS or TNBS and S. boulardii treatment for 4 days. Microarray was performed on total mRNA form colon, and histologic evaluation was also performed. In mice treated with S. boulardii, the histological appearance and mortality rate were significantly restored compared with rats receiving only TNBS. Among 330 genes which were altered by both S. boulardii and TNBS (>2 folds), 193 genes were down-regulated by S. boulardii in microarray. Most of genes which were down-regulated by S. bouardii were functionally classified as inflammatory and immune response related genes. S. boulardii may reduce colonic inflammation along with regulation of inflammatory and immune responsive genes in TNBS-induced colitis.

  19. β-empirical Bayes inference and model diagnosis of microarray data

    Directory of Open Access Journals (Sweden)

    Hossain Mollah Mohammad

    2012-06-01

    Full Text Available Abstract Background Microarray data enables the high-throughput survey of mRNA expression profiles at the genomic level; however, the data presents a challenging statistical problem because of the large number of transcripts with small sample sizes that are obtained. To reduce the dimensionality, various Bayesian or empirical Bayes hierarchical models have been developed. However, because of the complexity of the microarray data, no model can explain the data fully. It is generally difficult to scrutinize the irregular patterns of expression that are not expected by the usual statistical gene by gene models. Results As an extension of empirical Bayes (EB procedures, we have developed the β-empirical Bayes (β-EB approach based on a β-likelihood measure which can be regarded as an ’evidence-based’ weighted (quasi- likelihood inference. The weight of a transcript t is described as a power function of its likelihood, fβ(yt|θ. Genes with low likelihoods have unexpected expression patterns and low weights. By assigning low weights to outliers, the inference becomes robust. The value of β, which controls the balance between the robustness and efficiency, is selected by maximizing the predictive β0-likelihood by cross-validation. The proposed β-EB approach identified six significant (p−5 contaminated transcripts as differentially expressed (DE in normal/tumor tissues from the head and neck of cancer patients. These six genes were all confirmed to be related to cancer; they were not identified as DE genes by the classical EB approach. When applied to the eQTL analysis of Arabidopsis thaliana, the proposed β-EB approach identified some potential master regulators that were missed by the EB approach. Conclusions The simulation data and real gene expression data showed that the proposed β-EB method was robust against outliers. The distribution of the weights was used to scrutinize the irregular patterns of expression and diagnose the model

  20. Construction and evaluation of yeast expression networks by database-guided predictions

    Directory of Open Access Journals (Sweden)

    Katharina Papsdorf

    2016-05-01

    Full Text Available DNA-Microarrays are powerful tools to obtain expression data on the genome-wide scale. We performed microarray experiments to elucidate the transcriptional networks, which are up- or down-regulated in response to the expression of toxic polyglutamine proteins in yeast. Such experiments initially generate hit lists containing differentially expressed genes. To look into transcriptional responses, we constructed networks from these genes. We therefore developed an algorithm, which is capable of dealing with very small numbers of microarrays by clustering the hits based on co-regulatory relationships obtained from the SPELL database. Here, we evaluate this algorithm according to several criteria and further develop its statistical capabilities. Initially, we define how the number of SPELL-derived co-regulated genes and the number of input hits influences the quality of the networks. We then show the ability of our networks to accurately predict further differentially expressed genes. Including these predicted genes into the networks improves the network quality and allows quantifying the predictive strength of the networks based on a newly implemented scoring method. We find that this approach is useful for our own experimental data sets and also for many other data sets which we tested from the SPELL microarray database. Furthermore, the clusters obtained by the described algorithm greatly improve the assignment to biological processes and transcription factors for the individual clusters. Thus, the described clustering approach, which will be available through the ClusterEx web interface, and the evaluation parameters derived from it represent valuable tools for the fast and informative analysis of yeast microarray data.

  1. Literature-aided meta-analysis of microarray data: a compendium study on muscle development and disease

    Directory of Open Access Journals (Sweden)

    van Ommen Gert-Jan B

    2008-06-01

    Full Text Available Abstract Background Comparative analysis of expression microarray studies is difficult due to the large influence of technical factors on experimental outcome. Still, the identified differentially expressed genes may hint at the same biological processes. However, manually curated assignment of genes to biological processes, such as pursued by the Gene Ontology (GO consortium, is incomplete and limited. We hypothesised that automatic association of genes with biological processes through thesaurus-controlled mining of Medline abstracts would be more effective. Therefore, we developed a novel algorithm (LAMA: Literature-Aided Meta-Analysis to quantify the similarity between transcriptomics studies. We evaluated our algorithm on a large compendium of 102 microarray studies published in the field of muscle development and disease, and compared it to similarity measures based on gene overlap and over-representation of biological processes assigned by GO. Results While the overlap in both genes and overrepresented GO-terms was poor, LAMA retrieved many more biologically meaningful links between studies, with substantially lower influence of technical factors. LAMA correctly grouped muscular dystrophy, regeneration and myositis studies, and linked patient and corresponding mouse model studies. LAMA also retrieves the connecting biological concepts. Among other new discoveries, we associated cullin proteins, a class of ubiquitinylation proteins, with genes down-regulated during muscle regeneration, whereas ubiquitinylation was previously reported to be activated during the inverse process: muscle atrophy. Conclusion Our literature-based association analysis is capable of finding hidden common biological denominators in microarray studies, and circumvents the need for raw data analysis or curated gene annotation databases.

  2. PIIKA 2: an expanded, web-based platform for analysis of kinome microarray data.

    Directory of Open Access Journals (Sweden)

    Brett Trost

    Full Text Available Kinome microarrays are comprised of peptides that act as phosphorylation targets for protein kinases. This platform is growing in popularity due to its ability to measure phosphorylation-mediated cellular signaling in a high-throughput manner. While software for analyzing data from DNA microarrays has also been used for kinome arrays, differences between the two technologies and associated biologies previously led us to develop Platform for Intelligent, Integrated Kinome Analysis (PIIKA, a software tool customized for the analysis of data from kinome arrays. Here, we report the development of PIIKA 2, a significantly improved version with new features and improvements in the areas of clustering, statistical analysis, and data visualization. Among other additions to the original PIIKA, PIIKA 2 now allows the user to: evaluate statistically how well groups of samples cluster together; identify sets of peptides that have consistent phosphorylation patterns among groups of samples; perform hierarchical clustering analysis with bootstrapping; view false negative probabilities and positive and negative predictive values for t-tests between pairs of samples; easily assess experimental reproducibility; and visualize the data using volcano plots, scatterplots, and interactive three-dimensional principal component analyses. Also new in PIIKA 2 is a web-based interface, which allows users unfamiliar with command-line tools to easily provide input and download the results. Collectively, the additions and improvements described here enhance both the breadth and depth of analyses available, simplify the user interface, and make the software an even more valuable tool for the analysis of kinome microarray data. Both the web-based and stand-alone versions of PIIKA 2 can be accessed via http://saphire.usask.ca.

  3. PIIKA 2: an expanded, web-based platform for analysis of kinome microarray data.

    Science.gov (United States)

    Trost, Brett; Kindrachuk, Jason; Määttänen, Pekka; Napper, Scott; Kusalik, Anthony

    2013-01-01

    Kinome microarrays are comprised of peptides that act as phosphorylation targets for protein kinases. This platform is growing in popularity due to its ability to measure phosphorylation-mediated cellular signaling in a high-throughput manner. While software for analyzing data from DNA microarrays has also been used for kinome arrays, differences between the two technologies and associated biologies previously led us to develop Platform for Intelligent, Integrated Kinome Analysis (PIIKA), a software tool customized for the analysis of data from kinome arrays. Here, we report the development of PIIKA 2, a significantly improved version with new features and improvements in the areas of clustering, statistical analysis, and data visualization. Among other additions to the original PIIKA, PIIKA 2 now allows the user to: evaluate statistically how well groups of samples cluster together; identify sets of peptides that have consistent phosphorylation patterns among groups of samples; perform hierarchical clustering analysis with bootstrapping; view false negative probabilities and positive and negative predictive values for t-tests between pairs of samples; easily assess experimental reproducibility; and visualize the data using volcano plots, scatterplots, and interactive three-dimensional principal component analyses. Also new in PIIKA 2 is a web-based interface, which allows users unfamiliar with command-line tools to easily provide input and download the results. Collectively, the additions and improvements described here enhance both the breadth and depth of analyses available, simplify the user interface, and make the software an even more valuable tool for the analysis of kinome microarray data. Both the web-based and stand-alone versions of PIIKA 2 can be accessed via http://saphire.usask.ca.

  4. Detection of perturbation phases and developmental stages in organisms from DNA microarray time series data.

    Directory of Open Access Journals (Sweden)

    Marianne Rooman

    Full Text Available Available DNA microarray time series that record gene expression along the developmental stages of multicellular eukaryotes, or in unicellular organisms subject to external perturbations such as stress and diauxie, are analyzed. By pairwise comparison of the gene expression profiles on the basis of a translation-invariant and scale-invariant distance measure corresponding to least-rectangle regression, it is shown that peaks in the average distance values are noticeable and are localized around specific time points. These points systematically coincide with the transition points between developmental phases or just follow the external perturbations. This approach can thus be used to identify automatically, from microarray time series alone, the presence of external perturbations or the succession of developmental stages in arbitrary cell systems. Moreover, our results show that there is a striking similarity between the gene expression responses to these a priori very different phenomena. In contrast, the cell cycle does not involve a perturbation-like phase, but rather continuous gene expression remodeling. Similar analyses were conducted using three other standard distance measures, showing that the one we introduced was superior. Based on these findings, we set up an adapted clustering method that uses this distance measure and classifies the genes on the basis of their expression profiles within each developmental stage or between perturbation phases.

  5. Design, construction and validation of a Plasmodium vivax microarray for the transcriptome profiling of clinical isolates

    KAUST Repository

    Boopathi, Pon Arunachalam

    2016-10-09

    High density oligonucleotide microarrays have been used on Plasmodium vivax field isolates to estimate whole genome expression. However, no microarray platform has been experimentally optimized for studying the transcriptome of field isolates. In the present study, we adopted both bioinformatics and experimental testing approaches to select best optimized probes suitable for detecting parasite transcripts from field samples and included them in designing a custom 15K P. vivax microarray. This microarray has long oligonucleotide probes (60 mer) that were in-situ synthesized onto glass slides using Agilent SurePrint technology and has been developed into an 8X15K format (8 identical arrays on a single slide). Probes in this array were experimentally validated and represents 4180 P. vivax genes in sense orientation, of which 1219 genes have also probes in antisense orientation. Validation of the 15K array by using field samples (n =14) has shown 99% of parasite transcript detection from any of the samples. Correlation analysis between duplicate probes (n = 85) present in the arrays showed perfect correlation (r(2) = 0.98) indicating the reproducibility. Multiple probes representing the same gene exhibited similar kind of expression pattern across the samples (positive correlation, r >= 0.6). Comparison of hybridization data with the previous studies and quantitative real-time PCR experiments were performed to highlight the microarray validation procedure. This array is unique in its design, and results indicate that the array is sensitive and reproducible. Hence, this microarray could be a valuable functional genomics tool to generate reliable expression data from P. vivax field isolates. (C) 2016 Published by Elsevier B.V.

  6. Design, construction and validation of a Plasmodium vivax microarray for the transcriptome profiling of clinical isolates

    KAUST Repository

    Boopathi, Pon Arunachalam; Subudhi, Amit; Middha, Sheetal; Acharya, Jyoti; Mugasimangalam, Raja Chinnadurai; Kochar, Sanjay Kumar; Kochar, Dhanpat Kumar; Das, Ashis

    2016-01-01

    High density oligonucleotide microarrays have been used on Plasmodium vivax field isolates to estimate whole genome expression. However, no microarray platform has been experimentally optimized for studying the transcriptome of field isolates. In the present study, we adopted both bioinformatics and experimental testing approaches to select best optimized probes suitable for detecting parasite transcripts from field samples and included them in designing a custom 15K P. vivax microarray. This microarray has long oligonucleotide probes (60 mer) that were in-situ synthesized onto glass slides using Agilent SurePrint technology and has been developed into an 8X15K format (8 identical arrays on a single slide). Probes in this array were experimentally validated and represents 4180 P. vivax genes in sense orientation, of which 1219 genes have also probes in antisense orientation. Validation of the 15K array by using field samples (n =14) has shown 99% of parasite transcript detection from any of the samples. Correlation analysis between duplicate probes (n = 85) present in the arrays showed perfect correlation (r(2) = 0.98) indicating the reproducibility. Multiple probes representing the same gene exhibited similar kind of expression pattern across the samples (positive correlation, r >= 0.6). Comparison of hybridization data with the previous studies and quantitative real-time PCR experiments were performed to highlight the microarray validation procedure. This array is unique in its design, and results indicate that the array is sensitive and reproducible. Hence, this microarray could be a valuable functional genomics tool to generate reliable expression data from P. vivax field isolates. (C) 2016 Published by Elsevier B.V.

  7. Reconstructing the temporal ordering of biological samples using microarray data.

    Science.gov (United States)

    Magwene, Paul M; Lizardi, Paul; Kim, Junhyong

    2003-05-01

    Accurate time series for biological processes are difficult to estimate due to problems of synchronization, temporal sampling and rate heterogeneity. Methods are needed that can utilize multi-dimensional data, such as those resulting from DNA microarray experiments, in order to reconstruct time series from unordered or poorly ordered sets of observations. We present a set of algorithms for estimating temporal orderings from unordered sets of sample elements. The techniques we describe are based on modifications of a minimum-spanning tree calculated from a weighted, undirected graph. We demonstrate the efficacy of our approach by applying these techniques to an artificial data set as well as several gene expression data sets derived from DNA microarray experiments. In addition to estimating orderings, the techniques we describe also provide useful heuristics for assessing relevant properties of sample datasets such as noise and sampling intensity, and we show how a data structure called a PQ-tree can be used to represent uncertainty in a reconstructed ordering. Academic implementations of the ordering algorithms are available as source code (in the programming language Python) on our web site, along with documentation on their use. The artificial 'jelly roll' data set upon which the algorithm was tested is also available from this web site. The publicly available gene expression data may be found at http://genome-www.stanford.edu/cellcycle/ and http://caulobacter.stanford.edu/CellCycle/.

  8. Isolation of Microarray-Grade Total RNA, MicroRNA, and DNA from a Single PAXgene Blood RNA Tube

    DEFF Research Database (Denmark)

    Kruhøffer, Mogens; Andersen, Lars Dyrskjøt; Voss, Thorsten

    2007-01-01

    We have developed a procedure for isolation of microRNA and genomic DNA in addition to total RNA from whole blood stabilized in PAXgene Blood RNA tubes. The procedure is based on automatic extraction on a BioRobot MDx and includes isolation of DNA from a fraction of the stabilized blood...... and recovery of small RNA species that are otherwise lost. The procedure presented here is suitable for large-scale experiments and is amenable to further automation. Procured total RNA and DNA was tested using Affymetrix Expression and single-nucleotide polymorphism GeneChips, respectively, and isolated micro......RNA was tested using spotted locked nucleic acid-based microarrays. We conclude that the yield and quality of total RNA, microRNA, and DNA from a single PAXgene blood RNA tube is sufficient for downstream microarray analysis....

  9. Genome-Wide Screening of Genes Showing Altered Expression in Liver Metastases of Human Colorectal Cancers by cDNA Microarray

    Directory of Open Access Journals (Sweden)

    Rempei Yanagawa

    2001-01-01

    Full Text Available In spite of intensive and increasingly successful attempts to determine the multiple steps involved in colorectal carcinogenesis, the mechanisms responsible for metastasis of colorectal tumors to the liver remain to be clarified. To identify genes that are candidates for involvement in the metastatic process, we analyzed genome-wide expression profiles of 10 primary colorectal cancers and their corresponding metastatic lesions by means of a cDNA microarray consisting of 9121 human genes. This analysis identified 40 genes whose expression was commonly upregulated in metastatic lesions, and 7 that were commonly downregulated. The upregulated genes encoded proteins involved in cell adhesion, or remodeling of the actin cytoskeleton. Investigation of the functions of more of the altered genes should improve our understanding of metastasis and may identify diagnostic markers and/or novel molecular targets for prevention or therapy of metastatic lesions.

  10. Development of DNA Microarrays for Metabolic Pathway and Bioprocess Monitoring

    Energy Technology Data Exchange (ETDEWEB)

    Gregory Stephanopoulos

    2004-07-31

    Transcriptional profiling experiments utilizing DNA microarrays to study the intracellular accumulation of PHB in Synechocystis has proved difficult in large part because strains that show significant differences in PHB which would justify global analysis of gene expression have not been isolated.

  11. Gene selection for microarray data classification via subspace learning and manifold regularization.

    Science.gov (United States)

    Tang, Chang; Cao, Lijuan; Zheng, Xiao; Wang, Minhui

    2017-12-19

    With the rapid development of DNA microarray technology, large amount of genomic data has been generated. Classification of these microarray data is a challenge task since gene expression data are often with thousands of genes but a small number of samples. In this paper, an effective gene selection method is proposed to select the best subset of genes for microarray data with the irrelevant and redundant genes removed. Compared with original data, the selected gene subset can benefit the classification task. We formulate the gene selection task as a manifold regularized subspace learning problem. In detail, a projection matrix is used to project the original high dimensional microarray data into a lower dimensional subspace, with the constraint that the original genes can be well represented by the selected genes. Meanwhile, the local manifold structure of original data is preserved by a Laplacian graph regularization term on the low-dimensional data space. The projection matrix can serve as an importance indicator of different genes. An iterative update algorithm is developed for solving the problem. Experimental results on six publicly available microarray datasets and one clinical dataset demonstrate that the proposed method performs better when compared with other state-of-the-art methods in terms of microarray data classification. Graphical Abstract The graphical abstract of this work.

  12. The Development of Protein Microarrays and Their Applications in DNA-Protein and Protein-Protein Interaction Analyses of Arabidopsis Transcription Factors

    Science.gov (United States)

    Gong, Wei; He, Kun; Covington, Mike; Dinesh-Kumar, S. P.; Snyder, Michael; Harmer, Stacey L.; Zhu, Yu-Xian; Deng, Xing Wang

    2009-01-01

    We used our collection of Arabidopsis transcription factor (TF) ORFeome clones to construct protein microarrays containing as many as 802 TF proteins. These protein microarrays were used for both protein-DNA and protein-protein interaction analyses. For protein-DNA interaction studies, we examined AP2/ERF family TFs and their cognate cis-elements. By careful comparison of the DNA-binding specificity of 13 TFs on the protein microarray with previous non-microarray data, we showed that protein microarrays provide an efficient and high throughput tool for genome-wide analysis of TF-DNA interactions. This microarray protein-DNA interaction analysis allowed us to derive a comprehensive view of DNA-binding profiles of AP2/ERF family proteins in Arabidopsis. It also revealed four TFs that bound the EE (evening element) and had the expected phased gene expression under clock-regulation, thus providing a basis for further functional analysis of their roles in clock regulation of gene expression. We also developed procedures for detecting protein interactions using this TF protein microarray and discovered four novel partners that interact with HY5, which can be validated by yeast two-hybrid assays. Thus, plant TF protein microarrays offer an attractive high-throughput alternative to traditional techniques for TF functional characterization on a global scale. PMID:19802365

  13. Investigation of Parameters that Affect the Success Rate of Microarray-Based Allele-Specific Hybridization Assays

    DEFF Research Database (Denmark)

    Poulsen, Lena; Søe, Martin Jensen; Moller, Lisbeth Birk

    2011-01-01

    Background: The development of microarray-based genetic tests for diseases that are caused by known mutations is becoming increasingly important. The key obstacle to developing functional genotyping assays is that such mutations need to be genotyped regardless of their location in genomic regions...

  14. Transcriptome sequencing of the Microarray Quality Control (MAQC RNA reference samples using next generation sequencing

    Directory of Open Access Journals (Sweden)

    Thierry-Mieg Danielle

    2009-06-01

    Full Text Available Abstract Background Transcriptome sequencing using next-generation sequencing platforms will soon be competing with DNA microarray technologies for global gene expression analysis. As a preliminary evaluation of these promising technologies, we performed deep sequencing of cDNA synthesized from the Microarray Quality Control (MAQC reference RNA samples using Roche's 454 Genome Sequencer FLX. Results We generated more that 3.6 million sequence reads of average length 250 bp for the MAQC A and B samples and introduced a data analysis pipeline for translating cDNA read counts into gene expression levels. Using BLAST, 90% of the reads mapped to the human genome and 64% of the reads mapped to the RefSeq database of well annotated genes with e-values ≤ 10-20. We measured gene expression levels in the A and B samples by counting the numbers of reads that mapped to individual RefSeq genes in multiple sequencing runs to evaluate the MAQC quality metrics for reproducibility, sensitivity, specificity, and accuracy and compared the results with DNA microarrays and Quantitative RT-PCR (QRTPCR from the MAQC studies. In addition, 88% of the reads were successfully aligned directly to the human genome using the AceView alignment programs with an average 90% sequence similarity to identify 137,899 unique exon junctions, including 22,193 new exon junctions not yet contained in the RefSeq database. Conclusion Using the MAQC metrics for evaluating the performance of gene expression platforms, the ExpressSeq results for gene expression levels showed excellent reproducibility, sensitivity, and specificity that improved systematically with increasing shotgun sequencing depth, and quantitative accuracy that was comparable to DNA microarrays and QRTPCR. In addition, a careful mapping of the reads to the genome using the AceView alignment programs shed new light on the complexity of the human transcriptome including the discovery of thousands of new splice variants.

  15. Accounting for one-channel depletion improves missing value imputation in 2-dye microarray data.

    Science.gov (United States)

    Ritz, Cecilia; Edén, Patrik

    2008-01-19

    For 2-dye microarray platforms, some missing values may arise from an un-measurably low RNA expression in one channel only. Information of such "one-channel depletion" is so far not included in algorithms for imputation of missing values. Calculating the mean deviation between imputed values and duplicate controls in five datasets, we show that KNN-based imputation gives a systematic bias of the imputed expression values of one-channel depleted spots. Evaluating the correction of this bias by cross-validation showed that the mean square deviation between imputed values and duplicates were reduced up to 51%, depending on dataset. By including more information in the imputation step, we more accurately estimate missing expression values.

  16. Protein expression based multimarker analysis of breast cancer samples

    International Nuclear Information System (INIS)

    Presson, Angela P; Horvath, Steve; Yoon, Nam K; Bagryanova, Lora; Mah, Vei; Alavi, Mohammad; Maresh, Erin L; Rajasekaran, Ayyappan K; Goodglick, Lee; Chia, David

    2011-01-01

    Tissue microarray (TMA) data are commonly used to validate the prognostic accuracy of tumor markers. For example, breast cancer TMA data have led to the identification of several promising prognostic markers of survival time. Several studies have shown that TMA data can also be used to cluster patients into clinically distinct groups. Here we use breast cancer TMA data to cluster patients into distinct prognostic groups. We apply weighted correlation network analysis (WGCNA) to TMA data consisting of 26 putative tumor biomarkers measured on 82 breast cancer patients. Based on this analysis we identify three groups of patients with low (5.4%), moderate (22%) and high (50%) mortality rates, respectively. We then develop a simple threshold rule using a subset of three markers (p53, Na-KATPase-β1, and TGF β receptor II) that can approximately define these mortality groups. We compare the results of this correlation network analysis with results from a standard Cox regression analysis. We find that the rule-based grouping variable (referred to as WGCNA*) is an independent predictor of survival time. While WGCNA* is based on protein measurements (TMA data), it validated in two independent Affymetrix microarray gene expression data (which measure mRNA abundance). We find that the WGCNA patient groups differed by 35% from mortality groups defined by a more conventional stepwise Cox regression analysis approach. We show that correlation network methods, which are primarily used to analyze the relationships between gene products, are also useful for analyzing the relationships between patients and for defining distinct patient groups based on TMA data. We identify a rule based on three tumor markers for predicting breast cancer survival outcomes

  17. Microarray Analyses of Genes Differentially Expressed by Diet (Black Beans and Soy Flour) during Azoxymethane-Induced Colon Carcinogenesis in Rats.

    Science.gov (United States)

    Rondini, Elizabeth A; Bennink, Maurice R

    2012-01-01

    We previously demonstrated that black bean (BB) and soy flour (SF)-based diets inhibit azoxymethane (AOM)-induced colon cancer. The objective of this study was to identify genes altered by carcinogen treatment in normal-appearing colonic mucosa and those attenuated by bean feeding. Ninety-five male F344 rats were fed control (AIN) diets upon arrival. At 4 and 5 weeks, rats were injected with AOM (15 mg/kg) or saline and one week later administered an AIN, BB-, or SF-based diet. Rats were sacrificed after 31 weeks, and microarrays were conducted on RNA isolated from the distal colonic mucosa. AOM treatment induced a number of genes involved in immunity, including several MHC II-associated antigens and innate defense genes (RatNP-3, Lyz2, Pla2g2a). BB- and SF-fed rats exhibited a higher expression of genes involved in energy metabolism and water and sodium absorption and lower expression of innate (RatNP-3, Pla2g2a, Tlr4, Dmbt1) and cell cycle-associated (Cdc2, Ccnb1, Top2a) genes. Genes involved in the extracellular matrix (Col1a1, Fn1) and innate immunity (RatNP-3, Pla2g2a) were induced by AOM in all diets, but to a lower extent in bean-fed animals. This profile suggests beans inhibit colon carcinogenesis by modulating cellular kinetics and reducing inflammation, potentially by preserving mucosal barrier function.

  18. Transcription analysis of apple fruit development using cDNA microarrays

    NARCIS (Netherlands)

    Soglio, V.; Costa, F.; Molthoff, J.W.; Weemen-Hendriks, M.; Schouten, H.J.; Gianfranceschi, L.

    2009-01-01

    The knowledge of the molecular mechanisms underlying fruit quality traits is fundamental to devise efficient marker-assisted selection strategies and to improve apple breeding. In this study, cDNA microarray technology was used to identify genes whose expression changes during fruit development and

  19. Relative impact of key sources of systematic noise in Affymetrix and Illumina gene-expression microarray experiments

    Directory of Open Access Journals (Sweden)

    Kitchen Robert R

    2011-12-01

    Full Text Available Abstract Background Systematic processing noise, which includes batch effects, is very common in microarray experiments but is often ignored despite its potential to confound or compromise experimental results. Compromised results are most likely when re-analysing or integrating datasets from public repositories due to the different conditions under which each dataset is generated. To better understand the relative noise-contributions of various factors in experimental-design, we assessed several Illumina and Affymetrix datasets for technical variation between replicate hybridisations of Universal Human Reference (UHRR and individual or pooled breast-tumour RNA. Results A varying degree of systematic noise was observed in each of the datasets, however in all cases the relative amount of variation between standard control RNA replicates was found to be greatest at earlier points in the sample-preparation workflow. For example, 40.6% of the total variation in reported expressions were attributed to replicate extractions, compared to 13.9% due to amplification/labelling and 10.8% between replicate hybridisations. Deliberate probe-wise batch-correction methods were effective in reducing the magnitude of this variation, although the level of improvement was dependent on the sources of noise included in the model. Systematic noise introduced at the chip, run, and experiment levels of a combined Illumina dataset were found to be highly dependant upon the experimental design. Both UHRR and pools of RNA, which were derived from the samples of interest, modelled technical variation well although the pools were significantly better correlated (4% average improvement and better emulated the effects of systematic noise, over all probes, than the UHRRs. The effect of this noise was not uniform over all probes, with low GC-content probes found to be more vulnerable to batch variation than probes with a higher GC-content. Conclusions The magnitude of systematic

  20. Relative impact of key sources of systematic noise in Affymetrix and Illumina gene-expression microarray experiments.

    Science.gov (United States)

    Kitchen, Robert R; Sabine, Vicky S; Simen, Arthur A; Dixon, J Michael; Bartlett, John M S; Sims, Andrew H

    2011-12-01

    Systematic processing noise, which includes batch effects, is very common in microarray experiments but is often ignored despite its potential to confound or compromise experimental results. Compromised results are most likely when re-analysing or integrating datasets from public repositories due to the different conditions under which each dataset is generated. To better understand the relative noise-contributions of various factors in experimental-design, we assessed several Illumina and Affymetrix datasets for technical variation between replicate hybridisations of Universal Human Reference (UHRR) and individual or pooled breast-tumour RNA. A varying degree of systematic noise was observed in each of the datasets, however in all cases the relative amount of variation between standard control RNA replicates was found to be greatest at earlier points in the sample-preparation workflow. For example, 40.6% of the total variation in reported expressions were attributed to replicate extractions, compared to 13.9% due to amplification/labelling and 10.8% between replicate hybridisations. Deliberate probe-wise batch-correction methods were effective in reducing the magnitude of this variation, although the level of improvement was dependent on the sources of noise included in the model. Systematic noise introduced at the chip, run, and experiment levels of a combined Illumina dataset were found to be highly dependent upon the experimental design. Both UHRR and pools of RNA, which were derived from the samples of interest, modelled technical variation well although the pools were significantly better correlated (4% average improvement) and better emulated the effects of systematic noise, over all probes, than the UHRRs. The effect of this noise was not uniform over all probes, with low GC-content probes found to be more vulnerable to batch variation than probes with a higher GC-content. The magnitude of systematic processing noise in a microarray experiment is variable

  1. SSHscreen and SSHdb, generic software for microarray based gene discovery: application to the stress response in cowpea

    Directory of Open Access Journals (Sweden)

    Oelofse Dean

    2010-04-01

    Full Text Available Abstract Background Suppression subtractive hybridization is a popular technique for gene discovery from non-model organisms without an annotated genome sequence, such as cowpea (Vigna unguiculata (L. Walp. We aimed to use this method to enrich for genes expressed during drought stress in a drought tolerant cowpea line. However, current methods were inefficient in screening libraries and management of the sequence data, and thus there was a need to develop software tools to facilitate the process. Results Forward and reverse cDNA libraries enriched for cowpea drought response genes were screened on microarrays, and the R software package SSHscreen 2.0.1 was developed (i to normalize the data effectively using spike-in control spot normalization, and (ii to select clones for sequencing based on the calculation of enrichment ratios with associated statistics. Enrichment ratio 3 values for each clone showed that 62% of the forward library and 34% of the reverse library clones were significantly differentially expressed by drought stress (adjusted p value 88% of the clones in both libraries were derived from rare transcripts in the original tester samples, thus supporting the notion that suppression subtractive hybridization enriches for rare transcripts. A set of 118 clones were chosen for sequencing, and drought-induced cowpea genes were identified, the most interesting encoding a late embryogenesis abundant Lea5 protein, a glutathione S-transferase, a thaumatin, a universal stress protein, and a wound induced protein. A lipid transfer protein and several components of photosynthesis were down-regulated by the drought stress. Reverse transcriptase quantitative PCR confirmed the enrichment ratio values for the selected cowpea genes. SSHdb, a web-accessible database, was developed to manage the clone sequences and combine the SSHscreen data with sequence annotations derived from BLAST and Blast2GO. The self-BLAST function within SSHdb grouped

  2. TF-finder: A software package for identifying transcription factors involved in biological processes using microarray data and existing knowledge base

    Directory of Open Access Journals (Sweden)

    Cui Xiaoqi

    2010-08-01

    Full Text Available Abstract Background Identification of transcription factors (TFs involved in a biological process is the first step towards a better understanding of the underlying regulatory mechanisms. However, due to the involvement of a large number of genes and complicated interactions in a gene regulatory network (GRN, identification of the TFs involved in a biology process remains to be very challenging. In reality, the recognition of TFs for a given a biological process can be further complicated by the fact that most eukaryotic genomes encode thousands of TFs, which are organized in gene families of various sizes and in many cases with poor sequence conservation except for small conserved domains. This poses a significant challenge for identification of the exact TFs involved or ranking the importance of a set of TFs to a process of interest. Therefore, new methods for recognizing novel TFs are desperately needed. Although a plethora of methods have been developed to infer regulatory genes using microarray data, it is still rare to find the methods that use existing knowledge base in particular the validated genes known to be involved in a process to bait/guide discovery of novel TFs. Such methods can replace the sometimes-arbitrary process of selection of candidate genes for experimental validation and significantly advance our knowledge and understanding of the regulation of a process. Results We developed an automated software package called TF-finder for recognizing TFs involved in a biological process using microarray data and existing knowledge base. TF-finder contains two components, adaptive sparse canonical correlation analysis (ASCCA and enrichment test, for TF recognition. ASCCA uses positive target genes to bait TFS from gene expression data while enrichment test examines the presence of positive TFs in the outcomes from ASCCA. Using microarray data from salt and water stress experiments, we showed TF-finder is very efficient in recognizing

  3. An algorithm for finding biologically significant features in microarray data based on a priori manifold learning.

    Directory of Open Access Journals (Sweden)

    Zena M Hira

    Full Text Available Microarray databases are a large source of genetic data, which, upon proper analysis, could enhance our understanding of biology and medicine. Many microarray experiments have been designed to investigate the genetic mechanisms of cancer, and analytical approaches have been applied in order to classify different types of cancer or distinguish between cancerous and non-cancerous tissue. However, microarrays are high-dimensional datasets with high levels of noise and this causes problems when using machine learning methods. A popular approach to this problem is to search for a set of features that will simplify the structure and to some degree remove the noise from the data. The most widely used approach to feature extraction is principal component analysis (PCA which assumes a multivariate Gaussian model of the data. More recently, non-linear methods have been investigated. Among these, manifold learning algorithms, for example Isomap, aim to project the data from a higher dimensional space onto a lower dimension one. We have proposed a priori manifold learning for finding a manifold in which a representative set of microarray data is fused with relevant data taken from the KEGG pathway database. Once the manifold has been constructed the raw microarray data is projected onto it and clustering and classification can take place. In contrast to earlier fusion based methods, the prior knowledge from the KEGG databases is not used in, and does not bias the classification process--it merely acts as an aid to find the best space in which to search the data. In our experiments we have found that using our new manifold method gives better classification results than using either PCA or conventional Isomap.

  4. Fibre optic microarrays.

    Science.gov (United States)

    Walt, David R

    2010-01-01

    This tutorial review describes how fibre optic microarrays can be used to create a variety of sensing and measurement systems. This review covers the basics of optical fibres and arrays, the different microarray architectures, and describes a multitude of applications. Such arrays enable multiplexed sensing for a variety of analytes including nucleic acids, vapours, and biomolecules. Polymer-coated fibre arrays can be used for measuring microscopic chemical phenomena, such as corrosion and localized release of biochemicals from cells. In addition, these microarrays can serve as a substrate for fundamental studies of single molecules and single cells. The review covers topics of interest to chemists, biologists, materials scientists, and engineers.

  5. Multiplex Detection and Genotyping of Point Mutations Involved in Charcot-Marie-Tooth Disease Using a Hairpin Microarray-Based Assay

    Directory of Open Access Journals (Sweden)

    Yasser Baaj

    2009-01-01

    Full Text Available We previously developed a highly specific method for detecting SNPs with a microarray-based system using stem-loop probes. In this paper we demonstrate that coupling a multiplexing procedure with our microarray method is possible for the simultaneous detection and genotyping of four point mutations, in three different genes, involved in Charcot-Marie-Tooth disease. DNA from healthy individuals and patients was amplified, labeled with Cy3 by multiplex PCR; and hybridized to microarrays. Spot signal intensities were 18 to 74 times greater for perfect matches than for mismatched target sequences differing by a single nucleotide (discrimination ratio for “homozygous” DNA from healthy individuals. “Heterozygous” mutant DNA samples gave signal intensity ratios close to 1 at the positions of the mutations as expected. Genotyping by this method was therefore reliable. This system now combines the principle of highly specific genotyping based on stem-loop structure probes with the advantages of multiplex analysis.

  6. Large scale aggregate microarray analysis reveals three distinct molecular subclasses of human preeclampsia.

    Science.gov (United States)

    Leavey, Katherine; Bainbridge, Shannon A; Cox, Brian J

    2015-01-01

    Preeclampsia (PE) is a life-threatening hypertensive pathology of pregnancy affecting 3-5% of all pregnancies. To date, PE has no cure, early detection markers, or effective treatments short of the removal of what is thought to be the causative organ, the placenta, which may necessitate a preterm delivery. Additionally, numerous small placental microarray studies attempting to identify "PE-specific" genes have yielded inconsistent results. We therefore hypothesize that preeclampsia is a multifactorial disease encompassing several pathology subclasses, and that large cohort placental gene expression analysis will reveal these groups. To address our hypothesis, we utilized known bioinformatic methods to aggregate 7 microarray data sets across multiple platforms in order to generate a large data set of 173 patient samples, including 77 with preeclampsia. Unsupervised clustering of these patient samples revealed three distinct molecular subclasses of PE. This included a "canonical" PE subclass demonstrating elevated expression of known PE markers and genes associated with poor oxygenation and increased secretion, as well as two other subclasses potentially representing a poor maternal response to pregnancy and an immunological presentation of preeclampsia. Our analysis sheds new light on the heterogeneity of PE patients, and offers up additional avenues for future investigation. Hopefully, our subclassification of preeclampsia based on molecular diversity will finally lead to the development of robust diagnostics and patient-based treatments for this disorder.

  7. Global pathway analysis using DNA microarrays in skeletal muscle of women with polycystic ovary syndrome

    DEFF Research Database (Denmark)

    Skov, Vibe

    2007-01-01

    (study 1), to investigate whether pioglitazone therapy could reverse abnormalities in the transcriptional profile of muscle associated with insulin resistance in skeletal muscle of obese PCOS patients (study 2), and to develop a microarray platform for global gene expression profiling (study 3). In study...... comparable to other commercial and custom made microarrays and is a cost-effective alternative especially in larger epidemiological studies....

  8. Understanding Autoimmune Mechanisms in Multiple Sclerosis Using Gene Expression Microarrays: Treatment Effect and Cytokine-related Pathways

    Directory of Open Access Journals (Sweden)

    A. Achiron

    2004-01-01

    Full Text Available Multiple sclerosis (MS is a central nervous system disease in which activated autoreactive T-cells invade the blood brain barrier and initiate an inflammatory response that leads to myelin destruction and axonal loss. The etiology of MS, as well as the mechanisms associated with its unexpected onset, the unpredictable clinical course spanning decades, and the different rates of progression leading to disability over time, remains an enigma. We have applied gene expression microarrays technology in peripheral blood mononuclear cells (PBMC to better understand MS pathogenesis and better target treatment approaches. A signature of 535 genes were found to distinguish immunomodulatory treatment effects between 13 treated and 13 untreated MS patients. In addition, the expression pattern of 1109 gene transcripts that were previously reported to significantly differentiate between MS patients and healthy subjects were further analyzed to study the effect of cytokine-related pathways on disease pathogenesis. When relative gene expression for 26 MS patients was compared to 18 healthy controls, 30 genes related to various cytokine-associated pathways were identified. These genes belong to a variety of families such as interleukins, small inducible cytokine subfamily and tumor necrosis factor ligand and receptor. Further analysis disclosed seven cytokine-associated genes within the immunomodulatory treatment signature, and two cytokine-associated genes SCYA4 (small inducible cytokine A4 and FCAR (Fc fragment of IgA, CD89 that were common to both the MS gene expression signature and the immunomodulatory treatment gene expression signature. Our results indicate that cytokine-associated genes are involved in various pathogenic pathways in MS and also related to immunomodulatory treatment effects.

  9. Microarrays for Universal Detection and Identification of Phytoplasmas

    DEFF Research Database (Denmark)

    Nicolaisen, Mogens; Nyskjold, Henriette; Bertaccini, Assunta

    2013-01-01

    Detection and identification of phytoplasmas is a laborious process often involving nested PCR followed by restriction enzyme analysis and fine-resolution gel electrophoresis. To improve throughput, other methods are needed. Microarray technology offers a generic assay that can potentially detect...... and differentiate all types of phytoplasmas in one assay. The present protocol describes a microarray-based method for identification of phytoplasmas to 16Sr group level....

  10. Brachyury, SOX-9, and Podoplanin, New Markers in the Skull Base Chordoma Vs Chondrosarcoma Differential: A Tissue Microarray Based Comparative Analysis

    Science.gov (United States)

    Oakley, GJ; Fuhrer, K; Seethala, RR

    2014-01-01

    The distinction between chondrosarcoma and chordoma of the skull base/head and neck is prognostically important; however, both have sufficient morphologic overlap to make distinction difficult. As a result of gene expression studies, additional candidate markers have been proposed to help in this distinction. Hence, we sought to evaluate the performance of new markers: brachyury, SOX-9, and podoplanin alongside the more traditional markers glial fibrillary acid protein, carcinoembryonic antigen, CD24 and epithelial membrane antigen. Paraffin blocks from 103 skull base/head and neck chondroid tumors from 70 patients were retrieved (1969-2007). Diagnoses were made based on morphology and/or whole section immunohistochemistry for cytokeratin and S100 protein yielding 79 chordomas (comprising 45 chondroid chordomas and 34 conventional chordomas), and 24 chondrosarcomas. A tissue microarray containing 0.6 mm cores of each tumor in triplicate was constructed using a manual array (MTA-1, Beecher Instruments). For visualization of staining, the ImmPRESS detection system (Vector Laboratories) with 2 - diaminobenzidine substrate was used. Sensitivities and specificities were calculated for each marker. Core loss from the microarray ranged from 25-29% yielding 66-78 viable cases per stain. The classic marker, cytokeratin, still has the best performance characteristics. When combined with brachyury, accuracy improves slightly (sensitivity and specificity for detection of chordoma 98% and 100%, respectively). Positivity for both epithelial membrane antigen and AE1/AE3 had a sensitivity of 90% and a specificity of 100% for detecting chordoma in this study. SOX-9 is apparently common to both notochordal and cartilaginous differentiation, and is not useful in the chordoma-chondrosarcoma differential diagnosis. Glial fibrillary acid protein, carcinoembryonic antigen, CD24, and epithelial membrane antigen did not outperform other markers, and are less useful in the diagnosis of

  11. Prediction of transcriptional regulatory elements for plant hormone responses based on microarray data

    Directory of Open Access Journals (Sweden)

    Yamaguchi-Shinozaki Kazuko

    2011-02-01

    Full Text Available Abstract Background Phytohormones organize plant development and environmental adaptation through cell-to-cell signal transduction, and their action involves transcriptional activation. Recent international efforts to establish and maintain public databases of Arabidopsis microarray data have enabled the utilization of this data in the analysis of various phytohormone responses, providing genome-wide identification of promoters targeted by phytohormones. Results We utilized such microarray data for prediction of cis-regulatory elements with an octamer-based approach. Our test prediction of a drought-responsive RD29A promoter with the aid of microarray data for response to drought, ABA and overexpression of DREB1A, a key regulator of cold and drought response, provided reasonable results that fit with the experimentally identified regulatory elements. With this succession, we expanded the prediction to various phytohormone responses, including those for abscisic acid, auxin, cytokinin, ethylene, brassinosteroid, jasmonic acid, and salicylic acid, as well as for hydrogen peroxide, drought and DREB1A overexpression. Totally 622 promoters that are activated by phytohormones were subjected to the prediction. In addition, we have assigned putative functions to 53 octamers of the Regulatory Element Group (REG that have been extracted as position-dependent cis-regulatory elements with the aid of their feature of preferential appearance in the promoter region. Conclusions Our prediction of Arabidopsis cis-regulatory elements for phytohormone responses provides guidance for experimental analysis of promoters to reveal the basis of the transcriptional network of phytohormone responses.

  12. Integrating Multiple Microarray Data for Cancer Pathway Analysis Using Bootstrapping K-S Test

    Directory of Open Access Journals (Sweden)

    Bing Han

    2009-01-01

    Full Text Available Previous applications of microarray technology for cancer research have mostly focused on identifying genes that are differentially expressed between a particular cancer and normal cells. In a biological system, genes perform different molecular functions and regulate various biological processes via interactions with other genes thus forming a variety of complex networks. Therefore, it is critical to understand the relationship (e.g., interactions between genes across different types of cancer in order to gain insights into the molecular mechanisms of cancer. Here we propose an integrative method based on the bootstrapping Kolmogorov-Smirnov test and a large set of microarray data produced with various types of cancer to discover common molecular changes in cells from normal state to cancerous state. We evaluate our method using three key pathways related to cancer and demonstrate that it is capable of finding meaningful alterations in gene relations.

  13. Identification and comprehensive evaluation of reference genes for RT-qPCR analysis of host gene-expression in Brassica juncea-aphid interaction using microarray data.

    Science.gov (United States)

    Ram, Chet; Koramutla, Murali Krishna; Bhattacharya, Ramcharan

    2017-07-01

    Brassica juncea is a chief oil yielding crop in many parts of the world including India. With advancement of molecular techniques, RT-qPCR based study of gene-expression has become an integral part of experimentations in crop breeding. In RT-qPCR, use of appropriate reference gene(s) is pivotal. The virtue of the reference genes, being constant in expression throughout the experimental treatments, needs to be validated case by case. Appropriate reference gene(s) for normalization of gene-expression data in B. juncea during the biotic stress of aphid infestation is not known. In the present investigation, 11 reference genes identified from microarray database of Arabidopsis-aphid interaction at a cut off FDR ≤0.1, along with two known reference genes of B. juncea, were analyzed for their expression stability upon aphid infestation. These included 6 frequently used and 5 newly identified reference genes. Ranking orders of the reference genes in terms of expression stability were calculated using advanced statistical approaches such as geNorm, NormFinder, delta Ct and BestKeeper. The analysis suggested CAC, TUA and DUF179 as the most suitable reference genes. Further, normalization of the gene-expression data of STP4 and PR1 by the most and the least stable reference gene, respectively has demonstrated importance and applicability of the recommended reference genes in aphid infested samples of B. juncea. Copyright © 2017 Elsevier Masson SAS. All rights reserved.

  14. Optimal consistency in microRNA expression analysis using reference-gene-based normalization.

    Science.gov (United States)

    Wang, Xi; Gardiner, Erin J; Cairns, Murray J

    2015-05-01

    Normalization of high-throughput molecular expression profiles secures differential expression analysis between samples of different phenotypes or biological conditions, and facilitates comparison between experimental batches. While the same general principles apply to microRNA (miRNA) normalization, there is mounting evidence that global shifts in their expression patterns occur in specific circumstances, which pose a challenge for normalizing miRNA expression data. As an alternative to global normalization, which has the propensity to flatten large trends, normalization against constitutively expressed reference genes presents an advantage through their relative independence. Here we investigated the performance of reference-gene-based (RGB) normalization for differential miRNA expression analysis of microarray expression data, and compared the results with other normalization methods, including: quantile, variance stabilization, robust spline, simple scaling, rank invariant, and Loess regression. The comparative analyses were executed using miRNA expression in tissue samples derived from subjects with schizophrenia and non-psychiatric controls. We proposed a consistency criterion for evaluating methods by examining the overlapping of differentially expressed miRNAs detected using different partitions of the whole data. Based on this criterion, we found that RGB normalization generally outperformed global normalization methods. Thus we recommend the application of RGB normalization for miRNA expression data sets, and believe that this will yield a more consistent and useful readout of differentially expressed miRNAs, particularly in biological conditions characterized by large shifts in miRNA expression.

  15. A statistical framework for differential network analysis from microarray data

    Directory of Open Access Journals (Sweden)

    Datta Somnath

    2010-02-01

    Full Text Available Abstract Background It has been long well known that genes do not act alone; rather groups of genes act in consort during a biological process. Consequently, the expression levels of genes are dependent on each other. Experimental techniques to detect such interacting pairs of genes have been in place for quite some time. With the advent of microarray technology, newer computational techniques to detect such interaction or association between gene expressions are being proposed which lead to an association network. While most microarray analyses look for genes that are differentially expressed, it is of potentially greater significance to identify how entire association network structures change between two or more biological settings, say normal versus diseased cell types. Results We provide a recipe for conducting a differential analysis of networks constructed from microarray data under two experimental settings. At the core of our approach lies a connectivity score that represents the strength of genetic association or interaction between two genes. We use this score to propose formal statistical tests for each of following queries: (i whether the overall modular structures of the two networks are different, (ii whether the connectivity of a particular set of "interesting genes" has changed between the two networks, and (iii whether the connectivity of a given single gene has changed between the two networks. A number of examples of this score is provided. We carried out our method on two types of simulated data: Gaussian networks and networks based on differential equations. We show that, for appropriate choices of the connectivity scores and tuning parameters, our method works well on simulated data. We also analyze a real data set involving normal versus heavy mice and identify an interesting set of genes that may play key roles in obesity. Conclusions Examining changes in network structure can provide valuable information about the

  16. Tyrosine Kinase Gene Expression Profiling in Prostate Cancer

    National Research Council Canada - National Science Library

    Weier, Heinz-Ulrich

    2001-01-01

    ... of these genes parallels the progression of tumors to a more malignant phenotype. We developed a DNA micro-array based screening system to monitor the level of expression of tyrosine kinase (tk...

  17. Tyrosine Kinase Gene Expression Profiling in Prostate Cancer

    National Research Council Canada - National Science Library

    Weier, Heinz-Ulrich

    2002-01-01

    ... of these genes parallels the progression of tumors to a more malignant phenotype. We developed a DNA micro-array based screening system to monitor the level of expression of tyrosine kinase (tk...

  18. How large a training set is needed to develop a classifier for microarray data?

    Science.gov (United States)

    Dobbin, Kevin K; Zhao, Yingdong; Simon, Richard M

    2008-01-01

    A common goal of gene expression microarray studies is the development of a classifier that can be used to divide patients into groups with different prognoses, or with different expected responses to a therapy. These types of classifiers are developed on a training set, which is the set of samples used to train a classifier. The question of how many samples are needed in the training set to produce a good classifier from high-dimensional microarray data is challenging. We present a model-based approach to determining the sample size required to adequately train a classifier. It is shown that sample size can be determined from three quantities: standardized fold change, class prevalence, and number of genes or features on the arrays. Numerous examples and important experimental design issues are discussed. The method is adapted to address ex post facto determination of whether the size of a training set used to develop a classifier was adequate. An interactive web site for performing the sample size calculations is provided. We showed that sample size calculations for classifier development from high-dimensional microarray data are feasible, discussed numerous important considerations, and presented examples.

  19. A random variance model for detection of differential gene expression in small microarray experiments.

    Science.gov (United States)

    Wright, George W; Simon, Richard M

    2003-12-12

    Microarray techniques provide a valuable way of characterizing the molecular nature of disease. Unfortunately expense and limited specimen availability often lead to studies with small sample sizes. This makes accurate estimation of variability difficult, since variance estimates made on a gene by gene basis will have few degrees of freedom, and the assumption that all genes share equal variance is unlikely to be true. We propose a model by which the within gene variances are drawn from an inverse gamma distribution, whose parameters are estimated across all genes. This results in a test statistic that is a minor variation of those used in standard linear models. We demonstrate that the model assumptions are valid on experimental data, and that the model has more power than standard tests to pick up large changes in expression, while not increasing the rate of false positives. This method is incorporated into BRB-ArrayTools version 3.0 (http://linus.nci.nih.gov/BRB-ArrayTools.html). ftp://linus.nci.nih.gov/pub/techreport/RVM_supplement.pdf

  20. Gene expression microarray profiles of cumulus cells in lean and overweight-obese polycystic ovary syndrome patients.

    Science.gov (United States)

    Kenigsberg, Shlomit; Bentov, Yaakov; Chalifa-Caspi, Vered; Potashnik, Gad; Ofir, Rivka; Birk, Ohad S

    2009-02-01

    The aim of this work was to study gene expression patterns of cultured cumulus cells from lean and overweight-obese polycystic ovary syndrome (PCOS) patients using genome-wide oligonucleotide microarray. The study included 25 patients undergoing in vitro fertilization and intra-cytoplasmic sperm injection: 12 diagnosed with PCOS and 13 matching controls. Each of the groups was subdivided into lean (body mass index (BMI) 27) subgroups. The following comparisons of gene expression data were made: lean PCOS versus lean controls, lean PCOS versus overweight PCOS, all PCOS versus all controls, overweight PCOS versus overweight controls, overweight controls versus lean controls and all overweight versus all lean. The largest number of differentially expressed genes (DEGs), with fold change (FC) |FC| >or= 1.5 and P-value lean PCOS versus lean controls comparison (487) with most of these genes being down-regulated in PCOS. The second largest group of DEGs originated from the comparison of lean PCOS versus overweight PCOS (305). The other comparisons resulted in a much smaller number of DEGs (174, 109, 125 and 12, respectively). In the comparison of lean PCOS with lean controls, most DEGs were transcription factors and components of the extracellular matrix and two pathways, Wnt/beta-catenin and mitogen-activated protein kinase. When comparing overweight PCOS with overweight controls, most DEGs were of pathways related to insulin signaling, metabolism and energy production. The finding of unique gene expression patterns in cumulus cells from the two PCOS subtypes is in agreement with other studies that have found the two to be separate entities with potentially different pathophysiologies.

  1. MICROARRAY IMAGE GRIDDING USING GRID LINE REFINEMENT TECHNIQUE

    Directory of Open Access Journals (Sweden)

    V.G. Biju

    2015-05-01

    Full Text Available An important stage in microarray image analysis is gridding. Microarray image gridding is done to locate sub arrays in a microarray image and find co-ordinates of spots within each sub array. For accurate identification of spots, most of the proposed gridding methods require human intervention. In this paper a fully automatic gridding method which enhances spot intensity in the preprocessing step as per a histogram based threshold method is used. The gridding step finds co-ordinates of spots from horizontal and vertical profile of the image. To correct errors due to the grid line placement, a grid line refinement technique is proposed. The algorithm is applied on different image databases and results are compared based on spot detection accuracy and time. An average spot detection accuracy of 95.06% depicts the proposed method’s flexibility and accuracy in finding the spot co-ordinates for different database images.

  2. Comparison of Nanostring nCounter® Data on FFPE Colon Cancer Samples and Affymetrix Microarray Data on Matched Frozen Tissues.

    Directory of Open Access Journals (Sweden)

    Xi Chen

    Full Text Available The prognosis of colorectal cancer (CRC stage II and III patients remains a challenge due to the difficulties of finding robust biomarkers suitable for testing clinical samples. The majority of published gene signatures of CRC have been generated on fresh frozen colorectal tissues. Because collection of frozen tissue is not practical for routine surgical pathology practice, a clinical test that improves prognostic capabilities beyond standard pathological staging of colon cancer will need to be designed for formalin-fixed paraffin-embedded (FFPE tissues. The NanoString nCounter® platform is a gene expression analysis tool developed for use with FFPE-derived samples. We designed a custom nCounter® codeset based on elements from multiple published fresh frozen tissue microarray-based prognostic gene signatures for colon cancer, and we used this platform to systematically compare gene expression data from FFPE with matched microarray array data from frozen tissues. Our results show moderate correlation of gene expression between two platforms and discovery of a small subset of genes as candidate biomarkers for colon cancer prognosis that are detectable and quantifiable in FFPE tissue sections.

  3. Gene expression profiling of acute myeloid leukemia samples from adult patients with AML-M1 and -M2 through boutique microarrays, real-time PCR and droplet digital PCR.

    Science.gov (United States)

    Handschuh, Luiza; Kaźmierczak, Maciej; Milewski, Marek C; Góralski, Michał; Łuczak, Magdalena; Wojtaszewska, Marzena; Uszczyńska-Ratajczak, Barbara; Lewandowski, Krzysztof; Komarnicki, Mieczysław; Figlerowicz, Marek

    2018-03-01

    Acute myeloid leukemia (AML) is the most common and severe form of acute leukemia diagnosed in adults. Owing to its heterogeneity, AML is divided into classes associated with different treatment outcomes and specific gene expression profiles. Based on previous studies on AML, in this study, we designed and generated an AML-array containing 900 oligonucleotide probes complementary to human genes implicated in hematopoietic cell differentiation and maturation, proliferation, apoptosis and leukemic transformation. The AML-array was used to hybridize 118 samples from 33 patients with AML of the M1 and M2 subtypes of the French-American‑British (FAB) classification and 15 healthy volunteers (HV). Rigorous analysis of the microarray data revealed that 83 genes were differentially expressed between the patients with AML and the HV, including genes not yet discussed in the context of AML pathogenesis. The most overexpressed genes in AML were STMN1, KITLG, CDK6, MCM5, KRAS, CEBPA, MYC, ANGPT1, SRGN, RPLP0, ENO1 and SET, whereas the most underexpressed genes were IFITM1, LTB, FCN1, BIRC3, LYZ, ADD3, S100A9, FCER1G, PTRPE, CD74 and TMSB4X. The overexpression of the CPA3 gene was specific for AML with mutated NPM1 and FLT3. Although the microarray-based method was insufficient to differentiate between any other AML subgroups, quantitative PCR approaches enabled us to identify 3 genes (ANXA3, S100A9 and WT1) whose expression can be used to discriminate between the 2 studied AML FAB subtypes. The expression levels of the ANXA3 and S100A9 genes were increased, whereas those of WT1 were decreased in the AML-M2 compared to the AML-M1 group. We also examined the association between the STMN1, CAT and ABL1 genes, and the FLT3 and NPM1 mutation status. FLT3+/NPM1- AML was associated with the highest expression of STMN1, and ABL1 was upregulated in FLT3+ AML and CAT in FLT3- AML, irrespectively of the NPM1 mutation status. Moreover, our results indicated that CAT and WT1

  4. A molecular beacon microarray based on a quantum dot label for detecting single nucleotide polymorphisms.

    Science.gov (United States)

    Guo, Qingsheng; Bai, Zhixiong; Liu, Yuqian; Sun, Qingjiang

    2016-03-15

    In this work, we report the application of streptavidin-coated quantum dot (strAV-QD) in molecular beacon (MB) microarray assays by using the strAV-QD to label the immobilized MB, avoiding target labeling and meanwhile obviating the use of amplification. The MBs are stem-loop structured oligodeoxynucleotides, modified with a thiol and a biotin at two terminals of the stem. With the strAV-QD labeling an "opened" MB rather than a "closed" MB via streptavidin-biotin reaction, a sensitive and specific detection of label-free target DNA sequence is demonstrated by the MB microarray, with a signal-to-background ratio of 8. The immobilized MBs can be perfectly regenerated, allowing the reuse of the microarray. The MB microarray also is able to detect single nucleotide polymorphisms, exhibiting genotype-dependent fluorescence signals. It is demonstrated that the MB microarray can perform as a 4-to-2 encoder, compressing the genotype information into two outputs. Copyright © 2015 Elsevier B.V. All rights reserved.

  5. A Microarray Study of Carpet-Shell Clam (Ruditapes decussatus Shows Common and Organ-Specific Growth-Related Gene Expression Differences in Gills and Digestive Gland

    Directory of Open Access Journals (Sweden)

    Carlos Saavedra

    2017-11-01

    Full Text Available Growth rate is one of the most important traits from the point of view of individual fitness and commercial production in mollusks, but its molecular and physiological basis is poorly known. We have studied differential gene expression related to differences in growth rate in adult individuals of the commercial marine clam Ruditapes decussatus. Gene expression in the gills and the digestive gland was analyzed in 5 fast-growing and five slow-growing animals by means of an oligonucleotide microarray containing 14,003 probes. A total of 356 differentially expressed genes (DEG were found. We tested the hypothesis that differential expression might be concentrated at the growth control gene core (GCGC, i.e., the set of genes that underlie the molecular mechanisms of genetic control of tissue and organ growth and body size, as demonstrated in model organisms. The GCGC includes the genes coding for enzymes of the insulin/insulin-like growth factor signaling pathway (IIS, enzymes of four additional signaling pathways (Raf/Ras/Mapk, Jnk, TOR, and Hippo, and transcription factors acting at the end of those pathways. Only two out of 97 GCGC genes present in the microarray showed differential expression, indicating a very little contribution of GCGC genes to growth-related differential gene expression. Forty eight DEGs were shared by both organs, with gene ontology (GO annotations corresponding to transcription regulation, RNA splicing, sugar metabolism, protein catabolism, immunity, defense against pathogens, and fatty acid biosynthesis. GO term enrichment tests indicated that genes related to growth regulation, development and morphogenesis, extracellular matrix proteins, and proteolysis were overrepresented in the gills. In the digestive gland overrepresented GO terms referred to gene expression control through chromatin rearrangement, RAS-related small GTPases, glucolysis, and energy metabolism. These analyses suggest a relevant role of, among others

  6. The prognostic implication of the expression of EGFR, p53, cyclin D1, Bcl-2 and p16 in primary locally advanced oral squamous cell carcinoma cases: a tissue microarray study.

    Science.gov (United States)

    Solomon, Monica Charlotte; Vidyasagar, M S; Fernandes, Donald; Guddattu, Vasudev; Mathew, Mary; Shergill, Ankur Kaur; Carnelio, Sunitha; Chandrashekar, Chetana

    2016-12-01

    Oral squamous cell carcinomas comprise a heterogeneous tumor cell population with varied molecular characteristics, which makes prognostication of these tumors a complex and challenging issue. Thus, molecular profiling of these tumors is advantageous for an accurate prognostication and treatment planning. This is a retrospective study on a cohort of primary locally advanced oral squamous cell carcinomas (n = 178) of an Indian rural population. The expression of EGFR, p53, cyclin D1, Bcl-2 and p16 in a cohort of primary locally advanced oral squamous cell carcinomas was evaluated. A potential biomarker that can predict the tumor response to treatment was identified. Formalin-fixed paraffin-embedded tumor blocks of (n = 178) of histopathologically diagnosed cases of locally advanced oral squamous cell carcinomas were selected. Tissue microarray blocks were constructed with 2 cores of 2 mm diameter from each tumor block. Four-micron-thick sections were cut from these tissue microarray blocks. These tissue microarray sections were immunohistochemically stained for EGFR, p53, Bcl-2, cyclin D1 and p16. In this cohort, EGFR was the most frequently expressed 150/178 (84%) biomarker of the cases. Kaplan-Meier analysis showed a significant association (p = 0.038) between expression of p53 and a poor prognosis. A Poisson regression analysis showed that tumors that expressed p53 had a two times greater chance of recurrence (unadjusted IRR-95% CI 2.08 (1.03, 4.5), adjusted IRR-2.29 (1.08, 4.8) compared with the tumors that did not express this biomarker. Molecular profiling of oral squamous cell carcinomas will enable us to categorize our patients into more realistic risk groups. With biologically guided tumor characterization, personalized treatment protocols can be designed for individual patients, which will improve the quality of life of these patients.

  7. Integrative missing value estimation for microarray data.

    Science.gov (United States)

    Hu, Jianjun; Li, Haifeng; Waterman, Michael S; Zhou, Xianghong Jasmine

    2006-10-12

    Missing value estimation is an important preprocessing step in microarray analysis. Although several methods have been developed to solve this problem, their performance is unsatisfactory for datasets with high rates of missing data, high measurement noise, or limited numbers of samples. In fact, more than 80% of the time-series datasets in Stanford Microarray Database contain less than eight samples. We present the integrative Missing Value Estimation method (iMISS) by incorporating information from multiple reference microarray datasets to improve missing value estimation. For each gene with missing data, we derive a consistent neighbor-gene list by taking reference data sets into consideration. To determine whether the given reference data sets are sufficiently informative for integration, we use a submatrix imputation approach. Our experiments showed that iMISS can significantly and consistently improve the accuracy of the state-of-the-art Local Least Square (LLS) imputation algorithm by up to 15% improvement in our benchmark tests. We demonstrated that the order-statistics-based integrative imputation algorithms can achieve significant improvements over the state-of-the-art missing value estimation approaches such as LLS and is especially good for imputing microarray datasets with a limited number of samples, high rates of missing data, or very noisy measurements. With the rapid accumulation of microarray datasets, the performance of our approach can be further improved by incorporating larger and more appropriate reference datasets.

  8. Correction of technical bias in clinical microarray data improves concordance with known biological information

    DEFF Research Database (Denmark)

    Eklund, Aron Charles; Szallasi, Zoltan Imre

    2008-01-01

    The performance of gene expression microarrays has been well characterized using controlled reference samples, but the performance on clinical samples remains less clear. We identified sources of technical bias affecting many genes in concert, thus causing spurious correlations in clinical data...... sets and false associations between genes and clinical variables. We developed a method to correct for technical bias in clinical microarray data, which increased concordance with known biological relationships in multiple data sets....

  9. Normalization for triple-target microarray experiments

    Directory of Open Access Journals (Sweden)

    Magniette Frederic

    2008-04-01

    Full Text Available Abstract Background Most microarray studies are made using labelling with one or two dyes which allows the hybridization of one or two samples on the same slide. In such experiments, the most frequently used dyes are Cy3 and Cy5. Recent improvements in the technology (dye-labelling, scanner and, image analysis allow hybridization up to four samples simultaneously. The two additional dyes are Alexa488 and Alexa494. The triple-target or four-target technology is very promising, since it allows more flexibility in the design of experiments, an increase in the statistical power when comparing gene expressions induced by different conditions and a scaled down number of slides. However, there have been few methods proposed for statistical analysis of such data. Moreover the lowess correction of the global dye effect is available for only two-color experiments, and even if its application can be derived, it does not allow simultaneous correction of the raw data. Results We propose a two-step normalization procedure for triple-target experiments. First the dye bleeding is evaluated and corrected if necessary. Then the signal in each channel is normalized using a generalized lowess procedure to correct a global dye bias. The normalization procedure is validated using triple-self experiments and by comparing the results of triple-target and two-color experiments. Although the focus is on triple-target microarrays, the proposed method can be used to normalize p differently labelled targets co-hybridized on a same array, for any value of p greater than 2. Conclusion The proposed normalization procedure is effective: the technical biases are reduced, the number of false positives is under control in the analysis of differentially expressed genes, and the triple-target experiments are more powerful than the corresponding two-color experiments. There is room for improving the microarray experiments by simultaneously hybridizing more than two samples.

  10. Printing Proteins as Microarrays for High-Throughput Function Determination

    Science.gov (United States)

    MacBeath, Gavin; Schreiber, Stuart L.

    2000-09-01

    Systematic efforts are currently under way to construct defined sets of cloned genes for high-throughput expression and purification of recombinant proteins. To facilitate subsequent studies of protein function, we have developed miniaturized assays that accommodate extremely low sample volumes and enable the rapid, simultaneous processing of thousands of proteins. A high-precision robot designed to manufacture complementary DNA microarrays was used to spot proteins onto chemically derivatized glass slides at extremely high spatial densities. The proteins attached covalently to the slide surface yet retained their ability to interact specifically with other proteins, or with small molecules, in solution. Three applications for protein microarrays were demonstrated: screening for protein-protein interactions, identifying the substrates of protein kinases, and identifying the protein targets of small molecules.

  11. Identifying Fishes through DNA Barcodes and Microarrays.

    Directory of Open Access Journals (Sweden)

    Marc Kochzius

    2010-09-01

    Full Text Available International fish trade reached an import value of 62.8 billion Euro in 2006, of which 44.6% are covered by the European Union. Species identification is a key problem throughout the life cycle of fishes: from eggs and larvae to adults in fisheries research and control, as well as processed fish products in consumer protection.This study aims to evaluate the applicability of the three mitochondrial genes 16S rRNA (16S, cytochrome b (cyt b, and cytochrome oxidase subunit I (COI for the identification of 50 European marine fish species by combining techniques of "DNA barcoding" and microarrays. In a DNA barcoding approach, neighbour Joining (NJ phylogenetic trees of 369 16S, 212 cyt b, and 447 COI sequences indicated that cyt b and COI are suitable for unambiguous identification, whereas 16S failed to discriminate closely related flatfish and gurnard species. In course of probe design for DNA microarray development, each of the markers yielded a high number of potentially species-specific probes in silico, although many of them were rejected based on microarray hybridisation experiments. None of the markers provided probes to discriminate the sibling flatfish and gurnard species. However, since 16S-probes were less negatively influenced by the "position of label" effect and showed the lowest rejection rate and the highest mean signal intensity, 16S is more suitable for DNA microarray probe design than cty b and COI. The large portion of rejected COI-probes after hybridisation experiments (>90% renders the DNA barcoding marker as rather unsuitable for this high-throughput technology.Based on these data, a DNA microarray containing 64 functional oligonucleotide probes for the identification of 30 out of the 50 fish species investigated was developed. It represents the next step towards an automated and easy-to-handle method to identify fish, ichthyoplankton, and fish products.

  12. Cross-platform comparison of microarray data using order restricted inference

    Science.gov (United States)

    Klinglmueller, Florian; Tuechler, Thomas; Posch, Martin

    2013-01-01

    Motivation Titration experiments measuring the gene expression from two different tissues, along with total RNA mixtures of the pure samples, are frequently used for quality evaluation of microarray technologies. Such a design implies that the true mRNA expression of each gene, is either constant or follows a monotonic trend between the mixtures, applying itself to the use of order restricted inference procedures. Exploiting only the postulated monotonicity of titration designs, we propose three statistical analysis methods for the validation of high-throughput genetic data and corresponding preprocessing techniques. Results Our methods allow for inference of accuracy, repeatability and cross-platform agreement, with minimal required assumptions regarding the underlying data generating process. Therefore, they are readily applicable to all sorts of genetic high-throughput data independent of the degree of preprocessing. An application to the EMERALD dataset was used to demonstrate how our methods provide a rich spectrum of easily interpretable quality metrics and allow the comparison of different microarray technologies and normalization methods. The results are on par with previous work, but provide additional new insights that cast doubt on the utility of popular preprocessing techniques, specifically concerning the EMERALD projects dataset. Availability All datasets are available on EBI’s ArrayExpress web site (http://www.ebi.ac.uk/microarray-as/ae/) under accession numbers E-TABM-536, E-TABM-554 and E-TABM-555. Source code implemented in C and R is available at: http://statistics.msi.meduniwien.ac.at/float/cross_platform/. Methods for testing and variance decomposition have been made available in the R-package orQA, which can be downloaded and installed from CRAN http://cran.r-project.org. PMID:21317143

  13. Microarray-based genotyping of Salmonella: Inter-laboratory evaluation of reproducibility and standardization potential

    DEFF Research Database (Denmark)

    Grønlund, Hugo Ahlm; Riber, Leise; Vigre, Håkan

    2011-01-01

    Bacterial food-borne infections in humans caused by Salmonella spp. are considered a crucial food safety issue. Therefore, it is important for the risk assessments of Salmonella to consider the genomic variationamong different isolates in order to control pathogen-induced infections. Microarray...... critical methodology parameters that differed between the two labs were identified. These related to printing facilities, choice of hybridization buffer,wash buffers used following the hybridization and choice of procedure for purifying genomic DNA. Critical parameters were randomized in a four......DNA and different wash buffers. However, less agreement (Kappa=0.2–0.6) between microarray results were observed when using different hybridization buffers, indicating this parameter as being highly criticalwhen transferring a standard microarray assay between laboratories. In conclusion, this study indicates...

  14. Transcriptome analysis in non-model species: a new method for the analysis of heterologous hybridization on microarrays

    Directory of Open Access Journals (Sweden)

    Jouventin Pierre

    2010-05-01

    Full Text Available Abstract Background Recent developments in high-throughput methods of analyzing transcriptomic profiles are promising for many areas of biology, including ecophysiology. However, although commercial microarrays are available for most common laboratory models, transcriptome analysis in non-traditional model species still remains a challenge. Indeed, the signal resulting from heterologous hybridization is low and difficult to interpret because of the weak complementarity between probe and target sequences, especially when no microarray dedicated to a genetically close species is available. Results We show here that transcriptome analysis in a species genetically distant from laboratory models is made possible by using MAXRS, a new method of analyzing heterologous hybridization on microarrays. This method takes advantage of the design of several commercial microarrays, with different probes targeting the same transcript. To illustrate and test this method, we analyzed the transcriptome of king penguin pectoralis muscle hybridized to Affymetrix chicken microarrays, two organisms separated by an evolutionary distance of approximately 100 million years. The differential gene expression observed between different physiological situations computed by MAXRS was confirmed by real-time PCR on 10 genes out of 11 tested. Conclusions MAXRS appears to be an appropriate method for gene expression analysis under heterologous hybridization conditions.

  15. SAMSN1 is highly expressed and associated with a poor survival in glioblastoma multiforme.

    Directory of Open Access Journals (Sweden)

    Yong Yan

    Full Text Available OBJECTIVES: To study the expression pattern and prognostic significance of SAMSN1 in glioma. METHODS: Affymetrix and Arrystar gene microarray data in the setting of glioma was analyzed to preliminarily study the expression pattern of SAMSN1 in glioma tissues, and Hieratical clustering of gene microarray data was performed to filter out genes that have prognostic value in malignant glioma. Survival analysis by Kaplan-Meier estimates stratified by SAMSN1 expression was then made based on the data of more than 500 GBM cases provided by The Cancer Genome Atlas (TCGA project. At last, we detected the expression of SAMSN1 in large numbers of glioma and normal brain tissue samples using Tissue Microarray (TMA. Survival analysis by Kaplan-Meier estimates in each grade of glioma was stratified by SAMSN1 expression. Multivariate survival analysis was made by Cox proportional hazards regression models in corresponding groups of glioma. RESULTS: With the expression data of SAMSN1 and 68 other genes, high-grade glioma could be classified into two groups with clearly different prognoses. Gene and large sample tissue microarrays showed high expression of SAMSN1 in glioma particularly in GBM. Survival analysis based on the TCGA GBM data matrix and TMA multi-grade glioma dataset found that SAMSN1 expression was closely related to the prognosis of GBM, either PFS or OS (P<0.05. Multivariate survival analysis with Cox proportional hazards regression models confirmed that high expression of SAMSN1 was a strong risk factor for PFS and OS of GBM patients. CONCLUSION: SAMSN1 is over-expressed in glioma as compared with that found in normal brains, especially in GBM. High expression of SAMSN1 is a significant risk factor for the progression free and overall survival of GBM.

  16. Oral tongue cancer gene expression profiling: Identification of novel potential prognosticators by oligonucleotide microarray analysis

    International Nuclear Information System (INIS)

    Estilo, Cherry L; Boyle, Jay O; Kraus, Dennis H; Patel, Snehal; Shaha, Ashok R; Wong, Richard J; Huryn, Joseph M; Shah, Jatin P; Singh, Bhuvanesh; O-charoenrat, Pornchai; Talbot, Simon; Socci, Nicholas D; Carlson, Diane L; Ghossein, Ronald; Williams, Tijaana; Yonekawa, Yoshihiro; Ramanathan, Yegnanarayana

    2009-01-01

    The present study is aimed at identifying potential candidate genes as prognostic markers in human oral tongue squamous cell carcinoma (SCC) by large scale gene expression profiling. The gene expression profile of patients (n=37) with oral tongue SCC were analyzed using Affymetrix HG-U95Av2 high-density oligonucleotide arrays. Patients (n=20) from which there were available tumor and matched normal mucosa were grouped into stage (early vs. late) and nodal disease (node positive vs. node negative) subgroups and genes differentially expressed in tumor vs. normal and between the subgroups were identified. Three genes, GLUT3, HSAL2, and PACE4, were selected for their potential biological significance in a larger cohort of 49 patients via quantitative real-time RT-PCR. Hierarchical clustering analyses failed to show significant segregation of patients. In patients (n=20) with available tumor and matched normal mucosa, 77 genes were found to be differentially expressed (P< 0.05) in the tongue tumor samples compared to their matched normal controls. Among the 45 over-expressed genes, MMP-1 encoding interstitial collagenase showed the highest level of increase (average: 34.18 folds). Using the criterion of two-fold or greater as overexpression, 30.6%, 24.5% and 26.5% of patients showed high levels of GLUT3, HSAL2 and PACE4, respectively. Univariate analyses demonstrated that GLUT3 over-expression correlated with depth of invasion (P<0.0001), tumor size (P=0.024), pathological stage (P=0.009) and recurrence (P=0.038). HSAL2 was positively associated with depth of invasion (P=0.015) and advanced T stage (P=0.047). In survival studies, only GLUT3 showed a prognostic value with disease-free (P=0.049), relapse-free (P=0.002) and overall survival (P=0.003). PACE4 mRNA expression failed to show correlation with any of the relevant parameters. The characterization of genes identified to be significant predictors of prognosis by oligonucleotide microarray and further validation by

  17. Fascin and EMMPRIN expression in primary mucinous tumors of ovary: a tissue microarray study.

    Science.gov (United States)

    Alici, Omer; Kefeli, Mehmet; Yildiz, Levent; Baris, Sancar; Karagoz, Filiz; Kandemir, Bedri

    2014-12-01

    The aim of this study was to compare the expressions of fascin and EMMPRIN in primary malignant, borderline and benign mucinous ovarian tumors, and to investigate the relationship of these markers with tumor progression and their applicability to differential diagnosis. An immunohistochemical study was performed for fascin and EMMPRIN using the tissue microarray technique. Eighty-one cases were included in the study; there were 37 benign, 25 borderline and 19 malignant primary mucinous ovarian tumors. For each case, a total staining score was determined, consisting of scores for extent of staining and intensity of staining. The cases were allocated to negative, weakly positive and strongly positive staining categories, according to the total staining score. Both of the markers were significantly negative in benign tumors as compared with borderline and malignant tumors. There was no significant difference between borderline and malignant groups for both markers. Sixty-eight percent of malignant tumors were stained positive by fascin, while this rate was 40% for borderline mucinous tumors. All malignant tumors were strongly stained positive for EMMPRIN, while this rate was 92% for borderline mucinous tumors. The rest of the cases stained weakly positive. No significant difference in staining score was found between fascin and EMMPRIN expression. In ovarian primary mucinous tumors, fascin and EMMPRIN may play an important role in tumor progression from benign tumor to carcinoma. In that context, EMMPRIN and fascin expression may have potential application in the differential diagnosis of some diagnostically problematic mucinous ovarian tumors. However, the differential diagnostic applicability of EMMPRIN appears to be more limited than that of fascin due to its wide spectrum of staining in mucinous ovarian tumors. Copyright © 2014 Elsevier GmbH. All rights reserved.

  18. GTI: a novel algorithm for identifying outlier gene expression profiles from integrated microarray datasets.

    Directory of Open Access Journals (Sweden)

    John Patrick Mpindi

    Full Text Available BACKGROUND: Meta-analysis of gene expression microarray datasets presents significant challenges for statistical analysis. We developed and validated a new bioinformatic method for the identification of genes upregulated in subsets of samples of a given tumour type ('outlier genes', a hallmark of potential oncogenes. METHODOLOGY: A new statistical method (the gene tissue index, GTI was developed by modifying and adapting algorithms originally developed for statistical problems in economics. We compared the potential of the GTI to detect outlier genes in meta-datasets with four previously defined statistical methods, COPA, the OS statistic, the t-test and ORT, using simulated data. We demonstrated that the GTI performed equally well to existing methods in a single study simulation. Next, we evaluated the performance of the GTI in the analysis of combined Affymetrix gene expression data from several published studies covering 392 normal samples of tissue from the central nervous system, 74 astrocytomas, and 353 glioblastomas. According to the results, the GTI was better able than most of the previous methods to identify known oncogenic outlier genes. In addition, the GTI identified 29 novel outlier genes in glioblastomas, including TYMS and CDKN2A. The over-expression of these genes was validated in vivo by immunohistochemical staining data from clinical glioblastoma samples. Immunohistochemical data were available for 65% (19 of 29 of these genes, and 17 of these 19 genes (90% showed a typical outlier staining pattern. Furthermore, raltitrexed, a specific inhibitor of TYMS used in the therapy of tumour types other than glioblastoma, also effectively blocked cell proliferation in glioblastoma cell lines, thus highlighting this outlier gene candidate as a potential therapeutic target. CONCLUSIONS/SIGNIFICANCE: Taken together, these results support the GTI as a novel approach to identify potential oncogene outliers and drug targets. The algorithm is

  19. Association Study between BDNF Gene Polymorphisms and Autism by Three-Dimensional Gel-Based Microarray

    Directory of Open Access Journals (Sweden)

    Zuhong Lu

    2009-06-01

    Full Text Available Single nucleotide polymorphisms (SNPs are important markers which can be used in association studies searching for susceptible genes of complex diseases. High-throughput methods are needed for SNP genotyping in a large number of samples. In this study, we applied polyacrylamide gel-based microarray combined with dual-color hybridization for association study of four BDNF polymorphisms with autism. All the SNPs in both patients and controls could be analyzed quickly and correctly. Among four SNPs, only C270T polymorphism showed significant differences in the frequency of the allele (χ2 = 7.809, p = 0.005 and genotype (χ2 = 7.800, p = 0.020. In the haplotype association analysis, there was significant difference in global haplotype distribution between the groups (χ2 = 28.19,p = 3.44e-005. We suggest that BDNF has a possible role in the pathogenesis of autism. The study also show that the polyacrylamide gel-based microarray combined with dual-color hybridization is a rapid, simple and high-throughput method for SNPs genotyping, and can be used for association study of susceptible gene with disorders in large samples.

  20. Microarray analysis of DNA damage repair gene expression profiles in cervical cancer cells radioresistant to 252Cf neutron and X-rays

    International Nuclear Information System (INIS)

    Qing, Yi; Wang, Ge; Wang, Dong; Yang, Xue-Qin; Zhong, Zhao-Yang; Lei, Xin; Xie, Jia-Yin; Li, Meng-Xia; Xiang, De-Bing; Li, Zeng-Peng; Yang, Zhen-Zhou

    2010-01-01

    The aim of the study was to obtain stable radioresistant sub-lines from the human cervical cancer cell line HeLa by prolonged exposure to 252 Cf neutron and X-rays. Radioresistance mechanisms were investigated in the resulting cells using microarray analysis of DNA damage repair genes. HeLa cells were treated with fractionated 252 Cf neutron and X-rays, with a cumulative dose of 75 Gy each, over 8 months, yielding the sub-lines HeLaNR and HeLaXR. Radioresistant characteristics were detected by clone formation assay, ultrastructural observations, cell doubling time, cell cycle distribution, and apoptosis assay. Gene expression patterns of the radioresistant sub-lines were studied through microarray analysis and verified by Western blotting and real-time PCR. The radioresistant sub-lines HeLaNR and HeLaXR were more radioresisitant to 252 Cf neutron and X-rays than parental HeLa cells by detecting their radioresistant characteristics, respectively. Compared to HeLa cells, the expression of 24 genes was significantly altered by at least 2-fold in HeLaNR cells. Of these, 19 genes were up-regulated and 5 down-regulated. In HeLaXR cells, 41 genes were significantly altered by at least 2-fold; 38 genes were up-regulated and 3 down-regulated. Chronic exposure of cells to ionizing radiation induces adaptive responses that enhance tolerance of ionizing radiation and allow investigations of cellular radioresistance mechanisms. The insights gained into the molecular mechanisms activated by these 'radioresistance' genes will lead to new therapeutic targets for cervical cancer

  1. The pathogenesis shared between abdominal aortic aneurysms and intracranial aneurysms: a microarray analysis.

    Science.gov (United States)

    Wang, Wen; Li, Hao; Zhao, Zheng; Wang, Haoyuan; Zhang, Dong; Zhang, Yan; Lan, Qing; Wang, Jiangfei; Cao, Yong; Zhao, Jizong

    2018-04-01

    Abdominal aortic aneurysms (AAAs) and intracranial saccular aneurysms (IAs) are the most common types of aneurysms. This study was to investigate the common pathogenesis shared between these two kinds of aneurysms. We collected 12 IAs samples and 12 control arteries from the Beijing Tiantan Hospital and performed microarray analysis. In addition, we utilized the microarray datasets of IAs and AAAs from the Gene Expression Omnibus (GEO), in combination with our microarray results, to generate messenger RNA expression profiles for both AAAs and IAs in our study. Functional exploration and protein-protein interaction (PPI) analysis were performed. A total of 727 common genes were differentially expressed (404 was upregulated; 323 was downregulated) for both AAAs and IAs. The GO and pathway analyses showed that the common dysregulated genes were mainly enriched in vascular smooth muscle contraction, muscle contraction, immune response, defense response, cell activation, IL-6 signaling and chemokine signaling pathways, etc. The further protein-protein analysis identified 35 hub nodes, including TNF, IL6, MAPK13, and CCL5. These hub node genes were enriched in inflammatory response, positive regulation of IL-6 production, chemokine signaling pathway, and T/B cell receptor signaling pathway. Our study will gain new insight into the molecular mechanisms for the pathogenesis of both types of aneurysms and provide new therapeutic targets for the patients harboring AAAs and IAs.

  2. CoPub: a literature-based keyword enrichment tool for microarray data analysis.

    Science.gov (United States)

    Frijters, Raoul; Heupers, Bart; van Beek, Pieter; Bouwhuis, Maurice; van Schaik, René; de Vlieg, Jacob; Polman, Jan; Alkema, Wynand

    2008-07-01

    Medline is a rich information source, from which links between genes and keywords describing biological processes, pathways, drugs, pathologies and diseases can be extracted. We developed a publicly available tool called CoPub that uses the information in the Medline database for the biological interpretation of microarray data. CoPub allows batch input of multiple human, mouse or rat genes and produces lists of keywords from several biomedical thesauri that are significantly correlated with the set of input genes. These lists link to Medline abstracts in which the co-occurring input genes and correlated keywords are highlighted. Furthermore, CoPub can graphically visualize differentially expressed genes and over-represented keywords in a network, providing detailed insight in the relationships between genes and keywords, and revealing the most influential genes as highly connected hubs. CoPub is freely accessible at http://services.nbic.nl/cgi-bin/copub/CoPub.pl.

  3. Microarray Analysis Reveals Higher Gestational Folic Acid Alters Expression of Genes in the Cerebellum of Mice Offspring—A Pilot Study

    Directory of Open Access Journals (Sweden)

    Subit Barua

    2015-01-01

    Full Text Available Folate is a water-soluble vitamin that is critical for nucleotide synthesis and can modulate methylation of DNA by altering one-carbon metabolism. Previous studies have shown that folate status during pregnancy is associated with various congenital defects including the risk of aberrant neural tube closure. Maternal exposure to a methyl supplemented diet also can alter DNA methylation and gene expression, which may influence the phenotype of offspring. We investigated if higher gestational folic acid (FA in the diet dysregulates the expression of genes in the cerebellum of offspring in C57BL/6 J mice. One week before gestation and throughout the pregnancy, groups of dams were supplemented with FA either at 2 mg/kg or 20 mg/kg of diet. Microarray analysis was used to investigate the genome wide gene expression profile in the cerebellum from day old pups. Our results revealed that exposure to the higher dose FA diet during gestation dysregulated expression of several genes in the cerebellum of both male and female pups. Several transcription factors, imprinted genes, neuro-developmental genes and genes associated with autism spectrum disorder exhibited altered expression levels. These findings suggest that higher gestational FA potentially dysregulates gene expression in the offspring brain and such changes may adversely alter fetal programming and overall brain development.

  4. Microarray analysis of gene expression alteration in human middle ear epithelial cells induced by micro particle.

    Science.gov (United States)

    Song, Jae-Jun; Kwon, Jee Young; Park, Moo Kyun; Seo, Young Rok

    2013-10-01

    The primary aim of this study is to reveal the effect of particulate matter (PM) on the human middle ear epithelial cell (HMEEC). The HMEEC was treated with PM (300 μg/ml) for 24 h. Total RNA was extracted and used for microarray analysis. Molecular pathways among differentially expressed genes were further analyzed by using Pathway Studio 9.0 software. For selected genes, the changes in gene expression were confirmed by real-time PCR. A total of 611 genes were regulated by PM. Among them, 366 genes were up-regulated, whereas 245 genes were down-regulated. Up-regulated genes were mainly involved in cellular processes, including reactive oxygen species generation, cell proliferation, apoptosis, cell differentiation, inflammatory response and immune response. Down-regulated genes affected several cellular processes, including cell differentiation, cell cycle, proliferation, apoptosis and cell migration. A total of 21 genes were discovered as crucial components in potential signaling networks containing 2-fold up regulated genes. Four genes, VEGFA, IL1B, CSF2 and HMOX1 were revealed as key mediator genes among the up-regulated genes. A total of 25 genes were revealed as key modulators in the signaling pathway associated with 2-fold down regulated genes. Four genes, including IGF1R, TIMP1, IL6 and FN1, were identified as the main modulator genes. We identified the differentially expressed genes in PM-treated HMEEC, whose expression profile may provide a useful clue for the understanding of environmental pathophysiology of otitis media. Our work indicates that air pollution, like PM, plays an important role in the pathogenesis of otitis media. Copyright © 2013 Elsevier Ireland Ltd. All rights reserved.

  5. High-Dimensional Additive Hazards Regression for Oral Squamous Cell Carcinoma Using Microarray Data: A Comparative Study

    Directory of Open Access Journals (Sweden)

    Omid Hamidi

    2014-01-01

    Full Text Available Microarray technology results in high-dimensional and low-sample size data sets. Therefore, fitting sparse models is substantial because only a small number of influential genes can reliably be identified. A number of variable selection approaches have been proposed for high-dimensional time-to-event data based on Cox proportional hazards where censoring is present. The present study applied three sparse variable selection techniques of Lasso, smoothly clipped absolute deviation and the smooth integration of counting, and absolute deviation for gene expression survival time data using the additive risk model which is adopted when the absolute effects of multiple predictors on the hazard function are of interest. The performances of used techniques were evaluated by time dependent ROC curve and bootstrap .632+ prediction error curves. The selected genes by all methods were highly significant (P<0.001. The Lasso showed maximum median of area under ROC curve over time (0.95 and smoothly clipped absolute deviation showed the lowest prediction error (0.105. It was observed that the selected genes by all methods improved the prediction of purely clinical model indicating the valuable information containing in the microarray features. So it was concluded that used approaches can satisfactorily predict survival based on selected gene expression measurements.

  6. A new efficient statistical test for detecting variability in the gene expression data.

    Science.gov (United States)

    Mathur, Sunil; Dolo, Samuel

    2008-08-01

    DNA microarray technology allows researchers to monitor the expressions of thousands of genes under different conditions. The detection of differential gene expression under two different conditions is very important in microarray studies. Microarray experiments are multi-step procedures and each step is a potential source of variance. This makes the measurement of variability difficult because approach based on gene-by-gene estimation of variance will have few degrees of freedom. It is highly possible that the assumption of equal variance for all the expression levels may not hold. Also, the assumption of normality of gene expressions may not hold. Thus it is essential to have a statistical procedure which is not based on the normality assumption and also it can detect genes with differential variance efficiently. The detection of differential gene expression variance will allow us to identify experimental variables that affect different biological processes and accuracy of DNA microarray measurements.In this article, a new nonparametric test for scale is developed based on the arctangent of the ratio of two expression levels. Most of the tests available in literature require the assumption of normal distribution, which makes them inapplicable in many situations, and it is also hard to verify the suitability of the normal distribution assumption for the given data set. The proposed test does not require the assumption of the distribution for the underlying population and hence makes it more practical and widely applicable. The asymptotic relative efficiency is calculated under different distributions, which show that the proposed test is very powerful when the assumption of normality breaks down. Monte Carlo simulation studies are performed to compare the power of the proposed test with some of the existing procedures. It is found that the proposed test is more powerful than commonly used tests under almost all the distributions considered in the study. A

  7. Integrative missing value estimation for microarray data

    Directory of Open Access Journals (Sweden)

    Zhou Xianghong

    2006-10-01

    Full Text Available Abstract Background Missing value estimation is an important preprocessing step in microarray analysis. Although several methods have been developed to solve this problem, their performance is unsatisfactory for datasets with high rates of missing data, high measurement noise, or limited numbers of samples. In fact, more than 80% of the time-series datasets in Stanford Microarray Database contain less than eight samples. Results We present the integrative Missing Value Estimation method (iMISS by incorporating information from multiple reference microarray datasets to improve missing value estimation. For each gene with missing data, we derive a consistent neighbor-gene list by taking reference data sets into consideration. To determine whether the given reference data sets are sufficiently informative for integration, we use a submatrix imputation approach. Our experiments showed that iMISS can significantly and consistently improve the accuracy of the state-of-the-art Local Least Square (LLS imputation algorithm by up to 15% improvement in our benchmark tests. Conclusion We demonstrated that the order-statistics-based integrative imputation algorithms can achieve significant improvements over the state-of-the-art missing value estimation approaches such as LLS and is especially good for imputing microarray datasets with a limited number of samples, high rates of missing data, or very noisy measurements. With the rapid accumulation of microarray datasets, the performance of our approach can be further improved by incorporating larger and more appropriate reference datasets.

  8. Identification and prognostic value of anterior gradient protein 2 expression in breast cancer based on tissue microarray.

    Science.gov (United States)

    Guo, Jilong; Gong, Guohua; Zhang, Bin

    2017-07-01

    Breast cancer has attracted substantial attention as one of the major cancers causing death in women. It is crucial to find potential biomarkers of prognostic value in breast cancer. In this study, the expression pattern of anterior gradient protein 2 in breast cancer was identified based on the main molecular subgroups. Through analysis of 69 samples from the Gene Expression Omnibus database, we found that anterior gradient protein 2 expression was significantly higher in non-triple-negative breast cancer tissues compared with normal tissues and triple-negative breast cancer tissues (p gradient protein 2 expression pattern. Furthermore, we performed immunohistochemical analysis. The quantification results revealed that anterior gradient protein 2 is highly expressed in non-triple-negative breast cancer (grade 3 excluded) and grade 1 + 2 (triple-negative breast cancer excluded) tumours compared with normal tissues. Anterior gradient protein 2 was significantly highly expressed in non-triple-negative breast cancer (grade 3 excluded) and non-triple-negative breast cancer tissues compared with triple-negative breast cancer tissues (p gradient protein 2 was significantly highly expressed in grade 1 + 2 (triple-negative breast cancer excluded) and grade 1 + 2 tissues compared with grade 3 tissues (p gradient protein 2 expression was significantly associated with histologic type, histological grade, oestrogen status and progesterone status. Univariate analysis of clinicopathological variables showed that anterior gradient protein 2 expression, tumour size and lymph node status were significantly correlated with overall survival in patients with grade 1 and 2 tumours. Cox multivariate analysis revealed anterior gradient protein 2 as a putative independent indicator of unfavourable outcomes (p = 0.031). All these data clearly showed that anterior gradient protein 2 is highly expressed in breast cancer and can be regarded as a putative biomarker for

  9. Microarray analysis of thioacetamide-treated type 1 diabetic rats

    International Nuclear Information System (INIS)

    Devi, Sachin S.; Mehendale, Harihara M.

    2006-01-01

    It is well known that diabetes imparts high sensitivity to numerous hepatotoxicants. Previously, we have shown that a normally non-lethal dose of thioacetamide (TA, 300 mg/kg) causes 90% mortality in type 1 diabetic (DB) rats due to inhibited tissue repair allowing progression of liver injury. On the other hand, DB rats exposed to 30 mg TA/kg exhibit delayed tissue repair and delayed recovery from injury. The objective of this study was to investigate the mechanism of impaired tissue repair and progression of liver injury in TA-treated DB rats by using cDNA microarray. Gene expression pattern was examined at 0, 6, and 12 h after TA challenge, and selected mechanistic leads from microarray experiments were confirmed by real-time RT-PCR and further investigated at protein level over the time course of 0 to 36 h after TA treatment. Diabetic condition itself increased gene expression of proteases and decreased gene expression of protease inhibitors. Administration of 300 mg TA/kg to DB rats further elevated gene expression of proteases and suppressed gene expression of protease inhibitors, explaining progression of liver injury in DB rats after TA treatment. Inhibited expression of genes involved in cell division cycle (cyclin D1, IGFBP-1, ras, E2F) was observed after exposure of DB rats to 300 mg TA/kg, explaining inhibited tissue repair in these rats. On the other hand, DB rats receiving 30 mg TA/kg exhibit delayed expression of genes involved in cell division cycle, explaining delayed tissue repair in these rats. In conclusion, impaired cyclin D1 signaling along with increased proteases and decreased protease inhibitors may explain impaired tissue repair that leads to progression of liver injury initiated by TA in DB rats

  10. Dynamic association rules for gene expression data analysis.

    Science.gov (United States)

    Chen, Shu-Chuan; Tsai, Tsung-Hsien; Chung, Cheng-Han; Li, Wen-Hsiung

    2015-10-14

    The purpose of gene expression analysis is to look for the association between regulation of gene expression levels and phenotypic variations. This association based on gene expression profile has been used to determine whether the induction/repression of genes correspond to phenotypic variations including cell regulations, clinical diagnoses and drug development. Statistical analyses on microarray data have been developed to resolve gene selection issue. However, these methods do not inform us of causality between genes and phenotypes. In this paper, we propose the dynamic association rule algorithm (DAR algorithm) which helps ones to efficiently select a subset of significant genes for subsequent analysis. The DAR algorithm is based on association rules from market basket analysis in marketing. We first propose a statistical way, based on constructing a one-sided confidence interval and hypothesis testing, to determine if an association rule is meaningful. Based on the proposed statistical method, we then developed the DAR algorithm for gene expression data analysis. The method was applied to analyze four microarray datasets and one Next Generation Sequencing (NGS) dataset: the Mice Apo A1 dataset, the whole genome expression dataset of mouse embryonic stem cells, expression profiling of the bone marrow of Leukemia patients, Microarray Quality Control (MAQC) data set and the RNA-seq dataset of a mouse genomic imprinting study. A comparison of the proposed method with the t-test on the expression profiling of the bone marrow of Leukemia patients was conducted. We developed a statistical way, based on the concept of confidence interval, to determine the minimum support and minimum confidence for mining association relationships among items. With the minimum support and minimum confidence, one can find significant rules in one single step. The DAR algorithm was then developed for gene expression data analysis. Four gene expression datasets showed that the proposed

  11. A Microarray Study of Middle Cerebral Occlusion Rat Brain with Acupuncture Intervention

    Directory of Open Access Journals (Sweden)

    Chao Zhang

    2015-01-01

    Full Text Available Microarray analysis was used to investigate the changes of gene expression of ischemic stroke and acupuncture intervention in middle cerebral artery occlusion (MCAo rat brain. Results showed that acupuncture intervention had a remarkable improvement in neural deficit score, cerebral blood flow, and cerebral infarction volume of MCAo rats. Microarray analysis showed that a total of 627 different expression genes were regulated in ischemic stroke. 417 genes were upregulated and 210 genes were downregulated. A total of 361 different expression genes were regulated after acupuncture intervention. Three genes were upregulated and 358 genes were downregulated. The expression of novel genes after acupuncture intervention, including Tph1 and Olr883, was further analyzed by Real-Time Quantitative Polymerase Chain Reaction (RT-PCR. Upregulation of Tph1 and downregulation of Olr883 indicated that the therapeutic effect of acupuncture for ischemic stroke may be closely related to the suppression of poststroke depression and regulation of olfactory transduction. In conclusion, the present study may enrich our understanding of the multiple pathological process of ischemic brain injury and indicate possible mechanisms of acupuncture on ischemic stroke.

  12. DNA microarray analysis of fim mutations in Escherichia coli

    DEFF Research Database (Denmark)

    Schembri, Mark; Ussery, David; Workman, Christopher

    2002-01-01

    Bacterial adhesion is often mediated by complex polymeric surface structures referred to as fimbriae. Type I fimbriae of Escherichia coli represent the archetypical and best characterised fimbrial system. These adhesive organelles mediate binding to D-mannose and are directly associated...... we have used DNA microarray analysis to examine the molecular events involved in response to fimbrial gene expression in E. coli K-12. Observed differential expression levels of the fim genes were in good agreement with our current knowledge of the stoichiometry of type I fimbriae. Changes in fim...

  13. Calling biomarkers in milk using a protein microarray on your smartphone

    NARCIS (Netherlands)

    Ludwig, S.K.J.; Tokarski, Christian; Lang, Stefan N.; Ginkel, Van L.A.; Zhu, Hongying; Ozcan, Aydogan; Nielen, M.W.F.

    2015-01-01

    Here we present the concept of a protein microarray-based fluorescence immunoassay for multiple biomarker detection in milk extracts by an ordinary smartphone. A multiplex immunoassay was designed on a microarray chip, having built-in positive and negative quality controls. After the immunoassay

  14. A study of metaheuristic algorithms for high dimensional feature selection on microarray data

    Science.gov (United States)

    Dankolo, Muhammad Nasiru; Radzi, Nor Haizan Mohamed; Sallehuddin, Roselina; Mustaffa, Noorfa Haszlinna

    2017-11-01

    Microarray systems enable experts to examine gene profile at molecular level using machine learning algorithms. It increases the potentials of classification and diagnosis of many diseases at gene expression level. Though, numerous difficulties may affect the efficiency of machine learning algorithms which includes vast number of genes features comprised in the original data. Many of these features may be unrelated to the intended analysis. Therefore, feature selection is necessary to be performed in the data pre-processing. Many feature selection algorithms are developed and applied on microarray which including the metaheuristic optimization algorithms. This paper discusses the application of the metaheuristics algorithms for feature selection in microarray dataset. This study reveals that, the algorithms have yield an interesting result with limited resources thereby saving computational expenses of machine learning algorithms.

  15. Microarray evaluation of gene expression profiles in inflamed and healthy human dental pulp: the role of IL1beta and CD40 in pulp inflammation.

    Science.gov (United States)

    Gatta, V; Zizzari, V L; Dd ' Amico, V; Salini, L; D' Aurora, M; Franchi, S; Antonucci, I; Sberna, M T; Gherlone, E; Stuppia, L; Tetè, S

    2012-01-01

    Dental pulp undergoes a number of changes passing from healthy status to inflammation due to deep decay. These changes are regulated by several genes resulting differently expressed in inflamed and healthy dental pulp, and the knowledge of the processes underlying this differential expression is of great relevance in the identification of the pathogenesis of the disease. In this study, the gene expression profile of inflamed and healthy dental pulps were compared by microarray analysis, and data obtained were analyzed by Ingenuity Pathway Analysis (IPA) software. This analysis allows to focus on a variety of genes, typically expressed in inflamed tissues. The comparison analysis showed an increased expression of several genes in inflamed pulp, among which IL1β and CD40 resulted of particular interest. These results indicate that gene expression profile of human dental pulp in different physiological and pathological conditions may become an useful tool for improving our knowledge about processes regulating pulp inflammation.

  16. Detection of NASBA amplified bacterial tmRNA molecules on SLICSel designed microarray probes

    Directory of Open Access Journals (Sweden)

    Toome Kadri

    2011-02-01

    Full Text Available Abstract Background We present a comprehensive technological solution for bacterial diagnostics using tmRNA as a marker molecule. A robust probe design algorithm for microbial detection microarray is implemented. The probes were evaluated for specificity and, combined with NASBA (Nucleic Acid Sequence Based Amplification amplification, for sensitivity. Results We developed a new web-based program SLICSel for the design of hybridization probes, based on nearest-neighbor thermodynamic modeling. A SLICSel minimum binding energy difference criterion of 4 kcal/mol was sufficient to design of Streptococcus pneumoniae tmRNA specific microarray probes. With lower binding energy difference criteria, additional hybridization specificity tests on the microarray were needed to eliminate non-specific probes. Using SLICSel designed microarray probes and NASBA we were able to detect S. pneumoniae tmRNA from a series of total RNA dilutions equivalent to the RNA content of 0.1-10 CFU. Conclusions The described technological solution and both its separate components SLICSel and NASBA-microarray technology independently are applicative for many different areas of microbial diagnostics.

  17. Detection of NASBA amplified bacterial tmRNA molecules on SLICSel designed microarray probes

    LENUS (Irish Health Repository)

    Scheler, Ott

    2011-02-28

    Abstract Background We present a comprehensive technological solution for bacterial diagnostics using tmRNA as a marker molecule. A robust probe design algorithm for microbial detection microarray is implemented. The probes were evaluated for specificity and, combined with NASBA (Nucleic Acid Sequence Based Amplification) amplification, for sensitivity. Results We developed a new web-based program SLICSel for the design of hybridization probes, based on nearest-neighbor thermodynamic modeling. A SLICSel minimum binding energy difference criterion of 4 kcal\\/mol was sufficient to design of Streptococcus pneumoniae tmRNA specific microarray probes. With lower binding energy difference criteria, additional hybridization specificity tests on the microarray were needed to eliminate non-specific probes. Using SLICSel designed microarray probes and NASBA we were able to detect S. pneumoniae tmRNA from a series of total RNA dilutions equivalent to the RNA content of 0.1-10 CFU. Conclusions The described technological solution and both its separate components SLICSel and NASBA-microarray technology independently are applicative for many different areas of microbial diagnostics.

  18. A comprehensive sensitivity analysis of microarray breast cancer classification under feature variability

    NARCIS (Netherlands)

    Sontrop, H.M.J.; Moerland, P.D.; Van den Ham, R.; Reinders, M.J.T.; Verhaegh, W.F.J.

    2009-01-01

    Background: Large discrepancies in signature composition and outcome concordance have been observed between different microarray breast cancer expression profiling studies. This is often ascribed to differences in array platform as well as biological variability. We conjecture that other reasons for

  19. A comprehensive sensitivity analysis of microarray breast cancer classification under feature variability

    NARCIS (Netherlands)

    Sontrop, Herman M. J.; Moerland, Perry D.; van den Ham, René; Reinders, Marcel J. T.; Verhaegh, Wim F. J.

    2009-01-01

    Large discrepancies in signature composition and outcome concordance have been observed between different microarray breast cancer expression profiling studies. This is often ascribed to differences in array platform as well as biological variability. We conjecture that other reasons for the

  20. Quantitative inference of dynamic regulatory pathways via microarray data

    Directory of Open Access Journals (Sweden)

    Chen Bor-Sen

    2005-03-01

    Full Text Available Abstract Background The cellular signaling pathway (network is one of the main topics of organismic investigations. The intracellular interactions between genes in a signaling pathway are considered as the foundation of functional genomics. Thus, what genes and how much they influence each other through transcriptional binding or physical interactions are essential problems. Under the synchronous measures of gene expression via a microarray chip, an amount of dynamic information is embedded and remains to be discovered. Using a systematically dynamic modeling approach, we explore the causal relationship among genes in cellular signaling pathways from the system biology approach. Results In this study, a second-order dynamic model is developed to describe the regulatory mechanism of a target gene from the upstream causality point of view. From the expression profile and dynamic model of a target gene, we can estimate its upstream regulatory function. According to this upstream regulatory function, we would deduce the upstream regulatory genes with their regulatory abilities and activation delays, and then link up a regulatory pathway. Iteratively, these regulatory genes are considered as target genes to trace back their upstream regulatory genes. Then we could construct the regulatory pathway (or network to the genome wide. In short, we can infer the genetic regulatory pathways from gene-expression profiles quantitatively, which can confirm some doubted paths or seek some unknown paths in a regulatory pathway (network. Finally, the proposed approach is validated by randomly reshuffling the time order of microarray data. Conclusion We focus our algorithm on the inference of regulatory abilities of the identified causal genes, and how much delay before they regulate the downstream genes. With this information, a regulatory pathway would be built up using microarray data. In the present study, two signaling pathways, i.e. circadian regulatory

  1. Expression of the G protein-coupled estrogen receptor (GPER in endometriosis: a tissue microarray study

    Directory of Open Access Journals (Sweden)

    Samartzis Nicolas

    2012-04-01

    Full Text Available Abstract Background The G protein-coupled estrogen receptor (GPER is thought to be involved in non-genomic estrogen responses as well as processes such as cell proliferation and migration. In this study, we analyzed GPER expression patterns from endometriosis samples and normal endometrial tissue samples and compared these expression profiles to those of the classical sex hormone receptors. Methods A tissue microarray, which included 74 samples from different types of endometriosis (27 ovarian, 19 peritoneal and 28 deep-infiltrating and 30 samples from normal endometrial tissue, was used to compare the expression levels of the GPER, estrogen receptor (ER-alpha, ER-beta and progesterone receptor (PR. The immunoreactive score (IRS was calculated separately for epithelium and stroma as the product of the staining intensity and the percentage of positive cells. The expression levels of the hormonal receptors were dichotomized into low (IRS  =6 expression groups. Results The mean epithelial IRS (+/−standard deviation, range of cytoplasmic GPER expression was 1.2 (+/−1.7, 0–4 in normal endometrium and 5.1 (+/−3.5, 0–12 in endometriosis (p p = 0.71, of ER-alpha 10.6 (+/−2.4, 3–12 and 9.8 (+/−3.0, 2–12; p = 0.26, of ER-beta 2.4 (+/−2.2; 0–8 and 5.6 (+/−2.6; 0–10; p p p p = 0.001, of ER-beta 1.8 (+/−2.0; 0–8 and 5.4 (+/−2.5; 0–10; p p���= 0.044, respectively. Cytoplasmic GPER expression was not detectable in the stroma of endometrium and endometriosis. The observed frequency of high epithelial cytoplasmic GPER expression levels was 50% (n = 30/60 in the endometriosis and none (0/30 in the normal endometrium samples (p p = 0.01, as compared to peritoneal (9/18, 50% or deep-infiltrating endometriotic lesions (7/22, 31.8%. The frequency of high stromal nuclear GPER expression levels was 100% (n = 74/74 in endometriosis and 76.7% (n = 23/30 in normal endometrium (p

  2. Tissue microarrays for testing basal biomarkers in familial breast cancer cases

    Directory of Open Access Journals (Sweden)

    Rozany Mucha Dufloth

    Full Text Available CONTEXT AND OBJECTIVE: The proteins p63, p-cadherin and CK5 are consistently expressed by the basal and myoepithelial cells of the breast, although their expression in sporadic and familial breast cancer cases has yet to be fully defined. The aim here was to study the basal immunopro-file of a breast cancer case series using tissue microarray technology. DESIGN AND SETTING: This was a cross-sectional study at Universidade Estadual de Campinas, Brazil, and the Institute of Pathology and Mo-lecular Immunology, Porto, Portugal. METHODS: Immunohistochemistry using the antibodies p63, CK5 and p-cadherin, and also estrogen receptor (ER and Human Epidermal Receptor Growth Factor 2 (HER2, was per-formed on 168 samples from a breast cancer case series. The criteria for identifying women at high risk were based on those of the Breast Cancer Linkage Consortium. RESULTS: Familial tumors were more frequently positive for the p-cadherin (p = 0.0004, p63 (p < 0.0001 and CK5 (p < 0.0001 than was sporadic cancer. Moreover, familial tumors had coexpression of the basal biomarkers CK5+/ p63+, grouped two by two (OR = 34.34, while absence of coexpression (OR = 0.13 was associ-ated with the sporadic cancer phenotype. CONCLUSION: Familial breast cancer was found to be associated with basal biomarkers, using tissue microarray technology. Therefore, characterization of the familial breast cancer phenotype will improve the understanding of breast carcinogenesis.

  3. An Entropy-based gene selection method for cancer classification using microarray data

    Directory of Open Access Journals (Sweden)

    Krishnan Arun

    2005-03-01

    Full Text Available Abstract Background Accurate diagnosis of cancer subtypes remains a challenging problem. Building classifiers based on gene expression data is a promising approach; yet the selection of non-redundant but relevant genes is difficult. The selected gene set should be small enough to allow diagnosis even in regular clinical laboratories and ideally identify genes involved in cancer-specific regulatory pathways. Here an entropy-based method is proposed that selects genes related to the different cancer classes while at the same time reducing the redundancy among the genes. Results The present study identifies a subset of features by maximizing the relevance and minimizing the redundancy of the selected genes. A merit called normalized mutual information is employed to measure the relevance and the redundancy of the genes. In order to find a more representative subset of features, an iterative procedure is adopted that incorporates an initial clustering followed by data partitioning and the application of the algorithm to each of the partitions. A leave-one-out approach then selects the most commonly selected genes across all the different runs and the gene selection algorithm is applied again to pare down the list of selected genes until a minimal subset is obtained that gives a satisfactory accuracy of classification. The algorithm was applied to three different data sets and the results obtained were compared to work done by others using the same data sets Conclusion This study presents an entropy-based iterative algorithm for selecting genes from microarray data that are able to classify various cancer sub-types with high accuracy. In addition, the feature set obtained is very compact, that is, the redundancy between genes is reduced to a large extent. This implies that classifiers can be built with a smaller subset of genes.

  4. Partial Least Squares Based Gene Expression Analysis in EBV- Positive and EBV-Negative Posttransplant Lymphoproliferative Disorders.

    Science.gov (United States)

    Wu, Sa; Zhang, Xin; Li, Zhi-Ming; Shi, Yan-Xia; Huang, Jia-Jia; Xia, Yi; Yang, Hang; Jiang, Wen-Qi

    2013-01-01

    Post-transplant lymphoproliferative disorder (PTLD) is a common complication of therapeutic immunosuppression after organ transplantation. Gene expression profile facilitates the identification of biological difference between Epstein-Barr virus (EBV) positive and negative PTLDs. Previous studies mainly implemented variance/regression analysis without considering unaccounted array specific factors. The aim of this study is to investigate the gene expression difference between EBV positive and negative PTLDs through partial least squares (PLS) based analysis. With a microarray data set from the Gene Expression Omnibus database, we performed PLS based analysis. We acquired 1188 differentially expressed genes. Pathway and Gene Ontology enrichment analysis identified significantly over-representation of dysregulated genes in immune response and cancer related biological processes. Network analysis identified three hub genes with degrees higher than 15, including CREBBP, ATXN1, and PML. Proteins encoded by CREBBP and PML have been reported to be interact with EBV before. Our findings shed light on expression distinction of EBV positive and negative PTLDs with the hope to offer theoretical support for future therapeutic study.

  5. Analysis of baseline gene expression levels from ...

    Science.gov (United States)

    The use of gene expression profiling to predict chemical mode of action would be enhanced by better characterization of variance due to individual, environmental, and technical factors. Meta-analysis of microarray data from untreated or vehicle-treated animals within the control arm of toxicogenomics studies has yielded useful information on baseline fluctuations in gene expression. A dataset of control animal microarray expression data was assembled by a working group of the Health and Environmental Sciences Institute's Technical Committee on the Application of Genomics in Mechanism Based Risk Assessment in order to provide a public resource for assessments of variability in baseline gene expression. Data from over 500 Affymetrix microarrays from control rat liver and kidney were collected from 16 different institutions. Thirty-five biological and technical factors were obtained for each animal, describing a wide range of study characteristics, and a subset were evaluated in detail for their contribution to total variability using multivariate statistical and graphical techniques. The study factors that emerged as key sources of variability included gender, organ section, strain, and fasting state. These and other study factors were identified as key descriptors that should be included in the minimal information about a toxicogenomics study needed for interpretation of results by an independent source. Genes that are the most and least variable, gender-selectiv

  6. A reverse-phase protein microarray-based screen identifies host signaling dynamics upon Burkholderia spp. infection

    Directory of Open Access Journals (Sweden)

    Chih-Yuan eChiang

    2015-07-01

    Full Text Available Burkholderia is a diverse genus of Gram-negative bacteria that cause high mortality rate in humans and cattle. The lack of effective therapeutic treatments poses serious public health threats. Insights toward host-Burkholderia spp. interaction are critical in understanding the pathogenesis of the infection as well as identifying therapeutic targets for drug development. Reverse-phase protein microarray (RPMA technology was previously proven to characterize novel biomarkers and molecular signatures associated with infectious diseases and cancers. In the present study, this technology was utilized to interrogate changes in host protein expression and post-translational phosphorylation events in macrophages infected with a collection of geographically diverse strains of Burkholderia spp. The expression or phosphorylation state of 25 proteins was altered during Burkholderia spp. infections and of which eight proteins were selected for further validation by immunoblotting. Kinetic expression patterns of phosphorylated AMPK-α1, Src, and GSK3β suggested the importance of their roles in regulating Burkholderia spp. mediated innate immune responses. Modulating inflammatory responses by perturbing AMPK-α1, Src, and GSK3β activities may provide novel therapeutic targets for future treatments.

  7. Application of microarray and functional-based screening methods for the detection of antimicrobial resistance genes in the microbiomes of healthy humans.

    Directory of Open Access Journals (Sweden)

    Roderick M Card

    Full Text Available The aim of this study was to screen for the presence of antimicrobial resistance genes within the saliva and faecal microbiomes of healthy adult human volunteers from five European countries. Two non-culture based approaches were employed to obviate potential bias associated with difficult to culture members of the microbiota. In a gene target-based approach, a microarray was employed to screen for the presence of over 70 clinically important resistance genes in the saliva and faecal microbiomes. A total of 14 different resistance genes were detected encoding resistances to six antibiotic classes (aminoglycosides, β-lactams, macrolides, sulphonamides, tetracyclines and trimethoprim. The most commonly detected genes were erm(B, blaTEM, and sul2. In a functional-based approach, DNA prepared from pooled saliva samples was cloned into Escherichia coli and screened for expression of resistance to ampicillin or sulphonamide, two of the most common resistances found by array. The functional ampicillin resistance screen recovered genes encoding components of a predicted AcrRAB efflux pump. In the functional sulphonamide resistance screen, folP genes were recovered encoding mutant dihydropteroate synthase, the target of sulphonamide action. The genes recovered from the functional screens were from the chromosomes of commensal species that are opportunistically pathogenic and capable of exchanging DNA with related pathogenic species. Genes identified by microarray were not recovered in the activity-based screen, indicating that these two methods can be complementary in facilitating the identification of a range of resistance mechanisms present within the human microbiome. It also provides further evidence of the diverse reservoir of resistance mechanisms present in bacterial populations in the human gut and saliva. In future the methods described in this study can be used to monitor changes in the resistome in response to antibiotic therapy.

  8. The EADGENE Microarray Data Analysis Workshop

    DEFF Research Database (Denmark)

    de Koning, Dirk-Jan; Jaffrézic, Florence; Lund, Mogens Sandø

    2007-01-01

    Microarray analyses have become an important tool in animal genomics. While their use is becoming widespread, there is still a lot of ongoing research regarding the analysis of microarray data. In the context of a European Network of Excellence, 31 researchers representing 14 research groups from...... 10 countries performed and discussed the statistical analyses of real and simulated 2-colour microarray data that were distributed among participants. The real data consisted of 48 microarrays from a disease challenge experiment in dairy cattle, while the simulated data consisted of 10 microarrays...... statistical weights, to omitting a large number of spots or omitting entire slides. Surprisingly, these very different approaches gave quite similar results when applied to the simulated data, although not all participating groups analysed both real and simulated data. The workshop was very successful...

  9. Microarray Expression Profile of Circular RNAs in Heart Tissue of Mice with Myocardial Infarction-Induced Heart Failure

    Directory of Open Access Journals (Sweden)

    Hong-Jin Wu

    2016-06-01

    Full Text Available Background/Aims: Myocardial infarction (MI is a serious complication of atherosclerosis associated with increasing mortality attributable to heart failure. This study is aimed to assess the global changes in and characteristics of the transcriptome of circular RNAs (circRNAs in heart tissue during MI induced heart failure (HF. Methods: Using a post-myocardial infarction (MI model of HF in mice, we applied microarray assay to examine the transcriptome of circRNAs deregulated in the heart during HF. We confirmed the changes in circRNAs by quantitative PCR. Results: We revealed and confirmed a number of circRNAs that were deregulated during HF, which suggests a potential role of circRNAs in HF. Conclusions: The distinct expression patterns of circulatory circRNAs during HF indicate that circRNAs may actively respond to stress and thus serve as biomarkers of HF diagnosis and treatment.

  10. Microarray-based identification and RT-PCR test screening for epithelial-specific mRNAs in peripheral blood of patients with colon cancer

    Directory of Open Access Journals (Sweden)

    Coppola Domenico

    2006-10-01

    Full Text Available Abstract Background The efficacy of screening for colorectal cancer using a simple blood-based assay for the detection of tumor cells disseminated in the circulation at an early stage of the disease is gaining positive feedback from several lines of research. This method seems able to reduce colorectal cancer mortality and may replace colonoscopy as the most effective means of detecting colonic lesions. Methods In this work, we present a new microarray-based high-throughput screening method to identifying candidate marker mRNAs for the early detection of epithelial cells diluted in peripheral blood cells. This method includes 1. direct comparison of different samples of colonic mucosa and of blood cells to identify consistent epithelial-specific mRNAs from among 20,000 cDNA assayed by microarray slides; 2. identification of candidate marker mRNAs by data analysis, which allowed selection of only 10 putative differentially expressed genes; 3. Selection of some of the most suitable mRNAs (TMEM69, RANBP3 and PRSS22 that were assayed in blood samples from normal subjects and patients with colon cancer as possible markers for the presence of epithelial cells in the blood, using reverse transcription – polymerase chain reaction (RT-PCR. Results Our present results seem to provide an indication, for the first time obtained by genome-scale screening, that a suitable and consistent colon epithelium mRNA marker may be difficult to identify. Conclusion The design of new approaches to identify such markers is warranted.

  11. Association of adipocyte genes with ASP expression: a microarray analysis of subcutaneous and omental adipose tissue in morbidly obese subjects

    Directory of Open Access Journals (Sweden)

    Lu HuiLing

    2010-01-01

    Full Text Available Abstract Background Prevalence of obesity is increasing to pandemic proportions. However, obese subjects differ in insulin resistance, adipokine production and co-morbidities. Based on fasting plasma analysis, obese subjects were grouped as Low Acylation Stimulating protein (ASP and Triglyceride (TG (LAT vs High ASP and TG (HAT. Subcutaneous (SC and omental (OM adipose tissues (n = 21 were analysed by microarray, and biologic pathways in lipid metabolism and inflammation were specifically examined. Methods LAT and HAT groups were matched in age, obesity, insulin, and glucose, and had similar expression of insulin-related genes (InsR, IRS-1. ASP related genes tended to be increased in the HAT group and were correlated (factor B, adipsin, complement C3, p Results HAT adipose tissue demonstrated increased lipid related genes for storage (CD36, DGAT1, DGAT2, SCD1, FASN, and LPL, lipolysis (HSL, CES1, perilipin, fatty acid binding proteins (FABP1, FABP3 and adipocyte differentiation markers (CEBPα, CEBPβ, PPARγ. By contrast, oxidation related genes were decreased (AMPK, UCP1, CPT1, FABP7. HAT subjects had increased anti-inflammatory genes TGFB1, TIMP1, TIMP3, and TIMP4 while proinflammatory PIG7 and MMP2 were also significantly increased; all genes, p Conclusion Taken together, the profile of C5L2 receptor, ASP gene expression and metabolic factors in adipose tissue from morbidly obese HAT subjects suggests a compensatory response associated with the increased plasma ASP and TG.

  12. Microarray analysis of expression of cell death-associated genes in rat spinal cord cells exposed to cyclic tensile stresses in vitro

    Directory of Open Access Journals (Sweden)

    Roberts Sally

    2010-07-01

    Full Text Available Abstract Background The application of mechanical insults to the spinal cord results in profound cellular and molecular changes, including the induction of neuronal cell death and altered gene expression profiles. Previous studies have described alterations in gene expression following spinal cord injury, but the specificity of this response to mechanical stimuli is difficult to investigate in vivo. Therefore, we have investigated the effect of cyclic tensile stresses on cultured spinal cord cells from E15 Sprague-Dawley rats, using the FX3000® Flexercell Strain Unit. We examined cell morphology and viability over a 72 hour time course. Microarray analysis of gene expression was performed using the Affymetrix GeneChip System®, where categorization of identified genes was performed using the Gene Ontology (GO and Kyoto Encyclopedia of Genes and Genomes (KEGG systems. Changes in expression of 12 genes were validated with quantitative real-time reverse transcription polymerase chain reaction (RT-PCR. Results The application of cyclic tensile stress reduced the viability of cultured spinal cord cells significantly in a dose- and time-dependent manner. Increasing either the strain or the strain rate independently was associated with significant decreases in spinal cord cell survival. There was no clear evidence of additive effects of strain level with strain rate. GO analysis identified 44 candidate genes which were significantly related to "apoptosis" and 17 genes related to "response to stimulus". KEGG analysis identified changes in the expression levels of 12 genes of the mitogen-activated protein kinase (MAPK signaling pathway, which were confirmed to be upregulated by RT-PCR analysis. Conclusions We have demonstrated that spinal cord cells undergo cell death in response to cyclic tensile stresses, which were dose- and time-dependent. In addition, we have identified the up regulation of various genes, in particular of the MAPK pathway, which

  13. Gene expression of the endolymphatic sac

    DEFF Research Database (Denmark)

    Friis, Morten; Martin-Bertelsen, Tomas; Friis-Hansen, Lennart

    2011-01-01

    that the endolymphatic sac has multiple and diverse functions in the inner ear. Objectives:The objective of this study was to provide a comprehensive review of the genes expressed in the endolymphatic sac in the rat and perform a functional characterization based on measured mRNA abundance. Methods:Microarray technology...

  14. Investigating the effect of paralogs on microarray gene-set analysis

    LENUS (Irish Health Repository)

    Faure, Andre J

    2011-01-24

    Abstract Background In order to interpret the results obtained from a microarray experiment, researchers often shift focus from analysis of individual differentially expressed genes to analyses of sets of genes. These gene-set analysis (GSA) methods use previously accumulated biological knowledge to group genes into sets and then aim to rank these gene sets in a way that reflects their relative importance in the experimental situation in question. We suspect that the presence of paralogs affects the ability of GSA methods to accurately identify the most important sets of genes for subsequent research. Results We show that paralogs, which typically have high sequence identity and similar molecular functions, also exhibit high correlation in their expression patterns. We investigate this correlation as a potential confounding factor common to current GSA methods using Indygene http:\\/\\/www.cbio.uct.ac.za\\/indygene, a web tool that reduces a supplied list of genes so that it includes no pairwise paralogy relationships above a specified sequence similarity threshold. We use the tool to reanalyse previously published microarray datasets and determine the potential utility of accounting for the presence of paralogs. Conclusions The Indygene tool efficiently removes paralogy relationships from a given dataset and we found that such a reduction, performed prior to GSA, has the ability to generate significantly different results that often represent novel and plausible biological hypotheses. This was demonstrated for three different GSA approaches when applied to the reanalysis of previously published microarray datasets and suggests that the redundancy and non-independence of paralogs is an important consideration when dealing with GSA methodologies.

  15. Gene expression profiling to characterize sediment toxicity – a pilot study using Caenorhabditis elegans whole genome microarrays

    Directory of Open Access Journals (Sweden)

    Reifferscheid Georg

    2009-04-01

    Full Text Available Abstract Background Traditionally, toxicity of river sediments is assessed using whole sediment tests with benthic organisms. The challenge, however, is the differentiation between multiple effects caused by complex contaminant mixtures and the unspecific toxicity endpoints such as survival, growth or reproduction. The use of gene expression profiling facilitates the identification of transcriptional changes at the molecular level that are specific to the bio-available fraction of pollutants. Results In this pilot study, we exposed the nematode Caenorhabditis elegans to three sediments of German rivers with varying (low, medium and high levels of heavy metal and organic contamination. Beside chemical analysis, three standard bioassays were performed: reproduction of C. elegans, genotoxicity (Comet assay and endocrine disruption (YES test. Gene expression was profiled using a whole genome DNA-microarray approach to identify overrepresented functional gene categories and derived cellular processes. Disaccharide and glycogen metabolism were found to be affected, whereas further functional pathways, such as oxidative phosphorylation, ribosome biogenesis, metabolism of xenobiotics, aging and several developmental processes were found to be differentially regulated only in response to the most contaminated sediment. Conclusion This study demonstrates how ecotoxicogenomics can identify transcriptional responses in complex mixture scenarios to distinguish different samples of river sediments.

  16. Expression Comparison of Oil Biosynthesis Genes in Oil Palm Mesocarp Tissue Using Custom Array

    Directory of Open Access Journals (Sweden)

    Yick Ching Wong

    2014-11-01

    Full Text Available Gene expression changes that occur during mesocarp development are a major research focus in oil palm research due to the economic importance of this tissue and the relatively rapid increase in lipid content to very high levels at fruit ripeness. Here, we report the development of a transcriptome-based 105,000-probe oil palm mesocarp microarray. The expression of genes involved in fatty acid (FA and triacylglycerol (TAG assembly, along with the tricarboxylic acid cycle (TCA and glycolysis pathway at 16 Weeks After Anthesis (WAA exhibited significantly higher signals compared to those obtained from a cross-species hybridization to the Arabidopsis (p-value < 0.01, and rice (p-value < 0.01 arrays. The oil palm microarray data also showed comparable correlation of expression (r2 = 0.569, p < 0.01 throughout mesocarp development to transcriptome (RNA sequencing data, and improved correlation over quantitative real-time PCR (qPCR (r2 = 0.721, p < 0.01 of the same RNA samples. The results confirm the advantage of the custom microarray over commercially available arrays derived from model species. We demonstrate the utility of this custom microarray to gain a better understanding of gene expression patterns in the oil palm mesocarp that may lead to increasing future oil yield.

  17. Expression Comparison of Oil Biosynthesis Genes in Oil Palm Mesocarp Tissue Using Custom Array

    Science.gov (United States)

    Wong, Yick Ching; Kwong, Qi Bin; Lee, Heng Leng; Ong, Chuang Kee; Mayes, Sean; Chew, Fook Tim; Appleton, David R.; Kulaveerasingam, Harikrishna

    2014-01-01

    Gene expression changes that occur during mesocarp development are a major research focus in oil palm research due to the economic importance of this tissue and the relatively rapid increase in lipid content to very high levels at fruit ripeness. Here, we report the development of a transcriptome-based 105,000-probe oil palm mesocarp microarray. The expression of genes involved in fatty acid (FA) and triacylglycerol (TAG) assembly, along with the tricarboxylic acid cycle (TCA) and glycolysis pathway at 16 Weeks After Anthesis (WAA) exhibited significantly higher signals compared to those obtained from a cross-species hybridization to the Arabidopsis (p-value < 0.01), and rice (p-value < 0.01) arrays. The oil palm microarray data also showed comparable correlation of expression (r2 = 0.569, p < 0.01) throughout mesocarp development to transcriptome (RNA sequencing) data, and improved correlation over quantitative real-time PCR (qPCR) (r2 = 0.721, p < 0.01) of the same RNA samples. The results confirm the advantage of the custom microarray over commercially available arrays derived from model species. We demonstrate the utility of this custom microarray to gain a better understanding of gene expression patterns in the oil palm mesocarp that may lead to increasing future oil yield. PMID:27600348

  18. Differential binding of calmodulin-related proteins to their targets revealed through high-density Arabidopsis protein microarrays

    Science.gov (United States)

    Popescu, Sorina C.; Popescu, George V.; Bachan, Shawn; Zhang, Zimei; Seay, Montrell; Gerstein, Mark; Snyder, Michael; Dinesh-Kumar, S. P.

    2007-01-01

    Calmodulins (CaMs) are the most ubiquitous calcium sensors in eukaryotes. A number of CaM-binding proteins have been identified through classical methods, and many proteins have been predicted to bind CaMs based on their structural homology with known targets. However, multicellular organisms typically contain many CaM-like (CML) proteins, and a global identification of their targets and specificity of interaction is lacking. In an effort to develop a platform for large-scale analysis of proteins in plants we have developed a protein microarray and used it to study the global analysis of CaM/CML interactions. An Arabidopsis thaliana expression collection containing 1,133 ORFs was generated and used to produce proteins with an optimized medium-throughput plant-based expression system. Protein microarrays were prepared and screened with several CaMs/CMLs. A large number of previously known and novel CaM/CML targets were identified, including transcription factors, receptor and intracellular protein kinases, F-box proteins, RNA-binding proteins, and proteins of unknown function. Multiple CaM/CML proteins bound many binding partners, but the majority of targets were specific to one or a few CaMs/CMLs indicating that different CaM family members function through different targets. Based on our analyses, the emergent CaM/CML interactome is more extensive than previously predicted. Our results suggest that calcium functions through distinct CaM/CML proteins to regulate a wide range of targets and cellular activities. PMID:17360592

  19. Facilitating RNA structure prediction with microarrays.

    Science.gov (United States)

    Kierzek, Elzbieta; Kierzek, Ryszard; Turner, Douglas H; Catrina, Irina E

    2006-01-17

    Determining RNA secondary structure is important for understanding structure-function relationships and identifying potential drug targets. This paper reports the use of microarrays with heptamer 2'-O-methyl oligoribonucleotides to probe the secondary structure of an RNA and thereby improve the prediction of that secondary structure. When experimental constraints from hybridization results are added to a free-energy minimization algorithm, the prediction of the secondary structure of Escherichia coli 5S rRNA improves from 27 to 92% of the known canonical base pairs. Optimization of buffer conditions for hybridization and application of 2'-O-methyl-2-thiouridine to enhance binding and improve discrimination between AU and GU pairs are also described. The results suggest that probing RNA with oligonucleotide microarrays can facilitate determination of secondary structure.

  20. A microarray analysis of two distinct lymphatic endothelial cell populations

    Directory of Open Access Journals (Sweden)

    Bernhard Schweighofer

    2015-06-01

    Full Text Available We have recently identified lymphatic endothelial cells (LECs to form two morphologically different populations, exhibiting significantly different surface protein expression levels of podoplanin, a major surface marker for this cell type. In vitro shockwave treatment (IVSWT of LECs resulted in enrichment of the podoplaninhigh cell population and was accompanied by markedly increased cell proliferation, as well as 2D and 3D migration. Gene expression profiles of these distinct populations were established using Affymetrix microarray analyses. Here we provide additional details about our dataset (NCBI GEO accession number GSE62510 and describe how we analyzed the data to identify differently expressed genes in these two LEC populations.