WorldWideScience

Sample records for gene array-based analysis

  1. Gene expression profiles in squamous cell cervical carcinoma using array-based comparative genomic hybridization analysis.

    Science.gov (United States)

    Choi, Y-W; Bae, S M; Kim, Y-W; Lee, H N; Kim, Y W; Park, T C; Ro, D Y; Shin, J C; Shin, S J; Seo, J-S; Ahn, W S

    2007-01-01

    Our aim was to identify novel genomic regions of interest and provide highly dynamic range information on correlation between squamous cell cervical carcinoma and its related gene expression patterns by a genome-wide array-based comparative genomic hybridization (array-CGH). We analyzed 15 cases of cervical cancer from KangNam St Mary's Hospital of the Catholic University of Korea. Microdissection assay was performed to obtain DNA samples from paraffin-embedded cervical tissues of cancer as well as of the adjacent normal tissues. The bacterial artificial chromosome (BAC) array used in this study consisted of 1440 human BACs and the space among the clones was 2.08 Mb. All the 15 cases of cervical cancer showed the differential changes of the cervical cancer-associated genetic alterations. The analysis limit of average gains and losses was 53%. A significant positive correlation was found in 8q24.3, 1p36.32, 3q27.1, 7p21.1, 11q13.1, and 3p14.2 changes through the cervical carcinogenesis. The regions of high level of gain were 1p36.33-1p36.32, 8q24.3, 16p13.3, 1p36.33, 3q27.1, and 7p21.1. And the regions of homozygous loss were 2q12.1, 22q11.21, 3p14.2, 6q24.3, 7p15.2, and 11q25. In the high level of gain regions, GSDMDC1, RECQL4, TP73, ABCF3, ALG3, HDAC9, ESRRA, and RPS6KA4 were significantly correlated with cervical cancer. The genes encoded by frequently lost clones were PTPRG, GRM7, ZDHHC3, EXOSC7, LRP1B, and NR3C2. Therefore, array-CGH analyses showed that specific genomic alterations were maintained in cervical cancer that were critical to the malignant phenotype and may give a chance to find out possible target genes present in the gained or lost clones.

  2. 1-Mb resolution array-based comparative genomic hybridization using a BAC clone set optimized for cancer gene analysis

    NARCIS (Netherlands)

    Greshock, J; Naylor, TL; Margolin, A; Diskin, S; Cleaver, SH; Futreal, PA; deJong, PJ; Zhao, SY; Liebman, M; Weber, BL

    2004-01-01

    Array-based comparative genomic hybridization (aCGH) is a recently developed tool for genome-wide determination of DNA copy number alterations. This technology has tremendous potential for disease-gene discovery in cancer and developmental disorders as well as numerous other applications. However, w

  3. Global analysis of candidate genes important for fitness in a competitive biofilm using DNA-array-based transposon mapping.

    Science.gov (United States)

    Junker, Lauren M; Peters, Joseph E; Hay, Anthony G

    2006-08-01

    Escherichia coli strain PHL628 was subjected to saturating Tn5 transposon mutagenesis and then grown under competitive planktonic or biofilm conditions. The locations of transposon insertions from the remaining cells were then mapped on a gene array. The results from the array mapping indicated that 4.5 % of the E. coli genome was important under these conditions. Specifically, 114 genes were identified as important for the biofilm lifestyle, whereas 80 genes were important for the planktonic lifestyle. Four broad functional categories were identified as biofilm-important. These included genes encoding cell structures, small-molecule transport, energy metabolism and regulatory functions. For one of these genes, arcA, an insertion mutant was generated and its biofilm-related phenotype was examined. Results from both the transposon array and insertion mutagenesis indicated that arcA, which is known to be a negative response regulator of genes in aerobic pathways, was important for competitiveness in E. coli PHL628 biofilms. This work also demonstrated that ligation-mediated PCR, coupled with array-based transposon mapping, was an effective tool for identifying a large variety of candidate genes that are important for biofilm fitness.

  4. Functional gene array-based analysis of microbial community structure in groundwaters with a gradient of contaminant levels

    Energy Technology Data Exchange (ETDEWEB)

    Waldron, P.J.; Wu, L.; Van Nostrand, J.D.; Schadt, C.W.; Watson, D.B.; Jardine, P.M.; Palumbo, A.V.; Hazen, T.C.; Zhou, J.

    2009-06-15

    To understand how contaminants affect microbial community diversity, heterogeneity, and functional structure, six groundwater monitoring wells from the Field Research Center of the U.S. Department of Energy Environmental Remediation Science Program (ERSP; Oak Ridge, TN), with a wide range of pH, nitrate, and heavy metal contamination were investigated. DNA from the groundwater community was analyzed with a functional gene array containing 2006 probes to detect genes involved in metal resistance, sulfate reduction, organic contaminant degradation, and carbon and nitrogen cycling. Microbial diversity decreased in relation to the contamination levels of the wells. Highly contaminated wells had lower gene diversity but greater signal intensity than the pristine well. The microbial composition was heterogeneous, with 17-70% overlap between different wells. Metal-resistant and metal-reducing microorganisms were detected in both contaminated and pristine wells, suggesting the potential for successful bioremediation of metal-contaminated groundwaters. In addition, results of Mantel tests and canonical correspondence analysis indicate that nitrate, sulfate, pH, uranium, and technetium have a significant (p < 0.05) effect on microbial community structure. This study provides an overall picture of microbial community structure in contaminated environments with functional gene arrays by showing that diversity and heterogeneity can vary greatly in relation to contamination.

  5. Evolutionary insights from suffix array-based genome sequence analysis

    Indian Academy of Sciences (India)

    Anindya Poddar; Nagasuma Chandra; Madhavi Ganapathiraju; K Sekar; Judith Klein-Seetharaman; Raj Reddy; N Balakrishnan

    2007-08-01

    Gene and protein sequence analyses, central components of studies in modern biology are easily amenable to string matching and pattern recognition algorithms. The growing need of analysing whole genome sequences more efficiently and thoroughly, has led to the emergence of new computational methods. Suffix trees and suffix arrays are data structures, well known in many other areas and are highly suited for sequence analysis too. Here we report an improvement to the design of construction of suffix arrays. Enhancement in versatility and scalability, enabled by this approach, is demonstrated through the use of real-life examples. The scalability of the algorithm to whole genomes renders it suitable to address many biologically interesting problems. One example is the evolutionary insight gained by analysing unigrams, bi-grams and higher n-grams, indicating that the genetic code has a direct influence on the overall composition of the genome. Further, different proteomes have been analysed for the coverage of the possible peptide space, which indicate that as much as a quarter of the total space at the tetra-peptide level is left un-sampled in prokaryotic organisms, although almost all tri-peptides can be seen in one protein or another in a proteome. Besides, distinct patterns begin to emerge for the counts of particular tetra and higher peptides, indicative of a ‘meaning’ for tetra and higher n-grams. The toolkit has also been used to demonstrate the usefulness of identifying repeats in whole proteomes efficiently. As an example, 16 members of one COG, coded by the genome of Mycobacterium tuberculosis H37Rv have been found to contain a repeating sequence of 300 amino acids.

  6. Sample processing for DNA chip array-based analysis of enterohemorrhagic Escherichia coli (EHEC

    Directory of Open Access Journals (Sweden)

    Enfors Sven-Olof

    2008-10-01

    Full Text Available Abstract Background Exploitation of DNA-based analyses of microbial pathogens, and especially simultaneous typing of several virulence-related genes in bacteria is becoming an important objective of public health these days. Results A procedure for sample processing for a confirmative analysis of enterohemorrhagic Escherichia coli (EHEC on a single colony with DNA chip array was developed and is reported here. The protocol includes application of fragmented genomic DNA from ultrasonicated colonies. The sample processing comprises first 2.5 min of ultrasonic treatment, DNA extraction (2×, and afterwards additional 5 min ultrasonication. Thus, the total sample preparation time for a confirmative analysis of EHEC is nearly 10 min. Additionally, bioinformatic revisions were performed in order to design PCR primers and array probes specific to most conservative regions of the EHEC-associated genes. Six strains with distinct pathogenic properties were selected for this study. At last, the EHEC chip array for a parallel and simultaneous detection of genes etpC-stx1-stx2-eae was designed and examined. This should permit to sense all currently accessible variants of the selected sequences in EHEC types and subtypes. Conclusion In order to implement the DNA chip array-based analysis for direct EHEC detection the sample processing was established in course of this work. However, this sample preparation mode may also be applied to other types of EHEC DNA-based sensing systems.

  7. Association between chromosomal aberration of COX8C and tethered spinal cord syndrome: array-based comparative genomic hybridization analysis

    Directory of Open Access Journals (Sweden)

    Qiu-jiong Zhao

    2016-01-01

    Full Text Available Copy number variations have been found in patients with neural tube abnormalities. In this study, we performed genome-wide screening using high-resolution array-based comparative genomic hybridization in three children with tethered spinal cord syndrome and two healthy parents. Of eight copy number variations, four were non-polymorphic. These non-polymorphic copy number variations were associated with Angelman and Prader-Willi syndromes, and microcephaly. Gene function enrichment analysis revealed that COX8C, a gene associated with metabolic disorders of the nervous system, was located in the copy number variation region of Patient 1. Our results indicate that array-based comparative genomic hybridization can be used to diagnose tethered spinal cord syndrome. Our results may help determine the pathogenesis of tethered spinal cord syndrome and prevent occurrence of this disease.

  8. Stochastic segmentation models for array-based comparative genomic hybridization data analysis.

    Science.gov (United States)

    Lai, Tze Leung; Xing, Haipeng; Zhang, Nancy

    2008-04-01

    Array-based comparative genomic hybridization (array-CGH) is a high throughput, high resolution technique for studying the genetics of cancer. Analysis of array-CGH data typically involves estimation of the underlying chromosome copy numbers from the log fluorescence ratios and segmenting the chromosome into regions with the same copy number at each location. We propose for the analysis of array-CGH data, a new stochastic segmentation model and an associated estimation procedure that has attractive statistical and computational properties. An important benefit of this Bayesian segmentation model is that it yields explicit formulas for posterior means, which can be used to estimate the signal directly without performing segmentation. Other quantities relating to the posterior distribution that are useful for providing confidence assessments of any given segmentation can also be estimated by using our method. We propose an approximation method whose computation time is linear in sequence length which makes our method practically applicable to the new higher density arrays. Simulation studies and applications to real array-CGH data illustrate the advantages of the proposed approach.

  9. Array-based transcriptional analysis of Clostridium sporogenes UC9000 during germination, cell outgrowth and vegetative life.

    Science.gov (United States)

    Bassi, Daniela; Cappa, Fabrizio; Cocconcelli, Pier Sandro

    2013-02-01

    The members of the genus Clostridium, including the spore-forming anaerobic bacteria, have a complex and strictly regulated life cycle, but very little is known about the genetic pathways involved in the different stages of their life cycle. Clostridium sporogenes, a Gram-positive bacterium usually involved in food spoilage and frequently isolated from late blowing cheese, is genetically indistinguishable from the proteolytic Clostridium botulinum. As the non-neurotoxic counterpart, it is often used as an exemplar for the toxic subtypes. In this work, we performed a microscopic study combined with a custom array-based analysis of the C. sporogenes cycle, from dormant spores to the early stationary phase. We identified a total of 211 transcripts in spores, validating the hypothesis that mRNAs are abundant in spores and the pattern of mRNA expression is strikingly different from that present in growing cells. The spore transcripts included genes responsible for different life-sustaining functions, suggesting there was transcript entrapment or basic poly-functional gene activation for future steps. In addition, 3 h after the beginning of the germination process, 20% of the total up-regulated genes were temporally expressed in germinating spores. The vegetative condition appeared to be more active in terms of gene transcription and protein synthesis than the spore, and genes coding for germination and sporulation factors seemed to be expressed at this point. These results suggest that spores are not silent entities, and a broader knowledge of the genetic pathways involved in the Clostridium life cycle could provide a better understanding of pathogenic clostridia types.

  10. Design of a microchannel-nanochannel-microchannel array based nanoelectroporation system for precise gene transfection.

    Science.gov (United States)

    Gao, Keliang; Li, Lei; He, Lingna; Hinkle, Kevin; Wu, Yun; Ma, Junyu; Chang, Lingqian; Zhao, Xi; Perez, Daniel Gallego; Eckardt, Sigrid; McLaughlin, John; Liu, Boyu; Farson, Dave F; Lee, L James

    2014-03-12

    A micro/nano-fabrication process of a nanochannel electroporation (NEP) array and its application for precise delivery of plasmid for non-viral gene transfection is described. A dip-combing device is optimized to produce DNA nanowires across a microridge array patterned on the polydimethylsiloxane (PDMS) surface with a yield up to 95%. Molecular imprinting based on a low viscosity resin, 1,4-butanediol diacrylate (1,4-BDDA), adopted to convert the microridge-nanowire-microridge array into a microchannel-nanochannel-microchannel (MNM) array. Secondary machining by femtosecond laser ablation is applied to shorten one side of microchannels from 3000 to 50 μm to facilitate cell loading and unloading. The biochip is then sealed in a packaging case with reservoirs and microfluidic channels to enable cell and plasmid loading, and to protect the biochip from leakage and contamination. The package case can be opened for cell unloading after NEP to allow for the follow-up cell culture and analysis. These NEP cases can be placed in a spinning disc and up to ten discs can be piled together for spinning. The resulting centrifugal force can simultaneously manipulate hundreds or thousands of cells into microchannels of NEP arrays within 3 minutes. To demonstrate its application, a 13 kbp OSKM plasmid of induced pluripotent stem cell (iPSC) is injected into mouse embryonic fibroblasts cells (MEFCs). Fluorescence detection of transfected cells within the NEP biochips shows that the delivered dosage is high and much more uniform compared with similar gene transfection carried out by the conventional bulk electroporation (BEP) method.

  11. Clinical array-based karyotyping of breast cancer with equivocal HER2 status resolves gene copy number and reveals chromosome 17 complexity

    Directory of Open Access Journals (Sweden)

    Zadeh Soheila

    2010-07-01

    Full Text Available Abstract Background HER2 gene copy status, and concomitant administration of trastuzumab (Herceptin, remains one of the best examples of targeted cancer therapy based on understanding the genomic etiology of disease. However, newly diagnosed breast cancer cases with equivocal HER2 results present a challenge for the oncologist who must make treatment decisions despite the patient's unresolved HER2 status. In some cases both immunohistochemistry (IHC and fluorescence in situ hybridization (FISH are reported as equivocal, whereas in other cases IHC results and FISH are discordant for positive versus negative results. The recent validation of array-based, molecular karyotyping for clinical oncology testing provides an alternative method for determination of HER2 gene copy number status in cases remaining unresolved by traditional methods. Methods In the current study, DNA extracted from 20 formalin fixed paraffin embedded (FFPE tissue samples from newly diagnosed cases of invasive ductal carcinoma referred to our laboratory with unresolved HER2 status, were analyzed using a clinically validated genomic array containing 127 probes covering the HER2 amplicon, the pericentromeric regions, and both chromosome 17 arms. Results Array-based comparative genomic hybridization (array CGH analysis of chromosome 17 resolved HER2 gene status in [20/20] (100% of cases and revealed additional chromosome 17 copy number changes in [18/20] (90% of cases. Array CGH analysis also revealed two false positives and one false negative by FISH due to "ratio skewing" caused by chromosomal gains and losses in the centromeric region. All cases with complex rearrangements of chromosome 17 showed genome-wide chromosomal instability. Conclusions These results illustrate the analytical power of array-based genomic analysis as a clinical laboratory technique for resolution of HER2 status in breast cancer cases with equivocal results. The frequency of complex chromosome 17

  12. Exclusion of APC and VHL gene deletions by array-based comparative hybridization in two patients with microscopically visible chromosomal aberrations.

    Science.gov (United States)

    Wallerstein, Robert J; Brooks, Susan Sklower; Streck, Deanna L; Kurvathi, Rohini; Toruner, Gokce A

    2007-10-15

    Karyotyping is a major component of the genetic work-up of patients with dysmorphism. Cytogenetic aberrations close to a known tumor suppressor gene raise important clinical issues because deletion of that tumor suppressor gene can cause genetic predisposition to cancer. We present two cancer-free dysmorphic patients with karyotypes of 46,XX,del(5)(q15q22.3) and 46,XX,del(3)(p25.2~pter). These deletions are close to the APC and VHL genes that confer susceptibility to familial Adenomatous polyposis (OMIM #17510) and von-Hippel-Lindau syndrome (OMIM #193300), respectively. The array-based comparative genomic hybridization (array-CGH) analysis using a custom Agilent 44K oligonucleotide array demonstrated an interstitial 20.7-megabase (Mb) deletion on 5q (chr5: 89,725,638-110,491,345) and a terminal 9.45-Mb deletion on 3p (chr3:pter-9,450,984). According to the March 2006 human reference sequence, the APC gene is located at chr5: 112,101,483-112,209,835 and the VHL gene is located at chr3: 10,158,319-10,168,746. These results indicate that the APC gene is 2,300 kilobases (kb) and the VHL gene is 700 kb away from deleted regions. Southern blot analysis for APC and VHL genes were negative, consistent with array-CGH findings. These results demonstrate the power of array-CCH to assess potential tumor suppressor gene involvement and cancer risk in patients with microscopically visible deletions in areas near tumor suppressors.

  13. Sample processing for DNA chip array-based analysis of enterohemorrhagic Escherichia coli (EHEC)

    OpenAIRE

    Enfors Sven-Olof; Wegrzyn Grzegorz; Basselet Pascal; Gabig-Ciminska Magdalena

    2008-01-01

    Abstract Background Exploitation of DNA-based analyses of microbial pathogens, and especially simultaneous typing of several virulence-related genes in bacteria is becoming an important objective of public health these days. Results A procedure for sample processing for a confirmative analysis of enterohemorrhagic Escherichia coli (EHEC) on a single colony with DNA chip array was developed and is reported here. The protocol includes application of fragmented genomic DNA from ultrasonicated co...

  14. MethLAB: a graphical user interface package for the analysis of array-based DNA methylation data.

    Science.gov (United States)

    Kilaru, Varun; Barfield, Richard T; Schroeder, James W; Smith, Alicia K; Conneely, Karen N

    2012-03-01

    Recent evidence suggests that DNA methylation changes may underlie numerous complex traits and diseases. The advent of commercial, array-based methods to interrogate DNA methylation has led to a profusion of epigenetic studies in the literature. Array-based methods, such as the popular Illumina GoldenGate and Infinium platforms, estimate the proportion of DNA methylated at single-base resolution for thousands of CpG sites across the genome. These arrays generate enormous amounts of data, but few software resources exist for efficient and flexible analysis of these data. We developed a software package called MethLAB (http://genetics.emory.edu/conneely/MethLAB) using R, an open source statistical language that can be edited to suit the needs of the user. MethLAB features a graphical user interface (GUI) with a menu-driven format designed to efficiently read in and manipulate array-based methylation data in a user-friendly manner. MethLAB tests for association between methylation and relevant phenotypes by fitting a separate linear model for each CpG site. These models can incorporate both continuous and categorical phenotypes and covariates, as well as fixed or random batch or chip effects. MethLAB accounts for multiple testing by controlling the false discovery rate (FDR) at a user-specified level. Standard output includes a spreadsheet-ready text file and an array of publication-quality figures. Considering the growing interest in and availability of DNA methylation data, there is a great need for user-friendly open source analytical tools. With MethLAB, we present a timely resource that will allow users with no programming experience to implement flexible and powerful analyses of DNA methylation data.

  15. Array-based FMR1 sequencing and deletion analysis in patients with a fragile X syndrome-like phenotype.

    Directory of Open Access Journals (Sweden)

    Stephen C Collins

    Full Text Available BACKGROUND: Fragile X syndrome (FXS is caused by loss of function mutations in the FMR1 gene. Trinucleotide CGG-repeat expansions, resulting in FMR1 gene silencing, are the most common mutations observed at this locus. Even though the repeat expansion mutation is a functional null mutation, few conventional mutations have been identified at this locus, largely due to the clinical laboratory focus on the repeat tract. METHODOLOGY/PRINCIPAL FINDINGS: To more thoroughly evaluate the frequency of conventional mutations in FXS-like patients, we used an array-based method to sequence FMR1 in 51 unrelated males exhibiting several features characteristic of FXS but with normal CGG-repeat tracts of FMR1. One patient was identified with a deletion in FMR1, but none of the patients were found to have other conventional mutations. CONCLUSIONS/SIGNIFICANCE: These data suggest that missense mutations in FMR1 are not a common cause of the FXS phenotype in patients who have normal-length CGG-repeat tracts. However, screening for small deletions of FMR1 may be of clinically utility.

  16. Design of Smart Ion-Selective Electrode Arrays Based on Source Separation through Nonlinear Independent Component Analysis

    Directory of Open Access Journals (Sweden)

    Duarte L.T.

    2014-03-01

    Full Text Available The development of chemical sensor arrays based on Blind Source Separation (BSS provides a promising solution to overcome the interference problem associated with Ion-Selective Electrodes (ISE. The main motivation behind this new approach is to ease the time-demanding calibration stage. While the first works on this problem only considered the case in which the ions under analysis have equal valences, the present work aims at developing a BSS technique that works when the ions have different charges. In this situation, the resulting mixing model belongs to a particular class of nonlinear systems that have never been studied in the BSS literature. In order to tackle this sort of mixing process, we adopted a recurrent network as separating system. Moreover, concerning the BSS learning strategy, we develop a mutual information minimization approach based on the notion of the differential of the mutual information. The method works requires a batch operation, and, thus, can be used to perform off-line analysis. The validity of our approach is supported by experiments where the mixing model parameters were extracted from actual data.

  17. Fiber array based hyperspectral Raman imaging for chemical selective analysis of malaria-infected red blood cells

    Energy Technology Data Exchange (ETDEWEB)

    Brückner, Michael [Leibniz Institute of Photonic Technology, 07745 Jena (Germany); Becker, Katja [Justus Liebig University Giessen, Biochemistry and Molecular Biology, 35392 Giessen (Germany); Popp, Jürgen [Leibniz Institute of Photonic Technology, 07745 Jena (Germany); Friedrich Schiller University Jena, Institute for Physical Chemistry, 07745 Jena (Germany); Friedrich Schiller University Jena, Abbe Centre of Photonics, 07745 Jena (Germany); Frosch, Torsten, E-mail: torsten.frosch@uni-jena.de [Leibniz Institute of Photonic Technology, 07745 Jena (Germany); Friedrich Schiller University Jena, Institute for Physical Chemistry, 07745 Jena (Germany); Friedrich Schiller University Jena, Abbe Centre of Photonics, 07745 Jena (Germany)

    2015-09-24

    A new setup for Raman spectroscopic wide-field imaging is presented. It combines the advantages of a fiber array based spectral translator with a tailor-made laser illumination system for high-quality Raman chemical imaging of sensitive biological samples. The Gaussian-like intensity distribution of the illuminating laser beam is shaped by a square-core optical multimode fiber to a top-hat profile with very homogeneous intensity distribution to fulfill the conditions of Koehler. The 30 m long optical fiber and an additional vibrator efficiently destroy the polarization and coherence of the illuminating light. This homogeneous, incoherent illumination is an essential prerequisite for stable quantitative imaging of complex biological samples. The fiber array translates the two-dimensional lateral information of the Raman stray light into separated spectral channels with very high contrast. The Raman image can be correlated with a corresponding white light microscopic image of the sample. The new setup enables simultaneous quantification of all Raman spectra across the whole spatial area with very good spectral resolution and thus outperforms other Raman imaging approaches based on scanning and tunable filters. The unique capabilities of the setup for fast, gentle, sensitive, and selective chemical imaging of biological samples were applied for automated hemozoin analysis. A special algorithm was developed to generate Raman images based on the hemozoin distribution in red blood cells without any influence from other Raman scattering. The new imaging setup in combination with the robust algorithm provides a novel, elegant way for chemical selective analysis of the malaria pigment hemozoin in early ring stages of Plasmodium falciparum infected erythrocytes. - Highlights: • Raman hyperspectral imaging allows for chemical selective analysis of biological samples with spatial heterogeneity. • A homogeneous, incoherent illumination is essential for reliable

  18. Fiber array based hyperspectral Raman imaging for chemical selective analysis of malaria-infected red blood cells.

    Science.gov (United States)

    Brückner, Michael; Becker, Katja; Popp, Jürgen; Frosch, Torsten

    2015-09-24

    A new setup for Raman spectroscopic wide-field imaging is presented. It combines the advantages of a fiber array based spectral translator with a tailor-made laser illumination system for high-quality Raman chemical imaging of sensitive biological samples. The Gaussian-like intensity distribution of the illuminating laser beam is shaped by a square-core optical multimode fiber to a top-hat profile with very homogeneous intensity distribution to fulfill the conditions of Koehler. The 30 m long optical fiber and an additional vibrator efficiently destroy the polarization and coherence of the illuminating light. This homogeneous, incoherent illumination is an essential prerequisite for stable quantitative imaging of complex biological samples. The fiber array translates the two-dimensional lateral information of the Raman stray light into separated spectral channels with very high contrast. The Raman image can be correlated with a corresponding white light microscopic image of the sample. The new setup enables simultaneous quantification of all Raman spectra across the whole spatial area with very good spectral resolution and thus outperforms other Raman imaging approaches based on scanning and tunable filters. The unique capabilities of the setup for fast, gentle, sensitive, and selective chemical imaging of biological samples were applied for automated hemozoin analysis. A special algorithm was developed to generate Raman images based on the hemozoin distribution in red blood cells without any influence from other Raman scattering. The new imaging setup in combination with the robust algorithm provides a novel, elegant way for chemical selective analysis of the malaria pigment hemozoin in early ring stages of Plasmodium falciparum infected erythrocytes.

  19. Genetic characterization of dogs via chromosomal analysis and array-based comparative genomic hybridization (aCGH).

    Science.gov (United States)

    Müller, M H; Reimann-Berg, N; Bullerdiek, J; Murua Escobar, H

    2012-01-01

    The results of cytogenetic and molecular cytogenetic investigations revealed similarities in genetic background and biological behaviour between tumours and genetic diseases of humans and dogs. These findings classify the dog a good and accepted model for human cancers such as osteosarcomas, mammary carcinomas, oral melanomas and others. With the appearance of new studies and advances in canine genome sequencing, the number of known homologies in diseases between these species raised and still is expected to increase. In this context, array-based comparative genomic hybridization (aCGH) provides a novel tool to rapidly characterize numerical aberrations in canine tumours or to detect copy number aberrations between different breeds. As it is possible to spot probes covering the whole genome on each chip to discover copy number aberrations of all chromosomes simultaneously, this method is time-saving and cost-effective - considering the relation of costs and the amount of data obtained. Complemented with traditional methods like karyotyping and fluorescence in situ hybridization (FISH) analyses, the aCGH is able to provide new insights into the underlying causes of canine carcinogenesis.

  20. Specific lectin biomarkers for isolation of human pluripotent stem cells identified through array-based glycomic analysis

    Institute of Scientific and Technical Information of China (English)

    Yu-Chieh Wang; Trevor R Leonardo; Ying Liu; Suzanne E Peterson; Louise C Laurent; Shinya Yamanaka; Jeanne F Loring; Masato Nakagawa; Ibon Garitaonandia; Ileana Slavin; Gulsah Altun; Robert M Lacharite; Kristopher L Nazor; Ha T Tran; Candace L Lynch

    2011-01-01

    Rapid and dependable methods for isolating human pluripotent stem cell (hPSC) populations are urgently needed for quality control in basic research and in cell-based therapy applications.Using lectin arrays,we analyzed glycoproteins extracted from 26 hPSC samples and 22 differentiated cell samples,and identified a small group of iectins with distinctive binding signatures that were sufficient to distinguish hPSCs from a variety of non-pluripotent cell types.These specific biomarkers were shared by all the 12 human embryonic stem cell and the 14 human induced pluripotent stem cell samples examined,regardless of the laboratory of origin,the culture conditions,the somatic cell type reprogrammed,or the reprogramming method used.We demonstrated a practical application of specific lectin binding by detecting hPSCs within a differentiated cell population with lectin-mediated staining followed by fluorescence microscopy and flow cytometry,and by enriching and purging viable hPSCs from mixed cell populations using lectin-mediated cell separation.Global gene expression analysis showed pluripotency-associated differential expression of specific fucosyltransferases and sialyltransferases,which may underlie these differences in protein glycosylation and lectin binding.Taken together,our results show that protein glycosylation differs considerably between pluripotent and non-pluripotent cells,and demonstrate that lectins may be used as biomarkers to monitor pluripotency in stem cell populations and for removal of viable hPSCs from mixed cell populations.

  1. Torque analysis and measurements of a permanent magnet type Eddy current brake with a Halbach magnet array based on analytical magnetic field calculations

    Science.gov (United States)

    Park, Min-Gyu; Choi, Jang-Young; Shin, Hyeon-Jae; Jang, Seok-Myeong

    2014-05-01

    This paper presents the torque analysis and measurements of a permanent magnet (PM) type eddy current brake (ECB) with a Halbach magnet array based on analytical magnetic field calculations. On the basis of a magnetic vector potential and using a two-dimensional (2D) polar coordinate system, the analytical solution for magnetic flux density, including the eddy current reaction is evaluated. Based on these solutions, the magnetic torque is also determined analytically. A 2D finite element analysis is employed to validate the method used. Practical issues in the analytical study of the PM type ECBs, such as the maximum braking torque, the required rotor speed, and the segment-dependent, are fully discussed. Finally, the braking torque as a function of the rotor speed is measured to verify the results of the analytical study.

  2. Array-based comparative genomic hybridization is more informative than conventional karyotyping and fluorescence in situ hybridization in the analysis of first-trimester spontaneous abortion

    Directory of Open Access Journals (Sweden)

    Gao Jinsong

    2012-07-01

    Full Text Available Abstract Background Array-based comparative genomic hybridization (aCGH is a new technique for detecting submicroscopic deletions and duplications, and can overcome many of the limitations associated with classic cytogenetic analysis. However, its clinical use in spontaneous abortion needs comprehensive evaluation. We used aCGH to investigate chromosomal imbalances in 100 spontaneous abortions and compared the results with G-banding karyotyping and fluorescence in situ hybridization (FISH. Inconsistent results were verified by quantitative fluorescence PCR. Results Abnormalities were detected in 61 cases. aCGH achieved the highest detection rate (93.4%, 57/61 compared with traditional karyotyping (77%, 47/61 and FISH analysis (68.9%, 42/61. aCGH identified all chromosome abnormalities reported by traditional karyotyping and interphase FISH analysis, with the exception of four triploids. It also detected three additional aneuploidy cases in 37 specimens with ‘normal’ karyotypes, one mosaicism and 10 abnormalities in 14 specimens that failed to grow in vitro. Conclusions aCGH analysis circumvents many limitations in traditional karyotyping or FISH. The accuracy and efficiency of aCGH in spontaneous abortions highlights its clinical usefulness for the future. As aborted tissues have the potential to be contaminated with maternal cells, the threshold value of detection in aCGH should be lowered to avoid false negatives.

  3. Differentiation and classification of bacteria using vancomycin functionalized silver nanorods array based surface-enhanced raman spectroscopy an chemometric analysis

    Science.gov (United States)

    The intrinsic surface-enhanced Raman scattering (SERS) was used for differentiating and classifying bacterial species with chemometric data analysis. Such differentiation has often been conducted with an insufficient sample population and strong interference from the food matrices. To address these ...

  4. Quantitative Analysis of Rat Dorsal Root Ganglion Neurons Cultured on Microelectrode Arrays Based on Fluorescence Microscopy Image Processing.

    Science.gov (United States)

    Mari, João Fernando; Saito, José Hiroki; Neves, Amanda Ferreira; Lotufo, Celina Monteiro da Cruz; Destro-Filho, João-Batista; Nicoletti, Maria do Carmo

    2015-12-01

    Microelectrode Arrays (MEA) are devices for long term electrophysiological recording of extracellular spontaneous or evocated activities on in vitro neuron culture. This work proposes and develops a framework for quantitative and morphological analysis of neuron cultures on MEAs, by processing their corresponding images, acquired by fluorescence microscopy. The neurons are segmented from the fluorescence channel images using a combination of segmentation by thresholding, watershed transform, and object classification. The positioning of microelectrodes is obtained from the transmitted light channel images using the circular Hough transform. The proposed method was applied to images of dissociated culture of rat dorsal root ganglion (DRG) neuronal cells. The morphological and topological quantitative analysis carried out produced information regarding the state of culture, such as population count, neuron-to-neuron and neuron-to-microelectrode distances, soma morphologies, neuron sizes, neuron and microelectrode spatial distributions. Most of the analysis of microscopy images taken from neuronal cultures on MEA only consider simple qualitative analysis. Also, the proposed framework aims to standardize the image processing and to compute quantitative useful measures for integrated image-signal studies and further computational simulations. As results show, the implemented microelectrode identification method is robust and so are the implemented neuron segmentation and classification one (with a correct segmentation rate up to 84%). The quantitative information retrieved by the method is highly relevant to assist the integrated signal-image study of recorded electrophysiological signals as well as the physical aspects of the neuron culture on MEA. Although the experiments deal with DRG cell images, cortical and hippocampal cell images could also be processed with small adjustments in the image processing parameter estimation.

  5. A platform for analysis of nanoscale liquids with an integrated sensor array based on 2-d material

    CERN Document Server

    Engel, M; Neumann, R F; Giro, R; Feger, C; Avouris, P; Steiner, M

    2016-01-01

    Analysis of nanoscale liquids, including wetting and flow phenomena, is a scientific challenge with far reaching implications for industrial technologies. We report the conception, development, and application of an integrated platform for the experimental characterization of liquids at the nanometer scale. The platform combines the sensing functionalities of an integrated, two-dimensional electronic device array with in situ application of highly sensitive optical micro-spectroscopy and atomic force microscopy. We demonstrate the performance capabilities of the platform with an embodiment based on an array of integrated, optically transparent graphene sensors. The application of electronic and optical sensing in the platform allows for differentiating between liquids electronically, for determining a liquid's molecular fingerprint, and for monitoring surface wetting dynamics in real time. In order to explore the platform's sensitivity limits, we record topographies and optical spectra of individual, spatiall...

  6. Array based detection of antibiotic resistance genes in Gram negative bacteria isolated from retail poultry meat in the UK and Ireland.

    Science.gov (United States)

    McNeece, Grainne; Naughton, Violetta; Woodward, Martin J; Dooley, James S G; Naughton, Patrick J

    2014-06-02

    The use of antibiotics in birds and animals intended for human consumption within the European Union (EU) and elsewhere has been subject to regulation prohibiting the use of antimicrobials as growth promoters and the use of last resort antibiotics in an attempt to reduce the spread of multi-resistant Gram negative bacteria. Given the inexorable spread of antibiotic resistance there is an increasing need for improved monitoring of our food. Using selective media, Gram negative bacteria were isolated from retail chicken of UK-Intensively reared (n=27), Irish-Intensively reared (n=19) and UK-Free range (n=30) origin and subjected to an oligonucleotide based array system for the detection of 47 clinically relevant antibiotic resistance genes (ARGs) and two integrase genes. High incidences of β-lactamase genes were noted in all sample types, acc (67%), cmy (80%), fox (55%) and tem (40%) while chloramphenicol resistant determinants were detected in bacteria from the UK poultry portions and were absent in bacteria from the Irish samples. Denaturing Gradient Gel Electrophoresis (DGGE) was used to qualitatively analyse the Gram negative population in the samples and showed the expected diversity based on band stabbing and DNA sequencing. The array system proved to be a quick method for the detection of antibiotic resistance gene (ARG) burden within a mixed Gram negative bacterial population.

  7. Array-based techniques for fingerprinting medicinal herbs

    Directory of Open Access Journals (Sweden)

    Xue Charlie

    2011-05-01

    Full Text Available Abstract Poor quality control of medicinal herbs has led to instances of toxicity, poisoning and even deaths. The fundamental step in quality control of herbal medicine is accurate identification of herbs. Array-based techniques have recently been adapted to authenticate or identify herbal plants. This article reviews the current array-based techniques, eg oligonucleotides microarrays, gene-based probe microarrays, Suppression Subtractive Hybridization (SSH-based arrays, Diversity Array Technology (DArT and Subtracted Diversity Array (SDA. We further compare these techniques according to important parameters such as markers, polymorphism rates, restriction enzymes and sample type. The applicability of the array-based methods for fingerprinting depends on the availability of genomics and genetics of the species to be fingerprinted. For the species with few genome sequence information but high polymorphism rates, SDA techniques are particularly recommended because they require less labour and lower material cost.

  8. Array-based photoacoustic spectroscopy

    Science.gov (United States)

    Autrey, S. Thomas; Posakony, Gerald J.; Chen, Yu

    2005-03-22

    Methods and apparatus for simultaneous or sequential, rapid analysis of multiple samples by photoacoustic spectroscopy are disclosed. A photoacoustic spectroscopy sample array including a body having at least three recesses or affinity masses connected thereto is used in conjunction with a photoacoustic spectroscopy system. At least one acoustic detector is positioned near the recesses or affinity masses for detection of acoustic waves emitted from species of interest within the recesses or affinity masses.

  9. Array-based comparative genomic hybridization analysis reveals chromosomal copy number aberrations associated with clinical outcome in canine diffuse large B-cell lymphoma.

    Directory of Open Access Journals (Sweden)

    Arianna Aricò

    Full Text Available Canine Diffuse Large B-cell Lymphoma (cDLBCL is an aggressive cancer with variable clinical response. Despite recent attempts by gene expression profiling to identify the dog as a potential animal model for human DLBCL, this tumor remains biologically heterogeneous with no prognostic biomarkers to predict prognosis. The aim of this work was to identify copy number aberrations (CNAs by high-resolution array comparative genomic hybridization (aCGH in 12 dogs with newly diagnosed DLBCL. In a subset of these dogs, the genetic profiles at the end of therapy and at relapse were also assessed. In primary DLBCLs, 90 different genomic imbalances were counted, consisting of 46 gains and 44 losses. Two gains in chr13 were significantly correlated with clinical stage. In addition, specific regions of gains and losses were significantly associated to duration of remission. In primary DLBCLs, individual variability was found, however 14 recurrent CNAs (>30% were identified. Losses involving IGK, IGL and IGH were always found, and gains along the length of chr13 and chr31 were often observed (>41%. In these segments, MYC, LDHB, HSF1, KIT and PDGFRα are annotated. At the end of therapy, dogs in remission showed four new CNAs, whereas three new CNAs were observed in dogs at relapse compared with the previous profiles. One ex novo CNA, involving TCR, was present in dogs in remission after therapy, possibly induced by the autologous vaccine. Overall, aCGH identified small CNAs associated with outcome, which, along with future expression studies, may reveal target genes relevant to cDLBCL.

  10. Abductive Inference using Array-Based Logic

    DEFF Research Database (Denmark)

    Frisvad, Jeppe Revall; Falster, Peter; Møller, Gert L.;

    The notion of abduction has found its usage within a wide variety of AI fields. Computing abductive solutions has, however, shown to be highly intractable in logic programming. To avoid this intractability we present a new approach to logicbased abduction; through the geometrical view of data...... employed in array-based logic we embrace abduction in a simple structural operation. We argue that a theory of abduction on this form allows for an implementation which, at runtime, can perform abductive inference quite efficiently on arbitrary rules of logic representing knowledge of finite domains....

  11. Low Power Systolic Array Based Digital Filter for DSP Applications

    Directory of Open Access Journals (Sweden)

    S. Karthick

    2015-01-01

    Full Text Available Main concepts in DSP include filtering, averaging, modulating, and correlating the signals in digital form to estimate characteristic parameter of a signal into a desirable form. This paper presents a brief concept of low power datapath impact for Digital Signal Processing (DSP based biomedical application. Systolic array based digital filter used in signal processing of electrocardiogram analysis is presented with datapath architectural innovations in low power consumption perspective. Implementation was done with ASIC design methodology using TSMC 65 nm technological library node. The proposed systolic array filter has reduced leakage power up to 8.5% than the existing filter architectures.

  12. Research on battery array based on solar power

    Science.gov (United States)

    Li, Junhong

    2017-03-01

    Almost all of the energy of solar power supply system comes from solar energy, which is a kind of pollution-free green energy, using independent photovoltaic system as base station power supply. In this paper, taking the solar power system as the research object, we made MATLAB simulation analysis of the independent solar photovoltaic system battery array. The simulation results showed that the output voltage and the output current of the solar array based on solar power system are affected by the illumination intensity and temperature change. In addition, it also showed that at any temperature and illumination intensity, there will a largest output power.

  13. Low Power Systolic Array Based Digital Filter for DSP Applications.

    Science.gov (United States)

    Karthick, S; Valarmathy, S; Prabhu, E

    2015-01-01

    Main concepts in DSP include filtering, averaging, modulating, and correlating the signals in digital form to estimate characteristic parameter of a signal into a desirable form. This paper presents a brief concept of low power datapath impact for Digital Signal Processing (DSP) based biomedical application. Systolic array based digital filter used in signal processing of electrocardiogram analysis is presented with datapath architectural innovations in low power consumption perspective. Implementation was done with ASIC design methodology using TSMC 65 nm technological library node. The proposed systolic array filter has reduced leakage power up to 8.5% than the existing filter architectures.

  14. Cell array-based intracellular localization screening reveals novel functional features of human chromosome 21 proteins

    Directory of Open Access Journals (Sweden)

    Kahlem Pascal

    2006-06-01

    Full Text Available Abstract Background Trisomy of human chromosome 21 (Chr21 results in Down's syndrome, a complex developmental and neurodegenerative disease. Molecular analysis of Down's syndrome, however, poses a particular challenge, because the aneuploid region of Chr21 contains many genes of unknown function. Subcellular localization of human Chr21 proteins may contribute to further understanding of the functions and regulatory mechanisms of the genes that code for these proteins. Following this idea, we used a transfected-cell array technique to perform a rapid and cost-effective analysis of the intracellular distribution of Chr 21 proteins. Results We chose 89 genes that were distributed over the majority of 21q, ranging from RBM11 (14.5 Mb to MCM3AP (46.6 Mb, with part of them expressed aberrantly in the Down's syndrome mouse model. Open reading frames of these genes were cloned into a mammalian expression vector with an amino-terminal His6 tag. All of the constructs were arrayed on glass slides and reverse transfected into HEK293T cells for protein expression. Co-localization detection using a set of organelle markers was carried out for each Chr21 protein. Here, we report the subcellular localization properties of 52 proteins. For 34 of these proteins, their localization is described for the first time. Furthermore, the alteration in cell morphology and growth as a result of protein over-expression for claudin-8 and claudin-14 genes has been characterized. Conclusion The cell array-based protein expression and detection approach is a cost-effective platform for large-scale functional analyses, including protein subcellular localization and cell phenotype screening. The results from this study reveal novel functional features of human Chr21 proteins, which should contribute to further understanding of the molecular pathology of Down's syndrome.

  15. Transcriptome sequencing study implicates immune-related genes differentially expressed in schizophrenia: new data and a meta-analysis

    Science.gov (United States)

    Sanders, A R; Drigalenko, E I; Duan, J; Moy, W; Freda, J; Göring, H H H; Gejman, P V

    2017-01-01

    We undertook an RNA sequencing (RNAseq)-based transcriptomic profiling study on lymphoblastoid cell lines of a European ancestry sample of 529 schizophrenia cases and 660 controls, and found 1058 genes to be differentially expressed by affection status. These differentially expressed genes were enriched for involvement in immunity, especially the 697 genes with higher expression in cases. Comparing the current RNAseq transcriptomic profiling to our previous findings in an array-based study of 268 schizophrenia cases and 446 controls showed a highly significant positive correlation over all genes. Fifteen (18%) of the 84 genes with significant (false discovery rateRNAseq and array data sets (797 cases and 1106 controls) showed 169 additional genes (besides those found in the primary RNAseq-based analysis) to be differentially expressed, and provided further evidence of immune gene enrichment. In addition to strengthening our previous array-based gene expression differences in schizophrenia cases versus controls and providing transcriptomic support for some genes implicated by other approaches for schizophrenia, our study detected new genes differentially expressed in schizophrenia. We highlight RNAseq-based differential expression of various genes involved in neurodevelopment and/or neuronal function, and discuss caveats of the approach. PMID:28418402

  16. Gene set analysis for GWAS

    DEFF Research Database (Denmark)

    Debrabant, Birgit; Soerensen, Mette

    2014-01-01

    Abstract We discuss the use of modified Kolmogorov-Smirnov (KS) statistics in the context of gene set analysis and review corresponding null and alternative hypotheses. Especially, we show that, when enhancing the impact of highly significant genes in the calculation of the test statistic...... parameter and the genesis and distribution of the gene-level statistics, and illustrate the effects of differential weighting in a real-life example....

  17. Function analysis of unknown genes

    DEFF Research Database (Denmark)

    Rogowska-Wrzesinska, A.

    2002-01-01

      This thesis entitled "Function analysis of unknown genes" presents the use of proteome analysis for the characterisation of yeast (Saccharomyces cerevisiae) genes and their products (proteins especially those of unknown function). This study illustrates that proteome analysis can be used...... to describe different aspects of molecular biology of the cell, to study changes that occur in the cell due to overexpression or deletion of a gene and to identify various protein modifications. The biological questions and the results of the described studies show the diversity of the information that can...... genes and proteins. It reports the first global proteome database collecting 36 yeast single gene deletion mutants and selecting over 650 differences between analysed mutants and the wild type strain. The obtained results show that two-dimensional gel electrophoresis and mass spectrometry based proteome...

  18. Improving molecular diagnosis of aniridia and WAGR syndrome using customized targeted array-based CGH

    Science.gov (United States)

    Vallespín, Elena; Villaverde, Cristina; Martín-Arenas, Rubén; Vélez-Monsalve, Camilo; Lorda-Sánchez, Isabel; Nevado, Julián; Trujillo-Tiebas, María José; Lapunzina, Pablo; Ayuso, Carmen; Corton, Marta

    2017-01-01

    Chromosomal deletions at 11p13 are a frequent cause of congenital Aniridia, a rare pan-ocular genetic disease, and of WAGR syndrome, accounting up to 30% of cases. First-tier genetic testing for newborn with aniridia, to detect 11p13 rearrangements, includes Multiplex Ligation-dependent Probe Amplification (MLPA) and karyotyping. However, neither of these approaches allow obtaining a complete picture of the high complexity of chromosomal deletions and breakpoints in aniridia. Here, we report the development and validation of a customized targeted array-based comparative genomic hybridization, so called WAGR-array, for comprehensive high-resolution analysis of CNV in the WAGR locus. Our approach increased the detection rate in a Spanish cohort of 38 patients with aniridia, WAGR syndrome and other related ocular malformations, allowing to characterize four undiagnosed aniridia cases, and to confirm MLPA findings in four additional patients. For all patients, breakpoints were accurately established and a contiguous deletion syndrome, involving a large number of genes, was identified in three patients. Moreover, we identified novel microdeletions affecting 3' PAX6 regulatory regions in three families with isolated aniridia. This tool represents a good strategy for the genetic diagnosis of aniridia and associated syndromes, allowing for a more accurate CNVs detection, as well as a better delineation of breakpoints. Our results underline the clinical importance of performing exhaustive and accurate analysis of chromosomal rearrangements for patients with aniridia, especially newborns and those without defects in PAX6 after diagnostic screening. PMID:28231309

  19. Chromatin analysis of occluded genes

    Science.gov (United States)

    Lee, Jae Hyun; Gaetz, Jedidiah; Bugarija, Branimir; Fernandes, Croydon J.; Snyder, Gregory E.; Bush, Eliot C.; Lahn, Bruce T.

    2009-01-01

    We recently described two opposing states of transcriptional competency. One is termed ‘competent’ whereby a gene is capable of responding to trans-acting transcription factors of the cell, such that it is active if appropriate transcriptional activators are present, though it can also be silent if activators are absent or repressors are present. The other is termed ‘occluded’ whereby a gene is silenced by cis-acting, chromatin-based mechanisms in a manner that blocks it from responding to trans-acting factors, such that it is silent even when activators are present in the cellular milieu. We proposed that gene occlusion is a mechanism by which differentiated cells stably maintain their phenotypic identities. Here, we describe chromatin analysis of occluded genes. We found that DNA methylation plays a causal role in maintaining occlusion for a subset of occluded genes. We further examined a variety of other chromatin marks typically associated with transcriptional silencing, including histone variants, covalent histone modifications and chromatin-associated proteins. Surprisingly, we found that although many of these marks are robustly linked to silent genes (which include both occluded genes and genes that are competent but silent), none is linked specifically to occluded genes. Although the observation does not rule out a possible causal role of these chromatin marks in occlusion, it does suggest that these marks might be secondary effect rather than primary cause of the silent state in many genes. PMID:19380460

  20. High frequency of submicroscopic chromosomal imbalances in patients with syndromic craniosynostosis detected by a combined approach of microsatellite segregation analysis, multiplex ligation-dependent probe amplification and array-based comparative genome hybridisation.

    NARCIS (Netherlands)

    Jehee, F.S.; Krepischi-Santos, A.C.; Rocha, K.M.; Cavalcanti, D.P.; Kim, C.A.; Bertola, D.R.; Alonso, L.G.; D'Angelo, C.S.; Mazzeu, J.F.; Froyen, G.; Lugtenberg, D.; Vianna-Morgante, A.M.; Rosenberg, C.; Passos-Bueno, M.R.

    2008-01-01

    We present the first comprehensive study, to our knowledge, on genomic chromosomal analysis in syndromic craniosynostosis. In total, 45 patients with craniosynostotic disorders were screened with a variety of methods including conventional karyotype, microsatellite segregation analysis, subtelomeric

  1. Microarray-based analysis for hepatocellular carcinoma: From gene expression profiling to new challenges

    Institute of Scientific and Technical Information of China (English)

    Yutaka Midorikawa; Masatoshi Makuuchi; Wei Tang; Hiroyuki Aburatani

    2007-01-01

    Accumulation of mutations and alterations in the expression of various genes result in carcinogenesis, and the development of microarray technology has enabled us to identify the comprehensive gene expression alterations in oncogenesis. Many studies have applied this technology for hepatocellular carcinoma (HCC), and identified a number of candidate genes useful as biomarkers in cancer staging, prediction of recurrence and prognosis, and treatment selection. Some of these target molecules have been used to develop new serum diagnostic markers and therapeutic targets against HCC to benefit patients. Previously, we compared gene expression profiling data with classification based on clinicopathological features, such as hepatitis viral infection or liver cancer progression. The next era of gene expression analysis will require systematic integration of expression profiles with other types of biological information, such as genomic locus, gene function, and sequence information. We have reported integration between expression profiles and locus information, which is effective in detecting structural genomic abnormalities, such as chromosomal gains and losses, in which we showed that gene expression profiles are subject to chromosomal bias. Furthermore, array-based comparative genomic hybridization analysis and allelic dosage analysis using genotyping arrays for HCC were also reviewed, with comparison of conventional methods.

  2. Comparative genomic analysis of eutherian kallikrein genes

    Directory of Open Access Journals (Sweden)

    Marko Premzl

    2017-03-01

    Full Text Available The present study made attempts to update and revise eutherian kallikrein genes implicated in major physiological and pathological processes and in medical molecular diagnostics. Using eutherian comparative genomic analysis protocol and free available genomic sequence assemblies, the tests of reliability of eutherian public genomic sequences annotated most comprehensive curated third party data gene data set of eutherian kallikrein genes including 121 complete coding sequences among 335 potential coding sequences. The present analysis first described 13 major gene clusters of eutherian kallikrein genes, and explained their differential gene expansion patterns. One updated classification and nomenclature of eutherian kallikrein genes was proposed, as new framework of future experiments.

  3. Gene set analysis for longitudinal gene expression data

    Directory of Open Access Journals (Sweden)

    Piepho Hans-Peter

    2011-07-01

    Full Text Available Abstract Background Gene set analysis (GSA has become a successful tool to interpret gene expression profiles in terms of biological functions, molecular pathways, or genomic locations. GSA performs statistical tests for independent microarray samples at the level of gene sets rather than individual genes. Nowadays, an increasing number of microarray studies are conducted to explore the dynamic changes of gene expression in a variety of species and biological scenarios. In these longitudinal studies, gene expression is repeatedly measured over time such that a GSA needs to take into account the within-gene correlations in addition to possible between-gene correlations. Results We provide a robust nonparametric approach to compare the expressions of longitudinally measured sets of genes under multiple treatments or experimental conditions. The limiting distributions of our statistics are derived when the number of genes goes to infinity while the number of replications can be small. When the number of genes in a gene set is small, we recommend permutation tests based on our nonparametric test statistics to achieve reliable type I error and better power while incorporating unknown correlations between and within-genes. Simulation results demonstrate that the proposed method has a greater power than other methods for various data distributions and heteroscedastic correlation structures. This method was used for an IL-2 stimulation study and significantly altered gene sets were identified. Conclusions The simulation study and the real data application showed that the proposed gene set analysis provides a promising tool for longitudinal microarray analysis. R scripts for simulating longitudinal data and calculating the nonparametric statistics are posted on the North Dakota INBRE website http://ndinbre.org/programs/bioinformatics.php. Raw microarray data is available in Gene Expression Omnibus (National Center for Biotechnology Information with

  4. NMD Microarray Analysis for Rapid Genome-Wide Screen of Mutated Genes in Cancer

    Directory of Open Access Journals (Sweden)

    Maija Wolf

    2005-01-01

    Full Text Available Gene mutations play a critical role in cancer development and progression, and their identification offers possibilities for accurate diagnostics and therapeutic targeting. Finding genes undergoing mutations is challenging and slow, even in the post-genomic era. A new approach was recently developed by Noensie and Dietz to prioritize and focus the search, making use of nonsense-mediated mRNA decay (NMD inhibition and microarray analysis (NMD microarrays in the identification of transcripts containing nonsense mutations. We combined NMD microarrays with array-based CGH (comparative genomic hybridization in order to identify inactivation of tumor suppressor genes in cancer. Such a “mutatomics” screening of prostate cancer cell lines led to the identification of inactivating mutations in the EPHB2 gene. Up to 8% of metastatic uncultured prostate cancers also showed mutations of this gene whose loss of function may confer loss of tissue architecture. NMD microarray analysis could turn out to be a powerful research method to identify novel mutated genes in cancer cell lines, providing targets that could then be further investigated for their clinical relevance and therapeutic potential.

  5. Copy number variation analysis implicates the cell polarity gene glypican 5 as a human spina bifida candidate gene

    Science.gov (United States)

    Bassuk, Alexander G.; Muthuswamy, Lakshmi B.; Boland, Riley; Smith, Tiffany L.; Hulstrand, Alissa M.; Northrup, Hope; Hakeman, Matthew; Dierdorff, Jason M.; Yung, Christina K.; Long, Abby; Brouillette, Rachel B.; Au, Kit Sing; Gurnett, Christina; Houston, Douglas W.; Cornell, Robert A.; Manak, J. Robert

    2013-01-01

    Neural tube defects (NTDs) are common birth defects of complex etiology. Family and population-based studies have confirmed a genetic component to NTDs. However, despite more than three decades of research, the genes involved in human NTDs remain largely unknown. We tested the hypothesis that rare copy number variants (CNVs), especially de novo germline CNVs, are a significant risk factor for NTDs. We used array-based comparative genomic hybridization (aCGH) to identify rare CNVs in 128 Caucasian and 61 Hispanic patients with non-syndromic lumbar-sacral myelomeningocele. We also performed aCGH analysis on the parents of affected individuals with rare CNVs where parental DNA was available (42 sets). Among the eight de novo CNVs that we identified, three generated copy number changes of entire genes. One large heterozygous deletion removed 27 genes, including PAX3, a known spina bifida-associated gene. A second CNV altered genes (PGPD8, ZC3H6) for which little is known regarding function or expression. A third heterozygous deletion removed GPC5 and part of GPC6, genes encoding glypicans. Glypicans are proteoglycans that modulate the activity of morphogens such as Sonic Hedgehog (SHH) and bone morphogenetic proteins (BMPs), both of which have been implicated in NTDs. Additionally, glypicans function in the planar cell polarity (PCP) pathway, and several PCP genes have been associated with NTDs. Here, we show that GPC5 orthologs are expressed in the neural tube, and that inhibiting their expression in frog and fish embryos results in NTDs. These results implicate GPC5 as a gene required for normal neural tube development. PMID:23223018

  6. Analysis of MIMO antenna array based on electromagnetic vector sensor%基于电磁矢量传感器的 MIMO 天线阵列系统研究

    Institute of Scientific and Technical Information of China (English)

    周杰; 邱琳; 菊池久和

    2013-01-01

    EVS(electromagnetic vetor sensor) signal processing method was combined with traditional MIMO signal processing method and three-dimensional channel model of multi-antenna array was set up. The algorithm of multiple signal classification (MUSIC) was adopted to make spatial spectrum estimation for direction of arrival (DOA) signal, analytical expressions of three-dimensional spatial channel was derived base on EVS for MIMO receiver system and the relationship between EVS signal processing and MIMO multipath channel correlation was clarified. To compare with tra-ditional MIMO antenna array, EVS array possesses abilities of obtaining multidimensional polarization information, processing spatial and polarization domain for arrival signal and degrading effects of spatial factors on MIMO channel capacity. Theoretical analysis and simulation results indicate that EVS array provide more advantages than traditional scalar sensor array on enhancing MIMO system performance.%  将电磁矢量传感器(EVS, electromagnetic vetor sensor)信号处理法与传统 MIMO 信号处理有机地结合,建立了基于 EVS 的多天线三维信道模型。采用多重信号分类(MUSIC, multiple signal classification)算法对 MIMO 的达波信号方向(DOA, direction of arrival)进行空间谱估计,导出基于 EVS 的三维空间信道解析式,阐明了 EVS 信号处理与 MIMO 多径信道相关性的关系。与传统标量传感器阵列(SSA, scalar sensor array)MIMO 天线阵列比较, EVS 阵列能获取达波信号的多维极化信息,同时具有空间域和极化信号处理能力。因此可缓解空间多径信道相关性,使空间极化分量的相关性趋于零值,而且使 MIMO 系统性能受空间结构的影响较小。理论分析和仿真结果表明在提高 MIMO 天线系统性能上,基于 EVS 阵列的系统比 SSA 系统具有更高的优越性。

  7. Semiconductor Sensor Array Based Electronic Nose for Milk, Rancid Milk and Yoghurt Odors Identification

    Science.gov (United States)

    Botre, B.; Gharpure, D.; Shaligram, A.; Sadistap, S.

    2009-05-01

    This paper presents the use semiconductor sensor array based electronic nose for the identification of milk, rancid milk and yoghurt odors. A low cost sensor array, serial data acquisition system and E-nose software package (ESP) tool are used to generate the database, feature extraction and normalization. The MLP NN is trained using the NeuroSolutions for the identification. The network has successfully classified milk, rancid milk and yoghurt odors with 96% success rate. A sensitivity analysis is done to test the performance of the sensor data in the trained network

  8. 基于相控阵的结构损伤范围监测与图像表征%Phased array based structural damage range monitoring and imaging analysis

    Institute of Scientific and Technical Information of China (English)

    孙亚杰; 张永宏; 刘青山; 张自嘉

    2014-01-01

    本文针对碳纤维复合材料机翼盒段监测损伤信号微弱难以有效辨识及结构损伤大小的量化问题,提出基于相控阵的碳纤维复合材料结构损伤识别成像与损伤量化方法。利用Lamb波信号在特定方向上的干涉,实现对结构的定向扫描,提高信号的信噪比;通过划分扇环计算其面积进而量化损伤区域大小,同时分析时间延迟执行过程并分析其对损伤识别误差的影响。采用相控阵监测原理在某型无人机碳纤维复合材料机翼盒段上进行实验研究,识别结构中的损伤,对监测结果进行成像,不仅显示结构中损伤的位置并计算量化损伤区域大小,实验研究证明,采用相控阵原理能够有效精确地识别碳纤维复合材料机翼盒段中损伤,图像表征清晰且量化损伤范围准确。%Aiming at the problems that the monitoring damage signals travelling in the carbon fiber composite wing box are usually very weak and the damage size is difficult to be detected quantitively, a phased array principle based structural damage identification imaging and damage quantization method for carbon fiber composite structure is proposed. Using the Lamb wave interference taking place in a given direction, the signal directional scanning in the structure is achieved to improve the SNR (signal to noise ratio) of the signal. The damage area size is calculated and quantified through dividing the fan rings. The error analysis of the time delay process is analyzed and its impact on the damage identification error is also analyzed. The experiment was con-ducted on unmanned aerial vehicle (UAV) carbon fiber composite wing box using the phased array monitoring principle. The structural damage is identified with this method. The recognition result is drawn on a mapped image and the damage area position and size in the mapped image are calculated. The experiment study result on the UAV wing box shows that this method

  9. Small Area Array-Based LED Luminaire Design

    Energy Technology Data Exchange (ETDEWEB)

    Thomas Yuan

    2008-01-09

    This report contains a summary of technical achievements during a three-year project to demonstrate high efficiency LED luminaire designs based on small area array-based gallium nitride diodes. Novel GaN-based LED array designs are described, specifically addressing the thermal, optical, electrical and mechanical requirements for the incorporation of such arrays into viable solid-state LED luminaires. This work resulted in the demonstration of an integrated luminaire prototype of 1000 lumens cool white light output with reflector shaped beams and efficacy of 89.4 lm/W at CCT of 6000oK and CRI of 73; and performance of 903 lumens warm white light output with reflector shaped beams and efficacy of 63.0 lm/W at CCT of 2800oK and CRI of 82. In addition, up to 1275 lumens cool white light output at 114.2 lm/W and 1156 lumens warm white light output at 76.5 lm/W were achieved if the reflector was not used. The success to integrate small area array-based LED designs and address thermal, optical, electrical and mechanical requirements was clearly achieved in these luminaire prototypes with outstanding performance and high efficiency.

  10. Genome-wide Analysis of Gene Regulation

    DEFF Research Database (Denmark)

    Chen, Yun

    cells are capable of regulating their gene expression, so that each cell can only express a particular set of genes yielding limited numbers of proteins with specialized functions. Therefore a rigid control of differential gene expression is necessary for cellular diversity. On the other hand, aberrant...... gene regulation will disrupt the cell’s fundamental processes, which in turn can cause disease. Hence, understanding gene regulation is essential for deciphering the code of life. Along with the development of high throughput sequencing (HTS) technology and the subsequent large-scale data analysis......, genome-wide assays have increased our understanding of gene regulation significantly. This thesis describes the integration and analysis of HTS data across different important aspects of gene regulation. Gene expression can be regulated at different stages when the genetic information is passed from gene...

  11. Tin Oxide Nanorod Array-Based Electrochemical Hydrogen Peroxide Biosensor

    Directory of Open Access Journals (Sweden)

    Liu Jinping

    2010-01-01

    Full Text Available Abstract SnO2 nanorod array grown directly on alloy substrate has been employed as the working electrode of H2O2 biosensor. Single-crystalline SnO2 nanorods provide not only low isoelectric point and enough void spaces for facile horseradish peroxidase (HRP immobilization but also numerous conductive channels for electron transport to and from current collector; thus, leading to direct electrochemistry of HRP. The nanorod array-based biosensor demonstrates high H2O2 sensing performance in terms of excellent sensitivity (379 μA mM−1 cm−2, low detection limit (0.2 μM and high selectivity with the apparent Michaelis–Menten constant estimated to be as small as 33.9 μM. Our work further demonstrates the advantages of ordered array architecture in electrochemical device application and sheds light on the construction of other high-performance enzymatic biosensors.

  12. Radiation-hard/high-speed array-based optical engine

    Science.gov (United States)

    Gan, K. K.; Buchholz, P.; Heidbrink, S.; Kagan, H. P.; Kass, R. D.; Moore, J.; Smith, D. S.; Vogt, M.; Ziolkowski, M.

    2016-12-01

    We have designed and fabricated a compact array-based optical engine for transmitting data at 10 Gb/s. The device consists of a 4-channel ASIC driving a VCSEL (Vertical Cavity Surface Emitting Laser) array in an optical package. The ASIC is designed using only core transistors in a 65 nm CMOS process to enhance the radiation-hardness. The ASIC contains an 8-bit DAC to control the bias and modulation currents of the individual channels in the VCSEL array. The DAC settings are stored in SEU (single event upset) tolerant registers. Several devices were irradiated with 24 GeV/c protons and the performance of the devices is satisfactory after the irradiation.

  13. Fiber-array based optogenetic prosthetic system for stimulation therapy

    Science.gov (United States)

    Gu, Ling; Cote, Chris; Tejeda, Hector; Mohanty, Samarendra

    2012-02-01

    Recent advent of optogenetics has enabled activation of genetically-targeted neuronal cells using low intensity blue light with high temporal precision. Since blue light is attenuated rapidly due to scattering and absorption in neural tissue, optogenetic treatment of neurological disorders may require stimulation of specific cell types in multiple regions of the brain. Further, restoration of certain neural functions (vision, and auditory etc) requires accurate spatio-temporal stimulation patterns rather than just precise temporal stimulation. In order to activate multiple regions of the central nervous system in 3D, here, we report development of an optogenetic prosthetic comprising of array of fibers coupled to independently-controllable LEDs. This design avoids direct contact of LEDs with the brain tissue and thus does not require electrical and heat isolation, which can non-specifically stimulate and damage the local brain regions. The intensity, frequency, and duty cycle of light pulses from each fiber in the array was controlled independently using an inhouse developed LabView based program interfaced with a microcontroller driving the individual LEDs. While the temporal profile of the light pulses was controlled by varying the current driving the LED, the beam profile emanating from each fiber tip could be sculpted by microfabrication of the fiber tip. The fiber array was used to stimulate neurons, expressing channelrhodopsin-2, in different locations within the brain or retina. Control of neural activity in the mice cortex, using the fiber-array based prosthetic, is evaluated from recordings made with multi-electrode array (MEA). We also report construction of a μLED array based prosthetic for spatio-temporal stimulation of cortex.

  14. Gene set analysis using variance component tests

    Science.gov (United States)

    2013-01-01

    Background Gene set analyses have become increasingly important in genomic research, as many complex diseases are contributed jointly by alterations of numerous genes. Genes often coordinate together as a functional repertoire, e.g., a biological pathway/network and are highly correlated. However, most of the existing gene set analysis methods do not fully account for the correlation among the genes. Here we propose to tackle this important feature of a gene set to improve statistical power in gene set analyses. Results We propose to model the effects of an independent variable, e.g., exposure/biological status (yes/no), on multiple gene expression values in a gene set using a multivariate linear regression model, where the correlation among the genes is explicitly modeled using a working covariance matrix. We develop TEGS (Test for the Effect of a Gene Set), a variance component test for the gene set effects by assuming a common distribution for regression coefficients in multivariate linear regression models, and calculate the p-values using permutation and a scaled chi-square approximation. We show using simulations that type I error is protected under different choices of working covariance matrices and power is improved as the working covariance approaches the true covariance. The global test is a special case of TEGS when correlation among genes in a gene set is ignored. Using both simulation data and a published diabetes dataset, we show that our test outperforms the commonly used approaches, the global test and gene set enrichment analysis (GSEA). Conclusion We develop a gene set analyses method (TEGS) under the multivariate regression framework, which directly models the interdependence of the expression values in a gene set using a working covariance. TEGS outperforms two widely used methods, GSEA and global test in both simulation and a diabetes microarray data. PMID:23806107

  15. Comparative genomic analysis of soybean flowering genes.

    Directory of Open Access Journals (Sweden)

    Chol-Hee Jung

    Full Text Available Flowering is an important agronomic trait that determines crop yield. Soybean is a major oilseed legume crop used for human and animal feed. Legumes have unique vegetative and floral complexities. Our understanding of the molecular basis of flower initiation and development in legumes is limited. Here, we address this by using a computational approach to examine flowering regulatory genes in the soybean genome in comparison to the most studied model plant, Arabidopsis. For this comparison, a genome-wide analysis of orthologue groups was performed, followed by an in silico gene expression analysis of the identified soybean flowering genes. Phylogenetic analyses of the gene families highlighted the evolutionary relationships among these candidates. Our study identified key flowering genes in soybean and indicates that the vernalisation and the ambient-temperature pathways seem to be the most variant in soybean. A comparison of the orthologue groups containing flowering genes indicated that, on average, each Arabidopsis flowering gene has 2-3 orthologous copies in soybean. Our analysis highlighted that the CDF3, VRN1, SVP, AP3 and PIF3 genes are paralogue-rich genes in soybean. Furthermore, the genome mapping of the soybean flowering genes showed that these genes are scattered randomly across the genome. A paralogue comparison indicated that the soybean genes comprising the largest orthologue group are clustered in a 1.4 Mb region on chromosome 16 of soybean. Furthermore, a comparison with the undomesticated soybean (Glycine soja revealed that there are hundreds of SNPs that are associated with putative soybean flowering genes and that there are structural variants that may affect the genes of the light-signalling and ambient-temperature pathways in soybean. Our study provides a framework for the soybean flowering pathway and insights into the relationship and evolution of flowering genes between a short-day soybean and the long-day plant

  16. Application of multidisciplinary analysis to gene expression.

    Energy Technology Data Exchange (ETDEWEB)

    Wang, Xuefel (University of New Mexico, Albuquerque, NM); Kang, Huining (University of New Mexico, Albuquerque, NM); Fields, Chris (New Mexico State University, Las Cruces, NM); Cowie, Jim R. (New Mexico State University, Las Cruces, NM); Davidson, George S.; Haaland, David Michael; Sibirtsev, Valeriy (New Mexico State University, Las Cruces, NM); Mosquera-Caro, Monica P. (University of New Mexico, Albuquerque, NM); Xu, Yuexian (University of New Mexico, Albuquerque, NM); Martin, Shawn Bryan; Helman, Paul (University of New Mexico, Albuquerque, NM); Andries, Erik (University of New Mexico, Albuquerque, NM); Ar, Kerem (University of New Mexico, Albuquerque, NM); Potter, Jeffrey (University of New Mexico, Albuquerque, NM); Willman, Cheryl L. (University of New Mexico, Albuquerque, NM); Murphy, Maurice H. (University of New Mexico, Albuquerque, NM)

    2004-01-01

    Molecular analysis of cancer, at the genomic level, could lead to individualized patient diagnostics and treatments. The developments to follow will signal a significant paradigm shift in the clinical management of human cancer. Despite our initial hopes, however, it seems that simple analysis of microarray data cannot elucidate clinically significant gene functions and mechanisms. Extracting biological information from microarray data requires a complicated path involving multidisciplinary teams of biomedical researchers, computer scientists, mathematicians, statisticians, and computational linguists. The integration of the diverse outputs of each team is the limiting factor in the progress to discover candidate genes and pathways associated with the molecular biology of cancer. Specifically, one must deal with sets of significant genes identified by each method and extract whatever useful information may be found by comparing these different gene lists. Here we present our experience with such comparisons, and share methods developed in the analysis of an infant leukemia cohort studied on Affymetrix HG-U95A arrays. In particular, spatial gene clustering, hyper-dimensional projections, and computational linguistics were used to compare different gene lists. In spatial gene clustering, different gene lists are grouped together and visualized on a three-dimensional expression map, where genes with similar expressions are co-located. In another approach, projections from gene expression space onto a sphere clarify how groups of genes can jointly have more predictive power than groups of individually selected genes. Finally, online literature is automatically rearranged to present information about genes common to multiple groups, or to contrast the differences between the lists. The combination of these methods has improved our understanding of infant leukemia. While the complicated reality of the biology dashed our initial, optimistic hopes for simple answers from

  17. Filtering Genes for Cluster and Network Analysis

    Directory of Open Access Journals (Sweden)

    Parkhomenko Elena

    2009-06-01

    Full Text Available Abstract Background Prior to cluster analysis or genetic network analysis it is customary to filter, or remove genes considered to be irrelevant from the set of genes to be analyzed. Often genes whose variation across samples is less than an arbitrary threshold value are deleted. This can improve interpretability and reduce bias. Results This paper introduces modular models for representing network structure in order to study the relative effects of different filtering methods. We show that cluster analysis and principal components are strongly affected by filtering. Filtering methods intended specifically for cluster and network analysis are introduced and compared by simulating modular networks with known statistical properties. To study more realistic situations, we analyze simulated "real" data based on well-characterized E. coli and S. cerevisiae regulatory networks. Conclusion The methods introduced apply very generally, to any similarity matrix describing gene expression. One of the proposed methods, SUMCOV, performed well for all models simulated.

  18. Determination of pork spoilage by colorimetric gas sensor array based on natural pigments.

    Science.gov (United States)

    Huang, Xiao-wei; Zou, Xiao-bo; Shi, Ji-yong; Guo, Yanin; Zhao, Jie-wen; Zhang, Jianchun; Hao, Limin

    2014-02-15

    A new colorimetric gas-sensor array based on four natural pigments, that were extracted from spinach (Spinacia oleracea), red radish (Raphanus sativus L.), winter jasmine (Jasminum nudiflorum), and black rice (Oryza sativa L. indica), was developed for pork freshness evaluation. A colour change profile for each sample was obtained by differentiating the images of the sensor array before and after exposure to the odour of sample. The total viable count (TVC) per gram of pork was obtained by classical microbiological plating methods, and the biogenic amines were measured by HPLC. Biogenic amine index (BAI) for the determination of meat freshness was developed from the sum of putrescine and cadaverine. The colour change profiles were analysed using principal component analysis and correlated with conventional methods (BAI, TVC). A partial least squares (PLS) prediction model was obtained with r=0.854 and 0.933 for BAI and TVC, respectively.

  19. Using GenePattern for Gene Expression Analysis

    Science.gov (United States)

    Kuehn, Heidi; Liberzon, Arthur; Reich, Michael; Mesirov, Jill P.

    2013-01-01

    The abundance of genomic data now available in biomedical research has stimulated the development of sophisticated statistical methods for interpreting the data, and of special visualization tools for displaying the results in a concise and meaningful manner. However, biologists often find these methods and tools difficult to understand and use correctly. GenePattern is a freely available software package that addresses this issue by providing more than 100 analysis and visualization tools for genomic research in a comprehensive user-friendly environment for users at all levels of computational experience and sophistication. This unit demonstrates how to prepare and analyze microarray data in GenePattern. PMID:18551415

  20. Array-based GNSS Ionospheric Sensing: Estimability and Precision Analyses

    Science.gov (United States)

    Teunissen, Peter

    2016-04-01

    Array-based GNSS Ionospheric Sensing: Estimability and Precision Analyses PJG Teunissen1,2, A Khodabandeh1 and B Zhang1 1GNSS Research Centre, Curtin University, Perth, Australia 2Geoscience and Remote Sensing, Delft University of Technology, The Netherlands Introduction: The Global Navigation Satellite Systems (GNSS) have proved to be an effective means of measuring the Earth's ionosphere. The well-known geometry-free linear combinations of the GNSS data serve as the input of an external ionospheric model to capture both the spatial and temporal characteristics of the ionosphere. Next to the slant ionospheric delays experienced by the GNSS antennas, the geometry-free combinations also contain additional unknown delays that are caused by the presence of the carrier-phase ambiguous cycles and/or the code instrumental delays. That the geometry-free combinations, without an external ionospheric model, cannot unbiasedly determine the slant ionospheric delays reveals the lack of information content in the GNSS data. Motivation and objectives: With the advent of modernized multi-frequency signals, one is confronted with many different combinations of the GNSS data that are capable of sensing the ionosphere. Owing to such diversity and the lack of information content in the GNSS data, various estimable ionospheric delays of different interpretations (and of different precision) can therefore be formed. How such estimable ionospheric delays should be interpreted and the extent to which they contribute to the precision of the unbiased slant ionosphere are the topics of this contribution. Approach and results: In this contribution, we apply S-system theory to study the estimability and precision of the estimable slant ionospheric delays that are measured by the multi-frequency GNSS data. Two different S-systems are presented, leading to two different estimable parameters of different precision: 1) the phase-driven ionospheric delays and 2) the code-driven ionospheric delays

  1. Risk analysis of colorectal cancer incidence by gene expression analysis

    Science.gov (United States)

    Shangkuan, Wei-Chuan; Lin, Hung-Che; Chang, Yu-Tien; Jian, Chen-En; Fan, Hueng-Chuen; Chen, Kang-Hua; Liu, Ya-Fang; Hsu, Huan-Ming; Chou, Hsiu-Ling; Yao, Chung-Tay

    2017-01-01

    Background Colorectal cancer (CRC) is one of the leading cancers worldwide. Several studies have performed microarray data analyses for cancer classification and prognostic analyses. Microarray assays also enable the identification of gene signatures for molecular characterization and treatment prediction. Objective Microarray gene expression data from the online Gene Expression Omnibus (GEO) database were used to to distinguish colorectal cancer from normal colon tissue samples. Methods We collected microarray data from the GEO database to establish colorectal cancer microarray gene expression datasets for a combined analysis. Using the Prediction Analysis for Microarrays (PAM) method and the GSEA MSigDB resource, we analyzed the 14,698 genes that were identified through an examination of their expression values between normal and tumor tissues. Results Ten genes (ABCG2, AQP8, SPIB, CA7, CLDN8, SCNN1B, SLC30A10, CD177, PADI2, and TGFBI) were found to be good indicators of the candidate genes that correlate with CRC. From these selected genes, an average of six significant genes were obtained using the PAM method, with an accuracy rate of 95%. The results demonstrate the potential of utilizing a model with the PAM method for data mining. After a detailed review of the published reports, the results confirmed that the screened candidate genes are good indicators for cancer risk analysis using the PAM method. Conclusions Six genes were selected with 95% accuracy to effectively classify normal and colorectal cancer tissues. We hope that these results will provide the basis for new research projects in clinical practice that aim to rapidly assess colorectal cancer risk using microarray gene expression analysis. PMID:28229027

  2. Gene set analysis for interpreting genetic studies

    DEFF Research Database (Denmark)

    Pers, Tune H

    2016-01-01

    Interpretation of genome-wide association study (GWAS) results is lacking behind the discovery of new genetic associations. Consequently, there is an urgent need for data-driven methods for interpreting genetic association studies. Gene set analysis (GSA) can identify aetiologic pathways and func......Interpretation of genome-wide association study (GWAS) results is lacking behind the discovery of new genetic associations. Consequently, there is an urgent need for data-driven methods for interpreting genetic association studies. Gene set analysis (GSA) can identify aetiologic pathways...

  3. Array based characterization of a terminal deletion involving chromosome subband 15q26.2: an emerging syndrome associated with growth retardation, cardiac defects and developmental delay

    Directory of Open Access Journals (Sweden)

    Björkhem Gudrun

    2008-01-01

    Full Text Available Abstract Background Subtelomeric regions are gene rich and deletions in these chromosomal segments have been demonstrated to account for approximately 2.5% of patients displaying mental retardation with or without association of dysmorphic features. However, cases that report de novo terminal deletions on chromosome arm 15q are rare. Methods In this study we present the first example of a detailed molecular genetic mapping of a de novo deletion in involving 15q26.2-qter, caused by the formation of a dicentric chromosome 15, using metaphase FISH and tiling resolution (32 k genome-wide array-based comparative genomic hybridization (CGH. Results After an initial characterization of the dicentric chromosome by metaphase FISH, array CGH analysis mapped the terminal deletion to encompass a 6.48 megabase (Mb region, ranging from 93.86–100.34 Mb on chromosome 15. Conclusion In conclusion, we present an additional case to the growing family of reported cases with 15q26-deletion, thoroughly characterized at the molecular cytogenetic level. In the deleted regions, four candidate genes responsible for the phenotype of the patient could be delineated: IGFR1, MEF2A, CHSY1, and TM2D3. Further characterization of additional patients harboring similar 15q-aberrations might hopefully in the future lead to the description of a clear cut clinically recognizable syndrome.

  4. Using Array-Based Comparative Genomic Hybridization to Diagnose Pallister-Killian Syndrome.

    Science.gov (United States)

    Lee, Mi Na; Lee, Jiwon; Yu, Hee Joon; Lee, Jeehun; Kim, Sun Hee

    2017-01-01

    Pallister-Killian syndrome (PKS) is a rare multisystem disorder characterized by isochromosome 12p and tissue-limited mosaic tetrasomy 12p. In this study, we diagnosed three pediatric patients who were suspicious of having PKS using array-based comparative genomic hybridization (array CGH) and FISH analyses performed on peripheral lymphocytes. Patients 1 and 2 presented with craniofacial dysmorphic features, hypotonia, and a developmental delay. Array CGH revealed two to three copies of 12p in patient 1 and three copies in patient 2. FISH analysis showed trisomy or tetrasomy 12p. Patient 3, who had clinical features comparable to those of patients 1 and 2, was diagnosed by using FISH analysis alone. Here, we report three patients with mosaic tetrasomy 12p. There have been only reported cases diagnosed by chromosome analysis and FISH analysis on skin fibroblast or amniotic fluid. To our knowledge, patient 1 was the first case diagnosed by using array CGH performed on peripheral lymphocytes in Korea.

  5. Integrated Analysis of Genome-Wide Copy Number Alterations and Gene Expression Profiling of Lung Cancer in Xuanwei, China

    Science.gov (United States)

    Zhang, Yanliang; Xue, Qiuyue; Pan, Guoqing; Meng, Qing H.; Tuo, Xiaoyu; Cai, Xuemei; Chen, Zhenghui; Li, Ya; Huang, Tao; Duan, Xincen; Duan, Yong

    2017-01-01

    Objectives Lung cancer in Xuanwei (LCXW), China, is known throughout the world for its distinctive characteristics, but little is known about its pathogenesis. The purpose of this study was to screen potential novel “driver genes” in LCXW. Methods Genome-wide DNA copy number alterations (CNAs) were detected by array-based comparative genomic hybridization and differentially expressed genes (DEGs) by gene expression microarrays in 8 paired LCXW and non-cancerous lung tissues. Candidate driver genes were screened by integrated analysis of CNAs and DEGs. The candidate genes were further validated by real-time quantitative polymerase chain reaction. Results Large numbers of CNAs and DEGs were detected, respectively. Some of the most frequently occurring CNAs included gains at 5p15.33-p15.32, 5p15.1-p14.3, and 5p14.3-p14.2 and losses at 11q24.3, 21q21.1, 21q22.12-q22.13, and 21q22.2. Integrated analysis of CNAs and DEGs identified 24 candidate genes with frequent copy number gains and concordant upregulation, which were considered potential oncogenes, including CREB3L4, TRIP13, and CCNE2. In addition, the analysis identified 19 candidate genes with a negative association between copy number change and expression change, considered potential tumor suppressor genes, including AHRR, NKD2, and KLF10. One of the most studied oncogenes, MYC, may not play a carcinogenic role in LCXW. Conclusions This integrated analysis of CNAs and DEGs identified several potential novel LCXW-related genes, laying an important foundation for further research on the pathogenesis of LCXW and identification of novel biomarkers or therapeutic targets. PMID:28056099

  6. Multidimensional gene set analysis of genomic data.

    Directory of Open Access Journals (Sweden)

    David Montaner

    Full Text Available Understanding the functional implications of changes in gene expression, mutations, etc., is the aim of most genomic experiments. To achieve this, several functional profiling methods have been proposed. Such methods study the behaviour of different gene modules (e.g. gene ontology terms in response to one particular variable (e.g. differential gene expression. In spite to the wealth of information provided by functional profiling methods, a common limitation to all of them is their inherent unidimensional nature. In order to overcome this restriction we present a multidimensional logistic model that allows studying the relationship of gene modules with different genome-scale measurements (e.g. differential expression, genotyping association, methylation, copy number alterations, heterozygosity, etc. simultaneously. Moreover, the relationship of such functional modules with the interactions among the variables can also be studied, which produces novel results impossible to be derived from the conventional unidimensional functional profiling methods. We report sound results of gene sets associations that remained undetected by the conventional one-dimensional gene set analysis in several examples. Our findings demonstrate the potential of the proposed approach for the discovery of new cell functionalities with complex dependences on more than one variable.

  7. Function analysis of unknown genes

    DEFF Research Database (Denmark)

    Rogowska-Wrzesinska, A.

    2002-01-01

    leading to decreased growth rate, decreased glucose metabolism, decreased amino acid and protein synthesis and increased protein degradation. Some of these responses define a new type of stress that results from changes in the internal cell environment by overexpression of a membrane protein. Chapter 5...... that have been post-translationally modified by N- or C-terminal truncation and we show that this protein processing is not random and shows a specific pattern for a given yeast strain. Chapter 7 illustrates the construction of yeast proteome database and its potential application in characterising yeast...... analysis is a powerful tool to study yeast proteome and the complex proteome database gives a broad view on the molecular cell biology of yeast. The global database approach allows combining proteome data from different mutants and experiment conditions (e.g. heat stress, phosphate labelling, N...

  8. 75 FR 32484 - Array-Based Cytogenetic Tests: Questions on Performance Evaluation, Result Reporting and...

    Science.gov (United States)

    2010-06-08

    ... Performance Evaluation, Result Reporting and Interpretation. The purpose of the public meeting is to seek input on challenges related to performance evaluation, determination of clinical significance, result... HUMAN SERVICES Food and Drug Administration Array-Based Cytogenetic Tests: Questions on...

  9. Genome-wide Analysis of Gene Regulation

    DEFF Research Database (Denmark)

    Chen, Yun

    IP-seq and small RNA-seq, we delineated the landscape of the promoters with bidirectional transcriptions that yield steady-state RNA in only one directions (Paper III). A subsequent motif analysis enabled us to uncover specific DNA signals – early polyA sites – that make RNA on the reverse strand sensitive...... they regulated or if the sites had global elevated usage rates by multiple TFs. Using RNA-seq, 5’end-seq in combination with depletion of 5’exonuclease as well as nonsensemediated decay (NMD) factors, we systematically analyzed NMD substrates as well as their degradation intermediates in human cells (Paper V......). Gene enrichment analysis on the detected NMD substrates revealed an unappreciated NMD-based regulatory mechanism of the genes hosting multiple intronic snoRNAs, which can facilitate differential expression of individual snoRNAs from a single host gene locus. Finally, supported by RNA-seq and small RNA-seq...

  10. Analysis of pools of targeted Salmonella deletion mutants identifies novel genes affecting fitness during competitive infection in mice.

    Directory of Open Access Journals (Sweden)

    Carlos A Santiviago

    2009-07-01

    Full Text Available Pools of mutants of minimal complexity but maximal coverage of genes of interest facilitate screening for genes under selection in a particular environment. We constructed individual deletion mutants in 1,023 Salmonella enterica serovar Typhimurium genes, including almost all genes found in Salmonella but not in related genera. All mutations were confirmed simultaneously using a novel amplification strategy to produce labeled RNA from a T7 RNA polymerase promoter, introduced during the construction of each mutant, followed by hybridization of this labeled RNA to a Typhimurium genome tiling array. To demonstrate the ability to identify fitness phenotypes using our pool of mutants, the pool was subjected to selection by intraperitoneal injection into BALB/c mice and subsequent recovery from spleens. Changes in the representation of each mutant were monitored using T7 transcripts hybridized to a novel inexpensive minimal microarray. Among the top 120 statistically significant spleen colonization phenotypes, more than 40 were mutations in genes with no previously known role in this model. Fifteen phenotypes were tested using individual mutants in competitive assays of intraperitoneal infection in mice and eleven were confirmed, including the first two examples of attenuation for sRNA mutants in Salmonella. We refer to the method as Array-based analysis of cistrons under selection (ABACUS.

  11. Analysis of pools of targeted Salmonella deletion mutants identifies novel genes affecting fitness during competitive infection in mice.

    Directory of Open Access Journals (Sweden)

    Carlos A Santiviago

    2009-07-01

    Full Text Available Pools of mutants of minimal complexity but maximal coverage of genes of interest facilitate screening for genes under selection in a particular environment. We constructed individual deletion mutants in 1,023 Salmonella enterica serovar Typhimurium genes, including almost all genes found in Salmonella but not in related genera. All mutations were confirmed simultaneously using a novel amplification strategy to produce labeled RNA from a T7 RNA polymerase promoter, introduced during the construction of each mutant, followed by hybridization of this labeled RNA to a Typhimurium genome tiling array. To demonstrate the ability to identify fitness phenotypes using our pool of mutants, the pool was subjected to selection by intraperitoneal injection into BALB/c mice and subsequent recovery from spleens. Changes in the representation of each mutant were monitored using T7 transcripts hybridized to a novel inexpensive minimal microarray. Among the top 120 statistically significant spleen colonization phenotypes, more than 40 were mutations in genes with no previously known role in this model. Fifteen phenotypes were tested using individual mutants in competitive assays of intraperitoneal infection in mice and eleven were confirmed, including the first two examples of attenuation for sRNA mutants in Salmonella. We refer to the method as Array-based analysis of cistrons under selection (ABACUS.

  12. Molecular analysis of the glucocerebrosidase gene locus

    Energy Technology Data Exchange (ETDEWEB)

    Winfield, S.L.; Martin, B.M.; Fandino, A. [Clinical Neuroscience Branch, Bethesda, MD (United States)] [and others

    1994-09-01

    Gaucher disease is due to a deficiency in the activity of the lysosomal enzyme glucocerebrosidase. Both the functional gene for this enzyme and a pseudogene are located in close proximity on chromosome 1q21. Analysis of the mutations present in patient samples has suggested interaction between the functional gene and the pseudogene in the origin of mutant genotypes. To investigate the involvement of regions flanking the functional gene and pseudogene in the origin of mutations found in Gaucher disease, a YAC clone containing DNA from this locus has been subcloned and characterized. The original YAC containing {approximately}360 kb was truncated with the use of fragmentation plasmids to about 85 kb. A lambda library derived from this YAC was screened to obtain clones containing glucocerebrosidase sequences. PCR amplification was used to identify subclones containing 5{prime}, central, or 3{prime} sequences of the functional gene or of the pseudogene. Clones spanning the entire distance from the last exon of the functional gene to intron 1 of the pseudogene, the 5{prime} end of the functional gene and 16 kb of 5{prime} flanking region and approximately 15 kb of 3{prime} flanking region of the pseudogene were sequenced. Sequence data from 48 kb of intergenic and flanking regions of the glucocerebrosidase gene and its pseudogene has been generated. A large number of Alu sequences and several simple repeats have been found. Two of these repeats exhibit fragment length polymorphism. There is almost 100% homology between the 3{prime} flanking regions of the functional gene and the pseudogene, extending to about 4 kb past the termination codons. A much lower degree of homology is observed in the 5{prime} flanking region. Patient samples are currently being screened for polymorphisms in these flanking regions.

  13. 微阵列比较基因组杂交技术在自然流产遗传学分析中的应用%Application of array-based comparative genomic hybridization technique in genetic analysis of ;patients with spontaneous abortion

    Institute of Scientific and Technical Information of China (English)

    楚艳; 吴东; 侯巧芳; 霍晓东; 高越; 王涛; 王红丹; 杨艳丽; 廖世秀

    2016-01-01

    目的:探讨微阵列比较基因组杂交(array-CGH)技术在自然流产组织染色体分析中的应用,为自然流产的遗传咨询和临床诊治提供指导。方法选取2013年11月至2016年1月在河南省人民医院就诊的自然流产患者382例,收集流产绒毛或胎儿组织,采用array-CGH技术对流产绒毛或胎儿组织的全基因组拷贝数进行检测,并同时行细胞培养和传统G显带染色体核型分析,比较G显带染色体核型分析及array-CGH的结果。结果 array-CGH技术成功获得结果382例,检测成功率为100.0%(382/382),染色体异常检出率为46.6%(178/382);染色体核型分析技术成功获得结果281例,检测成功率为73.6%(281/382),染色体异常检出率为40.2%(113/281);array-CGH均高于染色体核型分析技术。array-CGH检测出的178例染色体异常中,染色体数目异常163例(91.6%,163/178);染色体结构异常15例(8.4%,15/178),其中10例同时出现了染色体微重复和微缺失的流产胚胎中有4例被证实父母一方为染色体平衡易位携带者。染色体核型分析检出的113例染色体异常中,染色体数目异常108例(95.6%,108/113),染色体结构异常5例(4.4%,5/113)。两种方法的结果不一致有3例,其中2例为三倍体、1例为性染色体低比例嵌合,array-CGH均漏检为正常。结论 array-CGH技术用于自然流产胚胎组织的染色体分析成功率高,对标本的取材要求远低于传统染色体核型分析技术,且分辨率高、准确快速,可以作为流产组织遗传学诊断的一线技术。%Objective To investigate the value of array-based comparative genomic hybridization (array-CGH) technique for the detection of chromosomal analysis of miscarried embryo, and to provide genetic counseling for couples with spontaneous abortion. Methods Totally 382 patients who underwent miscarriage were enrolled in this study. All

  14. A Spatiotemporal Indexing Approach for Efficient Processing of Big Array-Based Climate Data with MapReduce

    Science.gov (United States)

    Li, Zhenlong; Hu, Fei; Schnase, John L.; Duffy, Daniel Q.; Lee, Tsengdar; Bowen, Michael K.; Yang, Chaowei

    2016-01-01

    Climate observations and model simulations are producing vast amounts of array-based spatiotemporal data. Efficient processing of these data is essential for assessing global challenges such as climate change, natural disasters, and diseases. This is challenging not only because of the large data volume, but also because of the intrinsic high-dimensional nature of geoscience data. To tackle this challenge, we propose a spatiotemporal indexing approach to efficiently manage and process big climate data with MapReduce in a highly scalable environment. Using this approach, big climate data are directly stored in a Hadoop Distributed File System in its original, native file format. A spatiotemporal index is built to bridge the logical array-based data model and the physical data layout, which enables fast data retrieval when performing spatiotemporal queries. Based on the index, a data-partitioning algorithm is applied to enable MapReduce to achieve high data locality, as well as balancing the workload. The proposed indexing approach is evaluated using the National Aeronautics and Space Administration (NASA) Modern-Era Retrospective Analysis for Research and Applications (MERRA) climate reanalysis dataset. The experimental results show that the index can significantly accelerate querying and processing (10 speedup compared to the baseline test using the same computing cluster), while keeping the index-to-data ratio small (0.0328). The applicability of the indexing approach is demonstrated by a climate anomaly detection deployed on a NASA Hadoop cluster. This approach is also able to support efficient processing of general array-based spatiotemporal data in various geoscience domains without special configuration on a Hadoop cluster.

  15. High-resolution dynamic pressure sensor array based on piezo-phototronic effect tuned photoluminescence imaging.

    Science.gov (United States)

    Peng, Mingzeng; Li, Zhou; Liu, Caihong; Zheng, Qiang; Shi, Xieqing; Song, Ming; Zhang, Yang; Du, Shiyu; Zhai, Junyi; Wang, Zhong Lin

    2015-03-24

    A high-resolution dynamic tactile/pressure display is indispensable to the comprehensive perception of force/mechanical stimulations such as electronic skin, biomechanical imaging/analysis, or personalized signatures. Here, we present a dynamic pressure sensor array based on pressure/strain tuned photoluminescence imaging without the need for electricity. Each sensor is a nanopillar that consists of InGaN/GaN multiple quantum wells. Its photoluminescence intensity can be modulated dramatically and linearly by small strain (0-0.15%) owing to the piezo-phototronic effect. The sensor array has a high pixel density of 6350 dpi and exceptional small standard deviation of photoluminescence. High-quality tactile/pressure sensing distribution can be real-time recorded by parallel photoluminescence imaging without any cross-talk. The sensor array can be inexpensively fabricated over large areas by semiconductor product lines. The proposed dynamic all-optical pressure imaging with excellent resolution, high sensitivity, good uniformity, and ultrafast response time offers a suitable way for smart sensing, micro/nano-opto-electromechanical systems.

  16. Gene expression analysis of flax seed development

    Directory of Open Access Journals (Sweden)

    Sharpe Andrew

    2011-04-01

    Full Text Available Abstract Background Flax, Linum usitatissimum L., is an important crop whose seed oil and stem fiber have multiple industrial applications. Flax seeds are also well-known for their nutritional attributes, viz., omega-3 fatty acids in the oil and lignans and mucilage from the seed coat. In spite of the importance of this crop, there are few molecular resources that can be utilized toward improving seed traits. Here, we describe flax embryo and seed development and generation of comprehensive genomic resources for the flax seed. Results We describe a large-scale generation and analysis of expressed sequences in various tissues. Collectively, the 13 libraries we have used provide a broad representation of genes active in developing embryos (globular, heart, torpedo, cotyledon and mature stages seed coats (globular and torpedo stages and endosperm (pooled globular to torpedo stages and genes expressed in flowers, etiolated seedlings, leaves, and stem tissue. A total of 261,272 expressed sequence tags (EST (GenBank accessions LIBEST_026995 to LIBEST_027011 were generated. These EST libraries included transcription factor genes that are typically expressed at low levels, indicating that the depth is adequate for in silico expression analysis. Assembly of the ESTs resulted in 30,640 unigenes and 82% of these could be identified on the basis of homology to known and hypothetical genes from other plants. When compared with fully sequenced plant genomes, the flax unigenes resembled poplar and castor bean more than grape, sorghum, rice or Arabidopsis. Nearly one-fifth of these (5,152 had no homologs in sequences reported for any organism, suggesting that this category represents genes that are likely unique to flax. Digital analyses revealed gene expression dynamics for the biosynthesis of a number of important seed constituents during seed development. Conclusions We have developed a foundational database of expressed sequences and collection of plasmid

  17. Hierarchical Parallelization of Gene Differential Association Analysis

    Directory of Open Access Journals (Sweden)

    Dwarkadas Sandhya

    2011-09-01

    Full Text Available Abstract Background Microarray gene differential expression analysis is a widely used technique that deals with high dimensional data and is computationally intensive for permutation-based procedures. Microarray gene differential association analysis is even more computationally demanding and must take advantage of multicore computing technology, which is the driving force behind increasing compute power in recent years. In this paper, we present a two-layer hierarchical parallel implementation of gene differential association analysis. It takes advantage of both fine- and coarse-grain (with granularity defined by the frequency of communication parallelism in order to effectively leverage the non-uniform nature of parallel processing available in the cutting-edge systems of today. Results Our results show that this hierarchical strategy matches data sharing behavior to the properties of the underlying hardware, thereby reducing the memory and bandwidth needs of the application. The resulting improved efficiency reduces computation time and allows the gene differential association analysis code to scale its execution with the number of processors. The code and biological data used in this study are downloadable from http://www.urmc.rochester.edu/biostat/people/faculty/hu.cfm. Conclusions The performance sweet spot occurs when using a number of threads per MPI process that allows the working sets of the corresponding MPI processes running on the multicore to fit within the machine cache. Hence, we suggest that practitioners follow this principle in selecting the appropriate number of MPI processes and threads within each MPI process for their cluster configurations. We believe that the principles of this hierarchical approach to parallelization can be utilized in the parallelization of other computationally demanding kernels.

  18. Functional analysis of plastid-encoded genes

    OpenAIRE

    Swiatek, Magdalena

    2002-01-01

    Plastid chromosomes from the variety of plant species contain several conserved open reading frames of unknown function, which most probably represent functional genes. The primary aim of this thesis was the analysis of the role of two such ORFs, designated ycfs or hypothetical chloroplast reading frames, namely ycf9 (ORF62) and ycf10 (ORF229, cemA). Both were analyzed in Nicotiana tabacum (tobacco) via their inactivation using biolistic plastid transformation. A new experiment...

  19. Computational Analysis of PTEN Gene Mutation

    Directory of Open Access Journals (Sweden)

    Siew-Kien Mah

    2012-01-01

    Full Text Available Post-genomic data can be efficiently analyzed using computational tools. It has the advantage over the biochemical and biophysical methods in term of higher coverage. In this research, we adopted a computational analysis on PTEN gene mutation.  Mutation in PTEN is responsible for many human diseases. The results of this research provide insights into the protein domains of PTEN and the distribution of mutation.

  20. Globaltest and GOEAST: two different approaches for Gene Ontology analysis

    NARCIS (Netherlands)

    Hulsegge, B.; Kommadath, A.; Smits, M.A.

    2009-01-01

    Background Gene set analysis is a commonly used method for analysing microarray data by considering groups of functionally related genes instead of individual genes. Here we present the use of two gene set analysis approaches: Globaltest and GOEAST. Globaltest is a method for testing whether sets of

  1. Separate enrichment analysis of pathways for up- and downregulated genes.

    Science.gov (United States)

    Hong, Guini; Zhang, Wenjing; Li, Hongdong; Shen, Xiaopei; Guo, Zheng

    2014-03-06

    Two strategies are often adopted for enrichment analysis of pathways: the analysis of all differentially expressed (DE) genes together or the analysis of up- and downregulated genes separately. However, few studies have examined the rationales of these enrichment analysis strategies. Using both microarray and RNA-seq data, we show that gene pairs with functional links in pathways tended to have positively correlated expression levels, which could result in an imbalance between the up- and downregulated genes in particular pathways. We then show that the imbalance could greatly reduce the statistical power for finding disease-associated pathways through the analysis of all-DE genes. Further, using gene expression profiles from five types of tumours, we illustrate that the separate analysis of up- and downregulated genes could identify more pathways that are really pertinent to phenotypic difference. In conclusion, analysing up- and downregulated genes separately is more powerful than analysing all of the DE genes together.

  2. Transcriptional analysis of Pleurotus ostreatus laccase genes.

    Science.gov (United States)

    Pezzella, Cinzia; Lettera, Vincenzo; Piscitelli, Alessandra; Giardina, Paola; Sannia, Giovanni

    2013-01-01

    Fungal laccases (p-diphenol:oxygen oxidoreductase; EC 1.10.3.2) are multi-copper-containing oxidases that catalyse the oxidation of a great variety of phenolic compounds and aromatic amines through simultaneous reduction of molecular oxygen to water. Fungi generally produce several laccase isoenzymes encoded by complex multi-gene families. The Pleurotus ostreatus genome encodes 11 putative laccase coding genes, and only six different laccase isoenzymes have been isolated and characterised so far. Laccase expression was found to be regulated by culture conditions and developmental stages even if the redundancy of these genes still raises the question about their respective functions in vivo. In this context, laccase transcript profiling analysis has been used to unravel the physiological role played by the different isoforms produced by P. ostreatus. Even if reported results depict a complex picture of the transcriptional responses exhibited by the analysed laccase genes, they were allowed to speculate on the isoform role in vivo. Among the produced laccases, LACC10 (POXC) seems to play a major role during vegetative growth, since its transcription is downregulated when the fungus starts the fructification process. Furthermore, a new tessera has been added to the puzzling mosaic of the heterodimeric laccase LACC2 (POXA3). LACC2 small subunit seems to play an additional physiological role during fructification, beside that of LACC2 complex activation/stabilisation.

  3. Analysis of gene expression in rabbit muscle

    Directory of Open Access Journals (Sweden)

    Alena Gálová

    2014-02-01

    Full Text Available Increasing consumer knowledge of the link between diet and health has raised the demand for high quality food. Meat and meat products may be considered as irreplaceable in human nutrition. Breeding livestock to higher content of lean meat and the use of modern hybrids entails problems with the quality of meat. Analysing of livestock genomes could get us a great deal of important information, which may significantly affect the improvement process. Domestic animals are invaluable resources for study of the molecular architecture of complex traits. Although the mapping of quantitative trait loci (QTL responsible for economically important traits in domestic animals has achieved remarkable results in recent decades, not all of the genetic variation in the complex traits has been captured because of the low density of markers used in QTL mapping studies. The genome wide association study (GWAS, which utilizes high-density single-nucleotide polymorphism (SNP, provides a new way to tackle this issue. New technologies now allow producing microarrays containing thousands of hybridization probes on a single membrane or other solid support. We used microarray analysis to study gene expression in rabbit muscle during different developmental age stages. The outputs from GeneSpring GX sotware are presented in this work. After the evaluation of gene expression in rabbits, will be selected genes of interest in relation to meat quality parameters and will be further analyzed by the available methods of molecular biology and genetics.

  4. Gene expression analysis on small numbers of invasive cells collected by chemotaxis from primary mammary tumors of the mouse

    Directory of Open Access Journals (Sweden)

    Segall Jeffrey E

    2003-08-01

    Full Text Available Abstract Background cDNA microarrays have the potential to identify the genes involved in invasion and metastasis. However, when used with whole tumor tissue, the results average the expression patterns of different cell types. We have combined chemotaxis-based cell collection of the invasive subpopulation of cells within the primary tumor with array-based gene expression analysis to identify the genes necessary for the process of carcinoma cell invasion. Results Invasive cells were collected from live primary tumors using microneedles containing chemotactic growth factors to mimic chemotactic signals thought to be present in the primary tumor. When used with mammary tumors of rats and mice, carcinoma cells and macrophages constitute the invasive cell population. Microbeads conjugated with monoclonal anti-CD11b (Mac-1α antibodies were used to separate macrophages from carcinoma cells. We utilized PCR-based cDNA amplification from small number of cells and compared it to the quality and complexity of conventionally generated cDNA to determine if amplified cDNA could be used with fidelity for array analysis of this cell population. These techniques showed a very high level of correlation indicating that the PCR based amplification technique yields a cDNA population that resembles, with high fidelity, the original template population present in the small number of cells used to prepare the cDNA for use with the chip. Conclusions The specific collection of invasive cells from a primary tumor and the analysis of gene expression in these cells are is now possible. By further comparing the gene expression patterns of cells collected by invasion into microneedles with that of carcinoma cells obtained from the whole primary tumor, the blood, and whole metastatic tumors, genes that contribute to the invasive process in carcinoma cells may be identified.

  5. Gene expression analysis identifies global gene dosage sensitivity in cancer

    DEFF Research Database (Denmark)

    Fehrmann, Rudolf S. N.; Karjalainen, Juha M.; Krajewska, Malgorzata;

    2015-01-01

    expression. We reanalyzed 77,840 expression profiles and observed a limited set of 'transcriptional components' that describe well-known biology, explain the vast majority of variation in gene expression and enable us to predict the biological function of genes. On correcting expression profiles...... for these components, we observed that the residual expression levels (in 'functional genomic mRNA' profiling) correlated strongly with copy number. DNA copy number correlated positively with expression levels for 99% of all abundantly expressed human genes, indicating global gene dosage sensitivity. By applying...

  6. Genomic profiling of oral squamous cell carcinoma by array-based comparative genomic hybridization.

    Directory of Open Access Journals (Sweden)

    Shunichi Yoshioka

    Full Text Available We designed a study to investigate genetic relationships between primary tumors of oral squamous cell carcinoma (OSCC and their lymph node metastases, and to identify genomic copy number aberrations (CNAs related to lymph node metastasis. For this purpose, we collected a total of 42 tumor samples from 25 patients and analyzed their genomic profiles by array-based comparative genomic hybridization. We then compared the genetic profiles of metastatic primary tumors (MPTs with their paired lymph node metastases (LNMs, and also those of LNMs with non-metastatic primary tumors (NMPTs. Firstly, we found that although there were some distinctive differences in the patterns of genomic profiles between MPTs and their paired LNMs, the paired samples shared similar genomic aberration patterns in each case. Unsupervised hierarchical clustering analysis grouped together 12 of the 15 MPT-LNM pairs. Furthermore, similarity scores between paired samples were significantly higher than those between non-paired samples. These results suggested that MPTs and their paired LNMs are composed predominantly of genetically clonal tumor cells, while minor populations with different CNAs may also exist in metastatic OSCCs. Secondly, to identify CNAs related to lymph node metastasis, we compared CNAs between grouped samples of MPTs and LNMs, but were unable to find any CNAs that were more common in LNMs. Finally, we hypothesized that subpopulations carrying metastasis-related CNAs might be present in both the MPT and LNM. Accordingly, we compared CNAs between NMPTs and LNMs, and found that gains of 7p, 8q and 17q were more common in the latter than in the former, suggesting that these CNAs may be involved in lymph node metastasis of OSCC. In conclusion, our data suggest that in OSCCs showing metastasis, the primary and metastatic tumors share similar genomic profiles, and that cells in the primary tumor may tend to metastasize after acquiring metastasis-associated CNAs.

  7. Detection of genomic imbalances in microdissected Hodgkin and Reed-Sternberg cells of classical Hodgkin's lymphoma by array-based comparative genomic hybridization.

    Science.gov (United States)

    Hartmann, Sylvia; Martin-Subero, José I; Gesk, Stefan; Hüsken, Julia; Giefing, Maciej; Nagel, Inga; Riemke, Jennifer; Chott, Andreas; Klapper, Wolfram; Parrens, Marie; Merlio, Jean-Philippe; Küppers, Ralf; Bräuninger, Andreas; Siebert, Reiner; Hansmann, Martin-Leo

    2008-09-01

    Cytogenetic analysis of classical Hodgkin's lymphoma is limited by the low content of the neoplastic Hodgkin-Reed-Sternberg cells in the affected tissues. However, available cytogenetic data point to an extreme karyotype complexity. To obtain insights into chromosomal imbalances in classical Hodgkin's lymphoma, we applied array-based comparative genomic hybridization (array comparative genomic hybridization) using DNA from microdissected Hodgkin-Reed-Sternberg cells. To avoid biases introduced by DNA amplification for array comparative genomic hybridization, cHL cases rich in Hodgkin-Reed-Sternberg cells were selected. DNA obtained from approximately 100,000 microdissected Hodgkin-Reed-Sternberg cells of each of ten classical Hodgkin's lymphoma cases was hybridized onto commercial 105 K oligonucleotide comparative genomic hybridization microarrays. Selected imbalances were confirmed by interphase cytogenetics and quantitative polymerase chain reaction analysis and further studied in an independent series of classical Hodgkin's lymphoma. Gains identified in at least five cHL affected 2p12-16, 5q15-23, 6p22, 8q13, 8q24, 9p21-24, 9q34, 12q13-14, 17q12, 19p13, 19q13 and 20q11 whereas losses recurrent in at least five cases involved Xp21, 6q23-24 and 13q22. Copy number changes of selected genes and a small deletion (156 kb) of the CDKN2B (p15) gene were confirmed by interphase cytogenetics and polymerase chain reaction analysis, respectively. Several gained regions included genes constitutively expressed in cHL. Among these, gains of STAT6 (12q13), NOTCH1 (9q34) and JUNB (19p13) were present in additional cHL with the usual low Hodgkin-Reed-Sternberg cell content. The present study demonstrates that array comparative genomic hybridization of microdissected Hodgkin-Reed-Sternberg cells is suitable for identifying and characterizing chromosomal imbalances. Regions affected by genomic changes in Hodgkin-Reed-Sternberg cells recurrently include genes constitutively

  8. Gene coexpression network analysis as a source of functional annotation for rice genes.

    Directory of Open Access Journals (Sweden)

    Kevin L Childs

    Full Text Available With the existence of large publicly available plant gene expression data sets, many groups have undertaken data analyses to construct gene coexpression networks and functionally annotate genes. Often, a large compendium of unrelated or condition-independent expression data is used to construct gene networks. Condition-dependent expression experiments consisting of well-defined conditions/treatments have also been used to create coexpression networks to help examine particular biological processes. Gene networks derived from either condition-dependent or condition-independent data can be difficult to interpret if a large number of genes and connections are present. However, algorithms exist to identify modules of highly connected and biologically relevant genes within coexpression networks. In this study, we have used publicly available rice (Oryza sativa gene expression data to create gene coexpression networks using both condition-dependent and condition-independent data and have identified gene modules within these networks using the Weighted Gene Coexpression Network Analysis method. We compared the number of genes assigned to modules and the biological interpretability of gene coexpression modules to assess the utility of condition-dependent and condition-independent gene coexpression networks. For the purpose of providing functional annotation to rice genes, we found that gene modules identified by coexpression analysis of condition-dependent gene expression experiments to be more useful than gene modules identified by analysis of a condition-independent data set. We have incorporated our results into the MSU Rice Genome Annotation Project database as additional expression-based annotation for 13,537 genes, 2,980 of which lack a functional annotation description. These results provide two new types of functional annotation for our database. Genes in modules are now associated with groups of genes that constitute a collective functional

  9. Analysis of Duplicate Genes in Soybean

    Institute of Scientific and Technical Information of China (English)

    C.M. Cai; K.J. Van; M.Y. Kim; S.H. Lee

    2007-01-01

    @@ Gene duplication is a major determinant of the size and gene complement of eukaryotic genomes (Lockton and Gaut, 2005). There are a number of different ways in which duplicate genes can arise (Sankoff, 2001), but the most spectacular method of gene duplication may be whole genome duplication via polyploidization.

  10. Genome-wide analysis of homeobox genes from Mesobuthus martensii reveals Hox gene duplication in scorpions.

    Science.gov (United States)

    Di, Zhiyong; Yu, Yao; Wu, Yingliang; Hao, Pei; He, Yawen; Zhao, Huabin; Li, Yixue; Zhao, Guoping; Li, Xuan; Li, Wenxin; Cao, Zhijian

    2015-06-01

    Homeobox genes belong to a large gene group, which encodes the famous DNA-binding homeodomain that plays a key role in development and cellular differentiation during embryogenesis in animals. Here, one hundred forty-nine homeobox genes were identified from the Asian scorpion, Mesobuthus martensii (Chelicerata: Arachnida: Scorpiones: Buthidae) based on our newly assembled genome sequence with approximately 248 × coverage. The identified homeobox genes were categorized into eight classes including 82 families: 67 ANTP class genes, 33 PRD genes, 11 LIM genes, five POU genes, six SINE genes, 14 TALE genes, five CUT genes, two ZF genes and six unclassified genes. Transcriptome data confirmed that more than half of the genes were expressed in adults. The homeobox gene diversity of the eight classes is similar to the previously analyzed Mandibulata arthropods. Interestingly, it is hypothesized that the scorpion M. martensii may have two Hox clusters. The first complete genome-wide analysis of homeobox genes in Chelicerata not only reveals the repertoire of scorpion, arachnid and chelicerate homeobox genes, but also shows some insights into the evolution of arthropod homeobox genes.

  11. Comprehensive genome characterization of solitary fibrous tumors using high-resolution array-based comparative genomic hybridization.

    Science.gov (United States)

    Bertucci, François; Bouvier-Labit, Corinne; Finetti, Pascal; Adélaïde, José; Metellus, Philippe; Mokhtari, Karima; Decouvelaere, Anne-Valérie; Miquel, Catherine; Jouvet, Anne; Figarella-Branger, Dominique; Pedeutour, Florence; Chaffanet, Max; Birnbaum, Daniel

    2013-02-01

    Solitary fibrous tumors (SFTs) are rare spindle cell tumors with limited therapeutic options. Their molecular basis is poorly known. No consistent cytogenetic abnormality has been reported. We used high-resolution whole-genome array-based comparative genomic hybridization (Agilent 244K oligonucleotide chips) to profile 47 samples, meningeal in >75% of cases. Few copy number aberrations (CNAs) were observed. Sixty-eight percent of samples did not show any gene CNA after exclusion of probes located in regions with referenced copy number variation (CNV). Only low-level CNAs were observed. The genomic profiles were very homogeneous among samples. No molecular class was revealed by clustering of DNA copy numbers. All cases displayed a "simplex" profile. No recurrent CNA was identified. Imbalances occurring in >20%, such as the gain of 8p11.23-11.22 region, contained known CNVs. The 13q14.11-13q31.1 region (lost in 4% of cases) was the largest altered region and contained the lowest percentage of genes with referenced CNVs. A total of 425 genes without CNV showed copy number transition in at least one sample, but only but only 1 in at least 10% of samples. The genomic profiles of meningeal and extra-meningeal cases did not show any differences.

  12. Sequencing and Gene Expression Analysis of Leishmania tropica LACK Gene.

    Directory of Open Access Journals (Sweden)

    Nour Hammoudeh

    2014-12-01

    Full Text Available Leishmania Homologue of receptors for Activated C Kinase (LACK antigen is a 36-kDa protein, which provokes a very early immune response against Leishmania infection. There are several reports on the expression of LACK through different life-cycle stages of genus Leishmania, but only a few of them have focused on L.tropica.The present study provides details of the cloning, DNA sequencing and gene expression of LACK in this parasite species. First, several local isolates of Leishmania parasites were typed in our laboratory using PCR technique to verify of Leishmania parasite species. After that, LACK gene was amplified and cloned into a vector for sequencing. Finally, the expression of this molecule in logarithmic and stationary growth phase promastigotes, as well as in amastigotes, was evaluated by Reverse Transcription-PCR (RT-PCR technique.The typing result confirmed that all our local isolates belong to L.tropica. LACK gene sequence was determined and high similarity was observed with the sequences of other Leishmania species. Furthermore, the expression of LACK gene in both promastigotes and amastigotes forms was confirmed.Overall, the data set the stage for future studies of the properties and immune role of LACK gene products.

  13. RNA amplification for successful gene profiling analysis

    Directory of Open Access Journals (Sweden)

    Wang Ena

    2005-07-01

    Full Text Available Abstract The study of clinical samples is often limited by the amount of material available to study. While proteins cannot be multiplied in their natural form, DNA and RNA can be amplified from small specimens and used for high-throughput analyses. Therefore, genetic studies offer the best opportunity to screen for novel insights of human pathology when little material is available. Precise estimates of DNA copy numbers in a given specimen are necessary. However, most studies investigate static variables such as the genetic background of patients or mutations within pathological specimens without a need to assess proportionality of expression among different genes throughout the genome. Comparative genomic hybridization of DNA samples represents a crude exception to this rule since genomic amplification or deletion is compared among different specimens directly. For gene expression analysis, however, it is critical to accurately estimate the proportional expression of distinct RNA transcripts since such proportions directly govern cell function by modulating protein expression. Furthermore, comparative estimates of relative RNA expression at different time points portray the response of cells to environmental stimuli, indirectly informing about broader biological events affecting a particular tissue in physiological or pathological conditions. This cognitive reaction of cells is similar to the detection of electroencephalographic patterns which inform about the status of the brain in response to external stimuli. As our need to understand human pathophysiology at the global level increases, the development and refinement of technologies for high fidelity messenger RNA amplification have become the focus of increasing interest during the past decade. The need to increase the abundance of RNA has been met not only for gene specific amplification, but, most importantly for global transcriptome wide, unbiased amplification. Now gene

  14. Evaluating the consistency of gene sets used in the analysis of bacterial gene expression data

    Directory of Open Access Journals (Sweden)

    Tintle Nathan L

    2012-08-01

    Full Text Available Abstract Background Statistical analyses of whole genome expression data require functional information about genes in order to yield meaningful biological conclusions. The Gene Ontology (GO and Kyoto Encyclopedia of Genes and Genomes (KEGG are common sources of functionally grouped gene sets. For bacteria, the SEED and MicrobesOnline provide alternative, complementary sources of gene sets. To date, no comprehensive evaluation of the data obtained from these resources has been performed. Results We define a series of gene set consistency metrics directly related to the most common classes of statistical analyses for gene expression data, and then perform a comprehensive analysis of 3581 Affymetrix® gene expression arrays across 17 diverse bacteria. We find that gene sets obtained from GO and KEGG demonstrate lower consistency than those obtained from the SEED and MicrobesOnline, regardless of gene set size. Conclusions Despite the widespread use of GO and KEGG gene sets in bacterial gene expression data analysis, the SEED and MicrobesOnline provide more consistent sets for a wide variety of statistical analyses. Increased use of the SEED and MicrobesOnline gene sets in the analysis of bacterial gene expression data may improve statistical power and utility of expression data.

  15. Genome-wide analysis of DNA methylation, copy number variation, and gene expression in monozygotic twins discordant for primary biliary cirrhosis

    Directory of Open Access Journals (Sweden)

    Carlo eSelmi

    2014-03-01

    Full Text Available Primary biliary cirrhosis (PBC is an uncommon autoimmune disease with a homogeneous clinical phenotype that reflects incomplete disease concordance in monozygotic (MZ twins. We have taken advantage of a unique collection consisting of genomic DNA and mRNA from peripheral blood cells of female MZ twins (n=3 sets and sisters of similar age (n=8 pairs discordant for disease. We performed a genome-wide study to investigate differences in (i DNA methylation (using a custom tiled 4-plex array containing tiled 50-mers 19,084 randomly chosen methylation sites, (ii copy number variation (CNV (with a chip including markers derived from the 1000 Genomes Project, all three HapMap phases, and recently published studies, and/or (iii gene expression (by whole-genome expression arrays. Based on the results obtained from these three approaches we utilized quantitative PCR to compare the expression of candidate genes. Importantly, our data support consistent differences in discordant twins and siblings for the (i methylation profiles of 60 gene regions, (ii CNV of 10 genes, and (iii the expression of 2 interferon-dependent genes. Quantitative PCR analysis showed that 17 of these genes are differentially expressed in discordant sibling pairs. In conclusion, we report that MZ twins and sisters discordant for PBC manifest particular epigenetic differences and highlight the value of the epigenetic study of twins.

  16. Gene expression profile analysis of type 2 diabetic mouse liver.

    Directory of Open Access Journals (Sweden)

    Fang Zhang

    Full Text Available Liver plays a key role in glucose metabolism and homeostasis, and impaired hepatic glucose metabolism contributes to the development of type 2 diabetes. However, the precise gene expression profile of diabetic liver and its association with diabetes and related diseases are yet to be further elucidated. In this study, we detected the gene expression profile by high-throughput sequencing in 9-week-old normal and type 2 diabetic db/db mouse liver. Totally 12132 genes were detected, and 2627 genes were significantly changed in diabetic mouse liver. Biological process analysis showed that the upregulated genes in diabetic mouse liver were mainly enriched in metabolic processes. Surprisingly, the downregulated genes in diabetic mouse liver were mainly enriched in immune-related processes, although all the altered genes were still mainly enriched in metabolic processes. Similarly, KEGG pathway analysis showed that metabolic pathways were the major pathways altered in diabetic mouse liver, and downregulated genes were enriched in immune and cancer pathways. Analysis of the key enzyme genes in fatty acid and glucose metabolism showed that some key enzyme genes were significantly increased and none of the detected key enzyme genes were decreased. In addition, FunDo analysis showed that liver cancer and hepatitis were most likely to be associated with diabetes. Taken together, this study provides the digital gene expression profile of diabetic mouse liver, and demonstrates the main diabetes-associated hepatic biological processes, pathways, key enzyme genes in fatty acid and glucose metabolism and potential hepatic diseases.

  17. Mutation analysis of the preproghrelin gene

    DEFF Research Database (Denmark)

    Larsen, Lesli H; Gjesing, Anette P; Sørensen, Thorkild I A;

    2005-01-01

    To investigate the preproghrelin gene for variants and their association with obesity and type 2 diabetes.......To investigate the preproghrelin gene for variants and their association with obesity and type 2 diabetes....

  18. DNA micro-array-based identification of bile-responsive genes in Lactobacillus plantarum

    NARCIS (Netherlands)

    Bron, P.A.; Molenaar, D.; Vos, de W.M.; Kleerebezem, M.

    2006-01-01

    The purpose of this study was to determine the global transcriptional response in a food-associated lactic acid bacterium during bile stress. Methods and Results:¿ Clone-based DNA micro-arrays were employed to describe the global transcriptional response of Lactobacillus plantarum WCFS1 towards 0·1%

  19. A three-step workflow procedure for the interpretation of array-based comparative genome hybridization results in patients with idiopathic mental retardation and congenital anomalies.

    Science.gov (United States)

    Poot, Martin; Hochstenbach, Ron

    2010-08-01

    One of the aims of clinical genetics is to identify gene mutations or genomic rearrangements that may underlie complex presentations of phenotypic features, such as multiple congenital malformations and mental retardation. During the decade after publication of the first article on array-based comparative genome hybridization, this technique has supplemented karyotyping as the prime genome-wide screening method in patients with idiopathic multiple congenital malformations and mental retardation. The use of this novel, discovery-based, approach has dramatically increased the detection rate of genomic imbalances. Array-based comparative genome hybridization detects copy number changes in the genome of patients and healthy subjects, some of which may represent phenotypically neutral copy number variations. This prompts the need for properly distinguishing between those copy number changes that may contribute to the clinical phenotype amid a pool of neutral copy number variations. We briefly review the characteristics of copy number changes in relation to their clinical relevance. Second, we discuss several published workflow schemes to identify copy number changes putatively contributing to the phenotype, and third, we propose a three-step procedure aiming to rapidly evaluate copy number changes on a case-by-case basis as to their potential contribution to the phenotype of patients with idiopathic multiple congenital malformations and mental retardation. This workflow is gene-centered and should aid in identification of disease-related candidate genes and in estimating the recurrence risk for the disorder in the family.

  20. Identification of Human HK Genes and Gene Expression Regulation Study in Cancer from Transcriptomics Data Analysis

    Science.gov (United States)

    Zhang, Zhang; Liu, Jingxing; Wu, Jiayan; Yu, Jun

    2013-01-01

    The regulation of gene expression is essential for eukaryotes, as it drives the processes of cellular differentiation and morphogenesis, leading to the creation of different cell types in multicellular organisms. RNA-Sequencing (RNA-Seq) provides researchers with a powerful toolbox for characterization and quantification of transcriptome. Many different human tissue/cell transcriptome datasets coming from RNA-Seq technology are available on public data resource. The fundamental issue here is how to develop an effective analysis method to estimate expression pattern similarities between different tumor tissues and their corresponding normal tissues. We define the gene expression pattern from three directions: 1) expression breadth, which reflects gene expression on/off status, and mainly concerns ubiquitously expressed genes; 2) low/high or constant/variable expression genes, based on gene expression level and variation; and 3) the regulation of gene expression at the gene structure level. The cluster analysis indicates that gene expression pattern is higher related to physiological condition rather than tissue spatial distance. Two sets of human housekeeping (HK) genes are defined according to cell/tissue types, respectively. To characterize the gene expression pattern in gene expression level and variation, we firstly apply improved K-means algorithm and a gene expression variance model. We find that cancer-associated HK genes (a HK gene is specific in cancer group, while not in normal group) are expressed higher and more variable in cancer condition than in normal condition. Cancer-associated HK genes prefer to AT-rich genes, and they are enriched in cell cycle regulation related functions and constitute some cancer signatures. The expression of large genes is also avoided in cancer group. These studies will help us understand which cell type-specific patterns of gene expression differ among different cell types, and particularly for cancer. PMID:23382867

  1. Comprehensive analysis of plant rapid alkalization factor (RALF) genes.

    Science.gov (United States)

    Sharma, Arti; Hussain, Adil; Mun, Bong-Gyu; Imran, Qari Muhammad; Falak, Noreen; Lee, Sang-Uk; Kim, Jae Young; Hong, Jeum Kyu; Loake, Gary John; Ali, Asad; Yun, Byung-Wook

    2016-09-01

    Receptor mediated signal carriers play a critical role in the regulation of plant defense and development. Rapid alkalization factor (RALF) proteins potentially comprise important signaling components which may have a key role in plant biology. The RALF gene family contains large number of genes in several plant species, however, only a few RALF genes have been characterized to date. In this study, an extensive database search identified 39, 43, 34 and 18 RALF genes in Arabidopsis, rice, maize and soybean, respectively. These RALF genes were found to be highly conserved across the 4 plant species. A comprehensive analysis including the chromosomal location, gene structure, subcellular location, conserved motifs, protein structure, protein-ligand interaction and promoter analysis was performed. RALF genes from four plant species were divided into 7 groups based on phylogenetic analysis. In silico expression analysis of these genes, using microarray and EST data, revealed that these genes exhibit a variety of expression patterns. Furthermore, RALF genes showed distinct expression patterns of transcript accumulation in vivo following nitrosative and oxidative stresses in Arabidopsis. Predicted interaction between RALF and heme ligand also showed that RALF proteins may contribute towards transporting or scavenging oxygen moieties. This suggests a possible role for RALF genes during changes in cellular redox status. Collectively, our data provides a valuable resource to prime future research in the role of RALF genes in plant growth and development.

  2. Self-Contained Statistical Analysis of Gene Sets

    Science.gov (United States)

    Cannon, Judy L.; Ricoy, Ulises M.; Johnson, Christopher

    2016-01-01

    Microarrays are a powerful tool for studying differential gene expression. However, lists of many differentially expressed genes are often generated, and unraveling meaningful biological processes from the lists can be challenging. For this reason, investigators have sought to quantify the statistical probability of compiled gene sets rather than individual genes. The gene sets typically are organized around a biological theme or pathway. We compute correlations between different gene set tests and elect to use Fisher’s self-contained method for gene set analysis. We improve Fisher’s differential expression analysis of a gene set by limiting the p-value of an individual gene within the gene set to prevent a small percentage of genes from determining the statistical significance of the entire set. In addition, we also compute dependencies among genes within the set to determine which genes are statistically linked. The method is applied to T-ALL (T-lineage Acute Lymphoblastic Leukemia) to identify differentially expressed gene sets between T-ALL and normal patients and T-ALL and AML (Acute Myeloid Leukemia) patients. PMID:27711232

  3. Time-Course Gene Set Analysis for Longitudinal Gene Expression Data.

    Directory of Open Access Journals (Sweden)

    Boris P Hejblum

    2015-06-01

    Full Text Available Gene set analysis methods, which consider predefined groups of genes in the analysis of genomic data, have been successfully applied for analyzing gene expression data in cross-sectional studies. The time-course gene set analysis (TcGSA introduced here is an extension of gene set analysis to longitudinal data. The proposed method relies on random effects modeling with maximum likelihood estimates. It allows to use all available repeated measurements while dealing with unbalanced data due to missing at random (MAR measurements. TcGSA is a hypothesis driven method that identifies a priori defined gene sets with significant expression variations over time, taking into account the potential heterogeneity of expression within gene sets. When biological conditions are compared, the method indicates if the time patterns of gene sets significantly differ according to these conditions. The interest of the method is illustrated by its application to two real life datasets: an HIV therapeutic vaccine trial (DALIA-1 trial, and data from a recent study on influenza and pneumococcal vaccines. In the DALIA-1 trial TcGSA revealed a significant change in gene expression over time within 69 gene sets during vaccination, while a standard univariate individual gene analysis corrected for multiple testing as well as a standard a Gene Set Enrichment Analysis (GSEA for time series both failed to detect any significant pattern change over time. When applied to the second illustrative data set, TcGSA allowed the identification of 4 gene sets finally found to be linked with the influenza vaccine too although they were found to be associated to the pneumococcal vaccine only in previous analyses. In our simulation study TcGSA exhibits good statistical properties, and an increased power compared to other approaches for analyzing time-course expression patterns of gene sets. The method is made available for the community through an R package.

  4. GIFtS: annotation landscape analysis with GeneCards

    Directory of Open Access Journals (Sweden)

    Dalah Irina

    2009-10-01

    Full Text Available Abstract Background Gene annotation is a pivotal component in computational genomics, encompassing prediction of gene function, expression analysis, and sequence scrutiny. Hence, quantitative measures of the annotation landscape constitute a pertinent bioinformatics tool. GeneCards® is a gene-centric compendium of rich annotative information for over 50,000 human gene entries, building upon 68 data sources, including Gene Ontology (GO, pathways, interactions, phenotypes, publications and many more. Results We present the GeneCards Inferred Functionality Score (GIFtS which allows a quantitative assessment of a gene's annotation status, by exploiting the unique wealth and diversity of GeneCards information. The GIFtS tool, linked from the GeneCards home page, facilitates browsing the human genome by searching for the annotation level of a specified gene, retrieving a list of genes within a specified range of GIFtS value, obtaining random genes with a specific GIFtS value, and experimenting with the GIFtS weighting algorithm for a variety of annotation categories. The bimodal shape of the GIFtS distribution suggests a division of the human gene repertoire into two main groups: the high-GIFtS peak consists almost entirely of protein-coding genes; the low-GIFtS peak consists of genes from all of the categories. Cluster analysis of GIFtS annotation vectors provides the classification of gene groups by detailed positioning in the annotation arena. GIFtS also provide measures which enable the evaluation of the databases that serve as GeneCards sources. An inverse correlation is found (for GIFtS>25 between the number of genes annotated by each source, and the average GIFtS value of genes associated with that source. Three typical source prototypes are revealed by their GIFtS distribution: genome-wide sources, sources comprising mainly highly annotated genes, and sources comprising mainly poorly annotated genes. The degree of accumulated knowledge for a

  5. Integrated analysis of gene expression by association rules discovery

    Directory of Open Access Journals (Sweden)

    Carazo Jose M

    2006-02-01

    Full Text Available Abstract Background Microarray technology is generating huge amounts of data about the expression level of thousands of genes, or even whole genomes, across different experimental conditions. To extract biological knowledge, and to fully understand such datasets, it is essential to include external biological information about genes and gene products to the analysis of expression data. However, most of the current approaches to analyze microarray datasets are mainly focused on the analysis of experimental data, and external biological information is incorporated as a posterior process. Results In this study we present a method for the integrative analysis of microarray data based on the Association Rules Discovery data mining technique. The approach integrates gene annotations and expression data to discover intrinsic associations among both data sources based on co-occurrence patterns. We applied the proposed methodology to the analysis of gene expression datasets in which genes were annotated with metabolic pathways, transcriptional regulators and Gene Ontology categories. Automatically extracted associations revealed significant relationships among these gene attributes and expression patterns, where many of them are clearly supported by recently reported work. Conclusion The integration of external biological information and gene expression data can provide insights about the biological processes associated to gene expression programs. In this paper we show that the proposed methodology is able to integrate multiple gene annotations and expression data in the same analytic framework and extract meaningful associations among heterogeneous sources of data. An implementation of the method is included in the Engene software package.

  6. Phylogenetic analysis of cubilin (CUBN) gene.

    Science.gov (United States)

    Shaik, Abjal Pasha; Alsaeed, Abbas H; Kiranmayee, S; Bammidi, Vk; Sultana, Asma

    2013-01-01

    Cubilin, (CUBN; also known as intrinsic factor-cobalamin receptor [Homo sapiens Entrez Pubmed ref NM_001081.3; NG_008967.1; GI: 119606627]), located in the epithelium of intestine and kidney acts as a receptor for intrinsic factor - vitamin B12 complexes. Mutations in CUBN may play a role in autosomal recessive megaloblastic anemia. The current study investigated the possible role of CUBN in evolution using phylogenetic testing. A total of 588 BLAST hits were found for the cubilin query sequence and these hits showed putative conserved domain, CUB superfamily (as on 27(th) Nov 2012). A first-pass phylogenetic tree was constructed to identify the taxa which most often contained the CUBN sequences. Following this, we narrowed down the search by manually deleting sequences which were not CUBN. A repeat phylogenetic analysis of 25 taxa was performed using PhyML, RAxML and TreeDyn softwares to confirm that CUBN is a conserved protein emphasizing its importance as an extracellular domain and being present in proteins mostly known to be involved in development in many chordate taxa but not found in prokaryotes, plants and yeast.. No horizontal gene transfers have been found between different taxa.

  7. Gene expression profile analysis of human intervertebral disc degeneration

    OpenAIRE

    Kai Chen; Dajiang Wu; Xiaodong Zhu; Haijian Ni; Xianzhao Wei; Ningfang Mao; Yang Xie; Yunfei Niu; Ming Li

    2013-01-01

    In this study, we used microarray analysis to investigate the biogenesis and progression of intervertebral disc degeneration. The gene expression profiles of 37 disc tissue samples obtained from patients with herniated discs and degenerative disc disease collected by the National Cancer Institute Cooperative Tissue Network were analyzed. Differentially expressed genes between more and less degenerated discs were identified by significant analysis of microarray. A total of 555 genes were signi...

  8. Bioinformatics analysis of estrogen-responsive genes

    Science.gov (United States)

    Handel, Adam E.

    2016-01-01

    Estrogen is a steroid hormone that plays critical roles in a myriad of intracellular pathways. The expression of many genes is regulated through the steroid hormone receptors ESR1 and ESR2. These bind to DNA and modulate the expression of target genes. Identification of estrogen target genes is greatly facilitated by the use of transcriptomic methods, such as RNA-seq and expression microarrays, and chromatin immunoprecipitation with massively parallel sequencing (ChIP-seq). Combining transcriptomic and ChIP-seq data enables a distinction to be drawn between direct and indirect estrogen target genes. This chapter will discuss some methods of identifying estrogen target genes that do not require any expertise in programming languages or complex bioinformatics. PMID:26585125

  9. Analysis of multiplex gene expression maps obtained by voxelation

    Directory of Open Access Journals (Sweden)

    Smith Desmond J

    2009-04-01

    Full Text Available Abstract Background Gene expression signatures in the mammalian brain hold the key to understanding neural development and neurological disease. Researchers have previously used voxelation in combination with microarrays for acquisition of genome-wide atlases of expression patterns in the mouse brain. On the other hand, some work has been performed on studying gene functions, without taking into account the location information of a gene's expression in a mouse brain. In this paper, we present an approach for identifying the relation between gene expression maps obtained by voxelation and gene functions. Results To analyze the dataset, we chose typical genes as queries and aimed at discovering similar gene groups. Gene similarity was determined by using the wavelet features extracted from the left and right hemispheres averaged gene expression maps, and by the Euclidean distance between each pair of feature vectors. We also performed a multiple clustering approach on the gene expression maps, combined with hierarchical clustering. Among each group of similar genes and clusters, the gene function similarity was measured by calculating the average gene function distances in the gene ontology structure. By applying our methodology to find similar genes to certain target genes we were able to improve our understanding of gene expression patterns and gene functions. By applying the clustering analysis method, we obtained significant clusters, which have both very similar gene expression maps and very similar gene functions respectively to their corresponding gene ontologies. The cellular component ontology resulted in prominent clusters expressed in cortex and corpus callosum. The molecular function ontology gave prominent clusters in cortex, corpus callosum and hypothalamus. The biological process ontology resulted in clusters in cortex, hypothalamus and choroid plexus. Clusters from all three ontologies combined were most prominently expressed in

  10. Heat-inducible RNAi for gene functional analysis in plants.

    Science.gov (United States)

    Masclaux, Frédéric; Galaud, Jean-Philippe

    2011-01-01

    Controlling gene expression during plant development is an efficient method to explore gene function and RNA interference (RNAi) is now considered as a powerful technology for gene functional analysis. However, constitutive gene silencing cannot be used with genes involved in fundamental processes such as embryo viability or plant growth and alternative silencing strategies avoiding these limitations should be preferred. Tissue-specific and inducible promoters, able to control gene expression at spatial and/or temporal level, can be used to circumvent viability problems. In this chapter, after a rapid overview of the inducible promoters currently used for transgenic approaches in plants, we describe a method we have developed to study gene function by heat-inducible RNAi. This system is easy to use and complementary to those based on chemical gene inducer treatments and might be useful for both research and biotechnological applications.

  11. Database for exchangeable gene trap clones: pathway and gene ontology analysis of exchangeable gene trap clone mouse lines.

    Science.gov (United States)

    Araki, Masatake; Nakahara, Mai; Muta, Mayumi; Itou, Miharu; Yanai, Chika; Yamazoe, Fumika; Miyake, Mikiko; Morita, Ayaka; Araki, Miyuki; Okamoto, Yoshiyuki; Nakagata, Naomi; Yoshinobu, Kumiko; Yamamura, Ken-ichi; Araki, Kimi

    2014-02-01

    Gene trapping in embryonic stem (ES) cells is a proven method for large-scale random insertional mutagenesis in the mouse genome. We have established an exchangeable gene trap system, in which a reporter gene can be exchanged for any other DNA of interest through Cre/mutant lox-mediated recombination. We isolated trap clones, analyzed trapped genes, and constructed the database for Exchangeable Gene Trap Clones (EGTC) [http://egtc.jp]. The number of registered ES cell lines was 1162 on 31 August 2013. We also established 454 mouse lines from trap ES clones and deposited them in the mouse embryo bank at the Center for Animal Resources and Development, Kumamoto University, Japan. The EGTC database is the most extensive academic resource for gene-trap mouse lines. Because we used a promoter-trap strategy, all trapped genes were expressed in ES cells. To understand the general characteristics of the trapped genes in the EGTC library, we used Kyoto Encyclopedia of Genes and Genomes (KEGG) for pathway analysis and found that the EGTC ES clones covered a broad range of pathways. We also used Gene Ontology (GO) classification data provided by Mouse Genome Informatics (MGI) to compare the functional distribution of genes in each GO term between trapped genes in the EGTC mouse lines and total genes annotated in MGI. We found the functional distributions for the trapped genes in the EGTC mouse lines and for the RefSeq genes for the whole mouse genome were similar, indicating that the EGTC mouse lines had trapped a wide range of mouse genes. © 2014 The Authors Development, Growth & Differentiation © 2014 Japanese Society of Developmental Biologists.

  12. The limitations of simple gene set enrichment analysis assuming gene independence.

    Science.gov (United States)

    Tamayo, Pablo; Steinhardt, George; Liberzon, Arthur; Mesirov, Jill P

    2016-02-01

    Since its first publication in 2003, the Gene Set Enrichment Analysis method, based on the Kolmogorov-Smirnov statistic, has been heavily used, modified, and also questioned. Recently a simplified approach using a one-sample t-test score to assess enrichment and ignoring gene-gene correlations was proposed by Irizarry et al. 2009 as a serious contender. The argument criticizes Gene Set Enrichment Analysis's nonparametric nature and its use of an empirical null distribution as unnecessary and hard to compute. We refute these claims by careful consideration of the assumptions of the simplified method and its results, including a comparison with Gene Set Enrichment Analysis's on a large benchmark set of 50 datasets. Our results provide strong empirical evidence that gene-gene correlations cannot be ignored due to the significant variance inflation they produced on the enrichment scores and should be taken into account when estimating gene set enrichment significance. In addition, we discuss the challenges that the complex correlation structure and multi-modality of gene sets pose more generally for gene set enrichment methods.

  13. Genome-wide analysis of homeobox gene family in legumes: identification, gene duplication and expression profiling.

    Science.gov (United States)

    Bhattacharjee, Annapurna; Ghangal, Rajesh; Garg, Rohini; Jain, Mukesh

    2015-01-01

    Homeobox genes encode transcription factors that are known to play a major role in different aspects of plant growth and development. In the present study, we identified homeobox genes belonging to 14 different classes in five legume species, including chickpea, soybean, Medicago, Lotus and pigeonpea. The characteristic differences within homeodomain sequences among various classes of homeobox gene family were quite evident. Genome-wide expression analysis using publicly available datasets (RNA-seq and microarray) indicated that homeobox genes are differentially expressed in various tissues/developmental stages and under stress conditions in different legumes. We validated the differential expression of selected chickpea homeobox genes via quantitative reverse transcription polymerase chain reaction. Genome duplication analysis in soybean indicated that segmental duplication has significantly contributed in the expansion of homeobox gene family. The Ka/Ks ratio of duplicated homeobox genes in soybean showed that several members of this family have undergone purifying selection. Moreover, expression profiling indicated that duplicated genes might have been retained due to sub-functionalization. The genome-wide identification and comprehensive gene expression profiling of homeobox gene family members in legumes will provide opportunities for functional analysis to unravel their exact role in plant growth and development.

  14. A GeneTrek analysis of the maize genome.

    Science.gov (United States)

    Liu, Renyi; Vitte, Clémentine; Ma, Jianxin; Mahama, A Assibi; Dhliwayo, Thanda; Lee, Michael; Bennetzen, Jeffrey L

    2007-07-10

    Analysis of the sequences of 74 randomly selected BACs demonstrated that the maize nuclear genome contains approximately 37,000 candidate genes with homologues in other plant species. An additional approximately 5,500 predicted genes are severely truncated and probably pseudogenes. The distribution of genes is uneven, with approximately 30% of BACs containing no genes. BAC gene density varies from 0 to 7.9 per 100 kb, whereas most gene islands contain only one gene. The average number of genes per gene island is 1.7. Only 72% of these genes show collinearity with the rice genome. Particular LTR retrotransposon families (e.g., Gyma) are enriched on gene-free BACs, most of which do not come from pericentromeres or other large heterochromatic regions. Gene-containing BACs are relatively enriched in different families of LTR retrotransposons (e.g., Ji). Two major bursts of LTR retrotransposon activity in the last 2 million years are responsible for the large size of the maize genome, but only the more recent of these is well represented in gene-containing BACs, suggesting that LTR retrotransposons are more efficiently removed in these domains. The results demonstrate that sample sequencing and careful annotation of a few randomly selected BACs can provide a robust description of a complex plant genome.

  15. Rice Transcriptome Analysis to Identify Possible Herbicide Quinclorac Detoxification Genes

    Directory of Open Access Journals (Sweden)

    Wenying eXu

    2015-09-01

    Full Text Available Quinclorac is a highly selective auxin-type herbicide, and is widely used in the effective control of barnyard grass in paddy rice fields, improving the world’s rice yield. The herbicide mode of action of quinclorac has been proposed and hormone interactions affect quinclorac signaling. Because of widespread use, quinclorac may be transported outside rice fields with the drainage waters, leading to soil and water pollution and environmental health problems.In this study, we used 57K Affymetrix rice whole-genome array to identify quinclorac signaling response genes to study the molecular mechanisms of action and detoxification of quinclorac in rice plants. Overall, 637 probe sets were identified with differential expression levels under either 6 or 24 h of quinclorac treatment. Auxin-related genes such as GH3 and OsIAAs responded to quinclorac treatment. Gene Ontology analysis showed that genes of detoxification-related family genes were significantly enriched, including cytochrome P450, GST, UGT, and ABC and drug transporter genes. Moreover, real-time RT-PCR analysis showed that top candidate P450 families such as CYP81, CYP709C and CYP72A genes were universally induced by different herbicides. Some Arabidopsis genes for the same P450 family were up-regulated under quinclorac treatment.We conduct rice whole-genome GeneChip analysis and the first global identification of quinclorac response genes. This work may provide potential markers for detoxification of quinclorac and biomonitors of environmental chemical pollution.

  16. Pathogenic Network Analysis Predicts Candidate Genes for Cervical Cancer

    Directory of Open Access Journals (Sweden)

    Yun-Xia Zhang

    2016-01-01

    Full Text Available Purpose. The objective of our study was to predicate candidate genes in cervical cancer (CC using a network-based strategy and to understand the pathogenic process of CC. Methods. A pathogenic network of CC was extracted based on known pathogenic genes (seed genes and differentially expressed genes (DEGs between CC and normal controls. Subsequently, cluster analysis was performed to identify the subnetworks in the pathogenic network using ClusterONE. Each gene in the pathogenic network was assigned a weight value, and then candidate genes were obtained based on the weight distribution. Eventually, pathway enrichment analysis for candidate genes was performed. Results. In this work, a total of 330 DEGs were identified between CC and normal controls. From the pathogenic network, 2 intensely connected clusters were extracted, and a total of 52 candidate genes were detected under the weight values greater than 0.10. Among these candidate genes, VIM had the highest weight value. Moreover, candidate genes MMP1, CDC45, and CAT were, respectively, enriched in pathway in cancer, cell cycle, and methane metabolism. Conclusion. Candidate pathogenic genes including MMP1, CDC45, CAT, and VIM might be involved in the pathogenesis of CC. We believe that our results can provide theoretical guidelines for future clinical application.

  17. A sequence-based approach to identify reference genes for gene expression analysis

    Directory of Open Access Journals (Sweden)

    Chari Raj

    2010-08-01

    Full Text Available Abstract Background An important consideration when analyzing both microarray and quantitative PCR expression data is the selection of appropriate genes as endogenous controls or reference genes. This step is especially critical when identifying genes differentially expressed between datasets. Moreover, reference genes suitable in one context (e.g. lung cancer may not be suitable in another (e.g. breast cancer. Currently, the main approach to identify reference genes involves the mining of expression microarray data for highly expressed and relatively constant transcripts across a sample set. A caveat here is the requirement for transcript normalization prior to analysis, and measurements obtained are relative, not absolute. Alternatively, as sequencing-based technologies provide digital quantitative output, absolute quantification ensues, and reference gene identification becomes more accurate. Methods Serial analysis of gene expression (SAGE profiles of non-malignant and malignant lung samples were compared using a permutation test to identify the most stably expressed genes across all samples. Subsequently, the specificity of the reference genes was evaluated across multiple tissue types, their constancy of expression was assessed using quantitative RT-PCR (qPCR, and their impact on differential expression analysis of microarray data was evaluated. Results We show that (i conventional references genes such as ACTB and GAPDH are highly variable between cancerous and non-cancerous samples, (ii reference genes identified for lung cancer do not perform well for other cancer types (breast and brain, (iii reference genes identified through SAGE show low variability using qPCR in a different cohort of samples, and (iv normalization of a lung cancer gene expression microarray dataset with or without our reference genes, yields different results for differential gene expression and subsequent analyses. Specifically, key established pathways in lung

  18. An Array-based Approach to Modelling Production Management System Architectures

    DEFF Research Database (Denmark)

    Falster, Peter

    2000-01-01

    Several proposals to a conceptual framework for production management architecture are briefly reviewed. It is suggested that an array-based approach and a classic engineering-economic model, is used as tools for a conceptualisation of ideas. Traditional architectural design is usually based...... on a geometrical thinking. Accordingly, elements from measurement and array theory are introduced, but in a more abstract way than traditionally connected with 3D-geometry. The paper concludes that a few set of concepts, like products, resources, activities, events, stages, etc. can be synthesized and analogies...

  19. Green electron-beam pumped laser arrays based on II-VI nanostructures

    Energy Technology Data Exchange (ETDEWEB)

    Zverev, M.M.; Gamov, N.A.; Zdanova, E.V.; Studionov, V.B.; Peregoudov, D.V. [Moscow State Inst. of Radio Engineering, Electronics and Automations, Moscow (Russian Federation); Ivanov, S.V.; Sedova, I.V.; Gronin, S.V.; Sorokin, S.V.; Kop' ev, P.S. [Ioffe Physical Technical Inst., RAS Polytekhnicheskaya, St. Petersburg (Russian Federation); Olikhov, I.M. [Gamma Co. Ltd., SRC Platan, Fryazino (Russian Federation)

    2010-06-15

    Room temperature electron-beam pumped (U = 15-26 keV) green lasers and laser arrays based on multiple quantum well II-VI structures with an extended up to 2 {mu}m waveguide have been studied. The maximum achieved output pulse power is as high as 31 and 630 W per facet from a single 0.24-mm-wide laser element at the cavity length of 0.4 mm and a laser array consisting of 26 elements, respectively. (Abstract Copyright [2010], Wiley Periodicals, Inc.)

  20. A DSRPCL-SVM Approach to Informative Gene Analysis

    Institute of Scientific and Technical Information of China (English)

    Wei Xiong; Zhibin Cai; Jinwen Ma

    2008-01-01

    Microarray data based tumor diagnosis is a very interesting topic in bioinformatics. One of the key problems is the discovery and analysis of informative genes of a tumor. Although there are many elaborate approaches to this problem, it is still difficult to select a reasonable set of informative genes for tumor diagnosis only with microarray data. In this paper, we classify the genes expressed through microarray data into a number of clusters via the distance sensitive rival penalized competitive learning (DSRPCL) algorithm and then detect the informative gene cluster or set with the help of support vector machine (SVM). Moreover, the critical or powerful informative genes can be found through further classifications and detections on the obtained informative gene clusters. It is well demonstrated by experiments on the colon, leukemia, and breast cancer datasets that our proposed DSRPCL-SVM approach leads to a reasonable selection of informative genes for tumor diagnosis.

  1. Bioinformatics analysis of the gene expression profile in Bladder carcinoma

    Directory of Open Access Journals (Sweden)

    Jing Xiao

    2013-01-01

    Full Text Available Bladder carcinoma, which has the ninth highest incidence among malignant tumors in the world, is a complex, multifactorial disease. The malignant transformation of bladder cells results from DNA mutations and alterations in gene expression levels. In this work, we used a bioinformatics approach to investigate the molecular mechanisms of bladder carcinoma. Biochips downloaded from the Gene Expression Omnibus (GEO were used to analyze the gene expression profile in urinary bladder cells from individuals with carcinoma. The gene expression profile of normal genomes was used as a control. The analysis of gene expression revealed important alterations in genes involved in biological processes and metabolic pathways. We also identified some small molecules capable of reversing the altered gene expression in bladder carcinoma; these molecules could provide a basis for future therapies for the treatment of this disease.

  2. Serial Analysis of Gene Expression: Applications in Human Studies

    OpenAIRE

    2004-01-01

    Serial analysis of gene expression (SAGE) is a powerful tool, which provides quantitative and comprehensive expression profile of genes in a given cell population. It works by isolating short fragments of genetic information from the expressed genes that are present in the cell being studied. These short sequences, called SAGE tags, are linked together for efficient sequencing. The frequency of each SAGE tag in the cloned multimers directly reflects the transcript abundance. Therefore, SAGE r...

  3. Identification of candidate genes in osteoporosis by integrated microarray analysis

    OpenAIRE

    Li, J J; Wang, B. Q.; Fei, Q.; Yang, Y; Li, D.

    2017-01-01

    Objectives In order to screen the altered gene expression profile in peripheral blood mononuclear cells of patients with osteoporosis, we performed an integrated analysis of the online microarray studies of osteoporosis. Methods We searched the Gene Expression Omnibus (GEO) database for microarray studies of peripheral blood mononuclear cells in patients with osteoporosis. Subsequently, we integrated gene expression data sets from multiple microarray studies to obtain differentially expressed...

  4. Cluster Analysis of Gene Expression Data

    CERN Document Server

    Domany, E

    2002-01-01

    The expression levels of many thousands of genes can be measured simultaneously by DNA microarrays (chips). This novel experimental tool has revolutionized research in molecular biology and generated considerable excitement. A typical experiment uses a few tens of such chips, each dedicated to a single sample - such as tissue extracted from a particular tumor. The results of such an experiment contain several hundred thousand numbers, that come in the form of a table, of several thousand rows (one for each gene) and 50 - 100 columns (one for each sample). We developed a clustering methodology to mine such data. In this review I provide a very basic introduction to the subject, aimed at a physics audience with no prior knowledge of either gene expression or clustering methods. I explain what genes are, what is gene expression and how it is measured by DNA chips. Next I explain what is meant by "clustering" and how we analyze the massive amounts of data from such experiments, and present results obtained from a...

  5. When noisy neighbors are a blessing: analysis of gene expression noise identifies coregulated genes

    NARCIS (Netherlands)

    Junker, J.P.; van Oudenaarden, A.

    2012-01-01

    In this issue of Molecular Cell, Stewart-Ornstein et al. (2012) use systematic pair-wise correlation analysis of expression noise in a large number of yeast genes to identify clusters of functionally related genes and signaling pathways responsible for elevated noise.

  6. DAVID Knowledgebase: a gene-centered database integrating heterogeneous gene annotation resources to facilitate high-throughput gene functional analysis

    Directory of Open Access Journals (Sweden)

    Baseler Michael W

    2007-11-01

    Full Text Available Abstract Background Due to the complex and distributed nature of biological research, our current biological knowledge is spread over many redundant annotation databases maintained by many independent groups. Analysts usually need to visit many of these bioinformatics databases in order to integrate comprehensive annotation information for their genes, which becomes one of the bottlenecks, particularly for the analytic task associated with a large gene list. Thus, a highly centralized and ready-to-use gene-annotation knowledgebase is in demand for high throughput gene functional analysis. Description The DAVID Knowledgebase is built around the DAVID Gene Concept, a single-linkage method to agglomerate tens of millions of gene/protein identifiers from a variety of public genomic resources into DAVID gene clusters. The grouping of such identifiers improves the cross-reference capability, particularly across NCBI and UniProt systems, enabling more than 40 publicly available functional annotation sources to be comprehensively integrated and centralized by the DAVID gene clusters. The simple, pair-wise, text format files which make up the DAVID Knowledgebase are freely downloadable for various data analysis uses. In addition, a well organized web interface allows users to query different types of heterogeneous annotations in a high-throughput manner. Conclusion The DAVID Knowledgebase is designed to facilitate high throughput gene functional analysis. For a given gene list, it not only provides the quick accessibility to a wide range of heterogeneous annotation data in a centralized location, but also enriches the level of biological information for an individual gene. Moreover, the entire DAVID Knowledgebase is freely downloadable or searchable at http://david.abcc.ncifcrf.gov/knowledgebase/.

  7. Bioinformatics analysis and detection of gelatinase encoded gene in Lysinibacillussphaericus

    Science.gov (United States)

    Repin, Rul Aisyah Mat; Mutalib, Sahilah Abdul; Shahimi, Safiyyah; Khalid, Rozida Mohd.; Ayob, Mohd. Khan; Bakar, Mohd. Faizal Abu; Isa, Mohd Noor Mat

    2016-11-01

    In this study, we performed bioinformatics analysis toward genome sequence of Lysinibacillussphaericus (L. sphaericus) to determine gene encoded for gelatinase. L. sphaericus was isolated from soil and gelatinase species-specific bacterium to porcine and bovine gelatin. This bacterium offers the possibility of enzymes production which is specific to both species of meat, respectively. The main focus of this research is to identify the gelatinase encoded gene within the bacteria of L. Sphaericus using bioinformatics analysis of partially sequence genome. From the research study, three candidate gene were identified which was, gelatinase candidate gene 1 (P1), NODE_71_length_93919_cov_158.931839_21 which containing 1563 base pair (bp) in size with 520 amino acids sequence; Secondly, gelatinase candidate gene 2 (P2), NODE_23_length_52851_cov_190.061386_17 which containing 1776 bp in size with 591 amino acids sequence; and Thirdly, gelatinase candidate gene 3 (P3), NODE_106_length_32943_cov_169.147919_8 containing 1701 bp in size with 566 amino acids sequence. Three pairs of oligonucleotide primers were designed and namely as, F1, R1, F2, R2, F3 and R3 were targeted short sequences of cDNA by PCR. The amplicons were reliably results in 1563 bp in size for candidate gene P1 and 1701 bp in size for candidate gene P3. Therefore, the results of bioinformatics analysis of L. Sphaericus resulting in gene encoded gelatinase were identified.

  8. An improved method for functional similarity analysis of genes based on Gene Ontology.

    Science.gov (United States)

    Tian, Zhen; Wang, Chunyu; Guo, Maozu; Liu, Xiaoyan; Teng, Zhixia

    2016-12-23

    Measures of gene functional similarity are essential tools for gene clustering, gene function prediction, evaluation of protein-protein interaction, disease gene prioritization and other applications. In recent years, many gene functional similarity methods have been proposed based on the semantic similarity of GO terms. However, these leading approaches may make errorprone judgments especially when they measure the specificity of GO terms as well as the IC of a term set. Therefore, how to estimate the gene functional similarity reliably is still a challenging problem. We propose WIS, an effective method to measure the gene functional similarity. First of all, WIS computes the IC of a term by employing its depth, the number of its ancestors as well as the topology of its descendants in the GO graph. Secondly, WIS calculates the IC of a term set by means of considering the weighted inherited semantics of terms. Finally, WIS estimates the gene functional similarity based on the IC overlap ratio of term sets. WIS is superior to some other representative measures on the experiments of functional classification of genes in a biological pathway, collaborative evaluation of GO-based semantic similarity measures, protein-protein interaction prediction and correlation with gene expression. Further analysis suggests that WIS takes fully into account the specificity of terms and the weighted inherited semantics of terms between GO terms. The proposed WIS method is an effective and reliable way to compare gene function. The web service of WIS is freely available at http://nclab.hit.edu.cn/WIS/ .

  9. Genome-wide gene expression analysis of anguillid herpesvirus 1

    NARCIS (Netherlands)

    Beurden, van S.J.; Peeters, B.P.H.; Rottier, P.J.M.; Davison, A.A.; Engelsma, M.Y.

    2013-01-01

    Background Whereas temporal gene expression in mammalian herpesviruses has been studied extensively, little is known about gene expression in fish herpesviruses. Here we report a genome-wide transcription analysis of a fish herpesvirus, anguillid herpesvirus 1, in cell culture, studied during the

  10. Global Analysis of Horizontal Gene Transfer in Fusarium verticillioides

    Science.gov (United States)

    The co-occurrence of microbes within plants and other specialized niches may facilitate horizontal gene transfer (HGT) affecting host-pathogen interactions. We recently identified fungal-to-fungal HGTs involving metabolic gene clusters. For a global analysis of HGTs in the maize pathogen Fusarium ve...

  11. Frequency domain analysis of noise in autoregulated gene circuits

    Science.gov (United States)

    Simpson, Michael L.; Cox, Chris D.; Sayler, Gary S.

    2003-01-01

    We describe a frequency domain technique for the analysis of intrinsic noise within negatively autoregulated gene circuits. This approach is based on the transfer function around the feedback loop (loop transmission) and the equivalent noise bandwidth of the system. The loop transmission, T, is shown to be a determining factor of the dynamics and the noise behavior of autoregulated gene circuits, and this T-based technique provides a simple and flexible method for the analysis of noise arising from any source within the gene circuit. We show that negative feedback not only reduces the variance of the noise in the protein concentration, but also shifts this noise to higher frequencies where it may have a negligible effect on the noise behavior of following gene circuits within a cascade. This predicted effect is demonstrated through the exact stochastic simulation of a two-gene cascade. The analysis elucidates important aspects of gene circuit structure that control functionality, and may provide some insights into selective pressures leading to this structure. The resulting analytical relationships have a simple form, making them especially useful as synthetic gene circuit design equations. With the exception of the linearization of Hill kinetics, this technique is general and may be applied to the analysis or design of networks of higher complexity. This utility is demonstrated through the exact stochastic simulation of an autoregulated two-gene cascade operating near instability. PMID:12671069

  12. A general modular framework for gene set enrichment analysis

    Directory of Open Access Journals (Sweden)

    Strimmer Korbinian

    2009-02-01

    Full Text Available Abstract Background Analysis of microarray and other high-throughput data on the basis of gene sets, rather than individual genes, is becoming more important in genomic studies. Correspondingly, a large number of statistical approaches for detecting gene set enrichment have been proposed, but both the interrelations and the relative performance of the various methods are still very much unclear. Results We conduct an extensive survey of statistical approaches for gene set analysis and identify a common modular structure underlying most published methods. Based on this finding we propose a general framework for detecting gene set enrichment. This framework provides a meta-theory of gene set analysis that not only helps to gain a better understanding of the relative merits of each embedded approach but also facilitates a principled comparison and offers insights into the relative interplay of the methods. Conclusion We use this framework to conduct a computer simulation comparing 261 different variants of gene set enrichment procedures and to analyze two experimental data sets. Based on the results we offer recommendations for best practices regarding the choice of effective procedures for gene set enrichment analysis.

  13. Knowledge-guided gene ranking by coordinative component analysis.

    Science.gov (United States)

    Wang, Chen; Xuan, Jianhua; Li, Huai; Wang, Yue; Zhan, Ming; Hoffman, Eric P; Clarke, Robert

    2010-03-30

    In cancer, gene networks and pathways often exhibit dynamic behavior, particularly during the process of carcinogenesis. Thus, it is important to prioritize those genes that are strongly associated with the functionality of a network. Traditional statistical methods are often inept to identify biologically relevant member genes, motivating researchers to incorporate biological knowledge into gene ranking methods. However, current integration strategies are often heuristic and fail to incorporate fully the true interplay between biological knowledge and gene expression data. To improve knowledge-guided gene ranking, we propose a novel method called coordinative component analysis (COCA) in this paper. COCA explicitly captures those genes within a specific biological context that are likely to be expressed in a coordinative manner. Formulated as an optimization problem to maximize the coordinative effort, COCA is designed to first extract the coordinative components based on a partial guidance from knowledge genes and then rank the genes according to their participation strengths. An embedded bootstrapping procedure is implemented to improve statistical robustness of the solutions. COCA was initially tested on simulation data and then on published gene expression microarray data to demonstrate its improved performance as compared to traditional statistical methods. Finally, the COCA approach has been applied to stem cell data to identify biologically relevant genes in signaling pathways. As a result, the COCA approach uncovers novel pathway members that may shed light into the pathway deregulation in cancers. We have developed a new integrative strategy to combine biological knowledge and microarray data for gene ranking. The method utilizes knowledge genes for a guidance to first extract coordinative components, and then rank the genes according to their contribution related to a network or pathway. The experimental results show that such a knowledge-guided strategy

  14. Simultaneous determination of pH, urea, acetylcholine and heavy metals using array-based enzymatic optical biosensor.

    Science.gov (United States)

    Tsai, Hsiao-chung; Doong, Ruey-an

    2005-03-15

    An array-based optical biosensor for the simultaneous analysis of multiple samples in the presence of unrelated multi-analytes was fabricated. Urease and acetylcholinesterase (AChE) were used as model enzymes and were co-entrapped with the sensing probe, FITC-dextran, in the sol-gel matrix to measure pH, urea, acetylcholine (ACh) and heavy metals (enzyme inhibitors). Environmental and biological samples spiked with metal ions were also used to evaluate the application of the array biosensor to real samples. The biosensor exhibited high specificity in identifying multiple analytes. No obvious cross-interference was observed when a 50-spot array biosensor was used for simultaneous analysis of multiple samples in the presence of multiple analytes. The sensing system can determine pH over a dynamic range from 4 to 8.5. The limits of detection (LODs) of 2.5-50 microM with a dynamic range of 2-3 orders of magnitude for urea and ACh measurements were obtained. Moreover, the urease-encapsulated array biosensor was used to detect heavy metals. The analytical ranges of Cd(II), Cu(II), and Hg(II) were between 10 nM and 100 mM. When real samples were spiked with heavy metals, the array biosensor also exhibited potential effectiveness in screening enzyme inhibitors.

  15. Analysis of reelin as a candidate gene for autism.

    Science.gov (United States)

    Bonora, E; Beyer, K S; Lamb, J A; Parr, J R; Klauck, S M; Benner, A; Paolucci, M; Abbott, A; Ragoussis, I; Poustka, A; Bailey, A J; Monaco, A P

    2003-10-01

    Genetic studies indicate that chromosome 7q is likely to contain an autism susceptibility locus (AUTS1). We have followed a positional candidate gene approach to identify relevant gene(s) and report here the analysis of reelin (RELN), a gene located under our peak of linkage. Screening RELN for DNA changes identified novel missense variants absent in a large control group; however, the low frequency of these mutations does not explain the relatively strong linkage results on 7q. Furthermore, analysis of a previously reported triplet repeat polymorphism and intragenic single nucleotide polymorphisms, using the transmission disequilibrium test, provided no evidence for association with autism in IMGSAC and German singleton families. The analysis of RELN suggests that it probably does not play a major role in autism aetiology, although further analysis of several missense mutations is warranted in additional affected individuals.

  16. Identification of housekeeping genes suitable for gene expression analysis in Jian carp (Cyprinus carpio var. jian).

    Science.gov (United States)

    Tang, Yong-kai; Yu, Ju-hua; Xu, Pao; Li, Jian-lin; Li, Hong-xia; Ren, Hong-tao

    2012-10-01

    Jian carp (Cyprinus carpio var. jian) is an important economic fish species cultured in China. In this report, we performed a systematic analysis to identify an appropriate housekeeping (HK) gene for the study of gene expression in Jian carp. For this purpose, partial DNA sequences of four potential candidate genes (elongation factor 1 alpha (EF-1α), glyceraldehyde-3-phosphate (GAPDH), beta-actin (ACTB), and 18S ribosomal RNA (18S rRNA) were isolated, and their expression levels were studied using RNA extracted from nine tissues (forebrain, hypothalamus, liver, fore-intestine, hind-intestine, ovary, muscle, heart, kidney) in juvenile and adult Jian carp. Gene expression levels were quantified by quantitative real time RT-PCR (qRT-PCR), and expression stability was evaluated by comparing the coefficients of variation (CV) of the Ct values. The results showed that EF-1α was the most suitable HK gene in all tissues of juvenile and adult Jian carp. However, at distinct juvenile and adult developmental stages, there was not a single optimal gene for normalization of expression levels in all tissues. EF-1α was the most stable gene only in forebrain, hypothalamus, liver, heart, and kidney. These results provide data that can be expected to aid gene expression analysis in Jian carp research, but underline the importance of identifying the optimal HK gene for each new experimental paradigm.

  17. Investigation of gene expression profiles in coronary heart disease and functional analysis of target gene

    Institute of Scientific and Technical Information of China (English)

    YIN HuiJun; MA Xiaoduan; JIANG YueRong; SHI DaZhuo; CHEN KeJi

    2009-01-01

    The research outlined here includes constitution of the differential gene expression profile by means of oligonucleotide gene microarray and functional analysis of the target gene for coronary heart disease (CHD). In a microarray screening experiment, the predominance of inflammation-and immune-related genes is presented in the expression profile of 107 differential genes based on the analysis of gene ontology and gene pathway. IL-8, an inflammatory factor, is identified as one of the genes that were markedly up-regulated in CHD. The plasma level of IL-8 is significantly raised in patients with CHD (n = 30) compared with healthy controls (n = 40), which underscores the clinical relevance of the in vitro finding. The further functional analysis shows that IL-8 affects platelet aggregation percentage, ex-pression of CD62p and platelet aggregation morphology in 12 healthy volunteers to some extent. These findings suggest the relevance of inflammation and immune responses to CHD at the DNA level. Moreover, IL-8 may be involved in the pathogenesis of CHD through the pathway of platelet activation.

  18. Gene set analysis of the EADGENE chicken data-set

    DEFF Research Database (Denmark)

    Skarman, Axel; Jiang, Li; Hornshøj, Henrik

    2009-01-01

     Abstract Background: Gene set analysis is considered to be a way of improving our biological interpretation of the observed expression patterns. This paper describes different methods applied to analyse expression data from a chicken DNA microarray dataset. Results: Applying different gene set...... analyses to the chicken expression data led to different ranking of the Gene Ontology terms tested. A method for prediction of possible annotations was applied. Conclusion: Biological interpretation based on gene set analyses dependent on the statistical method used. Methods for predicting the possible...

  19. Gene Expression Divergence and Evolutionary Analysis of the Drosomycin Gene Family in Drosophila melanogaster

    Directory of Open Access Journals (Sweden)

    Xiao-Juan Deng

    2009-01-01

    Full Text Available Drosomycin (Drs encoding an inducible 44-residue antifungal peptide is clustered with six additional genes, Dro1, Dro2, Dro3, Dro4, Dro5, and Dro6, forming a multigene family on the 3L chromosome arm in Drosophila melanogaster. To get further insight into the regulation of each member of the drosomycin gene family, here we investigated gene expression patterns of this family by either microbe-free injury or microbial challenges using real time RT-PCR. The results indicated that among the seven drosomycin genes, Drs, Dro2, Dro3, Dro4, and Dro5 showed constitutive expressions. Three out of five, Dro2, Dro3, and Dro5, were able to be upregulated by simple injury. Interestingly, Drs is an only gene strongly upregulated when Drosophila was infected with microbes. In contrast to these five genes, Dro1 and Dro6 were not transcribed at all in either noninfected or infected flies. Furthermore, by 5′ rapid amplification of cDNA ends, two transcription start sites were identified in Drs and Dro2, and one in Dro3, Dro4, and Dro5. In addition, NF-κB binding sites were found in promoter regions of Drs, Dro2, Dro3, and Dro5, indicating the importance of NF-κB binding sites for the inducibility of drosomycin genes. Based on the analyses of flanking sequences of each gene in D. melanogaster and phylogenetic relationship of drosomycins in D. melanogaster species-group, we concluded that gene duplications were involved in the formation of the drosomycin gene family. The possible evolutionary fates of drosomycin genes were discussed according to the combining analysis of gene expression pattern, gene structure, and functional divergence of these genes.

  20. Dynamic association rules for gene expression data analysis.

    Science.gov (United States)

    Chen, Shu-Chuan; Tsai, Tsung-Hsien; Chung, Cheng-Han; Li, Wen-Hsiung

    2015-10-14

    The purpose of gene expression analysis is to look for the association between regulation of gene expression levels and phenotypic variations. This association based on gene expression profile has been used to determine whether the induction/repression of genes correspond to phenotypic variations including cell regulations, clinical diagnoses and drug development. Statistical analyses on microarray data have been developed to resolve gene selection issue. However, these methods do not inform us of causality between genes and phenotypes. In this paper, we propose the dynamic association rule algorithm (DAR algorithm) which helps ones to efficiently select a subset of significant genes for subsequent analysis. The DAR algorithm is based on association rules from market basket analysis in marketing. We first propose a statistical way, based on constructing a one-sided confidence interval and hypothesis testing, to determine if an association rule is meaningful. Based on the proposed statistical method, we then developed the DAR algorithm for gene expression data analysis. The method was applied to analyze four microarray datasets and one Next Generation Sequencing (NGS) dataset: the Mice Apo A1 dataset, the whole genome expression dataset of mouse embryonic stem cells, expression profiling of the bone marrow of Leukemia patients, Microarray Quality Control (MAQC) data set and the RNA-seq dataset of a mouse genomic imprinting study. A comparison of the proposed method with the t-test on the expression profiling of the bone marrow of Leukemia patients was conducted. We developed a statistical way, based on the concept of confidence interval, to determine the minimum support and minimum confidence for mining association relationships among items. With the minimum support and minimum confidence, one can find significant rules in one single step. The DAR algorithm was then developed for gene expression data analysis. Four gene expression datasets showed that the proposed

  1. Frequency domain analysis of noise in autoregulated gene circuits

    OpenAIRE

    Simpson, Michael L.; Cox, Chris D.; Sayler, Gary S.

    2003-01-01

    We describe a frequency domain technique for the analysis of intrinsic noise within negatively autoregulated gene circuits. This approach is based on the transfer function around the feedback loop (loop transmission) and the equivalent noise bandwidth of the system. The loop transmission, T, is shown to be a determining factor of the dynamics and the noise behavior of autoregulated gene circuits, and this T-based technique provides a simple and flexible method for the analysis of noise arisin...

  2. Gastric Cancer Associated Genes Identified by an Integrative Analysis of Gene Expression Data

    Science.gov (United States)

    Jiang, Bing; Li, Shuwen; Jiang, Zhi

    2017-01-01

    Gastric cancer is one of the most severe complex diseases with high morbidity and mortality in the world. The molecular mechanisms and risk factors for this disease are still not clear since the cancer heterogeneity caused by different genetic and environmental factors. With more and more expression data accumulated nowadays, we can perform integrative analysis for these data to understand the complexity of gastric cancer and to identify consensus players for the heterogeneous cancer. In the present work, we screened the published gene expression data and analyzed them with integrative tool, combined with pathway and gene ontology enrichment investigation. We identified several consensus differentially expressed genes and these genes were further confirmed with literature mining; at last, two genes, that is, immunoglobulin J chain and C-X-C motif chemokine ligand 17, were screened as novel gastric cancer associated genes. Experimental validation is proposed to further confirm this finding. PMID:28232943

  3. Identifying suitable reference genes for gene expression analysis in developing skeletal muscle in pigs

    Directory of Open Access Journals (Sweden)

    Guanglin Niu

    2016-12-01

    Full Text Available The selection of suitable reference genes is crucial to accurately evaluate and normalize the relative expression level of target genes for gene function analysis. However, commonly used reference genes have variable expression levels in developing skeletal muscle. There are few reports that systematically evaluate the expression stability of reference genes across prenatal and postnatal developing skeletal muscle in mammals. Here, we used quantitative PCR to examine the expression levels of 15 candidate reference genes (ACTB, GAPDH, RNF7, RHOA, RPS18, RPL32, PPIA, H3F3, API5, B2M, AP1S1, DRAP1, TBP, WSB, and VAPB in porcine skeletal muscle at 26 different developmental stages (15 prenatal and 11 postnatal periods. We evaluated gene expression stability using the computer algorithms geNorm, NormFinder, and BestKeeper. Our results indicated that GAPDH and ACTB had the greatest variability among the candidate genes across prenatal and postnatal stages of skeletal muscle development. RPS18, API5, and VAPB had stable expression levels in prenatal stages, whereas API5, RPS18, RPL32, and H3F3 had stable expression levels in postnatal stages. API5 and H3F3 expression levels had the greatest stability in all tested prenatal and postnatal stages, and were the most appropriate reference genes for gene expression normalization in developing skeletal muscle. Our data provide valuable information for gene expression analysis during different stages of skeletal muscle development in mammals. This information can provide a valuable guide for the analysis of human diseases.

  4. Identifying suitable reference genes for gene expression analysis in developing skeletal muscle in pigs.

    Science.gov (United States)

    Niu, Guanglin; Yang, Yalan; Zhang, YuanYuan; Hua, Chaoju; Wang, Zishuai; Tang, Zhonglin; Li, Kui

    2016-01-01

    The selection of suitable reference genes is crucial to accurately evaluate and normalize the relative expression level of target genes for gene function analysis. However, commonly used reference genes have variable expression levels in developing skeletal muscle. There are few reports that systematically evaluate the expression stability of reference genes across prenatal and postnatal developing skeletal muscle in mammals. Here, we used quantitative PCR to examine the expression levels of 15 candidate reference genes (ACTB, GAPDH, RNF7, RHOA, RPS18, RPL32, PPIA, H3F3, API5, B2M, AP1S1, DRAP1, TBP, WSB, and VAPB) in porcine skeletal muscle at 26 different developmental stages (15 prenatal and 11 postnatal periods). We evaluated gene expression stability using the computer algorithms geNorm, NormFinder, and BestKeeper. Our results indicated that GAPDH and ACTB had the greatest variability among the candidate genes across prenatal and postnatal stages of skeletal muscle development. RPS18, API5, and VAPB had stable expression levels in prenatal stages, whereas API5, RPS18, RPL32, and H3F3 had stable expression levels in postnatal stages. API5 and H3F3 expression levels had the greatest stability in all tested prenatal and postnatal stages, and were the most appropriate reference genes for gene expression normalization in developing skeletal muscle. Our data provide valuable information for gene expression analysis during different stages of skeletal muscle development in mammals. This information can provide a valuable guide for the analysis of human diseases.

  5. Gene Bionetwork Analysis of Ovarian Primordial Follicle Development

    Science.gov (United States)

    Nilsson, Eric E.; Savenkova, Marina I.; Schindler, Ryan; Zhang, Bin; Schadt, Eric E.; Skinner, Michael K.

    2010-01-01

    Ovarian primordial follicles are critical for female reproduction and comprise a finite pool of gametes arrested in development. A systems biology approach was used to identify regulatory gene networks essential for primordial follicle development. Transcriptional responses to eight different growth factors known to influence primordial follicles were used to construct a bionetwork of regulatory genes involved in rat primordial follicle development. Over 1,500 genes were found to be regulated by the various growth factors and a network analysis identified critical gene modules involved in a number of signaling pathways and cellular processes. A set of 55 genes was identified as potential critical regulators of these gene modules, and a sub-network associated with development was determined. Within the network two previously identified regulatory genes were confirmed (i.e., Pdgfa and Fgfr2) and a new factor was identified, connective tissue growth factor (CTGF). CTGF was tested in ovarian organ cultures and found to stimulate primordial follicle development. Therefore, the relevant gene network associated with primordial follicle development was validated and the critical genes and pathways involved in this process were identified. This is one of the first applications of network analysis to a normal developmental process. These observations provide insights into potential therapeutic targets for preventing ovarian disease and promoting female reproduction. PMID:20661288

  6. Genome-Wide Detection and Analysis of Multifunctional Genes

    Science.gov (United States)

    Pritykin, Yuri; Ghersi, Dario; Singh, Mona

    2015-01-01

    Many genes can play a role in multiple biological processes or molecular functions. Identifying multifunctional genes at the genome-wide level and studying their properties can shed light upon the complexity of molecular events that underpin cellular functioning, thereby leading to a better understanding of the functional landscape of the cell. However, to date, genome-wide analysis of multifunctional genes (and the proteins they encode) has been limited. Here we introduce a computational approach that uses known functional annotations to extract genes playing a role in at least two distinct biological processes. We leverage functional genomics data sets for three organisms—H. sapiens, D. melanogaster, and S. cerevisiae—and show that, as compared to other annotated genes, genes involved in multiple biological processes possess distinct physicochemical properties, are more broadly expressed, tend to be more central in protein interaction networks, tend to be more evolutionarily conserved, and are more likely to be essential. We also find that multifunctional genes are significantly more likely to be involved in human disorders. These same features also hold when multifunctionality is defined with respect to molecular functions instead of biological processes. Our analysis uncovers key features about multifunctional genes, and is a step towards a better genome-wide understanding of gene multifunctionality. PMID:26436655

  7. Gene bionetwork analysis of ovarian primordial follicle development.

    Directory of Open Access Journals (Sweden)

    Eric E Nilsson

    Full Text Available Ovarian primordial follicles are critical for female reproduction and comprise a finite pool of gametes arrested in development. A systems biology approach was used to identify regulatory gene networks essential for primordial follicle development. Transcriptional responses to eight different growth factors known to influence primordial follicles were used to construct a bionetwork of regulatory genes involved in rat primordial follicle development. Over 1,500 genes were found to be regulated by the various growth factors and a network analysis identified critical gene modules involved in a number of signaling pathways and cellular processes. A set of 55 genes was identified as potential critical regulators of these gene modules, and a sub-network associated with development was determined. Within the network two previously identified regulatory genes were confirmed (i.e., Pdgfa and Fgfr2 and a new factor was identified, connective tissue growth factor (CTGF. CTGF was tested in ovarian organ cultures and found to stimulate primordial follicle development. Therefore, the relevant gene network associated with primordial follicle development was validated and the critical genes and pathways involved in this process were identified. This is one of the first applications of network analysis to a normal developmental process. These observations provide insights into potential therapeutic targets for preventing ovarian disease and promoting female reproduction.

  8. Prediction and analysis of retinoblastoma related genes through gene ontology and KEGG.

    Science.gov (United States)

    Li, Zhen; Li, Bi-Qing; Jiang, Min; Chen, Lei; Zhang, Jian; Liu, Lin; Huang, Tao

    2013-01-01

    One of the most important and challenging problems in biomedicine is how to predict the cancer related genes. Retinoblastoma (RB) is the most common primary intraocular malignancy usually occurring in childhood. Early detection of RB could reduce the morbidity and promote the probability of disease-free survival. Therefore, it is of great importance to identify RB genes. In this study, we developed a computational method to predict RB related genes based on Dagging, with the maximum relevance minimum redundancy (mRMR) method followed by incremental feature selection (IFS). 119 RB genes were compiled from two previous RB related studies, while 5,500 non-RB genes were randomly selected from Ensemble genes. Ten datasets were constructed based on all these RB and non-RB genes. Each gene was encoded with a 13,126-dimensional vector including 12,887 Gene Ontology enrichment scores and 239 KEGG enrichment scores. Finally, an optimal feature set including 1061 GO terms and 8 KEGG pathways was obtained. Analysis showed that these features were closely related to RB. It is anticipated that the method can be applied to predict the other cancer related genes as well.

  9. Prediction and Analysis of Retinoblastoma Related Genes through Gene Ontology and KEGG

    Directory of Open Access Journals (Sweden)

    Zhen Li

    2013-01-01

    Full Text Available One of the most important and challenging problems in biomedicine is how to predict the cancer related genes. Retinoblastoma (RB is the most common primary intraocular malignancy usually occurring in childhood. Early detection of RB could reduce the morbidity and promote the probability of disease-free survival. Therefore, it is of great importance to identify RB genes. In this study, we developed a computational method to predict RB related genes based on Dagging, with the maximum relevance minimum redundancy (mRMR method followed by incremental feature selection (IFS. 119 RB genes were compiled from two previous RB related studies, while 5,500 non-RB genes were randomly selected from Ensemble genes. Ten datasets were constructed based on all these RB and non-RB genes. Each gene was encoded with a 13,126-dimensional vector including 12,887 Gene Ontology enrichment scores and 239 KEGG enrichment scores. Finally, an optimal feature set including 1061 GO terms and 8 KEGG pathways was obtained. Analysis showed that these features were closely related to RB. It is anticipated that the method can be applied to predict the other cancer related genes as well.

  10. A microfluidic DNA computing processor for gene expression analysis and gene drug synthesis.

    Science.gov (United States)

    Zhang, Yu; Yu, Hao; Qin, Jianhua; Lin, Bingcheng

    2009-11-06

    Boolean logic performs a logical operation on one or more logic input and produces a single logic output. Here, we describe a microfluidic DNA computing processor performing Boolean logic operations for gene expression analysis and gene drug synthesis. Multiple cancer-related genes were used as input molecules. Their expression levels were identified by interacting with the computing related DNA strands, which were designed according to the sequences of cancer-related genes and the suicide gene. When all the expressions of the cancer-related genes fit in with the diagnostic criteria, positive diagnosis would be confirmed and then a complete suicide gene (gene drug) could be synthesized as an output molecule. Microfluidic chip was employed as an effective platform to realize the computing process by integrating multistep biochemical reactions involving hybridization, displacement, denaturalization, and ligation. By combining the specific design of the computing related molecules and the integrated functions of the microfluidics, the microfluidic DNA computing processor is able to analyze the multiple gene expressions simultaneously and realize the corresponding gene drug synthesis with simplicity and fast speed, which demonstrates the potential of this platform for DNA computing in biomedical applications.

  11. Gene-network analysis identifies susceptibility genes related to glycobiology in autism.

    Directory of Open Access Journals (Sweden)

    Bert van der Zwaag

    Full Text Available The recent identification of copy-number variation in the human genome has opened up new avenues for the discovery of positional candidate genes underlying complex genetic disorders, especially in the field of psychiatric disease. One major challenge that remains is pinpointing the susceptibility genes in the multitude of disease-associated loci. This challenge may be tackled by reconstruction of functional gene-networks from the genes residing in these loci. We applied this approach to autism spectrum disorder (ASD, and identified the copy-number changes in the DNA of 105 ASD patients and 267 healthy individuals with Illumina Humanhap300 Beadchips. Subsequently, we used a human reconstructed gene-network, Prioritizer, to rank candidate genes in the segmental gains and losses in our autism cohort. This analysis highlighted several candidate genes already known to be mutated in cognitive and neuropsychiatric disorders, including RAI1, BRD1, and LARGE. In addition, the LARGE gene was part of a sub-network of seven genes functioning in glycobiology, present in seven copy-number changes specifically identified in autism patients with limited co-morbidity. Three of these seven copy-number changes were de novo in the patients. In autism patients with a complex phenotype and healthy controls no such sub-network was identified. An independent systematic analysis of 13 published autism susceptibility loci supports the involvement of genes related to glycobiology as we also identified the same or similar genes from those loci. Our findings suggest that the occurrence of genomic gains and losses of genes associated with glycobiology are important contributors to the development of ASD.

  12. Validation of suitable reference genes for quantitative gene expression analysis in Panax ginseng

    Directory of Open Access Journals (Sweden)

    Meizhen eWang

    2016-01-01

    Full Text Available Reverse transcription-qPCR (RT-qPCR has become a popular method for gene expression studies. Its results require data normalization by housekeeping genes. No single gene is proved to be stably expressed under all experimental conditions. Therefore, systematic evaluation of reference genes is necessary. With the aim to identify optimum reference genes for RT-qPCR analysis of gene expression in different tissues of Panax ginseng and the seedlings grown under heat stress, we investigated the expression stability of eight candidate reference genes, including elongation factor 1-beta (EF1-β, elongation factor 1-gamma (EF1-γ, eukaryotic translation initiation factor 3G (IF3G, eukaryotic translation initiation factor 3B (IF3B, actin (ACT, actin11 (ACT11, glyceraldehyde-3-phosphate dehydrogenase (GAPDH and cyclophilin ABH-like protein (CYC, using four widely used computational programs: geNorm, Normfinder, BestKeeper, and the comparative ΔCt method. The results were then integrated using the web-based tool RefFinder. As a result, EF1-γ, IF3G and EF1-β were the three most stable genes in different tissues of P. ginseng, while IF3G, ACT11 and GAPDH were the top three-ranked genes in seedlings treated with heat. Using three better reference genes alone or in combination as internal control, we examined the expression profiles of MAR, a multiple function-associated mRNA-like non-coding RNA (mlncRNA in P. ginseng. Taken together, we recommended EF1-γ/IF3G and IF3G/ACT11 as the suitable pair of reference genes for RT-qPCR analysis of gene expression in different tissues of P. ginseng and the seedlings grown under heat stress, respectively. The results serve as a foundation for future studies on P. ginseng functional genomics.

  13. Candidate gene copy number analysis by PCR and multicapillary electrophoresis.

    Science.gov (United States)

    Szantai, Eszter; Elek, Zsuzsanna; Guttman, András; Sasvari-Szekely, Maria

    2009-04-01

    Genetic polymorphisms are often considered as risk factors of complex diseases serving as valuable and easily detectable biomarkers, also stable during the whole lifespan. A novel type of genetic polymorphism has been identified just recently, referred to as gene copy number variation (CNV) or copy number polymorphism. CNV of glycogen synthase kinase 3 beta and its adjacent gene, Nr1i2 (pregnane X receptor isoform), has been reported to associate with bipolar depression. In our study we introduced multicapillary electrophoresis for gene copy number analysis as an affordable alternative to real-time PCR quantification with TaqMan gene probes. Our results show the reliability of the developed method based on conventional PCR followed by separation of products by multicapillary electrophoresis with quantitative evaluation. This method can be readily implemented for the analysis of candidate gene CNVs in high throughput clinical laboratories and also in personalized medicine care of depression-related risk factors.

  14. Global gene expression analysis for evaluation and design of biomaterials

    Directory of Open Access Journals (Sweden)

    Nobutaka Hanagata, Taro Takemura and Takashi Minowa

    2010-01-01

    Full Text Available Comprehensive gene expression analysis using DNA microarrays has become a widespread technique in molecular biological research. In the biomaterials field, it is used to evaluate the biocompatibility or cellular toxicity of metals, polymers and ceramics. Studies in this field have extracted differentially expressed genes in the context of differences in cellular responses among multiple materials. Based on these genes, the effects of materials on cells at the molecular level have been examined. Expression data ranging from several to tens of thousands of genes can be obtained from DNA microarrays. For this reason, several tens or hundreds of differentially expressed genes are often present in different materials. In this review, we outline the principles of DNA microarrays, and provide an introduction to methods of extracting information which is useful for evaluating and designing biomaterials from comprehensive gene expression data.

  15. FARO server: Meta-analysis of gene expression by matching gene expression signatures to a compendium of public gene expression data

    DEFF Research Database (Denmark)

    Manijak, Mieszko P.; Nielsen, Henrik Bjørn

    2011-01-01

    BACKGROUND: Although, systematic analysis of gene annotation is a powerful tool for interpreting gene expression data, it sometimes is blurred by incomplete gene annotation, missing expression response of key genes and secondary gene expression responses. These shortcomings may be partially...... circumvented by instead matching gene expression signatures to signatures of other experiments. FINDINGS: To facilitate this we present the Functional Association Response by Overlap (FARO) server, that match input signatures to a compendium of 242 gene expression signatures, extracted from more than 1700...

  16. In Silico Analysis of FMR1 Gene Missense SNPs.

    Science.gov (United States)

    Tekcan, Akin

    2016-06-01

    The FMR1 gene, a member of the fragile X-related gene family, is responsible for fragile X syndrome (FXS). Missense single-nucleotide polymorphisms (SNPs) are responsible for many complex diseases. The effect of FMR1 gene missense SNPs is unknown. The aim of this study, using in silico techniques, was to analyze all known missense mutations that can affect the functionality of the FMR1 gene, leading to mental retardation (MR) and FXS. Data on the human FMR1 gene were collected from the Ensembl database (release 81), National Centre for Biological Information dbSNP Short Genetic Variations database, 1000 Genomes Browser, and NHLBI Exome Sequencing Project Exome Variant Server. In silico analysis was then performed. One hundred-twenty different missense SNPs of the FMR1 gene were determined. Of these, 11.66 % of the FMR1 gene missense SNPs were in highly conserved domains, and 83.33 % were in domains with high variety. The results of the in silico prediction analysis showed that 31.66 % of the FMR1 gene SNPs were disease related and that 50 % of SNPs had a pathogenic effect. The results of the structural and functional analysis revealed that although the R138Q mutation did not seem to have a damaging effect on the protein, the G266E and I304N SNPs appeared to disturb the interaction between the domains and affect the function of the protein. This is the first study to analyze all missense SNPs of the FMR1 gene. The results indicate the applicability of a bioinformatics approach to FXS and other FMR1-related diseases. I think that the analysis of FMR1 gene missense SNPs using bioinformatics methods would help diagnosis of FXS and other FMR1-related diseases.

  17. Microarray gene expression profiling and analysis in renal cell carcinoma

    Directory of Open Access Journals (Sweden)

    Sadhukhan Provash

    2004-06-01

    Full Text Available Abstract Background Renal cell carcinoma (RCC is the most common cancer in adult kidney. The accuracy of current diagnosis and prognosis of the disease and the effectiveness of the treatment for the disease are limited by the poor understanding of the disease at the molecular level. To better understand the genetics and biology of RCC, we profiled the expression of 7,129 genes in both clear cell RCC tissue and cell lines using oligonucleotide arrays. Methods Total RNAs isolated from renal cell tumors, adjacent normal tissue and metastatic RCC cell lines were hybridized to affymatrix HuFL oligonucleotide arrays. Genes were categorized into different functional groups based on the description of the Gene Ontology Consortium and analyzed based on the gene expression levels. Gene expression profiles of the tissue and cell line samples were visualized and classified by singular value decomposition. Reverse transcription polymerase chain reaction was performed to confirm the expression alterations of selected genes in RCC. Results Selected genes were annotated based on biological processes and clustered into functional groups. The expression levels of genes in each group were also analyzed. Seventy-four commonly differentially expressed genes with more than five-fold changes in RCC tissues were identified. The expression alterations of selected genes from these seventy-four genes were further verified using reverse transcription polymerase chain reaction (RT-PCR. Detailed comparison of gene expression patterns in RCC tissue and RCC cell lines shows significant differences between the two types of samples, but many important expression patterns were preserved. Conclusions This is one of the initial studies that examine the functional ontology of a large number of genes in RCC. Extensive annotation, clustering and analysis of a large number of genes based on the gene functional ontology revealed many interesting gene expression patterns in RCC. Most

  18. Array-based identification of triple-negative breast cancer cells using fluorescent nanodot-graphene oxide complexes.

    Science.gov (United States)

    Tao, Yu; Auguste, Debra T

    2016-07-15

    Early and accurate diagnosis of breast cancer holds great promise to improve treatability and curability. Here, we report the usage of six luminescent nanodot-graphene oxide complexes as novel fluorescent nanoprobes in a sensing array capable of effectively identifying healthy, cancerous, and metastatic human breast cells. The sensory system is based on the utilization of nanoprobe-graphene oxide sensor elements that can be disrupted in the presence of breast cells to give fluorescent readouts. Using this multichannel sensor, we have successfully identified breast cancer cells and distinguished between estrogen receptor positive, human epidermal growth factor receptor-2 positive, and triple negative phenotypes. This approach also allows cell identification at high sensitivity (200 cells) with high reproducibility. The unknown cell sample analysis indicates that the sensor is able to identify 49 out of 50 breast cell samples correctly, with a detection accuracy of 98%. Taken together, this array-based luminescent nanoprobe-graphene oxide sensing platform presents a useful cell screening tool with potential applications in biomedical diagnostics.

  19. Fine mapping and candidate gene analysis of purple pericarp gene Pb in rice (Oryza sativa L.)

    Institute of Scientific and Technical Information of China (English)

    2007-01-01

    Purple rice is a type of rice with anthocyanins deposited in its grain pericarp. The rice Pb gene controlling purple pericarp character is known to be on chromosome 4, and the purple color is dominant over white color. In this study, we fine mapped the Pb gene using two F2 segregating populations, i.e. Pei'ai 64S (white) × Yunanheixiannuo (purple) and Pei'ai 64S × Chuanheinuo (purple). In the first-pass mapping, the Pb gene was located in the region downstream the SSR marker RM3820. In the fine mapping, the candidate region was saturated with InDel and CAPS markers developed specifically for this study. Eventually, the Pb gene was mapped within the 25-kb region delimited by the upstream marker RID3 and the downstream marker RID4. The delimited region contained two annotated genes, Ra and bhlh16 (TIGR Rice Genome, R.5). The former is a homologue of the Myc transcription factor Lc controlling anthocyanin biosynthesis in maize, and the latter is a homologue of the TT8 gene, which is also an Myc transcription factor gene controlling the pericarp pigmentation in Arabidopsis thaliana. Sequence analysis showed that the exon 7 of the Ra gene of Yunanheixiannuo and Chuanheinuo had a 2-bp (GT) deletion compared with those of the white rice varieties Pei'ai 64S, 9311 and Nipponbare. A CAPS marker, CAPSRa, was developed according to the GT deletion for analysis of the two F2 segregating populations and 106 rice lines. The results showed that all F2 plants with white pericarp, and all non-purple rice lines (63 white and 22 red) contained no GT deletion, but all 20 purple rice lines contained the GT deletion. These results suggested that the Ra gene may be the Pb gene and the purple pericarp characteristic of rice is caused by the GT deletion within exon 7 of the Ra gene.

  20. Microarray MAPH: accurate array-based detection of relative copy number in genomic DNA

    Directory of Open Access Journals (Sweden)

    Chan Alan

    2006-06-01

    Full Text Available Abstract Background Current methods for measurement of copy number do not combine all the desirable qualities of convenience, throughput, economy, accuracy and resolution. In this study, to improve the throughput associated with Multiplex Amplifiable Probe Hybridisation (MAPH we aimed to develop a modification based on the 3-Dimensional, Flow-Through Microarray Platform from PamGene International. In this new method, electrophoretic analysis of amplified products is replaced with photometric analysis of a probed oligonucleotide array. Copy number analysis of hybridised probes is based on a dual-label approach by comparing the intensity of Cy3-labelled MAPH probes amplified from test samples co-hybridised with similarly amplified Cy5-labelled reference MAPH probes. The key feature of using a hybridisation-based end point with MAPH is that discrimination of amplified probes is based on sequence and not fragment length. Results In this study we showed that microarray MAPH measurement of PMP22 gene dosage correlates well with PMP22 gene dosage determined by capillary MAPH and that copy number was accurately reported in analyses of DNA from 38 individuals, 12 of which were known to have Charcot-Marie-Tooth disease type 1A (CMT1A. Conclusion Measurement of microarray-based endpoints for MAPH appears to be of comparable accuracy to electrophoretic methods, and holds the prospect of fully exploiting the potential multiplicity of MAPH. The technology has the potential to simplify copy number assays for genes with a large number of exons, or of expanded sets of probes from dispersed genomic locations.

  1. GSMA: Gene Set Matrix Analysis, An Automated Method for Rapid Hypothesis Testing of Gene Expression Data

    Directory of Open Access Journals (Sweden)

    Chris Cheadle

    2007-01-01

    Full Text Available Background: Microarray technology has become highly valuable for identifying complex global changes in gene expression patterns. The assignment of functional information to these complex patterns remains a challenging task in effectively interpreting data and correlating results from across experiments, projects and laboratories. Methods which allow the rapid and robust evaluation of multiple functional hypotheses increase the power of individual researchers to data mine gene expression data more efficiently.Results: We have developed (gene set matrix analysis GSMA as a useful method for the rapid testing of group-wise up- or downregulation of gene expression simultaneously for multiple lists of genes (gene sets against entire distributions of gene expression changes (datasets for single or multiple experiments. The utility of GSMA lies in its flexibility to rapidly poll gene sets related by known biological function or as designated solely by the end-user against large numbers of datasets simultaneously.Conclusions: GSMA provides a simple and straightforward method for hypothesis testing in which genes are tested by groups across multiple datasets for patterns of expression enrichment.

  2. SIGNATURE: A workbench for gene expression signature analysis

    Directory of Open Access Journals (Sweden)

    Chang Jeffrey T

    2011-11-01

    Full Text Available Abstract Background The biological phenotype of a cell, such as a characteristic visual image or behavior, reflects activities derived from the expression of collections of genes. As such, an ability to measure the expression of these genes provides an opportunity to develop more precise and varied sets of phenotypes. However, to use this approach requires computational methods that are difficult to implement and apply, and thus there is a critical need for intelligent software tools that can reduce the technical burden of the analysis. Tools for gene expression analyses are unusually difficult to implement in a user-friendly way because their application requires a combination of biological data curation, statistical computational methods, and database expertise. Results We have developed SIGNATURE, a web-based resource that simplifies gene expression signature analysis by providing software, data, and protocols to perform the analysis successfully. This resource uses Bayesian methods for processing gene expression data coupled with a curated database of gene expression signatures, all carried out within a GenePattern web interface for easy use and access. Conclusions SIGNATURE is available for public use at http://genepattern.genome.duke.edu/signature/.

  3. Gene Ontology-Based Analysis of Zebrafish Omics Data Using the Web Tool Comparative Gene Ontology.

    Science.gov (United States)

    Ebrahimie, Esmaeil; Fruzangohar, Mario; Moussavi Nik, Seyyed Hani; Newman, Morgan

    2017-09-05

    Gene Ontology (GO) analysis is a powerful tool in systems biology, which uses a defined nomenclature to annotate genes/proteins within three categories: "Molecular Function," "Biological Process," and "Cellular Component." GO analysis can assist in revealing functional mechanisms underlying observed patterns in transcriptomic, genomic, and proteomic data. The already extensive and increasing use of zebrafish for modeling genetic and other diseases highlights the need to develop a GO analytical tool for this organism. The web tool Comparative GO was originally developed for GO analysis of bacterial data in 2013 ( www.comparativego.com ). We have now upgraded and elaborated this web tool for analysis of zebrafish genetic data using GOs and annotations from the Gene Ontology Consortium.

  4. High-resolution genome-wide array-based comparative genome hybridization reveals cryptic chromosome changes in AML and MDS cases with trisomy 8 as the sole cytogenetic aberration.

    Science.gov (United States)

    Paulsson, K; Heidenblad, M; Strömbeck, B; Staaf, J; Jönsson, G; Borg, A; Fioretos, T; Johansson, B

    2006-05-01

    Although trisomy 8 as the sole chromosome aberration is the most common numerical abnormality in acute myeloid leukemia (AML) and myelodysplastic syndromes (MDS), little is known about its pathogenetic effects. Considering that +8 is a frequent secondary change in AML/MDS, cryptic--possibly primary--genetic aberrations may occur in cases with trisomy 8 as the apparently single anomaly. However, no such hidden anomalies have been reported. We performed a high-resolution genome-wide array-based comparative genome hybridization (array CGH) analysis of 10 AML/MDS cases with isolated +8, utilizing a 32K bacterial artificial chromosome array set, providing >98% coverage of the genome with a resolution of 100 kb. Array CGH revealed intrachromosomal imbalances, not corresponding to known genomic copy number polymorphisms, in 4/10 cases, comprising nine duplications and hemizygous deletions ranging in size from 0.5 to 2.2 Mb. A 1.8 Mb deletion at 7p14.1, which had occurred prior to the +8, was identified in MDS transforming to AML. Furthermore, a deletion including ETV6 was present in one case. The remaining seven imbalances involved more than 40 genes. The present results show that cryptic genetic abnormalities are frequent in trisomy 8-positive AML/MDS cases and that +8 as the sole cytogenetic aberration is not always the primary genetic event.

  5. Functional analysis of fungal polyketide biosynthesis genes.

    Science.gov (United States)

    Fujii, Isao

    2010-05-01

    Fungal polyketides have huge structural diversity from simple aromatics to highly modified complex reduced-type compounds. Despite such diversty, single modular iterative type I polyketide synthases (iPKSs) are responsible for their carbon skeleton construction. Using heterologous expression systems, we have studied on ATX, a 6-methylsalicylic acid synthase from Aspergillus terreus as a model iPKS. In addition, iPKS functions involved in fungal spore pigment biosynthesis were analyzed together with polyketide-shortening enzymes that convert products of PKSs to shorter ketides by hydrolytic C-C bond cleavage. In our studies on reducing-type iPKSs, we cloned and expressed PKS genes, pksN, pksF, pksK and sol1 from Alternaria solani. The sol gene cluster was found to be involved in solanapyrone biosynthesis and sol5 was identified to encode solanapyrone synthase, a Diels-Alder enzyme. Our fungal PKS studies were further extended to identify the function of PKS-nonribosomal peptide synthase involved in cyclopiazonic acid biosynthesis.

  6. Reconstructability analysis as a tool for identifying gene-gene interactions in studies of human diseases.

    Science.gov (United States)

    Shervais, Stephen; Kramer, Patricia L; Westaway, Shawn K; Cox, Nancy J; Zwick, Martin

    2010-01-01

    There are a number of common human diseases for which the genetic component may include an epistatic interaction of multiple genes. Detecting these interactions with standard statistical tools is difficult because there may be an interaction effect, but minimal or no main effect. Reconstructability analysis (RA) uses Shannon's information theory to detect relationships between variables in categorical datasets. We applied RA to simulated data for five different models of gene-gene interaction, and find that even with heritability levels as low as 0.008, and with the inclusion of 50 non-associated genes in the dataset, we can identify the interacting gene pairs with an accuracy of > or =80%. We applied RA to a real dataset of type 2 non-insulin-dependent diabetes (NIDDM) cases and controls, and closely approximated the results of more conventional single SNP disease association studies. In addition, we replicated prior evidence for epistatic interactions between SNPs on chromosomes 2 and 15.

  7. GenePublisher: automated analysis of DNA microarray data

    DEFF Research Database (Denmark)

    Knudsen, Steen; Workman, Christopher; Sicheritz-Ponten, T.

    2003-01-01

    GenePublisher, a system for automatic analysis of data from DNA microarray experiments, has been implemented with a web interface at http://www.cbs.dtu.dk/services/GenePublisher. Raw data are uploaded to the server together with aspecification of the data. The server performs normalization......, statistical analysis and visualization of the data. The results are run against databases of signal transduction pathways, metabolic pathways and promoter sequences in order to extract more information. The results of the entire analysis are summarized in report form and returned to the user....

  8. Gene expression profiling of human erythroid progenitors by micro-serial analysis of gene expression.

    Science.gov (United States)

    Fujishima, Naohito; Hirokawa, Makoto; Aiba, Namiko; Ichikawa, Yoshikazu; Fujishima, Masumi; Komatsuda, Atsushi; Suzuki, Yoshiko; Kawabata, Yoshinari; Miura, Ikuo; Sawada, Ken-ichi

    2004-10-01

    We compared the expression profiles of highly purified human CD34+ cells and erythroid progenitor cells by micro-serial analysis of gene expression (microSAGE). Human CD34+ cells were purified from granulocyte colony-stimulating factor-mobilized blood stem cells, and erythroid progenitors were obtained by cultivating these cells in the presence of stem cell factor, interleukin 3, and erythropoietin. Our 10,202 SAGE tags allowed us to identify 1354 different transcripts appearing more than once. Erythroid progenitor cells showed increased expression of LRBA, EEF1A1, HSPCA, PILRB, RANBP1, NACA, and SMURF. Overexpression of HSPCA was confirmed by real-time polymerase chain reaction analysis. MicroSAGE revealed an unexpected preferential expression of several genes in erythroid progenitor cells in addition to the known functional genes, including hemoglobins. Our results provide reference data for future studies of gene expression in various hematopoietic disorders, including myelodysplastic syndrome and leukemia.

  9. Independent component analysis of Alzheimer's DNA microarray gene expression data

    Directory of Open Access Journals (Sweden)

    Vanderburg Charles R

    2009-01-01

    Full Text Available Abstract Background Gene microarray technology is an effective tool to investigate the simultaneous activity of multiple cellular pathways from hundreds to thousands of genes. However, because data in the colossal amounts generated by DNA microarray technology are usually complex, noisy, high-dimensional, and often hindered by low statistical power, their exploitation is difficult. To overcome these problems, two kinds of unsupervised analysis methods for microarray data: principal component analysis (PCA and independent component analysis (ICA have been developed to accomplish the task. PCA projects the data into a new space spanned by the principal components that are mutually orthonormal to each other. The constraint of mutual orthogonality and second-order statistics technique within PCA algorithms, however, may not be applied to the biological systems studied. Extracting and characterizing the most informative features of the biological signals, however, require higher-order statistics. Results ICA is one of the unsupervised algorithms that can extract higher-order statistical structures from data and has been applied to DNA microarray gene expression data analysis. We performed FastICA method on DNA microarray gene expression data from Alzheimer's disease (AD hippocampal tissue samples and consequential gene clustering. Experimental results showed that the ICA method can improve the clustering results of AD samples and identify significant genes. More than 50 significant genes with high expression levels in severe AD were extracted, representing immunity-related protein, metal-related protein, membrane protein, lipoprotein, neuropeptide, cytoskeleton protein, cellular binding protein, and ribosomal protein. Within the aforementioned categories, our method also found 37 significant genes with low expression levels. Moreover, it is worth noting that some oncogenes and phosphorylation-related proteins are expressed in low levels. In

  10. Identification of candidate genes in osteoporosis by integrated microarray analysis

    Science.gov (United States)

    Li, J. J.; Wang, B. Q.; Yang, Y.; Li, D.

    2016-01-01

    Objectives In order to screen the altered gene expression profile in peripheral blood mononuclear cells of patients with osteoporosis, we performed an integrated analysis of the online microarray studies of osteoporosis. Methods We searched the Gene Expression Omnibus (GEO) database for microarray studies of peripheral blood mononuclear cells in patients with osteoporosis. Subsequently, we integrated gene expression data sets from multiple microarray studies to obtain differentially expressed genes (DEGs) between patients with osteoporosis and normal controls. Gene function analysis was performed to uncover the functions of identified DEGs. Results A total of three microarray studies were selected for integrated analysis. In all, 1125 genes were found to be significantly differentially expressed between osteoporosis patients and normal controls, with 373 upregulated and 752 downregulated genes. Positive regulation of the cellular amino metabolic process (gene ontology (GO): 0033240, false discovery rate (FDR) = 1.00E + 00) was significantly enriched under the GO category for biological processes, while for molecular functions, flavin adenine dinucleotide binding (GO: 0050660, FDR = 3.66E-01) and androgen receptor binding (GO: 0050681, FDR = 6.35E-01) were significantly enriched. DEGs were enriched in many osteoporosis-related signalling pathways, including those of mitogen-activated protein kinase (MAPK) and calcium. Protein-protein interaction (PPI) network analysis showed that the significant hub proteins contained ubiquitin specific peptidase 9, X-linked (Degree = 99), ubiquitin specific peptidase 19 (Degree = 57) and ubiquitin conjugating enzyme E2 B (Degree = 57). Conclusion Analysis of gene function of identified differentially expressed genes may expand our understanding of fundamental mechanisms leading to osteoporosis. Moreover, significantly enriched pathways, such as MAPK and calcium, may involve in osteoporosis through osteoblastic differentiation and

  11. APOE gene polymorphism analysis in Barranquilla, Colombia.

    Science.gov (United States)

    Ruiz, Martha; Arias, Isis; Rolón, Gloria; Hernández, Enio; Garavito, Pilar; Silvera-Redondo, Carlos

    2016-03-03

    The genetic variability present in the APOE gene polymorphism is considered an important factor associated with predisposition to diseases affecting lipid metabolism, as well as heart diseases and Alzheimer's disease, among others. Understanding it as a risk factor in different populations and ethnic groups is a useful tool.  To analyze the APOE gene polymorphism and determine allelic and genotypic frequencies of a representative sample of population from Barranquilla, Colombia.  We performed a descriptive and comparative study. The sample size was 227 unrelated individuals from Barranquilla, Colombia.  The most frequent allele was the ε3, with 85%, followed by the ε4 allele (13%) and ε2 (1.8%). The genotypes found were: ε3/ε3: 71.8%, ε3/ε4: 24.2%, ε2/ε3: 2.2%, ε2/ε4: 1.3% and ε4/ε4: 0.4%. The ε2/ε2 genotype was not found in this study. The sample exhibited the Hardy-Weinberg equilibrium.  The frequency of the ε3 allele and the ε3/ε3 genotype was similar to that reported in the literature in countries like Brazil, Mexico, Colombia, and in some Colombian Amerindian ethnic groups. The ε2/ε2 genotype was absent. This result is consistent with those found in other population groups worldwide. The frequency of the ε4 allele and the genotypes associated in this population could be related to the presence of diseases such as hypercholesterolemia, myocardial infarction and Alzheimer.

  12. Analysis on Gene Expression Data using Confidence Margin

    Science.gov (United States)

    Yoshioka, Michifumi; Shimoda, Norihiro; Omatu, Shigeru

    Due to recent advances in biotechnology, we can get activation level of each gene within an organism at a particular point of time. The data is called “gene expression data”. Analysis of gene expression data can provide understanding and insight into gene function and regulatory mechanism. However, these tasks are made more difficult from the empirical nature of array data and the overwhelming number of gene feature. One of our previous works in our field is GASVM. GASVM is a hybrid method of Genetic Algorithm and Support Vector Machine by Saberi et al.(1). GASVM has a large computational cost and a possibility of overfitting. Therefore, we have introduced a new criterion “Confidence Margin” and proposed a new method using it. The experimental result using two famous datasets confirmed the effectiveness of our proposed method.

  13. Gene profile analysis of osteoblast genes differentially regulated by histone deacetylase inhibitors

    Directory of Open Access Journals (Sweden)

    Lamblin Anne-Francoise

    2007-10-01

    Full Text Available Abstract Background Osteoblast differentiation requires the coordinated stepwise expression of multiple genes. Histone deacetylase inhibitors (HDIs accelerate the osteoblast differentiation process by blocking the activity of histone deacetylases (HDACs, which alter gene expression by modifying chromatin structure. We previously demonstrated that HDIs and HDAC3 shRNAs accelerate matrix mineralization and the expression of osteoblast maturation genes (e.g. alkaline phosphatase, osteocalcin. Identifying other genes that are differentially regulated by HDIs might identify new pathways that contribute to osteoblast differentiation. Results To identify other osteoblast genes that are altered early by HDIs, we incubated MC3T3-E1 preosteoblasts with HDIs (trichostatin A, MS-275, or valproic acid for 18 hours in osteogenic conditions. The promotion of osteoblast differentiation by HDIs in this experiment was confirmed by osteogenic assays. Gene expression profiles relative to vehicle-treated cells were assessed by microarray analysis with Affymetrix GeneChip 430 2.0 arrays. The regulation of several genes by HDIs in MC3T3-E1 cells and primary osteoblasts was verified by quantitative real-time PCR. Nine genes were differentially regulated by at least two-fold after exposure to each of the three HDIs and six were verified by PCR in osteoblasts. Four of the verified genes (solute carrier family 9 isoform 3 regulator 1 (Slc9a3r1, sorbitol dehydrogenase 1, a kinase anchor protein, and glutathione S-transferase alpha 4 were induced. Two genes (proteasome subunit, beta type 10 and adaptor-related protein complex AP-4 sigma 1 were suppressed. We also identified eight growth factors and growth factor receptor genes that are significantly altered by each of the HDIs, including Frizzled related proteins 1 and 4, which modulate the Wnt signaling pathway. Conclusion This study identifies osteoblast genes that are regulated early by HDIs and indicates pathways that

  14. Bioinformatics Analysis of the Duck Enteritis Virus UL54 Gene

    Directory of Open Access Journals (Sweden)

    Chaoyue Liu

    2014-04-01

    Full Text Available In this study, we analyze the Duck Enteritis Virus (DEV UL54 gene, which has been isolated and identified in our lab (GenBank accession NO EU071033, to help deeply research on DEV. DNA sequence analysis showed that the identified ORF which composed of 1377 bp nucleotides encoded 458 amino acids with a predicted Mr. of 51.75 kDa. Multiple sequence alignment suggested that the UL54 gene was highly conserved in Alphaherpesvirinae and was similar to the other herpesviral UL54 gene. Phylogenetic analysis of the DEV UL54 gene revealed that DEV had a close evolutionary relationship with Gallid, Herpesvirus 2 (GaHV-2, Gallid Herpesvirus 3 (GaHV-3, Meleagrid Herpesvirus1 (MeHV-1 and should belong to a single cluster within the Alphaherpesvirinae subfamily.

  15. Genome-level identification, gene expression, and comparative analysis of porcine ß-defensin genes

    Directory of Open Access Journals (Sweden)

    Choi Min-Kyeung

    2012-11-01

    Full Text Available Abstract Background Beta-defensins (β-defensins are innate immune peptides with evolutionary conservation across a wide range of species and has been suggested to play important roles in innate immune reactions against pathogens. However, the complete β-defensin repertoire in the pig has not been fully addressed. Result A BLAST analysis was performed against the available pig genomic sequence in the NCBI database to identify β-defensin-related sequences using previously reported β-defensin sequences of pigs, humans, and cattle. The porcine β-defensin gene clusters were mapped to chromosomes 7, 14, 15 and 17. The gene expression analysis of 17 newly annotated porcine β-defensin genes across 15 tissues using semi-quantitative reverse transcription polymerase chain reaction (RT-PCR showed differences in their tissue distribution, with the kidney and testis having the largest pBD expression repertoire. We also analyzed single nucleotide polymorphisms (SNPs in the mature peptide region of pBD genes from 35 pigs of 7 breeds. We found 8 cSNPs in 7 pBDs. Conclusion We identified 29 porcine β-defensin (pBD gene-like sequences, including 17 unreported pBDs in the porcine genome. Comparative analysis of β-defensin genes in the pig genome with those in human and cattle genomes showed structural conservation of β-defensin syntenic regions among these species.

  16. Genes responsive to elevated CO2 concentrations in triploid white poplar and integrated gene network analysis.

    Directory of Open Access Journals (Sweden)

    Juanjuan Liu

    Full Text Available BACKGROUND: The atmospheric CO2 concentration increases every year. While the effects of elevated CO2 on plant growth, physiology and metabolism have been studied, there is now a pressing need to understand the molecular mechanisms of how plants will respond to future increases in CO2 concentration using genomic techniques. PRINCIPAL FINDINGS: Gene expression in triploid white poplar ((Populus tomentosa ×P. bolleana ×P. tomentosa leaves was investigated using the Affymetrix poplar genome gene chip, after three months of growth in controlled environment chambers under three CO2 concentrations. Our physiological findings showed the growth, assessed as stem diameter, was significantly increased, and the net photosynthetic rate was decreased in elevated CO2 concentrations. The concentrations of four major endogenous hormones appeared to actively promote plant development. Leaf tissues under elevated CO2 concentrations had 5,127 genes with different expression patterns in comparison to leaves under the ambient CO2 concentration. Among these, 8 genes were finally selected for further investigation by using randomized variance model corrective ANOVA analysis, dynamic gene expression profiling, gene network construction, and quantitative real-time PCR validation. Among the 8 genes in the network, aldehyde dehydrogenase and pyruvate kinase were situated in the core and had interconnections with other genes. CONCLUSIONS: Under elevated CO2 concentrations, 8 significantly changed key genes involved in metabolism and responding to stimulus of external environment were identified. These genes play crucial roles in the signal transduction network and show strong correlations with elevated CO2 exposure. This study provides several target genes, further investigation of which could provide an initial step for better understanding the molecular mechanisms of plant acclimation and evolution in future rising CO2 concentrations.

  17. Human gene correlation analysis (HGCA): a tool for the identification of transcriptionally co-expressed genes.

    Science.gov (United States)

    Michalopoulos, Ioannis; Pavlopoulos, Georgios A; Malatras, Apostolos; Karelas, Alexandros; Kostadima, Myrto-Areti; Schneider, Reinhard; Kossida, Sophia

    2012-06-06

    Bioinformatics and high-throughput technologies such as microarray studies allow the measure of the expression levels of large numbers of genes simultaneously, thus helping us to understand the molecular mechanisms of various biological processes in a cell. We calculate the Pearson Correlation Coefficient (r-value) between probe set signal values from Affymetrix Human Genome Microarray samples and cluster the human genes according to the r-value correlation matrix using the Neighbour Joining (NJ) clustering method. A hyper-geometric distribution is applied on the text annotations of the probe sets to quantify the term overrepresentations. The aim of the tool is the identification of closely correlated genes for a given gene of interest and/or the prediction of its biological function, which is based on the annotations of the respective gene cluster. Human Gene Correlation Analysis (HGCA) is a tool to classify human genes according to their coexpression levels and to identify overrepresented annotation terms in correlated gene groups. It is available at: http://biobank-informatics.bioacademy.gr/coexpression/.

  18. Analysis of pan-genome to identify the core genes and essential genes of Brucella spp.

    Science.gov (United States)

    Yang, Xiaowen; Li, Yajie; Zang, Juan; Li, Yexia; Bie, Pengfei; Lu, Yanli; Wu, Qingmin

    2016-04-01

    Brucella spp. are facultative intracellular pathogens, that cause a contagious zoonotic disease, that can result in such outcomes as abortion or sterility in susceptible animal hosts and grave, debilitating illness in humans. For deciphering the survival mechanism of Brucella spp. in vivo, 42 Brucella complete genomes from NCBI were analyzed for the pan-genome and core genome by identification of their composition and function of Brucella genomes. The results showed that the total 132,143 protein-coding genes in these genomes were divided into 5369 clusters. Among these, 1710 clusters were associated with the core genome, 1182 clusters with strain-specific genes and 2477 clusters with dispensable genomes. COG analysis indicated that 44 % of the core genes were devoted to metabolism, which were mainly responsible for energy production and conversion (COG category C), and amino acid transport and metabolism (COG category E). Meanwhile, approximately 35 % of the core genes were in positive selection. In addition, 1252 potential essential genes were predicted in the core genome by comparison with a prokaryote database of essential genes. The results suggested that the core genes in Brucella genomes are relatively conservation, and the energy and amino acid metabolism play a more important role in the process of growth and reproduction in Brucella spp. This study might help us to better understand the mechanisms of Brucella persistent infection and provide some clues for further exploring the gene modules of the intracellular survival in Brucella spp.

  19. Analysis of gene and protein name synonyms in Entrez Gene and UniProtKB resources

    KAUST Repository

    Arkasosy, Basil

    2013-05-11

    Ambiguity in texts is a well-known problem: words can carry several meanings, and hence, can be read and interpreted differently. This is also true in the biological literature; names of biological concepts, such as genes and proteins, might be ambiguous, referring in some cases to more than one gene or one protein, or in others, to both genes and proteins at the same time. Public biological databases give a very useful insight about genes and proteins information, including their names. In this study, we made a thorough analysis of the nomenclatures of genes and proteins in two data sources and for six different species. We developed an automated process that parses, extracts, processes and stores information available in two major biological databases: Entrez Gene and UniProtKB. We analysed gene and protein synonyms, their types, frequencies, and the ambiguities within a species, in between data sources and cross-species. We found that at least 40% of the cross-species ambiguities are caused by names that are already ambiguous within the species. Our study shows that from the six species we analysed (Homo Sapiens, Mus Musculus, Arabidopsis Thaliana, Oryza Sativa, Bacillus Subtilis and Pseudomonas Fluorescens), rice (Oriza Sativa) has the best naming model in Entrez Gene database, with low ambiguities between data sources and cross-species.

  20. Semi-supervised consensus clustering for gene expression data analysis

    OpenAIRE

    Wang, Yunli; Pan, Youlian

    2014-01-01

    Background Simple clustering methods such as hierarchical clustering and k-means are widely used for gene expression data analysis; but they are unable to deal with noise and high dimensionality associated with the microarray gene expression data. Consensus clustering appears to improve the robustness and quality of clustering results. Incorporating prior knowledge in clustering process (semi-supervised clustering) has been shown to improve the consistency between the data partitioning and do...

  1. Cloning and Analysis of Genes SAMS From Glycine soja

    Institute of Scientific and Technical Information of China (English)

    FAN Jinping; BAI Xi; LI Yong; JI Wei; WANG Xi; ZHU Yanming

    2008-01-01

    S-adenosylmethionine (SAM) plays important role in trans-methyl reactions. Under the condition of drought (30%PEG),salinity (200 mmol·L-1 NaCI) and low temperature (4℃),total RNA was extracted from the leaf and the first strand of eDNA was synthesized with reverse transcription.S-adenosylmethionine synthetase gene (SAMS gene) was amplified by PeR with the first strand eDNA as template and a pair of primers which was based on constructed ESTs sequence.Full-length SAMS gene sequence was obtained by BLAST comparison. According to the analysis, completed sequence of SAMS gene was integrality.The sequence of the SAMS gene was 1185 bp in length with an opening reading frame (ORF) encoding 394 amino acids.The cDNA sequence showed a significant homology to the SAM genes from Phaseolus lunatus (89%),Medicago sativa (85%).A prokaryotie expression vectors based on pET-32b had been constructed and prokaryotie expression was analyzed in order to lay a strong foundation for resist adversity function analysis through situation of genie expression analysis.

  2. Gene Expression Signature in Endemic Osteoarthritis by Microarray Analysis

    Directory of Open Access Journals (Sweden)

    Xi Wang

    2015-05-01

    Full Text Available Kashin-Beck Disease (KBD is an endemic osteochondropathy with an unknown pathogenesis. Diagnosis of KBD is effective only in advanced cases, which eliminates the possibility of early treatment and leads to an inevitable exacerbation of symptoms. Therefore, we aim to identify an accurate blood-based gene signature for the detection of KBD. Previously published gene expression profile data on cartilage and peripheral blood mononuclear cells (PBMCs from adults with KBD were compared to select potential target genes. Microarray analysis was conducted to evaluate the expression of the target genes in a cohort of 100 KBD patients and 100 healthy controls. A gene expression signature was identified using a training set, which was subsequently validated using an independent test set with a minimum redundancy maximum relevance (mRMR algorithm and support vector machine (SVM algorithm. Fifty unique genes were differentially expressed between KBD patients and healthy controls. A 20-gene signature was identified that distinguished between KBD patients and controls with 90% accuracy, 85% sensitivity, and 95% specificity. This study identified a 20-gene signature that accurately distinguishes between patients with KBD and controls using peripheral blood samples. These results promote the further development of blood-based genetic biomarkers for detection of KBD.

  3. 基于RAP的垂直线列阵时反定位研究%Research on time reversal positioning of vertical line array based on RAP

    Institute of Scientific and Technical Information of China (English)

    王鸿吉; 韩建辉; 杨日杰

    2015-01-01

    在分析可靠声路径( RAP )的物理机理的基础上,将RAP与时间反转处理相结合,研究了基于RAP的垂直水听器阵时反定位问题,并在Munk声速剖面下的进行了仿真分析,结果表明了利用RAP进行垂直线列阵时反定位的有效性,通过适当提高阵元数量和间距可增强时反聚焦效果。%On the basis of analyzing physical mechanism of reliable acoustic path( RAP ) and time reversal processing is combined. Time reversal positioning problem of vertical hydrophone array based on RAP is researched. Simulation analysis under the condition of Munk sound speed profile is carried out,and the result shows validity of positioning of vertical line array based on RAP,improving sensor number and distance between sensors can enhance time-reversal energy focus effect.

  4. Gene family level comparative analysis of gene expression in mammals validates the ortholog conjecture.

    Science.gov (United States)

    Rogozin, Igor B; Managadze, David; Shabalina, Svetlana A; Koonin, Eugene V

    2014-04-01

    The ortholog conjecture (OC), which is central to functional annotation of genomes, posits that orthologous genes are functionally more similar than paralogous genes at the same level of sequence divergence. However, a recent study challenged the OC by reporting a greater functional similarity, in terms of Gene Ontology (GO) annotations and expression profiles, among within-species paralogs compared with orthologs. These findings were taken to indicate that functional similarity of homologous genes is primarily determined by the cellular context of the genes, rather than evolutionary history. However, several subsequent studies suggest that GO annotations and microarray data could artificially inflate functional similarity between paralogs from the same organism. We sought to test the OC using approaches distinct from those used in previous studies. Analysis of a large RNAseq data set from multiple human and mouse tissues shows that expression similarity (correlations coefficients, rank's, or Z-scores) between orthologs is substantially greater than that for between-species paralogs with the same sequence divergence, in agreement with the OC and the results of recent detailed analyses. These findings are further corroborated by a fine-grain analysis in which expression profiles of orthologs and paralogs were compared separately for individual gene families. Expression profiles of within-species paralogs are more strongly correlated than profiles of orthologs but it is shown that this is caused by high background noise, that is, correlation between profiles of unrelated genes in the same organism. Z-scores and rank scores show a nonmonotonic dependence of expression profile similarity on sequence divergence. This complexity of gene expression evolution after duplication might be at least partially caused by selection for protein dosage rebalancing following gene duplication.

  5. Haplotype sharing analysis with SNPs in candidate genes : The genetic analysis workshop 12 example

    NARCIS (Netherlands)

    Fischer, C; Beckmann, L; Majoram, P; Meerman, GT; Chang-Claude, J

    2003-01-01

    Haplotype sharing analysis was used to investigate the association of affection status with single nucleotide polymorphism (SNP) haplotypes within candidate gene 1 in one sample each from the isolated and the general population of Genetic Analysis Workshop (GAW) 12 simulated data. Gene 1 has direct

  6. Genome-wide gene-gene interaction analysis for next-generation sequencing.

    Science.gov (United States)

    Zhao, Jinying; Zhu, Yun; Xiong, Momiao

    2016-03-01

    The critical barrier in interaction analysis for next-generation sequencing (NGS) data is that the traditional pairwise interaction analysis that is suitable for common variants is difficult to apply to rare variants because of their prohibitive computational time, large number of tests and low power. The great challenges for successful detection of interactions with NGS data are (1) the demands in the paradigm of changes in interaction analysis; (2) severe multiple testing; and (3) heavy computations. To meet these challenges, we shift the paradigm of interaction analysis between two SNPs to interaction analysis between two genomic regions. In other words, we take a gene as a unit of analysis and use functional data analysis techniques as dimensional reduction tools to develop a novel statistic to collectively test interaction between all possible pairs of SNPs within two genome regions. By intensive simulations, we demonstrate that the functional logistic regression for interaction analysis has the correct type 1 error rates and higher power to detect interaction than the currently used methods. The proposed method was applied to a coronary artery disease dataset from the Wellcome Trust Case Control Consortium (WTCCC) study and the Framingham Heart Study (FHS) dataset, and the early-onset myocardial infarction (EOMI) exome sequence datasets with European origin from the NHLBI's Exome Sequencing Project. We discovered that 6 of 27 pairs of significantly interacted genes in the FHS were replicated in the independent WTCCC study and 24 pairs of significantly interacted genes after applying Bonferroni correction in the EOMI study.

  7. Gene expression correlation analysis predicts involvement of high- and low-confidence risk genes in different stages of prostate carcinogenesis.

    Science.gov (United States)

    Yano, Kojiro

    2010-12-01

    Whole genome association studies have identified many loci associated with the risk of prostate cancer (PC). However, very few of the genes associated with these loci have been related to specific processes of prostate carcinogenesis. Therefore I inferred biological functions associated with these risk genes using gene expression correlation analysis. PC risk genes reported in the literature were classified as having high (Plow (Phigh-confidence genes and other genes in the microarray dataset, whereas correlation between low-confidence genes and other genes in PC showed smaller decrease. Genes involved in developmental processes were significantly correlated with all risk gene categories. Ectoderm development genes, which may be related to squamous metaplasia, and genes enriched in fetal prostate stem cells (PSCs) showed strong association with the high-confidence genes. The association between the PSC genes and the low-confidence genes was weak, but genes related to neural system genes showed strong association with low-confidence genes. The high-confidence risk genes may be associated with an early stage of prostate carcinogenesis, possibly involving PSCs and squamous metaplasia. The low-confidence genes may be involved in a later stage of carcinogenesis. © 2010 Wiley-Liss, Inc.

  8. Functional analysis of prognostic gene expression network genes in metastatic breast cancer models.

    Directory of Open Access Journals (Sweden)

    Thomas R Geiger

    Full Text Available Identification of conserved co-expression networks is a useful tool for clustering groups of genes enriched for common molecular or cellular functions [1]. The relative importance of genes within networks can frequently be inferred by the degree of connectivity, with those displaying high connectivity being significantly more likely to be associated with specific molecular functions [2]. Previously we utilized cross-species network analysis to identify two network modules that were significantly associated with distant metastasis free survival in breast cancer. Here, we validate one of the highly connected genes as a metastasis associated gene. Tpx2, the most highly connected gene within a proliferation network specifically prognostic for estrogen receptor positive (ER+ breast cancers, enhances metastatic disease, but in a tumor autonomous, proliferation-independent manner. Histologic analysis suggests instead that variation of TPX2 levels within disseminated tumor cells may influence the transition between dormant to actively proliferating cells in the secondary site. These results support the co-expression network approach for identification of new metastasis-associated genes to provide new information regarding the etiology of breast cancer progression and metastatic disease.

  9. Functional gene group analysis identifies synaptic gene groups as risk factor for schizophrenia.

    Science.gov (United States)

    Lips, E S; Cornelisse, L N; Toonen, R F; Min, J L; Hultman, C M; Holmans, P A; O'Donovan, M C; Purcell, S M; Smit, A B; Verhage, M; Sullivan, P F; Visscher, P M; Posthuma, D

    2012-10-01

    Schizophrenia is a highly heritable disorder with a polygenic pattern of inheritance and a population prevalence of ~1%. Previous studies have implicated synaptic dysfunction in schizophrenia. We tested the accumulated association of genetic variants in expert-curated synaptic gene groups with schizophrenia in 4673 cases and 4965 healthy controls, using functional gene group analysis. Identifying groups of genes with similar cellular function rather than genes in isolation may have clinical implications for finding additional drug targets. We found that a group of 1026 synaptic genes was significantly associated with the risk of schizophrenia (P=7.6 × 10(-11)) and more strongly associated than 100 randomly drawn, matched control groups of genetic variants (P<0.01). Subsequent analysis of synaptic subgroups suggested that the strongest association signals are derived from three synaptic gene groups: intracellular signal transduction (P=2.0 × 10(-4)), excitability (P=9.0 × 10(-4)) and cell adhesion and trans-synaptic signaling (P=2.4 × 10(-3)). These results are consistent with a role of synaptic dysfunction in schizophrenia and imply that impaired intracellular signal transduction in synapses, synaptic excitability and cell adhesion and trans-synaptic signaling play a role in the pathology of schizophrenia.

  10. A multicolor panel of novel lentiviral "gene ontology" (LeGO) vectors for functional gene analysis.

    Science.gov (United States)

    Weber, Kristoffer; Bartsch, Udo; Stocking, Carol; Fehse, Boris

    2008-04-01

    Functional gene analysis requires the possibility of overexpression, as well as downregulation of one, or ideally several, potentially interacting genes. Lentiviral vectors are well suited for this purpose as they ensure stable expression of complementary DNAs (cDNAs), as well as short-hairpin RNAs (shRNAs), and can efficiently transduce a wide spectrum of cell targets when packaged within the coat proteins of other viruses. Here we introduce a multicolor panel of novel lentiviral "gene ontology" (LeGO) vectors designed according to the "building blocks" principle. Using a wide spectrum of different fluorescent markers, including drug-selectable enhanced green fluorescent protein (eGFP)- and dTomato-blasticidin-S resistance fusion proteins, LeGO vectors allow simultaneous analysis of multiple genes and shRNAs of interest within single, easily identifiable cells. Furthermore, each functional module is flanked by unique cloning sites, ensuring flexibility and individual optimization. The efficacy of these vectors for analyzing multiple genes in a single cell was demonstrated in several different cell types, including hematopoietic, endothelial, and neural stem and progenitor cells, as well as hepatocytes. LeGO vectors thus represent a valuable tool for investigating gene networks using conditional ectopic expression and knock-down approaches simultaneously.

  11. LINGUISTIC ANALYSIS OF THE NUCLEOPROTEIN GENE OF INFLUENZA A VIRUS

    Energy Technology Data Exchange (ETDEWEB)

    A. SKOURIKHINE; T. BURR

    2000-05-01

    We applied linguistic analysis approach, specifically N-grams, to classify nucleotide and amino acids sequences of nucleoprotein (NP) gene of the Influenza A virus isolated from a range of hosts and geographic regions. We considered letter frequency (1-grams), letter pairs frequency (2-grams) and triplets' frequency (3-grams). Classification trees based on 1,2,3-grams variables were constructed for the same NP nucleotide and amino acids strains and their classification efficiency were compared with the clustering obtained using phylogenetic analysis. The results have shown that disregarding positional information for a NP gene can provide the same level of recognition accuracy like alternative more complex classification techniques.

  12. High-throughput Binary Vectors for Plant Gene Function Analysis

    Institute of Scientific and Technical Information of China (English)

    Zhi-Yong Lei; Ping Zhao; Min-Jie Cao; Rong Cui; Xi Chen; Li-Zhong Xiong; Qi-Fa Zhang; David J. Oliver; Cheng-Bin Xiang

    2007-01-01

    A series of high-throughput binary cloning vectors were constructed to facilitate gene function analysis in higher plants. This vector series consists of plasmids designed for plant expression, promoter analysis, gene silencing,and green fluorescent protein fusions for protein localization. These vectors provide for high-throughput and efficient cloning utilizing sites for λ phage integrase/excisionase. In addition, unique restriction sites are incorporated in a multiple cloning site and enable promoter replacement. The entire vector series are available with complete sequence information and detailed annotations and are freely distributed to the scientific community for non-commercial uses.

  13. Meta-analysis based variable selection for gene expression data.

    Science.gov (United States)

    Li, Quefeng; Wang, Sijian; Huang, Chiang-Ching; Yu, Menggang; Shao, Jun

    2014-12-01

    Recent advance in biotechnology and its wide applications have led to the generation of many high-dimensional gene expression data sets that can be used to address similar biological questions. Meta-analysis plays an important role in summarizing and synthesizing scientific evidence from multiple studies. When the dimensions of datasets are high, it is desirable to incorporate variable selection into meta-analysis to improve model interpretation and prediction. According to our knowledge, all existing methods conduct variable selection with meta-analyzed data in an "all-in-or-all-out" fashion, that is, a gene is either selected in all of studies or not selected in any study. However, due to data heterogeneity commonly exist in meta-analyzed data, including choices of biospecimens, study population, and measurement sensitivity, it is possible that a gene is important in some studies while unimportant in others. In this article, we propose a novel method called meta-lasso for variable selection with high-dimensional meta-analyzed data. Through a hierarchical decomposition on regression coefficients, our method not only borrows strength across multiple data sets to boost the power to identify important genes, but also keeps the selection flexibility among data sets to take into account data heterogeneity. We show that our method possesses the gene selection consistency, that is, when sample size of each data set is large, with high probability, our method can identify all important genes and remove all unimportant genes. Simulation studies demonstrate a good performance of our method. We applied our meta-lasso method to a meta-analysis of five cardiovascular studies. The analysis results are clinically meaningful.

  14. Designed microtremor array based actual measurement and analysis of strong ground motion at Palu city, Indonesia

    Science.gov (United States)

    Thein, Pyi Soe; Pramumijoyo, Subagyo; Brotopuspito, Kirbani Sri; Wilopo, Wahyu; Kiyono, Junji; Setianto, Agung; Putra, Rusnardi Rahmat

    2015-04-01

    In this study, we investigated the strong ground motion characteristics under Palu City, Indonesia. The shear wave velocity structures evaluated by eight microtremors measurement are the most applicable to determine the thickness of sediments and average shear wave velocity with Vs ≤ 300 m/s. Based on subsurface underground structure models identified, earthquake ground motion was estimated in the future Palu-Koro earthquake by using statistical green's function method. The seismic microzonation parameters were carried out by considering several significant controlling factors on ground response at January 23, 2005 earthquake.

  15. Designed microtremor array based actual measurement and analysis of strong ground motion at Palu city, Indonesia

    Energy Technology Data Exchange (ETDEWEB)

    Thein, Pyi Soe, E-mail: pyisoethein@yahoo.com [Geology Department, Yangon University (Myanmar); Pramumijoyo, Subagyo; Wilopo, Wahyu; Setianto, Agung [Geological Engineering Department, Gadjah Mada University (Indonesia); Brotopuspito, Kirbani Sri [Physics Department, Gadjah Mada University (Indonesia); Kiyono, Junji; Putra, Rusnardi Rahmat [Graduate School of Global Environmental Studies, Kyoto University (Japan)

    2015-04-24

    In this study, we investigated the strong ground motion characteristics under Palu City, Indonesia. The shear wave velocity structures evaluated by eight microtremors measurement are the most applicable to determine the thickness of sediments and average shear wave velocity with Vs ≤ 300 m/s. Based on subsurface underground structure models identified, earthquake ground motion was estimated in the future Palu-Koro earthquake by using statistical green’s function method. The seismic microzonation parameters were carried out by considering several significant controlling factors on ground response at January 23, 2005 earthquake.

  16. Gene expression patterns combined with network analysis identify hub genes associated with bladder cancer.

    Science.gov (United States)

    Bi, Dongbin; Ning, Hao; Liu, Shuai; Que, Xinxiang; Ding, Kejia

    2015-06-01

    To explore molecular mechanisms of bladder cancer (BC), network strategy was used to find biomarkers for early detection and diagnosis. The differentially expressed genes (DEGs) between bladder carcinoma patients and normal subjects were screened using empirical Bayes method of the linear models for microarray data package. Co-expression networks were constructed by differentially co-expressed genes and links. Regulatory impact factors (RIF) metric was used to identify critical transcription factors (TFs). The protein-protein interaction (PPI) networks were constructed by the Search Tool for the Retrieval of Interacting Genes/Proteins (STRING) and clusters were obtained through molecular complex detection (MCODE) algorithm. Centralities analyses for complex networks were performed based on degree, stress and betweenness. Enrichment analyses were performed based on Gene Ontology (GO) and Kyoto Encyclopedia of Genes and Genomes (KEGG) databases. Co-expression networks and TFs (based on expression data of global DEGs and DEGs in different stages and grades) were identified. Hub genes of complex networks, such as UBE2C, ACTA2, FABP4, CKS2, FN1 and TOP2A, were also obtained according to analysis of degree. In gene enrichment analyses of global DEGs, cell adhesion, proteinaceous extracellular matrix and extracellular matrix structural constituent were top three GO terms. ECM-receptor interaction, focal adhesion, and cell cycle were significant pathways. Our results provide some potential underlying biomarkers of BC. However, further validation is required and deep studies are needed to elucidate the pathogenesis of BC. Copyright © 2015 Elsevier Ltd. All rights reserved.

  17. A new gene superfamily of pathogen-response (repat) genes in Lepidoptera: classification and expression analysis.

    Science.gov (United States)

    Navarro-Cerrillo, G; Hernández-Martínez, P; Vogel, H; Ferré, J; Herrero, S

    2013-01-01

    Repat (REsponse to PAThogens) genes were first identified in the midgut of Spodoptera exigua (Lepidoptera: Noctuidae) in response to Bacillus thuringiensis and baculovirus exposure. Since then, additional repat gene homologs have been identified in different studies. In this study the comprehensive larval transcriptome from S. exigua was analyzed for the presence of novel repat-homolog sequences. These analyses revealed the presence of at least 46 repat genes in S. exigua, establishing a new gene superfamily in this species. Phylogenetic analysis and studies of conserved motifs in these hypothetical proteins have allowed their classification in two main classes, αREPAT and βREPAT. Studies on the transcriptional response of repat genes have shown that αREPAT and βREPAT differ in their sequence but also in the pattern of regulation. The αREPAT were mainly regulated in response to the Cry1Ca toxin from B. thuringiensis but not to the increase in the midgut microbiota load. In contrast, βREPAT were neither responding to Cry1Ca toxin nor to midgut microbiota. Differential expression between midgut stem cells and the whole midgut tissue was studied for the different repat genes revealing changes in the gene expression distribution between midgut stem cells and midgut tissue in response to midgut microbiota. This high diversity found in their sequence and in their expression profile suggests that REPAT proteins may be involved in multiple processes that could be of relevance for the understanding of the insect gut physiology.

  18. Quantitative DNA methylation analysis of candidate genes in cervical cancer.

    Directory of Open Access Journals (Sweden)

    Erin M Siegel

    Full Text Available Aberrant DNA methylation has been observed in cervical cancer; however, most studies have used non-quantitative approaches to measure DNA methylation. The objective of this study was to quantify methylation within a select panel of genes previously identified as targets for epigenetic silencing in cervical cancer and to identify genes with elevated methylation that can distinguish cancer from normal cervical tissues. We identified 49 women with invasive squamous cell cancer of the cervix and 22 women with normal cytology specimens. Bisulfite-modified genomic DNA was amplified and quantitative pyrosequencing completed for 10 genes (APC, CCNA, CDH1, CDH13, WIF1, TIMP3, DAPK1, RARB, FHIT, and SLIT2. A Methylation Index was calculated as the mean percent methylation across all CpG sites analyzed per gene (~4-9 CpG site per sequence. A binary cut-point was defined at >15% methylation. Sensitivity, specificity and area under ROC curve (AUC of methylation in individual genes or a panel was examined. The median methylation index was significantly higher in cases compared to controls in 8 genes, whereas there was no difference in median methylation for 2 genes. Compared to HPV and age, the combination of DNA methylation level of DAPK1, SLIT2, WIF1 and RARB with HPV and age significantly improved the AUC from 0.79 to 0.99 (95% CI: 0.97-1.00, p-value = 0.003. Pyrosequencing analysis confirmed that several genes are common targets for aberrant methylation in cervical cancer and DNA methylation level of four genes appears to increase specificity to identify cancer compared to HPV detection alone. Alterations in DNA methylation of specific genes in cervical cancers, such as DAPK1, RARB, WIF1, and SLIT2, may also occur early in cervical carcinogenesis and should be evaluated.

  19. Quantitative DNA methylation analysis of candidate genes in cervical cancer.

    Science.gov (United States)

    Siegel, Erin M; Riggs, Bridget M; Delmas, Amber L; Koch, Abby; Hakam, Ardeshir; Brown, Kevin D

    2015-01-01

    Aberrant DNA methylation has been observed in cervical cancer; however, most studies have used non-quantitative approaches to measure DNA methylation. The objective of this study was to quantify methylation within a select panel of genes previously identified as targets for epigenetic silencing in cervical cancer and to identify genes with elevated methylation that can distinguish cancer from normal cervical tissues. We identified 49 women with invasive squamous cell cancer of the cervix and 22 women with normal cytology specimens. Bisulfite-modified genomic DNA was amplified and quantitative pyrosequencing completed for 10 genes (APC, CCNA, CDH1, CDH13, WIF1, TIMP3, DAPK1, RARB, FHIT, and SLIT2). A Methylation Index was calculated as the mean percent methylation across all CpG sites analyzed per gene (~4-9 CpG site) per sequence. A binary cut-point was defined at >15% methylation. Sensitivity, specificity and area under ROC curve (AUC) of methylation in individual genes or a panel was examined. The median methylation index was significantly higher in cases compared to controls in 8 genes, whereas there was no difference in median methylation for 2 genes. Compared to HPV and age, the combination of DNA methylation level of DAPK1, SLIT2, WIF1 and RARB with HPV and age significantly improved the AUC from 0.79 to 0.99 (95% CI: 0.97-1.00, p-value = 0.003). Pyrosequencing analysis confirmed that several genes are common targets for aberrant methylation in cervical cancer and DNA methylation level of four genes appears to increase specificity to identify cancer compared to HPV detection alone. Alterations in DNA methylation of specific genes in cervical cancers, such as DAPK1, RARB, WIF1, and SLIT2, may also occur early in cervical carcinogenesis and should be evaluated.

  20. Identifying glioblastoma gene networks based on hypergeometric test analysis.

    Directory of Open Access Journals (Sweden)

    Vasileios Stathias

    Full Text Available Patient specific therapy is emerging as an important possibility for many cancer patients. However, to identify such therapies it is essential to determine the genomic and transcriptional alterations present in one tumor relative to control samples. This presents a challenge since use of a single sample precludes many standard statistical analysis techniques. We reasoned that one means of addressing this issue is by comparing transcriptional changes in one tumor with those observed in a large cohort of patients analyzed by The Cancer Genome Atlas (TCGA. To test this directly, we devised a bioinformatics pipeline to identify differentially expressed genes in tumors resected from patients suffering from the most common malignant adult brain tumor, glioblastoma (GBM. We performed RNA sequencing on tumors from individual GBM patients and filtered the results through the TCGA database in order to identify possible gene networks that are overrepresented in GBM samples relative to controls. Importantly, we demonstrate that hypergeometric-based analysis of gene pairs identifies gene networks that validate experimentally. These studies identify a putative workflow for uncovering differentially expressed patient specific genes and gene networks for GBM and other cancers.

  1. Functional analysis of soybean genes involved in flavonoid biosynthesis by virus-induced gene silencing.

    Science.gov (United States)

    Nagamatsu, Atsushi; Masuta, Chikara; Senda, Mineo; Matsuura, Hideyuki; Kasai, Atsushi; Hong, Jin-Sung; Kitamura, Keisuke; Abe, Jun; Kanazawa, Akira

    2007-11-01

    Virus-induced gene silencing (VIGS) is a powerful tool for functional analysis of genes in plants. A wide-host-range VIGS vector, which was developed based on the Cucumber mosaic virus (CMV), was tested for its ability to silence endogenous genes involved in flavonoid biosynthesis in soybean. Symptomless infection was established using a pseudorecombinant virus, which enabled detection of specific changes in metabolite content by VIGS. It has been demonstrated that the yellow seed coat phenotype of various cultivated soybean lines that lack anthocyanin pigmentation is induced by natural degradation of chalcone synthase (CHS) mRNA. When soybean plants with brown seed coats were infected with a virus that contains the CHS gene sequence, the colour of the seed coats changed to yellow, which indicates that the naturally occurring RNA silencing is reproduced by VIGS. In addition, CHS VIGS consequently led to a decrease in isoflavone content in seeds. VIGS was also tested on the putative flavonoid 3'-hydroxylase (F3'H) gene in the pathway. This experiment resulted in a decrease in the content of quercetin relative to kaempferol in the upper leaves after viral infection, which suggests that the putative gene actually encodes the F3'H protein. In both experiments, a marked decrease in the target mRNA and accumulation of short interfering RNAs were detected, indicating that sequence-specific mRNA degradation was induced. The present report is a successful demonstration of the application of VIGS for genes involved in flavonoid biosynthesis in plants; the CMV-based VIGS system provides an efficient tool for functional analysis of soybean genes.

  2. Prioritisation and network analysis of Crohn's disease susceptibility genes.

    Directory of Open Access Journals (Sweden)

    Daniele Muraro

    Full Text Available Recent Genome-Wide Association Studies (GWAS have revealed numerous Crohn's disease susceptibility genes and a key challenge now is in understanding how risk polymorphisms in associated genes might contribute to development of this disease. For a gene to contribute to disease phenotype, its risk variant will likely adversely communicate with a variety of other gene products to result in dysregulation of common signaling pathways. A vital challenge is to elucidate pathways of potentially greatest influence on pathological behaviour, in a manner recognizing how multiple relevant genes may yield integrative effect. In this work we apply mathematical analysis of networks involving the list of recently described Crohn's susceptibility genes, to prioritise pathways in relation to their potential development of this disease. Prioritisation was performed by applying a text mining and a diffusion based method (GRAIL, GPEC. Prospective biological significance of the resulting prioritised list of proteins is highlighted by changes in their gene expression levels in Crohn's patients intestinal tissue in comparison with healthy donors.

  3. CANDIDATE GENE ANALYSIS IN ISRAELI SOLDIERS WITH STRESS FRACTURES

    Directory of Open Access Journals (Sweden)

    Ran Yanovich

    2012-03-01

    Full Text Available To investigate the association of polymorphisms within candidate genes which we hypothesized may contribute to stress fracture predisposition, a case-control, cross- sectional study design was employed. Genotyping 268 Single Nucleotide Polymorphisms- SNPs within 17 genes in 385 Israeli young male and female recruits (182 with and 203 without stress fractures. Twenty-five polymorphisms within 9 genes (NR3C1, ANKH, VDR, ROR2, CALCR, IL6, COL1A2, CBG, and LRP4 showed statistically significant differences (p < 0.05 in the distribution between stress fracture cases and non stress fracture controls. Seventeen genetic variants were associated with an increased stress fracture risk, and eight variants with a decreased stress fracture risk. None of the SNP associations remained significant after correcting for multiple comparisons (false discovery rate- FDR. Our findings suggest that genes may be involved in stress fracture pathogenesis. Specifically, the CALCR and the VDR genes are intriguing candidates. The putative involvement of these genes in stress fracture predisposition requires analysis of more cases and controls and sequencing the relevant genomic regions, in order to define the specific gene mutations

  4. Comparative and functional analysis of cardiovascular-related genes

    Energy Technology Data Exchange (ETDEWEB)

    Cheng, Jan-Fang; Pennacchio, Len A.

    2003-09-01

    The ability to detect putative cis-regulatory elements in cardiovascular-related genes has been accelerated by the availability of genomic sequence data from numerous vertebrate species and the recent development of comparative genomic tools. This improvement is anticipated to lead to a better understanding of the complex regulatory architecture of cardiovascular (CV) genes and how genetic variants in these non-coding regions can potentially play a role in cardiovascular disease. This manuscript reviews a recently established database dedicated to the comparative sequence analysis of 250 human CV genes of known importance, 37 of which currently contain sequence comparison data for organisms beyond those of human, mouse and rat. These data have provided a glimpse into the variety of possible insights from deep vertebrate sequence comparisons and the identification of putative gene regulatory elements.

  5. Putative psychosis genes in the prefrontal cortex: combined analysis of gene expression microarrays

    Directory of Open Access Journals (Sweden)

    Yolken Robert H

    2008-11-01

    Full Text Available Abstract Background Recent studies have shown similarities between schizophrenia and bipolar disorder in phenotypes and in genotypes, and those studies have contributed to an ongoing re-evaluation of the traditional dichotomy between schizophrenia and bipolar disorder. Bipolar disorder with psychotic features may be closely related to schizophrenia and therefore, psychosis may be an alternative phenotype compared to the traditional diagnosis categories. Methods We performed a cross-study analysis of 7 gene expression microarrays that include both psychosis and non-psychosis subjects. These studies include over 400 microarray samples (163 individual subjects on 3 different Affymetrix microarray platforms. Results We found that 110 transcripts are differentially regulated (p Conclusion This study demonstrates the advantages of cross-study analysis in detecting consensus changes in gene expression across multiple microarray studies. Differential gene expression between individuals with and without psychosis suggests that psychosis may be a useful phenotypic variable to complement the traditional diagnosis categories.

  6. Coupled Two-Way Clustering Analysis of Gene Microarray Data

    CERN Document Server

    Getz, G; Domany, E

    2000-01-01

    We present a novel coupled two-way clustering approach to gene microarray data analysis. The main idea is to identify subsets of the genes and samples, such that when one of these is used to cluster the other, stable and significant partitions emerge. The search for such subsets is a computationally complex task: we present an algorithm, based on iterative clustering, which performs such a search. This analysis is especially suitable for gene microarray data, where the contributions of a variety of biological mechanisms to the gene expression levels are entangled in a large body of experimental data. The method was applied to two gene microarray data sets, on colon cancer and leukemia. By identifying relevant subsets of the data and focusing on them we were able to discover partitions and correlations that were masked and hidden when the full dataset was used in the analysis. Some of these partitions have clear biological interpretation; others can serve to identify possible directions for future research.

  7. Coupled two-way clustering analysis of gene microarray data

    Science.gov (United States)

    Getz, Gad; Levine, Erel; Domany, Eytan

    2000-10-01

    We present a coupled two-way clustering approach to gene microarray data analysis. The main idea is to identify subsets of the genes and samples, such that when one of these is used to cluster the other, stable and significant partitions emerge. The search for such subsets is a computationally complex task. We present an algorithm, based on iterative clustering, that performs such a search. This analysis is especially suitable for gene microarray data, where the contributions of a variety of biological mechanisms to the gene expression levels are entangled in a large body of experimental data. The method was applied to two gene microarray data sets, on colon cancer and leukemia. By identifying relevant subsets of the data and focusing on them we were able to discover partitions and correlations that were masked and hidden when the full dataset was used in the analysis. Some of these partitions have clear biological interpretation; others can serve to identify possible directions for future research.

  8. Differentially expressed genes in pancreatic ductal adenocarcinomas identified through serial analysis of gene expression

    DEFF Research Database (Denmark)

    Hustinx, Steven R; Cao, Dengfeng; Maitra, Anirban;

    2004-01-01

    Serial analysis of gene expression (SAGE) is a powerful tool for the discovery of novel tumor markers. The publicly available online SAGE libraries of normal and neoplastic tissues (http://www.ncbi.nlm.nih.gov/SAGE/) have recently been expanded; in addition, a more complete annotation of the human...

  9. DGCA: A comprehensive R package for Differential Gene Correlation Analysis.

    Science.gov (United States)

    McKenzie, Andrew T; Katsyv, Igor; Song, Won-Min; Wang, Minghui; Zhang, Bin

    2016-11-15

    Dissecting the regulatory relationships between genes is a critical step towards building accurate predictive models of biological systems. A powerful approach towards this end is to systematically study the differences in correlation between gene pairs in more than one distinct condition. In this study we develop an R package, DGCA (for Differential Gene Correlation Analysis), which offers a suite of tools for computing and analyzing differential correlations between gene pairs across multiple conditions. To minimize parametric assumptions, DGCA computes empirical p-values via permutation testing. To understand differential correlations at a systems level, DGCA performs higher-order analyses such as measuring the average difference in correlation and multiscale clustering analysis of differential correlation networks. Through a simulation study, we show that the straightforward z-score based method that DGCA employs significantly outperforms the existing alternative methods for calculating differential correlation. Application of DGCA to the TCGA RNA-seq data in breast cancer not only identifies key changes in the regulatory relationships between TP53 and PTEN and their target genes in the presence of inactivating mutations, but also reveals an immune-related differential correlation module that is specific to triple negative breast cancer (TNBC). DGCA is an R package for systematically assessing the difference in gene-gene regulatory relationships under different conditions. This user-friendly, effective, and comprehensive software tool will greatly facilitate the application of differential correlation analysis in many biological studies and thus will help identification of novel signaling pathways, biomarkers, and targets in complex biological systems and diseases.

  10. Array-based comparative genomic hybridization for genome-wide screening of DNA copy number in bladder tumors.

    NARCIS (Netherlands)

    Veltman, J.A.; Fridlyand, J.; Pejavar, S.; Olshen, A.B.; Korkola, J.E.; Vries, S. de; Carroll, P.; Kuo, W.L.; Pinkel, D.; Albertson, D.; Cordon-Cardo, C.; Jain, A.N.; Waldman, F.M.

    2003-01-01

    Genome-wide copy number profiles were characterized in 41 primary bladder tumors using array-based comparative genomic hybridization (array CGH). In addition to previously identified alterations in large chromosomal regions, alterations were identified in many small genomic regions, some with high-l

  11. Integrative analysis of RUNX1 downstream pathways and target genes

    Science.gov (United States)

    Michaud, Joëlle; Simpson, Ken M; Escher, Robert; Buchet-Poyau, Karine; Beissbarth, Tim; Carmichael, Catherine; Ritchie, Matthew E; Schütz, Frédéric; Cannon, Ping; Liu, Marjorie; Shen, Xiaofeng; Ito, Yoshiaki; Raskind, Wendy H; Horwitz, Marshall S; Osato, Motomi; Turner, David R; Speed, Terence P; Kavallaris, Maria; Smyth, Gordon K; Scott, Hamish S

    2008-01-01

    Background The RUNX1 transcription factor gene is frequently mutated in sporadic myeloid and lymphoid leukemia through translocation, point mutation or amplification. It is also responsible for a familial platelet disorder with predisposition to acute myeloid leukemia (FPD-AML). The disruption of the largely unknown biological pathways controlled by RUNX1 is likely to be responsible for the development of leukemia. We have used multiple microarray platforms and bioinformatic techniques to help identify these biological pathways to aid in the understanding of why RUNX1 mutations lead to leukemia. Results Here we report genes regulated either directly or indirectly by RUNX1 based on the study of gene expression profiles generated from 3 different human and mouse platforms. The platforms used were global gene expression profiling of: 1) cell lines with RUNX1 mutations from FPD-AML patients, 2) over-expression of RUNX1 and CBFβ, and 3) Runx1 knockout mouse embryos using either cDNA or Affymetrix microarrays. We observe that our datasets (lists of differentially expressed genes) significantly correlate with published microarray data from sporadic AML patients with mutations in either RUNX1 or its cofactor, CBFβ. A number of biological processes were identified among the differentially expressed genes and functional assays suggest that heterozygous RUNX1 point mutations in patients with FPD-AML impair cell proliferation, microtubule dynamics and possibly genetic stability. In addition, analysis of the regulatory regions of the differentially expressed genes has for the first time systematically identified numerous potential novel RUNX1 target genes. Conclusion This work is the first large-scale study attempting to identify the genetic networks regulated by RUNX1, a master regulator in the development of the hematopoietic system and leukemia. The biological pathways and target genes controlled by RUNX1 will have considerable importance in disease progression in both

  12. Biclustering methods: biological relevance and application in gene expression analysis.

    Directory of Open Access Journals (Sweden)

    Ali Oghabian

    Full Text Available DNA microarray technologies are used extensively to profile the expression levels of thousands of genes under various conditions, yielding extremely large data-matrices. Thus, analyzing this information and extracting biologically relevant knowledge becomes a considerable challenge. A classical approach for tackling this challenge is to use clustering (also known as one-way clustering methods where genes (or respectively samples are grouped together based on the similarity of their expression profiles across the set of all samples (or respectively genes. An alternative approach is to develop biclustering methods to identify local patterns in the data. These methods extract subgroups of genes that are co-expressed across only a subset of samples and may feature important biological or medical implications. In this study we evaluate 13 biclustering and 2 clustering (k-means and hierarchical methods. We use several approaches to compare their performance on two real gene expression data sets. For this purpose we apply four evaluation measures in our analysis: (1 we examine how well the considered (biclustering methods differentiate various sample types; (2 we evaluate how well the groups of genes discovered by the (biclustering methods are annotated with similar Gene Ontology categories; (3 we evaluate the capability of the methods to differentiate genes that are known to be specific to the particular sample types we study and (4 we compare the running time of the algorithms. In the end, we conclude that as long as the samples are well defined and annotated, the contamination of the samples is limited, and the samples are well replicated, biclustering methods such as Plaid and SAMBA are useful for discovering relevant subsets of genes and samples.

  13. Exploiting gene families for phylogenomic analysis of myzostomid transcriptome data.

    Directory of Open Access Journals (Sweden)

    Stefanie Hartmann

    Full Text Available BACKGROUND: In trying to understand the evolutionary relationships of organisms, the current flood of sequence data offers great opportunities, but also reveals new challenges with regard to data quality, the selection of data for subsequent analysis, and the automation of steps that were once done manually for single-gene analyses. Even though genome or transcriptome data is available for representatives of most bilaterian phyla, some enigmatic taxa still have an uncertain position in the animal tree of life. This is especially true for myzostomids, a group of symbiotic (or parasitic protostomes that are either placed with annelids or flatworms. METHODOLOGY: Based on similarity criteria, Illumina-based transcriptome sequences of one myzostomid were compared to protein sequences of one additional myzostomid and 29 reference metazoa and clustered into gene families. These families were then used to investigate the phylogenetic position of Myzostomida using different approaches: Alignments of 989 sequence families were concatenated, and the resulting superalignment was analyzed under a Maximum Likelihood criterion. We also used all 1,878 gene trees with at least one myzostomid sequence for a supertree approach: the individual gene trees were computed and then reconciled into a species tree using gene tree parsimony. CONCLUSIONS: Superalignments require strictly orthologous genes, and both the gene selection and the widely varying amount of data available for different taxa in our dataset may cause anomalous placements and low bootstrap support. In contrast, gene tree parsimony is designed to accommodate multilocus gene families and therefore allows a much more comprehensive data set to be analyzed. Results of this supertree approach showed a well-resolved phylogeny, in which myzostomids were part of the annelid radiation, and major bilaterian taxa were found to be monophyletic.

  14. Integrative analysis of RUNX1 downstream pathways and target genes

    Directory of Open Access Journals (Sweden)

    Liu Marjorie

    2008-07-01

    Full Text Available Abstract Background The RUNX1 transcription factor gene is frequently mutated in sporadic myeloid and lymphoid leukemia through translocation, point mutation or amplification. It is also responsible for a familial platelet disorder with predisposition to acute myeloid leukemia (FPD-AML. The disruption of the largely unknown biological pathways controlled by RUNX1 is likely to be responsible for the development of leukemia. We have used multiple microarray platforms and bioinformatic techniques to help identify these biological pathways to aid in the understanding of why RUNX1 mutations lead to leukemia. Results Here we report genes regulated either directly or indirectly by RUNX1 based on the study of gene expression profiles generated from 3 different human and mouse platforms. The platforms used were global gene expression profiling of: 1 cell lines with RUNX1 mutations from FPD-AML patients, 2 over-expression of RUNX1 and CBFβ, and 3 Runx1 knockout mouse embryos using either cDNA or Affymetrix microarrays. We observe that our datasets (lists of differentially expressed genes significantly correlate with published microarray data from sporadic AML patients with mutations in either RUNX1 or its cofactor, CBFβ. A number of biological processes were identified among the differentially expressed genes and functional assays suggest that heterozygous RUNX1 point mutations in patients with FPD-AML impair cell proliferation, microtubule dynamics and possibly genetic stability. In addition, analysis of the regulatory regions of the differentially expressed genes has for the first time systematically identified numerous potential novel RUNX1 target genes. Conclusion This work is the first large-scale study attempting to identify the genetic networks regulated by RUNX1, a master regulator in the development of the hematopoietic system and leukemia. The biological pathways and target genes controlled by RUNX1 will have considerable importance in disease

  15. HLA class II genes: typing by DNA analysis.

    Science.gov (United States)

    Bidwell, J L; Bidwell, E A; Bradley, B A

    1990-04-01

    A detailed understanding of the structure and function of the human major histocompatibility complex (MHC) has ensued from studies by molecular biologist during the last decade. Virtually all of the HLA genes have now been cloned, and the nucleotide sequences of their different allelic forms have been determined. Typing for these HLA alleles is a fundamental prerequisite for tissue matching in allogeneic organ transplantation. Until very recently, typing procedures have been dominated by serological and cellular methods. The availability of cloned DNA from HLA genes has now permitted the technique of restriction fragment length polymorphism (RFLP) analysis to be applied, with remarkable success and advantage, to phenotyping of both HLA Class I and Class II determinants. For the HLA Class II genes DR and DQ, a simple two-stage RFLP analysis permits the accurate identification of all specificities defined by serology, and of many which are defined by cellular typing. At the present time, however, RFLP typing of HLA Class I genes is not as practicable or as informative as that for HLA Class II genes. The present clinical applications of HLA-DR and DQ RFLP typing are predominantly in phenotyping of living donors, including selection of HLA-matched volunteer bone marrow donors, in allograft survival studies, and in studies of HLA Class II-associated diseases. However, the time taken to perform RFLP analysis precludes its use for the typing of cadaveric kidney donors. Nucleotide sequence data for the alleles of HLA Class II genes have now permitted the development of allele-specific oligonucleotide (ASO) typing, a second category of DNA analysis. This has been greatly facilitated by the ability to amplify specific HLA Class II DNA 'target' sequences using the polymerase chain reaction (PCR) technique. The accuracy of DNA typing techniques should ensure that this methodology will eventually replace conventional HLA phenotyping.

  16. A readout integrated circuit based on DBI-CTIA and cyclic ADC for MEMS-array-based focal plane

    Science.gov (United States)

    Miao, Liu; Dong, Wu; Zheyao, Wang

    2016-11-01

    A readout integrated circuit (ROIC) for a MEMS (microelectromechanical system)-array-based focal plane (MAFP) intended for imaging applications is presented. The ROIC incorporates current sources for diode detectors, scanners, timing sequence controllers, differential buffered injection-capacitive trans-impedance amplifier (DBI-CTIA) and 10-bit cyclic ADCs, and is integrated with MAFP using 3-D integration technology. A small-signal equivalent model is built to include thermal detectors into circuit simulations. The biasing current is optimized in terms of signal-to-noise ratio and power consumption. Layout design is tailored to fulfill the requirements of 3-D integration and to adapt to the size of MAFP elements, with not all but only the 2 bottom metal layers to complete nearly all the interconnections in DBI-CTIA and ADC in a 40 μm wide column. Experimental chips are designed and fabricated in a 0.35 μm CMOS mixed signal process, and verified in a code density test of which the results indicate a (0.29/-0.31) LSB differential nonlinearity (DNL) and a (0.61/-0.45) LSB integral nonlinearity (INL). Spectrum analysis shows that the effective number of bits (ENOB) is 9.09. The ROIC consumes 248 mW of power at most if not to cut off quiescent current paths when not needed. Project supported by by National Natural Science Foundation of China (No. 61271130), the Beijing Municipal Science and Tech Project (No. D13110100290000), the Tsinghua University Initiative Scientific Research Program (No. 20131089225), and the Shenzhen Science and Technology Development Fund (No. CXZZ20130322170740736).

  17. Semiparametric bayesian analysis of gene-environment interactions

    OpenAIRE

    Lobach, I.

    2010-01-01

    A key component to prevention and control of complex diseases, such as cancer, diabetes, hypertension, is to analyze the genetic and environmental factors that lead to the development of these complex diseases. We propose a Bayesian approach for analysis of gene-environment interactions that efficiently models information available in the observed data and a priori biomedical knowledge.

  18. Analysis of gene expression during development: lessons from the Apicomplexa.

    Science.gov (United States)

    Boyle, Jon P; Saeij, Jeroen P J; Cleary, Michael D; Boothroyd, John C

    2006-05-01

    Apicomplexans are responsible for significant human and animal disease worldwide, including malaria and toxoplasmosis. Herein we summarize recent advances in gene expression analysis in these eukaryotic pathogens, especially with respect to their developmental biology, and discuss the impact this work may have on the development of new vaccines and chemotherapeutics.

  19. Detection and sequence analysis of accessory gene regulator genes of Staphylococcus pseudintermedius isolates

    Directory of Open Access Journals (Sweden)

    M. Ananda Chitra

    2015-07-01

    Full Text Available Background: Staphylococcus pseudintermedius (SP is the major pathogenic species of dogs involved in a wide variety of skin and soft tissue infections. The accessory gene regulator (agr locus of Staphylococcus aureus has been extensively studied, and it influences the expression of many virulence genes. It encodes a two-component signal transduction system that leads to down-regulation of surface proteins and up-regulation of secreted proteins during in vitro growth of S. aureus. The objective of this study was to detect and sequence analyzing the AgrA, B, and D of SP isolated from canine skin infections. Materials and Methods: In this study, we have isolated and identified SP from canine pyoderma and otitis cases by polymerase chain reaction (PCR and confirmed by PCR-restriction fragment length polymorphism. Primers for SP agrA and agrBD genes were designed using online primer designing software and BLAST searched for its specificity. Amplification of the agr genes was carried out for 53 isolates of SP by PCR and sequencing of agrA, B, and D were carried out for five isolates and analyzed using DNAstar and Mega5.2 software. Results: A total of 53 (59% SP isolates were obtained from 90 samples. 15 isolates (28% were confirmed to be methicillinresistant SP (MRSP with the detection of the mecA gene. Accessory gene regulator A, B, and D genes were detected in all the SP isolates. Complete nucleotide sequences of the above three genes for five isolates were submitted to GenBank, and their accession numbers are from KJ133557 to KJ133571. AgrA amino acid sequence analysis showed that it is mainly made of alpha-helices and is hydrophilic in nature. AgrB is a transmembrane protein, and AgrD encodes the precursor of the autoinducing peptide (AIP. Sequencing of the agrD gene revealed that the 5 canine SP strains tested could be divided into three Agr specificity groups (RIPTSTGFF, KIPTSTGFF, and RIPISTGFF based on the putative AIP produced by each strain

  20. Analysis of gene evolution and metabolic pathways using the Candida Gene Order Browser

    Science.gov (United States)

    2010-01-01

    Background Candida species are the most common cause of opportunistic fungal infection worldwide. Recent sequencing efforts have provided a wealth of Candida genomic data. We have developed the Candida Gene Order Browser (CGOB), an online tool that aids comparative syntenic analyses of Candida species. CGOB incorporates all available Candida clade genome sequences including two Candida albicans isolates (SC5314 and WO-1) and 8 closely related species (Candida dubliniensis, Candida tropicalis, Candida parapsilosis, Lodderomyces elongisporus, Debaryomyces hansenii, Pichia stipitis, Candida guilliermondii and Candida lusitaniae). Saccharomyces cerevisiae is also included as a reference genome. Results CGOB assignments of homology were manually curated based on sequence similarity and synteny. In total CGOB includes 65617 genes arranged into 13625 homology columns. We have also generated improved Candida gene sets by merging/removing partial genes in each genome. Interrogation of CGOB revealed that the majority of tandemly duplicated genes are under strong purifying selection in all Candida species. We identified clusters of adjacent genes involved in the same metabolic pathways (such as catabolism of biotin, galactose and N-acetyl glucosamine) and we showed that some clusters are species or lineage-specific. We also identified one example of intron gain in C. albicans. Conclusions Our analysis provides an important resource that is now available for the Candida community. CGOB is available at http://cgob.ucd.ie. PMID:20459735

  1. Analysis of gene evolution and metabolic pathways using the Candida Gene Order Browser

    LENUS (Irish Health Repository)

    Fitzpatrick, David A

    2010-05-10

    Abstract Background Candida species are the most common cause of opportunistic fungal infection worldwide. Recent sequencing efforts have provided a wealth of Candida genomic data. We have developed the Candida Gene Order Browser (CGOB), an online tool that aids comparative syntenic analyses of Candida species. CGOB incorporates all available Candida clade genome sequences including two Candida albicans isolates (SC5314 and WO-1) and 8 closely related species (Candida dubliniensis, Candida tropicalis, Candida parapsilosis, Lodderomyces elongisporus, Debaryomyces hansenii, Pichia stipitis, Candida guilliermondii and Candida lusitaniae). Saccharomyces cerevisiae is also included as a reference genome. Results CGOB assignments of homology were manually curated based on sequence similarity and synteny. In total CGOB includes 65617 genes arranged into 13625 homology columns. We have also generated improved Candida gene sets by merging\\/removing partial genes in each genome. Interrogation of CGOB revealed that the majority of tandemly duplicated genes are under strong purifying selection in all Candida species. We identified clusters of adjacent genes involved in the same metabolic pathways (such as catabolism of biotin, galactose and N-acetyl glucosamine) and we showed that some clusters are species or lineage-specific. We also identified one example of intron gain in C. albicans. Conclusions Our analysis provides an important resource that is now available for the Candida community. CGOB is available at http:\\/\\/cgob.ucd.ie.

  2. Analysis of gene evolution and metabolic pathways using the Candida Gene Order Browser

    Directory of Open Access Journals (Sweden)

    Byrne Kevin P

    2010-05-01

    Full Text Available Abstract Background Candida species are the most common cause of opportunistic fungal infection worldwide. Recent sequencing efforts have provided a wealth of Candida genomic data. We have developed the Candida Gene Order Browser (CGOB, an online tool that aids comparative syntenic analyses of Candida species. CGOB incorporates all available Candida clade genome sequences including two Candida albicans isolates (SC5314 and WO-1 and 8 closely related species (Candida dubliniensis, Candida tropicalis, Candida parapsilosis, Lodderomyces elongisporus, Debaryomyces hansenii, Pichia stipitis, Candida guilliermondii and Candida lusitaniae. Saccharomyces cerevisiae is also included as a reference genome. Results CGOB assignments of homology were manually curated based on sequence similarity and synteny. In total CGOB includes 65617 genes arranged into 13625 homology columns. We have also generated improved Candida gene sets by merging/removing partial genes in each genome. Interrogation of CGOB revealed that the majority of tandemly duplicated genes are under strong purifying selection in all Candida species. We identified clusters of adjacent genes involved in the same metabolic pathways (such as catabolism of biotin, galactose and N-acetyl glucosamine and we showed that some clusters are species or lineage-specific. We also identified one example of intron gain in C. albicans. Conclusions Our analysis provides an important resource that is now available for the Candida community. CGOB is available at http://cgob.ucd.ie.

  3. Evaluation of reference genes for gene expression analysis using quantitative RT-PCR in Azospirillum brasilense.

    Directory of Open Access Journals (Sweden)

    Mary McMillan

    Full Text Available Azospirillum brasilense is a nitrogen fixing bacterium that has been shown to have various beneficial effects on plant growth and yield. Under normal conditions A. brasilense exists in a motile flagellated form, which, under starvation or stress conditions, can undergo differentiation into an encapsulated, cyst-like form. Quantitative RT-PCR can be used to analyse changes in gene expression during this differentiation process. The accuracy of quantification of mRNA levels by qRT-PCR relies on the normalisation of data against stably expressed reference genes. No suitable set of reference genes has yet been described for A. brasilense. Here we evaluated the expression of ten candidate reference genes (16S rRNA, gapB, glyA, gyrA, proC, pykA, recA, recF, rpoD, and tpiA in wild-type and mutant A. brasilense strains under different culture conditions, including conditions that induce differentiation. Analysis with the software programs BestKeeper, NormFinder and GeNorm indicated that gyrA, glyA and recA are the most stably expressed reference genes in A. brasilense. The results also suggested that the use of two reference genes (gyrA and glyA is sufficient for effective normalisation of qRT-PCR data.

  4. Evaluation of reference genes for gene expression analysis using quantitative RT-PCR in Azospirillum brasilense.

    Science.gov (United States)

    McMillan, Mary; Pereg, Lily

    2014-01-01

    Azospirillum brasilense is a nitrogen fixing bacterium that has been shown to have various beneficial effects on plant growth and yield. Under normal conditions A. brasilense exists in a motile flagellated form, which, under starvation or stress conditions, can undergo differentiation into an encapsulated, cyst-like form. Quantitative RT-PCR can be used to analyse changes in gene expression during this differentiation process. The accuracy of quantification of mRNA levels by qRT-PCR relies on the normalisation of data against stably expressed reference genes. No suitable set of reference genes has yet been described for A. brasilense. Here we evaluated the expression of ten candidate reference genes (16S rRNA, gapB, glyA, gyrA, proC, pykA, recA, recF, rpoD, and tpiA) in wild-type and mutant A. brasilense strains under different culture conditions, including conditions that induce differentiation. Analysis with the software programs BestKeeper, NormFinder and GeNorm indicated that gyrA, glyA and recA are the most stably expressed reference genes in A. brasilense. The results also suggested that the use of two reference genes (gyrA and glyA) is sufficient for effective normalisation of qRT-PCR data.

  5. An LED-array-based range imaging system used for enhancing three-dimensional imaging

    Science.gov (United States)

    Wang, Huanqin; Xu, Jun; He, Deyong; Zhao, Tianpeng; Wang, Anting; Ming, Hai; Kong, Deyi

    2010-11-01

    An LED-array-based range imaging system is proposed for three-dimensional (3-D) shape measurement. The range image is obtained by time-division electronic scanning of the LED Time-of-Flight (TOF) range finders in array, and no complex mechanical scanning is needed. By combining with a low cost CCD/CMOS sensor for capturing the twodimensional (2-D) image, the proposed range imaging system can be used to accomplish a high quality 3-D imaging. A sophisticated co-lens optical path is designed to assure the natural registration between the range image and 2-D image. Experimental tests for evaluation of the imaging system performance are described. It was found that the 3-D images can be acquired at a rate of 10 frames per second with a depth resolution better than 5mm in the range of 50 - 1000mm, which is sufficient for many practical applications, including the obstacle detection in robotics, machine automation, 3-D vision, virtual reality games and 3-D video.

  6. Optical properties of plasmonic nanoantenna arrays based on H-shaped nanoparticles with extended arms

    Science.gov (United States)

    Turkmen, Mustafa; Aslan, Erdem

    2014-03-01

    In this study, we present the optical properties of a plasmonic nanoantenna array based on H-shaped gold nanoparticles with extended arms, which can be used for infrared detection applications. Plasmonic nanoantennas operating at the infrared and visible region provide a unique way to capture, control and manipulate light at the nanoscale through the excitation of collective electron oscillations known as surface plasmons. The unit cell of proposed antenna consists of one H-shaped nanostructure and two extended arms located on the lateral sides of this nanostructure. We will demonstrate the proposed antenna has a dual band spectral response and the locations of the resonance frequencies can be adjusted by changing the geometrical dimensions of both the H-shaped nanoparticles and the extended arms. Theoretical calculations of the reflectance spectra of the nanoantenna array are performed by using simulation software, which utilizes Finite Difference Time Domain (FDTD) method. In order to show the sensing capacity of the structure, the effect of the dielectric medium on the resonance frequency is also determined. The results show that the proposed antenna can be utilized for infrared sensing applications.

  7. Flexible Neural Electrode Array Based-on Porous Graphene for Cortical Microstimulation and Sensing

    Science.gov (United States)

    Lu, Yichen; Lyu, Hongming; Richardson, Andrew G.; Lucas, Timothy H.; Kuzum, Duygu

    2016-09-01

    Neural sensing and stimulation have been the backbone of neuroscience research, brain-machine interfaces and clinical neuromodulation therapies for decades. To-date, most of the neural stimulation systems have relied on sharp metal microelectrodes with poor electrochemical properties that induce extensive damage to the tissue and significantly degrade the long-term stability of implantable systems. Here, we demonstrate a flexible cortical microelectrode array based on porous graphene, which is capable of efficient electrophysiological sensing and stimulation from the brain surface, without penetrating into the tissue. Porous graphene electrodes show superior impedance and charge injection characteristics making them ideal for high efficiency cortical sensing and stimulation. They exhibit no physical delamination or degradation even after 1 million biphasic stimulation cycles, confirming high endurance. In in vivo experiments with rodents, same array is used to sense brain activity patterns with high spatio-temporal resolution and to control leg muscles with high-precision electrical stimulation from the cortical surface. Flexible porous graphene array offers a minimally invasive but high efficiency neuromodulation scheme with potential applications in cortical mapping, brain-computer interfaces, treatment of neurological disorders, where high resolution and simultaneous recording and stimulation of neural activity are crucial.

  8. Wireless Remote Monitoring of Glucose Using a Functionalized ZnO Nanowire Arrays Based Sensor

    Directory of Open Access Journals (Sweden)

    Magnus Willander

    2011-08-01

    Full Text Available This paper presents a prototype wireless remote glucose monitoring system interfaced with a ZnO nanowire arrays-based glucose sensor, glucose oxidase enzyme immobilized onto ZnO nanowires in conjunction with a Nafion® membrane coating, which can be effectively applied for the monitoring of glucose levels in diabetics. Global System for Mobile Communications (GSM services like General Packet Radio Service (GPRS and Short Message Service (SMS have been proven to be logical and cost effective methods for gathering data from remote locations. A communication protocol that facilitates remote data collection using SMS has been utilized for monitoring a patient’s sugar levels. In this study, we demonstrate the remote monitoring of the glucose levels with existing GPRS/GSM network infra-structures using our proposed functionalized ZnO nanowire arrays sensors integrated with standard readily available mobile phones. The data can be used for centralized monitoring and other purposes. Such applications can reduce health care costs and allow caregivers to monitor and support to their patients remotely, especially those located in rural areas.

  9. Ultrahigh-dimensional variable selection method for whole-genome gene-gene interaction analysis

    Directory of Open Access Journals (Sweden)

    Ueki Masao

    2012-05-01

    Full Text Available Abstract Background Genome-wide gene-gene interaction analysis using single nucleotide polymorphisms (SNPs is an attractive way for identification of genetic components that confers susceptibility of human complex diseases. Individual hypothesis testing for SNP-SNP pairs as in common genome-wide association study (GWAS however involves difficulty in setting overall p-value due to complicated correlation structure, namely, the multiple testing problem that causes unacceptable false negative results. A large number of SNP-SNP pairs than sample size, so-called the large p small n problem, precludes simultaneous analysis using multiple regression. The method that overcomes above issues is thus needed. Results We adopt an up-to-date method for ultrahigh-dimensional variable selection termed the sure independence screening (SIS for appropriate handling of numerous number of SNP-SNP interactions by including them as predictor variables in logistic regression. We propose ranking strategy using promising dummy coding methods and following variable selection procedure in the SIS method suitably modified for gene-gene interaction analysis. We also implemented the procedures in a software program, EPISIS, using the cost-effective GPGPU (General-purpose computing on graphics processing units technology. EPISIS can complete exhaustive search for SNP-SNP interactions in standard GWAS dataset within several hours. The proposed method works successfully in simulation experiments and in application to real WTCCC (Wellcome Trust Case–control Consortium data. Conclusions Based on the machine-learning principle, the proposed method gives powerful and flexible genome-wide search for various patterns of gene-gene interaction.

  10. Comprehensive analysis of gene expression patterns of hedgehog-related genes

    Directory of Open Access Journals (Sweden)

    Baillie David

    2006-10-01

    Full Text Available Abstract Background The Caenorhabditis elegans genome encodes ten proteins that share sequence similarity with the Hedgehog signaling molecule through their C-terminal autoprocessing Hint/Hog domain. These proteins contain novel N-terminal domains, and C. elegans encodes dozens of additional proteins containing only these N-terminal domains. These gene families are called warthog, groundhog, ground-like and quahog, collectively called hedgehog (hh-related genes. Previously, the expression pattern of seventeen genes was examined, which showed that they are primarily expressed in the ectoderm. Results With the completion of the C. elegans genome sequence in November 2002, we reexamined and identified 61 hh-related ORFs. Further, we identified 49 hh-related ORFs in C. briggsae. ORF analysis revealed that 30% of the genes still had errors in their predictions and we improved these predictions here. We performed a comprehensive expression analysis using GFP fusions of the putative intergenic regulatory sequence with one or two transgenic lines for most genes. The hh-related genes are expressed in one or a few of the following tissues: hypodermis, seam cells, excretory duct and pore cells, vulval epithelial cells, rectal epithelial cells, pharyngeal muscle or marginal cells, arcade cells, support cells of sensory organs, and neuronal cells. Using time-lapse recordings, we discovered that some hh-related genes are expressed in a cyclical fashion in phase with molting during larval development. We also generated several translational GFP fusions, but they did not show any subcellular localization. In addition, we also studied the expression patterns of two genes with similarity to Drosophila frizzled, T23D8.1 and F27E11.3A, and the ortholog of the Drosophila gene dally-like, gpn-1, which is a heparan sulfate proteoglycan. The two frizzled homologs are expressed in a few neurons in the head, and gpn-1 is expressed in the pharynx. Finally, we compare the

  11. Network Analysis of Human Genes Influencing Susceptibility to Mycobacterial Infections.

    Directory of Open Access Journals (Sweden)

    Ettie M Lipner

    Full Text Available Tuberculosis and nontuberculous mycobacterial infections constitute a high burden of pulmonary disease in humans, resulting in over 1.5 million deaths per year. Building on the premise that genetic factors influence the instance, progression, and defense of infectious disease, we undertook a systems biology approach to investigate relationships among genetic factors that may play a role in increased susceptibility or control of mycobacterial infections. We combined literature and database mining with network analysis and pathway enrichment analysis to examine genes, pathways, and networks, involved in the human response to Mycobacterium tuberculosis and nontuberculous mycobacterial infections. This approach allowed us to examine functional relationships among reported genes, and to identify novel genes and enriched pathways that may play a role in mycobacterial susceptibility or control. Our findings suggest that the primary pathways and genes influencing mycobacterial infection control involve an interplay between innate and adaptive immune proteins and pathways. Signaling pathways involved in autoimmune disease were significantly enriched as revealed in our networks. Mycobacterial disease susceptibility networks were also examined within the context of gene-chemical relationships, in order to identify putative drugs and nutrients with potential beneficial immunomodulatory or anti-mycobacterial effects.

  12. Evolutionary Analysis of Minor Histocompatibility Genes In Hydra

    KAUST Repository

    Aalismail, Nojood

    2016-05-01

    Hydra is a simple freshwater solitary polyp used as a model system to study evolutionary aspects. The immune response of this organism has not been studied extensively and the immune response genes have not been identified and characterized. On the other hand, immune response has been investigated and genetic analysis has been initiated in other lower invertebrates. In the present study we took initiative to study the self/nonself recognition in hydra and its relation to the immune response. Moreover, performing phylogenetic analysis to look for annotated immune genes in hydra gave us a potential to analyze the expression of minor histocompatibility genes that have been shown to play a major role in grafting and transplantation in mammals. Here we obtained the cDNA library that shows expression of minor histocompatibility genes and confirmed that the annotated sequences in databases are actually present. In addition, grafting experiments suggested, although still preliminary, that homograft showed less rejection response than in heterograft. Involvement of possible minor histocompatibility gene orthologous in immune response was examined by qPCR.

  13. Polymorphism analysis of csd gene in six Apis mellifera subspecies.

    Science.gov (United States)

    Wang, Zilong; Liu, Zhiyong; Wu, Xiaobo; Yan, Weiyu; Zeng, Zhijiang

    2012-03-01

    The complementary sex determination (csd) gene is the primary gene determining the gender of honey bees (Apis spp). In this study we analyzed the polymorphism of csd gene in six Apis mellifera subspecies. The genomic region 3 of csd gene in these six A. mellifera was cloned, and identified. A total of 79 haplotypes were obtained from these six subspecies. Analysis showed that region 3 of csd gene has a high level of polymorphism in all the six A. mellifera subspecies. The A. m. anatolica subspecies has a slightly higher nucleotide diversity (π) than other subspecies, while the π values showed no significant difference among the other five subspecies. The phylogenetic tree showed that all the csd haplotypes from different A. mellifera subspecies are scattered throughout the tree, without forming six different clades. Population differentiation analysis showed that there are significant genetic differentiations among some of the subspecies. The NJ phylogenetic tree showed that the A. m. caucasica and A. m. carnica have the closest relationship, followed by A. m. ssp, A. m. ligustica, A. m. carpatica and A. m. anatolica that were gathered in the tree in turn.

  14. Association analysis of chromosome 1 migraine candidate genes

    Directory of Open Access Journals (Sweden)

    MacMillan John

    2007-08-01

    Full Text Available Abstract Background Migraine with aura (MA is a subtype of typical migraine. Migraine with aura (MA also encompasses a rare severe subtype Familial Hemiplegic Migraine (FHM with several known genetic loci. The type 2 FHM (FHM-2 susceptibility locus maps to chromosome 1q23 and mutations in the ATP1A2 gene at this site have recently been implicated. We have previously provided evidence of linkage of typical migraine (predominantly MA to microsatellite markers on chromosome 1, in the 1q31 and 1q23 regions. In this study, we have undertaken a large genomic investigation involving candidate genes that lie within the chromosome 1q23 and 1q31 regions using an association analysis approach. Methods We have genotyped a large population of case-controls (243 unrelated Caucasian migraineurs versus 243 controls examining a set of 5 single nucleotide polymorphisms (SNPs and the Fas Ligand dinucleotide repeat marker, located within the chromosome 1q23 and 1q31 regions. Results Several genes have been studied including membrane protein (ATP 1 subtype A4 and FasL, cytoplasmic glycoprotein (CASQ 1 genes and potassium (KCN J9 and KCN J10 and calcium (CACNA1E channel genes in 243 migraineurs (including 85% MA and 15% of migraine without aura (MO and 243 matched controls. After correction for multiple testing, chi-square results showed non-significant P values (P > 0.008 across all SNPs (and a CA repeat tested in these different genes, however results with the KCN J10 marker gave interesting results (P = 0.02 that may be worth exploring further in other populations. Conclusion These results do not show a significant role for the tested candidate gene variants and also do not support the hypothesis that a common chromosome 1 defective gene influences both FHM and the more common forms of migraine.

  15. Genomic analysis of primordial dwarfism reveals novel disease genes.

    Science.gov (United States)

    Shaheen, Ranad; Faqeih, Eissa; Ansari, Shinu; Abdel-Salam, Ghada; Al-Hassnan, Zuhair N; Al-Shidi, Tarfa; Alomar, Rana; Sogaty, Sameera; Alkuraya, Fowzan S

    2014-02-01

    Primordial dwarfism (PD) is a disease in which severely impaired fetal growth persists throughout postnatal development and results in stunted adult size. The condition is highly heterogeneous clinically, but the use of certain phenotypic aspects such as head circumference and facial appearance has proven helpful in defining clinical subgroups. In this study, we present the results of clinical and genomic characterization of 16 new patients in whom a broad definition of PD was used (e.g., 3M syndrome was included). We report a novel PD syndrome with distinct facies in two unrelated patients, each with a different homozygous truncating mutation in CRIPT. Our analysis also reveals, in addition to mutations in known PD disease genes, the first instance of biallelic truncating BRCA2 mutation causing PD with normal bone marrow analysis. In addition, we have identified a novel locus for Seckel syndrome based on a consanguineous multiplex family and identified a homozygous truncating mutation in DNA2 as the likely cause. An additional novel PD disease candidate gene XRCC4 was identified by autozygome/exome analysis, and the knockout mouse phenotype is highly compatible with PD. Thus, we add a number of novel genes to the growing list of PD-linked genes, including one which we show to be linked to a novel PD syndrome with a distinct facial appearance. PD is extremely heterogeneous genetically and clinically, and genomic tools are often required to reach a molecular diagnosis.

  16. Cloning,sequencing and phylogenic analysis of duck prion gene

    Institute of Scientific and Technical Information of China (English)

    WANG Qigui; ZHANG Lei; HU Xiaoxiang; FAN Baoliang; LI Ning; LI Hui; WU Changxin

    2004-01-01

    Duck prion gene was cloned and sequenced. Similar to mammalian prion protein (PrP), duck prion is encoded by a single exon of a single copy in genome, which was confirmed by Southern blot analysis. All of the structural features of mammalian PrP were also identified in the duck PrP. Compared with mammalian PrP, it exhibited a 30 % of general similarity. When compared with chicken PrP, it showed a higher homology of 97%. A phylogenetic tree was constructed to trace evolution of prion gene in animals.

  17. GECKO: a complete large-scale gene expression analysis platform

    Directory of Open Access Journals (Sweden)

    Heuer Michael

    2004-12-01

    Full Text Available Abstract Background Gecko (Gene Expression: Computation and Knowledge Organization is a complete, high-capacity centralized gene expression analysis system, developed in response to the needs of a distributed user community. Results Based on a client-server architecture, with a centralized repository of typically many tens of thousands of Affymetrix scans, Gecko includes automatic processing pipelines for uploading data from remote sites, a data base, a computational engine implementing ~ 50 different analysis tools, and a client application. Among available analysis tools are clustering methods, principal component analysis, supervised classification including feature selection and cross-validation, multi-factorial ANOVA, statistical contrast calculations, and various post-processing tools for extracting data at given error rates or significance levels. On account of its open architecture, Gecko also allows for the integration of new algorithms. The Gecko framework is very general: non-Affymetrix and non-gene expression data can be analyzed as well. A unique feature of the Gecko architecture is the concept of the Analysis Tree (actually, a directed acyclic graph, in which all successive results in ongoing analyses are saved. This approach has proven invaluable in allowing a large (~ 100 users and distributed community to share results, and to repeatedly return over a span of years to older and potentially very complex analyses of gene expression data. Conclusions The Gecko system is being made publicly available as free software http://sourceforge.net/projects/geckoe. In totality or in parts, the Gecko framework should prove useful to users and system developers with a broad range of analysis needs.

  18. Membrane gene ontology bias in sequencing and microarray obtained by housekeeping-gene analysis.

    Science.gov (United States)

    Zhang, Yijuan; Akintola, Oluwafemi S; Liu, Ken J A; Sun, Bingyun

    2016-01-10

    Microarray (MA) and high-throughput sequencing are two commonly used detection systems for global gene expression profiling. Although these two systems are frequently used in parallel, the differences in their final results have not been examined thoroughly. Transcriptomic analysis of housekeeping (HK) genes provides a unique opportunity to reliably examine the technical difference between these two systems. We investigated here the structure, genome location, expression quantity, microarray probe coverage, as well as biological functions of differentially identified human HK genes by 9 MA and 6 sequencing studies. These in-depth analyses allowed us to discover, for the first time, a subset of transcripts encoding membrane, cell surface and nuclear proteins that were prone to differential identification by the two platforms. We hope that the discovery can aid the future development of these technologies for comprehensive transcriptomic studies. Copyright © 2015 Elsevier B.V. All rights reserved.

  19. Analysis of tumor suppressor genes based on gene ontology and the KEGG pathway.

    Science.gov (United States)

    Yang, Jing; Chen, Lei; Kong, Xiangyin; Huang, Tao; Cai, Yu-Dong

    2014-01-01

    Cancer is a serious disease that causes many deaths every year. We urgently need to design effective treatments to cure this disease. Tumor suppressor genes (TSGs) are a type of gene that can protect cells from becoming cancerous. In view of this, correct identification of TSGs is an alternative method for identifying effective cancer therapies. In this study, we performed gene ontology (GO) and pathway enrichment analysis of the TSGs and non-TSGs. Some popular feature selection methods, including minimum redundancy maximum relevance (mRMR) and incremental feature selection (IFS), were employed to analyze the enrichment features. Accordingly, some GO terms and KEGG pathways, such as biological adhesion, cell cycle control, genomic stability maintenance and cell death regulation, were extracted, which are important factors for identifying TSGs. We hope these findings can help in building effective prediction methods for identifying TSGs and thereby, promoting the discovery of effective cancer treatments.

  20. Whole Gene Capture Analysis of 15 CRC Susceptibility Genes in Suspected Lynch Syndrome Patients.

    Directory of Open Access Journals (Sweden)

    Anne M L Jansen

    Full Text Available Lynch Syndrome (LS is caused by pathogenic germline variants in one of the mismatch repair (MMR genes. However, up to 60% of MMR-deficient colorectal cancer cases are categorized as suspected Lynch Syndrome (sLS because no pathogenic MMR germline variant can be identified, which leads to difficulties in clinical management. We therefore analyzed the genomic regions of 15 CRC susceptibility genes in leukocyte DNA of 34 unrelated sLS patients and 11 patients with MLH1 hypermethylated tumors with a clear family history.Using targeted next-generation sequencing, we analyzed the entire non-repetitive genomic sequence, including intronic and regulatory sequences, of 15 CRC susceptibility genes. In addition, tumor DNA from 28 sLS patients was analyzed for somatic MMR variants.Of 1979 germline variants found in the leukocyte DNA of 34 sLS patients, one was a pathogenic variant (MLH1 c.1667+1delG. Leukocyte DNA of 11 patients with MLH1 hypermethylated tumors was negative for pathogenic germline variants in the tested CRC susceptibility genes and for germline MLH1 hypermethylation. Somatic DNA analysis of 28 sLS tumors identified eight (29% cases with two pathogenic somatic variants, one with a VUS predicted to pathogenic and LOH, and nine cases (32% with one pathogenic somatic variant (n = 8 or one VUS predicted to be pathogenic (n = 1.This is the first study in sLS patients to include the entire genomic sequence of CRC susceptibility genes. An underlying somatic or germline MMR gene defect was identified in ten of 34 sLS patients (29%. In the remaining sLS patients, the underlying genetic defect explaining the MMRdeficiency in their tumors might be found outside the genomic regions harboring the MMR and other known CRC susceptibility genes.

  1. FlyPhy: a phylogenomic analysis platform for Drosophila genes and gene families

    Directory of Open Access Journals (Sweden)

    Bao Qiyu

    2009-04-01

    Full Text Available Abstract Background The availability of 12 fully sequenced Drosophila species genomes provides an excellent opportunity to explore the evolutionary mechanism, structure and function of gene families in Drosophila. Currently, several important resources, such as FlyBase, FlyMine and DroSpeGe, have been devoted to integrating genetic, genomic, and functional data of Drosophila into a well-organized form. However, all of these resources are gene-centric and lack the information of the gene families in Drosophila. Description FlyPhy is a comprehensive phylogenomic analysis platform devoted to analyzing the genes and gene families in Drosophila. Genes were classified into families using a graph-based Markov Clustering algorithm and extensively annotated by a number of bioinformatic tools, such as basic sequence features, functional category, gene ontology terms, domain organization and sequence homolog to other databases. FlyPhy provides a simple and user-friendly web interface to allow users to browse and retrieve the information at multiple levels. An outstanding feature of the FlyPhy is that all the retrieved results can be added to a workset for further data manipulation. For the data stored in the workset, multiple sequence alignment, phylogenetic tree construction and visualization can be easily performed to investigate the sequence variation of each given family and to explore its evolutionary mechanism. Conclusion With the above functionalities, FlyPhy will be a useful resource and convenient platform for the Drosophila research community. The FlyPhy is available at http://bioinformatics.zj.cn/fly/.

  2. Candidate pathways and genes for prostate cancer: a meta-analysis of gene expression data

    Directory of Open Access Journals (Sweden)

    Efstathiou Eleni

    2009-08-01

    Full Text Available Abstract Backgound The genetic mechanisms of prostate tumorigenesis remain poorly understood, but with the advent of gene expression array capabilities, we can now produce a large amount of data that can be used to explore the molecular and genetic mechanisms of prostate tumorigenesis. Methods We conducted a meta-analysis of gene expression data from 18 gene array datasets targeting transition from normal to localized prostate cancer and from localized to metastatic prostate cancer. We functionally annotated the top 500 differentially expressed genes and identified several candidate pathways associated with prostate tumorigeneses. Results We found the top differentially expressed genes to be clustered in pathways involving integrin-based cell adhesion: integrin signaling, the actin cytoskeleton, cell death, and cell motility pathways. We also found integrins themselves to be downregulated in the transition from normal prostate tissue to primary localized prostate cancer. Based on the results of this study, we developed a collagen hypothesis of prostate tumorigenesis. According to this hypothesis, the initiating event in prostate tumorigenesis is the age-related decrease in the expression of collagen genes and other genes encoding integrin ligands. This concomitant depletion of integrin ligands leads to the accumulation of ligandless integrin and activation of integrin-associated cell death. To escape integrin-associated death, cells suppress the expression of integrins, which in turn alters the actin cytoskeleton, elevates cell motility and proliferation, and disorganizes prostate histology, contributing to the histologic progression of prostate cancer and its increased metastasizing potential. Conclusion The results of this study suggest that prostate tumor progression is associated with the suppression of integrin-based cell adhesion. Suppression of integrin expression driven by integrin-mediated cell death leads to increased cell

  3. [Hydrophidae identification through analysis on Cyt b gene barcode].

    Science.gov (United States)

    Liao, Li-xi; Zeng, Ke-wu; Tu, Peng-fei

    2015-08-01

    Hydrophidae, one of the precious traditional Chinese medicines, is generally drily preserved to prevent corruption, but it is hard to identify the species of Hydrophidae through the appearance because of the change due to the drying process. The identification through analysis on gene barcode, a new technique in species identification, can avoid the problem. The gene barcodes of the 6 species of Hydrophidae like Lapemis hardwickii were aquired through DNA extraction and gene sequencing. These barcodes were then in sequence alignment and test the identification efficency by BLAST. Our results revealed that the barcode sequences performed high identification efficiency, and had obvious difference between intra- and inter-species. These all indicated that Cyt b DNA barcoding can confirm the Hydrophidae identification.

  4. Array-based comparative genomic hybridization facilitates identification of breakpoints of a novel der(1)t(1;18)(p36.3;q23)dn in a child presenting with mental retardation.

    Science.gov (United States)

    Lennon, P A; Cooper, M L; Curtis, M A; Lim, C; Ou, Z; Patel, A; Cheung, S W; Bacino, C A

    2006-06-01

    Monosomy of distal 1p36 represents the most common terminal deletion in humans and results in one of the most frequently diagnosed mental retardation syndromes. This deletion is considered a contiguous gene deletion syndrome, and has been shown to vary in deletion sizes that contribute to the spectrum of phenotypic anomalies seen in patients with monosomy 1p36. We report on an 8-year-old female with characteristics of the monosomy 1p36 syndrome who demonstrated a novel der(1)t(1;18)(p36.3;q23). Initial G-banded karyotype analysis revealed a deleted chromosome 1, with a breakpoint within 1p36.3. Subsequent FISH and array-based comparative genomic hybridization not only confirmed and partially characterized the deletion of chromosome 1p36.3, but also uncovered distal trisomy for 18q23. In this patient, the duplicated 18q23 is translocated onto the deleted 1p36.3 region, suggesting telomere capture. Molecular characterization of this novel der(1)t(1;18)(p36.3;q23), guided by our clinical array-comparative genomic hybridization, demonstrated a 3.2 Mb terminal deletion of chromosome 1p36.3 and a 200 kb duplication of 18q23 onto the deleted 1p36.3, presumably stabilizing the deleted chromosome 1. DNA sequence analysis around the breakpoints demonstrated no homology, and therefore this telomere capture of distal 18q is apparently the result of a non-homologous recombination. Partial trisomy for 18q23 has not been previously reported. The importance of mapping the breakpoints of all balanced and unbalanced translocations found in the clinical laboratory, when phenotypic abnormalities are found, is discussed.

  5. Lists2Networks: Integrated analysis of gene/protein lists

    Directory of Open Access Journals (Sweden)

    Ma'ayan Avi

    2010-02-01

    Full Text Available Abstract Background Systems biologists are faced with the difficultly of analyzing results from large-scale studies that profile the activity of many genes, RNAs and proteins, applied in different experiments, under different conditions, and reported in different publications. To address this challenge it is desirable to compare the results from different related studies such as mRNA expression microarrays, genome-wide ChIP-X, RNAi screens, proteomics and phosphoproteomics experiments in a coherent global framework. In addition, linking high-content multilayered experimental results with prior biological knowledge can be useful for identifying functional themes and form novel hypotheses. Results We present Lists2Networks, a web-based system that allows users to upload lists of mammalian genes/proteins onto a server-based program for integrated analysis. The system includes web-based tools to manipulate lists with different set operations, to expand lists using existing mammalian networks of protein-protein interactions, co-expression correlation, or background knowledge co-annotation correlation, as well as to apply gene-list enrichment analyses against many gene-list libraries of prior biological knowledge such as pathways, gene ontology terms, kinase-substrate, microRNA-mRAN, and protein-protein interactions, metabolites, and protein domains. Such analyses can be applied to several lists at once against many prior knowledge libraries of gene-lists associated with specific annotations. The system also contains features that allow users to export networks and share lists with other users of the system. Conclusions Lists2Networks is a user friendly web-based software system expected to significantly ease the computational analysis process for experimental systems biologists employing high-throughput experiments at multiple layers of regulation. The system is freely available at http://www.lists2networks.org.

  6. CHO gene expression profiling in biopharmaceutical process analysis and design.

    Science.gov (United States)

    Schaub, Jochen; Clemens, Christoph; Schorn, Peter; Hildebrandt, Tobias; Rust, Werner; Mennerich, Detlev; Kaufmann, Hitto; Schulz, Torsten W

    2010-02-01

    Increase in both productivity and product yields in biopharmaceutical process development with recombinant protein producing mammalian cells can be mainly attributed to the advancements in cell line development, media, and process optimization. Only recently, genome-scale technologies enable a system-level analysis to elucidate the complex biomolecular basis of protein production in mammalian cells promising an increased process understanding and the deduction of knowledge-based approaches for further process optimization. Here, the use of gene expression profiling for the analysis of a low titer (LT) and high titer (HT) fed batch process using the same IgG producing CHO cell line was investigated. We found that gene expression (i) significantly differed in HT versus LT process conditions due to differences in applied chemically defined, serum-free media, (ii) changed over the time course of the fed batch processes, and that (iii) both metabolic pathways and 14 biological functions such as cellular growth or cell death were affected. Furthermore, detailed analysis of metabolism in a standard process format revealed the potential use of transcriptomics for rational media design as is shown for the case of lipid metabolism where the product titer could be increased by about 20% based on a lipid modified basal medium. The results demonstrate that gene expression profiling can be an important tool for mammalian biopharmaceutical process analysis and optimization.

  7. Basic performance evaluation of a Si-PM array-based LGSO phoswich DOI block detector for a high-resolution small animal PET system.

    Science.gov (United States)

    Yamamoto, Seiichi

    2013-07-01

    The silicon photomultiplier (Si-PM) is a promising photodetector for PET. However, it remains unclear whether Si-PM can be used for a depth-of-interaction (DOI) detector based on the decay time differences of the scintillator where pulse shape analysis is used. For clarification, we tested the Hamamatsu 4 × 4 Si-PM array (S11065-025P) combined with scintillators that used different decay times to develop DOI block detectors using the pulse shape analysis. First, Ce-doped Gd(2)SiO(5) (GSO) scintillators of 0.5 mol% Ce were arranged in a 4 × 4 matrix and were optically coupled to the center of each pixel of the Si-PM array for measurement of the energy resolution as well as its gain variations according to the temperature. Then two types of Ce-doped Lu(1.9)Gd(0.1)Si0(5) (LGSO) scintillators, 0.025 mol% Ce (decay time: ~31 ns) and 0.75 mol% Ce (decay time: ~46 ns), were optically coupled in the DOI direction, arranged in a 11 × 7 matrix, and optically coupled to a Si-PM array for testing of the possibility of a high-resolution DOI detector. The energy resolution of the Si-PM array-based GSO block detector was 18 ± 4.4 % FWHM for a Cs-137 gamma source (662 keV). Less than 1 mm crystals were clearly resolved in the position map of the LGSO DOI block detector. The peak-to-valley ratio (P/V) derived from the pulse shape spectra of the LGSO DOI block detector was 2.2. These results confirmed that Si-PM array-based DOI block detectors are promising for high-resolution small animal PET systems.

  8. Microarray gene expression analysis of uterosacral ligaments in uterine prolapse.

    Science.gov (United States)

    Ak, Handan; Zeybek, Burak; Atay, Sevcan; Askar, Niyazi; Akdemir, Ali; Aydin, Hikmet Hakan

    2016-11-01

    Pelvic organ prolapse (POP) is a major health problem that impairs the quality of life with a wide clinical spectrum. Since the uterosacral ligaments provide primary support for the uterus and the upper vagina, we hypothesize that the disruption of these ligaments may lead to a loss of support and eventually contribute to POP. In this study, we therefore investigated whether there are any differences in the transcription profile of uterosacral ligaments in patients with POP when compared to those of the control samples. Seventeen women with POP and 8 non-POP controls undergoing hysterectomy for benign conditions were included in the study. Affymetrix® Gene Chip microarrays (Human Hu 133 plus 2.0) were used for whole genome gene expression profiling analysis. There was 1 significantly down-regulated gene, NKX2-3 in patients with POP compared to the controls (p=4.28464e-013). KIF11 gene was found to be significantly down-regulated in patients with ≥3 deliveries compared to patients with <3 deliveries (p=0.0156237). UGT1A1 (p=2.43388e-005), SCARB1 (p=1.19001e-006) and NKX2-3 (p=2.17966e-013) genes were found to be significantly down-regulated in the premenopausal patients compared to the premenopausal controls. UGT1A1 gene was also found to be significantly down-regulated in the post menopausal patients compared to the postmenopausal controls (p=0.0005). This study provides evidence for a significant down-regulation of the genes that take role in cell cycle, proliferation and embryonic development along with cell adhesion process on the development of POP for the first time. Copyright © 2016 The Canadian Society of Clinical Chemists. Published by Elsevier Inc. All rights reserved.

  9. Molecular analysis of immunoglobulin genes in multiple myeloma.

    Science.gov (United States)

    Kosmas, C; Stamatopoulos, K; Stavroyianni, N; Belessi, C; Viniou, N; Yataganas, X

    1999-04-01

    The study of immunoglobulin genes in multiple myeloma over the last five years has provided important information regarding biology, ontogenetic location, disease evolution, pathogenic consequences and tumor-specific therapeutic intervention with idiotypic vaccination. Detailed analysis of V(H) genes has revealed clonal relationship between switch variants expressed by the bone marrow plasma cell and myeloma progenitors in the marrow and peripheral blood. V(H) gene usage is biased against V4-34 (encoding antibodies with cold agglutinin specificity; anti-l/i) explaining the absence of autoimmune phenomena in myeloma compared to other B-cell lymphoproliferative disorders. V(H) genes accumulate somatic hypermutations following a distribution compatible with antigen selection, but with no intraclonal heterogeneity. V(L) genes indicate a bias in usage of VkappaI family members and somatic hypermutation, in line with antigen selection, of the expressed Vkappa genes is higher than any other B-cell lymphoid disorder. A complementary imprint of antigen selection as evidenced by somatic hypermutation of either the V(H) or V(L) clonogenic genes has been observed. The absence of ongoing somatic mutations in either V(H) or V(L) genes gives rise to the notion that the cell of origin in myeloma is a post-germinal center memory B-cell. Clinical application of sensitive PCR methods in order to detect clonal immunoglobulin gene rearrangements has made relevant the monitoring and follow-up of minimal residual disease in stem cell autografts and after myeloablative therapy. The fact that surface immunoglobulin V(H) and V(L) sequences constitute unique tumor-specific antigenic determinants has stimulated investigators to devise strategies aiming to generate active specific immunity against the idiotype of malignant B-cells in myeloma by constructing vaccines based on expressed single-chain Fv fragments, DNA plasmids carrying V(H)+V(L) clonogenic genes for naked DNA vaccination, or

  10. Comparative analysis of ADS gene promoter in seven Artemisia species

    Indian Academy of Sciences (India)

    Mojtaba Ranjbar; Mohammad Reza Naghavi; Hoshang Alizadeh

    2014-12-01

    Artemisinin is the most effective antimalarial drug that is derived from Artemisia annua. Amorpha-4,11-diene synthase (ADS) controls the first committed step in artemisinin biosynthesis. The ADS gene expression is regulated by transcription factors which bind to the cis-acting elements on the ADS promoter and are probably responsible for the ADS gene expression difference in the Artemisia species. To identify the elements that are significantly involved in ADS gene expression, the ADS gene promoter of the seven Artemisia species was isolated and comparative analysis was performed on the ADS promoter sequences of these species. Results revealed that some of the cis-elements were unique or in terms of number were more in the high artemisinin producer species, A. annua, than the other species. We have reported that the light-responsive elements, W-box, CAAT-box, 5′-UTR py-rich stretch, TATA-box sequence and tandem repeat sequences have been identified as important factors in the increased expression of ADS gene.

  11. Identification of oral cancer related candidate genes by integrating protein-protein interactions, gene ontology, pathway analysis and immunohistochemistry.

    Science.gov (United States)

    Kumar, Ravindra; Samal, Sabindra K; Routray, Samapika; Dash, Rupesh; Dixit, Anshuman

    2017-05-30

    In the recent years, bioinformatics methods have been reported with a high degree of success for candidate gene identification. In this milieu, we have used an integrated bioinformatics approach assimilating information from gene ontologies (GO), protein-protein interaction (PPI) and network analysis to predict candidate genes related to oral squamous cell carcinoma (OSCC). A total of 40973 PPIs were considered for 4704 cancer-related genes to construct human cancer gene network (HCGN). The importance of each node was measured in HCGN by ten different centrality measures. We have shown that the top ranking genes are related to a significantly higher number of diseases as compared to other genes in HCGN. A total of 39 candidate oral cancer target genes were predicted by combining top ranked genes and the genes corresponding to significantly enriched oral cancer related GO terms. Initial verification using literature and available experimental data indicated that 29 genes were related with OSCC. A detailed pathway analysis led us to propose a role for the selected candidate genes in the invasion and metastasis in OSCC. We further validated our predictions using immunohistochemistry (IHC) and found that the gene FLNA was upregulated while the genes ARRB1 and HTT were downregulated in the OSCC tissue samples.

  12. Network analysis of genes and their association with diseases.

    Science.gov (United States)

    Kontou, Panagiota I; Pavlopoulou, Athanasia; Dimou, Niki L; Pavlopoulos, Georgios A; Bagos, Pantelis G

    2016-09-15

    A plethora of network-based approaches within the Systems Biology universe have been applied, to date, to investigate the underlying molecular mechanisms of various human diseases. In the present study, we perform a bipartite, topological and clustering graph analysis in order to gain a better understanding of the relationships between human genetic diseases and the relationships between the genes that are implicated in them. For this purpose, disease-disease and gene-gene networks were constructed from combined gene-disease association networks. The latter, were created by collecting and integrating data from three diverse resources, each one with different content covering from rare monogenic disorders to common complex diseases. This data pluralism enabled us to uncover important associations between diseases with unrelated phenotypic manifestations but with common genetic origin. For our analysis, the topological attributes and the functional implications of the individual networks were taken into account and are shortly discussed. We believe that some observations of this study could advance our understanding regarding the etiology of a disease with distinct pathological manifestations, and simultaneously provide the springboard for the development of preventive and therapeutic strategies and its underlying genetic mechanisms.

  13. Bioinformatics Analysis of MAPKKK Family Genes in Medicago truncatula

    Science.gov (United States)

    Li, Wei; Xu, Hanyun; Liu, Ying; Song, Lili; Guo, Changhong; Shu, Yongjun

    2016-01-01

    Mitogen-activated protein kinase kinase kinase (MAPKKK) is a component of the MAPK cascade pathway that plays an important role in plant growth, development, and response to abiotic stress, the functions of which have been well characterized in several plant species, such as Arabidopsis, rice, and maize. In this study, we performed genome-wide and systemic bioinformatics analysis of MAPKKK family genes in Medicago truncatula. In total, there were 73 MAPKKK family members identified by search of homologs, and they were classified into three subfamilies, MEKK, ZIK, and RAF. Based on the genomic duplication function, 72 MtMAPKKK genes were located throughout all chromosomes, but they cluster in different chromosomes. Using microarray data and high-throughput sequencing-data, we assessed their expression profiles in growth and development processes; these results provided evidence for exploring their important functions in developmental regulation, especially in the nodulation process. Furthermore, we investigated their expression in abiotic stresses by RNA-seq, which confirmed their critical roles in signal transduction and regulation processes under stress. In summary, our genome-wide, systemic characterization and expressional analysis of MtMAPKKK genes will provide insights that will be useful for characterizing the molecular functions of these genes in M. truncatula. PMID:27049397

  14. Bioinformatics Analysis of MAPKKK Family Genes in Medicago truncatula

    Directory of Open Access Journals (Sweden)

    Wei Li

    2016-04-01

    Full Text Available Mitogen‐activated protein kinase kinase kinase (MAPKKK is a component of the MAPK cascade pathway that plays an important role in plant growth, development, and response to abiotic stress, the functions of which have been well characterized in several plant species, such as Arabidopsis, rice, and maize. In this study, we performed genome‐wide and systemic bioinformatics analysis of MAPKKK family genes in Medicago truncatula. In total, there were 73 MAPKKK family members identified by search of homologs, and they were classified into three subfamilies, MEKK, ZIK, and RAF. Based on the genomic duplication function, 72 MtMAPKKK genes were located throughout all chromosomes, but they cluster in different chromosomes. Using microarray data and high‐throughput sequencing‐data, we assessed their expression profiles in growth and development processes; these results provided evidence for exploring their important functions in developmental regulation, especially in the nodulation process. Furthermore, we investigated their expression in abiotic stresses by RNA‐seq, which confirmed their critical roles in signal transduction and regulation processes under stress. In summary, our genome‐wide, systemic characterization and expressional analysis of MtMAPKKK genes will provide insights that will be useful for characterizing the molecular functions of these genes in M. truncatula.

  15. Evaluation of real-time PCR endogenous control genes for analysis of gene expression in bovine endometrium

    Directory of Open Access Journals (Sweden)

    Mitchell Murray D

    2009-11-01

    Full Text Available Abstract Background Quantitative real-time PCR gene expression results are generally normalised using endogenous control genes. These reference genes should be expressed at a constant level across all sample groups in a study, and should not be influenced by study treatments or conditions. There has been no systematic investigation of endogenous control genes for bovine endometrium to date. The suitability of both commonly used and novel endogenous control genes was evaluated in this study, with the latter being selected from stably expressed transcripts identified through microarray analysis of bovine endometrium. Fifteen candidate endogenous control genes were assessed across different tissue subtypes in pregnant and cycling Holstein-Friesian dairy cows from two divergent genetic backgrounds. Results The expression profiles of five commonly used endogenous control genes (GAPDH, PPIA, RPS9, RPS15A, and UXT and 10 experimentally derived candidate endogenous control genes (SUZ12, C2ORF29, ZNF131, ACTR1A, HDAC1, SLC30A6, CNOT7, DNAJC17, BBS2, and RANBP10 were analysed across 44 samples to determine the most stably expressed gene. Gene stability was assessed using the statistical algorithms GeNorm and Normfinder. All genes presented with low overall variability (0.87 to 1.48% CV of Cq. However, when used to normalise a differentially expressed gene (oxytocin receptor - OXTR in the samples, the reported relative gene expression levels were significantly affected by the control gene chosen. Based on the results of this analysis, SUZ12 is proposed as the most appropriate control gene for use in bovine endometrium during early pregnancy or the oestrus cycle. Conclusion This study establishes the suitability of novel endogenous control genes for comparing expression levels in endometrial tissues of pregnant and cycling bovines, and demonstrates the utility of microarray analysis as a method for identifying endogenous control gene candidates.

  16. Meta Analysis of Gene Expression Data within and Across Species.

    Science.gov (United States)

    Fierro, Ana C; Vandenbussche, Filip; Engelen, Kristof; Van de Peer, Yves; Marchal, Kathleen

    2008-12-01

    Since the second half of the 1990s, a large number of genome-wide analyses have been described that study gene expression at the transcript level. To this end, two major strategies have been adopted, a first one relying on hybridization techniques such as microarrays, and a second one based on sequencing techniques such as serial analysis of gene expression (SAGE), cDNA-AFLP, and analysis based on expressed sequence tags (ESTs). Despite both types of profiling experiments becoming routine techniques in many research groups, their application remains costly and laborious. As a result, the number of conditions profiled in individual studies is still relatively small and usually varies from only two to few hundreds of samples for the largest experiments. More and more, scientific journals require the deposit of these high throughput experiments in public databases upon publication. Mining the information present in these databases offers molecular biologists the possibility to view their own small-scale analysis in the light of what is already available. However, so far, the richness of the public information remains largely unexploited. Several obstacles such as the correct association between ESTs and microarray probes with the corresponding gene transcript, the incompleteness and inconsistency in the annotation of experimental conditions, and the lack of standardized experimental protocols to generate gene expression data, all impede the successful mining of these data. Here, we review the potential and difficulties of combining publicly available expression data from respectively EST analyses and microarray experiments. With examples from literature, we show how meta-analysis of expression profiling experiments can be used to study expression behavior in a single organism or between organisms, across a wide range of experimental conditions. We also provide an overview of the methods and tools that can aid molecular biologists in exploiting these public data.

  17. Analysis of diabetic retinopathy biomarker VEGF gene by computational approaches

    OpenAIRE

    Jayashree Sadasivam; Ramesh, N.; K. Vijayalakshmi; Vinni Viridi; Shiva prasad

    2012-01-01

    Diabetic retinopathy, the most common diabetic eye disease, is caused by changes in the blood vessels of the retina which remains the major cause. It is characterized by vascular permeability and increased tissue ischemia and angiogenesis. One of the biomarker for Diabetic retinopathy has been identified as Vascular Endothelial Growth Factor ( VEGF )gene by computational analysis. VEGF is a sub-family of growth factors, the platelet-derived growth factor family of cystine-knot growth factors...

  18. Multi-membership gene regulation in pathway based microarray analysis

    OpenAIRE

    2011-01-01

    This article is available through the Brunel Open Access Publishing Fund. This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited. Background: Gene expression analysis has been intensively researched for more than a decade. Recently, there has been elevated interest in the inte...

  19. Trypanosome Surface Antigen Genes: Analysis Using Recombinant DNA.

    Science.gov (United States)

    1984-06-15

    different VATs. These cDNAs were used to screen the genomic libraries . The cloned probes were examined by nucleotide sequence analysis and used to...examination of the consequence of - antigenic variation on the structure of the 1.1, 1.D and 1.11 VSG -, gene sequences. In addition, all genomic libraries have...immunoprecipitated by heterologous VAt specific antisera (preprint 2, figure 3; preprint 3, figure 1). These data presented in preprints 2 and Genomic

  20. Canine Mammary Carcinomas: A Comparative Analysis of Altered Gene Expression

    Directory of Open Access Journals (Sweden)

    Farruk M. Lutful Kabir

    2015-12-01

    Full Text Available Breast cancer represents the second most frequent neoplasm in humans and sexually intact female dogs after lung and skin cancers, respectively. Many similar features in human and dog cancers including, spontaneous development, clinical presentation, tumor heterogeneity, disease progression and response to conventional therapies have supported development of this comparative model as an alternative to mice. The highly conserved similarities between canine and human genomes are also key to this comparative analysis, especially when compared to the murine genome. Studies with canine mammary tumor (CMT models have shown a strong genetic correlation with their human counterparts, particularly in terms of altered expression profiles of cell cycle regulatory genes, tumor suppressor and oncogenes and also a large group of non-coding RNAs or microRNAs (miRNAs. Because CMTs are considered predictive intermediate models for human breast cancer, similarities in genetic alterations and cancer predisposition between humans and dogs have raised further interest. Many cancer-associated genetic defects critical to mammary tumor development and oncogenic determinants of metastasis have been reported and appear to be similar in both species. Comparative analysis of deregulated gene sets or cancer signaling pathways has shown that a significant proportion of orthologous genes are comparably up- or down-regulated in both human and dog breast tumors. Particularly, a group of cell cycle regulators called cyclin-dependent kinase inhibitors (CKIs acting as potent tumor suppressors are frequently defective in CMTs. Interestingly, comparative analysis of coding sequences has also shown that these genes are highly conserved in mammals in terms of their evolutionary divergence from a common ancestor. Moreover, co-deletion and/or homozygous loss of the INK4A/ARF/INK4B (CDKN2A/B locus, encoding three members of the CKI tumor suppressor gene families (p16/INK4A, p14ARF and p15

  1. Understanding gene expression in coronary artery disease through global profiling, network analysis and independent validation of key candidate genes

    Indian Academy of Sciences (India)

    Prathima Arvind; Shanker Jayashree; Srikarthika Jambunathan; Jiny Nair; Vijay V. Kakkar

    2015-12-01

    Molecular mechanism underlying the patho-physiology of coronary artery disease (CAD) is complex. We used global expression profiling combined with analysis of biological network to dissect out potential genes and pathways associated with CAD in a representative case–control Asian Indian cohort. We initially performed blood transcriptomics profiling in 20 subjects, including 10 CAD patients and 10 healthy controls on the Agilent microarray platform. Data was analysed with Gene Spring Gx12.5, followed by network analysis using David v 6.7 and Reactome databases. The most significant differentially expressed genes from microarray were independently validated by real time PCR in 97 cases and 97 controls. A total of 190 gene transcripts showed significant differential expression (fold change > 2, P < 0.05) between the cases and the controls of which 142 genes were upregulated and 48 genes were downregulated. Genes associated with inflammation, immune response, cell regula- tion, proliferation and apoptotic pathways were enriched, while inflammatory and immune response genes were displayed as hubs in the network, having greater number of interactions with the neighbouring genes. Expression of 1/2/3, 8, 1, 2, 69, , , 4, 42, 58, and 42 genes were independently validated; 1/2/3 and 8 showed >8-fold higher expression in cases relative to the controls implying their important role in CAD. In conclusion, global gene expression profiling combined with network analysis can help in identifying key genes and pathways for CAD.

  2. LEGO: a novel method for gene set over-representation analysis by incorporating network-based gene weights.

    Science.gov (United States)

    Dong, Xinran; Hao, Yun; Wang, Xiao; Tian, Weidong

    2016-01-11

    Pathway or gene set over-representation analysis (ORA) has become a routine task in functional genomics studies. However, currently widely used ORA tools employ statistical methods such as Fisher's exact test that reduce a pathway into a list of genes, ignoring the constitutive functional non-equivalent roles of genes and the complex gene-gene interactions. Here, we develop a novel method named LEGO (functional Link Enrichment of Gene Ontology or gene sets) that takes into consideration these two types of information by incorporating network-based gene weights in ORA analysis. In three benchmarks, LEGO achieves better performance than Fisher and three other network-based methods. To further evaluate LEGO's usefulness, we compare LEGO with five gene expression-based and three pathway topology-based methods using a benchmark of 34 disease gene expression datasets compiled by a recent publication, and show that LEGO is among the top-ranked methods in terms of both sensitivity and prioritization for detecting target KEGG pathways. In addition, we develop a cluster-and-filter approach to reduce the redundancy among the enriched gene sets, making the results more interpretable to biologists. Finally, we apply LEGO to two lists of autism genes, and identify relevant gene sets to autism that could not be found by Fisher.

  3. Gene network analysis in a pediatric cohort identifies novel lung function genes.

    Directory of Open Access Journals (Sweden)

    Bruce A Ong

    Full Text Available Lung function is a heritable trait and serves as an important clinical predictor of morbidity and mortality for pulmonary conditions in adults, however, despite its importance, no studies have focused on uncovering pediatric-specific loci influencing lung function. To identify novel genetic determinants of pediatric lung function, we conducted a genome-wide association study (GWAS of four pulmonary function traits, including FVC, FEV1, FEV1/FVC and FEF25-75% in 1556 children. Further, we carried out gene network analyses for each trait including all SNPs with a P-value of <1.0 × 10(-3 from the individual GWAS. The GWAS identified SNPs with notable trends towards association with the pulmonary function measures, including the previously described INTS12 locus association with FEV1 (pmeta=1.41 × 10(-7. The gene network analyses identified 34 networks of genes associated with pulmonary function variables in Caucasians. Of those, the glycoprotein gene network reached genome-wide significance for all four variables. P-value range pmeta=6.29 × 10(-4 - 2.80 × 10(-8 on meta-analysis. In this study, we report on specific pathways that are significantly associated with pediatric lung function at genome-wide significance. In addition, we report the first loci associated with lung function in both pediatric Caucasian and African American populations.

  4. Comparative genomic analysis of sixty mycobacteriophage genomes: Genome clustering, gene acquisition and gene size

    Science.gov (United States)

    Hatfull, Graham F.; Jacobs-Sera, Deborah; Lawrence, Jeffrey G.; Pope, Welkin H.; Russell, Daniel A.; Ko, Ching-Chung; Weber, Rebecca J.; Patel, Manisha C.; Germane, Katherine L.; Edgar, Robert H.; Hoyte, Natasha N.; Bowman, Charles A.; Tantoco, Anthony T.; Paladin, Elizabeth C.; Myers, Marlana S.; Smith, Alexis L.; Grace, Molly S.; Pham, Thuy T.; O'Brien, Matthew B.; Vogelsberger, Amy M.; Hryckowian, Andrew J.; Wynalek, Jessica L.; Donis-Keller, Helen; Bogel, Matt W.; Peebles, Craig L.; Cresawn, Steve G.; Hendrix, Roger W.

    2010-01-01

    Mycobacteriophages are viruses that infect mycobacterial hosts. Expansion of a collection of sequenced phage genomes to a total of sixty – all infecting a common bacterial host – provides further insight into their diversity and evolution. Of the sixty phage genomes, 55 can be grouped into nine clusters according to their nucleotide sequence similarities, five of which can be further divided into subclusters; five genomes do not cluster with other phages. The sequence diversity between genomes within a cluster varies greatly; for example, the six genomes in cluster D share more than 97.5% average nucleotide similarity with each other. In contrast, similarity between the two genomes in Cluster I is barely detectable by diagonal plot analysis. The total of 6,858 predicted ORFs have been grouped into 1523 phamilies (phams) of related sequences, 46% of which possess only a single member. Only 18.8% of the phams have sequence similarity to non-mycobacteriophage database entries and fewer than 10% of all phams can be assigned functions based on database searching or synteny. Genome clustering facilitates the identification of genes that are in greatest genetic flux and are more likely to have been exchanged horizontally in relatively recent evolutionary time. Although mycobacteriophage genes exhibit smaller average size than genes of their host (205 residues compared to 315), phage genes in higher flux average only ∼100 amino acids, suggesting that the primary units of genetic exchange correspond to single protein domains. PMID:20064525

  5. Global Analysis of miRNA Gene Clusters and Gene Families Reveals Dynamic and Coordinated Expression

    Directory of Open Access Journals (Sweden)

    Li Guo

    2014-01-01

    Full Text Available To further understand the potential expression relationships of miRNAs in miRNA gene clusters and gene families, a global analysis was performed in 4 paired tumor (breast cancer and adjacent normal tissue samples using deep sequencing datasets. The compositions of miRNA gene clusters and families are not random, and clustered and homologous miRNAs may have close relationships with overlapped miRNA species. Members in the miRNA group always had various expression levels, and even some showed larger expression divergence. Despite the dynamic expression as well as individual difference, these miRNAs always indicated consistent or similar deregulation patterns. The consistent deregulation expression may contribute to dynamic and coordinated interaction between different miRNAs in regulatory network. Further, we found that those clustered or homologous miRNAs that were also identified as sense and antisense miRNAs showed larger expression divergence. miRNA gene clusters and families indicated important biological roles, and the specific distribution and expression further enrich and ensure the flexible and robust regulatory network.

  6. Tissue Non-Specific Genes and Pathways Associated with Diabetes: An Expression Meta-Analysis.

    Science.gov (United States)

    Mei, Hao; Li, Lianna; Liu, Shijian; Jiang, Fan; Griswold, Michael; Mosley, Thomas

    2017-01-21

    We performed expression studies to identify tissue non-specific genes and pathways of diabetes by meta-analysis. We searched curated datasets of the Gene Expression Omnibus (GEO) database and identified 13 and five expression studies of diabetes and insulin responses at various tissues, respectively. We tested differential gene expression by empirical Bayes-based linear method and investigated gene set expression association by knowledge-based enrichment analysis. Meta-analysis by different methods was applied to identify tissue non-specific genes and gene sets. We also proposed pathway mapping analysis to infer functions of the identified gene sets, and correlation and independent analysis to evaluate expression association profile of genes and gene sets between studies and tissues. Our analysis showed that PGRMC1 and HADH genes were significant over diabetes studies, while IRS1 and MPST genes were significant over insulin response studies, and joint analysis showed that HADH and MPST genes were significant over all combined data sets. The pathway analysis identified six significant gene sets over all studies. The KEGG pathway mapping indicated that the significant gene sets are related to diabetes pathogenesis. The results also presented that 12.8% and 59.0% pairwise studies had significantly correlated expression association for genes and gene sets, respectively; moreover, 12.8% pairwise studies had independent expression association for genes, but no studies were observed significantly different for expression association of gene sets. Our analysis indicated that there are both tissue specific and non-specific genes and pathways associated with diabetes pathogenesis. Compared to the gene expression, pathway association tends to be tissue non-specific, and a common pathway influencing diabetes development is activated through different genes at different tissues.

  7. Association and linkage analysis of aluminum tolerance genes in maize.

    Directory of Open Access Journals (Sweden)

    Allison M Krill

    Full Text Available BACKGROUND: Aluminum (Al toxicity is a major worldwide constraint to crop productivity on acidic soils. Al becomes soluble at low pH, inhibiting root growth and severely reducing yields. Maize is an important staple food and commodity crop in acidic soil regions, especially in South America and Africa where these soils are very common. Al exclusion and intracellular tolerance have been suggested as two important mechanisms for Al tolerance in maize, but little is known about the underlying genetics. METHODOLOGY: An association panel of 282 diverse maize inbred lines and three F2 linkage populations with approximately 200 individuals each were used to study genetic variation in this complex trait. Al tolerance was measured as net root growth in nutrient solution under Al stress, which exhibited a wide range of variation between lines. Comparative and physiological genomics-based approaches were used to select 21 candidate genes for evaluation by association analysis. CONCLUSIONS: Six candidate genes had significant results from association analysis, but only four were confirmed by linkage analysis as putatively contributing to Al tolerance: Zea mays AltSB like (ZmASL, Zea mays aluminum-activated malate transporter2 (ALMT2, S-adenosyl-L-homocysteinase (SAHH, and Malic Enzyme (ME. These four candidate genes are high priority subjects for follow-up biochemical and physiological studies on the mechanisms of Al tolerance in maize. Immediately, elite haplotype-specific molecular markers can be developed for these four genes and used for efficient marker-assisted selection of superior alleles in Al tolerance maize breeding programs.

  8. [IT15 gene analysis in two pedigrees of Huntington's disease].

    Science.gov (United States)

    Zhang, Bao-Rong; Song, Fei; Yin, Xin-Zhen; Xia, Kun; Tian, Jun; Huang, Jian-Zheng; Xia, Jia-Hui

    2006-11-01

    To investigate the relationship between the clinical features and (CAG)n trinucleotide repeats in two pedigrees of Chinese Huntington's disease (HD). Clinical and neuroimaging features, the age of disease onset and pattern of transmission of the patients were studied in the two pedigrees of HD. Genomic DNA of 42 family members was used for amplification of the (CAG)n repeats of IT15 gene by PCR. The numbers of (CAG)n were determined by electrophoresis through a 6% polyacrylamide gel and direct sequence analysis. Results showed that patients in pedigree 1 were absent of the typical triad of HD symptoms or caudate atrophy. A total of 9 (5 patients and 4 asymptomatic) out of 18 family members had 40-50 (CAG)n repeats in the IT15 gene. In pedigree 2, all the patients were characterized by a triad of symptoms, including motor disturbance, cognitive impairment and psychiatric features. Three patients and two asymptomatic relatives had more than 50 (CAG)n repeats in the IT15 gene. In conclusion, the clinical symptoms are partly determined by (CAG)n repeats in the IT15 gene. The age of onset was correlated with (CAG)n repeats over 50, and the phenomenon called "anticipation" was found to have played a role.

  9. An efficient RNA interference screening strategy for gene functional analysis

    Directory of Open Access Journals (Sweden)

    Chang Chih-Hung

    2012-09-01

    Full Text Available Abstract Background RNA interference (RNAi is commonly applied in genome-scale gene functional screens. However, a one-on-one RNAi analysis that targets each gene is cost-ineffective and laborious. Previous studies have indicated that siRNAs can also affect RNAs that are near-perfectly complementary, and this phenomenon has been termed an off-target effect. This phenomenon implies that it is possible to silence several genes simultaneously with a carefully designed siRNA. Results We propose a strategy that is combined with a heuristic algorithm to design suitable siRNAs that can target multiple genes and a group testing method that would reduce the number of required RNAi experiments in a large-scale RNAi analysis. To verify the efficacy of our strategy, we used the Orchid expressed sequence tag data as a case study to screen the putative transcription factors that are involved in plant disease responses. According to our computation, 94 qualified siRNAs were sufficient to examine all of the predicated 229 transcription factors. In addition, among the 94 computer-designed siRNAs, an siRNA that targets both TF15 (a previously identified transcription factor that is involved in the plant disease-response pathway and TF21 was introduced into orchids. The experimental results showed that this siRNA can simultaneously silence TF15 and TF21, and application of our strategy successfully confirmed that TF15 is involved in plant defense responses. Interestingly, our second-round analysis, which used an siRNA specific to TF21, indicated that TF21 is a previously unidentified transcription factor that is related to plant defense responses. Conclusions Our computational results showed that it is possible to screen all genes with fewer experiments than would be required for the traditional one-on-one RNAi screening. We also verified that our strategy is capable of identifying genes that are involved in a specific phenotype.

  10. Identification of housekeeping genes suitable for gene expression analysis in the pearl mussel, Hyriopsis cumingii, during biomineralization.

    Science.gov (United States)

    Bai, Zhiyi; Lin, Jingyun; Ma, Keyi; Wang, Guiling; Niu, Donghong; Li, Jiale

    2014-08-01

    Quantitative real-time polymerase chain reaction is a sensitive technique for quantifying gene expression levels. One or more appropriate reference genes must be selected to accurately compare mRNA transcript levels across different samples and tissues. The freshwater pearl, Hyriopsis cumingii (Lea), is an important economic species cultured in China. To date, no reference genes for gene expression analysis in this species have been validated. This study aimed to compare the relative expression of seven housekeeping genes across different tissue types and in the mantle or pearl sac during three biomineralization processes: seasonal shell growth, shell healing and pearl-sac formation in H. cumingii. Three programs evaluated the expression stabilities of the seven genes: BestKeeper, geNorm and NormFinder. The beta actin gene (ACTB), commonly used as a housekeeping gene in many studies, was the least stable. The expressions of Ubiquitin (Ubi) and Ribosomal protein L18 (Rpl18) and Elongation factor 1-alpha (EF1α) were more stable than the remaining four genes. Therefore, we suggest that Ubi, Rpl18 and EF1α are suitable reference genes. The three selected reference genes are expected to facilitate analysis of gene expressions during shell or pearl formation in H. cumingii.

  11. Molecular Characterization and Expression Analysis of Equine ( Gene in Horse (

    Directory of Open Access Journals (Sweden)

    Ki-Duk Song

    2014-05-01

    Full Text Available The objective of this study was to determine the molecular characteristics of the horse vascular endothelial growth factor alpha gene (VEGFα by constructing a phylogenetic tree, and to investigate gene expression profiles in tissues and blood leukocytes after exercise for development of suitable biomarkers. Using published amino acid sequences of other vertebrate species (human, chimpanzee, mouse, rat, cow, pig, chicken and dog, we constructed a phylogenetic tree which showed that equine VEGFα belonged to the same clade of the pig VEGFα. Analysis for synonymous (Ks and non-synonymous substitution ratios (Ka revealed that the horse VEGFα underwent positive selection. RNA was extracted from blood samples before and after exercise and different tissue samples of three horses. Expression analyses using reverse transcription-polymerase chain reaction (RT-PCR and quantitative-polymerase chain reaction (qPCR showed ubiquitous expression of VEGFα mRNA in skeletal muscle, kidney, thyroid, lung, appendix, colon, spinal cord, and heart tissues. Analysis of differential expression of VEGFα gene in blood leukocytes after exercise indicated a unimodal pattern. These results will be useful in developing biomarkers that can predict the recovery capacity of racing horses.

  12. Polymorphism Interaction Analysis (PIA: a method for investigating complex gene-gene interactions

    Directory of Open Access Journals (Sweden)

    Chanock Stephen J

    2008-03-01

    Full Text Available Abstract Background The risk of common diseases is likely determined by the complex interplay between environmental and genetic factors, including single nucleotide polymorphisms (SNPs. Traditional methods of data analysis are poorly suited for detecting complex interactions due to sparseness of data in high dimensions, which often occurs when data are available for a large number of SNPs for a relatively small number of samples. Validation of associations observed using multiple methods should be implemented to minimize likelihood of false-positive associations. Moreover, high-throughput genotyping methods allow investigators to genotype thousands of SNPs at one time. Investigating associations for each individual SNP or interactions between SNPs using traditional approaches is inefficient and prone to false positives. Results We developed the Polymorphism Interaction Analysis tool (PIA version 2.0 to include different approaches for ranking and scoring SNP combinations, to account for imbalances between case and control ratios, stratify on particular factors, and examine associations of user-defined pathways (based on SNP or gene with case status. PIA v. 2.0 detected 2-SNP interactions as the highest ranking model 77% of the time, using simulated data sets of genetic models of interaction (minor allele frequency = 0.2; heritability = 0.01; N = 1600 generated previously [Velez DR, White BC, Motsinger AA, Bush WS, Ritchie MD, Williams SM, Moore JH: A balanced accuracy function for epistasis modeling in imbalanced datasets using multifactor dimensionality reduction. Genet Epidemiol 2007, 31:306–315.]. Interacting SNPs were detected in both balanced (20 SNPs and imbalanced data (case:control 1:2 and 1:4, 10 SNPs in the context of non-interacting SNPs. Conclusion PIA v. 2.0 is a useful tool for exploring gene*gene or gene*environment interactions and identifying a small number of putative associations which may be investigated further using other

  13. κMicroarray analysis of relative gene expression stability for selection of internal reference genes in the rhesus macaque brain

    Directory of Open Access Journals (Sweden)

    Urbanski Henryk F

    2010-06-01

    Full Text Available Abstract Background Normalization of gene expression data refers to the comparison of expression values using reference standards that are consistent across all conditions of an experiment. In PCR studies, genes designated as "housekeeping genes" have been used as internal reference genes under the assumption that their expression is stable and independent of experimental conditions. However, verification of this assumption is rarely performed. Here we assess the use of gene microarray analysis to facilitate selection of internal reference sequences with higher expression stability across experimental conditions than can be expected using traditional selection methods. We recently demonstrated that relative gene expression from qRT-PCR data normalized using GAPDH, ALG9 and RPL13A expression values mirrored relative expression using quantile normalization in Robust Multichip Analysis (RMA on the Affymetrix® GeneChip® rhesus Macaque Genome Array. Having shown that qRT-PCR and Affymetrix® GeneChip® data from the same hormone replacement therapy (HRT study yielded concordant results, we used quantile-normalized gene microarray data to identify the most stably expressed among probe sets for prospective internal reference genes across three brain regions from the HRT study and an additional study of normally menstruating rhesus macaques (cycle study. Gene selection was limited to 575 previously published human "housekeeping" genes. Twelve animals were used per study, and three brain regions were analyzed from each animal. Gene expression stabilities were determined using geNorm, NormFinder and BestKeeper software packages. Results Sequences co-annotated for ribosomal protein S27a (RPS27A, and ubiquitin were among the most stably expressed under all conditions and selection criteria used for both studies. Higher annotation quality on the human GeneChip® facilitated more targeted analysis than could be accomplished using the rhesus GeneChip®. In

  14. A comparative analysis of biclustering algorithms for gene expression data.

    Science.gov (United States)

    Eren, Kemal; Deveci, Mehmet; Küçüktunç, Onur; Çatalyürek, Ümit V

    2013-05-01

    The need to analyze high-dimension biological data is driving the development of new data mining methods. Biclustering algorithms have been successfully applied to gene expression data to discover local patterns, in which a subset of genes exhibit similar expression levels over a subset of conditions. However, it is not clear which algorithms are best suited for this task. Many algorithms have been published in the past decade, most of which have been compared only to a small number of algorithms. Surveys and comparisons exist in the literature, but because of the large number and variety of biclustering algorithms, they are quickly outdated. In this article we partially address this problem of evaluating the strengths and weaknesses of existing biclustering methods. We used the BiBench package to compare 12 algorithms, many of which were recently published or have not been extensively studied. The algorithms were tested on a suite of synthetic data sets to measure their performance on data with varying conditions, such as different bicluster models, varying noise, varying numbers of biclusters and overlapping biclusters. The algorithms were also tested on eight large gene expression data sets obtained from the Gene Expression Omnibus. Gene Ontology enrichment analysis was performed on the resulting biclusters, and the best enrichment terms are reported. Our analyses show that the biclustering method and its parameters should be selected based on the desired model, whether that model allows overlapping biclusters, and its robustness to noise. In addition, we observe that the biclustering algorithms capable of finding more than one model are more successful at capturing biologically relevant clusters.

  15. Analysis of deterministic cyclic gene regulatory network models with delays

    CERN Document Server

    Ahsen, Mehmet Eren; Niculescu, Silviu-Iulian

    2015-01-01

    This brief examines a deterministic, ODE-based model for gene regulatory networks (GRN) that incorporates nonlinearities and time-delayed feedback. An introductory chapter provides some insights into molecular biology and GRNs. The mathematical tools necessary for studying the GRN model are then reviewed, in particular Hill functions and Schwarzian derivatives. One chapter is devoted to the analysis of GRNs under negative feedback with time delays and a special case of a homogenous GRN is considered. Asymptotic stability analysis of GRNs under positive feedback is then considered in a separate chapter, in which conditions leading to bi-stability are derived. Graduate and advanced undergraduate students and researchers in control engineering, applied mathematics, systems biology and synthetic biology will find this brief to be a clear and concise introduction to the modeling and analysis of GRNs.

  16. GeneMesh: a web-based microarray analysis tool for relating differentially expressed genes to MeSH terms

    Directory of Open Access Journals (Sweden)

    Argraves W Scott

    2010-04-01

    Full Text Available Abstract Background An important objective of DNA microarray-based gene expression experimentation is determining inter-relationships that exist between differentially expressed genes and biological processes, molecular functions, cellular components, signaling pathways, physiologic processes and diseases. Results Here we describe GeneMesh, a web-based program that facilitates analysis of DNA microarray gene expression data. GeneMesh relates genes in a query set to categories available in the Medical Subject Headings (MeSH hierarchical index. The interface enables hypothesis driven relational analysis to a specific MeSH subcategory (e.g., Cardiovascular System, Genetic Processes, Immune System Diseases etc. or unbiased relational analysis to broader MeSH categories (e.g., Anatomy, Biological Sciences, Disease etc.. Genes found associated with a given MeSH category are dynamically linked to facilitate tabular and graphical depiction of Entrez Gene information, Gene Ontology information, KEGG metabolic pathway diagrams and intermolecular interaction information. Expression intensity values of groups of genes that cluster in relation to a given MeSH category, gene ontology or pathway can be displayed as heat maps of Z score-normalized values. GeneMesh operates on gene expression data derived from a number of commercial microarray platforms including Affymetrix, Agilent and Illumina. Conclusions GeneMesh is a versatile web-based tool for testing and developing new hypotheses through relating genes in a query set (e.g., differentially expressed genes from a DNA microarray experiment to descriptors making up the hierarchical structure of the National Library of Medicine controlled vocabulary thesaurus, MeSH. The system further enhances the discovery process by providing links between sets of genes associated with a given MeSH category to a rich set of html linked tabular and graphic information including Entrez Gene summaries, gene ontologies

  17. Carotenoid biosynthetic genes in Brassica rapa: comparative genomic analysis, phylogenetic analysis, and expression profiling

    OpenAIRE

    Li, Peirong; Zhang, Shujiang; Zhang, Shifan; Li, Fei; Zhang, Hui; Cheng, Feng; Wu, Jian; Wang, Xiaowu; Sun, Rifei

    2015-01-01

    Background Carotenoids are isoprenoid compounds synthesized by all photosynthetic organisms. Despite much research on carotenoid biosynthesis in the model plant Arabidopsis thaliana, there is a lack of information on the carotenoid pathway in Brassica rapa. To better understand its carotenoid biosynthetic pathway, we performed a systematic analysis of carotenoid biosynthetic genes at the genome level in B. rapa. Results We identified 67 carotenoid biosynthetic genes in B. rapa, which were ort...

  18. Large Scale Gene Expression Meta-Analysis Reveals Tissue-Specific, Sex-Biased Gene Expression in Humans

    Science.gov (United States)

    Mayne, Benjamin T.; Bianco-Miotto, Tina; Buckberry, Sam; Breen, James; Clifton, Vicki; Shoubridge, Cheryl; Roberts, Claire T.

    2016-01-01

    The severity and prevalence of many diseases are known to differ between the sexes. Organ specific sex-biased gene expression may underpin these and other sexually dimorphic traits. To further our understanding of sex differences in transcriptional regulation, we performed meta-analyses of sex biased gene expression in multiple human tissues. We analyzed 22 publicly available human gene expression microarray data sets including over 2500 samples from 15 different tissues and 9 different organs. Briefly, by using an inverse-variance method we determined the effect size difference of gene expression between males and females. We found the greatest sex differences in gene expression in the brain, specifically in the anterior cingulate cortex, (1818 genes), followed by the heart (375 genes), kidney (224 genes), colon (218 genes), and thyroid (163 genes). More interestingly, we found different parts of the brain with varying numbers and identity of sex-biased genes, indicating that specific cortical regions may influence sexually dimorphic traits. The majority of sex-biased genes in other tissues such as the bladder, liver, lungs, and pancreas were on the sex chromosomes or involved in sex hormone production. On average in each tissue, 32% of autosomal genes that were expressed in a sex-biased fashion contained androgen or estrogen hormone response elements. Interestingly, across all tissues, we found approximately two-thirds of autosomal genes that were sex-biased were not under direct influence of sex hormones. To our knowledge this is the largest analysis of sex-biased gene expression in human tissues to date. We identified many sex-biased genes that were not under the direct influence of sex chromosome genes or sex hormones. These may provide targets for future development of sex-specific treatments for diseases.

  19. Large scale gene expression meta-analysis reveals tissue-specific, sex-biased gene expression in humans

    Directory of Open Access Journals (Sweden)

    Benjamin Mayne

    2016-10-01

    Full Text Available The severity and prevalence of many diseases are known to differ between the sexes. Organ specific sex-biased gene expression may underpin these and other sexually dimorphic traits. To further our understanding of sex differences in transcriptional regulation, we performed meta-analyses of sex biased gene expression in multiple human tissues. We analysed 22 publicly available human gene expression microarray data sets including over 2500 samples from 15 different tissues and 9 different organs. Briefly, by using an inverse-variance method we determined the effect size difference of gene expression between males and females. We found the greatest sex differences in gene expression in the brain, specifically in the anterior cingulate cortex, (1818 genes, followed by the heart (375 genes, kidney (224 genes, colon (218 genes and thyroid (163 genes. More interestingly, we found different parts of the brain with varying numbers and identity of sex-biased genes, indicating that specific cortical regions may influence sexually dimorphic traits. The majority of sex-biased genes in other tissues such as the bladder, liver, lungs and pancreas were on the sex chromosomes or involved in sex hormone production. On average in each tissue, 32% of autosomal genes that were expressed in a sex-biased fashion contained androgen or estrogen hormone response elements. Interestingly, across all tissues, we found approximately two-thirds of autosomal genes that were sex-biased were not under direct influence of sex hormones. To our knowledge this is the largest analysis of sex-biased gene expression in human tissues to date. We identified many sex-biased genes that were not under the direct influence of sex chromosome genes or sex hormones. These may provide targets for future development of sex-specific treatments for diseases.

  20. CONSIDERATIONS FOR OPTIMIZING MICROBIOME ANALYSIS USING A MARKER GENE

    Directory of Open Access Journals (Sweden)

    Jacobo de la Cuesta-Zuluaga

    2016-08-01

    Full Text Available Next generation sequencing technologies have found a widespread use in the study of host–microbe interactions due to the increase in their throughput and their ever-decreasing costs. The analysis of human-associated microbial communities using a marker gene, particularly the 16S rRNA, has greatly benefited from these technologies, the human gut microbiome research being a remarkable example of such analysis that has greatly expanded our understanding of microbe-mediated human health and disease, metabolism and food absorption. 16S studies go through a series of in vitro and in silico steps that can greatly influence their outcomes. However, the lack of a standardized workflow has led to uncertainties regarding the transparency and reproducibility of gut microbiome studies. We here discuss the most common challenges in the archetypical 16S rRNA workflow, including the extraction of total DNA, its use as template in PCR with primers that amplify specific hypervariable regions of the gene, amplicon sequencing, the denoising and removal of low quality reads, the detection and removal of chimeric sequences, the clustering of high-quality sequences into operational taxonomic units (OTUs and their taxonomic classification. We recommend the essential technical information that should be conveyed in publications for reproducibility of results and encourage non-experts to include procedures and available tools that mitigate most of the problems encountered in microbiome analysis.

  1. Transcript analysis of 250 novel yeast genes from chromosome XIV.

    Science.gov (United States)

    Planta, R J; Brown, A J; Cadahia, J L; Cerdan, M E; de Jonge, M; Gent, M E; Hayes, A; Kolen, C P; Lombardia, L J; Sefton, M; Oliver, S G; Thevelein, J; Tournu, H; van Delft, Y J; Verbart, D J; Winderickx, J

    1999-03-15

    The European Functional Analysis Network (EUROFAN) is systematically analysing the function of novel Saccharomyces cerevisiae genes revealed by genome sequencing. As part of this effort our consortium has performed a detailed transcript analysis for 250 novel ORFs on chromosome XIV. All transcripts were quantified by Northern analysis under three quasi-steady-state conditions (exponential growth on rich fermentative, rich non-fermentative, and minimal fermentative media) and eight transient conditions (glucose derepression, glucose upshift, stationary phase, nitrogen starvation, osmo-stress, heat-shock, and two control conditions). Transcripts were detected for 82% of the 250 ORFs, and only one ORF did not yield a transcript of the expected length (YNL285w). Transcripts ranged from low (62%), moderate (16%) to high abundance (2%) relative to the ACT1 mRNA. The levels of 73% of the 206 chromosome XIV transcripts detected fluctuated in response to the transient states tested. However, only a small number responded strongly to the transients: eight ORFs were induced upon glucose upshift; five were repressed by glucose; six were induced in response to nitrogen starvation; three were induced in stationary phase; five were induced by osmo-stress; four were induced by heat-shock. These data provide useful clues about the general function of these ORFs and add to our understanding of gene regulation on a genome-wide basis.

  2. Gene Regulation, Modulation, and Their Applications in Gene Expression Data Analysis

    Directory of Open Access Journals (Sweden)

    Mario Flores

    2013-01-01

    Full Text Available Common microarray and next-generation sequencing data analysis concentrate on tumor subtype classification, marker detection, and transcriptional regulation discovery during biological processes by exploring the correlated gene expression patterns and their shared functions. Genetic regulatory network (GRN based approaches have been employed in many large studies in order to scrutinize for dysregulation and potential treatment controls. In addition to gene regulation and network construction, the concept of the network modulator that has significant systemic impact has been proposed, and detection algorithms have been developed in past years. Here we provide a unified mathematic description of these methods, followed with a brief survey of these modulator identification algorithms. As an early attempt to extend the concept to new RNA regulation mechanism, competitive endogenous RNA (ceRNA, into a modulator framework, we provide two applications to illustrate the network construction, modulation effect, and the preliminary finding from these networks. Those methods we surveyed and developed are used to dissect the regulated network under different modulators. Not limit to these, the concept of “modulation” can adapt to various biological mechanisms to discover the novel gene regulation mechanisms.

  3. Sub-megabase resolution tiling (SMRT array-based comparative genomic hybridization profiling reveals novel gains and losses of chromosomal regions in Hodgkin Lymphoma and Anaplastic Large Cell Lymphoma cell lines

    Directory of Open Access Journals (Sweden)

    Lam Wan L

    2008-01-01

    Full Text Available Abstract Background Hodgkin lymphoma (HL and Anaplastic Large Cell Lymphoma (ALCL, are forms of malignant lymphoma defined by unique morphologic, immunophenotypic, genotypic, and clinical characteristics, but both overexpress CD30. We used sub-megabase resolution tiling (SMRT array-based comparative genomic hybridization to screen HL-derived cell lines (KMH2 and L428 and ALCL cell lines (DEL and SR-786 in order to identify disease-associated gene copy number gains and losses. Results Significant copy number gains and losses were observed on several chromosomes in all four cell lines. Assessment of copy number alterations with 26,819 DNA segments identified an average of 20 genetic alterations. Of the recurrent minimally altered regions identified, 11 (55% were within previously published regions of chromosomal alterations in HL and ALCL cell lines while 9 (45% were novel alterations not previously reported. HL cell lines L428 and KMH2 shared gains in chromosome cytobands 2q23.1-q24.2, 7q32.2-q36.3, 9p21.3-p13.3, 12q13.13-q14.1, and losses in 13q12.13-q12.3, and 18q21.32-q23. ALCL cell lines SR-786 and DEL, showed gains in cytobands 5p15.32-p14.3, 20p12.3-q13.11, and 20q13.2-q13.32. Both pairs of HL and ALCL cell lines showed losses in 18q21.32-18q23. Conclusion This study is considered to be the first one describing HL and ALCL cell line genomes at sub-megabase resolution. This high-resolution analysis allowed us to propose novel candidate target genes that could potentially contribute to the pathogenesis of HL and ALCL. FISH was used to confirm the amplification of all three isoforms of the trypsin gene (PRSS1/PRSS2/PRSS3 in KMH2 and L428 (HL and DEL (ALCL cell lines. These are novel findings that have not been previously reported in the lymphoma literature, and opens up an entirely new area of research that has not been previously associated with lymphoma biology. The findings raise interesting possibilities about the role of signaling

  4. Rapid analysis of colipase gene variants by multicapillary electrophoresis.

    Science.gov (United States)

    Jaczó, Zsuzsanna; Pál, Eszter; Dénes, Réka; Somogyi, Anikó; Sasvári-Székely, Mária; Guttman, András; Rónai, Zsolt

    2015-06-01

    Despite of the fact that the Human Genome Project was completed more than a decade ago, identification of the genetic background of polygenic diseases is still challenging. Several somewhat different approaches are available to investigate inheritable factors of complex phenotypes, all require, however efficient, high-throughput techniques for SNP genotyping. In this paper, we report a robust and reliable multiplex PCR-RFLP for genotype and haplotype analysis of six SNPs (rs41270082, rs3748051, rs142027015, rs3748048, rs73404011, and rs72925892) of the colipase (CLPS) gene. A multicapillary (12 capillaries) electrophoresis unit was used for high throughput and sensitive analysis of the digestion fragments. A Microsoft Excel-based spreadsheet was designed for the flexible visualization and evaluation of the electrophoretic separations, which is readily adaptable for any kind of electrophoresis application. Haplotype analysis of the two loci localized in close proximity of each other was carried out by molecular method, extended haplotypes including all five SNPs in the 5' upstream region were calculated. The techniques were applied in a case-control association study of type 2 diabetes mellitus. Although, single marker analysis did not reveal any significant association, it was observed that the rare GGCCG haplotype of the five 5' upstream region SNPs was about three times more frequent among patients compared to healthy control population. Our results demonstrated the applicability of multicapillary CGE in large-scale, high-throughput SNP analysis, and suggested that the CLPS gene polymorphisms might be considered as genetic risk factor for type 2 diabetes mellitus.

  5. Importance of rare gene copy number alterations for personalized tumor characterization and survival analysis.

    Science.gov (United States)

    Seifert, Michael; Friedrich, Betty; Beyer, Andreas

    2016-10-03

    It has proven exceedingly difficult to ascertain rare copy number alterations (CNAs) that may have strong effects in individual tumors. We show that a regulatory network inferred from gene expression and gene copy number data of 768 human cancer cell lines can be used to quantify the impact of patient-specific CNAs on survival signature genes. A focused analysis of tumors from six tissues reveals that rare patient-specific gene CNAs often have stronger effects on signature genes than frequent gene CNAs. Further comparison to a related network-based approach shows that the integration of indirectly acting gene CNAs significantly improves the survival analysis.

  6. Accurate and unambiguous tag-to-gene mapping in serial analysis of gene expression

    Directory of Open Access Journals (Sweden)

    Melo Francisco

    2006-11-01

    Full Text Available Abstract Background In this study, we present a robust and reliable computational method for tag-to-gene assignment in serial analysis of gene expression (SAGE. The method relies on current genome information and annotation, incorporation of several new features, and key improvements over alternative methods, all of which are important to determine gene expression levels more accurately. The method provides a complete annotation of potential virtual SAGE tags within a genome, along with an estimation of their confidence for experimental observation that ranks tags that present multiple matches in the genome. Results We applied this method to the Saccharomyces cerevisiae genome, producing the most thorough and accurate annotation of potential virtual SAGE tags that is available today for this organism. The usefulness of this method is exemplified by the significant reduction of ambiguous cases in existing experimental SAGE data. In addition, we report new insights from the analysis of existing SAGE data. First, we found that experimental SAGE tags mapping onto introns, intron-exon boundaries, and non-coding RNA elements are observed in all available SAGE data. Second, a significant fraction of experimental SAGE tags was found to map onto genomic regions currently annotated as intergenic. Third, a significant number of existing experimental SAGE tags for yeast has been derived from truncated cDNAs, which are synthesized through oligo-d(T priming to internal poly-(A regions during reverse transcription. Conclusion We conclude that an accurate and unambiguous tag mapping process is essential to increase the quality and the amount of information that can be extracted from SAGE experiments. This is supported by the results obtained here and also by the large impact that the erroneous interpretation of these data could have on downstream applications.

  7. Conditional promoters for analysis of essential genes in Zymoseptoria tritici.

    Science.gov (United States)

    Kilaru, S; Ma, W; Schuster, M; Courbot, M; Steinberg, G

    2015-06-01

    Development of new fungicides, needed for sustainable control of fungal plant pathogens, requires identification of novel anti-fungal targets. Essential fungal-specific proteins are good candidates, but due to their importance, gene deletion mutants are not viable. Consequently, their cellular role often remains elusive. This hindrance can be overcome by the use of conditional mutants, where expression is controlled by an inducible/repressible promoter. Here, we introduce 5 inducible/repressible promoter systems to study essential genes in the wheat pathogen Zymoseptoria tritici. We fused the gene for enhanced green-fluorescent protein (egfp) to the promoter region of Z. tritici nitrate reductase (Pnar1; induced by nitrogen and repressed by ammonium), 1,4-β-endoxylanase A (Pex1A; induced by xylose and repressed by maltodextrin), l-arabinofuranosidase B (PlaraB; induced by arabinose and repressed by glucose), galactose-1-phosphate uridylyltransferase 7 (Pgal7; induced by galactose and repressed by glucose) and isocitrate lyase (Picl1; induced by sodium acetate and repressed by glucose). This was followed by quantitative analysis of cytoplasmic reporter fluorescence under induced and repressed conditions. We show that Pnar1, PlaraB and Pex1A drive very little or no egfp expression when repressed, but induce moderate protein production when induced. In contrast, Pgal7 and Picl1 show considerable egfp expression when repressed, and were strongly induced in the presence of their inducers. Normalising the expression levels of all promoters to that of the α-tubulin promoter Ptub2 revealed that PlaraB was the weakest promoter (∼20% of Ptub2), whereas Picl1 strongly expressed the reporter (∼250% of Ptub2). The use of these tools promises a better understanding of essential genes, which will help developing novel control strategies that protect wheat from Z. tritici.

  8. Canonical correlation analysis for gene-based pleiotropy discovery.

    Directory of Open Access Journals (Sweden)

    Jose A Seoane

    2014-10-01

    Full Text Available Genome-wide association studies have identified a wealth of genetic variants involved in complex traits and multifactorial diseases. There is now considerable interest in testing variants for association with multiple phenotypes (pleiotropy and for testing multiple variants for association with a single phenotype (gene-based association tests. Such approaches can increase statistical power by combining evidence for association over multiple phenotypes or genetic variants respectively. Canonical Correlation Analysis (CCA measures the correlation between two sets of multidimensional variables, and thus offers the potential to combine these two approaches. To apply CCA, we must restrict the number of attributes relative to the number of samples. Hence we consider modules of genetic variation that can comprise a gene, a pathway or another biologically relevant grouping, and/or a set of phenotypes. In order to do this, we use an attribute selection strategy based on a binary genetic algorithm. Applied to a UK-based prospective cohort study of 4286 women (the British Women's Heart and Health Study, we find improved statistical power in the detection of previously reported genetic associations, and identify a number of novel pleiotropic associations between genetic variants and phenotypes. New discoveries include gene-based association of NSF with triglyceride levels and several genes (ACSM3, ERI2, IL18RAP, IL23RAP and NRG1 with left ventricular hypertrophy phenotypes. In multiple-phenotype analyses we find association of NRG1 with left ventricular hypertrophy phenotypes, fibrinogen and urea and pleiotropic relationships of F7 and F10 with Factor VII, Factor IX and cholesterol levels.

  9. GeneAnalytics: An Integrative Gene Set Analysis Tool for Next Generation Sequencing, RNAseq and Microarray Data.

    Science.gov (United States)

    Ben-Ari Fuchs, Shani; Lieder, Iris; Stelzer, Gil; Mazor, Yaron; Buzhor, Ella; Kaplan, Sergey; Bogoch, Yoel; Plaschkes, Inbar; Shitrit, Alina; Rappaport, Noa; Kohn, Asher; Edgar, Ron; Shenhav, Liraz; Safran, Marilyn; Lancet, Doron; Guan-Golan, Yaron; Warshawsky, David; Shtrichman, Ronit

    2016-03-01

    Postgenomics data are produced in large volumes by life sciences and clinical applications of novel omics diagnostics and therapeutics for precision medicine. To move from "data-to-knowledge-to-innovation," a crucial missing step in the current era is, however, our limited understanding of biological and clinical contexts associated with data. Prominent among the emerging remedies to this challenge are the gene set enrichment tools. This study reports on GeneAnalytics™ ( geneanalytics.genecards.org ), a comprehensive and easy-to-apply gene set analysis tool for rapid contextualization of expression patterns and functional signatures embedded in the postgenomics Big Data domains, such as Next Generation Sequencing (NGS), RNAseq, and microarray experiments. GeneAnalytics' differentiating features include in-depth evidence-based scoring algorithms, an intuitive user interface and proprietary unified data. GeneAnalytics employs the LifeMap Science's GeneCards suite, including the GeneCards®--the human gene database; the MalaCards-the human diseases database; and the PathCards--the biological pathways database. Expression-based analysis in GeneAnalytics relies on the LifeMap Discovery®--the embryonic development and stem cells database, which includes manually curated expression data for normal and diseased tissues, enabling advanced matching algorithm for gene-tissue association. This assists in evaluating differentiation protocols and discovering biomarkers for tissues and cells. Results are directly linked to gene, disease, or cell "cards" in the GeneCards suite. Future developments aim to enhance the GeneAnalytics algorithm as well as visualizations, employing varied graphical display items. Such attributes make GeneAnalytics a broadly applicable postgenomics data analyses and interpretation tool for translation of data to knowledge-based innovation in various Big Data fields such as precision medicine, ecogenomics, nutrigenomics, pharmacogenomics, vaccinomics

  10. Data Preprocessing in Cluster Analysis of Gene Expression

    Institute of Scientific and Technical Information of China (English)

    杨春梅; 万柏坤; 高晓峰

    2003-01-01

    Considering that the DNA microarray technology has generated explosive gene expression data and that it is urgent to analyse and to visualize such massive datasets with efficient methods, we investigate the data preprocessing methods used in cluster analysis, normalization or logarithm of the matrix, by using hierarchical clustering, principal component analysis (PCA) and self-organizing maps (SOMs). The results illustrate that when using the Euclidean distance as measuring metrics, logarithm of relative expression level is the best preprocessing method, while data preprocessed by normalization cannot attain the expected results because the data structure is ruined. If there are only a few principal components, the PCA is an effective method to extract the frame structure, while SOMs are more suitable for a specific structure.

  11. Suitable reference genes for accurate gene expression analysis in parsley (Petroselinum crispum for abiotic stresses and hormone stimuli

    Directory of Open Access Journals (Sweden)

    Meng-Yao Li

    2016-09-01

    Full Text Available Parsley is one of the most important vegetable in Apiaceae family and widely used in food industry, medicinal and cosmetic. The recent studies in parsley are mainly focus on chemical composition, further research involving the analysis of the gene functions and expressions will be required. qPCR is a powerful method for detecting very low quantities of target transcript levels and widely used for gene expression studies. To ensure the accuracy of results, a suitable reference gene is necessary for expression normalization. In this study, three software geNorm, NormFinder, and BestKeeper were used to evaluate the expression stabilities of eight candidate reference genes (GAPDH, ACTIN, eIF-4α, SAND, UBC, TIP41, EF-1α, and TUB under various conditions including abiotic stresses (heat, cold, salt, and drought and hormone stimuli treatments (GA, SA, MeJA, and ABA. The results showed that EF-1α and TUB were identified as the most stable genes for abiotic stresses, while EF-1α, GAPDH, and TUB were the top three choices for hormone stimuli treatments. Moreover, EF-1α and TUB were the most stable reference genes across all the tested samples, while UBC was the least stable one. The expression analysis of PcDREB1 and PcDREB2 further verified that the selected stable reference genes were suitable for gene expression normalization. This study provides a guideline for selection the suitable reference genes in gene expression in parsley.

  12. Molecular characterization and analysis of the porcine NURR1 gene

    Directory of Open Access Journals (Sweden)

    Knud Larsen

    2016-12-01

    Here we report the isolation and characterization of porcine NURR1 cDNA. The NURR1 cDNA was RT-PCR cloned using NURR1-specific oligonucleotide primers derived from in silico sequences. The porcine NURR1 cDNA encodes a polypeptide of 598 amino acids, displaying a very high similarity with bovine, human and mouse (99% NURR1 protein. Expression analysis revealed a differential NURR1 mRNA expression in various organs and tissues. NURR1 transcripts could be detected as early as at 60 days of embryo development in different brain tissues. A significant increase in NURR1 transcript in the cerebellum and a decrease in NURR1 transcript in the basal ganglia was observed during embryo development. The porcine NURR1 gene was mapped to chromosome 15. Two missense mutations were found in exon 3, the first coding exon of NURR1. Methylation analysis of the porcine NURR1 gene body revealed a high methylation degree in brain tissue, whereas methylation of the promoter was very low. A decrease in DNA methylation in a discrete region of the NURR1 promoter was observed in pig frontal cortex during pig embryo development. This observation correlated with an increase in NURR1 transcripts. Therefore, methylation might be a determinant of NURR1 expression at certain time points in embryo development.

  13. Gene expression analysis of in vivo fluorescent cells.

    Directory of Open Access Journals (Sweden)

    Konstantin Khodosevich

    Full Text Available BACKGROUND: The analysis of gene expression for tissue homogenates is of limited value because of the considerable cell heterogeneity in tissues. However, several methods are available to isolate a cell type of interest from a complex tissue, the most reliable one being Laser Microdissection (LMD. Cells may be distinguished by their morphology or by specific antigens, but the obligatory staining often results in RNA degradation. Alternatively, particular cell types can be detected in vivo by expression of fluorescent proteins from cell type-specific promoters. METHODOLOGY/PRINCIPAL FINDINGS: We developed a technique for fixing in vivo fluorescence in brain cells and isolating them by LMD followed by an optimized RNA isolation procedure. RNA isolated from these cells was of equal quality as from unfixed frozen tissue, with clear 28S and 18S rRNA bands of a mass ratio of approximately 2ratio1. We confirmed the specificity of the amplified RNA from the microdissected fluorescent cells as well as its usefulness and reproducibility for microarray hybridization and quantitative real-time PCR (qRT-PCR. CONCLUSIONS/SIGNIFICANCE: Our technique guarantees the isolation of sufficient high quality RNA obtained from specific cell populations of the brain expressing soluble fluorescent marker, which is a critical prerequisite for subsequent gene expression studies by microarray analysis or qRT-PCR.

  14. Transcriptional analysis of exopolysaccharides biosynthesis gene clusters in Lactobacillus plantarum.

    Science.gov (United States)

    Vastano, Valeria; Perrone, Filomena; Marasco, Rosangela; Sacco, Margherita; Muscariello, Lidia

    2016-04-01

    Exopolysaccharides (EPS) from lactic acid bacteria contribute to specific rheology and texture of fermented milk products and find applications also in non-dairy foods and in therapeutics. Recently, four clusters of genes (cps) associated with surface polysaccharide production have been identified in Lactobacillus plantarum WCFS1, a probiotic and food-associated lactobacillus. These clusters are involved in cell surface architecture and probably in release and/or exposure of immunomodulating bacterial molecules. Here we show a transcriptional analysis of these clusters. Indeed, RT-PCR experiments revealed that the cps loci are organized in five operons. Moreover, by reverse transcription-qPCR analysis performed on L. plantarum WCFS1 (wild type) and WCFS1-2 (ΔccpA), we demonstrated that expression of three cps clusters is under the control of the global regulator CcpA. These results, together with the identification of putative CcpA target sequences (catabolite responsive element CRE) in the regulatory region of four out of five transcriptional units, strongly suggest for the first time a role of the master regulator CcpA in EPS gene transcription among lactobacilli.

  15. Array-based DNA methylation profiling for breast cancer subtype discrimination.

    Directory of Open Access Journals (Sweden)

    Ilse Van der Auwera

    Full Text Available BACKGROUND: Abnormal DNA methylation is well established for breast cancer and contributes to its progression by silencing tumor suppressor genes. DNA methylation profiling platforms might provide an alternative approach to expression microarrays for accurate breast tumor subtyping. We sought to determine whether the distinction of the inflammatory breast cancer (IBC phenotype from the non-IBC phenotype by transcriptomics could be sustained by methylomics. METHODOLOGY/PRINCIPAL FINDINGS: We performed methylation profiling on a cohort of IBC (N = 19 and non-IBC (N = 43 samples using the Illumina Infinium Methylation Assay. These results were correlated with gene expression profiles. Methylation values allowed separation of breast tumor samples into high and low methylation groups. This separation was significantly related to DNMT3B mRNA levels. The high methylation group was enriched for breast tumor samples from patients with distant metastasis and poor prognosis, as predicted by the 70-gene prognostic signature. Furthermore, this tumor group tended to be enriched for IBC samples (54% vs. 24% and samples with a high genomic grade index (67% vs. 38%. A set of 16 CpG loci (14 genes correctly classified 97% of samples into the low or high methylation group. Differentially methylated genes appeared to be mainly related to focal adhesion, cytokine-cytokine receptor interactions, Wnt signaling pathway, chemokine signaling pathways and metabolic processes. Comparison of IBC with non-IBC led to the identification of only four differentially methylated genes (TJP3, MOGAT2, NTSR2 and AGT. A significant correlation between methylation values and gene expression was shown for 4,981 of 6,605 (75% genes. CONCLUSIONS/SIGNIFICANCE: A subset of clinical samples of breast cancer was characterized by high methylation levels, which coincided with increased DNMT3B expression. Furthermore, an association was observed with molecular signatures indicative of

  16. Reference gene selection for gene expression analysis of oocytes collected from dairy cattle and buffaloes during winter and summer.

    Directory of Open Access Journals (Sweden)

    Carolina Habermann Macabelli

    Full Text Available Oocytes from dairy cattle and buffaloes have severely compromised developmental competence during summer. While analysis of gene expression is a powerful technique for understanding the factors affecting developmental hindrance in oocytes, analysis by real-time reverse transcription PCR (RT-PCR relies on the correct normalization by reference genes showing stable expression. Furthermore, several studies have found that genes commonly used as reference standards do not behave as expected depending on cell type and experimental design. Hence, it is recommended to evaluate expression stability of candidate reference genes for a specific experimental condition before employing them as internal controls. In acknowledgment of the importance of seasonal effects on oocyte gene expression, the aim of this study was to evaluate the stability of expression levels of ten well-known reference genes (ACTB, GAPDH, GUSB, HIST1H2AG, HPRT1, PPIA, RPL15, SDHA, TBP and YWHAZ using oocytes collected from different categories of dairy cattle and buffaloes during winter and summer. A normalization factor was provided for cattle (RPL15, PPIA and GUSB and buffaloes (YWHAZ, GUSB and GAPDH based on the expression of the three most stable reference genes in each species. Normalization of non-reference target genes by these reference genes was shown to be considerably different from normalization by less stable reference genes, further highlighting the need for careful selection of internal controls. Therefore, due to the high variability of reference genes among experimental groups, we conclude that data normalized by internal controls can be misleading and should be compared to not normalized data or to data normalized by an external control in order to better interpret the biological relevance of gene expression analysis.

  17. Global gene expression analysis of the zoonotic parasite Trichinella spiralis revealed novel genes in host parasite interaction.

    Directory of Open Access Journals (Sweden)

    Xiaolei Liu

    Full Text Available BACKGROUND: Trichinellosis is a typical food-borne zoonotic disease which is epidemic worldwide and the nematode Trichinella spiralis is the main pathogen. The life cycle of T. spiralis contains three developmental stages, i.e. adult worms, new borne larva (new borne L1 larva and muscular larva (infective L1 larva. Stage-specific gene expression in the parasites has been investigated with various immunological and cDNA cloning approaches, whereas the genome-wide transcriptome and expression features of the parasite have been largely unknown. The availability of the genome sequence information of T. spiralis has made it possible to deeply dissect parasite biology in association with global gene expression and pathogenesis. METHODOLOGY AND PRINCIPAL FINDINGS: In this study, we analyzed the global gene expression patterns in the three developmental stages of T. spiralis using digital gene expression (DGE analysis. Almost 15 million sequence tags were generated with the Illumina RNA-seq technology, producing expression data for more than 9,000 genes, covering 65% of the genome. The transcriptome analysis revealed thousands of differentially expressed genes within the genome, and importantly, a panel of genes encoding functional proteins associated with parasite invasion and immuno-modulation were identified. More than 45% of the genes were found to be transcribed from both strands, indicating the importance of RNA-mediated gene regulation in the development of the parasite. Further, based on gene ontological analysis, over 3000 genes were functionally categorized and biological pathways in the three life cycle stage were elucidated. CONCLUSIONS AND SIGNIFICANCE: The global transcriptome of T. spiralis in three developmental stages has been profiled, and most gene activity in the genome was found to be developmentally regulated. Many metabolic and biological pathways have been revealed. The findings of the differential expression of several protein

  18. Reference Gene Selection for Gene Expression Analysis of Oocytes Collected from Dairy Cattle and Buffaloes during Winter and Summer

    Science.gov (United States)

    Gimenes, Lindsay Unno; de Carvalho, Nelcio Antonio Tonizza; Soares, Júlia Gleyci; Ayres, Henderson; Ferraz, Márcio Leão; Watanabe, Yeda Fumie; Watanabe, Osnir Yoshime; Sangalli, Juliano Rodrigues; Smith, Lawrence Charles; Baruselli, Pietro Sampaio; Meirelles, Flávio Vieira; Chiaratti, Marcos Roberto

    2014-01-01

    Oocytes from dairy cattle and buffaloes have severely compromised developmental competence during summer. While analysis of gene expression is a powerful technique for understanding the factors affecting developmental hindrance in oocytes, analysis by real-time reverse transcription PCR (RT-PCR) relies on the correct normalization by reference genes showing stable expression. Furthermore, several studies have found that genes commonly used as reference standards do not behave as expected depending on cell type and experimental design. Hence, it is recommended to evaluate expression stability of candidate reference genes for a specific experimental condition before employing them as internal controls. In acknowledgment of the importance of seasonal effects on oocyte gene expression, the aim of this study was to evaluate the stability of expression levels of ten well-known reference genes (ACTB, GAPDH, GUSB, HIST1H2AG, HPRT1, PPIA, RPL15, SDHA, TBP and YWHAZ) using oocytes collected from different categories of dairy cattle and buffaloes during winter and summer. A normalization factor was provided for cattle (RPL15, PPIA and GUSB) and buffaloes (YWHAZ, GUSB and GAPDH) based on the expression of the three most stable reference genes in each species. Normalization of non-reference target genes by these reference genes was shown to be considerably different from normalization by less stable reference genes, further highlighting the need for careful selection of internal controls. Therefore, due to the high variability of reference genes among experimental groups, we conclude that data normalized by internal controls can be misleading and should be compared to not normalized data or to data normalized by an external control in order to better interpret the biological relevance of gene expression analysis. PMID:24676354

  19. Left-handed compact MIMO antenna array based on wire spiral resonator for 5-GHz wireless applications

    Science.gov (United States)

    Alqadami, Abdulrahman Shueai Mohsen; Jamlos, Mohd Faizal; Soh, Ping Jack; Rahim, Sharul Kamal Abdul; Narbudowicz, Adam

    2017-01-01

    A compact coplanar waveguide-fed multiple-input multiple-output antenna array based on the left-handed wire loaded spiral resonators (SR) is presented. The proposed antenna consists of a 2 × 2 wire SR with two symmetrical microstrip feed lines, each line exciting a 1 × 2 wire SR. Left-handed metamaterial unit cells are placed on its reverse side and arranged in a 2 × 3 array. A reflection coefficient of less than -16 dB and mutual coupling of less than -28 dB are achieved at 5.15 GHz WLAN band.

  20. Cloning and Expression Analysis of Mlo Gene from Pericallis hybrida B. Nord

    Institute of Scientific and Technical Information of China (English)

    Wang Jin-gang; Li Wei; Ren Hong-wei; Lv Yuan-da; Bai Ding; Xu Jing

    2014-01-01

    The full-length Mlo gene was obtained by reverse transcription polymerase chain reaction (RT-PCR) and RACE. The result of sequence analysis indicated that Mlo gene from Pericallis hybrida B. Nord. contained about 1 296 bp open reading frame and encoded 431 amino acids. According to the comparison of the exogenous gene sequences by BLAST analysis and phylogenetic analysis, Mlo gene shared over 85%nucleotide homology and 98%amino acid homology. Finally, through semi-quantitative-PCR and fluorescence quantitative analysis, we found that Mlo gene showed the highest expression levels in leaves and the lowest in roots after inoculated with powdery mildew pathogen for different days.

  1. Identify the signature genes for diagnose of uveal melanoma by weight gene co-expression network analysis

    Institute of Scientific and Technical Information of China (English)

    Kai; Shi; Zhi-Tong; Bing; Gui-Qun; Cao; Ling; Guo; Ya-Na; Cao; Hai-Ou; Jiang; Mei-Xia; Zhang

    2015-01-01

    AIM: To identify and understand the relationship between co-expression pattern and clinic traits in uveal melanoma, weighted gene co-expression network analysis(WGCNA) is applied to investigate the gene expression levels and patient clinic features. Uveal melanoma is the most common primary eye tumor in adults. Although many studies have identified some important genes and pathways that were relevant to progress of uveal melanoma, the relationship between co-expression and clinic traits in systems level of uveal melanoma is unclear yet. We employ WGCNA to investigate the relationship underlying molecular and phenotype in this study.METHODS: Gene expression profile of uveal melanoma and patient clinic traits were collected from the Gene Expression Omnibus(GEO) database. The gene co-expression is calculated by WGCNA that is the R package software. The package is used to analyze the correlation between pairs of expression levels of genes.The function of the genes were annotated by gene ontology(GO).RESULTS: In this study, we identified four co-expression modules significantly correlated with clinictraits. Module blue positively correlated with radiotherapy treatment. Module purple positively correlates with tumor location(sclera) and negatively correlates with patient age. Module red positively correlates with sclera and negatively correlates with thickness of tumor. Module black positively correlates with the largest tumor diameter(LTD). Additionally, we identified the hug gene(top connectivity with other genes) in each module. The hub gene RPS15 A, PTGDS, CD53 and MSI2 might play a vital role in progress of uveal melanoma.CONCLUSION: From WGCNA analysis and hub gene calculation, we identified RPS15 A, PTGDS, CD53 and MSI2 might be target or diagnosis for uveal melanoma.

  2. GeneAnalytics: An Integrative Gene Set Analysis Tool for Next Generation Sequencing, RNAseq and Microarray Data

    Science.gov (United States)

    Ben-Ari Fuchs, Shani; Lieder, Iris; Mazor, Yaron; Buzhor, Ella; Kaplan, Sergey; Bogoch, Yoel; Plaschkes, Inbar; Shitrit, Alina; Rappaport, Noa; Kohn, Asher; Edgar, Ron; Shenhav, Liraz; Safran, Marilyn; Lancet, Doron; Guan-Golan, Yaron; Warshawsky, David; Shtrichman, Ronit

    2016-01-01

    Abstract Postgenomics data are produced in large volumes by life sciences and clinical applications of novel omics diagnostics and therapeutics for precision medicine. To move from “data-to-knowledge-to-innovation,” a crucial missing step in the current era is, however, our limited understanding of biological and clinical contexts associated with data. Prominent among the emerging remedies to this challenge are the gene set enrichment tools. This study reports on GeneAnalytics™ (geneanalytics.genecards.org), a comprehensive and easy-to-apply gene set analysis tool for rapid contextualization of expression patterns and functional signatures embedded in the postgenomics Big Data domains, such as Next Generation Sequencing (NGS), RNAseq, and microarray experiments. GeneAnalytics' differentiating features include in-depth evidence-based scoring algorithms, an intuitive user interface and proprietary unified data. GeneAnalytics employs the LifeMap Science's GeneCards suite, including the GeneCards®—the human gene database; the MalaCards—the human diseases database; and the PathCards—the biological pathways database. Expression-based analysis in GeneAnalytics relies on the LifeMap Discovery®—the embryonic development and stem cells database, which includes manually curated expression data for normal and diseased tissues, enabling advanced matching algorithm for gene–tissue association. This assists in evaluating differentiation protocols and discovering biomarkers for tissues and cells. Results are directly linked to gene, disease, or cell “cards” in the GeneCards suite. Future developments aim to enhance the GeneAnalytics algorithm as well as visualizations, employing varied graphical display items. Such attributes make GeneAnalytics a broadly applicable postgenomics data analyses and interpretation tool for translation of data to knowledge-based innovation in various Big Data fields such as precision medicine, ecogenomics, nutrigenomics

  3. Identify the signature genes for diagnose of uveal melanoma by weight gene co-expression network analysis

    Directory of Open Access Journals (Sweden)

    Kai Shi

    2015-04-01

    Full Text Available AIM: To identify and understand the relationship between co-expression pattern and clinic traits in uveal melanoma, weighted gene co-expression network analysis (WGCNA is applied to investigate the gene expression levels and patient clinic features. Uveal melanoma is the most common primary eye tumor in adults. Although many studies have identified some important genes and pathways that were relevant to progress of uveal melanoma, the relationship between co-expression and clinic traits in systems level of uveal melanoma is unclear yet. We employ WGCNA to investigate the relationship underlying molecular and phenotype in this study. METHODS: Gene expression profile of uveal melanoma and patient clinic traits were collected from the Gene Expression Omnibus (GEO database. The gene co-expression is calculated by WGCNA that is the R package software. The package is used to analyze the correlation between pairs of expression levels of genes. The function of the genes were annotated by gene ontology (GO. RESULTS: In this study, we identified four co-expression modules significantly correlated with clinic traits. Module blue positively correlated with radiotherapy treatment. Module purple positively correlates with tumor location (sclera and negatively correlates with patient age. Module red positively correlates with sclera and negatively correlates with thickness of tumor. Module black positively correlates with the largest tumor diameter (LTD. Additionally, we identified the hug gene (top connectivity with other genes in each module. The hub gene RPS15A, PTGDS, CD53 and MSI2 might play a vital role in progress of uveal melanoma. CONCLUSION: From WGCNA analysis and hub gene calculation, we identified RPS15A, PTGDS, CD53 and MSI2 might be target or diagnosis for uveal melanoma.

  4. Molecular cloning and SNP association analysis of chicken PMCH gene.

    Science.gov (United States)

    Sun, Guirong; Li, Ming; Li, Hong; Tian, Yadong; Chen, Qixin; Bai, Yichun; Kang, Xiangtao

    2013-08-01

    The pre-melanin-concentrating hormone (PMCH) gene is an important gene functionally concerning the regulations of body fat content, feeding behavior and energy balance. In this study, the full-length cDNA of chicken PMCH gene was amplified by SMART RACE method. The single nucleotide polymorphisms (SNPs) in the PMCH gene were screened by comparative sequence analysis. The obtained non-synonymous coding SNPs (ncSNPs) were designed for genotyping firstly. Its effects on growth, carcass characteristics and meat quality traits were investigated employing the F2 resource population of Gushi chicken crossed with Anak broiler by AluI CRS-PCR-RFLP. Our results indicated that the cDNA of chicken PMCH shared 67.25 and 66.47% homology with that of human and bovine PMCH, respectively. The deduced amino acid sequence of chicken PMCH (163 amino acids) were 52.07 and 50.89% identical to those of human and bovine PMCH, respectively. The PMCH protein sequence is predicted to have several functional domains, including pro-MCH, CSP, IL7, XPGI and some low complexity sequence. It has 8 phosphorylation sites and no signal peptide sequence. gga-miR-18a, gga-miR-18b, gga-miR-499 microRNA targeting site was predicted in the 3' untranslated region of chicken PMCH mRNA. In addition, a total of seven SNPs including an ncSNP and a synonymous coding SNP, were identified in the PMCH gene. The ncSNP c.81 A>T was found to be in moderate polymorphic state (polymorphic index=0.365), and the frequencies for genotype AA, AB and BB were 0.3648, 0.4682 and 0.1670, respectively. Significant associations between the locus and shear force of breast and leg were observed. This polymorphic site may serve as a useful target for the marker assisted selection of the growth and meat quality traits in chicken.

  5. Gene expression analysis in human breast cancer associated blood vessels.

    Directory of Open Access Journals (Sweden)

    Dylan T Jones

    Full Text Available Angiogenesis is essential for solid tumour growth, whilst the molecular profiles of tumour blood vessels have been reported to be different between cancer types. Although presently available anti-angiogenic strategies are providing some promise for the treatment of some cancers it is perhaps not surprisingly that, none of the anti-angiogenic agents available work on all tumours. Thus, the discovery of novel anti-angiogenic targets, relevant to individual cancer types, is required. Using Affymetrix microarray analysis of laser-captured, CD31-positive blood vessels we have identified 63 genes that are upregulated significantly (5-72 fold in angiogenic blood vessels associated with human invasive ductal carcinoma (IDC of the breast as compared with blood vessels in normal human breast. We tested the angiogenic capacity of a subset of these genes. Genes were selected based on either their known cellular functions, their enriched expression in endothelial cells and/or their sensitivity to anti-VEGF treatment; all features implicating their involvement in angiogenesis. For example, RRM2, a ribonucleotide reductase involved in DNA synthesis, was upregulated 32-fold in IDC-associated blood vessels; ATF1, a nuclear activating transcription factor involved in cellular growth and survival was upregulated 23-fold in IDC-associated blood vessels and HEX-B, a hexosaminidase involved in the breakdown of GM2 gangliosides, was upregulated 8-fold in IDC-associated blood vessels. Furthermore, in silico analysis confirmed that AFT1 and HEX-B also were enriched in endothelial cells when compared with non-endothelial cells. None of these genes have been reported previously to be involved in neovascularisation. However, our data establish that siRNA depletion of Rrm2, Atf1 or Hex-B had significant anti-angiogenic effects in VEGF-stimulated ex vivo mouse aortic ring assays. Overall, our results provide proof-of-principle that our approach can identify a cohort of

  6. Met is the most frequently amplified gene in endometriosis-associated ovarian clear cell adenocarcinoma and correlates with worsened prognosis.

    Directory of Open Access Journals (Sweden)

    Yoriko Yamashita

    Full Text Available Clear cell adenocarcinoma of the ovary (OCC is a chemo-resistant tumor with a relatively poor prognosis and is frequently associated with endometriosis. Although it is assumed that oxidative stress plays some role in the malignant transformation of this tumor, the characteristic molecular events leading to carcinogenesis remain unknown. In this study, an array-based comparative genomic hybridization (CGH analysis revealed Met gene amplification in 4/13 OCC primary tumors and 2/8 OCC cell lines. Amplification of the AKT2 gene, which is a downstream component of the Met/PI3K signaling pathway, was also observed in 5/21 samples by array-based CGH analysis. In one patient, both the Met and AKT2 genes were amplified. These findings were confirmed using fluorescence in situ hybridization, real-time quantitative PCR, immunoblotting, and immunohistochemistry. In total, 73 OCC cases were evaluated using real-time quantitative PCR; 37.0% demonstrated Met gene amplification (>4 copies, and 8.2% had AKT2 amplification. Furthermore, stage 1 and 2 patients with Met gene amplification had significantly worse survival than patients without Met gene amplification (p<0.05. Met knockdown by shRNA resulted in reduced viability of OCC cells with Met amplification due to increased apoptosis and cellular senescence, suggesting that the Met signaling pathway plays an important role in OCC carcinogenesis. Thus, we believe that targeted inhibition of the Met pathway may be a promising treatment for OCC.

  7. GeneNet Toolbox for MATLAB: a flexible platform for the analysis of gene connectivity in biological networks.

    Science.gov (United States)

    Taylor, Avigail; Steinberg, Julia; Andrews, Tallulah S; Webber, Caleb

    2015-02-01

    We present GeneNet Toolbox for MATLAB (also available as a set of standalone applications for Linux). The toolbox, available as command-line or with a graphical user interface, enables biologists to assess connectivity among a set of genes of interest ('seed-genes') within a biological network of their choosing. Two methods are implemented for calculating the significance of connectivity among seed-genes: 'seed randomization' and 'network permutation'. Options include restricting analyses to a specified subnetwork of the primary biological network, and calculating connectivity from the seed-genes to a second set of interesting genes. Pre-analysis tools help the user choose the best connectivity-analysis algorithm for their network. The toolbox also enables visualization of the connections among seed-genes. GeneNet Toolbox functions execute in reasonable time for very large networks (∼10 million edges) on a desktop computer. GeneNet Toolbox is open source and freely available from http://avigailtaylor.github.io/gntat14. Supplementary data are available at Bioinformatics online. avigail.taylor@dpag.ox.ac.uk. © The Author 2014. Published by Oxford University Press.

  8. Sparse canonical correlation analysis for identifying, connecting and completing gene-expression networks

    NARCIS (Netherlands)

    Waaijenborg, S.; Zwinderman, A.H.

    2009-01-01

    ABSTRACT: BACKGROUND: We generalized penalized canonical correlation analysis for analyzing microarray gene-expression measurements for checking completeness of known metabolic pathways and identifying candidate genes for incorporation in the pathway. We used Wold's method for calculation of the can

  9. Microarray analysis of genes affected by salt stress in tomato | Zhou ...

    African Journals Online (AJOL)

    Microarray analysis of genes affected by salt stress in tomato. ... African Journal of Environmental Science and Technology ... key enzyme genes in the metabolic pathways of carbohydrates, amino acids, and fatty acids, were also affected by ...

  10. Analysis of bacterial xylose isomerase gene diversity using gene-targeted metagenomics.

    Science.gov (United States)

    Nurdiani, Dini; Ito, Michihiro; Maruyama, Toru; Terahara, Takeshi; Mori, Tetsushi; Ugawa, Shin; Takeyama, Haruko

    2015-08-01

    Bacterial xylose isomerases (XI) are promising resources for efficient biofuel production from xylose in lignocellulosic biomass. Here, we investigated xylose isomerase gene (xylA) diversity in three soil metagenomes differing in plant vegetation and geographical location, using an amplicon pyrosequencing approach and two newly-designed primer sets. A total of 158,555 reads from three metagenomic DNA replicates for each soil sample were classified into 1127 phylotypes, detected in triplicate and defined by 90% amino acid identity. The phylotype coverage was estimated to be within the range of 84.0-92.7%. The xylA gene phylotypes obtained were phylogenetically distributed across the two known xylA groups. They shared 49-100% identities with their closest-related XI sequences in GenBank. Phylotypes demonstrating analysis, suggesting soil-specific xylA genotypes and taxonomic compositions. The differences among xylA members and their compositions in the soil were strongly correlated with 16S rRNA variation between soil samples, also assessed by amplicon pyrosequencing. This is the first report of xylA diversity in environmental samples assessed by amplicon pyrosequencing. Our data provide information regarding xylA diversity in nature, and can be a basis for the screening of novel xylA genotypes for practical applications.

  11. Molecular characterization and gene functional analysis of Dicer-2 gene from Nilaparvata lugens(Hemiptera:Geometroidea)

    Institute of Scientific and Technical Information of China (English)

    Xiao-Yi Zhang; Kai Lu; Jia-Liang Zhou; Qiang Zhou

    2013-01-01

    Nilaparvata lugens(St((A)l)(Hemiptera: Geometroidea),a serious rice pest in many countries of Asia,causes a great loss in rice production every year.RNA interference (RNAi)is a powerful technology for gene function study in insects and a potential tool for pest control.As a core component of RNAi pathway,Dicer-2(Dcr-2)protein determines the production of small interfering RNA(siRNA)and is crucial for the efficiency of RNAi.In this study,the full-length complementary DNA(cDNA)ofN.lugens Dcr-2(NlDcr-2)was first cloned and analyzed,and then the RNAi experiment was conducted to explore the function of NIDcr-2 gene.The complete Dcr-2 cDNA ofN.lugens was 4971 bp in length with an open reading frame(ORF)of 1,656 amino acids.Phylogenetic and protein domain analysis showed that the predicted NlDcr-2 protein was similar to Tribolium castaneum.In the RNAi experiment,the messenger RNA level of NIDcr-2 was significantly reduced by NlDcr-2double-stranded RNA(dsRNA)(dsDcr-2).Fifty-five per cent decrease of NlDcr-2 was found after 4 days of unremitting feeding.No significant effect was observed on the development ofN.lugens after dsRNA ingestion.

  12. Characterization of the global profile of genes expressed in cervical epithelium by Serial Analysis of Gene Expression (SAGE)

    Science.gov (United States)

    Pérez-Plasencia, Carlos; Riggins, Gregory; Vázquez-Ortiz, Guelaguetza; Moreno, José; Arreola, Hugo; Hidalgo, Alfredo; Piña-Sanchez, Patricia; Salcedo, Mauricio

    2005-01-01

    Background Serial Analysis of Gene Expression (SAGE) is a new technique that allows a detailed and profound quantitative and qualitative knowledge of gene expression profile, without previous knowledge of sequence of analyzed genes. We carried out a modification of SAGE methodology (microSAGE), useful for the analysis of limited quantities of tissue samples, on normal human cervical tissue obtained from a donor without histopathological lesions. Cervical epithelium is constituted mainly by cervical keratinocytes which are the targets of human papilloma virus (HPV), where persistent HPV infection of cervical epithelium is associated with an increase risk for developing cervical carcinomas (CC). Results We report here a transcriptome analysis of cervical tissue by SAGE, derived from 30,418 sequenced tags that provide a wealth of information about the gene products involved in normal cervical epithelium physiology, as well as genes not previously found in uterine cervix tissue involved in the process of epidermal differentiation. Conclusion This first comprehensive and profound analysis of uterine cervix transcriptome, should be useful for the identification of genes involved in normal cervix uterine function, and candidate genes associated with cervical carcinoma. PMID:16171524

  13. Characterization of the global profile of genes expressed in cervical epithelium by Serial Analysis of Gene Expression (SAGE

    Directory of Open Access Journals (Sweden)

    Piña-Sanchez Patricia

    2005-09-01

    Full Text Available Abstract Background Serial Analysis of Gene Expression (SAGE is a new technique that allows a detailed and profound quantitative and qualitative knowledge of gene expression profile, without previous knowledge of sequence of analyzed genes. We carried out a modification of SAGE methodology (microSAGE, useful for the analysis of limited quantities of tissue samples, on normal human cervical tissue obtained from a donor without histopathological lesions. Cervical epithelium is constituted mainly by cervical keratinocytes which are the targets of human papilloma virus (HPV, where persistent HPV infection of cervical epithelium is associated with an increase risk for developing cervical carcinomas (CC. Results We report here a transcriptome analysis of cervical tissue by SAGE, derived from 30,418 sequenced tags that provide a wealth of information about the gene products involved in normal cervical epithelium physiology, as well as genes not previously found in uterine cervix tissue involved in the process of epidermal differentiation. Conclusion This first comprehensive and profound analysis of uterine cervix transcriptome, should be useful for the identification of genes involved in normal cervix uterine function, and candidate genes associated with cervical carcinoma.

  14. Functional annotation and identification of candidate disease genes by computational analysis of normal tissue gene expression data.

    Directory of Open Access Journals (Sweden)

    Laura Miozzi

    Full Text Available BACKGROUND: High-throughput gene expression data can predict gene function through the "guilt by association" principle: coexpressed genes are likely to be functionally associated. METHODOLOGY/PRINCIPAL FINDINGS: We analyzed publicly available expression data on normal human tissues. The analysis is based on the integration of data obtained with two experimental platforms (microarrays and SAGE and of various measures of dissimilarity between expression profiles. The building blocks of the procedure are the Ranked Coexpression Groups (RCG, small sets of tightly coexpressed genes which are analyzed in terms of functional annotation. Functionally characterized RCGs are selected by means of the majority rule and used to predict new functional annotations. Functionally characterized RCGs are enriched in groups of genes associated to similar phenotypes. We exploit this fact to find new candidate disease genes for many OMIM phenotypes of unknown molecular origin. CONCLUSIONS/SIGNIFICANCE: We predict new functional annotations for many human genes, showing that the integration of different data sets and coexpression measures significantly improves the scope of the results. Combining gene expression data, functional annotation and known phenotype-gene associations we provide candidate genes for several genetic diseases of unknown molecular basis.

  15. ASSOCIATION ANALYSIS OF POLYMORPHISMS OF ACE GENE AND AGT GENE WITH ESSENTIAL HYPERTENSION IN CHINESE HAN'S POPULATION

    Institute of Scientific and Technical Information of China (English)

    刘英; 周文郁; 侯淑琴; 邱长春

    1998-01-01

    Objective. To investigate whether the polymorphisms in the angiotensin converting enxyme (ACE) gene and angiotensinogen (AGT) gene are associated with essential hypertension. Methods. A case-contrul study was carried out using 103 hypertensive (HT) and 131 normotensive (NT) subjects. The insertion/daletion(I/D) polymorphism of the ACE gene and the methionine→threonine variant at position 235 (M235T) of the AGT gene were determined by the polymerase chain reaction (PCR) technique and PCR/restriction fragment length polymorphism (PCR/RFLP) analysis, respectively. Results. The differences of D allele frequency and genotype distribution of the ACE gene between NT and HT groups were statistically significant (X2= 18.12,P<0. 005). The T235 allele frequeacy of the AGT gene was 69% in NT Chinese group (approximately 1.38 to 1.64 fold that in Caucasians), and was greater in female HT than in NT (0.82 vs 0. 72, X2=8. 1,P<0.025). A corralation between M235T molecular variant of the AGT gene and I/D molecular variant of ACE gene to hypertension was found. Concluions. The possession of D allele of the ACE gene might be a marker for predisposition to hypertension. The T235 allele of the AGT gene was more common in Chinese than in Caucasians, and might contribute to the risk for hypertension in female Chinese.

  16. Gene identification for risk of relapse in stage I lung adenocarcinoma patients: a combined methodology of gene expression profiling and computational gene network analysis.

    Science.gov (United States)

    Ludovini, Vienna; Bianconi, Fortunato; Siggillino, Annamaria; Piobbico, Danilo; Vannucci, Jacopo; Metro, Giulio; Chiari, Rita; Bellezza, Guido; Puma, Francesco; Della Fazia, Maria Agnese; Servillo, Giuseppe; Crinò, Lucio

    2016-05-24

    Risk assessment and treatment choice remains a challenge in early non-small-cell lung cancer (NSCLC). The aim of this study was to identify novel genes involved in the risk of early relapse (ER) compared to no relapse (NR) in resected lung adenocarcinoma (AD) patients using a combination of high throughput technology and computational analysis. We identified 18 patients (n.13 NR and n.5 ER) with stage I AD. Frozen samples of patients in ER, NR and corresponding normal lung (NL) were subjected to Microarray technology and quantitative-PCR (Q-PCR). A gene network computational analysis was performed to select predictive genes. An independent set of 79 ADs stage I samples was used to validate selected genes by Q-PCR.From microarray analysis we selected 50 genes, using the fold change ratio of ER versus NR. They were validated both in pool and individually in patient samples (ER and NR) by Q-PCR. Fourteen increased and 25 decreased genes showed a concordance between two methods. They were used to perform a computational gene network analysis that identified 4 increased (HOXA10, CLCA2, AKR1B10, FABP3) and 6 decreased (SCGB1A1, PGC, TFF1, PSCA, SPRR1B and PRSS1) genes. Moreover, in an independent dataset of ADs samples, we showed that both high FABP3 expression and low SCGB1A1 expression was associated with a worse disease-free survival (DFS).Our results indicate that it is possible to define, through gene expression and computational analysis, a characteristic gene profiling of patients with an increased risk of relapse that may become a tool for patient selection for adjuvant therapy.

  17. Gene-based analysis of regionally enriched cortical genes in GWAS data sets of cognitive traits and psychiatric disorders.

    Directory of Open Access Journals (Sweden)

    Kari M Ersland

    Full Text Available BACKGROUND: Despite its estimated high heritability, the genetic architecture leading to differences in cognitive performance remains poorly understood. Different cortical regions play important roles in normal cognitive functioning and impairment. Recently, we reported on sets of regionally enriched genes in three different cortical areas (frontomedial, temporal and occipital cortices of the adult rat brain. It has been suggested that genes preferentially, or specifically, expressed in one region or organ reflect functional specialisation. Employing a gene-based approach to the analysis, we used the regionally enriched cortical genes to mine a genome-wide association study (GWAS of the Norwegian Cognitive NeuroGenetics (NCNG sample of healthy adults for association to nine psychometric tests measures. In addition, we explored GWAS data sets for the serious psychiatric disorders schizophrenia (SCZ (n = 3 samples and bipolar affective disorder (BP (n = 3 samples, to which cognitive impairment is linked. PRINCIPAL FINDINGS: At the single gene level, the temporal cortex enriched gene RAR-related orphan receptor B (RORB showed the strongest overall association, namely to a test of verbal intelligence (Vocabulary, P = 7.7E-04. We also applied gene set enrichment analysis (GSEA to test the candidate genes, as gene sets, for enrichment of association signal in the NCNG GWAS and in GWASs of BP and of SCZ. We found that genes differentially expressed in the temporal cortex showed a significant enrichment of association signal in a test measure of non-verbal intelligence (Reasoning in the NCNG sample. CONCLUSION: Our gene-based approach suggests that RORB could be involved in verbal intelligence differences, while the genes enriched in the temporal cortex might be important to intellectual functions as measured by a test of reasoning in the healthy population. These findings warrant further replication in independent samples on cognitive traits.

  18. Gene dosage, expression, and ontology analysis identifies driver genes in the carcinogenesis and chemoradioresistance of cervical cancer.

    Science.gov (United States)

    Lando, Malin; Holden, Marit; Bergersen, Linn C; Svendsrud, Debbie H; Stokke, Trond; Sundfør, Kolbein; Glad, Ingrid K; Kristensen, Gunnar B; Lyng, Heidi

    2009-11-01

    Integrative analysis of gene dosage, expression, and ontology (GO) data was performed to discover driver genes in the carcinogenesis and chemoradioresistance of cervical cancers. Gene dosage and expression profiles of 102 locally advanced cervical cancers were generated by microarray techniques. Fifty-two of these patients were also analyzed with the Illumina expression method to confirm the gene expression results. An independent cohort of 41 patients was used for validation of gene expressions associated with clinical outcome. Statistical analysis identified 29 recurrent gains and losses and 3 losses (on 3p, 13q, 21q) associated with poor outcome after chemoradiotherapy. The intratumor heterogeneity, assessed from the gene dosage profiles, was low for these alterations, showing that they had emerged prior to many other alterations and probably were early events in carcinogenesis. Integration of the alterations with gene expression and GO data identified genes that were regulated by the alterations and revealed five biological processes that were significantly overrepresented among the affected genes: apoptosis, metabolism, macromolecule localization, translation, and transcription. Four genes on 3p (RYBP, GBE1) and 13q (FAM48A, MED4) correlated with outcome at both the gene dosage and expression level and were satisfactorily validated in the independent cohort. These integrated analyses yielded 57 candidate drivers of 24 genetic events, including novel loci responsible for chemoradioresistance. Further mapping of the connections among genetic events, drivers, and biological processes suggested that each individual event stimulates specific processes in carcinogenesis through the coordinated control of multiple genes. The present results may provide novel therapeutic opportunities of both early and advanced stage cervical cancers.

  19. Sparse canonical correlation analysis for identifying, connecting and completing gene-expression networks

    Directory of Open Access Journals (Sweden)

    Zwinderman Aeilko H

    2009-09-01

    Full Text Available Abstract Background We generalized penalized canonical correlation analysis for analyzing microarray gene-expression measurements for checking completeness of known metabolic pathways and identifying candidate genes for incorporation in the pathway. We used Wold's method for calculation of the canonical variates, and we applied ridge penalization to the regression of pathway genes on canonical variates of the non-pathway genes, and the elastic net to the regression of non-pathway genes on the canonical variates of the pathway genes. Results We performed a small simulation to illustrate the model's capability to identify new candidate genes to incorporate in the pathway: in our simulations it appeared that a gene was correctly identified if the correlation with the pathway genes was 0.3 or more. We applied the methods to a gene-expression microarray data set of 12, 209 genes measured in 45 patients with glioblastoma, and we considered genes to incorporate in the glioma-pathway: we identified more than 25 genes that correlated > 0.9 with canonical variates of the pathway genes. Conclusion We concluded that penalized canonical correlation analysis is a powerful tool to identify candidate genes in pathway analysis.

  20. Comparative analysis of gene expression by microarray analysis of male and female flowers of Asparagus officinalis.

    Science.gov (United States)

    Gao, Wu-Jun; Li, Shu-Fen; Zhang, Guo-Jun; Wang, Ning-Na; Deng, Chuan-Liang; Lu, Long-Dou

    2013-01-01

    To identify rapidly a number of genes probably involved in sex determination and differentiation of the dioecious plant Asparagus officinalis, gene expression profiles in early flower development for male and female plants were investigated by microarray assay with 8,665 probes. In total, 638 male-biased and 543 female-biased genes were identified. These genes with biased-expression for male and female were involved in a variety of processes associated with molecular functions, cellular components, and biological processes, suggesting that a complex mechanism underlies the sex development of asparagus. Among the differentially expressed genes involved in the reproductive process, a number of genes associated with floral development were identified. Reverse transcription-PCR was performed for validation, and the results were largely consistent with those obtained by microarray analysis. The findings of this study might contribute to understanding of the molecular mechanisms of sex determination and differentiation in dioecious asparagus and provide a foundation for further studies of this plant.

  1. Analysis of nicastrin gene phylogeny and expression in zebrafish.

    Science.gov (United States)

    Lim, Anne; Moussavi Nik, Seyyed Hani; Ebrahimie, Esmaeil; Lardelli, Michael

    2015-06-01

    NICASTRIN is a component of the aspartyl protease γ-secretase complex which is involved in intramembranous cleavage of type I transmembrane proteins, notably the Notch receptor proteins and the AMYLOID BETA A4 PRECURSOR PROTEIN (APP). This study aimed to characterize the orthologue of the human NICASTRIN (NCSTN) gene in zebrafish, an advantageous model organism for the study of human disease. Zebrafish Nicastrin protein was predicted to possess the conserved glutamate 333 residue and DYIGS motif of human NCSTN that are important for substrate recognition/processing in γ-secretase. Quantitative real-time RT-PCR revealed the profile of relative zebrafish nicastrin (ncstn) transcript levels in embryos at different times during development and in adult tissues. The analysis of synteny conservation revealed local rearrangements of ncstn and another gene, mpz, relative to copa, and pex19. In situ hybridization showed higher relative levels of ncstn transcripts in the developing brain and otic vesicles of embryos at 24 and 48 h post fertilization, respectively. Our observations are consistent with a role for Ncstn protein in Notch signaling within the proliferative ventricular zone of the developing central nervous system.

  2. Inverse bifurcation analysis: application to simple gene systems

    Directory of Open Access Journals (Sweden)

    Schuster Peter

    2006-07-01

    Full Text Available Abstract Background Bifurcation analysis has proven to be a powerful method for understanding the qualitative behavior of gene regulatory networks. In addition to the more traditional forward problem of determining the mapping from parameter space to the space of model behavior, the inverse problem of determining model parameters to result in certain desired properties of the bifurcation diagram provides an attractive methodology for addressing important biological problems. These include understanding how the robustness of qualitative behavior arises from system design as well as providing a way to engineer biological networks with qualitative properties. Results We demonstrate that certain inverse bifurcation problems of biological interest may be cast as optimization problems involving minimal distances of reference parameter sets to bifurcation manifolds. This formulation allows for an iterative solution procedure based on performing a sequence of eigen-system computations and one-parameter continuations of solutions, the latter being a standard capability in existing numerical bifurcation software. As applications of the proposed method, we show that the problem of maximizing regions of a given qualitative behavior as well as the reverse engineering of bistable gene switches can be modelled and efficiently solved.

  3. Analysis of gene translation using a communications theory approach.

    Science.gov (United States)

    Al Bataineh, Mohammad; Huang, Lun; Alonso, Maria; Menhart, Nick; Atkin, Guillermo E

    2010-01-01

    Rapid advances in both genomic data acquisition and computational technology have encouraged the development and use of advanced engineering methods in the field of bioinformatics and computational genomics. Processes in molecular biology can be modeled through the use of these methods. Such processes include identification and annotation of all the functional elements in the genome, including genes and regulatory sequences, which are a fundamental challenge in genomics and computational biology. Since regulatory elements are often short and variable, their identification and discovery using computational algorithms is difficult. However, significant advances have been made in the computational methods for modeling and detection of DNA regulatory elements. This paper proposes a novel use of techniques and principles from communications engineering, coding, and information theory for modeling, identification, and analysis of genomic regulatory elements and biological sequences. The methods proposed are not only able to identify regulatory elements (REs) at their exact locations, but can also "interestingly" distinguish coding from non-coding regions. Therefore, the proposed methods can be utilized to identify genes in the mRNA sequence.

  4. From array-based hybridization of Helicobacter pylori isolates to the complete genome sequence of an isolate associated with MALT lymphoma

    Directory of Open Access Journals (Sweden)

    Mégraud Francis

    2010-06-01

    Full Text Available Abstract Background elicobacter pylori infection is associated with several gastro-duodenal inflammatory diseases of various levels of severity. To determine whether certain combinations of genetic markers can be used to predict the clinical source of the infection, we analyzed well documented and geographically homogenous clinical isolates using a comparative genomics approach. Results A set of 254 H. pylori genes was used to perform array-based comparative genomic hybridization among 120 French H. pylori strains associated with chronic gastritis (n = 33, duodenal ulcers (n = 27, intestinal metaplasia (n = 17 or gastric extra-nodal marginal zone B-cell MALT lymphoma (n = 43. Hierarchical cluster analyses of the DNA hybridization values allowed us to identify a homogeneous subpopulation of strains that clustered exclusively with cagPAI minus MALT lymphoma isolates. The genome sequence of B38, a representative of this MALT lymphoma strain-cluster, was completed, fully annotated, and compared with the six previously released H. pylori genomes (i.e. J99, 26695, HPAG1, P12, G27 and Shi470. B38 has the smallest H. pylori genome described thus far (1,576,758 base pairs containing 1,528 CDSs; it contains the vacAs2m2 allele and lacks the genes encoding the major virulence factors (absence of cagPAI, babB, babC, sabB, and homB. Comparative genomics led to the identification of very few sequences that are unique to the B38 strain (9 intact CDSs and 7 pseudogenes. Pair-wise genomic synteny comparisons between B38 and the 6 H. pylori sequenced genomes revealed an almost complete co-linearity, never seen before between the genomes of strain Shi470 (a Peruvian isolate and B38. Conclusion These isolates are deprived of the main H. pylori virulence factors characterized previously, but are nonetheless associated with gastric neoplasia.

  5. Spectral analysis of Gene co-expression network of Zebrafish

    CERN Document Server

    Jalan, S; Bhojwani, J; Li, B; Zhang, L; Lan, S H; Gong, Z

    2012-01-01

    We analyze the gene expression data of Zebrafish under the combined framework of complex networks and random matrix theory. The nearest neighbor spacing distribution of the corresponding matrix spectra follows random matrix predictions of Gaussian orthogonal statistics. Based on the eigenvector analysis we can divide the spectra into two parts, first part for which the eigenvector localization properties match with the random matrix theory predictions, and the second part for which they show deviation from the theory and hence are useful to understand the system dependent properties. Spectra with the localized eigenvectors can be characterized into three groups based on the eigenvalues. We explore the position of localized nodes from these different categories. Using an overlap measure, we find that the top contributing nodes in the different groups carry distinguished structural features. Furthermore, the top contributing nodes of the different localized eigenvectors corresponding to the lower eigenvalue reg...

  6. Random matrix analysis of localization properties of gene coexpression network.

    Science.gov (United States)

    Jalan, Sarika; Solymosi, Norbert; Vattay, Gábor; Li, Baowen

    2010-04-01

    We analyze gene coexpression network under the random matrix theory framework. The nearest-neighbor spacing distribution of the adjacency matrix of this network follows Gaussian orthogonal statistics of random matrix theory (RMT). Spectral rigidity test follows random matrix prediction for a certain range and deviates afterwards. Eigenvector analysis of the network using inverse participation ratio suggests that the statistics of bulk of the eigenvalues of network is consistent with those of the real symmetric random matrix, whereas few eigenvalues are localized. Based on these IPR calculations, we can divide eigenvalues in three sets: (a) The nondegenerate part that follows RMT. (b) The nondegenerate part, at both ends and at intermediate eigenvalues, which deviates from RMT and expected to contain information about important nodes in the network. (c) The degenerate part with zero eigenvalue, which fluctuates around RMT-predicted value. We identify nodes corresponding to the dominant modes of the corresponding eigenvectors and analyze their structural properties.

  7. Genome-wide identification and analysis of the MADS-box gene family in sesame.

    Science.gov (United States)

    Wei, Xin; Wang, Linhai; Yu, Jingyin; Zhang, Yanxin; Li, Donghua; Zhang, Xiurong

    2015-09-10

    MADS-box genes encode transcription factors that play crucial roles in plant growth and development. Sesame (Sesamum indicum L.) is an oil crop that contributes to the daily oil and protein requirements of almost half of the world's population; therefore, a genome-wide analysis of the MADS-box gene family is needed. Fifty-seven MADS-box genes were identified from 14 linkage groups of the sesame genome. Analysis of phylogenetic relationships with Arabidopsis thaliana, Utricularia gibba and Solanum lycopersicum MADS-box genes was performed. Sesame MADS-box genes were clustered into four groups: 28 MIKC(c)-type, 5 MIKC(⁎)-type, 14 Mα-type and 10 Mγ-type. Gene structure analysis revealed from 1 to 22 exons of sesame MADS-box genes. The number of exons in type II MADS-box genes greatly exceeded the number in type I genes. Motif distribution analysis of sesame MADS-box genes also indicated that type II MADS-box genes contained more motifs than type I genes. These results suggested that type II sesame MADS-box genes had more complex structures. By analyzing expression profiles of MADS-box genes in seven sesame transcriptomes, we determined that MIKC(C)-type MADS-box genes played significant roles in sesame flower and seed development. Although most MADS-box genes in the same clade showed similar expression features, some gene functions were diversified from the orthologous Arabidopsis genes. This research will contribute to uncovering the role of MADS-box genes in sesame development. Copyright © 2015 Elsevier B.V. All rights reserved.

  8. Asthma susceptible genes in Chinese population: A meta-analysis

    Directory of Open Access Journals (Sweden)

    He Chao

    2010-09-01

    Full Text Available Abstract Background Published data regarding the associations between genetic variants and asthma risk in Chinese population were inconclusive. The aim of this study was to investigate asthma susceptible genes in Chinese population. Methods The authors conducted 18 meta-analyzes for 18 polymorphisms in 13 genes from eighty-two publications. Results Seven polymorphisms were found being associated with risk of asthma, namely: A Disintegrin and Metalloprotease 33 (ADAM33 T1-C/T (odds ratio [OR] = 6.07, 95% confidence interval [CI]: 2.69-13.73, Angiotensin-Converting Enzyme (ACE D/I (OR = 3.85, 95%CI: 2.49-5.94, High-affinity IgE receptor β chain (FcεRIβ -6843G/A (OR = 1.49, 95%CI: 1.01-2.22, Interleukin 13(IL-13 -1923C/T (OR = 2.99, 95%CI: 2.12-4.24, IL-13 -2044A/G (OR = 1.49, 95%CI: 1.07-2.08, Regulated upon Activation, Normal T cell Expressed and Secreted (RANTES -28C/G (OR = 1.64, 95%CI: 1.09-2.46, Tumor Necrosis Factor-α (TNF-α -308G/A(OR = 1.42, 95%CI: 1.09, 1.85. After subgroup analysis by age, the ACE D/I, β2-Adrenergic Receptor (β2-AR -79G/C, TNF-α -308G/A, Interleukin 4 receptor(IL-4R -1902G/A and IL-13 -1923C/T polymorphisms were found significantly associated with asthma risk in Chinese children. In addition, the ACE D/I, FcεRIβ -6843G/A, TNF-α -308G/A, IL-13 -1923C/T and IL-13 -2044A/G polymorphisms were associated with asthma risk in Chinese adults. Conclusion ADAM33, FcεRIβ, RANTES, TNF-α, ACE, β2-AR, IL-4R and IL-13 genes could be proposed as asthma susceptible genes in Chinese population. Given the limited number of studies, more data are required to validate these associations.

  9. Cartilage-selective genes identified in genome-scale analysis of non-cartilage and cartilage gene expression

    Directory of Open Access Journals (Sweden)

    Cohn Zachary A

    2007-06-01

    Full Text Available Abstract Background Cartilage plays a fundamental role in the development of the human skeleton. Early in embryogenesis, mesenchymal cells condense and differentiate into chondrocytes to shape the early skeleton. Subsequently, the cartilage anlagen differentiate to form the growth plates, which are responsible for linear bone growth, and the articular chondrocytes, which facilitate joint function. However, despite the multiplicity of roles of cartilage during human fetal life, surprisingly little is known about its transcriptome. To address this, a whole genome microarray expression profile was generated using RNA isolated from 18–22 week human distal femur fetal cartilage and compared with a database of control normal human tissues aggregated at UCLA, termed Celsius. Results 161 cartilage-selective genes were identified, defined as genes significantly expressed in cartilage with low expression and little variation across a panel of 34 non-cartilage tissues. Among these 161 genes were cartilage-specific genes such as cartilage collagen genes and 25 genes which have been associated with skeletal phenotypes in humans and/or mice. Many of the other cartilage-selective genes do not have established roles in cartilage or are novel, unannotated genes. Quantitative RT-PCR confirmed the unique pattern of gene expression observed by microarray analysis. Conclusion Defining the gene expression pattern for cartilage has identified new genes that may contribute to human skeletogenesis as well as provided further candidate genes for skeletal dysplasias. The data suggest that fetal cartilage is a complex and transcriptionally active tissue and demonstrate that the set of genes selectively expressed in the tissue has been greatly underestimated.

  10. Identification and testing of reference genes for Sesame gene expression analysis by quantitative real-time PCR.

    Science.gov (United States)

    Wei, Libin; Miao, Hongmei; Zhao, Ruihong; Han, Xiuhua; Zhang, Tide; Zhang, Haiyang

    2013-03-01

    Sesame (Sesamum indicum L.) is an ancient and important oilseed crop. However, few sesame reference genes have been selected for quantitative real-time PCR until now. Screening and validating reference genes is a requisite for gene expression normalization in sesame functional genomics research. In this study, ten candidate reference genes, i.e., SiACT, SiUBQ6, SiTUB, Si18S rRNA, SiEF1α, SiCYP, SiHistone, SiDNAJ, SiAPT and SiGAPDH, were chosen and examined systematically in 32 sesame samples. Three qRT-PCR analysis methods, i.e., geNorm, NormFinder and BestKeeper, were evaluated systematically. Results indicated that all ten candidate reference genes could be used as reference genes in sesame. SiUBQ6 and SiAPT were the optimal reference genes for sesame plant development; SiTUB was suitable for sesame vegetative tissue development, SiDNAJ for pathogen treatment, SiHistone for abiotic stress, SiUBQ6 for bud development and SiACT for seed germination. As for hormone treatment and seed development, SiHistone, SiCYP, SiDNAJ or SiUBQ6, as well as SiACT, SiDNAJ, SiTUB or SiAPT, could be used as reference gene, respectively. To illustrate the suitability of these reference genes, we analyzed the expression variation of three functional sesame genes of SiSS, SiLEA and SiGH in different organs using the optimal qRT-PCR system for the first time. The stability levels of optimal and worst reference genes screened for seed development, anther sterility and plant development were validated in the qRT-PCR normalization. Our results provided a reference gene application guideline for sesame gene expression characterization using qRT-PCR system.

  11. MDM2 SNP309, gene-gene interaction, and tumor susceptibility: an updated meta-analysis

    Directory of Open Access Journals (Sweden)

    Wu Wei

    2011-05-01

    Full Text Available Abstract Background The tumor suppressor gene p53 is involved in multiple cellular pathways including apoptosis, transcriptional control, and cell cycle regulation. In the last decade it has been demonstrated that the single nucleotide polymorphism (SNP at codon 72 of the p53 gene is associated with the risk for development of various neoplasms. MDM2 SNP309 is a single nucleotide T to G polymorphism located in the MDM2 gene promoter. From the time that this well-characterized functional polymorphism was identified, a variety of case-control studies have been published that investigate the possible association between MDM2 SNP309 and cancer risk. However, the results of the published studies, as well as the subsequent meta-analyses, remain contradictory. Methods To investigate whether currently published epidemiological studies can clarify the potential interaction between MDM2 SNP309 and the functional genetic variant in p53 codon72 (Arg72Pro and p53 mutation status, we performed a meta-analysis of the risk estimate on 27,813 cases with various tumor types and 30,295 controls. Results The data we reviewed indicated that variant homozygote 309GG and heterozygote 309TG were associated with a significant increased risk of all tumor types (homozygote comparison: odds ratio (OR = 1.25, 95% confidence interval (CI = 1.13-1.37; heterozygote comparison: OR = 1.10, 95% CI = 1.03-1.17. We also found that the combination of GG and Pro/Pro, TG and Pro/Pro, GG and Arg/Arg significantly increased the risk of cancer (OR = 3.38, 95% CI = 1.77-6.47; OR = 1.88, 95% CI = 1.26-2.81; OR = 1.96, 95% CI = 1.01-3.78, respectively. In a stratified analysis by tumor location, we also found a significant increased risk in brain, liver, stomach and uterus cancer (OR = 1.47, 95% CI = 1.06-2.03; OR = 2.24, 95%CI = 1.57-3.18; OR = 1.54, 95%CI = 1.04-2.29; OR = 1.34, 95%CI = 1.07-1.29, respectively. However, no association was seen between MDM2 SNP309 and tumor susceptibility

  12. Gene Dosage Analysis in a Clinical Environment: Gene-Targeted Microarrays as the Platform-of-Choice

    Directory of Open Access Journals (Sweden)

    Donald R. Love

    2013-03-01

    Full Text Available The role of gene deletion and duplication in the aetiology of disease has become increasingly evident over the last decade. In addition to the classical deletion/duplication disorders diagnosed using molecular techniques, such as Duchenne Muscular Dystrophy and Charcot-Marie-Tooth Neuropathy Type 1A, the significance of partial or whole gene deletions in the pathogenesis of a large number single-gene disorders is becoming more apparent. A variety of dosage analysis methods are available to the diagnostic laboratory but the widespread application of many of these techniques is limited by the expense of the kits/reagents and restrictive targeting to a particular gene or portion of a gene. These limitations are particularly important in the context of a small diagnostic laboratory with modest sample throughput. We have developed a gene-targeted, custom-designed comparative genomic hybridisation (CGH array that allows twelve clinical samples to be interrogated simultaneously for exonic deletions/duplications within any gene (or panel of genes on the array. We report here on the use of the array in the analysis of a series of clinical samples processed by our laboratory over a twelve-month period. The array has proven itself to be robust, flexible and highly suited to the diagnostic environment.

  13. Comparison of buccal and blood-derived canine DNA, either native or whole genome amplified, for array-based genome-wide association studies

    Directory of Open Access Journals (Sweden)

    Lawley Cynthia

    2011-06-01

    Full Text Available Abstract Background The availability of array-based genotyping platforms for single nucleotide polymorphisms (SNPs for the canine genome has expanded the opportunities to undertake genome-wide association (GWA studies to identify the genetic basis for Mendelian and complex traits. Whole blood as the source of high quality DNA is undisputed but often proves impractical for collection of the large numbers of samples necessary to discover the loci underlying complex traits. Further, many countries prohibit the collection of blood from dogs unless medically necessary thereby restricting access to critical control samples from healthy dogs. Alternate sources of DNA, typically from buccal cytobrush extractions, while convenient, have been suggested to have low yield and perform poorly in GWA. Yet buccal cytobrushes provide a cost-effective means of collecting DNA, are readily accepted by dog owners, and represent a large resource base in many canine genetics laboratories. To increase the DNA quantities, whole genome amplification (WGA can be performed. Thus, the present study assessed the utility of buccal-derived DNA as well as whole genome amplification in comparison to blood samples for use on the most recent iteration of the canine HD SNP array (Illumina. Findings In both buccal and blood samples, whether whole genome amplified or not, 97% of the samples had SNP call rates in excess of 80% indicating that the vast majority of the SNPs would be suitable to perform association studies regardless of the DNA source. Similarly, there were no significant differences in marker intensity measurements between buccal and blood samples for copy number variations (CNV analysis. Conclusions All DNA samples assayed, buccal or blood, native or whole genome amplified, are appropriate for use in array-based genome-wide association studies. The concordance between subsets of dogs for which both buccal and blood samples, or those samples whole genome amplified, was

  14. Microarray analysis of potential genes in the pathogenesis of recurrent oral ulcer.

    Science.gov (United States)

    Han, Jingying; He, Zhiwei; Li, Kun; Hou, Lu

    2015-01-01

    Recurrent oral ulcer seriously threatens patients' daily life and health. This study investigated potential genes and pathways that participate in the pathogenesis of recurrent oral ulcer by high throughput bioinformatic analysis. RT-PCR and Western blot were applied to further verify screened interleukins effect. Recurrent oral ulcer related genes were collected from websites and papers, and further found out from Human Genome 280 6.0 microarray data. Each pathway of recurrent oral ulcer related genes were got through chip hybridization. RT-PCR was applied to test four recurrent oral ulcer related genes to verify the microarray data. Data transformation, scatter plot, clustering analysis, and expression pattern analysis were used to analyze recurrent oral ulcer related gene expression changes. Recurrent oral ulcer gene microarray was successfully established. Microarray showed that 551 genes involved in recurrent oral ulcer activity and 196 genes were recurrent oral ulcer related genes. Of them, 76 genes up-regulated, 62 genes down-regulated, and 58 genes up-/down-regulated. Total expression level up-regulated 752 times (60%) and down-regulated 485 times (40%). IL-2 plays an important role in the occurrence, development and recurrence of recurrent oral ulcer on the mRNA and protein levels. Gene microarray can be used to analyze potential genes and pathways in recurrent oral ulcer. IL-2 may be involved in the pathogenesis of recurrent oral ulcer.

  15. GeneAnalytics Pathway Analysis and Genetic Overlap among Autism Spectrum Disorder, Bipolar Disorder and Schizophrenia

    Directory of Open Access Journals (Sweden)

    Naveen S. Khanzada

    2017-02-01

    Full Text Available Bipolar disorder (BPD and schizophrenia (SCH show similar neuropsychiatric behavioral disturbances, including impaired social interaction and communication, seen in autism spectrum disorder (ASD with multiple overlapping genetic and environmental influences implicated in risk and course of illness. GeneAnalytics software was used for pathway analysis and genetic profiling to characterize common susceptibility genes obtained from published lists for ASD (792 genes, BPD (290 genes and SCH (560 genes. Rank scores were derived from the number and nature of overlapping genes, gene-disease association, tissue specificity and gene functions subdivided into categories (e.g., diseases, tissues or functional pathways. Twenty-three genes were common to all three disorders and mapped to nine biological Superpathways including Circadian entrainment (10 genes, score = 37.0, Amphetamine addiction (five genes, score = 24.2, and Sudden infant death syndrome (six genes, score = 24.1. Brain tissues included the medulla oblongata (11 genes, score = 2.1, thalamus (10 genes, score = 2.0 and hypothalamus (nine genes, score = 2.0 with six common genes (BDNF, DRD2, CHRNA7, HTR2A, SLC6A3, and TPH2. Overlapping genes impacted dopamine and serotonin homeostasis and signal transduction pathways, impacting mood, behavior and physical activity level. Converging effects on pathways governing circadian rhythms support a core etiological relationship between neuropsychiatric illnesses and sleep disruption with hypoxia and central brain stem dysfunction.

  16. An approach for the identification of targets specific to bone metastasis using cancer genes interactome and gene ontology analysis.

    Science.gov (United States)

    Vashisht, Shikha; Bagler, Ganesh

    2012-01-01

    Metastasis is one of the most enigmatic aspects of cancer pathogenesis and is a major cause of cancer-associated mortality. Secondary bone cancer (SBC) is a complex disease caused by metastasis of tumor cells from their primary site and is characterized by intricate interplay of molecular interactions. Identification of targets for multifactorial diseases such as SBC, the most frequent complication of breast and prostate cancers, is a challenge. Towards achieving our aim of identification of targets specific to SBC, we constructed a 'Cancer Genes Network', a representative protein interactome of cancer genes. Using graph theoretical methods, we obtained a set of key genes that are relevant for generic mechanisms of cancers and have a role in biological essentiality. We also compiled a curated dataset of 391 SBC genes from published literature which serves as a basis of ontological correlates of secondary bone cancer. Building on these results, we implement a strategy based on generic cancer genes, SBC genes and gene ontology enrichment method, to obtain a set of targets that are specific to bone metastasis. Through this study, we present an approach for probing one of the major complications in cancers, namely, metastasis. The results on genes that play generic roles in cancer phenotype, obtained by network analysis of 'Cancer Genes Network', have broader implications in understanding the role of molecular regulators in mechanisms of cancers. Specifically, our study provides a set of potential targets that are of ontological and regulatory relevance to secondary bone cancer.

  17. An approach for the identification of targets specific to bone metastasis using cancer genes interactome and gene ontology analysis.

    Directory of Open Access Journals (Sweden)

    Shikha Vashisht

    Full Text Available Metastasis is one of the most enigmatic aspects of cancer pathogenesis and is a major cause of cancer-associated mortality. Secondary bone cancer (SBC is a complex disease caused by metastasis of tumor cells from their primary site and is characterized by intricate interplay of molecular interactions. Identification of targets for multifactorial diseases such as SBC, the most frequent complication of breast and prostate cancers, is a challenge. Towards achieving our aim of identification of targets specific to SBC, we constructed a 'Cancer Genes Network', a representative protein interactome of cancer genes. Using graph theoretical methods, we obtained a set of key genes that are relevant for generic mechanisms of cancers and have a role in biological essentiality. We also compiled a curated dataset of 391 SBC genes from published literature which serves as a basis of ontological correlates of secondary bone cancer. Building on these results, we implement a strategy based on generic cancer genes, SBC genes and gene ontology enrichment method, to obtain a set of targets that are specific to bone metastasis. Through this study, we present an approach for probing one of the major complications in cancers, namely, metastasis. The results on genes that play generic roles in cancer phenotype, obtained by network analysis of 'Cancer Genes Network', have broader implications in understanding the role of molecular regulators in mechanisms of cancers. Specifically, our study provides a set of potential targets that are of ontological and regulatory relevance to secondary bone cancer.

  18. Bioinformatics analysis to screen the key prognostic genes in ovarian cancer.

    Science.gov (United States)

    Li, Li; Cai, Shengyun; Liu, Shengnan; Feng, Hao; Zhang, Junjie

    2017-04-13

    Ovarian cancer (OC) is a gynecological oncology that has a poor prognosis and high mortality. This study is conducted to identify the key genes implicated in the prognosis of OC by bioinformatic analysis. Gene expression data (including 568 primary OC tissues, 17 recurrent OC tissues, and 8 adjacent normal tissues) and the relevant clinical information of OC patients were downloaded from The Cancer Genome Atlas database. After data preprocessing, cluster analysis was conducted using the ConsensusClusterPlus package in R. Using the limma package in R, differential analysis was performed to identify feature genes. Based on Kaplan-Meier (KM) survival analysis, prognostic seed genes were selected from the feature genes. After key prognostic genes were further screened by cluster analysis and KM survival analysis, they were performed functional enrichment analysis and multivariate survival analysis. Using the survival package in R, cox regression analysis was conducted for the microarray data of GSE17260 to validate the key prognostic genes. A total of 3668 feature genes were obtained, among which 75 genes were identified as prognostic seed genes. Then, 25 key prognostic genes were screened, including AXL, FOS, KLF6, WDR77, DUSP1, GADD45B, and SLIT3. Especially, AXL and SLIT3 were enriched in ovulation cycle. Multivariate survival analysis showed that the key prognostic genes could effectively differentiate the samples and were significantly associated with prognosis. Additionally, GSE17260 confirmed that the key prognostic genes were associated with the prognosis of OC. AXL, FOS, KLF6, WDR77, DUSP1, GADD45B, and SLIT3 might affect the prognosis of OC.

  19. Development of gene diagnosis for diabetes and cholecystitis based on gene analysis of CCK-A receptor

    Energy Technology Data Exchange (ETDEWEB)

    Kono, Akira [National Kyushu Cancer Center, Fukuoka (Japan)

    1999-02-01

    Base sequence analysis of CCKAR gene (a gene of A-type receptor for cholecystokinin) from OLETF rat, a model rat for insulin-independent diabetes was made based on the base sequence of wild CCKAR gene, which had been clarified in the previous year. From the pancreas of OLETF rat, DNA was extracted and transduced into {lambda}phage after fragmentation to construct the gene library of OLETF. Then, {lambda}phage DNA clone bound with labelled cDNA of CCKAR gene was analyzed and the gene structure was compared with that of the wild gene. It was demonstrated that CCKAR gene of OLETF had a deletion (6800 b.p.) ranging from the promoter region to the Exon 2, suggesting that CCKAR gene is not functional in OLETF rat. The whole sequence of this mutant gene was registered into Japan DNA Bank (D 50610). Then, F{sub 2} offspring rats were obtained through crossing OLETF (female) and F344 (male) and the time course-changes in the blood glucose level after glucose loading were compared among them. The blood glucose level after glucose loading was significantly higher in the homo-mutant F{sub 2} (CCKAR,-/-) as well as the parent OLETF rat than hetero-mutant F{sub 2} (CCKARm-/+) or the wild rat (CCKAR,+/+). This suggests that CCKAR gene might be involved in the control of blood glucose level and an alteration of the expression level or the functions of CCKAR gene might affect the blood glucose level. (M.N.)

  20. Advantages of Array-Based Technologies for Pre-Emptive Pharmacogenomics Testing

    Directory of Open Access Journals (Sweden)

    Al Shahandeh

    2016-05-01

    Full Text Available As recognised by the National Institutes of Health (NIH Precision Medicine Initiative (PMI, microarray technology currently provides a rapid, inexpensive means of identifying large numbers of known genomic variants or gene transcripts in experimental and clinical settings. However new generation sequencing techniques are now being introduced in many clinical genetic contexts, particularly where novel mutations are involved. While these methods can be valuable for screening a restricted set of genes for known or novel mutations, implementation of whole genome sequencing in clinical practice continues to present challenges. Even very accurate high-throughput methods with small error rates can generate large numbers of false negative or false positive errors due to the high numbers of simultaneous readings. Additional validation is likely to be required for safe use of any such methods in clinical settings. Custom-designed arrays can offer advantages for screening for common, known mutations and, in this context, may currently be better suited for accredited, quality-controlled clinical genetic screening services, as illustrated by their successful application in several large-scale pre-emptive pharmacogenomics programs now underway. Excessive, inappropriate use of next-generation sequencing may waste scarce research funds and other resources. Microarrays presently remain the technology of choice in applications that require fast, cost-effective genome-wide screening of variants of known importance, particularly for large sample sizes. This commentary considers some of the applications where microarrays continue to offer advantages over next-generation sequencing technologies.

  1. Analysis of snail genes in the crustacean Parhyale hawaiensis: insight into snail gene family evolution.

    Science.gov (United States)

    Hannibal, Roberta L; Price, Alivia L; Parchem, Ronald J; Patel, Nipam H

    2012-05-01

    The transcriptional repressor snail was first discovered in Drosophila melanogaster, where it initially plays a role in gastrulation and mesoderm formation, and later plays a role in neurogenesis. Among arthropods, this role of snail appears to be conserved in the insects Tribolium and Anopheles gambiae, but not in the chelicerates Cupiennius salei and Achaearanea tepidariorum, the myriapod Glomeris marginata, or the Branchiopod crustacean Daphnia magna. These data imply that within arthropoda, snail acquired its role in gastrulation and mesoderm formation in the insect lineage. However, crustaceans are a diverse group with several major taxa, making analysis of more crustaceans necessary to potentially understand the ancestral role of snail in Pancrustacea (crustaceans + insects) and thus in the ancestor of insects as well. To address these questions, we examined the snail family in the Malacostracan crustacean Parhyale hawaiensis. We found three snail homologs, Ph-snail1, Ph-snail2 and Ph-snail3, and one scratch homolog, Ph-scratch. Parhyale snail genes are expressed after gastrulation, during germband formation and elongation. Ph-snail1, Ph-snail2, and Ph-snail3 are expressed in distinct patterns in the neuroectoderm. Ph-snail1 is the only Parhyale snail gene expressed in the mesoderm, where its expression cycles in the mesodermal stem cells, called mesoteloblasts. The mesoteloblasts go through a series of cycles, where each cycle is composed of a migration phase and a division phase. Ph-snail1 is expressed during the migration phase, but not during the division phase. We found that as each mesoteloblast division produces one segment's worth of mesoderm, Ph-snail1 expression is linked to both the cell cycle and the segmental production of mesoderm.

  2. Refining transcriptional programs in kidney development by integration of deep RNA-sequencing and array-based spatial profiling

    Directory of Open Access Journals (Sweden)

    Rumballe Bree A

    2011-09-01

    Full Text Available Abstract Background The developing mouse kidney is currently the best-characterized model of organogenesis at a transcriptional level. Detailed spatial maps have been generated for gene expression profiling combined with systematic in situ screening. These studies, however, fall short of capturing the transcriptional complexity arising from each locus due to the limited scope of microarray-based technology, which is largely based on "gene-centric" models. Results To address this, the polyadenylated RNA and microRNA transcriptomes of the 15.5 dpc mouse kidney were profiled using strand-specific RNA-sequencing (RNA-Seq to a depth sufficient to complement spatial maps from pre-existing microarray datasets. The transcriptional complexity of RNAs arising from mouse RefSeq loci was catalogued; including 3568 alternatively spliced transcripts and 532 uncharacterized alternate 3' UTRs. Antisense expressions for 60% of RefSeq genes was also detected including uncharacterized non-coding transcripts overlapping kidney progenitor markers, Six2 and Sall1, and were validated by section in situ hybridization. Analysis of genes known to be involved in kidney development, particularly during mesenchymal-to-epithelial transition, showed an enrichment of non-coding antisense transcripts extended along protein-coding RNAs. Conclusion The resulting resource further refines the transcriptomic cartography of kidney organogenesis by integrating deep RNA sequencing data with locus-based information from previously published expression atlases. The added resolution of RNA-Seq has provided the basis for a transition from classical gene-centric models of kidney development towards more accurate and detailed "transcript-centric" representations, which highlights the extent of transcriptional complexity of genes that direct complex development events.

  3. A Preliminary Genetic Analysis of Complement 3 Gene and Schizophrenia.

    Directory of Open Access Journals (Sweden)

    Jianliang Ni

    Full Text Available Complement pathway activation was found to occur frequently in schizophrenia, and complement 3 (C3 plays a major role in this process. Previous studies have provided evidence for the possible role of C3 in the development of schizophrenia. In this study, we hypothesized that the gene encoding C3 (C3 may confer susceptibility to schizophrenia in Han Chinese. We analyzed 7 common single nucleotide polymorphisms (SNPs of C3 in 647 schizophrenia patients and 687 healthy controls. Peripheral C3 mRNA expression level was measured in 23 drug-naïve patients with schizophrenia and 24 controls. Two SNPs (rs1047286 and rs2250656 that deviated from Hardy-Weinberg equilibrium were excluded for further analysis. Among the remaining 5 SNPs, there was no significant difference in allele and genotype frequencies between the patient and control groups. Logistic regression analysis showed no significant SNP-gender interaction in either dominant model or recessive model. There was no significant difference in the level of peripheral C3 expression between the drug-naïve schizophrenia patients and healthy controls. In conclusion, the results of this study do not support C3 as a major genetic susceptibility factor in schizophrenia. Other factors in AP may have critical roles in schizophrenia and be worthy of further investigation.

  4. Paralogous histidine biosynthetic genes: evolutionary analysis of the Saccharomyces cerevisiae HIS6 and HIS7 genes.

    Science.gov (United States)

    Fani, R; Tamburini, E; Mori, E; Lazcano, A; Liò, P; Barberio, C; Casalone, E; Cavalieri, D; Perito, B; Polsinelli, M

    1997-09-15

    The HIS6 gene from Saccharomyces cerevisiae strain YNN282 is able to complement both the S. cerevisiae his6 and the Escherichia coli hisA mutations. The cloning and the nucleotide sequence indicated that this gene encodes a putative phosphoribosyl-5-amino-1-phosphoribosyl-4-imidazolecarboxiamide isomerase (5' Pro-FAR isomerase, EC 5.3.1.16) of 261 amino acids, with a molecular weight of 29,554. The HIS6 gene product shares a significant degree of sequence similarity with the prokaryotic HisA proteins and HisF proteins, and with the C-terminal domain of the S. cerevisiae HIS7 protein (homologous to HisF), indicating that the yeast HIS6 and HIS7 genes are paralogous. Moreover, the HIS6 gene is organized into two homologous modules half the size of the entire gene, typical of all the known prokaryotic hisA and hisF genes. The structure of the yeast HIS6 gene supports the two-step evolutionary model suggested by Fani et al. (J. Mol. Evol. 1994; 38: 489-495) to explain the present-day hisA and hisF genes. According to this idea, the hisF gene originated from the duplication of an ancestral hisA gene which, in turn, was the result of an earlier gene elongation event involving an ancestral module half the size of the extant gene. Results reported in this paper also suggest that these two successive paralogous gene duplications took probably place in the early steps of molecular evolution of the histidine pathway, well before the diversification of the three domains, and that this pathway was one of the metabolic activities of the last common ancestor. The molecular evolution of the yeast HIS6 and HIS7 genes is also discussed.

  5. Comparative analysis of Haemophilus influenzae hifA (pilin) genes.

    Science.gov (United States)

    Clemans, D L; Marrs, C F; Patel, M; Duncan, M; Gilsdorf, J R

    1998-02-01

    Adherence of Haemophilus influenzae to epithelial cells plays a central role in colonization and is the first step in infection with this organism. Pili, which are large polymorphic surface proteins, have been shown to mediate the binding of H. influenzae to cells of the human respiratory tract. Earlier experiments have demonstrated that the major epitopes of H. influenzae pili are highly conformational and immunologically heterogenous; their subunit pilins are, however, immunologically homogenous. To define the extent of structural variation in pilins, which polymerize to form pili, the pilin genes (hifA) of 26 type a to f and 16 nontypeable strains of H. influenzae were amplified by PCR and subjected to restriction fragment length polymorphism (RFLP) analysis with AluI and RsaI. Six different RFLP patterns were identified. Four further RFLP patterns were identified from published hifA sequences from five nontypeable H. influenzae strains. Two patterns contained only nontypeable isolates; one of these contained H. influenzae biotype aegyptius strains F3031 and F3037. Another pattern contained predominantly H. influenzae type f strains. All other patterns were displayed by a variety of capsular and noncapsular types. Sequence analysis of selected hifA genes confirmed the 10 RFLP patterns and showed strong identity among representatives displaying the same RFLP patterns. In addition, the immunologic reactivity of pili with antipilus antisera correlated with the groupings of strains based on hifA RFLP patterns. Those strains that show greater reactivity with antiserum directed against H. influenzae type b strain M43 pili tend to fall into one RFLP pattern (pattern 3); while those strains that show equal or greater reactivity with antiserum directed against H. influenzae type b strain Eagan pili tend to fall in a different RFLP pattern (pattern 1). Sequence analysis of representative HifA pilins from typeable and nontypeable H. influenzae identified several highly

  6. Logic design for array-based circuits a structured design methodology

    CERN Document Server

    White, D E

    1992-01-01

    This book will show you how to approach the design covering everything from the circuit specification to the final design acceptance, including what support you can expect, sizing, timing analysis, power and packaging, various simulations, design verification, and design submission.

  7. Prioritization of Susceptibility Genes for Ectopic Pregnancy by Gene Network Analysis.

    Science.gov (United States)

    Liu, Ji-Long; Zhao, Miao

    2016-02-01

    Ectopic pregnancy is a very dangerous complication of pregnancy, affecting 1%-2% of all reported pregnancies. Due to ethical constraints on human biopsies and the lack of suitable animal models, there has been little success in identifying functionally important genes in the pathogenesis of ectopic pregnancy. In the present study, we developed a random walk-based computational method named TM-rank to prioritize ectopic pregnancy-related genes based on text mining data and gene network information. Using a defined threshold value, we identified five top-ranked genes: VEGFA (vascular endothelial growth factor A), IL8 (interleukin 8), IL6 (interleukin 6), ESR1 (estrogen receptor 1) and EGFR (epidermal growth factor receptor). These genes are promising candidate genes that can serve as useful diagnostic biomarkers and therapeutic targets. Our approach represents a novel strategy for prioritizing disease susceptibility genes.

  8. A combined analysis of microarray gene expression studies of the human prefrontal cortex identifies genes implicated in schizophrenia.

    Science.gov (United States)

    Pérez-Santiago, Josué; Diez-Alarcia, Rebeca; Callado, Luis F; Zhang, Jin X; Chana, Gursharan; White, Cory H; Glatt, Stephen J; Tsuang, Ming T; Everall, Ian P; Meana, J Javier; Woelk, Christopher H

    2012-11-01

    Small cohort sizes and modest levels of gene expression changes in brain tissue have plagued the statistical approaches employed in microarray studies investigating the mechanism of schizophrenia. To combat these problems a combined analysis of six prior microarray studies was performed to facilitate the robust statistical analysis of gene expression data from the dorsolateral prefrontal cortex of 107 patients with schizophrenia and 118 healthy subjects. Multivariate permutation tests identified 144 genes that were differentially expressed between schizophrenia and control groups. Seventy of these genes were identified as differentially expressed in at least one component microarray study but none of these individual studies had the power to identify the remaining 74 genes, demonstrating the utility of a combined approach. Gene ontology terms and biological pathways that were significantly enriched for differentially expressed genes were related to neuronal cell-cell signaling, mesenchymal induction, and mitogen-activated protein kinase signaling, which have all previously been associated with the etiopathogenesis of schizophrenia. The differential expression of BAG3, C4B, EGR1, MT1X, NEUROD6, SST and S100A8 was confirmed by real-time quantitative PCR in an independent cohort using postmortem human prefrontal cortex samples. Comparison of gene expression between schizophrenic subjects with and without detectable levels of antipsychotics in their blood suggests that the modulation of MT1X and S100A8 may be the result of drug exposure. In conclusion, this combined analysis has resulted in a statistically robust identification of genes whose dysregulation may contribute to the mechanism of schizophrenia.

  9. FTO gene polymorphisms and obesity risk: a meta-analysis

    Directory of Open Access Journals (Sweden)

    Li Xiaobo

    2011-06-01

    Full Text Available Abstract Background The pathogenesis of obesity is reportedly related to variations in the fat mass and an obesity-associated gene (FTO; however, as the number of reports increases, particularly with respect to varying ethnicities, there is a need to determine more precisely the effect sizes in each ethnic group. In addition, some reports have claimed ethnic-specific associations with alternative SNPs, and to that end there has been a degree of confusion. Methods We searched PubMed, MEDLINE, Web of Science, EMBASE, and BIOSIS Preview to identify studies investigating the associations between the five polymorphisms and obesity risk. Individual study odds ratios (OR and their 95% confidence intervals (CI were estimated using per-allele comparison. Summary ORs were estimated using a random effects model. Results We identified 59 eligible case-control studies in 27 articles, investigating 41,734 obesity cases and 69,837 healthy controls. Significant associations were detected between obesity risk and the five polymorphisms: rs9939609 (OR: 1.31, 95% CI: 1.26 to 1.36, rs1421085 (OR: 1.43, 95% CI: 1.33 to 1.53, rs8050136 (OR: 1.25, 95% CI: 1.13 to 1.38, rs17817449 (OR: 1.54, 95% CI: 1.41 to 1.68, and rs1121980 (OR: 1.34, 95% CI: 1.10 to 1.62. Begg's and Egger's tests provided no evidence of publication bias for the polymorphisms except rs1121980. There is evidence of higher heterogeneity, with I2 test values ranging from 38.1% to 84.5%. Conclusions This meta-analysis suggests that FTO may represent a low-penetrance susceptible gene for obesity risk. Individual studies with large sample size are needed to further evaluate the associations between the polymorphisms and obesity risk in various ethnic populations.

  10. Literature Mining and Ontology based Analysis of Host-Brucella Gene-Gene Interaction Network.

    Science.gov (United States)

    Karadeniz, İlknur; Hur, Junguk; He, Yongqun; Özgür, Arzucan

    2015-01-01

    Brucella is an intracellular bacterium that causes chronic brucellosis in humans and various mammals. The identification of host-Brucella interaction is crucial to understand host immunity against Brucella infection and Brucella pathogenesis against host immune responses. Most of the information about the inter-species interactions between host and Brucella genes is only available in the text of the scientific publications. Many text-mining systems for extracting gene and protein interactions have been proposed. However, only a few of them have been designed by considering the peculiarities of host-pathogen interactions. In this paper, we used a text mining approach for extracting host-Brucella gene-gene interactions from the abstracts of articles in PubMed. The gene-gene interactions here represent the interactions between genes and/or gene products (e.g., proteins). The SciMiner tool, originally designed for detecting mammalian gene/protein names in text, was extended to identify host and Brucella gene/protein names in the abstracts. Next, sentence-level and abstract-level co-occurrence based approaches, as well as sentence-level machine learning based methods, originally designed for extracting intra-species gene interactions, were utilized to extract the interactions among the identified host and Brucella genes. The extracted interactions were manually evaluated. A total of 46 host-Brucella gene interactions were identified and represented as an interaction network. Twenty four of these interactions were identified from sentence-level processing. Twenty two additional interactions were identified when abstract-level processing was performed. The Interaction Network Ontology (INO) was used to represent the identified interaction types at a hierarchical ontology structure. Ontological modeling of specific gene-gene interactions demonstrates that host-pathogen gene-gene interactions occur at experimental conditions which can be ontologically represented. Our

  11. Phylogenetic and evolutionary analysis of NBS-encoding genes in Rutaceae fruit crops.

    Science.gov (United States)

    Xu, Qiang; Biswas, Manosh Kumar; Lan, Hong; Zeng, Wenfang; Liu, Chaoyang; Xu, Jidi; Deng, Xiuxin

    2011-02-01

    The nucleotide-binding site leucine-rich repeat (NBS-LRR) genes are the largest class of disease resistance genes in plants. However, our understanding of the evolution of NBS-LRR genes in Rutaceae fruit crops is rather limited. We report an evolutionary study of 103 NBS-encoding genes isolated from Poncirus trifoliata (trifoliate orange), Citrus reticulata (tangerine) and their F(1) progeny. In all, 58 of the sequences contained a continuous open reading frame. Phylogenetic analysis classified the 58 NBS genes into nine clades, eight of which were genus specific. This was taken to imply that most of the ancestors of these NBS genes evolved after the genus split. The motif pattern of the 58 NBS-encoding genes was consistent with their phylogenetic profile. An extended phylogenetic analysis, incorporating citrus NBS genes from the public database, classified 95 citrus NBS genes into six clades, half of which were genus specific. RFLP analysis showed that citrus NBS-encoding genes have been evolving rapidly, and that they are unstable when passed through an intergeneric cross. Of 32 NBS-encoding genes tracked by gene-specific PCR, 24 showed segregation distortion among a set of 94 F(1) individuals. This study provides new insight into the evolution of Rutaceae NBS genes and their behaviour following an intergeneric cross.

  12. High-throughput analysis of candidate imprinted genes and allele-specific gene expression in the human term placenta

    Directory of Open Access Journals (Sweden)

    Clark Taane G

    2010-04-01

    Full Text Available Abstract Background Imprinted genes show expression from one parental allele only and are important for development and behaviour. This extreme mode of allelic imbalance has been described for approximately 56 human genes. Imprinting status is often disrupted in cancer and dysmorphic syndromes. More subtle variation of gene expression, that is not parent-of-origin specific, termed 'allele-specific gene expression' (ASE is more common and may give rise to milder phenotypic differences. Using two allele-specific high-throughput technologies alongside bioinformatics predictions, normal term human placenta was screened to find new imprinted genes and to ascertain the extent of ASE in this tissue. Results Twenty-three family trios of placental cDNA, placental genomic DNA (gDNA and gDNA from both parents were tested for 130 candidate genes with the Sequenom MassArray system. Six genes were found differentially expressed but none imprinted. The Illumina ASE BeadArray platform was then used to test 1536 SNPs in 932 genes. The array was enriched for the human orthologues of 124 mouse candidate genes from bioinformatics predictions and 10 human candidate imprinted genes from EST database mining. After quality control pruning, a total of 261 informative SNPs (214 genes remained for analysis. Imprinting with maternal expression was demonstrated for the lymphocyte imprinted gene ZNF331 in human placenta. Two potential differentially methylated regions (DMRs were found in the vicinity of ZNF331. None of the bioinformatically predicted candidates tested showed imprinting except for a skewed allelic expression in a parent-specific manner observed for PHACTR2, a neighbour of the imprinted PLAGL1 gene. ASE was detected for two or more individuals in 39 candidate genes (18%. Conclusions Both Sequenom and Illumina assays were sensitive enough to study imprinting and strong allelic bias. Previous bioinformatics approaches were not predictive of new imprinted genes

  13. Fine Mapping and Candidate Gene Analysis of Resistance Gene RSC3Q to Soybean mosaic virus in Qihuang 1

    Institute of Scientific and Technical Information of China (English)

    Zheng gui-jie; Yang Yong-qing; Ma Ying; Yang Xiao-feng; Chen Shan-yu; Ren Rui; Wang Da-gang; Yang Zhong-lu; ZhI hai-jian

    2014-01-01

    Soybean mosaic virus (SMV) disease is one of the most destructive viral diseases in soybean (Glycine max (L.) Merr.). SMV strain SC3 is the major prevalent strain in huang-huai and Yangtze valleys, China. The soybean cultivar Qihuang 1 is of a rich resistance spectrum and has a wide range of application in breeding programs in China. In this study, F1, F2 and F2:3 from Qihuang 1×nannong 1138-2 were used to study inheritance and linkage mapping of the SC3 resistance gene in Qihuang 1. The secondary F2 population and near isogenic lines (nILs) derived from residual heterozygous lines (RhLs) of Qihuang 1×nannong 1138-2 were separatively used in the ifne mapping and candidate gene analysis of the resistance gene. Results indicated that a single dominant gene (designated RSC3Q) controls resistance, which was located on chromosome 13. Two genomic-simple sequence repeat (SSR) markers BARCSOYSSR_13_1114 and BARCSOYSSR_13_1136 were found lfanking the two sides of the RSC3Q. The interval between the two markers was 651 kb. Quantitative real-time PCR analysis of the candidate genes showed that ifve genes (Glyma13g25730, 25750, 25950, 25970 and 26000) were likely involved in soybean SMV resistance. These results would have utility in cloning of RSC3Q resistance candidate gene and marker-assisted selection (MaS) in resistance breeding to SMV.

  14. Molecular evidence of lateral gene transfer in rpoB gene of Mycobacterium yongonense strains via multilocus sequence analysis.

    Directory of Open Access Journals (Sweden)

    Byoung-Jun Kim

    Full Text Available Recently, a novel species, Mycobacterium yongonense (DSM 45126(T, was introduced and while it is phylogenetically related to Mycobacterium intracellulare, it has a distinct RNA polymerase β-subunit gene (rpoB sequence that is identical to that of Mycobacterium parascrofulaceum, which is a distantly related scotochromogen, which suggests the acquisition of the rpoB gene via a potential lateral gene transfer (LGT event. The aims of this study are to prove the presence of the LGT event in the rpoB gene of the M. yongonense strains via multilocus sequence analysis (MLSA. In order to determine the potential of an LGT event in the rpoB gene of the M. yongonense, the MLSA based on full rpoB sequences (3447 or 3450 bp and on partial sequences of five other targets [16S rRNA (1383 or 1395 bp, hsp65 (603 bp, dnaJ (192 bp, recA (1053 bp, and sodA (501 bp] were conducted. Incongruences between the phylogenetic analysis of the full rpoB and the five other genes in a total of three M. yongonense strains [two clinical strains (MOTT-12 and MOTT-27 and one type strain (DSM 45126(T] were observed, suggesting that rpoB gene of three M. yongonense strains may have been acquired very recently via an LGT event from M. parascrofulaceum, which is a distantly related scotochromogen.

  15. A Novel Approach to Functional Analysis of the Ribulose Bisphosphate Carboxylase Small Subunit Gene by Agrobacterium-Mediated Gene Silencing

    Institute of Scientific and Technical Information of China (English)

    Xiao-Fu Zhou; Peng-Da Ma; Ren-Hou Wang; Bo Liu; Xing-Zhi Wang

    2006-01-01

    A novel approach to virus-induced post-transcriptional gene silencing for studying the function of the ribulose bisphosphate carboxylase small subunlt (rbcS) gene was established and optimized using potato virus X vector and Nicotiana benthamiana as experimental material. The analysis of silencing phenomena,transcriptional level, protein expression, and pigment measurement showed that the expression of the rbcS endogenous gene was inactivated by the expression of a 500-bp homologous cDNA fragment carried in the virus vector.

  16. Random matrix analysis for gene interaction networks in cancer cells

    CERN Document Server

    Kikkawa, Ayumi

    2016-01-01

    Motivation: The investigation of topological modifications of the gene interaction networks in cancer cells is essential for understanding the desease. We study gene interaction networks in various human cancer cells with the random matrix theory. This study is based on the Cancer Network Galaxy (TCNG) database which is the repository of huge gene interactions inferred by Bayesian network algorithms from 256 microarray experimental data downloaded from NCBI GEO. The original GEO data are provided by the high-throughput microarray expression experiments on various human cancer cells. We apply the random matrix theory to the computationally inferred gene interaction networks in TCNG in order to detect the universality in the topology of the gene interaction networks in cancer cells. Results: We found the universal behavior in almost one half of the 256 gene interaction networks in TCNG. The distribution of nearest neighbor level spacing of the gene interaction matrix becomes the Wigner distribution when the net...

  17. Comparative analysis of chromatin landscape in regulatory regions of human housekeeping and tissue specific genes

    Directory of Open Access Journals (Sweden)

    Dasgupta Dipayan

    2005-05-01

    Full Text Available Abstract Background Global regulatory mechanisms involving chromatin assembly and remodelling in the promoter regions of genes is implicated in eukaryotic transcription control especially for genes subjected to spatial and temporal regulation. The potential to utilise global regulatory mechanisms for controlling gene expression might depend upon the architecture of the chromatin in and around the gene. In-silico analysis can yield important insights into this aspect, facilitating comparison of two or more classes of genes comprising of a large number of genes within each group. Results In the present study, we carried out a comparative analysis of chromatin characteristics in terms of the scaffold/matrix attachment regions, nucleosome formation potential and the occurrence of repetitive sequences, in the upstream regulatory regions of housekeeping and tissue specific genes. Our data show that putative scaffold/matrix attachment regions are more abundant and nucleosome formation potential is higher in the 5' regions of tissue specific genes as compared to the housekeeping genes. Conclusion The differences in the chromatin features between the two groups of genes indicate the involvement of chromatin organisation in the control of gene expression. The presence of global regulatory mechanisms mediated through chromatin organisation can decrease the burden of invoking gene specific regulators for maintenance of the active/silenced state of gene expression. This could partially explain the lower number of genes estimated in the human genome.

  18. Modular Analysis of Peripheral Blood Gene Expression in Rheumatoid Arthritis Captures Reproducible Gene Expression Changes in TNF Responders

    Science.gov (United States)

    Oswald, Michaela; Curran, Mark; Lamberth, Sarah; Townsend, Robert; Hamilton, Jennifer D.; Chernoff, David N.; Carulli, John; Townsend, Michael; Weinblatt, Michael; Kern, Marlena; Pond, Cassandra; Lee, Annette; Gregersen, Peter K.

    2015-01-01

    Objective To establish whether the analysis of whole blood gene expression can be useful in predicting or monitoring response to anti-TNF therapy in RA. Methods Whole blood RNA (PAXgene) was obtained at baseline and 14 weeks on three independent cohorts with a combined total of 250 patients with rheumatoid arthritis beginning anti-TNF therapy. We employed an approach to gene expression analysis that is based on gene expression “modules”. Results Good and Moderate Responders by EULAR criteria exhibited highly significant and consistent changes in multiple gene expression modules using a hyper geometric analysis after 14 weeks of therapy. Strikingly, non responders exhibited very little change in any modules, despite exposure to TNF blockade. These patterns of change were highly consistent across all three cohorts, indicating that immunological changes after TNF treatment are specific to the combination of both drug exposure and responder status. In contrast, modular patterns of gene expression did not exhibit consistent differences between responders and non-responders at baseline in the three cohorts. Conclusions These data provide evidence that using gene expression modules related to inflammatory disease may provide a valuable method for objective monitoring of the response of RA patients who are treated with TNF inhibitors. PMID:25371395

  19. Association analysis of dyslipidemia-related genes in diabetic nephropathy.

    Directory of Open Access Journals (Sweden)

    Gareth J McKay

    Full Text Available Type 1 diabetes (T1D increases risk of the development of microvascular complications and cardiovascular disease (CVD. Dyslipidemia is a common risk factor in the pathogenesis of both CVD and diabetic nephropathy (DN, with CVD identified as the primary cause of death in patients with DN. In light of this commonality, we assessed single nucleotide polymorphisms (SNPs in thirty-seven key genetic loci previously associated with dyslipidemia in a T1D cohort using a case-control design. SNPs (n = 53 were genotyped using Sequenom in 1467 individuals with T1D (718 cases with proteinuric nephropathy and 749 controls without nephropathy i.e. normal albumin excretion. Cases and controls were white and recruited from the UK and Ireland. Association analyses were performed using PLINK to compare allele frequencies in cases and controls. In a sensitivity analysis, samples from control individuals with reduced renal function (estimated glomerular filtration rate<60 ml/min/1.73 m(2 were excluded. Correction for multiple testing was performed by permutation testing. A total of 1394 samples passed quality control filters. Following regression analysis adjusted by collection center, gender, duration of diabetes, and average HbA1c, two SNPs were significantly associated with DN. rs4420638 in the APOC1 region (odds ratio [OR] = 1.51; confidence intervals [CI]: 1.19-1.91; P = 0.001 and rs1532624 in CETP (OR = 0.82; CI: 0.69-0.99; P = 0.034; rs4420638 was also significantly associated in a sensitivity analysis (P = 0.016 together with rs7679 (P = 0.027. However, no association was significant following correction for multiple testing. Subgroup analysis of end-stage renal disease status failed to reveal any association. Our results suggest common variants associated with dyslipidemia are not strongly associated with DN in T1D among white individuals. Our findings, cannot entirely exclude these key genes which are central to the process of dyslipidemia, from

  20. Expression Analysis of Genes in the Nif Cluster of Clostridium beijerinckii

    OpenAIRE

    2007-01-01

    The nif genes of Clostridium beijerinckii NRRL B593 occupy a region of about 16 kilobases. Besides the two glnB-like genes, five other genes are interspersed between the nifNB and the nifVw genes. An expression analysis of the nif genes in nitrogen-fixing and non-nitrogen-fixing cells with probes generated from various regions of the nif cluster by northern blot analysis revealed the presence of four different transcripts in nitrogen-fixing cells. Two of these transcripts had the predicted si...

  1. Prediction and analysis of essential genes using the enrichments of gene ontology and KEGG pathways.

    Science.gov (United States)

    Chen, Lei; Zhang, Yu-Hang; Wang, ShaoPeng; Zhang, YunHua; Huang, Tao; Cai, Yu-Dong

    2017-01-01

    Identifying essential genes in a given organism is important for research on their fundamental roles in organism survival. Furthermore, if possible, uncovering the links between core functions or pathways with these essential genes will further help us obtain deep insight into the key roles of these genes. In this study, we investigated the essential and non-essential genes reported in a previous study and extracted gene ontology (GO) terms and biological pathways that are important for the determination of essential genes. Through the enrichment theory of GO and KEGG pathways, we encoded each essential/non-essential gene into a vector in which each component represented the relationship between the gene and one GO term or KEGG pathway. To analyze these relationships, the maximum relevance minimum redundancy (mRMR) was adopted. Then, the incremental feature selection (IFS) and support vector machine (SVM) were employed to extract important GO terms and KEGG pathways. A prediction model was built simultaneously using the extracted GO terms and KEGG pathways, which yielded nearly perfect performance, with a Matthews correlation coefficient of 0.951, for distinguishing essential and non-essential genes. To fully investigate the key factors influencing the fundamental roles of essential genes, the 21 most important GO terms and three KEGG pathways were analyzed in detail. In addition, several genes was provided in this study, which were predicted to be essential genes by our prediction model. We suggest that this study provides more functional and pathway information on the essential genes and provides a new way to investigate related problems.

  2. Identification of hub genes of pneumocyte senescence induced by thoracic irradiation using weighted gene co-expression network analysis

    Science.gov (United States)

    XING, YONGHUA; ZHANG, JUNLING; LU, LU; LI, DEGUAN; WANG, YUEYING; HUANG, SONG; LI, CHENGCHENG; ZHANG, ZHUBO; LI, JIANGUO; MENG, AIMIN

    2016-01-01

    Irradiation commonly causes pneumocyte senescence, which may lead to severe fatal lung injury characterized by pulmonary dysfunction and respiratory failure. However, the molecular mechanism underlying the induction of pneumocyte senescence by irradiation remains to be elucidated. In the present study, weighted gene co-expression network analysis (WGCNA) was used to screen for differentially expressed genes, and to identify the hub genes and gene modules, which may be critical for senescence. A total of 2,916 differentially expressed genes were identified between the senescence and non-senescence groups following thoracic irradiation. In total, 10 gene modules associated with cell senescence were detected, and six hub genes were identified, including B-cell scaffold protein with ankyrin repeats 1, translocase of outer mitochondrial membrane 70 homolog A, actin filament-associated protein 1, Cd84, Nuf2 and nuclear factor erythroid 2. These genes were markedly associated with cell proliferation, cell division and cell cycle arrest. The results of the present study demonstrated that WGCNA of microarray data may provide further insight into the molecular mechanism underlying pneumocyte senescence. PMID:26572216

  3. Genome-scale analysis of positional clustering of mouse testis-specific genes

    Directory of Open Access Journals (Sweden)

    Lee Bernett TK

    2005-01-01

    Full Text Available Abstract Background Genes are not randomly distributed on a chromosome as they were thought even after removal of tandem repeats. The positional clustering of co-expressed genes is known in prokaryotes and recently reported in several eukaryotic organisms such as Caenorhabditis elegans, Drosophila melanogaster, and Homo sapiens. In order to further investigate the mode of tissue-specific gene clustering in higher eukaryotes, we have performed a genome-scale analysis of positional clustering of the mouse testis-specific genes. Results Our computational analysis shows that a large proportion of testis-specific genes are clustered in groups of 2 to 5 genes in the mouse genome. The number of clusters is much higher than expected by chance even after removal of tandem repeats. Conclusion Our result suggests that testis-specific genes tend to cluster on the mouse chromosomes. This provides another piece of evidence for the hypothesis that clusters of tissue-specific genes do exist.

  4. Cluster Analysis and Significance of Novel Genes Related to Molecular Classification of Glioma

    Institute of Scientific and Technical Information of China (English)

    Juxiang Chen; Yicheng Lu; Guohan Hu; Kehua Sun; Chun Luo; Meiqing Lou; Kang Ying; Yao Li

    2005-01-01

    OBJECTIVE To screen differentially expressed genes in the development of human glioma and establish a primary molecular classification of glioma based on gene expression using cDNA microarrays.METHODS Brain specimens were obtained from 18 patients with glioma, 10males and 8 females, ages 14~62 with an average age of 44.4. The total RNAs of these glioma specimens and two specimens of donated brain of normal adults were extracted. BioStarH140S microarrays (including 8,347old genes and 5,592 novel genes) were adopted and hybridized with probes which were prepared from the total RNAs. Differentially expressed genes between normal tissues and glioma tissues were assayed after scanning cDNA microarrays with ScanArray4000. Northern hybridization and in situ hybridization (ISH) were used to identify functions of novel genes. Those differentially expressed genes were studied with a Hierarchical method and molecular classification of glioma was preliminary carried out.RESULTS Among the 13,939 target genes, there were 1,200 (8.61%)differentially expressed genes, of which 395 (2.83%) were novel genes. A total of 348 genes were up-regulated and 852 genes were down-regulated in the gliomas. The results of bioinformatical analysis, Northern hybridization and ISH revealed that those novel genes were highly associated with gliomas. There were multiple genes, such as the MAP gene、cytoskeleton & matrix motility genes, etc, which were of relevance to classification by the Hierarchical method. Molecular classification of glioma using a Hierarchical cluster was in accordance with pathology and suggested a molecular process of tumorigenesis and development.CONCLUSION Multiple genes play important roles in development of glioma. cDNA microarray technology is a powerful technique in screening for differentially expressed genes between two different kinds of tissues. Further analysis of gene expression and novel genes would be helpful to understand the molecular mechanism of glioma

  5. Identification and structural analysis of a novel snoRNA gene cluster from Arabidopsis thaliana

    Institute of Scientific and Technical Information of China (English)

    2000-01-01

    A Z2 snoRNA gene cluster,consisting of four antisense snoRNA genes, was identified from Arabidopsis thaliana. The sequence and structural analysis showed that the Z2 snoRNA gene cluster might be transcribed as a polycistronic precursor from an upstream promoter, and the intergenic spacers of the gene cluster encode the 'hairpin' structures similar to the processing recognition signals of yeast Saccharomyces cerevisiae polycistronic snoRNA precursor. The results also revealed that plant snoRNA gene with multiple copies is a characteristic in common, and provides a good system for further revealing the transcription and expression mechanism of plant snoRNA gene cluster.

  6. Global mass spectrometry and transcriptomics array based drug profiling provides novel insight into glucosamine induced endoplasmic reticulum stress

    DEFF Research Database (Denmark)

    Carvalho, Ana Sofia; Ribeiro, Helena; Voabil, Paula;

    2014-01-01

    We investigated the molecular effects of glucosamine supplements, a popular and safe alternative to nonsteroidal anti-inflammatory drugs, for decreasing pain, inflammation, and maintaining healthy joints. Numerous studies have reported an array of molecular effects after glucosamine treatment. We...... questioned whether the differences in the effects observed in previous studies were associated with the focus on a specific subproteome or with the use of specific cell lines or tissues. To address this question, global mass spectrometry- and transcription array-based glucosamine drug profiling was performed...... mainly observed glucosamine induced O-GlcNAcylation/O-GalNAcylation (O-HexNAcylation); however, we also observed global and local changes in acetylation, methylation, and phosphorylation. For example, our data provides two additional examples of "yin-yang" between phosphorylation and O...

  7. Global identification and expression analysis of stress-responsive genes of the Argonaute family in apple.

    Science.gov (United States)

    Xu, Ruirui; Liu, Caiyun; Li, Ning; Zhang, Shizhong

    2016-12-01

    Argonaute (AGO) proteins, which are found in yeast, animals, and plants, are the core molecules of the RNA-induced silencing complex. These proteins play important roles in plant growth, development, and responses to biotic stresses. The complete analysis and classification of the AGO gene family have been recently reported in different plants. Nevertheless, systematic analysis and expression profiling of these genes have not been performed in apple (Malus domestica). Approximately 15 AGO genes were identified in the apple genome. The phylogenetic tree, chromosome location, conserved protein motifs, gene structure, and expression of the AGO gene family in apple were analyzed for gene prediction. All AGO genes were phylogenetically clustered into four groups (i.e., AGO1, AGO4, MEL1/AGO5, and ZIPPY/AGO7) with the AGO genes of Arabidopsis. These groups of the AGO gene family were statistically analyzed and compared among 31 plant species. The predicted apple AGO genes are distributed across nine chromosomes at different densities and include three segment duplications. Expression studies indicated that 15 AGO genes exhibit different expression patterns in at least one of the tissues tested. Additionally, analysis of gene expression levels indicated that the genes are mostly involved in responses to NaCl, PEG, heat, and low-temperature stresses. Hence, several candidate AGO genes are involved in different aspects of physiological and developmental processes and may play an important role in abiotic stress responses in apple. To the best of our knowledge, this study is the first to report a comprehensive analysis of the apple AGO gene family. Our results provide useful information to understand the classification and putative functions of these proteins, especially for gene members that may play important roles in abiotic stress responses in M. hupehensis.

  8. A Field Programmable Gate Array Based Software Defined Radio Design for the Space Environment

    Science.gov (United States)

    2009-12-01

    Signal Analysis, Software Defined Radio (SDR), System Generator , Fast Fourier Transform (FFT), Field Programmable Gate Array (FPGA), Xilinx, Virtex...5 B. COMPUTING TOOLS ...................................................................................7 1. System Generator .................................................................................7... System Generator Configuration Options [After 16]........................18 Figure 8. FFTv1.0, Triple memory configuration [From 22

  9. DNA microarray analysis of genes differentially expressed in adipocyte differentiation

    Indian Academy of Sciences (India)

    Chunyan Yin; Yanfeng Xiao; Wei Zhang; Erdi Xu; Weihua Liu; Xiaoqing Yi; Ming Chang

    2014-06-01

    In the present study, the human liposarcoma cell line SW872 was used to identify global changes in gene expression profiles occurring during adipogenesis. We further explored some of the genes expressed during the late phase of adipocyte differentiation. These genes may play a major role in promoting excessive proliferation and accumulation of lipid droplets, which contribute to the development of obesity. By using microarray-based technology, we examined differential gene expression in early differentiated adipocytes and late differentiated adipocytes. Validated genes exhibited a ≥ 10-fold increase in the late phase of adipocyte differentiation by polymerase chain reaction (RT-PCR). Compared with undifferentiated preadipocytes, we found that 763 genes were increased in early differentiated adipocytes, and 667 genes were increased in later differentiated adipocytes. Furthermore, 21 genes were found being expressed 10-fold higher in the late phase of adipocyte differentiation. The results were in accordance with the RT-PCR test, which validated 11 genes, namely, CIDEC, PID1, LYRM1, ADD1, PPAR2, ANGPTL4, ADIPOQ, ACOX1, FIP1L1, MAP3K2 and PEX14. Most of these genes were found being expressed in the later phase of adipocyte differentiation involved in obesity-related diseases. The findings may help to better understand the mechanism of obesity and related diseases.

  10. DNA microarray analysis of genes differentially expressed in adipocyte differentiation.

    Science.gov (United States)

    Yin, Chunyan; Xiao, Yanfeng; Zhang, Wei; Xu, Erdi; Liu, Weihua; Yi, Xiaoqing; Chang, Ming

    2014-06-01

    In the present study, the human liposarcoma cell line SW872 was used to identify global changes in gene expression profiles occurring during adipogenesis. We further explored some of the genes expressed during the late phase of adipocyte differentiation. These genes may play a major role in promoting excessive proliferation and accumulation of lipid droplets, which contribute to the development of obesity. By using microarray-based technology, we examined differential gene expression in early differentiated adipocytes and late differentiated adipocytes. Validated genes exhibited a greater than or equal to 10-fold increase in the late phase of adipocyte differentiation by polymerase chain reaction (RT-PCR). Compared with undifferentiated preadipocytes, we found that 763 genes were increased in early differentiated adipocytes, and 667 genes were increased in later differentiated adipocytes. Furthermore, 21 genes were found being expressed 10-fold higher in the late phase of adipocyte differentiation. The results were in accordance with the RTPCR test, which validated 11 genes, namely, CIDEC, PID1, LYRM1, ADD1, PPAR?2, ANGPTL4, ADIPOQ, ACOX1, FIP1L1, MAP3K2 and PEX14. Most of these genes were found being expressed in the later phase of adipocyte differentiation involved in obesity-related diseases. The findings may help to better understand the mechanism of obesity and related diseases.

  11. Analysis of SRY Gene in 8 Cases of Sex Abnormality

    Institute of Scientific and Technical Information of China (English)

    王慧; 腾云; 田虹; 陈燕; 杨真荣; 唐艳平

    2004-01-01

    In order to investigate the relationship between sex dysplasia and sex-determining region Y (SRY) gene, 8 patients with sexual abnormality were analyzed by cytogenetic and molecular genetic methods. Fluorescence in situ hybridization (FISH) using PY3.4, X alpha satellite, and SRY probes was performed in each case to analyze the sex chromosome translocation and gene translocation. SRY gene was amplified by polymerase chain reaction (PCR) and its mutation was detected by direct sequencing. The results showed that among 8 patients, 5 were positive for SRY and the remaining negative for SRY. In the patients positive for SRY genes, 3 presented testes and the left 2streak ovaries. In the patients negative for SRY, only one case presented testes, while 2 ovaries.Direct sequencing demonstrated that all SRY genes were normal in the patients positive for SRY genes. FISH technique demonstrated that SRY genes translocated from Ypter to Xpter in 2 46,XX phenotypic males positive for SRY genes. It was concluded that SRY gene is strongly involved in.male sex determination, while a sequence of other genes may be taken into account in sexual development.

  12. Prediction and Analysis of Retinoblastoma Related Genes through Gene Ontology and KEGG

    OpenAIRE

    Zhen Li; Bi-Qing Li; Min Jiang; Lei Chen; Jian Zhang; Lin Liu; Tao Huang

    2013-01-01

    One of the most important and challenging problems in biomedicine is how to predict the cancer related genes. Retinoblastoma (RB) is the most common primary intraocular malignancy usually occurring in childhood. Early detection of RB could reduce the morbidity and promote the probability of disease-free survival. Therefore, it is of great importance to identify RB genes. In this study, we developed a computational method to predict RB related genes based on Dagging, with the maximum relevance...

  13. Gene-Wide Analysis Detects Two New Susceptibility Genes for Alzheimer's Disease

    OpenAIRE

    Escott-Price, Valentina; Bellenguez, Céline; Wang, Li-San; Choi, Seung-Hoan; Harold, Denise; Jones, Lesley; Holmans, Peter Alan; Gerrish, Amy; Vedernikov, Alexey; Richards, Alexander; DeStefano, Anita L.; Lambert, Jean-Charles; Ibrahim-Verbaas, Carla A.; Naj, Adam C; Sims, Rebecca

    2014-01-01

    PUBLISHED BACKGROUND: Alzheimer's disease is a common debilitating dementia with known heritability, for which 20 late onset susceptibility loci have been identified, but more remain to be discovered. This study sought to identify new susceptibility genes, using an alternative gene-wide analytical approach which tests for patterns of association within genes, in the powerful genome-wide association dataset of the International Genomics of Alzheimer's Project Consortium, comprising over...

  14. Transcriptome analysis reveals key differentially expressed genes involved in wheat grain development

    Institute of Scientific and Technical Information of China (English)

    Yonglong Yu; Dong Zhu; Chaoying Ma; Hui Cao; Yaping Wang; Yanhao Xu; Wenying Zhang; Yueming Yan

    2016-01-01

    Wheat seed development is an important physiological process of seed maturation and directly affects wheat yield and quality. In this study, we performed dynamic transcriptome microarray analysis of an elite Chinese bread wheat cultivar (Jimai 20) during grain development using the GeneChip Wheat Genome Array. Grain morphology and scanning electron microscope observations showed that the period of 11–15 days post-anthesis (DPA) was a key stage for the synthesis and accumulation of seed starch. Genome-wide transcriptional profiling and significance analysis of microarrays revealed that the period from 11 to 15 DPA was more important than the 15–20 DPA stage for the synthesis and accumulation of nutritive reserves. Series test of cluster analysis of differential genes revealed five statistically significant gene expression profiles. Gene ontology annotation and enrichment analysis gave further informa-tion about differentially expressed genes, and MapMan analysis revealed expression changes within functional groups during seed development. Metabolic pathway network analysis showed that major and minor metabolic pathways regulate one another to ensure regular seed development and nutritive reserve accumulation. We performed gene co-expression network analysis to identify genes that play vital roles in seed development and identified several key genes involved in important metabolic pathways. The transcriptional expression of eight key genes involved in starch and protein synthesis and stress defense was further validated by qRT-PCR. Our results provide new insight into the molecular mechanisms of wheat seed development and the determinants of yield and quality.

  15. Transcriptome analysis reveals key differentially expressed genes involved in wheat grain development

    Directory of Open Access Journals (Sweden)

    Yonglong Yu

    2016-04-01

    Full Text Available Wheat seed development is an important physiological process of seed maturation and directly affects wheat yield and quality. In this study, we performed dynamic transcriptome microarray analysis of an elite Chinese bread wheat cultivar (Jimai 20 during grain development using the GeneChip Wheat Genome Array. Grain morphology and scanning electron microscope observations showed that the period of 11–15 days post-anthesis (DPA was a key stage for the synthesis and accumulation of seed starch. Genome-wide transcriptional profiling and significance analysis of microarrays revealed that the period from 11 to 15 DPA was more important than the 15–20 DPA stage for the synthesis and accumulation of nutritive reserves. Series test of cluster analysis of differential genes revealed five statistically significant gene expression profiles. Gene ontology annotation and enrichment analysis gave further information about differentially expressed genes, and MapMan analysis revealed expression changes within functional groups during seed development. Metabolic pathway network analysis showed that major and minor metabolic pathways regulate one another to ensure regular seed development and nutritive reserve accumulation. We performed gene co-expression network analysis to identify genes that play vital roles in seed development and identified several key genes involved in important metabolic pathways. The transcriptional expression of eight key genes involved in starch and protein synthesis and stress defense was further validated by qRT-PCR. Our results provide new insight into the molecular mechanisms of wheat seed development and the determinants of yield and quality.

  16. Analysis of Gene Expression Responses to a Infection in Rugao Chicken Intestine Using GeneChips

    Directory of Open Access Journals (Sweden)

    D. Q. Luan

    2012-02-01

    Full Text Available Poultry products are an important source of Salmonella enterica. An effective way to reduce food poisoning due to Salmonella would be to breed chickens more resistant to infection. Unfortunately host responses to Salmonella are complex with many factors involved. To learn more about responses to Salmonella in young chickens of 2 wk old, a cDNA Microarray containing 13,319 probes was performed to compare gene expression profiles between two chicken groups under control and Salmonella infected conditions. Newly hatched chickens were orally infected with S. enterica serovar Enteritidis. Since the intestine is one of the important barriers the bacteria encounter after oral inoculation, intestine gene expression was investigated at 2 wk old. There were 588 differentially expressed genes detected, of which 276 were known genes, and of the total number 266 were up-regulated and 322 were down-regulated. Differences in gene expression between the two chicken groups were found in control as well as Salmonella infected conditions indicating a difference in the intestine development between the two chicken groups which might be linked to the difference in Salmonella susceptibility. The differential expressions of 4 genes were confirmed by quantitative real-time PCR and the results indicated that the expression changes of these genes were generally consistent with the results of GeneChips. The findings in this study have lead to the identification of novel genes and possible cellular pathways, which are host dependent.

  17. Meta-analysis of differentiating mouse embryonic stem cell gene expression kinetics reveals early change of a small gene set.

    Directory of Open Access Journals (Sweden)

    Clive H Glover

    2006-11-01

    Full Text Available Stem cell differentiation involves critical changes in gene expression. Identification of these should provide endpoints useful for optimizing stem cell propagation as well as potential clues about mechanisms governing stem cell maintenance. Here we describe the results of a new meta-analysis methodology applied to multiple gene expression datasets from three mouse embryonic stem cell (ESC lines obtained at specific time points during the course of their differentiation into various lineages. We developed methods to identify genes with expression changes that correlated with the altered frequency of functionally defined, undifferentiated ESC in culture. In each dataset, we computed a novel statistical confidence measure for every gene which captured the certainty that a particular gene exhibited an expression pattern of interest within that dataset. This permitted a joint analysis of the datasets, despite the different experimental designs. Using a ranking scheme that favored genes exhibiting patterns of interest, we focused on the top 88 genes whose expression was consistently changed when ESC were induced to differentiate. Seven of these (103728_at, 8430410A17Rik, Klf2, Nr0b1, Sox2, Tcl1, and Zfp42 showed a rapid decrease in expression concurrent with a decrease in frequency of undifferentiated cells and remained predictive when evaluated in additional maintenance and differentiating protocols. Through a novel meta-analysis, this study identifies a small set of genes whose expression is useful for identifying changes in stem cell frequencies in cultures of mouse ESC. The methods and findings have broader applicability to understanding the regulation of self-renewal of other stem cell types.

  18. MiningABs: mining associated biomarkers across multi-connected gene expression datasets.

    Science.gov (United States)

    Cheng, Chun-Pei; DeBoever, Christopher; Frazer, Kelly A; Liu, Yu-Cheng; Tseng, Vincent S

    2014-06-08

    Human disease often arises as a consequence of alterations in a set of associated genes rather than alterations to a set of unassociated individual genes. Most previous microarray-based meta-analyses identified disease-associated genes or biomarkers independent of genetic interactions. Therefore, in this study, we present the first meta-analysis method capable of taking gene combination effects into account to efficiently identify associated biomarkers (ABs) across different microarray platforms. We propose a new meta-analysis approach called MiningABs to mine ABs across different array-based datasets. The similarity between paired probe sequences is quantified as a bridge to connect these datasets together. The ABs can be subsequently identified from an "improved" common logit model (c-LM) by combining several sibling-like LMs in a heuristic genetic algorithm selection process. Our approach is evaluated with two sets of gene expression datasets: i) 4 esophageal squamous cell carcinoma and ii) 3 hepatocellular carcinoma datasets. Based on an unbiased reciprocal test, we demonstrate that each gene in a group of ABs is required to maintain high cancer sample classification accuracy, and we observe that ABs are not limited to genes common to all platforms. Investigating the ABs using Gene Ontology (GO) enrichment, literature survey, and network analyses indicated that our ABs are not only strongly related to cancer development but also highly connected in a diverse network of biological interactions. The proposed meta-analysis method called MiningABs is able to efficiently identify ABs from different independently performed array-based datasets, and we show its validity in cancer biology via GO enrichment, literature survey and network analyses. We postulate that the ABs may facilitate novel target and drug discovery, leading to improved clinical treatment. Java source code, tutorial, example and related materials are available at "http://sourceforge.net/projects/miningabs/".

  19. Molecular characterization, sequence analysis and tissue expression of a porcine gene – MOSPD2

    Directory of Open Access Journals (Sweden)

    Yang Jie

    2017-01-01

    Full Text Available The full-length cDNA sequence of a porcine gene, MOSPD2, was amplified using the rapid amplification of cDNA ends method based on a pig expressed sequence tag sequence which was highly homologous to the coding sequence of the human MOSPD2 gene. Sequence prediction analysis revealed that the open reading frame of this gene encodes a protein of 491 amino acids that has high homology with the motile sperm domain-containing protein 2 (MOSPD2 of five species: horse (89%, human (90%, chimpanzee (89%, rhesus monkey (89% and mouse (85%; thus, it could be defined as a porcine MOSPD2 gene. This novel porcine gene was assigned GeneID: 100153601. This gene is structured in 15 exons and 14 introns as revealed by computer-assisted analysis. The phylogenetic analysis revealed that the porcine MOSPD2 gene has a closer genetic relationship with the MOSPD2 gene of horse. Tissue expression analysis indicated that the porcine MOSPD2 gene is generally and differentially expressed in the spleen, muscle, skin, kidney, lung, liver, fat and heart. Our experiment is the first to establish the primary foundation for further research on the porcine MOSPD2 gene.

  20. New cyt b gene universal primer set for forensic analysis.

    Science.gov (United States)

    Lopez-Oceja, A; Gamarra, D; Borragan, S; Jiménez-Moreno, S; de Pancorbo, M M

    2016-07-01

    Analysis of mitochondrial DNA, and in particular the cytochrome b gene (cyt b), has become an essential tool for species identification in routine forensic practice. In cases of degraded samples, where the DNA is fractionated, universal primers that are highly efficient for the amplification of the target region are necessary. Therefore, in the present study a new universal cyt b primer set with high species identification capabilities, even in samples with highly degraded DNA, has been developed. In order to achieve this objective, the primers were designed following the alignment of complete sequences of the cyt b from 751 species from the Class of Mammalia listed in GenBank. A highly variable region of 148bp flanked by highly conserved sequences was chosen for placing the primers. The effectiveness of the new pair of primers was examined in 63 animal species belonging to 38 Families from 14 Orders and 5 Classes (Mammalia, Aves, Reptilia, Actinopterygii, and Malacostraca). Species determination was possible in all cases, which shows that the fragment analyzed provided a high capability for species identification. Furthermore, to ensure the efficiency of the 148bp fragment, the intraspecific variability was analyzed by calculating the concordance between individuals with the BLAST tool from the NCBI (National Center for Biotechnological Information). The intraspecific concordance levels were superior to 97% in all species. Likewise, the phylogenetic information from the selected fragment was confirmed by obtaining the phylogenetic tree from the sequences of the species analyzed. Evidence of the high power of phylogenetic discrimination of the analyzed fragment of the cyt b was obtained, as 93.75% of the species were grouped within their corresponding Orders. Finally, the analysis of 40 degraded samples with small-size DNA fragments showed that the new pair of primers permits identifying the species, even when the DNA is highly degraded as it is very common in

  1. Gene structural analysis and expression of human renal dipeptidase

    Energy Technology Data Exchange (ETDEWEB)

    Satoh, Susumu; Ohtsuka, Kazuyuki; Keida, Yuriko; Kusunoki, Chihiro; Niwa, Mineo; Kohsaka, Masanobu (Fujisawa Pharmaceutical Company, Ltd., Osaka (Japan)); Konta, Yoshiyuki (Hirosaki Univ. (Japan))

    Human renal dipeptidase cDNA and genomic DNA were isolated from human kidney cDNA and genomic libraries, respectively. The human renal dipeptidase gene has a total length of approximately 6 kb and consists of ten exons and nine introns. The exons and cDNA each encode the 411 amino acid residues of the precursor protein, including 16 amino acid residues of signal sequence and a hydrophobic carboxyl terminal sequence for the attachment of a phosphatidylinositol glycan. Although the cDNA was slightly different from the cDNA reported by Adachi et al. (1990), the differences observed suggest, by comparison with human genomic DNA, that it may not represent an allelic variant but a cloning artifact. The recombinant human renal dipeptidase was produced on the surface of transfected L929 cells and had the same character as native renal dipeptidase. Northern blotting hybridization analysis showed that renal dipeptidase mRNA is only transcribed in kidney. 21 refs., 5 figs., 2 tabs.

  2. Phylogenetic analysis of uroporphyrinogen III synthase (UROS) gene.

    Science.gov (United States)

    Shaik, Abjal Pasha; Alsaeed, Abbas H; Sultana, Asma

    2012-01-01

    The uroporphyrinogen III synthase (UROS) enzyme (also known as hydroxymethylbilane hydrolyase) catalyzes the cyclization of hydroxymethylbilane to uroporphyrinogen III during heme biosynthesis. A deficiency of this enzyme is associated with the very rare Gunther's disease or congenital erythropoietic porphyria, an autosomal recessive inborn error of metabolism. The current study investigated the possible role of UROS (Homo sapiens [EC: 4.2.1.75; 265 aa; 1371 bp mRNA; Entrez Pubmed ref NP_000366.1, NM_000375.2]) in evolution by studying the phylogenetic relationship and divergence of this gene using computational methods. The UROS protein sequences from various taxa were retrieved from GenBank database and were compared using Clustal-W (multiple sequence alignment) with defaults and a first-pass phylogenetic tree was built using neighbor-joining method as in DELTA BLAST 2.2.27+ version. A total of 163 BLAST hits were found for the uroporphyrinogen III synthase query sequence and these hits showed putative conserved domain, HemD superfamily (as on 14(th) Nov 2012). We then narrowed down the search by manually deleting the proteins which were not UROS sequences and sequences belonging to phyla other than Chordata were deleted. A repeat phylogenetic analysis of 39 taxa was performed using PhyML and TreeDyn software to confirm that UROS is a highly conserved protein with approximately 85% conserved sequences in almost all chordate taxons emphasizing its importance in heme synthesis.

  3. Development of field programmable gate array-based reactor trip functions using systems engineering approach

    Energy Technology Data Exchange (ETDEWEB)

    Jung, Jae Cheon; Ahmed, Ibrahim [Nuclear Power Plant Engineering, KEPCO International Nuclear Graduate School, Ulsan (Korea, Republic of)

    2016-08-15

    Design engineering process for field programmable gate array (FPGA)-based reactor trip functions are developed in this work. The process discussed in this work is based on the systems engineering approach. The overall design process is effectively implemented by combining with design and implementation processes. It transforms its overall development process from traditional V-model to Y-model. This approach gives the benefit of concurrent engineering of design work with software implementation. As a result, it reduces development time and effort. The design engineering process consisted of five activities, which are performed and discussed: needs/systems analysis; requirement analysis; functional analysis; design synthesis; and design verification and validation. Those activities are used to develop FPGA-based reactor bistable trip functions that trigger reactor trip when the process input value exceeds the setpoint. To implement design synthesis effectively, a model-based design technique is implied. The finite-state machine with data path structural modeling technique together with very high speed integrated circuit hardware description language and the Aldec Active-HDL tool are used to design, model, and verify the reactor bistable trip functions for nuclear power plants.

  4. Screening Reliable Reference Genes for RT-qPCR Analysis of Gene Expression in Moringa oleifera

    Science.gov (United States)

    Deng, Li-Ting; Wu, Yu-Ling; Li, Jun-Cheng; OuYang, Kun-Xi; Ding, Mei-Mei; Zhang, Jun-Jie; Li, Shu-Qi; Lin, Meng-Fei; Chen, Han-Bin; Hu, Xin-Sheng; Chen, Xiao-Yang

    2016-01-01

    Moringa oleifera is a promising plant species for oil and forage, but its genetic improvement is limited. Our current breeding program in this species focuses on exploiting the functional genes associated with important agronomical traits. Here, we screened reliable reference genes for accurately quantifying the expression of target genes using the technique of real-time quantitative polymerase chain reaction (RT-qPCR) in M. oleifera. Eighteen candidate reference genes were selected from a transcriptome database, and their expression stabilities were examined in 90 samples collected from the pods in different developmental stages, various tissues, and the roots and leaves under different conditions (low or high temperature, sodium chloride (NaCl)- or polyethyleneglycol (PEG)- simulated water stress). Analyses with geNorm, NormFinder and BestKeeper algorithms revealed that the reliable reference genes differed across sample designs and that ribosomal protein L1 (RPL1) and acyl carrier protein 2 (ACP2) were the most suitable reference genes in all tested samples. The experiment results demonstrated the significance of using the properly validated reference genes and suggested the use of more than one reference gene to achieve reliable expression profiles. In addition, we applied three isotypes of the superoxide dismutase (SOD) gene that are associated with plant adaptation to abiotic stress to confirm the efficacy of the validated reference genes under NaCl and PEG water stresses. Our results provide a valuable reference for future studies on identifying important functional genes from their transcriptional expressions via RT-qPCR technique in M. oleifera. PMID:27541138

  5. Screening Reliable Reference Genes for RT-qPCR Analysis of Gene Expression in Moringa oleifera.

    Science.gov (United States)

    Deng, Li-Ting; Wu, Yu-Ling; Li, Jun-Cheng; OuYang, Kun-Xi; Ding, Mei-Mei; Zhang, Jun-Jie; Li, Shu-Qi; Lin, Meng-Fei; Chen, Han-Bin; Hu, Xin-Sheng; Chen, Xiao-Yang

    2016-01-01

    Moringa oleifera is a promising plant species for oil and forage, but its genetic improvement is limited. Our current breeding program in this species focuses on exploiting the functional genes associated with important agronomical traits. Here, we screened reliable reference genes for accurately quantifying the expression of target genes using the technique of real-time quantitative polymerase chain reaction (RT-qPCR) in M. oleifera. Eighteen candidate reference genes were selected from a transcriptome database, and their expression stabilities were examined in 90 samples collected from the pods in different developmental stages, various tissues, and the roots and leaves under different conditions (low or high temperature, sodium chloride (NaCl)- or polyethyleneglycol (PEG)- simulated water stress). Analyses with geNorm, NormFinder and BestKeeper algorithms revealed that the reliable reference genes differed across sample designs and that ribosomal protein L1 (RPL1) and acyl carrier protein 2 (ACP2) were the most suitable reference genes in all tested samples. The experiment results demonstrated the significance of using the properly validated reference genes and suggested the use of more than one reference gene to achieve reliable expression profiles. In addition, we applied three isotypes of the superoxide dismutase (SOD) gene that are associated with plant adaptation to abiotic stress to confirm the efficacy of the validated reference genes under NaCl and PEG water stresses. Our results provide a valuable reference for future studies on identifying important functional genes from their transcriptional expressions via RT-qPCR technique in M. oleifera.

  6. Comparative genomic analysis of Drosophila melanogaster and vector mosquito developmental genes.

    Directory of Open Access Journals (Sweden)

    Susanta K Behura

    Full Text Available Genome sequencing projects have presented the opportunity for analysis of developmental genes in three vector mosquito species: Aedes aegypti, Culex quinquefasciatus, and Anopheles gambiae. A comparative genomic analysis of developmental genes in Drosophila melanogaster and these three important vectors of human disease was performed in this investigation. While the study was comprehensive, special emphasis centered on genes that 1 are components of developmental signaling pathways, 2 regulate fundamental developmental processes, 3 are critical for the development of tissues of vector importance, 4 function in developmental processes known to have diverged within insects, and 5 encode microRNAs (miRNAs that regulate developmental transcripts in Drosophila. While most fruit fly developmental genes are conserved in the three vector mosquito species, several genes known to be critical for Drosophila development were not identified in one or more mosquito genomes. In other cases, mosquito lineage-specific gene gains with respect to D. melanogaster were noted. Sequence analyses also revealed that numerous repetitive sequences are a common structural feature of Drosophila and mosquito developmental genes. Finally, analysis of predicted miRNA binding sites in fruit fly and mosquito developmental genes suggests that the repertoire of developmental genes targeted by miRNAs is species-specific. The results of this study provide insight into the evolution of developmental genes and processes in dipterans and other arthropods, serve as a resource for those pursuing analysis of mosquito development, and will promote the design and refinement of functional analysis experiments.

  7. Gene set enrichment analysis for non-monotone association and multiple experimental categories

    OpenAIRE

    Heinloth Alexandra N; Irwin Richard D; Dai Shuangshuang; Lin Rongheng; Boorman Gary A; Li Leping

    2008-01-01

    Abstract Background Recently, microarray data analyses using functional pathway information, e.g., gene set enrichment analysis (GSEA) and significance analysis of function and expression (SAFE), have gained recognition as a way to identify biological pathways/processes associated with a phenotypic endpoint. In these analyses, a local statistic is used to assess the association between the expression level of a gene and the value of a phenotypic endpoint. Then these gene-specific local statis...

  8. Determination of Rayleigh wave ellipticity across the Earthscope Transportable Array using single-station and array-based processing of ambient seismic noise

    Science.gov (United States)

    Workman, Eli; Lin, Fan-Chi; Koper, Keith D.

    2017-01-01

    We present a single station method for the determination of Rayleigh wave ellipticity, or Rayleigh wave horizontal to vertical amplitude ratio (H/V) using Frequency Dependent Polarization Analysis (FDPA). This procedure uses singular value decomposition of 3-by-3 spectral covariance matrices over 1-hr time windows to determine properties of the ambient seismic noise field such as particle motion and dominant wave-type. In FPDA, if the noise is mostly dominated by a primary singular value and the phase difference is roughly 90° between the major horizontal axis and the vertical axis of the corresponding singular vector, we infer that Rayleigh waves are dominant and measure an H/V ratio for that hour and frequency bin. We perform this analysis for all available data from the Earthscope Transportable Array between 2004 and 2014. We compare the observed Rayleigh wave H/V ratios with those previously measured by multicomponent, multistation noise cross-correlation (NCC), as well as classical noise spectrum H/V ratio analysis (NSHV). At 8 s the results from all three methods agree, suggesting that the ambient seismic noise field is Rayleigh wave dominated. Between 10 and 30 s, while the general pattern agrees well, the results from FDPA and NSHV are persistently slightly higher (˜2 per cent) and significantly higher (>20 per cent), respectively, than results from the array-based NCC. This is likely caused by contamination from other wave types (i.e. Love waves, body waves, and tilt noise) in the single station methods, but it could also reflect a small, persistent error in NCC. Additionally, we find that the single station method has difficulty retrieving robust Rayleigh wave H/V ratios within major sedimentary basins, such as the Williston Basin and Mississippi Embayment, where the noise field is likely dominated by reverberating Love waves and tilt noise.

  9. Bayesian hierarchical error model for analysis of gene expression data

    National Research Council Canada - National Science Library

    Cho, HyungJun; Lee, Jae K

    2004-01-01

    .... Moreover, the same gene often shows quite heterogeneous error variability under different biological and experimental conditions, which must be estimated separately for evaluating the statistical...

  10. Coupled Two-Way Clustering Analysis of Breast Cancer and Colon Cancer Gene Expression Data

    CERN Document Server

    Getz, G; Kela, I; Domany, E; Notterman, D A; Getz, Gad; Gal, Hilah; Kela, Itai; Domany, Eytan; Notterman, Dan A.

    2003-01-01

    We present and review Coupled Two Way Clustering, a method designed to mine gene expression data. The method identifies submatrices of the total expression matrix, whose clustering analysis reveals partitions of samples (and genes) into biologically relevant classes. We demonstrate, on data from colon and breast cancer, that we are able to identify partitions that elude standard clustering analysis.

  11. Expression Pattern Analysis of Zinc Finger Protein Genes in Wheat (Triticum aestivum L.) Under Phosphorus Deprivation

    Institute of Scientific and Technical Information of China (English)

    LI Xiao-juan; GUO Cheng-jin; LU Wen-jing; DUAN Wei-wei; ZHAO Miao; MA Chun-ying; GU Jun-tao; XIAO Kai

    2014-01-01

    Zinc ifnger protein (ZFP) genes comprise a large and diverse gene family, and are involved in biotic and abiotic stress responses in plants. In this study, a total of 126 ZFP genes classiifed into various types in wheat were characterized and subjected to expression pattern analysis under inorganic phosphate (Pi) deprivation. The wheat ZFP genes and their corresponding GenBank numbers were obtained from the information of a 4×44K wheat gene expression microarray chip. They were conifrmed by sequence similarity analysis and named based on their homologs in Brachypodium distachyon or Oriza sativa. Expression analysis based on the microarray chip revealed that these ZFP genes are categorized into 11 classes according to their gene expression patterns in a 24-h of Pi deprivation regime. Among them, ten genes were differentially up-regulated, ten genes differentially down-regulated, and two genes both differentially up-and down-regulated by Pi deprivation. The differentially up-or down-regulated genes exhibited signiifcantly more or less transcripts at one, two, or all of the checking time points (1, 6, and 24 h) of Pi stress in comparison with those of normal growth, respectively. The both differentially up-and down-regulated genes exhibited contrasting expression patterns, of these, TaWRKY70;5 showed significantly up-regulated at 1 and 6 h and down-regulated at 24 h whereas TaAN1AN20-8;2 displayed signiifcantly upregulated at 1 h and downregulated at 6 h under deprivation Pi condition. Real time PCR analysis conifrmed the expression patterns of the differentially expressed genes obtained by the microarray chip. Our results indicate that numerous ZFP genes in wheat respond to Pi deprivation and have provided further insight into the molecular basis that plants respond to Pi deprivation mediated by the ZFP gene family.

  12. Integrated analysis of DNA copy number and gene expression microarray data using gene sets

    NARCIS (Netherlands)

    R.X. de Menezes (Renee); M. Boetzer (Marten); M. Sieswerda (Melle); G.J.B. van Ommen; J.M. Boer (Judith)

    2009-01-01

    textabstractBackground: Genes that play an important role in tumorigenesis are expected to show association between DNA copy number and RNA expression. Optimal power to find such associations can only be achieved if analysing copy number and gene expression jointly. Furthermore, some copy number

  13. Gene promoter analysis and molecular diagnostics – Application to erythroid-specific gene expression

    NARCIS (Netherlands)

    de Vooght, K.M.K.

    2008-01-01

    Gene expression is regulated at many levels. Most regulation, however, is believed to occur at the level of transcription initiation. Transcription factors, chromatin-modifying enzymes, and basal transcription factors unite to activate genes and are recruited in a precise order to promoters. This th

  14. Sequencing, characterization, and gene expression analysis of the histidine decarboxylase gene cluster of Morganella morganii.

    Science.gov (United States)

    Ferrario, Chiara; Borgo, Francesca; de Las Rivas, Blanca; Muñoz, Rosario; Ricci, Giovanni; Fortina, Maria Grazia

    2014-03-01

    The histidine decarboxylase gene cluster of Morganella morganii DSM30146(T) was sequenced, and four open reading frames, named hdcT1, hdc, hdcT2, and hisRS were identified. Two putative histidine/histamine antiporters (hdcT1 and hdcT2) were located upstream and downstream the hdc gene, codifying a pyridoxal-P dependent histidine decarboxylase, and followed by hisRS gene encoding a histidyl-tRNA synthetase. This organization was comparable with the gene cluster of other known Gram negative bacteria, particularly with that of Klebsiella oxytoca. Recombinant Escherichia coli strains harboring plasmids carrying the M. morganii hdc gene were shown to overproduce histidine decarboxylase, after IPTG induction at 37 °C for 4 h. Quantitative RT-PCR experiments revealed the hdc and hisRS genes were highly induced under acidic and histidine-rich conditions. This work represents the first description and identification of the hdc-related genes in M. morganii. Results support the hypothesis that the histidine decarboxylation reaction in this prolific histamine producing species may play a role in acid survival. The knowledge of the role and the regulation of genes involved in histidine decarboxylation should improve the design of rational strategies to avoid toxic histamine production in foods.

  15. Array-Based Ultrawideband through-Wall Radar: Prediction and Assessment of Real Radar Abilities

    Directory of Open Access Journals (Sweden)

    Nadia Maaref

    2013-01-01

    Full Text Available This paper deals with a new through-the-wall (TTW radar demonstrator for the detection and the localisation of people in a room (in a noncooperative way with the radar situated outside but in the vicinity of the first wall. After modelling the propagation through various walls and quantifying the backscattering by the human body, an analysis of the technical considerations which aims at defining the radar design is presented. Finally, an ultrawideband (UWB frequency modulated continuous wave (FMCW radar is proposed, designed, and implemented. Some representative trials show that this radar is able to localise and track moving people behind a wall in real time.

  16. Discovery of putative capsaicin biosynthetic genes by RNA-Seq and digital gene expression analysis of pepper

    Science.gov (United States)

    Zhang, Zi-Xin; Zhao, Shu-Niu; Liu, Gao-Feng; Huang, Zu-Mei; Cao, Zhen-Mu; Cheng, Shan-Han; Lin, Shi-Sen

    2016-01-01

    The Indian pepper ‘Guijiangwang’ (Capsicum frutescens L.), one of the world’s hottest chili peppers, is rich in capsaicinoids. The accumulation of the alkaloid capsaicin and its analogs in the epidermal cells of the placenta contribute to the pungency of Capsicum fruits. To identify putative genes involved in capsaicin biosynthesis, RNA-Seq was used to analyze the pepper’s expression profiles over five developmental stages. Five cDNA libraries were constructed from the total RNA of placental tissue and sequenced using an Illumina HiSeq 2000. More than 19 million clean reads were obtained from each library, and greater than 50% of the reads were assignable to reference genes. Digital gene expression (DGE) profile analysis using Solexa sequencing was performed at five fruit developmental stages and resulted in the identification of 135 genes of known function; their expression patterns were compared to the capsaicin accumulation pattern. Ten genes of known function were identified as most likely to be involved in regulating capsaicin synthesis. Additionally, 20 new candidate genes were identified related to capsaicin synthesis. We use a combination of RNA-Seq and DGE analyses to contribute to the understanding of the biosynthetic regulatory mechanism(s) of secondary metabolites in a nonmodel plant and to identify candidate enzyme-encoding genes. PMID:27756914

  17. Expression QTL analysis of top loci from GWAS meta-analysis highlights additional schizophrenia candidate genes.

    Science.gov (United States)

    de Jong, Simone; van Eijk, Kristel R; Zeegers, Dave W L H; Strengman, Eric; Janson, Esther; Veldink, Jan H; van den Berg, Leonard H; Cahn, Wiepke; Kahn, René S; Boks, Marco P M; Ophoff, Roel A

    2012-09-01

    There is genetic evidence that schizophrenia is a polygenic disorder with a large number of loci of small effect on disease susceptibility. Genome-wide association studies (GWASs) of schizophrenia have had limited success, with the best finding at the MHC locus at chromosome 6p. A recent effort of the Psychiatric GWAS consortium (PGC) yielded five novel loci for schizophrenia. In this study, we aim to highlight additional schizophrenia susceptibility loci from the PGC study by combining the top association findings from the discovery stage (9394 schizophrenia cases and 12 462 controls) with expression QTLs (eQTLs) and differential gene expression in whole blood of schizophrenia patients and controls. We examined the 6192 single-nucleotide polymorphisms (SNPs) with significance threshold at Pschizophrenia cases and controls (n=202). After correction for multiple testing, the eQTL analysis yielded 40 significant cis-acting effects of the SNPs. Seven of these transcripts show differential expression between cases and controls. Of these, the effect of three genes (RNF5, TRIM26 and HLA-DRB3) coincided with the direction expected from meta-analysis findings and were all located within the MHC region. Our results identify new genes of interest and highlight again the involvement of the MHC region in schizophrenia susceptibility.

  18. Cognitive analysis of schizophrenia risk genes that function as epigenetic regulators of gene expression.

    Science.gov (United States)

    Whitton, Laura; Cosgrove, Donna; Clarkson, Christopher; Harold, Denise; Kendall, Kimberley; Richards, Alex; Mantripragada, Kiran; Owen, Michael J; O'Donovan, Michael C; Walters, James; Hartmann, Annette; Konte, Betina; Rujescu, Dan; Gill, Michael; Corvin, Aiden; Rea, Stephen; Donohoe, Gary; Morris, Derek W

    2016-12-01

    Epigenetic mechanisms are an important heritable and dynamic means of regulating various genomic functions, including gene expression, to orchestrate brain development, adult neurogenesis, and synaptic plasticity. These processes when perturbed are thought to contribute to schizophrenia pathophysiology. A core feature of schizophrenia is cognitive dysfunction. For genetic disorders where cognitive impairment is more severe such as intellectual disability, there are a disproportionally high number of genes involved in the epigenetic regulation of gene transcription. Evidence now supports some shared genetic aetiology between schizophrenia and intellectual disability. GWAS have identified 108 chromosomal regions associated with schizophrenia risk that span 350 genes. This study identified genes mapping to those loci that have epigenetic functions, and tested the risk alleles defining those loci for association with cognitive deficits. We developed a list of 350 genes with epigenetic functions and cross-referenced this with the GWAS loci. This identified eight candidate genes: BCL11B, CHD7, EP300, EPC2, GATAD2A, KDM3B, RERE, SATB2. Using a dataset of Irish psychosis cases and controls (n = 1235), the schizophrenia risk SNPs at these loci were tested for effects on IQ, working memory, episodic memory, and attention. Strongest associations were for rs6984242 with both measures of IQ (P = 0.001) and episodic memory (P = 0.007). We link rs6984242 to CHD7 via a long range eQTL. These associations were not replicated in independent samples. Our study highlights that a number of genes mapping to risk loci for schizophrenia may function as epigenetic regulators of gene expression but further studies are required to establish a role for these genes in cognition. © 2016 Wiley Periodicals, Inc. © 2016 Wiley Periodicals, Inc.

  19. Gene-Based Analysis of Regionally Enriched Cortical Genes in GWAS Data Sets of Cognitive Traits and Psychiatric Disorders

    DEFF Research Database (Denmark)

    Ersland, Kari M; Christoforou, Andrea; Stansberg, Christine;

    2012-01-01

    the regionally enriched cortical genes to mine a genome-wide association study (GWAS) of the Norwegian Cognitive NeuroGenetics (NCNG) sample of healthy adults for association to nine psychometric tests measures. In addition, we explored GWAS data sets for the serious psychiatric disorders schizophrenia (SCZ) (n......Despite its estimated high heritability, the genetic architecture leading to differences in cognitive performance remains poorly understood. Different cortical regions play important roles in normal cognitive functioning and impairment. Recently, we reported on sets of regionally enriched genes...... in three different cortical areas (frontomedial, temporal and occipital cortices) of the adult rat brain. It has been suggested that genes preferentially, or specifically, expressed in one region or organ reflect functional specialisation. Employing a gene-based approach to the analysis, we used...

  20. Transcriptomic analysis in the developing zebrafish embryo after compound exposure: Individual gene expression and pathway regulation

    Energy Technology Data Exchange (ETDEWEB)

    Hermsen, Sanne A.B., E-mail: Sanne.Hermsen@rivm.nl [Centre for Health Protection, National Institute for Public Health and the Environment (RIVM), P.O. Box 1, 3720 BA Bilthoven (Netherlands); Department of Toxicogenomics, Maastricht University, P.O. Box 616, 6200 MD, Maastricht (Netherlands); Institute for Risk Assessment Sciences (IRAS), Utrecht University, P.O. Box 80.178, 3508 TD, Utrecht (Netherlands); Pronk, Tessa E. [Centre for Health Protection, National Institute for Public Health and the Environment (RIVM), P.O. Box 1, 3720 BA Bilthoven (Netherlands); Department of Toxicogenomics, Maastricht University, P.O. Box 616, 6200 MD, Maastricht (Netherlands); Brandhof, Evert-Jan van den [Centre for Environmental Quality, National Institute for Public Health and the Environment (RIVM), P.O. Box 1, 3720 BA Bilthoven (Netherlands); Ven, Leo T.M. van der [Centre for Health Protection, National Institute for Public Health and the Environment (RIVM), P.O. Box 1, 3720 BA Bilthoven (Netherlands); Piersma, Aldert H. [Centre for Health Protection, National Institute for Public Health and the Environment (RIVM), P.O. Box 1, 3720 BA Bilthoven (Netherlands); Institute for Risk Assessment Sciences (IRAS), Utrecht University, P.O. Box 80.178, 3508 TD, Utrecht (Netherlands)

    2013-10-01

    The zebrafish embryotoxicity test is a promising alternative assay for developmental toxicity. Classically, morphological assessment of the embryos is applied to evaluate the effects of compound exposure. However, by applying differential gene expression analysis the sensitivity and predictability of the test may be increased. For defining gene expression signatures of developmental toxicity, we explored the possibility of using gene expression signatures of compound exposures based on commonly expressed individual genes as well as based on regulated gene pathways. Four developmental toxic compounds were tested in concentration-response design, caffeine, carbamazepine, retinoic acid and valproic acid, and two non-embryotoxic compounds, D-mannitol and saccharin, were included. With transcriptomic analyses we were able to identify commonly expressed genes, which were mostly development related, after exposure to the embryotoxicants. We also identified gene pathways regulated by the embryotoxicants, suggestive of their modes of action. Furthermore, whereas pathways may be regulated by all compounds, individual gene expression within these pathways can differ for each compound. Overall, the present study suggests that the use of individual gene expression signatures as well as pathway regulation may be useful starting points for defining gene biomarkers for predicting embryotoxicity. - Highlights: • The zebrafish embryotoxicity test in combination with transcriptomics was used. • We explored two approaches of defining gene biomarkers for developmental toxicity. • Four compounds in concentration-response design were tested. • We identified commonly expressed individual genes as well as regulated gene pathways. • Both approaches seem suitable starting points for defining gene biomarkers.

  1. Molecular responses and expression analysis of genes in a ...

    African Journals Online (AJOL)

    STORAGESEVER

    2009-06-17

    Jun 17, 2009 ... regulatory genes (HaDR5, HaDR9); senescence-associated genes. (HaDR22, HaDR26); ..... The shortened replicative life span of prohibitin mutants of yeast appears to be ... Aging Cell 1: 149-157. Pnueli L, Hallak-Herr E, ...

  2. Application of Euclidean distance measurement and principal component analysis for gene identification.

    Science.gov (United States)

    Ghosh, Antara; Barman, Soma

    2016-06-01

    Gene systems are extremely complex, heterogeneous, and noisy in nature. Many statistical tools which are used to extract relevant feature from genes provide fuzzy and ambiguous information. High-dimensional gene expression database available in public domain usually contains thousands of genes. Efficient prediction method is demanding nowadays for accurate identification of such database. Euclidean distance measurement and principal component analysis methods are applied on such databases to identify the genes. In both methods, prediction algorithm is based on homology search approach. Digital Signal Processing technique along with statistical method is used for analysis of genes in both cases. A two-level decision logic is used for gene classification as healthy or cancerous. This binary logic minimizes the prediction error and improves prediction accuracy. Superiority of the method is judged by receiver operating characteristic curve.

  3. Identification of the key regulating genes of diminished ovarian reserve (DOR) by network and gene ontology analysis.

    Science.gov (United States)

    Pashaiasl, Maryam; Ebrahimi, Mansour; Ebrahimie, Esmaeil

    2016-09-01

    Diminished ovarian reserve (DOR) is one of the reasons for infertility that not only affects both older and young women. Ovarian reserve assessment can be used as a new prognostic tool for infertility treatment decision making. Here, up- and down-regulated gene expression profiles of granulosa cells were analysed to generate a putative interaction map of the involved genes. In addition, gene ontology (GO) analysis was used to get insight intol the biological processes and molecular functions of involved proteins in DOR. Eleven up-regulated genes and nine down-regulated genes were identified and assessed by constructing interaction networks based on their biological processes. PTGS2, CTGF, LHCGR, CITED, SOCS2, STAR and FSTL3 were the key nodes in the up-regulated networks, while the IGF2, AMH, GREM, and FOXC1 proteins were key in the down-regulated networks. MIRN101-1, MIRN153-1 and MIRN194-1 inhibited the expression of SOCS2, while CSH1 and BMP2 positively regulated IGF1 and IGF2. Ossification, ovarian follicle development, vasculogenesis, sequence-specific DNA binding transcription factor activity, and golgi apparatus are the major differential groups between up-regulated and down-regulated genes in DOR. Meta-analysis of publicly available transcriptomic data highlighted the high coexpression of CTGF, connective tissue growth factor, with the other key regulators of DOR. CTGF is involved in organ senescence and focal adhesion pathway according to GO analysis. These findings provide a comprehensive system biology based insight into the aetiology of DOR through network and gene ontology analyses.

  4. Diplotype Trend Regression Analysis of the ADH Gene Cluster and the ALDH2 Gene: Multiple Significant Associations with Alcohol Dependence

    Science.gov (United States)

    Luo, Xingguang; Kranzler, Henry R.; Zuo, Lingjun; Wang, Shuang; Schork, Nicholas J.; Gelernter, Joel

    2006-01-01

    The set of alcohol-metabolizing enzymes has considerable genetic and functional complexity. The relationships between some alcohol dehydrogenase (ADH) and aldehyde dehydrogenase (ALDH) genes and alcohol dependence (AD) have long been studied in many populations, but not comprehensively. In the present study, we genotyped 16 markers within the ADH gene cluster (including the ADH1A, ADH1B, ADH1C, ADH5, ADH6, and ADH7 genes), 4 markers within the ALDH2 gene, and 38 unlinked ancestry-informative markers in a case-control sample of 801 individuals. Associations between markers and disease were analyzed by a Hardy-Weinberg equilibrium (HWE) test, a conventional case-control comparison, a structured association analysis, and a novel diplotype trend regression (DTR) analysis. Finally, the disease alleles were fine mapped by a Hardy-Weinberg disequilibrium (HWD) measure (J). All markers were found to be in HWE in controls, but some markers showed HWD in cases. Genotypes of many markers were associated with AD. DTR analysis showed that ADH5 genotypes and diplotypes of ADH1A, ADH1B, ADH7, and ALDH2 were associated with AD in European Americans and/or African Americans. The risk-influencing alleles were fine mapped from among the markers studied and were found to coincide with some well-known functional variants. We demonstrated that DTR was more powerful than many other conventional association methods. We also found that several ADH genes and the ALDH2 gene were susceptibility loci for AD, and the associations were best explained by several independent risk genes. PMID:16685648

  5. Integrated exon level expression analysis of driver genes explain their role in colorectal cancer.

    Directory of Open Access Journals (Sweden)

    Mohammad Azhar Aziz

    Full Text Available Integrated analysis of genomic and transcriptomic level changes holds promise for a better understanding of colorectal cancer (CRC biology. There is a pertinent need to explain the functional effect of genome level changes by integrating the information at the transcript level. Using high resolution cytogenetics array, we had earlier identified driver genes by 'Genomic Identification of Significant Targets In Cancer (GISTIC' analysis of paired tumour-normal samples from colorectal cancer patients. In this study, we analyze these driver genes at three levels using exon array data--gene, exon and network. Gene level analysis revealed a small subset to experience differential expression. These results were reinforced by carrying out separate differential expression analyses (SAM and LIMMA. ATP8B1 was found to be the novel gene associated with CRC that shows changes at cytogenetic, gene and exon levels. Splice index of 29 exons corresponding to 13 genes was found to be significantly altered in tumour samples. Driver genes were used to construct regulatory networks for tumour and normal groups. There were rearrangements in transcription factor genes suggesting the presence of regulatory switching. The regulatory pattern of AHR gene was found to have the most significant alteration. Our results integrate data with focus on driver genes resulting in highly enriched novel molecules that need further studies to establish their role in CRC.

  6. Genome-Wide Analysis of the RNA Helicase Gene Family in Gossypium raimondii

    Directory of Open Access Journals (Sweden)

    Jie Chen

    2014-03-01

    Full Text Available The RNA helicases, which help to unwind stable RNA duplexes, and have important roles in RNA metabolism, belong to a class of motor proteins that play important roles in plant development and responses to stress. Although this family of genes has been the subject of systematic investigation in Arabidopsis, rice, and tomato, it has not yet been characterized in cotton. In this study, we identified 161 putative RNA helicase genes in the genome of the diploid cotton species Gossypium raimondii. We classified these genes into three subfamilies, based on the presence of either a DEAD-box (51 genes, DEAH-box (52 genes, or DExD/H-box (58 genes in their coding regions. Chromosome location analysis showed that the genes that encode RNA helicases are distributed across all 13 chromosomes of G. raimondii. Syntenic analysis revealed that 62 of the 161 G. raimondii helicase genes (38.5% are within the identified syntenic blocks. Sixty-six (40.99% helicase genes from G. raimondii have one or several putative orthologs in tomato. Additionally, GrDEADs have more conserved gene structures and more simple domains than GrDEAHs and GrDExD/Hs. Transcriptome sequencing data demonstrated that many of these helicases, especially GrDEADs, are highly expressed at the fiber initiation stage and in mature leaves. To our knowledge, this is the first report of a genome-wide analysis of the RNA helicase gene family in cotton.

  7. A Multiple Beamforming Network for Unequally Spaced Linear Array Based on CORPS

    Directory of Open Access Journals (Sweden)

    Armando Arce

    2015-01-01

    Full Text Available This paper proposes an alternative and innovative way to design a simpler beamforming network (BFN based on balancing alternated power combiners and dividers, to feed a nonuniformly spaced linear array with Gaussian amplitude and coherent (in-phase signals. Thus, a two-beam design configuration of the feeding network for a nonuniform array with beam steering capability is proposed and analyzed. The nonuniform aperture and the complex inputs of the feeding network are optimized by means of a differential evolution algorithm. In addition, a comparative analysis between a uniform and nonuniform linear array with the proposed feeding network is performed. Simulation results show the advantages and effectiveness of the proposed feeding network exploiting the nonuniformity of the antenna elements, in terms of side lobe level and directivity. Furthermore, research results show an inherent reduction in hardware complexity of the network.

  8. Investigation of Reference Genes in Vibrio parahaemolyticus for Gene Expression Analysis Using Quantitative RT-PCR.

    Directory of Open Access Journals (Sweden)

    Yue-Jiao Ma

    Full Text Available Vibrio parahaemolyticus is a significant human pathogen capable of causing foodborne gastroenteritis associated with the consumption of contaminated raw or undercooked seafood. Quantitative RT-PCR (qRT-PCR is a useful tool for studying gene expression in V. parahaemolyticus to characterize its virulence factors and understand the effect of environmental conditions on its pathogenicity. However, there is not a stable gene in V. parahaemolyticus that has been identified for use as a reference gene for qRT-PCR. This study evaluated the stability of 6 reference genes (16S rRNA, recA, rpoS, pvsA, pvuA, and gapdh in 5 V. parahaemolyticus strains (O3:K6-clinical strain-tdh+, ATCC33846-tdh+, ATCC33847-tdh+, ATCC17802-trh+, and F13-environmental strain-tdh+ cultured at 4 different temperatures (15, 25, 37 and 42°C. Stability values were calculated using GeNorm, NormFinder, BestKeeper, and Delta CT algorithms. The results indicated that recA was the most stably expressed gene in the V. parahaemolyticus strains cultured at different temperatures. This study examined multiple V. parahaemolyticus strains and growth temperatures, hence the finding provided stronger evidence that recA can be used as a reference gene for gene expression studies in V. parahaemolyticus.

  9. Investigation of Reference Genes in Vibrio parahaemolyticus for Gene Expression Analysis Using Quantitative RT-PCR.

    Science.gov (United States)

    Ma, Yue-Jiao; Sun, Xiao-Hong; Xu, Xiao-Yan; Zhao, Yong; Pan, Ying-Jie; Hwang, Cheng-An; Wu, Vivian C H

    2015-01-01

    Vibrio parahaemolyticus is a significant human pathogen capable of causing foodborne gastroenteritis associated with the consumption of contaminated raw or undercooked seafood. Quantitative RT-PCR (qRT-PCR) is a useful tool for studying gene expression in V. parahaemolyticus to characterize its virulence factors and understand the effect of environmental conditions on its pathogenicity. However, there is not a stable gene in V. parahaemolyticus that has been identified for use as a reference gene for qRT-PCR. This study evaluated the stability of 6 reference genes (16S rRNA, recA, rpoS, pvsA, pvuA, and gapdh) in 5 V. parahaemolyticus strains (O3:K6-clinical strain-tdh+, ATCC33846-tdh+, ATCC33847-tdh+, ATCC17802-trh+, and F13-environmental strain-tdh+) cultured at 4 different temperatures (15, 25, 37 and 42°C). Stability values were calculated using GeNorm, NormFinder, BestKeeper, and Delta CT algorithms. The results indicated that recA was the most stably expressed gene in the V. parahaemolyticus strains cultured at different temperatures. This study examined multiple V. parahaemolyticus strains and growth temperatures, hence the finding provided stronger evidence that recA can be used as a reference gene for gene expression studies in V. parahaemolyticus.

  10. Analysis of Stage-Specific Gene Expression Profiles in the Uterine Endometrium during Pregnancy in Pigs.

    Science.gov (United States)

    Kim, Mingoo; Seo, Heewon; Choi, Yohan; Yoo, Inkyu; Seo, Minseok; Lee, Chang-Kyu; Kim, Heebal; Ka, Hakhyun

    2015-01-01

    The uterine endometrium plays a critical role in regulating the estrous cycle and the establishment and maintenance of pregnancy in mammalian species. Many studies have investigated the expression and function of genes in the uterine endometrium, but the global expression pattern of genes and relationships among genes differentially expressed in the uterine endometrium during gestation in pigs remain unclear. Thus, this study investigated global gene expression profiles using microarray in pigs. Diverse transcriptome analyses including clustering, network, and differentially expressed gene (DEG) analyses were performed to detect endometrial gene expression changes during the different gestation stages. In total, 6,991 genes were found to be differentially expressed by comparing genes expressed on day (D) 12 of pregnancy with those on D15, D30, D60, D90 and D114 of pregnancy, and clustering analysis of detected DEGs distinguished 8 clusters. Furthermore, several pregnancy-related hub genes such as ALPPL2, RANBP17, NF1B, SPP1, and CST6 were discovered through network analysis. Finally, detected hub genes were technically validated by quantitative RT-PCR. These results suggest the complex network characteristics involved in uterine endometrial gene expression during pregnancy and indicate that diverse patterns of stage-specific gene expression and network connections may play a critical role in endometrial remodeling and in placental and fetal development to establish and maintenance of pregnancy in pigs.

  11. Analysis of Stage-Specific Gene Expression Profiles in the Uterine Endometrium during Pregnancy in Pigs.

    Directory of Open Access Journals (Sweden)

    Mingoo Kim

    Full Text Available The uterine endometrium plays a critical role in regulating the estrous cycle and the establishment and maintenance of pregnancy in mammalian species. Many studies have investigated the expression and function of genes in the uterine endometrium, but the global expression pattern of genes and relationships among genes differentially expressed in the uterine endometrium during gestation in pigs remain unclear. Thus, this study investigated global gene expression profiles using microarray in pigs. Diverse transcriptome analyses including clustering, network, and differentially expressed gene (DEG analyses were performed to detect endometrial gene expression changes during the different gestation stages. In total, 6,991 genes were found to be differentially expressed by comparing genes expressed on day (D 12 of pregnancy with those on D15, D30, D60, D90 and D114 of pregnancy, and clustering analysis of detected DEGs distinguished 8 clusters. Furthermore, several pregnancy-related hub genes such as ALPPL2, RANBP17, NF1B, SPP1, and CST6 were discovered through network analysis. Finally, detected hub genes were technically validated by quantitative RT-PCR. These results suggest the complex network characteristics involved in uterine endometrial gene expression during pregnancy and indicate that diverse patterns of stage-specific gene expression and network connections may play a critical role in endometrial remodeling and in placental and fetal development to establish and maintenance of pregnancy in pigs.

  12. Gene expression analysis uncovers novel hedgehog interacting protein (HHIP) effects in human bronchial epithelial cells.

    Science.gov (United States)

    Zhou, Xiaobo; Qiu, Weiliang; Sathirapongsasuti, J Fah; Cho, Michael H; Mancini, John D; Lao, Taotao; Thibault, Derek M; Litonjua, Augusto A; Bakke, Per S; Gulsvik, Amund; Lomas, David A; Beaty, Terri H; Hersh, Craig P; Anderson, Christopher; Geigenmuller, Ute; Raby, Benjamin A; Rennard, Stephen I; Perrella, Mark A; Choi, Augustine M K; Quackenbush, John; Silverman, Edwin K

    2013-05-01

    Hedgehog interacting protein (HHIP) was implicated in chronic obstructive pulmonary disease (COPD) by genome-wide association studies (GWAS). However, it remains unclear how HHIP contributes to COPD pathogenesis. To identify genes regulated by HHIP, we performed gene expression microarray analysis in a human bronchial epithelial cell line (Beas-2B) stably infected with HHIP shRNAs. HHIP silencing led to differential expression of 296 genes; enrichment for variants nominally associated with COPD was found. Eighteen of the differentially expressed genes were validated by real-time PCR in Beas-2B cells. Seven of 11 validated genes tested in human COPD and control lung tissues demonstrated significant gene expression differences. Functional annotation indicated enrichment for extracellular matrix and cell growth genes. Network modeling demonstrated that the extracellular matrix and cell proliferation genes influenced by HHIP tended to be interconnected. Thus, we identified potential HHIP targets in human bronchial epithelial cells that may contribute to COPD pathogenesis.

  13. Topology association analysis in weighted protein interaction network for gene prioritization

    Science.gov (United States)

    Wu, Shunyao; Shao, Fengjing; Zhang, Qi; Ji, Jun; Xu, Shaojie; Sun, Rencheng; Sun, Gengxin; Du, Xiangjun; Sui, Yi

    2016-11-01

    Although lots of algorithms for disease gene prediction have been proposed, the weights of edges are rarely taken into account. In this paper, the strengths of topology associations between disease and essential genes are analyzed in weighted protein interaction network. Empirical analysis demonstrates that compared to other genes, disease genes are weakly connected with essential genes in protein interaction network. Based on this finding, a novel global distance measurement for gene prioritization with weighted protein interaction network is proposed in this paper. Positive and negative flow is allocated to disease and essential genes, respectively. Additionally network propagation model is extended for weighted network. Experimental results on 110 diseases verify the effectiveness and potential of the proposed measurement. Moreover, weak links play more important role than strong links for gene prioritization, which is meaningful to deeply understand protein interaction network.

  14. ErmineJ: Tool for functional analysis of gene expression data sets

    Directory of Open Access Journals (Sweden)

    Braynen William

    2005-11-01

    Full Text Available Abstract Background It is common for the results of a microarray study to be analyzed in the context of biologically-motivated groups of genes such as pathways or Gene Ontology categories. The most common method for such analysis uses the hypergeometric distribution (or a related technique to look for "over-representation" of groups among genes selected as being differentially expressed or otherwise of interest based on a gene-by-gene analysis. However, this method suffers from some limitations, and biologist-friendly tools that implement alternatives have not been reported. Results We introduce ErmineJ, a multiplatform user-friendly stand-alone software tool for the analysis of functionally-relevant sets of genes in the context of microarray gene expression data. ErmineJ implements multiple algorithms for gene set analysis, including over-representation and resampling-based methods that focus on gene scores or correlation of gene expression profiles. In addition to a graphical user interface, ErmineJ has a command line interface and an application programming interface that can be used to automate analyses. The graphical user interface includes tools for creating and modifying gene sets, visualizing the Gene Ontology as a table or tree, and visualizing gene expression data. ErmineJ comes with a complete user manual, and is open-source software licensed under the Gnu Public License. Conclusion The availability of multiple analysis algorithms, together with a rich feature set and simple graphical interface, should make ErmineJ a useful addition to the biologist's informatics toolbox. ErmineJ is available from http://microarray.cu.genome.org.

  15. Global gene expression analysis of apple fruit development from the floral bud to ripe fruit

    Directory of Open Access Journals (Sweden)

    McArtney Steve

    2008-02-01

    Full Text Available Abstract Background Apple fruit develop over a period of 150 days from anthesis to fully ripe. An array representing approximately 13000 genes (15726 oligonucleotides of 45–55 bases designed from apple ESTs has been used to study gene expression over eight time points during fruit development. This analysis of gene expression lays the groundwork for a molecular understanding of fruit growth and development in apple. Results Using ANOVA analysis of the microarray data, 1955 genes showed significant changes in expression over this time course. Expression of genes is coordinated with four major patterns of expression observed: high in floral buds; high during cell division; high when starch levels and cell expansion rates peak; and high during ripening. Functional analysis associated cell cycle genes with early fruit development and three core cell cycle genes are significantly up-regulated in the early stages of fruit development. Starch metabolic genes were associated with changes in starch levels during fruit development. Comparison with microarrays of ethylene-treated apple fruit identified a group of ethylene induced genes also induced in normal fruit ripening. Comparison with fruit development microarrays in tomato has been used to identify 16 genes for which expression patterns are similar in apple and tomato and these genes may play fundamental roles in fruit development. The early phase of cell division and tissue specification that occurs in the first 35 days after pollination has been associated with up-regulation of a cluster of genes that includes core cell cycle genes. Conclusion Gene expression in apple fruit is coordinated with specific developmental stages. The array results are reproducible and comparisons with experiments in other species has been used to identify genes that may play a fundamental role in fruit development.

  16. In silico Analysis of the Potential Infection Mechanisms of Magnaporthe grisea from Horizontal Gene Transfer Hypothesis

    Institute of Scientific and Technical Information of China (English)

    Chunyang Li; Ying Wang; Hao Peng; Hejiao Bian; Mingwei Min; Longfei Chen; Qian Liu; Jinku Bao

    2009-01-01

    Horizontal gene transfer(HGT)has long been considered as a principal force for an organism to gain novel genes in genome evolution. Homology search, phylogenetic analysis and nucleotide composition analysis are three major objective approaches to arguably determine the occurrence and directionality of HGT. Here, 21 genes that possess the potential to horizontal transfer were acquired from the whole genome of Magnaporthe grisea according to annotation, among which three can-didate genes(corresponding protein accession numbers are EAA55123, EAA47200 and EAA52136)were selected for further analysis. According to BLAST homology results, we subsequently conducted phylogenetic analysis of the three candidate HGT genes. Moreover, nucleotide composition analysis was conducted to further validate these HGTs. In addition, the functions of the three candidate genes were searched in COG database. Consequently, we conclude that the gene encoding protein EAA55123 is transferred from Clostridium perfringens. Another HGT event is between EAA52136 and a certain metazoan's corresponding gene, but the direction remains uncertain. Yet, EAA47200 is not a transferred gene.

  17. Gene expression analysis of precision cut human liver slices indicate stable expression of ADME-Tox related genes

    NARCIS (Netherlands)

    Elferink, M.; Olinga, P.; Van Leeuwen, E.; Bauerschmidt, S.; Polman, J.; Schoonen, W.; Heisterkamp, S.; Groothuis, G.

    2010-01-01

    In the process of drug development it is of high importance to test the safety of new drugs with predictive value for human toxicity. Currently, for toxicity studies toxicogenomic analysis of changes in gene expression profile of the liver is increasingly applied. Toxicity screening based on animal

  18. Comparative analysis of Phytophthora genes encoding secreted proteins reveals conserved synteny and lineage-specific gene duplications and deletions

    NARCIS (Netherlands)

    Jiang, R.H.Y.; Tyler, B.M.; Govers, F.

    2006-01-01

    Comparative analysis of two Phytophthora genomes revealed overall colinearity in four genomic regions consisting of a 1.5-Mb sequence of Phytophthora sojae and a 0.9-Mb sequence of R ramorum. In these regions with conserved synteny, the gene order is largely similar; however, genome rearrangements a

  19. Disease gene characterization through large-scale co-expression analysis.

    Directory of Open Access Journals (Sweden)

    Allen Day

    Full Text Available BACKGROUND: In the post genome era, a major goal of biology is the identification of specific roles for individual genes. We report a new genomic tool for gene characterization, the UCLA Gene Expression Tool (UGET. RESULTS: Celsius, the largest co-normalized microarray dataset of Affymetrix based gene expression, was used to calculate the correlation between all possible gene pairs on all platforms, and generate stored indexes in a web searchable format. The size of Celsius makes UGET a powerful gene characterization tool. Using a small seed list of known cartilage-selective genes, UGET extended the list of known genes by identifying 32 new highly cartilage-selective genes. Of these, 7 of 10 tested were validated by qPCR including the novel cartilage-specific genes SDK2 and FLJ41170. In addition, we retrospectively tested UGET and other gene expression based prioritization tools to identify disease-causing genes within known linkage intervals. We first demonstrated this utility with UGET using genetically heterogeneous disorders such as Joubert syndrome, microcephaly, neuropsychiatric disorders and type 2 limb girdle muscular dystrophy (LGMD2 and then compared UGET to other gene expression based prioritization programs which use small but discrete and well annotated datasets. Finally, we observed a significantly higher gene correlation shared between genes in disease networks associated with similar complex or Mendelian disorders. DISCUSSION: UGET is an invaluable resource for a geneticist that permits the rapid inclusion of expression criteria from one to hundreds of genes in genomic intervals linked to disease. By using thousands of arrays UGET annotates and prioritizes genes better than other tools especially with rare tissue disorders or complex multi-tissue biological processes. This information can be critical in prioritization of candidate genes for sequence analysis.

  20. When is hub gene selection better than standard meta-analysis?

    Directory of Open Access Journals (Sweden)

    Peter Langfelder

    Full Text Available Since hub nodes have been found to play important roles in many networks, highly connected hub genes are expected to play an important role in biology as well. However, the empirical evidence remains ambiguous. An open question is whether (or when hub gene selection leads to more meaningful gene lists than a standard statistical analysis based on significance testing when analyzing genomic data sets (e.g., gene expression or DNA methylation data. Here we address this question for the special case when multiple genomic data sets are available. This is of great practical importance since for many research questions multiple data sets are publicly available. In this case, the data analyst can decide between a standard statistical approach (e.g., based on meta-analysis and a co-expression network analysis approach that selects intramodular hubs in consensus modules. We assess the performance of these two types of approaches according to two criteria. The first criterion evaluates the biological insights gained and is relevant in basic research. The second criterion evaluates the validation success (reproducibility in independent data sets and often applies in clinical diagnostic or prognostic applications. We compare meta-analysis with consensus network analysis based on weighted correlation network analysis (WGCNA in three comprehensive and unbiased empirical studies: (1 Finding genes predictive of lung cancer survival, (2 finding methylation markers related to age, and (3 finding mouse genes related to total cholesterol. The results demonstrate that intramodular hub gene status with respect to consensus modules is more useful than a meta-analysis p-value when identifying biologically meaningful gene lists (reflecting criterion 1. However, standard meta-analysis methods perform as good as (if not better than a consensus network approach in terms of validation success (criterion 2. The article also reports a comparison of meta-analysis techniques

  1. Analysis of Androgen Receptor Gene Mutations in female with infertility

    Directory of Open Access Journals (Sweden)

    Soyar Sari

    2017-06-01

    Full Text Available Background : Infertility is a multifactorial disease. Hormonal disorders and genetic factors are important in female infertility. Development and maturation of ovulation are depending on the molecular signaling pathways in response to androgens. Over hundreds of mutations leading to resistance gene function in androgen receptor (AR has been recorded. One of them is polymorphic region 5'UTR. Thus regarding to the role of androgen receptor in infertility, the aim of the present study was to investigate the association between gene mutations AR and infertility in Iranian women Materials and Methods: In this study of 50 infertile women and 80 healthy women as a control, blood samples were taken. After extraction of DNA, PCR method was used to determine the AR gene mutations. Results: In the present study in '5UTR area at position +25 androgen receptor gene a T nucleotide deletion was observed. , therefore single nucleotide mutations did not change in the androgen receptor gene expression, so indicates the lack of communication between the AR gene mutations in the promoter region of 23 to 214+ in women with infertility. According to the results of this study are significant differences between the two groups of patients and healthy women was not found (P=0.5. Conclusion: Results indicated no correlation between mutations in the promoter region of 23 to 214+ AR genes in the population studied women with infertility

  2. Apoptotic gene analysis in idiopathic talipes equinovarus (clubfoot).

    Science.gov (United States)

    Ester, Audrey R; Tyerman, Gayle; Wise, Carol A; Blanton, Susan H; Hecht, Jacqueline T

    2007-09-01

    Idiopathic talipes equinovarus, also known as clubfoot, is a common birth defect occurring in one of 1000 live births. It is a complex disorder in which multiple genes and environmental factors may play an etiologic role. Several chromosomal deletion regions, including 2q31-33, are associated with talipes equinovarus and may harbor genes that contribute to the idiopathic talipes equinovarus phenotype. Previously, two STRs in the 2q31-33, GATA149B10 and D2S1371, showed linkage with association to idiopathic talipes equinovarus. Single nucleotide polymorphisms (SNPs) in three apoptotic genes (Casp8, Casp10, and CFLAR) near GATA149B10 were genotyped in idiopathic talipes equinovarus families. rs3731714 in Casp10 showed linkage with association, suggesting variation in the apoptotic gene pathway, which is important in limb morphogenesis, and may play a role in the development of idiopathic talipes equinovarus. We genotyped SNPs spanning seven apoptotic genes-Casp3, Casp8, Casp9, Casp10, Bid, Bcl-2 and Apaf1-in 210 simplex trios and 139 multiplex families and tested for link-age and association to idiopathic talipes equinovarus. One SNP in each of the genes provided suggestive evidence of association with idiopathic talipes equinovarus. Several haplotypes constructed from these SNPs displayed altered transmission. These data suggest genetic variation in apoptotic genes may play a role in development of idiopathic talipes equinovarus.

  3. Weighted near-field focusing in an array-based GPR

    Science.gov (United States)

    Savelyev, Timofey G.; Yarovoy, Alexander G.; Ligthart, Leo P.

    2008-08-01

    This paper presents a 3-D imaging technique for an ultra-wideband (UWB) ground penetrating radar (GPR) with a single transmit antenna and a linear receive array. The video impulse GPR working in the frequency band of 0.3-3 GHz has been designed in IRCTR for landmine detection, i.e., for a near-field application. Installed on a vehicle it can image in one mechanical scan a strip of 84 cm width due to the length of array aperture. The imaging is done by software means only. The developed imaging technique combines a real aperture focusing in the array plane with a synthetic aperture focusing in the mechanical scan direction. To compensate for parasitic time delays in the array channels, a calibration procedure is also described. Owing to directional properties of transmit antenna, the distribution of signal strength over the array is nonuniform that requires an amplitude correction when focusing the real aperture. The authors analyzed how this affects the footprint of the focused array, its cross-range resolution capability and the image quality of antipersonnel plastic landmines which were buried under different array channels. The analysis bases on experimental data sets acquired in the facilities of IRCTR and TNO-DSS. As a result, the authors propose a weighted array focusing that improves the cross-range resolution and provides proper imaging of typical buried landmines.

  4. Pulsar timing array based search for supermassive black hole binaries in the SKA era

    CERN Document Server

    Wang, Yan

    2016-01-01

    The advent of next generation radio telescope facilities, such as the Square Kilometer Array (SKA), will usher in an era where a Pulsar Timing Array (PTA) based search for gravitational waves (GWs) will be able to use hundreds of well timed millisecond pulsars rather than the few dozens in existing PTAs. A realistic assessment of the performance of such an extremely large PTA must take into account the data analysis challenge posed by an exponential increase in the parameter space volume due to the large number of so-called pulsar phase parameters. We address this problem and present such an assessment for isolated supermassive black hole binary (SMBHB) searches using a SKA era PTA containing $10^3$ pulsars. We find that an all-sky search will be able to confidently detect non-evolving sources with redshifted chirp mass of $10^{10}$ $M_\\odot$ out to a redshift of about $28$. The detection of GW signals from optically identified SMBHB candidates similar to PSO J334+01 is assured. If no SMBHB detections occur, ...

  5. Identification of plasma biomarker candidates in glioblastoma using an antibody-array-based proteomic approach.

    Science.gov (United States)

    Zupancic, Klemen; Blejec, Andrej; Herman, Ana; Veber, Matija; Verbovsek, Urska; Korsic, Marjan; Knezevic, Miomir; Rozman, Primoz; Turnsek, Tamara Lah; Gruden, Kristina; Motaln, Helena

    2014-09-01

    Glioblastoma multiforme (GBM) is a brain tumour with a very high patient mortality rate, with a median survival of 47 weeks. This might be improved by the identification of novel diagnostic, prognostic and predictive therapy-response biomarkers, preferentially through the monitoring of the patient blood. The aim of this study was to define the impact of GBM in terms of alterations of the plasma protein levels in these patients. We used a commercially available antibody array that includes 656 antibodies to analyse blood plasma samples from 17 healthy volunteers in comparison with 17 blood plasma samples from patients with GBM. We identified 11 plasma proteins that are statistically most strongly associated with the presence of GBM. These proteins belong to three functional signalling pathways: T-cell signalling and immune responses; cell adhesion and migration; and cell-cycle control and apoptosis. Thus, we can consider this identified set of proteins as potential diagnostic biomarker candidates for GBM. In addition, a set of 16 plasma proteins were significantly associated with the overall survival of these patients with GBM. Guanine nucleotide binding protein alpha (GNAO1) was associated with both GBM presence and survival of patients with GBM. Antibody array analysis represents a useful tool for the screening of plasma samples for potential cancer biomarker candidates in small-scale exploratory experiments; however, clinical validation of these candidates requires their further evaluation in a larger study on an independent cohort of patients.

  6. Pulsar Timing Array Based Search for Supermassive Black Hole Binaries in the Square Kilometer Array Era.

    Science.gov (United States)

    Wang, Yan; Mohanty, Soumya D

    2017-04-14

    The advent of next generation radio telescope facilities, such as the Square Kilometer Array (SKA), will usher in an era where a pulsar timing array (PTA) based search for gravitational waves (GWs) will be able to use hundreds of well timed millisecond pulsars rather than the few dozens in existing PTAs. A realistic assessment of the performance of such an extremely large PTA must take into account the data analysis challenge posed by an exponential increase in the parameter space volume due to the large number of so-called pulsar phase parameters. We address this problem and present such an assessment for isolated supermassive black hole binary (SMBHB) searches using a SKA era PTA containing 10^{3} pulsars. We find that an all-sky search will be able to confidently detect nonevolving sources with a redshifted chirp mass of 10^{10}  M_{⊙} out to a redshift of about 28 (corresponding to a rest-frame chirp mass of 3.4×10^{8}  M_{⊙}). We discuss the important implications that the large distance reach of a SKA era PTA has on GW observations from optically identified SMBHB candidates. If no SMBHB detections occur, a highly unlikely scenario in the light of our results, the sky-averaged upper limit on strain amplitude will be improved by about 3 orders of magnitude over existing limits.

  7. Intelligent Monitoring and Predicting Output Power Losses of Solar Arrays Based on Particle Filtering

    Directory of Open Access Journals (Sweden)

    Hongzheng Fang

    2013-01-01

    Full Text Available Solar arrays are the main source of energy to the on-orbit satellite, whose output power largely determines the life cycle of on-orbit satellites. Monitoring and further forecasting the output power of solar arrays by using the real-time observational data are very important for the study of satellite design and on-orbit satellite control. In this paper, we firstly describe the dynamical model of output power with summarizing the influencing factors of attenuation for solar arrays and elaborating the evolution trend of influencing factors which change with time. Based on the empirical model, a particle filtering algorithm is formulated to predict the output power of solar arrays and update the model parameters, simultaneously. Finally, using eight-year observational data of voltage and current from a synchronous on-orbit satellite, an experiment is carried out to illustrate the reliability and accuracy of the particle filtering method. Comparative results with classical curve fitting also are presented with statistical root mean square error and mean relative error analysis.

  8. Genomewide analysis of the lateral organ boundaries domain gene family in Vitis vinifera.

    Science.gov (United States)

    Cao, Hui; Liu, Cai-Yun; Liu, Chun-Xiang; Zhao, Yue-Ling; Xu, Rui-Rui

    2016-09-01

    In plants, the transcription factor families have been implicated in many important biological processes. These processes include morphogenesis, signal transduction and environmental stress responses. Proteins containing the lateral organ boundaries domain (LBD), which encodes a zinc finger-like domain are only found in plants. This finding indicates that this unique gene family regulates only plant-specific biological processes. LBD genes play crucial roles in the growth and development of plants such as Arabidopsis, Oryza sativa, Zea mays, poplar, apple and tomato. However, relatively little is known about the LBD genes in grape (Vitis vinifera). In this study, we identified 40 LBD genes in the grape genome. A complete overview of the chromosomal locations, phylogenetic relationships, structures and expression profiles of this gene family during development in grape is presented here. Phylogenetic analysis showed that the LBD genes could be divided into classes I and II, together with LBDs from Arabidopsis. We mapped the 40 LBD genes on the grape chromosomes (chr1-chr19) and found that 37 of the predicted grape LBD genes were distributed in different densities across 12 chromosomes. Grape LBDs were found to share a similar intron/exon structure and gene length within the same class. The expression profiles of grape LBD genes at different developmental stages were analysed using microarray data. Results showed that 21 grape LBD genes may be involved in grape developmental processes, including preveraison, veraison and ripening. Finally, we analysed the expression patterns of six LBD genes through quantitative real-time polymerase chain reation analysis. The six LBD genes showed differential expression patterns among the three representative grape tissues, and five of these genes were found to be involved in responses to mannitol, sodium chloride, heat stress and low temperature treatments. To our knowledge, this is the first study to analyse the LBD gene family in

  9. Genomewide analysis of the lateral organ boundaries domain gene family in Vitis vinifera

    Indian Academy of Sciences (India)

    HUI CAO; CAI-YUN LIU; HUN-XIANG LIU; YUE-LING ZHAO; RUI-RUI XU

    2016-09-01

    In plants, the transcription factor families have been implicated in many important biological processes. These processes include morphogenesis, signal transduction and environmental stress responses. Proteins containing the lateral organ bound-aries domain (LBD), which encodes a zinc finger-like domain are only found in plants. This finding indicates that this unique gene family regulates only plant-specific biological processes. LBD genes play crucial roles in the growth and development of plants such as Arabidopsis, Oryza sativa, Zea mays , poplar, apple and tomato. However, relatively little is known about the LBD genes in grape ( Vitis vinifera). In this study, we identified 40 LBD genes in the grape genome. A complete overview of the chromosomal locations, phylogenetic relationships, structures and expression profiles of this gene family during development in grape is presented here. Phylogenetic analysis showed that the LBD genes could be divided into classes I and II, together with LBDs from Arabidopsis. We mapped the 40 LBD genes on the grape chromosomes (chr1–chr19) and found that 37 of the predicted grape LBD genes were distributed in different densities across 12 chromosomes. Grape LBDs were found to share a similar intron/exon structure and gene length within the same class. The expression profiles of grape LBD genes at different developmental stages were analysed using microarray data. Results showed that 21 grape LBD genes may be involved in grape developmental processes, including preveraison, veraison and ripening. Finally, we analysed the expres-sion patterns of six LBD genes through quantitative real-time polymerase chain reation analysis. The six LBD genes showed differential expression patterns among the three representative grape tissues, and five of these genes were found to be involved in responses to mannitol, sodium chloride, heat stress and low temperature treatments. To our knowledge, this is the first study to analyse the LBD gene

  10. Genome-Wide Analysis of the Aquaporin Gene Family in Chickpea (Cicer arietinum L.).

    Science.gov (United States)

    Deokar, Amit A; Tar'an, Bunyamin

    2016-01-01

    Aquaporins (AQPs) are essential membrane proteins that play critical role in the transport of water and many other solutes across cell membranes. In this study, a comprehensive genome-wide analysis identified 40 AQP genes in chickpea (Cicer arietinum L.). A complete overview of the chickpea AQP (CaAQP) gene family is presented, including their chromosomal locations, gene structure, phylogeny, gene duplication, conserved functional motifs, gene expression, and conserved promoter motifs. To understand AQP's evolution, a comparative analysis of chickpea AQPs with AQP orthologs from soybean, Medicago, common bean, and Arabidopsis was performed. The chickpea AQP genes were found on all of the chickpea chromosomes, except chromosome 7, with a maximum of six genes on chromosome 6, and a minimum of one gene on chromosome 5. Gene duplication analysis indicated that the expansion of chickpea AQP gene family might have been due to segmental and tandem duplications. CaAQPs were grouped into four subfamilies including 15 NOD26-like intrinsic proteins (NIPs), 13 tonoplast intrinsic proteins (TIPs), eight plasma membrane intrinsic proteins (PIPs), and four small basic intrinsic proteins (SIPs) based on sequence similarities and phylogenetic position. Gene structure analysis revealed a highly conserved exon-intron pattern within CaAQP subfamilies supporting the CaAQP family classification. Functional prediction based on conserved Ar/R selectivity filters, Froger's residues, and specificity-determining positions suggested wide differences in substrate specificity among the subfamilies of CaAQPs. Expression analysis of the AQP genes indicated that some of the genes are tissue-specific, whereas few other AQP genes showed differential expression in response to biotic and abiotic stresses. Promoter profiling of CaAQP genes for conserved cis-acting regulatory elements revealed enrichment of cis-elements involved in circadian control, light response, defense and stress responsiveness

  11. Metabolic flux balance analysis and the in silico analysis of Escherichia coli K-12 gene deletions

    Directory of Open Access Journals (Sweden)

    Edwards Jeremy S

    2000-07-01

    Full Text Available Abstract Background Genome sequencing and bioinformatics are producing detailed lists of the molecular components contained in many prokaryotic organisms. From this 'parts catalogue' of a microbial cell, in silico representations of integrated metabolic functions can be constructed and analyzed using flux balance analysis (FBA. FBA is particularly well-suited to study metabolic networks based on genomic, biochemical, and strain specific information. Results Herein, we have utilized FBA to interpret and analyze the metabolic capabilities of Escherichia coli. We have computationally mapped the metabolic capabilities of E. coli using FBA and examined the optimal utilization of the E. coli metabolic pathways as a function of environmental variables. We have used an in silico analysis to identify seven gene products of central metabolism (glycolysis, pentose phosphate pathway, TCA cycle, electron transport system essential for aerobic growth of E. coli on glucose minimal media, and 15 gene products essential for anaerobic growth on glucose minimal media. The in silico tpi-, zwf, and pta- mutant strains were examined in more detail by mapping the capabilities of these in silico isogenic strains. Conclusions We found that computational models of E. coli metabolism based on physicochemical constraints can be used to interpret mutant behavior. These in silica results lead to a further understanding of the complex genotype-phenotype relation. Supplementary information: http://gcrg.ucsd.edu/supplementary_data/DeletionAnalysis/main.htm

  12. Phase analysis of circadian-related genes in two tissues

    Directory of Open Access Journals (Sweden)

    Li Leping

    2006-02-01

    Full Text Available Abstract Background Recent circadian clock studies using gene expression microarray in two different tissues of mouse have revealed not all circadian-related genes are synchronized in phase or peak expression times across tissues in vivo. Instead, some circadian-related genes may be delayed by 4–8 hrs in peak expression in one tissue relative to the other. These interesting biological observations prompt a statistical question regarding how to distinguish the synchronized genes from genes that are systematically lagged in phase/peak expression time across two tissues. Results We propose a set of techniques from circular statistics to analyze phase angles of circadian-related genes in two tissues. We first estimate the phases of a cycling gene separately in each tissue, which are then used to estimate the paired angular difference of the phase angles of the gene in the two tissues. These differences are modeled as a mixture of two von Mises distributions which enables us to cluster genes into two groups; one group having synchronized transcripts with the same phase in the two tissues, the other containing transcripts with a discrepancy in phase between the two tissues. For each cluster of genes we assess the association of phases across the tissue types using circular-circular regression. We also develop a bootstrap methodology based on a circular-circular regression model to evaluate the improvement in fit provided by allowing two components versus a one-component von-Mises model. Conclusion We applied our proposed methodologies to the circadian-related genes common to heart and liver tissues in Storch et al. 2, and found that an estimated 80% of circadian-related transcripts common to heart and liver tissues were synchronized in phase, and the other 20% of transcripts were lagged about 8 hours in liver relative to heart. The bootstrap p-value for being one cluster is 0.063, which suggests the possibility of two clusters. Our methodologies can

  13. Mutational Analysis of the Wilms' Tumor (WTI) Gene.

    Science.gov (United States)

    King-Underwood, L; Pritchard-Jones, K

    1996-01-01

    Mutations of the Wilms' tumor (WT1) gene have been shown to underlie a proportion of cases of Wilms' tumor, an embryonal kidney cancer occurring mainly in childhood. The WTl gene comprtses ten exons spanning approx 50 kb of genomrc DNA. The messenger RNA is approx 3 kb in length and encodes a zinc finger protein. The four zinc fingers, which he at the C-terminal end of the protein, are encoded by separate exons 7-10. The 5' end of the gene is extremely GC-rich, with areas approaching a 70% GC content. This makes this region difficult to amplify in polymerase chain reactions.

  14. A complex network analysis of hypertension-related genes

    Science.gov (United States)

    Wang, Huan; Xu, Chuan-Yun; Hu, Jing-Bo; Cao, Ke-Fei

    2014-01-01

    In this paper, a network of hypertension-related genes is constructed by analyzing the correlations of gene expression data among the Dahl salt-sensitive rat and two consomic rat strains. The numerical calculations show that this sparse and assortative network has small-world and scale-free properties. Further, 16 key hub genes (Col4a1, Lcn2, Cdk4, etc.) are determined by introducing an integrated centrality and have been confirmed by biological/medical research to play important roles in hypertension.

  15. Detection and sequence analysis of accessory gene regulator genes of Staphylococcus pseudintermedius isolates

    OpenAIRE

    M. Ananda Chitra; Jayanthy, C.; Nagarajan, B.

    2015-01-01

    Background: Staphylococcus pseudintermedius (SP) is the major pathogenic species of dogs involved in a wide variety of skin and soft tissue infections. The accessory gene regulator (agr) locus of Staphylococcus aureus has been extensively studied, and it influences the expression of many virulence genes. It encodes a two-component signal transduction system that leads to down-regulation of surface proteins and up-regulation of secreted proteins during in vitro growth of S. aureus. The objecti...

  16. Investigation of Reference Genes in Vibrio parahaemolyticus for Gene Expression Analysis Using Quantitative RT-PCR

    OpenAIRE

    Yue-Jiao Ma; Xiao-Hong Sun; Xiao-Yan Xu; Yong Zhao; Ying-Jie Pan; Cheng-An Hwang; Wu, Vivian C. H.

    2015-01-01

    Vibrio parahaemolyticus is a significant human pathogen capable of causing foodborne gastroenteritis associated with the consumption of contaminated raw or undercooked seafood. Quantitative RT-PCR (qRT-PCR) is a useful tool for studying gene expression in V. parahaemolyticus to characterize its virulence factors and understand the effect of environmental conditions on its pathogenicity. However, there is not a stable gene in V. parahaemolyticus that has been identified for use as a reference ...

  17. Analysis of the retinal gene expression profile after hypoxic preconditioning identifies candidate genes for neuroprotection

    Directory of Open Access Journals (Sweden)

    Wenzel Andreas

    2008-02-01

    Full Text Available Abstract Background Retinal degeneration is a main cause of blindness in humans. Neuroprotective therapies may be used to rescue retinal cells and preserve vision. Hypoxic preconditioning stabilizes the transcription factor HIF-1α in the retina and strongly protects photoreceptors in an animal model of light-induced retinal degeneration. To address the molecular mechanisms of the protection, we analyzed the transcriptome of the hypoxic retina using microarrays and real-time PCR. Results Hypoxic exposure induced a marked alteration in the retinal transcriptome with significantly different expression levels of 431 genes immediately after hypoxic exposure. The normal expression profile was restored within 16 hours of reoxygenation. Among the differentially regulated genes, several candidates for neuroprotection were identified like metallothionein-1 and -2, the HIF-1 target gene adrenomedullin and the gene encoding the antioxidative and cytoprotective enzyme paraoxonase 1 which was previously not known to be a hypoxia responsive gene in the retina. The strongly upregulated cyclin dependent kinase inhibitor p21 was excluded from being essential for neuroprotection. Conclusion Our data suggest that neuroprotection after hypoxic preconditioning is the result of the differential expression of a multitude of genes which may act in concert to protect visual cells against a toxic insult.

  18. Large Scale Association Analysis for Drug Addiction: Results from SNP to Gene

    Directory of Open Access Journals (Sweden)

    Xiaobo Guo

    2012-01-01

    Full Text Available Many genetic association studies used single nucleotide polymorphisms (SNPs data to identify genetic variants for complex diseases. Although SNP-based associations are most common in genome-wide association studies (GWAS, gene-based association analysis has received increasing attention in understanding genetic etiologies for complex diseases. While both methods have been used to analyze the same data, few genome-wide association studies compare the results or observe the connection between them. We performed a comprehensive analysis of the data from the Study of Addiction: Genetics and Environment (SAGE and compared the results from the SNP-based and gene-based analyses. Our results suggest that the gene-based method complements the individual SNP-based analysis, and conceptually they are closely related. In terms of gene findings, our results validate many genes that were either reported from the analysis of the same dataset or based on animal studies for substance dependence.

  19. Global Gene-Expression Analysis to Identify Differentially Expressed Genes Critical for the Heat Stress Response in Brassica rapa.

    Science.gov (United States)

    Dong, Xiangshu; Yi, Hankuil; Lee, Jeongyeo; Nou, Ill-Sup; Han, Ching-Tack; Hur, Yoonkang

    2015-01-01

    Genome-wide dissection of the heat stress response (HSR) is necessary to overcome problems in crop production caused by global warming. To identify HSR genes, we profiled gene expression in two Chinese cabbage inbred lines with different thermotolerances, Chiifu and Kenshin. Many genes exhibited >2-fold changes in expression upon exposure to 0.5- 4 h at 45°C (high temperature, HT): 5.2% (2,142 genes) in Chiifu and 3.7% (1,535 genes) in Kenshin. The most enriched GO (Gene Ontology) items included 'response to heat', 'response to reactive oxygen species (ROS)', 'response to temperature stimulus', 'response to abiotic stimulus', and 'MAPKKK cascade'. In both lines, the genes most highly induced by HT encoded small heat shock proteins (Hsps) and heat shock factor (Hsf)-like proteins such as HsfB2A (Bra029292), whereas high-molecular weight Hsps were constitutively expressed. Other upstream HSR components were also up-regulated: ROS-scavenging genes like glutathione peroxidase 2 (BrGPX2, Bra022853), protein kinases, and phosphatases. Among heat stress (HS) marker genes in Arabidopsis, only exportin 1A (XPO1A) (Bra008580, Bra006382) can be applied to B. rapa for basal thermotolerance (BT) and short-term acquired thermotolerance (SAT) gene. CYP707A3 (Bra025083, Bra021965), which is involved in the dehydration response in Arabidopsis, was associated with membrane leakage in both lines following HS. Although many transcription factors (TF) genes, including DREB2A (Bra005852), were involved in HS tolerance in both lines, Bra024224 (MYB41) and Bra021735 (a bZIP/AIR1 [Anthocyanin-Impaired-Response-1]) were specific to Kenshin. Several candidate TFs involved in thermotolerance were confirmed as HSR genes by real-time PCR, and these assignments were further supported by promoter analysis. Although some of our findings are similar to those obtained using other plant species, clear differences in Brassica rapa reveal a distinct HSR in this species. Our data could also provide a

  20. [Screening of Bacillus thuringiensis strains containing vip3A genes and analysis of gene conservation].

    Science.gov (United States)

    Chen, Jian-Wu; Tang, Li-Xia; Song, Shao-Yun; Yuan, Mei-Jin; Pang, Yi

    2003-09-01

    Vip3A, a novel insecticidal protein, is secreted by Bacillus thuringiensis (Bt) during vegetative growth. Vip3A protein possesses insecticidal activity against a wild spectrum of lepidopteran insect larvae. Since the first cloning of vip3A gene from Bt, many other vip3A genes have been isolated. To investigate vip3A genes contribution to Bt and reflect the revolution relationships, the strains containing vip3A genes were screened and gene similarity was analyzed. 114 wild-type Bacillus thuringiensis (Bt) strains isolated from different regions and 41 standard Bt strains from the Institute of Pasteur were screened for the vip3A genes using PCR amplification. 39 strains including B. thuringiensis subsp. kurstaki (Btk) HD-1 were found to contain the vip3A genes. Because acrystallerous strain Cry- B derived from Btk HD-1 was proved not to contain vip3A gene, it suppose that the vip3A gene may be located at the plasmids. Vip3A proteins expressed in these strains were detected with polyclonal antibody by Western blot and 4 strains among them were shown not to express the Vip3A proteins. The vip3A genes amplified from wild-type Bacillus thuringiensis strains S101 and 611 with different levels of activity against lepidopteran insect larvae were cloned into pGEM-T Easy vector. Alignment of these 2 putative Vip3A proteins with 6 others (Vip3A (a), Vip3A(b), Vip3A-S, Vip3A-S184, Vip83 and Vip3V) in the GenBank data base and 2 reported Vip3A proteins (Vip14 and Vip15) showed that vip3A genes are highly conservative. The plasmids pOTP-S101 and pOTP-611 were constructed by in- serting 2 vip3A genes (vip3A-S101 and vip3A-611) into the expression vector pQE30 respectively and were transformed into E. coli M15. E. coli M15 cells harboring the pOTP plasmids were induced with 1 mmol/L IPTG to express 89 kDa protein. Experiments showed that the level of soluble proteins of Vip3A-S101 in E. coli M15[pOTP-S101] and Vip3A-611 in E. coli M15 [pOTP-611] were about 48% and 35% respectively

  1. [Transcriptional analysis of the Grp gene, a genomic homolog of the retrotransposon gypsy gag gene, in Drosophila melanogaster].

    Science.gov (United States)

    Nefedova, L N; Kuz'min, I V; Burmistrova, D A; Rezazadekh, S; Kim, A I

    2011-08-01

    In the present work, we studied the Grp gene (CG4680, Gag related protein) expression at the transcriptional level. It was found that at the embryonic and larval stages of D. melanogaster development the Grp expression proceeds at a low level, but it significantly increases at the adult stage. Adult individuals display a tissue-specific expression: an eleveated level of transcription is observed in the gut tissues, but not in the chitin carcass, head, and gonads. Since the gut may potentially be a primary barrier for the penetration of a viral infection, we conducted a comparative analysis of Grp gene transcription in D. melanogaster strains differing in the presence of active copies of the gypsy errantivirus and in the status of the flamenco gene controlling sensitivity to errantiviral infections. No noticeable differences in the level of Grp gene transcription were revealed. Thus, the Grp gene is not a pseudogene, but it is a functional gene of the D. melanogaster genome whose role remains to be elucidated.

  2. Transcriptomic Analysis of Differentially Expressed Genes During Larval Development of Rapana venosa by Digital Gene Expression Profiling

    Directory of Open Access Journals (Sweden)

    Hao Song

    2016-07-01

    Full Text Available During the life cycle of shellfish, larval development, especially metamorphosis, has a vital influence on the dynamics, distribution, and recruitment of natural populations, as well as seed breeding. Rapana venosa, a carnivorous gastropod, is an important commercial shellfish in China, and is an ecological invader in the United States, Argentina, and France. However, information about the mechanism of its early development is still limited, because research in this area has long suffered from a lack of genomic resources. In this study, 15 digital gene expression (DGE libraries from five developmental stages of R. venosa were constructed and sequenced on the IIIumina Hi-Sequation 2500 platform. Bioinformaticsanalysis identified numerous differentially and specifically expressed genes, which revealed that genes associated with growth, nervous system, digestive system, immune system, and apoptosis participate in important developmental processes. The functional analysis of differentially expressed genes was further implemented by gene ontology, and Kyoto encyclopedia of genes and genomes enrichment. DGE profiling provided a general picture of the transcriptomic activities during the early development of R. venosa, which may provide interesting hints for further study. Our data represent the first comparative transcriptomic information available for the early development of R. venosa, which is a prerequisit