Jespersen, Jakob S.; Petersen, Bent; Seguin-Orlando, Andaine
at identifying PfEMP1 features associated with high virulence. Here we present the first effective method for sequence analysis of var genes expressed in field samples: a sequential PCR and next generation sequencing based technique applied on expressed var sequence tags and subsequently on long range PCR......, encoded by ~60 highly variable 'var' genes per haploid genome. PfEMP1 is exported to the surface of infected erythrocytes and is thought to be fundamental to immune evasion by adhesion to host and parasite factors. The highly variable nature has constituted a roadblock in var expression studies aimed...
Kielpinski, Lukasz Jan; Sidiropoulos, Nikos; Vinther, Jeppe
time also made analysis of the data challenging for scientists without formal training in computational biology. Here, we discuss different strategies for data analysis of massive parallel sequencing-based structure-probing data. To facilitate reproducible and standardized analysis of this type of data...
Marsh, Alan J; O'Sullivan, Orla; Hill, Colin; Ross, R Paul; Cotter, Paul D
Water kefir is a water-sucrose-based beverage, fermented by a symbiosis of bacteria and yeast to produce a final product that is lightly carbonated, acidic and that has a low alcohol percentage. The microorganisms present in water kefir are introduced via water kefir grains, which consist of a polysaccharide matrix in which the microorganisms are embedded. We aimed to provide a comprehensive sequencing-based analysis of the bacterial population of water kefir beverages and grains, while providing an initial insight into the corresponding fungal population. To facilitate this objective, four water kefirs were sourced from the UK, Canada and the United States. Culture-independent, high-throughput, sequencing-based analyses revealed that the bacterial fraction of each water kefir and grain was dominated by Zymomonas, an ethanol-producing bacterium, which has not previously been detected at such a scale. The other genera detected were representatives of the lactic acid bacteria and acetic acid bacteria. Our analysis of the fungal component established that it was comprised of the genera Dekkera, Hanseniaspora, Saccharomyces, Zygosaccharomyces, Torulaspora and Lachancea. This information will assist in the ultimate identification of the microorganisms responsible for the potentially health-promoting attributes of these beverages. © 2013 Federation of European Microbiological Societies. Published by John Wiley & Sons Ltd. All rights reserved.
Sousa-Nunes, Rita; Rana, Amer Ahmed; Kettleborough, Ross
This article investigates the expression patterns of 160 genes that are expressed during early mouse development. The cDNAs were isolated from 7.5 d postcoitum (dpc) endoderm, a region that comprises visceral endoderm (VE), definitive endoderm, and the node-tissues that are required for the initi...
Macas, Jiří; Kejnovský, Eduard; Neumann, Pavel; Novák, Petr; Koblížková, Andrea; Vyskot, Boris
Roč. 6, č. 11 (2011), e27335 E-ISSN 1932-6203 R&D Projects: GA MŠk(CZ) OC10037; GA MŠk(CZ) LC06004; GA MŠk(CZ) LH11058; GA ČR(CZ) GAP501/10/0102; GA ČR(CZ) GAP305/10/0930 Institutional research plan: CEZ:AV0Z50510513; CEZ:AV0Z50040702 Keywords : Plant genome * Sequencing-Based Analyses * Repetitive DNA * Silene latifolia Subject RIV: EB - Genetics ; Molecular Biology Impact factor: 4.092, year: 2011
Full Text Available This paper proposes novel framework for facial expressions analysis using dynamic and static information in video sequences. First, based on incremental formulation, discriminative deformable face alignment method is adapted to locate facial points to correct in-plane head rotation and break up facial region from background. Then, spatial-temporal motion local binary pattern (LBP feature is extracted and integrated with Gabor multiorientation fusion histogram to give descriptors, which reflect static and dynamic texture information of facial expressions. Finally, a one-versus-one strategy based multiclass support vector machine (SVM classifier is applied to classify facial expressions. Experiments on Cohn-Kanade (CK + facial expression dataset illustrate that integrated framework outperforms methods using single descriptors. Compared with other state-of-the-art methods on CK+, MMI, and Oulu-CASIA VIS datasets, our proposed framework performs better.
Background Parasitoid insects manipulate their hosts' physiology by injecting various factors into their host upon parasitization. Transcriptomic approaches provide a powerful approach to study insect host-parasitoid interactions at the molecular level. In order to investigate the effects of parasitization by an ichneumonid wasp (Diadegma semiclausum) on the host (Plutella xylostella), the larval transcriptome profile was analyzed using a short-read deep sequencing method (Illumina). Symbiotic polydnaviruses (PDVs) associated with ichneumonid parasitoids, known as ichnoviruses, play significant roles in host immune suppression and developmental regulation. In the current study, D. semiclausum ichnovirus (DsIV) genes expressed in P. xylostella were identified and their sequences compared with other reported PDVs. Five of these genes encode proteins of unknown identity, that have not previously been reported. Results De novo assembly of cDNA sequence data generated 172,660 contigs between 100 and 10000 bp in length; with 35% of > 200 bp in length. Parasitization had significant impacts on expression levels of 928 identified insect host transcripts. Gene ontology data illustrated that the majority of the differentially expressed genes are involved in binding, catalytic activity, and metabolic and cellular processes. In addition, the results show that transcription levels of antimicrobial peptides, such as gloverin, cecropin E and lysozyme, were up-regulated after parasitism. Expression of ichnovirus genes were detected in parasitized larvae with 19 unique sequences identified from five PDV gene families including vankyrin, viral innexin, repeat elements, a cysteine-rich motif, and polar residue rich protein. Vankyrin 1 and repeat element 1 genes showed the highest transcription levels among the DsIV genes. Conclusion This study provides detailed information on differential expression of P. xylostella larval genes following parasitization, DsIV genes expressed in the
Clark Virginia L
Full Text Available Abstract Background Maintenance of an anaerobic denitrification system in the obligate human pathogen, Neisseria gonorrhoeae, suggests that an anaerobic lifestyle may be important during the course of infection. Furthermore, mounting evidence suggests that reduction of host-produced nitric oxide has several immunomodulary effects on the host. However, at this point there have been no studies analyzing the complete gonococcal transcriptome response to anaerobiosis. Here we performed deep sequencing to compare the gonococcal transcriptomes of aerobically and anaerobically grown cells. Using the information derived from this sequencing, we discuss the implications of the robust transcriptional response to anaerobic growth. Results We determined that 198 chromosomal genes were differentially expressed (~10% of the genome in response to anaerobic conditions. We also observed a large induction of genes encoded within the cryptic plasmid, pJD1. Validation of RNA-seq data using translational-lacZ fusions or RT-PCR demonstrated the RNA-seq results to be very reproducible. Surprisingly, many genes of prophage origin were induced anaerobically, as well as several transcriptional regulators previously unknown to be involved in anaerobic growth. We also confirmed expression and regulation of a small RNA, likely a functional equivalent of fnrS in the Enterobacteriaceae family. We also determined that many genes found to be responsive to anaerobiosis have also been shown to be responsive to iron and/or oxidative stress. Conclusions Gonococci will be subject to many forms of environmental stress, including oxygen-limitation, during the course of infection. Here we determined that the anaerobic stimulon in gonococci was larger than previous studies would suggest. Many new targets for future research have been uncovered, and the results derived from this study may have helped to elucidate factors or mechanisms of virulence that may have otherwise been overlooked.
Singh, Angad Pal; Zafer, Samreen; Pe'er, Itsik
Human genetics recently transitioned from GWAS to studies based on NGS data. For GWAS, small effects dictated large sample sizes, typically made possible through meta-analysis by exchanging summary statistics across consortia. NGS studies groupwise-test for association of multiple potentially-causal alleles along each gene. They are subject to similar power constraints and therefore likely to resort to meta-analysis as well. The problem arises when considering privacy of the genetic information during the data-exchange process. Many scoring schemes for NGS association rely on the frequency of each variant thus requiring the exchange of identity of the sequenced variant. As such variants are often rare, potentially revealing the identity of their carriers and jeopardizing privacy. We have thus developed MetaSeq, a protocol for meta-analysis of genome-wide sequencing data by multiple collaborating parties, scoring association for rare variants pooled per gene across all parties. We tackle the challenge of tallying frequency counts of rare, sequenced alleles, for metaanalysis of sequencing data without disclosing the allele identity and counts, thereby protecting sample identity. This apparent paradoxical exchange of information is achieved through cryptographic means. The key idea is that parties encrypt identity of genes and variants. When they transfer information about frequency counts in cases and controls, the exchanged data does not convey the identity of a mutation and therefore does not expose carrier identity. The exchange relies on a 3rd party, trusted to follow the protocol although not trusted to learn about the raw data. We show applicability of this method to publicly available exome-sequencing data from multiple studies, simulating phenotypic information for powerful meta-analysis. The MetaSeq software is publicly available as open source.
Marsh, Alan J; O'Sullivan, Orla; Hill, Colin; Ross, R Paul; Cotter, Paul D
Kombucha is a sweetened tea beverage that, as a consequence of fermentation, contains ethanol, carbon dioxide, a high concentration of acid (gluconic, acetic and lactic) as well as a number of other metabolites and is thought to contain a number of health-promoting components. The sucrose-tea solution is fermented by a symbiosis of bacteria and yeast embedded within a cellulosic pellicle, which forms a floating mat in the tea, and generates a new layer with each successful fermentation. The specific identity of the microbial populations present has been the focus of attention but, to date, the majority of studies have relied on culture-based analyses. To gain a more comprehensive insight into the kombucha microbiota we have carried out the first culture-independent, high-throughput sequencing analysis of the bacterial and fungal populations of 5 distinct pellicles as well as the resultant fermented kombucha at two time points. Following the analysis it was established that the major bacterial genus present was Gluconacetobacter, present at >85% in most samples, with only trace populations of Acetobacter detected (kombucha, also being revealed. The yeast populations were found to be dominated by Zygosaccharomyces at >95% in the fermented beverage, with a greater fungal diversity present in the cellulosic pellicle, including numerous species not identified in kombucha previously. Ultimately, this study represents the most accurate description of the microbiology of kombucha to date. Copyright © 2013 Elsevier Ltd. All rights reserved.
Cavanagh, Jorunn Pauline; Klingenberg, Claus; Hanssen, Anne-Merethe; Fredheim, Elizabeth Aarag; Francois, Patrice; Schrenzel, Jacques; Flægstad, Trond; Sollid, Johanna Ericson
The notoriously multi-resistant Staphylococcus haemolyticus is an emerging pathogen causing serious infections in immunocompromised patients. Defining the population structure is important to detect outbreaks and spread of antimicrobial resistant clones. Currently, the standard typing technique is pulsed-field gel electrophoresis (PFGE). In this study we describe novel molecular typing schemes for S. haemolyticus using multi locus sequence typing (MLST) and multi locus variable number of tandem repeats (VNTR) analysis. Seven housekeeping genes (MLST) and five VNTR loci (MLVF) were selected for the novel typing schemes. A panel of 45 human and veterinary S. haemolyticus isolates was investigated. The collection had diverse PFGE patterns (38 PFGE types) and was sampled over a 20 year-period from eight countries. MLST resolved 17 sequence types (Simpsons index of diversity [SID]=0.877) and MLVF resolved 14 repeat types (SID=0.831). We found a low sequence diversity. Phylogenetic analysis clustered the isolates in three (MLST) and one (MLVF) clonal complexes, respectively. Taken together, neither the MLST nor the MLVF scheme was suitable to resolve the population structure of this S. haemolyticus collection. Future MLVF and MLST schemes will benefit from addition of more variable core genome sequences identified by comparing different fully sequenced S. haemolyticus genomes. Copyright © 2012 Elsevier B.V. All rights reserved.
Liu, Dong; Cheng, Chen; Fu, Qiang; Liu, Chunlei; Li, Mo; Faiz, Muhammad Abrar; Li, Tianxiao; Khan, Muhammad Imran; Cui, Song
In this paper, the complete ensemble empirical mode decomposition with the adaptive noise (CEEMDAN) algorithm is introduced into the complexity research of precipitation systems to improve the traditional complexity measure method specific to the mode mixing of the Empirical Mode Decomposition (EMD) and incomplete decomposition of the ensemble empirical mode decomposition (EEMD). We combined the CEEMDAN with the wavelet packet transform (WPT) and multifractal detrended fluctuation analysis (MF-DFA) to create the CEEMDAN-WPT-MFDFA, and used it to measure the complexity of the monthly precipitation sequence of 12 sub-regions in Harbin, Heilongjiang Province, China. The results show that there are significant differences in the monthly precipitation complexity of each sub-region in Harbin. The complexity of the northwest area of Harbin is the lowest and its predictability is the best. The complexity and predictability of the middle and Midwest areas of Harbin are about average. The complexity of the southeast area of Harbin is higher than that of the northwest, middle, and Midwest areas of Harbin and its predictability is worse. The complexity of Shuangcheng is the highest and its predictability is the worst of all the studied sub-regions. We used terrain and human activity as factors to analyze the causes of the complexity of the local precipitation. The results showed that the correlations between the precipitation complexity and terrain are obvious, and the correlations between the precipitation complexity and human influence factors vary. The distribution of the precipitation complexity in this area may be generated by the superposition effect of human activities and natural factors such as terrain, general atmospheric circulation, land and sea location, and ocean currents. To evaluate the stability of the algorithm, the CEEMDAN-WPT-MFDFA was compared with the equal probability coarse graining LZC algorithm, fuzzy entropy, and wavelet entropy. The results show
Full Text Available Beetles (Coleoptera are the most diverse animal group on earth and interact with numerous symbiotic or pathogenic microbes in their environments. The red flour beetle Tribolium castaneum is a genetically tractable model beetle species and its whole genome sequence has recently been determined. To advance our understanding of the molecular basis of beetle immunity here we analyzed the whole transcriptome of T. castaneum by high-throughput next generation sequencing technology. Here, we demonstrate that the Illumina/Solexa sequencing approach of cDNA samples from T. castaneum including over 9.7 million reads with 72 base pairs (bp length (approximately 700 million bp sequence information with about 30× transcriptome coverage confirms the expression of most predicted genes and enabled subsequent qualitative and quantitative transcriptome analysis. This approach recapitulates our recent quantitative real-time PCR studies of immune-challenged and naïve T. castaneum beetles, validating our approach. Furthermore, this sequencing analysis resulted in the identification of 73 differentially expressed genes upon immune-challenge with statistical significance by comparing expression data to calculated values derived by fitting to generalized linear models. We identified up regulation of diverse immune-related genes (e.g. Toll receptor, serine proteinases, DOPA decarboxylase and thaumatin and of numerous genes encoding proteins with yet unknown functions. Of note, septic-injury resulted also in the elevated expression of genes encoding heat-shock proteins or cytochrome P450s supporting the view that there is crosstalk between immune and stress responses in T. castaneum. The present study provides a first comprehensive overview of septic-injury responsive genes in T. castaneum beetles. Identified genes advance our understanding of T. castaneum specific gene expression alteration upon immune-challenge in particular and may help to understand beetle immunity
Full Text Available Although invertebrates are incapable of adaptive immunity, immunal reactions which are functionally similar to the adaptive immunity of vertebrates have been described in many studies of invertebrates including insects. The phenomenon was termed immune priming. In order to understand the molecular mechanism of immune priming, we employed Illumina/Solexa platform to investigate the transcriptional changes of the hemocytes and fat body of Helicoverpa armigera larvae immune-primed with the pathogenic bacteria Photorhabdus luminescens TT01. A total of 43.6 and 65.1 million clean reads with 4.4 and 6.5 gigabase sequence data were obtained from the TT01 (the immune-primed and PBS (non-primed cDNA libraries and assembled into 35,707 all-unigenes (non-redundant transcripts, which has a length varied from 201 to 16,947 bp and a N50 length of 1,997 bp. For 35,707 all-unigenes, 20,438 were functionally annotated and 2,494 were differentially expressed after immune priming. The differentially expressed genes (DEGs are mainly related to immunity, detoxification, development and metabolism of the host insect. Analysis on the annotated immune related DEGs supported a hypothesis that we proposed previously: the immune priming phenomenon observed in H. armigera larvae was achieved by regulation of key innate immune elements. The transcriptome profiling data sets (especially the sequences of 1,022 unannotated DEGs and the clues (such as those on immune-related signal and regulatory pathways obtained from this study will facilitate immune-related novel gene discovery and provide valuable information for further exploring the molecular mechanism of immune priming of invertebrates. All these will increase our understanding of invertebrate immunity which may provide new approaches to control insect pests or prevent epidemic of infectious diseases in economic invertebrates in the future.
Full Text Available Sunflower is an important oilseed crop, as well as a model system for evolutionary studies, but its 3.6 gigabase genome has proven difficult to assemble, in part because of the high repeat content of its genome. Here we report on the sequencing, assembly, and analyses of 96 randomly chosen BACs from sunflower to provide additional information on the repeat content of the sunflower genome, assess how repetitive elements in the sunflower genome are organized relative to genes, and compare the genomic distribution of these repeats to that found in other food crops and model species. We also examine the expression of transposable element-related transcripts in EST databases for sunflower to determine the representation of repeats in the transcriptome and to measure their transcriptional activity. Our data confirm previous reports in suggesting that the sunflower genome is >78% repetitive. Sunflower repeats share very little similarity to other plant repeats such as those of Arabidopsis, rice, maize and wheat; overall 28% of repeats are “novel” to sunflower. The repetitive sequences appear to be randomly distributed within the sequenced BACs. Assuming the 96 BACs are representative of the genome as a whole, then approximately 5.2% of the sunflower genome comprises non TE-related genic sequence, with an average gene density of 18kbp/gene. Expression levels of these transposable elements indicate tissue specificity and differential expression in vegetative and reproductive tissues, suggesting that expressed TEs might contribute to sunflower development. The assembled BACs will also be useful for assessing the quality of several different draft assemblies of the sunflower genome and for annotating the reference sequence.
Gill, Navdeep; Buti, Matteo; Kane, Nolan; Bellec, Arnaud; Helmstetter, Nicolas; Berges, Hélène; Rieseberg, Loren H.
Sunflower is an important oilseed crop, as well as a model system for evolutionary studies, but its 3.6 gigabase genome has proven difficult to assemble, in part because of the high repeat content of its genome. Here we report on the sequencing, assembly, and analyses of 96 randomly chosen BACs from sunflower to provide additional information on the repeat content of the sunflower genome, assess how repetitive elements in the sunflower genome are organized relative to genes, and compare the genomic distribution of these repeats to that found in other food crops and model species. We also examine the expression of transposable element-related transcripts in EST databases for sunflower to determine the representation of repeats in the transcriptome and to measure their transcriptional activity. Our data confirm previous reports in suggesting that the sunflower genome is >78% repetitive. Sunflower repeats share very little similarity to other plant repeats such as those of Arabidopsis, rice, maize and wheat; overall 28% of repeats are “novel” to sunflower. The repetitive sequences appear to be randomly distributed within the sequenced BACs. Assuming the 96 BACs are representative of the genome as a whole, then approximately 5.2% of the sunflower genome comprises non TE-related genic sequence, with an average gene density of 18kbp/gene. Expression levels of these transposable elements indicate tissue specificity and differential expression in vegetative and reproductive tissues, suggesting that expressed TEs might contribute to sunflower development. The assembled BACs will also be useful for assessing the quality of several different draft assemblies of the sunflower genome and for annotating the reference sequence. PMID:24833511
Full Text Available Avian pathogenic Escherichia coli (APEC leads to economic losses in poultry production and is also a threat to human health. The goal of this study was to characterize the chicken spleen transcriptome and to identify candidate genes for response and resistance to APEC infection using Solexa sequencing. We obtained 14422935, 14104324, and 14954692 Solexa read pairs for non-challenged (NC, challenged-mild pathology (MD, and challenged-severe pathology (SV, respectively. A total of 148197 contigs and 98461 unigenes were assembled, of which 134949 contigs and 91890 unigenes match the chicken genome. In total, 12272 annotated unigenes take part in biological processes (11664, cellular components (11927, and molecular functions (11963. Summing three specific contrasts, 13650 significantly differentially expressed unigenes were found in NC Vs. MD (6844, NC Vs. SV (7764, and MD Vs. SV (2320. Some unigenes (e.g. CD148, CD45 and LCK were involved in crucial pathways, such as the T cell receptor (TCR signaling pathway and microbial metabolism in diverse environments. This study facilitates understanding of the genetic architecture of the chicken spleen transcriptome, and has identified candidate genes for host response to APEC infection.
Marsh, Alan J.; O’Sullivan, Orla; Hill, Colin; Ross, R. Paul; Cotter, Paul D.
Kefir is a fermented milk-based beverage to which a number of health-promoting properties have been attributed. The microbes responsible for the fermentation of milk to produce kefir consist of a complex association of bacteria and yeasts, bound within a polysaccharide matrix, known as the kefir grain. The consistency of this microbial population, and that present in the resultant beverage, has been the subject of a number of previous, almost exclusively culture-based, studies which have indicated differences depending on geographical location and culture conditions. However, culture-based identification studies are limited by virtue of only detecting species with the ability to grow on the specific medium used and thus culture-independent, molecular-based techniques offer the potential for a more comprehensive analysis of such communities. Here we describe a detailed investigation of the microbial population, both bacterial and fungal, of kefir, using high-throughput sequencing to analyse 25 kefir milks and associated grains sourced from 8 geographically distinct regions. This is the first occasion that this technology has been employed to investigate the fungal component of these populations or to reveal the microbial composition of such an extensive number of kefir grains or milks. As a result several genera and species not previously identified in kefir were revealed. Our analysis shows that the bacterial populations in kefir are dominated by 2 phyla, the Firmicutes and the Proteobacteria. It was also established that the fungal populations of kefir were dominated by the genera Kazachstania, Kluyveromyces and Naumovozyma, but that a variable sub-dominant population also exists. PMID:23894461
Alan J Marsh
Full Text Available Kefir is a fermented milk-based beverage to which a number of health-promoting properties have been attributed. The microbes responsible for the fermentation of milk to produce kefir consist of a complex association of bacteria and yeasts, bound within a polysaccharide matrix, known as the kefir grain. The consistency of this microbial population, and that present in the resultant beverage, has been the subject of a number of previous, almost exclusively culture-based, studies which have indicated differences depending on geographical location and culture conditions. However, culture-based identification studies are limited by virtue of only detecting species with the ability to grow on the specific medium used and thus culture-independent, molecular-based techniques offer the potential for a more comprehensive analysis of such communities. Here we describe a detailed investigation of the microbial population, both bacterial and fungal, of kefir, using high-throughput sequencing to analyse 25 kefir milks and associated grains sourced from 8 geographically distinct regions. This is the first occasion that this technology has been employed to investigate the fungal component of these populations or to reveal the microbial composition of such an extensive number of kefir grains or milks. As a result several genera and species not previously identified in kefir were revealed. Our analysis shows that the bacterial populations in kefir are dominated by 2 phyla, the Firmicutes and the Proteobacteria. It was also established that the fungal populations of kefir were dominated by the genera Kazachstania, Kluyveromyces and Naumovozyma, but that a variable sub-dominant population also exists.
Marsh, Alan J; O'Sullivan, Orla; Hill, Colin; Ross, R Paul; Cotter, Paul D
Kefir is a fermented milk-based beverage to which a number of health-promoting properties have been attributed. The microbes responsible for the fermentation of milk to produce kefir consist of a complex association of bacteria and yeasts, bound within a polysaccharide matrix, known as the kefir grain. The consistency of this microbial population, and that present in the resultant beverage, has been the subject of a number of previous, almost exclusively culture-based, studies which have indicated differences depending on geographical location and culture conditions. However, culture-based identification studies are limited by virtue of only detecting species with the ability to grow on the specific medium used and thus culture-independent, molecular-based techniques offer the potential for a more comprehensive analysis of such communities. Here we describe a detailed investigation of the microbial population, both bacterial and fungal, of kefir, using high-throughput sequencing to analyse 25 kefir milks and associated grains sourced from 8 geographically distinct regions. This is the first occasion that this technology has been employed to investigate the fungal component of these populations or to reveal the microbial composition of such an extensive number of kefir grains or milks. As a result several genera and species not previously identified in kefir were revealed. Our analysis shows that the bacterial populations in kefir are dominated by 2 phyla, the Firmicutes and the Proteobacteria. It was also established that the fungal populations of kefir were dominated by the genera Kazachstania, Kluyveromyces and Naumovozyma, but that a variable sub-dominant population also exists.
Full Text Available A diverse antibody repertoire is primarily generated by the rearrangement of V, D, and J genes and subsequent somatic hypermutation (SHM. Class-switch recombination (CSR produces various isotypes and subclasses with different functional properties. Although antibody isotypes and subclasses are considered to be produced by both direct and sequential CSR, it is still not fully understood how SHMs accumulate during the process in which antibody subclasses are generated. Here, we developed a new next-generation sequencing (NGS-based antibody repertoire analysis capable of identifying all antibody isotype and subclass genes and used it to examine the peripheral blood mononuclear cells of 12 healthy individuals. Using a total of 5,480,040 sequences, we compared percentage frequency of variable (V, junctional (J sequence, and a combination of V and J, diversity, length, and amino acid compositions of CDR3, SHM, and shared clones in the IgM, IgD, IgG3, IgG1, IgG2, IgG4, IgA1, IgE, and IgA2 genes. The usage and diversity were similar among the immunoglobulin (Ig subclasses. Clonally related sequences sharing identical V, D, J, and CDR3 amino acid sequences were frequently found within multiple Ig subclasses, especially between IgG1 and IgG2 or IgA1 and IgA2. SHM occurred most frequently in IgG4, while IgG3 genes were the least mutated among all IgG subclasses. The shared clones had almost the same SHM levels among Ig subclasses, while subclass-specific clones had different levels of SHM dependent on the genomic location. Given the sequential CSR, these results suggest that CSR occurs sequentially over multiple subclasses in the order corresponding to the genomic location of IGHCs, but CSR is likely to occur more quickly than SHMs accumulate within Ig genes under physiological conditions. NGS-based antibody repertoire analysis should provide critical information on how various antibodies are generated in the immune system.
Bolívar, Julio; Hehl, Reinhard; Bülow, Lorenz
Information on the specificity of cis-sequences enables the design of functional synthetic plant promoters that are responsive to specific stresses. Potential cis-sequences may be experimentally tested, however, correlation of genomic sequence with gene expression data enables an in silico expression analysis approach to bioinformatically assess the stress specificity of candidate cis-sequences prior to experimental verification. The present chapter demonstrates an example for the in silico validation of a potential cis-regulatory sequence responsive to cold stress. The described online tool can be applied for the bioinformatic assessment of cis-sequences responsive to most abiotic and biotic stresses of plants. Furthermore, a method is presented based on a reverted in silico expression analysis approach that predicts highly specific potentially functional cis-regulatory elements for a given stress.
Antonio, May A. D.; Hillier, Sharon L.
Lactobacillus crispatus is one of the predominant hydrogen peroxide (H2O2)-producing species found in the vagina and is under development as a probiotic for the treatment of bacterial vaginosis. In this study, we assessed whether DNA fingerprinting by repetitive element sequence-based PCR (rep-PCR) can be used to distinguish the capsule strain of L. crispatus (CTV-05) from other endogenous strains as well as other species of vaginal lactobacilli. Vaginal and rectal lactobacilli were identifie...
Full Text Available Abstract Background Systematic research on fish immunogenetics is indispensable in understanding the origin and evolution of immune systems. This has long been a challenging task because of the limited number of deep sequencing technologies and genome backgrounds of non-model fish available. The newly developed Solexa/Illumina RNA-seq and Digital gene expression (DGE are high-throughput sequencing approaches and are powerful tools for genomic studies at the transcriptome level. This study reports the transcriptome profiling analysis of bacteria-challenged Lateolabrax japonicus using RNA-seq and DGE in an attempt to gain insights into the immunogenetics of marine fish. Results RNA-seq analysis generated 169,950 non-redundant consensus sequences, among which 48,987 functional transcripts with complete or various length encoding regions were identified. More than 52% of these transcripts are possibly involved in approximately 219 known metabolic or signalling pathways, while 2,673 transcripts were associated with immune-relevant genes. In addition, approximately 8% of the transcripts appeared to be fish-specific genes that have never been described before. DGE analysis revealed that the host transcriptome profile of Vibrio harveyi-challenged L. japonicus is considerably altered, as indicated by the significant up- or down-regulation of 1,224 strong infection-responsive transcripts. Results indicated an overall conservation of the components and transcriptome alterations underlying innate and adaptive immunity in fish and other vertebrate models. Analysis suggested the acquisition of numerous fish-specific immune system components during early vertebrate evolution. Conclusion This study provided a global survey of host defence gene activities against bacterial challenge in a non-model marine fish. Results can contribute to the in-depth study of candidate genes in marine fish immunity, and help improve current understanding of host
Christensen, Aske Simon; Møller, Anders; Schwartzbach, Michael Ignatieff
We perform static analysis of Java programs to answer a simple question: which values may occur as results of string expressions? The answers are summarized for each expression by a regular language that is guaranteed to contain all possible values. We present several applications of this analysis...... are automatically produced. We present extensive benchmarks demonstrating that the analysis is efficient and produces results of useful precision......., including statically checking the syntax of dynamically generated expressions, such as SQL queries. Our analysis constructs flow graphs from class files and generates a context-free grammar with a nonterminal for each string expression. The language of this grammar is then widened into a regular language...
He, Zihuai; Xu, Bin; Lee, Seunggeun; Ionita-Laza, Iuliana
Substantial progress has been made in the functional annotation of genetic variation in the human genome. Integrative analysis that incorporates such functional annotations into sequencing studies can aid the discovery of disease-associated genetic variants, especially those with unknown function and located outside protein-coding regions. Direct incorporation of one functional annotation as weight in existing dispersion and burden tests can suffer substantial loss of power when the functional annotation is not predictive of the risk status of a variant. Here, we have developed unified tests that can utilize multiple functional annotations simultaneously for integrative association analysis with efficient computational techniques. We show that the proposed tests significantly improve power when variant risk status can be predicted by functional annotations. Importantly, when functional annotations are not predictive of risk status, the proposed tests incur only minimal loss of power in relation to existing dispersion and burden tests, and under certain circumstances they can even have improved power by learning a weight that better approximates the underlying disease model in a data-adaptive manner. The tests can be constructed with summary statistics of existing dispersion and burden tests for sequencing data, therefore allowing meta-analysis of multiple studies without sharing individual-level data. We applied the proposed tests to a meta-analysis of noncoding rare variants in Metabochip data on 12,281 individuals from eight studies for lipid traits. By incorporating the Eigen functional score, we detected significant associations between noncoding rare variants in SLC22A3 and low-density lipoprotein and total cholesterol, associations that are missed by standard dispersion and burden tests. Copyright © 2017 American Society of Human Genetics. Published by Elsevier Inc. All rights reserved.
Illumina sequencing-based analysis of a microbial community enriched under anaerobic methane oxidation condition coupled to denitrification revealed coexistence of aerobic and anaerobic methanotrophs.
Siniscalchi, Luciene Alves Batista; Leite, Laura Rabelo; Oliveira, Guilherme; Chernicharo, Carlos Augusto Lemos; de Araújo, Juliana Calabria
Methane is produced in anaerobic environments, such as reactors used to treat wastewaters, and can be consumed by methanotrophs. The composition and structure of a microbial community enriched from anaerobic sewage sludge under methane-oxidation condition coupled to denitrification were investigated. Denaturing gradient gel electrophoresis (DGGE) analysis retrieved sequences of Methylocaldum and Chloroflexi. Deep sequencing analysis revealed a complex community that changed over time and was affected by methane concentration. Methylocaldum (8.2%), Methylosinus (2.3%), Methylomonas (0.02%), Methylacidiphilales (0.45%), Nitrospirales (0.18%), and Methanosarcinales (0.3%) were detected. Despite denitrifying conditions provided, Nitrospirales and Methanosarcinales, known to perform anaerobic methane oxidation coupled to denitrification (DAMO) process, were in very low abundance. Results demonstrated that aerobic and anaerobic methanotrophs coexisted in the reactor together with heterotrophic microorganisms, suggesting that a diverse microbial community was important to sustain methanotrophic activity. The methanogenic sludge was a good inoculum to enrich methanotrophs, and cultivation conditions play a selective role in determining community composition.
Full Text Available RuBisCO is an important enzyme for plants to photosynthesize and balance carbon dioxide in the atmosphere. This study aimed to perform sequence, physicochemical, phylogenetic and 3D (three-dimensional comparative analyses of RuBisCO proteins in the Carthamus ssp. using various bioinformatics tools. The sequence lengths of the RuBisCO proteins were between 166 and 477 amino acids, with an average length of 411.8 amino acids. Their molecular weights (Mw ranged from 18711.47 to 52843.09 Da; the most acidic and basic protein sequences were detected in C. tinctorius (pI = 5.99 and in C. tenuis (pI = 6.92, respectively. The extinction coefficients of RuBisCO proteins at 280 nm ranged from 17,670 to 69,830 M-1 cm-1, the instability index (II values for RuBisCO proteins ranged from 33.31 to 39.39, while the GRAVY values of RuBisCO proteins ranged from -0.313 to -0.250. The most abundant amino acid in the RuBisCO protein was Gly (9.7%, while the least amino acid ratio was Trp (1.6 %. The putative phosphorylation sites of RuBisCO proteins were determined by NetPhos 2.0. Phylogenetic analysis revealed that RuBisCO proteins formed two main clades. A RAMPAGE analysis revealed that 96.3%-97.6% of residues were located in the favoured region of RuBisCO proteins. To predict the three dimensional (3D structure of the RuBisCO proteins PyMOL was used. The results of the current study provide insights into fundamental characteristic of RuBisCO proteins in Carthamus ssp.
Xue, Gang [Iowa State Univ., Ames, IA (United States)
The purpose of this research was to improve the fluorescence detection for the multiplexed capillary array electrophoresis, extend its use beyond the genomic analysis, and to develop an integrated micro-sample preparation system for high-throughput DNA sequencing. The authors first demonstrated multiplexed capillary zone electrophoresis (CZE) and micellar electrokinetic chromatography (MEKC) separations in a 96-capillary array system with laser-induced fluorescence detection. Migration times of four kinds of fluoresceins and six polyaromatic hydrocarbons (PAHs) are normalized to one of the capillaries using two internal standards. The relative standard deviations (RSD) after normalization are 0.6-1.4% for the fluoresceins and 0.1-1.5% for the PAHs. Quantitative calibration of the separations based on peak areas is also performed, again with substantial improvement over the raw data. This opens up the possibility of performing massively parallel separations for high-throughput chemical analysis for process monitoring, combinatorial synthesis, and clinical diagnosis. The authors further improved the fluorescence detection by step laser scanning. A computer-controlled galvanometer scanner is adapted for scanning a focused laser beam across a 96-capillary array for laser-induced fluorescence detection. The signal at a single photomultiplier tube is temporally sorted to distinguish among the capillaries. The limit of detection for fluorescein is 3 x 10-11 M (S/N = 3) for 5-mW of total laser power scanned at 4 Hz. The observed cross-talk among capillaries is 0.2%. Advantages include the efficient utilization of light due to the high duty-cycle of step scan, good detection performance due to the reduction of stray light, ruggedness due to the small mass of the galvanometer mirror, low cost due to the simplicity of components, and flexibility due to the independent paths for excitation and emission.
Pausch, Hubert; Emmerling, Reiner; Gredler-Grandl, Birgit; Fries, Ruedi; Daetwyler, Hans D; Goddard, Michael E
Genotyping and whole-genome sequencing data have been generated for hundreds of thousands of cattle. International consortia used these data to compile imputation reference panels that facilitate the imputation of sequence variant genotypes for animals that have been genotyped using dense microarrays. Association studies with imputed sequence variant genotypes allow for the characterization of quantitative trait loci (QTL) at nucleotide resolution particularly when individuals from several breeds are included in the mapping populations. We imputed genotypes for 28 million sequence variants in 17,229 cattle of the Braunvieh, Fleckvieh and Holstein breeds in order to compile large mapping populations that provide high power to identify QTL for milk production traits. Association tests between imputed sequence variant genotypes and fat and protein percentages in milk uncovered between six and thirteen QTL (P < 1e-8) per breed. Eight of the detected QTL were significant in more than one breed. We combined the results across breeds using meta-analysis and identified a total of 25 QTL including six that were not significant in the within-breed association studies. Two missense mutations in the ABCG2 (p.Y581S, rs43702337, P = 4.3e-34) and GHR (p.F279Y, rs385640152, P = 1.6e-74) genes were the top variants at QTL on chromosomes 6 and 20. Another known causal missense mutation in the DGAT1 gene (p.A232K, rs109326954, P = 8.4e-1436) was the second top variant at a QTL on chromosome 14 but its allelic substitution effects were inconsistent across breeds. It turned out that the conflicting allelic substitution effects resulted from flaws in the imputed genotypes due to the use of a multi-breed reference population for genotype imputation. Many QTL for milk production traits segregate across breeds and across-breed meta-analysis has greater power to detect such QTL than within-breed association testing. Association testing between imputed sequence variant genotypes and
Full Text Available A variant in a transcription factor gene, POU4F3, is responsible for autosomal dominant nonsyndromic hereditary hearing loss, DFNA15. To date, 14 variants, including a whole deletion of POU4F3, have been reported to cause HL in various ethnic groups. In the present study, genetic screening for POU4F3 variants was carried out for a large series of Japanese hearing loss (HL patients to clarify the prevalence and clinical characteristics of DFNA15 in the Japanese population. Massively parallel DNA sequencing of 68 target candidate genes was utilized in 2,549 unrelated Japanese HL patients (probands to identify genomic variations responsible for HL. The detailed clinical features in patients with POU4F3 variants were collected from medical charts and analyzed. Novel 12 POU4F3 likely pathogenic variants (six missense variants, three frameshift variants, and three nonsense variants were successfully identified in 15 probands (2.5% among 602 families exhibiting autosomal dominant HL, whereas no variants were detected in the other 1,947 probands with autosomal recessive or inheritance pattern unknown HL. To obtain the audiovestibular configuration of the patients harboring POU4F3 variants, we collected audiograms and vestibular symptoms of the probands and their affected family members. Audiovestibular phenotypes in a total of 24 individuals from the 15 families possessing variants were characterized by progressive HL, with a large variation in the onset age and severity with or without vestibular symptoms observed. Pure-tone audiograms indicated the most prevalent configuration as mid-frequency HL type followed by high-frequency HL type, with asymmetry observed in approximately 20% of affected individuals. Analysis of the relationship between age and pure-tone average suggested that individuals with truncating variants showed earlier onset and slower progression of HL than did those with non-truncating variants. The present study showed that variants
Þórarinsson, Elfar; Yao, Zizhen; Wiklund, Eric D.
Recent computational scans for non-coding RNAs (ncRNAs) in multiple organisms have relied on existing multiple sequence alignments. However, as sequence similarity drops, a key signal of RNA structure--frequent compensating base changes--is increasingly likely to cause sequence-based alignment me...
Tobitani, Kensuke; Kato, Kunihito; Yamamoto, Kazuhiko
In this study, we focused on the basic taste stimulation for the analysis of real facial expressions. We considered that the expressions caused by taste stimulation were unaffected by individuality or emotion, that is, such expressions were involuntary. We analyzed the movement of facial muscles by taste stimulation and compared real expressions with artificial expressions. From the result, we identified an obvious difference between real and artificial expressions. Thus, our method would be a new approach for facial expression recognition.
The purpose of Emerald Express was to bring together senior representatives from military, relief, political, and diplomatic communities to address issues that arise during Humanitarian Assistance and Peace Operations (HA/POs...
Rustici, Gabriella; Kolesnikov, Nikolay; Brandizi, Marco; Burdett, Tony; Dylag, Miroslaw; Emam, Ibrahim; Farne, Anna; Hastings, Emma; Ison, Jon; Keays, Maria; Kurbatova, Natalja; Malone, James; Mani, Roby; Mupo, Annalisa; Pedro Pereira, Rui; Pilicheva, Ekaterina; Rung, Johan; Sharma, Anjan; Tang, Y Amy; Ternent, Tobias; Tikhonov, Andrew; Welter, Danielle; Williams, Eleanor; Brazma, Alvis; Parkinson, Helen; Sarkans, Ugis
The ArrayExpress Archive of Functional Genomics Data (http://www.ebi.ac.uk/arrayexpress) is one of three international functional genomics public data repositories, alongside the Gene Expression Omnibus at NCBI and the DDBJ Omics Archive, supporting peer-reviewed publications. It accepts data generated by sequencing or array-based technologies and currently contains data from almost a million assays, from over 30 000 experiments. The proportion of sequencing-based submissions has grown significantly over the last 2 years and has reached, in 2012, 15% of all new data. All data are available from ArrayExpress in MAGE-TAB format, which allows robust linking to data analysis and visualization tools, including Bioconductor and GenomeSpace. Additionally, R objects, for microarray data, and binary alignment format files, for sequencing data, have been generated for a significant proportion of ArrayExpress data.
Akkoç, Betül; Arslan, Ahmet
Eyes play an important role in expressing emotions in nonverbal communication. In the present study, emotional expression classification was performed based on the features that were automatically extracted from the eye area. Fırst, the face area and the eye area were automatically extracted from the captured image. Afterwards, the parameters to be used for the analysis through discrete wavelet transformation were obtained from the eye area. Using these parameters, emotional expression analysis was performed through artificial intelligence techniques. As the result of the experimental studies, 6 universal emotions consisting of expressions of happiness, sadness, surprise, disgust, anger and fear were classified at a success rate of 84% using artificial neural networks.
van Ruissen, Fred; Baas, Frank
In 1995, serial analysis of gene expression (SAGE) was developed as a versatile tool for gene expression studies. SAGE technology does not require pre-existing knowledge of the genome that is being examined and therefore SAGE can be applied to many different model systems. In this chapter, the SAGE
cophaga ranges from 0.037–0.106 and 0.049–0.207 for COI and ND5 genes, respectively (tables 2 and 3). Analysis of genetic distance on the basis of sequence difference for both the mitochondrial genes shows very little genetic difference. The discrepancy in the phylogenetic trees based on individ- ual genes may be due ...
Repositioning of Memantine as a Potential Novel Therapeutic Agent against Meningitic E. coli-Induced Pathogenicities through Disease-Associated Alpha7 Cholinergic Pathway and RNA Sequencing-Based Transcriptome Analysis of Host Inflammatory Responses.
Full Text Available Neonatal sepsis and meningitis (NSM remains a leading cause worldwide of mortality and morbidity in newborn infants despite the availability of antibiotics over the last several decades. E. coli is the most common gram-negative pathogen causing NSM. Our previous studies show that α7 nicotinic receptor (α7 nAChR, an essential regulator of inflammation, plays a detrimental role in the host defense against NSM. Despite notable successes, there still exists an unmet need for new effective therapeutic approaches to treat this disease. Using the in vitro/in vivo models of the blood-brain barrier (BBB and RNA-seq, we undertook a drug repositioning study to identify unknown antimicrobial activities for known drugs. We have demonstrated for the first time that memantine (MEM, a FDA-approved drug for treatment of Alzheimer's disease, could very efficiently block E. coli-caused bacteremia and meningitis in a mouse model of NSM in a manner dependent on α7 nAChR. MEM was able to synergistically enhance the antibacterial activity of ampicillin in HBMEC infected with E. coli K1 (E44 and in neonatal mice with E44-caused bacteremia and meningitis. Differential gene expression analysis of RNA-Seq data from mouse BMEC infected with E. coli K1 showed that several E44-increased inflammatory factors, including IL33, IL18rap, MMP10 and Irs1, were significantly reduced by MEM compared to the infected cells without drug treatment. MEM could also significantly up-regulate anti-inflammatory factors, including Tnfaip3, CISH, Ptgds and Zfp36. Most interestingly, these factors may positively and negatively contribute to regulation of NF-κB, which is a hallmark feature of bacterial meningitis. Furthermore, we have demonstrated that circulating BMEC (cBMEC are the potential novel biomarkers for NSM. MEM could significantly reduce E44-increased blood level of cBMEC in mice. Taken together, our data suggest that memantine can efficiently block host inflammatory responses to
Nueda, Maria José; Carbonell, José; Medina, Ignacio; Dopazo, Joaquín; Conesa, Ana
Serial transcriptomics experiments investigate the dynamics of gene expression changes associated with a quantitative variable such as time or dosage. The statistical analysis of these data implies the study of global and gene-specific expression trends, the identification of significant serial changes, the comparison of expression profiles and the assessment of transcriptional changes in terms of cellular processes. We have created the SEA (Serial Expression Analysis) suite to provide a complete web-based resource for the analysis of serial transcriptomics data. SEA offers five different algorithms based on univariate, multivariate and functional profiling strategies framed within a user-friendly interface and a project-oriented architecture to facilitate the analysis of serial gene expression data sets from different perspectives. SEA is available at sea.bioinfo.cipf.es. PMID:20525784
Full Text Available Phalaenopsis is one of the most interesting genera of orchids due to the members are often used as parents to produce hybrids. The establishment and development of highly reliable and discriminatory methods for identifying species and cultivars has become increasingly more important to plant breeders and members of the nursery industry. The aim of this research was to develop sequence-based microsatellite (eSSR markers for the Phalaenopsis orchid designed from the sequence of GenBank NCBI. Seventeen primers were designed and thirteen primers pairs could amplify the DNA giving the expected PCR product with polymorphism. A total of 51 alleles, with an average of 3 alleles per locus and polymorphism information content (PIC values at 0.674, were detected at the 16 SSR loci. Therefore, these markers could be used for identification of the Phalaenopsis orchid used in this study. Genetic similarity and principle coordinate analysis identified five major groups of Phalaenopsis sp. the first group consisted of P. amabilis, P. fuscata, P. javanica, and P. zebrine. The second group consisted of P. amabilis, P. amboinensis, P. bellina, P. floresens, and P. mannii. The third group consisted of P. bellina, P. cornucervi, P. cornucervi, P. violaceae sumatra, P. modesta. The forth group consisted of P. cornucervi and P. lueddemanniana, and the fifth group was P. amboinensis.
Background Flax, Linum usitatissimum L., is an important crop whose seed oil and stem fiber have multiple industrial applications. Flax seeds are also well-known for their nutritional attributes, viz., omega-3 fatty acids in the oil and lignans and mucilage from the seed coat. In spite of the importance of this crop, there are few molecular resources that can be utilized toward improving seed traits. Here, we describe flax embryo and seed development and generation of comprehensive genomic resources for the flax seed. Results We describe a large-scale generation and analysis of expressed sequences in various tissues. Collectively, the 13 libraries we have used provide a broad representation of genes active in developing embryos (globular, heart, torpedo, cotyledon and mature stages) seed coats (globular and torpedo stages) and endosperm (pooled globular to torpedo stages) and genes expressed in flowers, etiolated seedlings, leaves, and stem tissue. A total of 261,272 expressed sequence tags (EST) (GenBank accessions LIBEST_026995 to LIBEST_027011) were generated. These EST libraries included transcription factor genes that are typically expressed at low levels, indicating that the depth is adequate for in silico expression analysis. Assembly of the ESTs resulted in 30,640 unigenes and 82% of these could be identified on the basis of homology to known and hypothetical genes from other plants. When compared with fully sequenced plant genomes, the flax unigenes resembled poplar and castor bean more than grape, sorghum, rice or Arabidopsis. Nearly one-fifth of these (5,152) had no homologs in sequences reported for any organism, suggesting that this category represents genes that are likely unique to flax. Digital analyses revealed gene expression dynamics for the biosynthesis of a number of important seed constituents during seed development. Conclusions We have developed a foundational database of expressed sequences and collection of plasmid clones that comprise
Full Text Available Abstract Background Flax, Linum usitatissimum L., is an important crop whose seed oil and stem fiber have multiple industrial applications. Flax seeds are also well-known for their nutritional attributes, viz., omega-3 fatty acids in the oil and lignans and mucilage from the seed coat. In spite of the importance of this crop, there are few molecular resources that can be utilized toward improving seed traits. Here, we describe flax embryo and seed development and generation of comprehensive genomic resources for the flax seed. Results We describe a large-scale generation and analysis of expressed sequences in various tissues. Collectively, the 13 libraries we have used provide a broad representation of genes active in developing embryos (globular, heart, torpedo, cotyledon and mature stages seed coats (globular and torpedo stages and endosperm (pooled globular to torpedo stages and genes expressed in flowers, etiolated seedlings, leaves, and stem tissue. A total of 261,272 expressed sequence tags (EST (GenBank accessions LIBEST_026995 to LIBEST_027011 were generated. These EST libraries included transcription factor genes that are typically expressed at low levels, indicating that the depth is adequate for in silico expression analysis. Assembly of the ESTs resulted in 30,640 unigenes and 82% of these could be identified on the basis of homology to known and hypothetical genes from other plants. When compared with fully sequenced plant genomes, the flax unigenes resembled poplar and castor bean more than grape, sorghum, rice or Arabidopsis. Nearly one-fifth of these (5,152 had no homologs in sequences reported for any organism, suggesting that this category represents genes that are likely unique to flax. Digital analyses revealed gene expression dynamics for the biosynthesis of a number of important seed constituents during seed development. Conclusions We have developed a foundational database of expressed sequences and collection of plasmid
Sara B Collins
Full Text Available Metabolic flux is frequently rerouted through cellular metabolism in response to dynamic changes in the intra- and extra-cellular environment. Capturing the mechanisms underlying these metabolic transitions in quantitative and predictive models is a prominent challenge in systems biology. Progress in this regard has been made by integrating high-throughput gene expression data into genome-scale stoichiometric models of metabolism. Here, we extend previous approaches to perform a Temporal Expression-based Analysis of Metabolism (TEAM. We apply TEAM to understanding the complex metabolic dynamics of the respiratorily versatile bacterium Shewanella oneidensis grown under aerobic, lactate-limited conditions. TEAM predicts temporal metabolic flux distributions using time-series gene expression data. Increased predictive power is achieved by supplementing these data with a large reference compendium of gene expression, which allows us to take into account the unique character of the distribution of expression of each individual gene. We further propose a straightforward method for studying the sensitivity of TEAM to changes in its fundamental free threshold parameter θ, and reveal that discrete zones of distinct metabolic behavior arise as this parameter is changed. By comparing the qualitative characteristics of these zones to additional experimental data, we are able to constrain the range of θ to a small, well-defined interval. In parallel, the sensitivity analysis reveals the inherently difficult nature of dynamic metabolic flux modeling: small errors early in the simulation propagate to relatively large changes later in the simulation. We expect that handling such "history-dependent" sensitivities will be a major challenge in the future development of dynamic metabolic-modeling techniques.
Bennett, David S; Bendersky, Margaret; Lewis, Michael
The specificity predicted by differential emotions theory (DET) for early facial expressions in response to 5 different eliciting situations was studied in a sample of 4-month-old infants (n = 150). Infants were videotaped during tickle, sour taste, jack-in-the-box, arm restraint, and masked-stranger situations and their expressions were coded second by second. Infants showed a variety of facial expressions in each situation; however, more infants exhibited positive (joy and surprise) than negative expressions (anger, disgust, fear, and sadness) across all situations except sour taste. Consistent with DET-predicted specificity, joy expressions were the most common in response to tickling, and were less common in response to other situations. Surprise expressions were the most common in response to the jack-in-the-box, as predicted, but also were the most common in response to the arm restraint and masked-stranger situations, indicating a lack of specificity. No evidence of predicted specificity was found for anger, disgust, fear, and sadness expressions. Evidence of individual differences in expressivity within situations, as well as stability in the pattern across situations, underscores the need to examine both child and contextual factors in studying emotional development. The results provide little support for the DET postulate of situational specificity and suggest that a synthesis of differential emotions and dynamic systems theories of emotional expression should be considered.
The use of gene expression profiling to predict chemical mode of action would be enhanced by better characterization of variance due to individual, environmental, and technical factors. Meta-analysis of microarray data from untreated or vehicle-treated animals within the control arm of toxicogenomics studies has yielded useful information on baseline fluctuations in gene expression. A dataset of control animal microarray expression data was assembled by a working group of the Health and Environmental Sciences Institute's Technical Committee on the Application of Genomics in Mechanism Based Risk Assessment in order to provide a public resource for assessments of variability in baseline gene expression. Data from over 500 Affymetrix microarrays from control rat liver and kidney were collected from 16 different institutions. Thirty-five biological and technical factors were obtained for each animal, describing a wide range of study characteristics, and a subset were evaluated in detail for their contribution to total variability using multivariate statistical and graphical techniques. The study factors that emerged as key sources of variability included gender, organ section, strain, and fasting state. These and other study factors were identified as key descriptors that should be included in the minimal information about a toxicogenomics study needed for interpretation of results by an independent source. Genes that are the most and least variable, gender-selectiv
In western art music, composers communicate their work to performers via a standard notation which specificies the musical pitches and relative timings of notes. This notation may also include some higher level information such as variations in the dynamics, tempo and timing. Famous performers are characterised by their expressive interpretation, the ability to convey structural and emotive information within the given framework. The majority of work on audio content analysis focusses on retrieving score-level information; this paper reports on the extraction of parameters describing the performance, a task which requires a much higher degree of accuracy. Two systems are presented: BeatRoot, an off-line beat tracking system which finds the times of musical beats and tracks changes in tempo throughout a performance, and the Performance Worm, a system which provides a real-time visualisation of the two most important expressive dimensions, tempo and dynamics. Both of these systems are being used to process data for a large-scale study of musical expression in classical and romantic piano performance, which uses artificial intelligence (machine learning) techniques to discover fundamental patterns or principles governing expressive performance.
At present, the sports dance has entered every stage of the people’s life, has become the public’s favorite sport. Sports dance has been well developed. This article mainly uses the literature material law to carry on the detailed analysis to the sports dance constitution, elaborated in detail the sports dance artistic expression. The composition of sports dance elements; sports dance is a form of dance art show; sports dance through the dance art can be divided into three aspects, namely, fo...
Will, Sebastian; Otto, Christina; Miladi, Milad; Möhl, Mathias; Backofen, Rolf
RNA-Seq experiments have revealed a multitude of novel ncRNAs. The gold standard for their analysis based on simultaneous alignment and folding suffers from extreme time complexity of [Formula: see text]. Subsequently, numerous faster 'Sankoff-style' approaches have been suggested. Commonly, the performance of such methods relies on sequence-based heuristics that restrict the search space to optimal or near-optimal sequence alignments; however, the accuracy of sequence-based methods breaks down for RNAs with sequence identities below 60%. Alignment approaches like LocARNA that do not require sequence-based heuristics, have been limited to high complexity ([Formula: see text] quartic time). Breaking this barrier, we introduce the novel Sankoff-style algorithm 'sparsified prediction and alignment of RNAs based on their structure ensembles (SPARSE)', which runs in quadratic time without sequence-based heuristics. To achieve this low complexity, on par with sequence alignment algorithms, SPARSE features strong sparsification based on structural properties of the RNA ensembles. Following PMcomp, SPARSE gains further speed-up from lightweight energy computation. Although all existing lightweight Sankoff-style methods restrict Sankoff's original model by disallowing loop deletions and insertions, SPARSE transfers the Sankoff algorithm to the lightweight energy model completely for the first time. Compared with LocARNA, SPARSE achieves similar alignment and better folding quality in significantly less time (speedup: 3.7). At similar run-time, it aligns low sequence identity instances substantially more accurate than RAF, which uses sequence-based heuristics. © The Author 2015. Published by Oxford University Press.
Will, Sebastian; Otto, Christina; Miladi, Milad; Möhl, Mathias; Backofen, Rolf
Motivation: RNA-Seq experiments have revealed a multitude of novel ncRNAs. The gold standard for their analysis based on simultaneous alignment and folding suffers from extreme time complexity of O(n6). Subsequently, numerous faster ‘Sankoff-style’ approaches have been suggested. Commonly, the performance of such methods relies on sequence-based heuristics that restrict the search space to optimal or near-optimal sequence alignments; however, the accuracy of sequence-based methods breaks down for RNAs with sequence identities below 60%. Alignment approaches like LocARNA that do not require sequence-based heuristics, have been limited to high complexity (≥ quartic time). Results: Breaking this barrier, we introduce the novel Sankoff-style algorithm ‘sparsified prediction and alignment of RNAs based on their structure ensembles (SPARSE)’, which runs in quadratic time without sequence-based heuristics. To achieve this low complexity, on par with sequence alignment algorithms, SPARSE features strong sparsification based on structural properties of the RNA ensembles. Following PMcomp, SPARSE gains further speed-up from lightweight energy computation. Although all existing lightweight Sankoff-style methods restrict Sankoff’s original model by disallowing loop deletions and insertions, SPARSE transfers the Sankoff algorithm to the lightweight energy model completely for the first time. Compared with LocARNA, SPARSE achieves similar alignment and better folding quality in significantly less time (speedup: 3.7). At similar run-time, it aligns low sequence identity instances substantially more accurate than RAF, which uses sequence-based heuristics. Availability and implementation: SPARSE is freely available at http://www.bioinf.uni-freiburg.de/Software/SPARSE. Contact: email@example.com Supplementary information: Supplementary data are available at Bioinformatics online. PMID:25838465
Sousa, MA de; Boye, Kit; Lencastre, H de
Current DNA amplification-based typing methods for bacterial pathogens often lack interlaboratory reproducibility. In this international study, DNA sequence-based typing of the Staphylococcus aureus protein A gene (spa, 110 to 422 bp) showed 100% intra- and interlaboratory reproducibility without...... extensive harmonization of protocols for 30 blind-coded S. aureus DNA samples sent to 10 laboratories. Specialized software for automated sequence analysis ensured a common typing nomenclature....
Full Text Available At present, the sports dance has entered every stage of the people’s life, has become the public’s favorite sport. Sports dance has been well developed. This article mainly uses the literature material law to carry on the detailed analysis to the sports dance constitution, elaborated in detail the sports dance artistic expression. The composition of sports dance elements; sports dance is a form of dance art show; sports dance through the dance art can be divided into three aspects, namely, form, music, shape of the expressive force. In this paper, the study will be more in-depth excavation of the cultural connotation of sports dance, and promote the development of sports dance can be more comprehensive. In 20s of last century, Chinese Sports Dance Association officially joined the International Sports Dance Association, which also makes our country’s sports dance and international exchange more frequent. However, due to China’s sports dance sports dance learning time is not long, while learning is influenced by Chinese traditional culture, the sports dance movements are too conservative, there is a very large gap and international enthusiasm, bold and unrestrained, the pursuit of individual sports dance in the dance style, music and performance hand. Sports dance originated from abroad, it is produced in the daily life of people in foreign countries. China’s domestic sports dance players in learning dance at the same time, the production and the connotation of dance is not very understanding, therefore, it is difficult to better reflect the emotional expression of sports dance. Although the sports dance is a kind of similar to the competitive projects, but it is also a kind of dance culture, and to constitute a force from the dance art show a detailed study, detailed mining playing officer of sports dance performance further, reducing China’s sports dance and international sports dance gap.
Taylor, Peter N.; Porcu, Eleonora; Chew, Shelby
Normal thyroid function is essential for health, but its genetic architecture remains poorly understood. Here, for the heritable thyroid traits thyrotropin (TSH) and free thyroxine (FT4), we analyse whole-genome sequence data from the UK10K project (N = 2,287). Using additional whole-genome seque...
Porteous David J
Full Text Available Abstract Background Regions of interest identified through genetic linkage studies regularly exceed 30 centimorgans in size and can contain hundreds of genes. Traditionally this number is reduced by matching functional annotation to knowledge of the disease or phenotype in question. However, here we show that disease genes share patterns of sequence-based features that can provide a good basis for automatic prioritization of candidates by machine learning. Results We examined a variety of sequence-based features and found that for many of them there are significant differences between the sets of genes known to be involved in human hereditary disease and those not known to be involved in disease. We have created an automatic classifier called PROSPECTR based on those features using the alternating decision tree algorithm which ranks genes in the order of likelihood of involvement in disease. On average, PROSPECTR enriches lists for disease genes two-fold 77% of the time, five-fold 37% of the time and twenty-fold 11% of the time. Conclusion PROSPECTR is a simple and effective way to identify genes involved in Mendelian and oligogenic disorders. It performs markedly better than the single existing sequence-based classifier on novel data. PROSPECTR could save investigators looking at large regions of interest time and effort by prioritizing positional candidate genes for mutation detection and case-control association studies.
Hibbett, David; Abarenkov, Kessy; Kõljalg, Urmas; Öpik, Maarja; Chai, Benli; Cole, James; Wang, Qiong; Crous, Pedro; Robert, Vincent; Helgason, Thorunn; Herr, Joshua R; Kirk, Paul; Lueschow, Shiloh; O'Donnell, Kerry; Nilsson, R Henrik; Oono, Ryoko; Schoch, Conrad; Smyth, Christopher; Walker, Donald M; Porras-Alfaro, Andrea; Taylor, John W; Geiser, David M
Fungal taxonomy and ecology have been revolutionized by the application of molecular methods and both have increasing connections to genomics and functional biology. However, data streams from traditional specimen- and culture-based systematics are not yet fully integrated with those from metagenomic and metatranscriptomic studies, which limits understanding of the taxonomic diversity and metabolic properties of fungal communities. This article reviews current resources, needs, and opportunities for sequence-based classification and identification (SBCI) in fungi as well as related efforts in prokaryotes. To realize the full potential of fungal SBCI it will be necessary to make advances in multiple areas. Improvements in sequencing methods, including long-read and single-cell technologies, will empower fungal molecular ecologists to look beyond ITS and current shotgun metagenomics approaches. Data quality and accessibility will be enhanced by attention to data and metadata standards and rigorous enforcement of policies for deposition of data and workflows. Taxonomic communities will need to develop best practices for molecular characterization in their focal clades, while also contributing to globally useful datasets including ITS. Changes to nomenclatural rules are needed to enable validPUBLICation of sequence-based taxon descriptions. Finally, cultural shifts are necessary to promote adoption of SBCI and to accord professional credit to individuals who contribute to community resources.
mass spectrometry (GC-MS) were performed to determine the phytochemicals in the active fraction. Results: Five differentially expressed bacterial proteins (four from Escherichia coli and one from Staphylococcus aureus), were identified via ...
Persis, M; Chandra Sekhar Reddy, A; Rao, L M; Khedkar, G D; Ravinder, K; Nasruddin, K
Mitochondrial DNA, cytochrome oxidase-1 gene sequences were analyzed for species identification and phylogenetic relationship among the very high food value and commercially important Indian carangid fish species. Sequence analysis of COI gene very clearly indicated that all the 28 fish species fell into five distinct groups, which are genetically distant from each other and exhibited identical phylogenetic reservation. All the COI gene sequences from 28 fishes provide sufficient phylogenetic information and evolutionary relationship to distinguish the carangid species unambiguously. This study proves the utility of mtDNA COI gene sequence based approach in identifying fish species at a faster pace.
This paper reports a follow-up study of 5-, 7-, and 9-year-old subjects who had participated in an investigation of the nature of children's and adults' ability to graphically represent expressive qualities (i.e., happy, sad, angry, loud, quiet, hard). In the original study, the use of literal representation (such as a smiling face on a tree) and…
Haman, Jiří; Valenta, Zdeněk; Kalina, Jan
Roč. 1, č. 1 (2013), s. 65-65 ISSN 1805-8698. [EFMI 2013 Special Topic Conference. 17.04.2013-19.04.2013, Prague] Institutional support: RVO:67985807 Keywords : shrinkage estimation * covariance matrix * high dimensional data * gene expression Subject RIV: IN - Informatics, Computer Science
Kozlov, Konstantin; Gursky, Vitaly; Kulakovskiy, Ivan; Samsonova, Maria
The detailed analysis of transcriptional regulation is crucially important for understanding biological processes. The gap gene network in Drosophila attracts large interest among researches studying mechanisms of transcriptional regulation. It implements the most upstream regulatory layer of the segmentation gene network. The knowledge of molecular mechanisms involved in gap gene regulation is far less complete than that of genetics of the system. Mathematical modeling goes beyond insights gained by genetics and molecular approaches. It allows us to reconstruct wild-type gene expression patterns in silico, infer underlying regulatory mechanism and prove its sufficiency. We developed a new model that provides a dynamical description of gap gene regulatory systems, using detailed DNA-based information, as well as spatial transcription factor concentration data at varying time points. We showed that this model correctly reproduces gap gene expression patterns in wild type embryos and is able to predict gap expression patterns in Kr mutants and four reporter constructs. We used four-fold cross validation test and fitting to random dataset to validate the model and proof its sufficiency in data description. The identifiability analysis showed that most model parameters are well identifiable. We reconstructed the gap gene network topology and studied the impact of individual transcription factor binding sites on the model output. We measured this impact by calculating the site regulatory weight as a normalized difference between the residual sum of squares error for the set of all annotated sites and for the set with the site of interest excluded. The reconstructed topology of the gap gene network is in agreement with previous modeling results and data from literature. We showed that 1) the regulatory weights of transcription factor binding sites show very weak correlation with their PWM score; 2) sites with low regulatory weight are important for the model output; 3
Francisco Alexandre P
Full Text Available Abstract Background With the decrease of DNA sequencing costs, sequence-based typing methods are rapidly becoming the gold standard for epidemiological surveillance. These methods provide reproducible and comparable results needed for a global scale bacterial population analysis, while retaining their usefulness for local epidemiological surveys. Online databases that collect the generated allelic profiles and associated epidemiological data are available but this wealth of data remains underused and are frequently poorly annotated since no user-friendly tool exists to analyze and explore it. Results PHYLOViZ is platform independent Java software that allows the integrated analysis of sequence-based typing methods, including SNP data generated from whole genome sequence approaches, and associated epidemiological data. goeBURST and its Minimum Spanning Tree expansion are used for visualizing the possible evolutionary relationships between isolates. The results can be displayed as an annotated graph overlaying the query results of any other epidemiological data available. Conclusions PHYLOViZ is a user-friendly software that allows the combined analysis of multiple data sources for microbial epidemiological and population studies. It is freely available at http://www.phyloviz.net.
Full Text Available Identifying recent HIV-1 infections is crucial for monitoring HIV-1 incidence and optimizing public health prevention efforts. To identify recent HIV-1 infections, we evaluated and compared the performance of 4 sequence-based diversity measures including percent diversity, percent complexity, Shannon entropy and number of haplotypes targeting 13 genetic segments within the env gene of HIV-1. A total of 597 diagnostic samples obtained in 2013 and 2015 from recently and chronically HIV-1 infected individuals were selected. From the selected samples, 249 (134 from recent versus 115 from chronic infections env coding regions, including V1-C5 of gp120 and the gp41 ectodomain of HIV-1, were successfully amplified and sequenced by next generation sequencing (NGS using the Illumina MiSeq platform. The ability of the four sequence-based diversity measures to correctly identify recent HIV infections was evaluated using the frequency distribution curves, median and interquartile range and area under the curve (AUC of the receiver operating characteristic (ROC. Comparing the median and interquartile range and evaluating the frequency distribution curves associated with the 4 sequence-based diversity measures, we observed that the percent diversity, number of haplotypes and Shannon entropy demonstrated significant potential to discriminate recent from chronic infections (p<0.0001. Using the AUC of ROC analysis, only the Shannon entropy measure within three HIV-1 env segments could accurately identify recent infections at a satisfactory level. The env segments were gp120 C2_1 (AUC = 0.806, gp120 C2_3 (AUC = 0.805 and gp120 V3 (AUC = 0.812. Our results clearly indicate that the Shannon entropy measure represents a useful tool for predicting HIV-1 infection recency.
Jiang, Changbing; Bai, Lijun; Tong, Xiaoqing
China's express delivery market has become the arena in which each express enterprise struggles to chase due to the huge potential demand and high profitable prospects. So certain qualitative and quantitative forecast for the future changes of China's express delivery market will help enterprises understand various types of market conditions and social changes in demand and adjust business activities to enhance their competitiveness timely. The development of China's express delivery industry is first introduced in this chapter. Then the theoretical basis of the regression model is overviewed. We also predict the demand trends of China's express delivery market by using Pearson correlation analysis and regression analysis from qualitative and quantitative aspects, respectively. Finally, we draw some conclusions and recommendations for China's express delivery industry.
Full Text Available Now more than 12 years in orbit, Mars Express battery telemetry during some of the deepest discharge cycles has been analysed with the help of the ESTEC lithium ion cell model. The best-fitting model parameter sets were then used to predict the energy that is expected to be available before the battery voltage drops below the minimum value that can support the power bus. This allows mission planners to determine what future power profiles could be supported without risk of entering safe mode. It also gives some more insights into the ageing properties of these batteries.
Pirooznia, Mehdi; Gong, Ping; Guan, Xin; Inouye, Laura S; Yang, Kuan; Perkins, Edward J; Deng, Youping
Background Eisenia fetida, commonly known as red wiggler or compost worm, belongs to the Lumbricidae family of the Annelida phylum. Little is known about its genome sequence although it has been extensively used as a test organism in terrestrial ecotoxicology. In order to understand its gene expression response to environmental contaminants, we cloned 4032 cDNAs or expressed sequence tags (ESTs) from two E. fetida libraries enriched with genes responsive to ten ordnance related compounds using suppressive subtractive hybridization-PCR. Results A total of 3144 good quality ESTs (GenBank dbEST accession number EH669363–EH672369 and EL515444–EL515580) were obtained from the raw clone sequences after cleaning. Clustering analysis yielded 2231 unique sequences including 448 contigs (from 1361 ESTs) and 1783 singletons. Comparative genomic analysis showed that 743 or 33% of the unique sequences shared high similarity with existing genes in the GenBank nr database. Provisional function annotation assigned 830 Gene Ontology terms to 517 unique sequences based on their homology with the annotated genomes of four model organisms Drosophila melanogaster, Mus musculus, Saccharomyces cerevisiae, and Caenorhabditis elegans. Seven percent of the unique sequences were further mapped to 99 Kyoto Encyclopedia of Genes and Genomes pathways based on their matching Enzyme Commission numbers. All the information is stored and retrievable at a highly performed, web-based and user-friendly relational database called EST model database or ESTMD version 2. Conclusion The ESTMD containing the sequence and annotation information of 4032 E. fetida ESTs is publicly accessible at . PMID:18047730
Keywords. fat mass and obesity-associated gene (FTO); rabbit; mRNA expression patterns; sequence analysis; Oryctolagus cuniculus. ... In this work, the molecular characterization and expression features of rabbit (Oryctolagus cuniculus) FTO cDNA were analysed. The rabbit FTO cDNA with a size of 2158 bp was cloned, ...
Microarray analysis of the gene expression profile in triethylene glycol dimethacrylate-treated human dental pulp cells. ... Conclusions: Our results suggest that TEGDMA can change the many functions of hDPCs through large changes in gene expression levels and complex interactions with different signaling pathways.
Long SAGE analysis of genes differentially expressed in the midgut and silk gland between the sexes of the silkwormBombyx mori. Liping Gan, Ying Wang, Jian Xi, Yanshan Niu, Hongyou Qin, Yanghu Sima, Shiqing Xu ...
Full Text Available Abstract Background Temporal gene expression profiles characterize the time-dynamics of expression of specific genes and are increasingly collected in current gene expression experiments. In the analysis of experiments where gene expression is obtained over the life cycle, it is of interest to relate temporal patterns of gene expression associated with different developmental stages to each other to study patterns of long-term developmental gene regulation. We use tools from functional data analysis to study dynamic changes by relating temporal gene expression profiles of different developmental stages to each other. Results We demonstrate that functional regression methodology can pinpoint relationships that exist between temporary gene expression profiles for different life cycle phases and incorporates dimension reduction as needed for these high-dimensional data. By applying these tools, gene expression profiles for pupa and adult phases are found to be strongly related to the profiles of the same genes obtained during the embryo phase. Moreover, one can distinguish between gene groups that exhibit relationships with positive and others with negative associations between later life and embryonal expression profiles. Specifically, we find a positive relationship in expression for muscle development related genes, and a negative relationship for strictly maternal genes for Drosophila, using temporal gene expression profiles. Conclusion Our findings point to specific reactivation patterns of gene expression during the Drosophila life cycle which differ in characteristic ways between various gene groups. Functional regression emerges as a useful tool for relating gene expression patterns from different developmental stages, and avoids the problems with large numbers of parameters and multiple testing that affect alternative approaches.
Choe, Jae Young; Han, Hyung Soo; Lee, Seon Duk; Lee, Hanna; Lee, Dong Eun; Ahn, Jae Yun; Ryoo, Hyun Wook; Seo, Kang Suk; Kim, Jong Kun
TNF-α regulates immune cells and acts as an endogenous pyrogen. Reverse transcription polymerase chain reaction (RT-PCR) is one of the most commonly used methods for gene expression analysis. Among the alternatives to PCR, loop-mediated isothermal amplification (LAMP) shows good potential in terms of specificity and sensitivity. However, few studies have compared RT-PCR and LAMP for human gene expression analysis. Therefore, in the present study, we compared one-step RT-PCR, two-step RT-LAMP and one-step RT-LAMP for human gene expression analysis. We compared three gene expression analysis methods using the human TNF-α gene as a biomarker from peripheral blood cells. Total RNA from the three selected febrile patients were subjected to the three different methods of gene expression analysis. In the comparison of three gene expression analysis methods, the detection limit of both one-step RT-PCR and one-step RT-LAMP were the same, while that of two-step RT-LAMP was inferior. One-step RT-LAMP takes less time, and the experimental result is easy to determine. One-step RT-LAMP is a potentially useful and complementary tool that is fast and reasonably sensitive. In addition, one-step RT-LAMP could be useful in environments lacking specialized equipment or expertise.
Beauchamp, Nicholas J.; van Achterberg, Tanja A. E.; Engelse, Marten A.; Pannekoek, Hans; de Vries, Carlie J. M.
Migration and proliferation of vascular smooth muscle cells (SMCs) are key events in atherosclerosis. However, little is known about alterations in gene expression upon transition of the quiescent, contractile SMC to the proliferative SMC. We performed serial analysis of gene expression (SAGE) of
Full Text Available Most existing methods for sequence-based classification use exhaustive feature generation, employing, for example, all k-mer patterns. The motivation behind such (enumerative approaches is to minimize the potential for overlooking important features. However, there are shortcomings to this strategy. First, practical constraints limit the scope of exhaustive feature generation to patterns of length ≤ k, such that potentially important, longer (> k predictors are not considered. Second, features so generated exhibit strong dependencies, which can complicate understanding of derived classification rules. Third, and most importantly, numerous irrelevant features are created. These concerns can compromise prediction and interpretation. While remedies have been proposed, they tend to be problem-specific and not broadly applicable. Here, we develop a generally applicable methodology, and an attendant software pipeline, that is predicated on discriminatory motif finding. In addition to the traditional training and validation partitions, our framework entails a third level of data partitioning, a discovery partition. A discriminatory motif finder is used on sequences and associated class labels in the discovery partition to yield a (small set of features. These features are then used as inputs to a classifier in the training partition. Finally, performance assessment occurs on the validation partition. Important attributes of our approach are its modularity (any discriminatory motif finder and any classifier can be deployed and its universality (all data, including sequences that are unaligned and/or of unequal length, can be accommodated. We illustrate our approach on two nucleosome occupancy datasets and a protein solubility dataset, previously analyzed using enumerative feature generation. Our method achieves excellent performance results, with and without optimization of classifier tuning parameters. A Python pipeline implementing the approach is
Full Text Available Gene co-expression network analysis has been shown effective in identifying functional co-expressed gene modules associated with complex human diseases. However, existing techniques to construct co-expression networks require some critical prior information such as predefined number of clusters, numerical thresholds for defining co-expression/interaction, or do not naturally reproduce the hallmarks of complex systems such as the scale-free degree distribution of small-worldness. Previously, a graph filtering technique called Planar Maximally Filtered Graph (PMFG has been applied to many real-world data sets such as financial stock prices and gene expression to extract meaningful and relevant interactions. However, PMFG is not suitable for large-scale genomic data due to several drawbacks, such as the high computation complexity O(|V|3, the presence of false-positives due to the maximal planarity constraint, and the inadequacy of the clustering framework. Here, we developed a new co-expression network analysis framework called Multiscale Embedded Gene Co-expression Network Analysis (MEGENA by: i introducing quality control of co-expression similarities, ii parallelizing embedded network construction, and iii developing a novel clustering technique to identify multi-scale clustering structures in Planar Filtered Networks (PFNs. We applied MEGENA to a series of simulated data and the gene expression data in breast carcinoma and lung adenocarcinoma from The Cancer Genome Atlas (TCGA. MEGENA showed improved performance over well-established clustering methods and co-expression network construction approaches. MEGENA revealed not only meaningful multi-scale organizations of co-expressed gene clusters but also novel targets in breast carcinoma and lung adenocarcinoma.
Song, Won-Min; Zhang, Bin
Gene co-expression network analysis has been shown effective in identifying functional co-expressed gene modules associated with complex human diseases. However, existing techniques to construct co-expression networks require some critical prior information such as predefined number of clusters, numerical thresholds for defining co-expression/interaction, or do not naturally reproduce the hallmarks of complex systems such as the scale-free degree distribution of small-worldness. Previously, a graph filtering technique called Planar Maximally Filtered Graph (PMFG) has been applied to many real-world data sets such as financial stock prices and gene expression to extract meaningful and relevant interactions. However, PMFG is not suitable for large-scale genomic data due to several drawbacks, such as the high computation complexity O(|V|3), the presence of false-positives due to the maximal planarity constraint, and the inadequacy of the clustering framework. Here, we developed a new co-expression network analysis framework called Multiscale Embedded Gene Co-expression Network Analysis (MEGENA) by: i) introducing quality control of co-expression similarities, ii) parallelizing embedded network construction, and iii) developing a novel clustering technique to identify multi-scale clustering structures in Planar Filtered Networks (PFNs). We applied MEGENA to a series of simulated data and the gene expression data in breast carcinoma and lung adenocarcinoma from The Cancer Genome Atlas (TCGA). MEGENA showed improved performance over well-established clustering methods and co-expression network construction approaches. MEGENA revealed not only meaningful multi-scale organizations of co-expressed gene clusters but also novel targets in breast carcinoma and lung adenocarcinoma.
Jun 17, 2009 ... Molecular responses and expression analysis of genes in a xerophytic desert shrub Haloxylon ammodendron .... physiological determination and cDNA-AFLP analysis, three groups of seeds were sowed in pots with sand and .... HaDR27. U. 234. PDR-like ABC transporter. AT1G59870. HaDR28. U. 135.
Nobutaka Hanagata, Taro Takemura and Takashi Minowa
Full Text Available Comprehensive gene expression analysis using DNA microarrays has become a widespread technique in molecular biological research. In the biomaterials field, it is used to evaluate the biocompatibility or cellular toxicity of metals, polymers and ceramics. Studies in this field have extracted differentially expressed genes in the context of differences in cellular responses among multiple materials. Based on these genes, the effects of materials on cells at the molecular level have been examined. Expression data ranging from several to tens of thousands of genes can be obtained from DNA microarrays. For this reason, several tens or hundreds of differentially expressed genes are often present in different materials. In this review, we outline the principles of DNA microarrays, and provide an introduction to methods of extracting information which is useful for evaluating and designing biomaterials from comprehensive gene expression data.
Hanagata, Nobutaka; Takemura, Taro; Minowa, Takashi
Comprehensive gene expression analysis using DNA microarrays has become a widespread technique in molecular biological research. In the biomaterials field, it is used to evaluate the biocompatibility or cellular toxicity of metals, polymers and ceramics. Studies in this field have extracted differentially expressed genes in the context of differences in cellular responses among multiple materials. Based on these genes, the effects of materials on cells at the molecular level have been examined. Expression data ranging from several to tens of thousands of genes can be obtained from DNA microarrays. For this reason, several tens or hundreds of differentially expressed genes are often present in different materials. In this review, we outline the principles of DNA microarrays, and provide an introduction to methods of extracting information which is useful for evaluating and designing biomaterials from comprehensive gene expression data. (topical review)
Greijer, AE; Verschuuren, EAM; Harmsen, MC; Dekkers, CAJ; Adriaanse, HMA; The, TH; Middeldorp, JM
The dynamics of active human cytomegalovirus (HCMV) infection was monitored by competitive nucleic acid sequence-based amplification (NASBA) assays for quantification of IE1 (UL123) and pp67 (UL65) mRNA expression levels In the blood of patients after lung transplantation. RNA was isolated from 339
Wang, Yunli; Pan, Youlian
Background Simple clustering methods such as hierarchical clustering and k-means are widely used for gene expression data analysis; but they are unable to deal with noise and high dimensionality associated with the microarray gene expression data. Consensus clustering appears to improve the robustness and quality of clustering results. Incorporating prior knowledge in clustering process (semi-supervised clustering) has been shown to improve the consistency between the data partitioning and do...
Chen, Shu-Chuan; Tsai, Tsung-Hsien; Chung, Cheng-Han; Li, Wen-Hsiung
The purpose of gene expression analysis is to look for the association between regulation of gene expression levels and phenotypic variations. This association based on gene expression profile has been used to determine whether the induction/repression of genes correspond to phenotypic variations including cell regulations, clinical diagnoses and drug development. Statistical analyses on microarray data have been developed to resolve gene selection issue. However, these methods do not inform us of causality between genes and phenotypes. In this paper, we propose the dynamic association rule algorithm (DAR algorithm) which helps ones to efficiently select a subset of significant genes for subsequent analysis. The DAR algorithm is based on association rules from market basket analysis in marketing. We first propose a statistical way, based on constructing a one-sided confidence interval and hypothesis testing, to determine if an association rule is meaningful. Based on the proposed statistical method, we then developed the DAR algorithm for gene expression data analysis. The method was applied to analyze four microarray datasets and one Next Generation Sequencing (NGS) dataset: the Mice Apo A1 dataset, the whole genome expression dataset of mouse embryonic stem cells, expression profiling of the bone marrow of Leukemia patients, Microarray Quality Control (MAQC) data set and the RNA-seq dataset of a mouse genomic imprinting study. A comparison of the proposed method with the t-test on the expression profiling of the bone marrow of Leukemia patients was conducted. We developed a statistical way, based on the concept of confidence interval, to determine the minimum support and minimum confidence for mining association relationships among items. With the minimum support and minimum confidence, one can find significant rules in one single step. The DAR algorithm was then developed for gene expression data analysis. Four gene expression datasets showed that the proposed
Michael G. Surette
Full Text Available The identification of transcription factor binding sites is essential to the understanding of the regulation of gene expression and the reconstruction of genetic regulatory networks. The in silico identification of cis-regulatory motifs is challenging due to sequence variability and lack of sufficient data to generate consensus motifs that are of quantitative or even qualitative predictive value. To determine functional motifs in gene expression, we propose a strategy to adopt false discovery rate (FDR and estimate motif effects to evaluate combinatorial analysis of motif candidates and temporal gene expression data. The method decreases the number of predicted motifs, which can then be confirmed by genetic analysis. To assess the method we used simulated motif/expression data to evaluate parameters. We applied this approach to experimental data for a group of iron responsive genes in Salmonella typhimurium 14028S. The method identified known and potentially new ferric-uptake regulator (Fur binding sites. In addition, we identified uncharacterized functional motif candidates that correlated with specific patterns of expression. A SAS code for the simulation and analysis gene expression data is available from the first author upon request.
Chang Jeffrey T
Full Text Available Abstract Background The biological phenotype of a cell, such as a characteristic visual image or behavior, reflects activities derived from the expression of collections of genes. As such, an ability to measure the expression of these genes provides an opportunity to develop more precise and varied sets of phenotypes. However, to use this approach requires computational methods that are difficult to implement and apply, and thus there is a critical need for intelligent software tools that can reduce the technical burden of the analysis. Tools for gene expression analyses are unusually difficult to implement in a user-friendly way because their application requires a combination of biological data curation, statistical computational methods, and database expertise. Results We have developed SIGNATURE, a web-based resource that simplifies gene expression signature analysis by providing software, data, and protocols to perform the analysis successfully. This resource uses Bayesian methods for processing gene expression data coupled with a curated database of gene expression signatures, all carried out within a GenePattern web interface for easy use and access. Conclusions SIGNATURE is available for public use at http://genepattern.genome.duke.edu/signature/.
Background Problems with social-emotional processing are known to be an important contributor to the development and maintenance of eating disorders (EDs). Diminished facial communication of emotion has been frequently reported in individuals with anorexia nervosa (AN). Less is known about facial expressivity in bulimia nervosa (BN) and in people who have recovered from AN (RecAN). This study aimed to pilot the use of computerised facial expression analysis software to investigate emotion expression across the ED spectrum and recovery in a large sample of participants. Method 297 participants with AN, BN, RecAN, and healthy controls were recruited. Participants watched film clips designed to elicit happy or sad emotions, and facial expressions were then analysed using FaceReader. Results The finding mirrored those from previous work showing that healthy control and RecAN participants expressed significantly more positive emotions during the positive clip compared to the AN group. There were no differences in emotion expression during the sad film clip. Discussion These findings support the use of computerised methods to analyse emotion expression in EDs. The findings also demonstrate that reduced positive emotion expression is likely to be associated with the acute stage of AN illness, with individuals with BN showing an intermediate profile. PMID:28575109
Leppanen, Jenni; Dapelo, Marcela Marin; Davies, Helen; Lang, Katie; Treasure, Janet; Tchanturia, Kate
Problems with social-emotional processing are known to be an important contributor to the development and maintenance of eating disorders (EDs). Diminished facial communication of emotion has been frequently reported in individuals with anorexia nervosa (AN). Less is known about facial expressivity in bulimia nervosa (BN) and in people who have recovered from AN (RecAN). This study aimed to pilot the use of computerised facial expression analysis software to investigate emotion expression across the ED spectrum and recovery in a large sample of participants. 297 participants with AN, BN, RecAN, and healthy controls were recruited. Participants watched film clips designed to elicit happy or sad emotions, and facial expressions were then analysed using FaceReader. The finding mirrored those from previous work showing that healthy control and RecAN participants expressed significantly more positive emotions during the positive clip compared to the AN group. There were no differences in emotion expression during the sad film clip. These findings support the use of computerised methods to analyse emotion expression in EDs. The findings also demonstrate that reduced positive emotion expression is likely to be associated with the acute stage of AN illness, with individuals with BN showing an intermediate profile.
Wu, Kun-Feng; Sasidharan, Lekshmi; Thor, Craig P; Chen, Sheng-Yin
Considerable research has been conducted related to motorcycle and other powered-two-wheeler (PTW) crashes; however, it always has been controversial among practitioners concerning with types of crashes should be first targeted and how to prioritize resources for the implementation of mitigating actions. Therefore, there is a need to identify types of motorcycle crashes that constitute the greatest safety risk to riders - most frequent and most severe crashes. This pilot study seeks exhibit the efficacy of a new approach for prioritizing PTW crash causation sequences as they relate to injury severity to better inform the application of mitigating countermeasures. To accomplish this, the present study constructed a crash sequence-based risk matrix to identify most frequent and most severe motorcycle crashes in an attempt to better connect causes and countermeasures of PTW crashes. Although the frequency of each crash sequence can be computed from crash data, a crash severity model is needed to compare the levels of crash severity among different crash sequences, while controlling for other factors that also have effects on crash severity such drivers' age, use of helmet, etc. The construction of risk matrix based on crash sequences involve two tasks: formulation of crash sequence and the estimation of a mixed-effects (ME) model to adjust the levels of severities for each crash sequence to account for other crash contributing factors that would have an effect on the maximum level of crash severity in a crash. Three data elements from the National Automotive Sampling System - General Estimating System (NASS-GES) data were utilized to form a crash sequence: critical event, crash types, and sequence of events. A mixed-effects model was constructed to model the severity levels for each crash sequence while accounting for the effects of those crash contributing factors on crash severity. A total of 8039 crashes involving 8208 motorcycles occurred during 2011 and 2013 were
Jiang, Jinjin; Wang, Yue; Zhu, Bao; Fang, Tingting; Fang, Yujie; Wang, Youping
Brassica includes many successfully cultivated crop species of polyploid origin, either by ancestral genome triplication or by hybridization between two diploid progenitors, displaying complex repetitive sequences and transposons. The U's triangle, which consists of three diploids and three amphidiploids, is optimal for the analysis of complicated genomes after polyploidization. Next-generation sequencing enables the transcriptome profiling of polyploids on a global scale. We examined the gene expression patterns of three diploids (Brassica rapa, B. nigra, and B. oleracea) and three amphidiploids (B. napus, B. juncea, and B. carinata) via digital gene expression analysis. In total, the libraries generated between 5.7 and 6.1 million raw reads, and the clean tags of each library were mapped to 18547-21995 genes of B. rapa genome. The unambiguous tag-mapped genes in the libraries were compared. Moreover, the majority of differentially expressed genes (DEGs) were explored among diploids as well as between diploids and amphidiploids. Gene ontological analysis was performed to functionally categorize these DEGs into different classes. The Kyoto Encyclopedia of Genes and Genomes analysis was performed to assign these DEGs into approximately 120 pathways, among which the metabolic pathway, biosynthesis of secondary metabolites, and peroxisomal pathway were enriched. The non-additive genes in Brassica amphidiploids were analyzed, and the results indicated that orthologous genes in polyploids are frequently expressed in a non-additive pattern. Methyltransferase genes showed differential expression pattern in Brassica species. Our results provided an understanding of the transcriptome complexity of natural Brassica species. The gene expression changes in diploids and allopolyploids may help elucidate the morphological and physiological differences among Brassica species.
Cotton, Laura A; Abdur Rahman, Manal; Ng, Carmond; Le, Anh Q; Milloy, M-J; Mo, Theresa; Brumme, Zabrina L
We describe a rapid, reliable and cost-effective method for intermediate-to-high-resolution sequence-based HLA class I typing using frozen plasma as a source of genomic DNA. The plasma samples investigated had a median age of 8.5 years. Total nucleic acids were isolated from matched frozen PBMC (~2.5 million) and plasma (500 μl) samples from a panel of 25 individuals using commercial silica-based kits. Extractions yielded median [IQR] nucleic acid concentrations of 85.7 [47.0-130.0]ng/μl and 2.2 [1.7-2.6]ng/μl from PBMC and plasma, respectively. Following extraction, ~1000 base pair regions spanning exons 2 and 3 of HLA-A, -B and -C were amplified independently via nested PCR using universal, locus-specific primers and sequenced directly. Chromatogram analysis was performed using commercial DNA sequence analysis software and allele interpretation was performed using a free web-based tool. HLA-A, -B and -C amplification rates were 100% and chromatograms were of uniformly high quality with clearly distinguishable mixed bases regardless of DNA source. Concordance between PBMC and plasma-derived HLA types was 100% at the allele and protein levels. At the nucleotide level, a single partially discordant base (resulting from a failure to call both peaks in a mixed base) was observed out of >46,975 bases sequenced (>99.9% concordance). This protocol has previously been used to perform HLA class I typing from a variety of genomic DNA sources including PBMC, whole blood, granulocyte pellets and serum, from specimens up to 30 years old. This method provides comparable specificity to conventional sequence-based approaches and could be applied in situations where cell samples are unavailable or DNA quantities are limiting. Copyright © 2012 Elsevier B.V. All rights reserved.
Fierro, Ana C; Vandenbussche, Filip; Engelen, Kristof; Van de Peer, Yves; Marchal, Kathleen
Since the second half of the 1990s, a large number of genome-wide analyses have been described that study gene expression at the transcript level. To this end, two major strategies have been adopted, a first one relying on hybridization techniques such as microarrays, and a second one based on sequencing techniques such as serial analysis of gene expression (SAGE), cDNA-AFLP, and analysis based on expressed sequence tags (ESTs). Despite both types of profiling experiments becoming routine techniques in many research groups, their application remains costly and laborious. As a result, the number of conditions profiled in individual studies is still relatively small and usually varies from only two to few hundreds of samples for the largest experiments. More and more, scientific journals require the deposit of these high throughput experiments in public databases upon publication. Mining the information present in these databases offers molecular biologists the possibility to view their own small-scale analysis in the light of what is already available. However, so far, the richness of the public information remains largely unexploited. Several obstacles such as the correct association between ESTs and microarray probes with the corresponding gene transcript, the incompleteness and inconsistency in the annotation of experimental conditions, and the lack of standardized experimental protocols to generate gene expression data, all impede the successful mining of these data. Here, we review the potential and difficulties of combining publicly available expression data from respectively EST analyses and microarray experiments. With examples from literature, we show how meta-analysis of expression profiling experiments can be used to study expression behavior in a single organism or between organisms, across a wide range of experimental conditions. We also provide an overview of the methods and tools that can aid molecular biologists in exploiting these public data.
Wanke, Dierk; Kilian, Joachim; Bloss, Ulrich; Mangelsen, Elke; Supper, Jochen; Harter, Klaus; Berendzen, Kenneth W.
Biologists and bioinformatic scientists cope with the analysis of transcript abundance and the extraction of meaningful information from microarray expression data. By exploiting biological information accessible in public databases, we try to extend our current knowledge over the plant model organism Arabidopsis thaliana. Here, we give two examples of increasing the quality of information gained from large scale expression experiments by the integration of microarray-unrelated biological information: First, we utilize Arabidopsis microarray data to demonstrate that expression profiles are usually conserved between orthologous genes of different organisms. In an initial step of the analysis, orthology has to be inferred unambiguously, which then allows comparison of expression profiles between orthologs. We make use of the publicly available microarray expression data of Arabidopsis and barley, Hordeum vulgare. We found a generally positive correlation in expression trajectories between true orthologs although both organisms are only distantly related in evolutionary time scale. Second, extracting clusters of co-regulated genes implies similarities in transcriptional regulation via similar cis-regulatory elements (CREs). Vice versa approaches, where co-regulated gene clusters are found by investigating on CREs were not successful in general. Nonetheless, in some cases the presence of CREs in a defined position, orientation or CRE-combinations is positively correlated with co-regulated gene clusters. Here, we make use of genes involved in the phenylpropanoid biosynthetic pathway, to give one positive example for this approach.
Jeffrey T Leek
Full Text Available It has unambiguously been shown that genetic, environmental, demographic, and technical factors may have substantial effects on gene expression levels. In addition to the measured variable(s of interest, there will tend to be sources of signal due to factors that are unknown, unmeasured, or too complicated to capture through simple models. We show that failing to incorporate these sources of heterogeneity into an analysis can have widespread and detrimental effects on the study. Not only can this reduce power or induce unwanted dependence across genes, but it can also introduce sources of spurious signal to many genes. This phenomenon is true even for well-designed, randomized studies. We introduce "surrogate variable analysis" (SVA to overcome the problems caused by heterogeneity in expression studies. SVA can be applied in conjunction with standard analysis techniques to accurately capture the relationship between expression and any modeled variables of interest. We apply SVA to disease class, time course, and genetics of gene expression studies. We show that SVA increases the biological accuracy and reproducibility of analyses in genome-wide expression studies.
Kurgan, Lukasz; Disfani, Fatemeh Miri
The last few decades observed an increasing interest in development and application of 1-dimensional (1D) descriptors of protein structure. These descriptors project 3D structural features onto 1D strings of residue-wise structural assignments. They cover a wide-range of structural aspects including conformation of the backbone, burying depth/solvent exposure and flexibility of residues, and inter-chain residue-residue contacts. We perform first-of-its-kind comprehensive comparative review of the existing 1D structural descriptors. We define, review and categorize ten structural descriptors and we also describe, summarize and contrast over eighty computational models that are used to predict these descriptors from the protein sequences. We show that the majority of the recent sequence-based predictors utilize machine learning models, with the most popular being neural networks, support vector machines, hidden Markov models, and support vector and linear regressions. These methods provide high-throughput predictions and most of them are accessible to a non-expert user via web servers and/or stand-alone software packages. We empirically evaluate several recent sequence-based predictors of secondary structure, disorder, and solvent accessibility descriptors using a benchmark set based on CASP8 targets. Our analysis shows that the secondary structure can be predicted with over 80% accuracy and segment overlap (SOV), disorder with over 0.9 AUC, 0.6 Matthews Correlation Coefficient (MCC), and 75% SOV, and relative solvent accessibility with PCC of 0.7 and MCC of 0.6 (0.86 when homology is used). We demonstrate that the secondary structure predicted from sequence without the use of homology modeling is as good as the structure extracted from the 3D folds predicted by top-performing template-based methods.
Jul 27, 2011 ... sugar-signalling pathway (Chi et al., 2010). All the earlier mentioned ... real-time qPCR analysis was the ABI PRISM7500 real-time PCR system. ... Construction of prokaryotic expression vector of Sc-GST gene. pET29a (+) ...
Expression analysis of CfCHS in different tissues and elicitor treatments showed that methyl jasmonate ... Journal of Genetics, DOI 10.1007/s12041-016-0680-8, Vol. 95, No. ... leaf of C. forskohlii. Quantitative real time RT-PCR was used ..... SGG acknowledges the financial support for this work from CSIR. 12th FYP project ...
Valstar, Michel F.; Jiang, Bihan; Mehu, Marc; Pantic, Maja; Scherer, Klaus
Automatic Facial Expression Recognition and Analysis, in particular FACS Action Unit (AU) detection and discrete emotion detection, has been an active topic in computer science for over two decades. Standardisation and comparability has come some way; for instance, there exist a number of commonly
Evolution and expression analysis of the soybean glutamate decarboxylase gene family. TAE KYUNG HYUN, SEUNG HEE EOM, XIAO HAN and JU-SUNG KIM http://www.ias.ac.in/jbiosci. J. Biosci. 39(5), December 2014, 899–907, © Indian Academy of Sciences. Supplementary material. Supplementary figure 1.
An, Li; Xie, Hongbo; Chin, Mark H; Obradovic, Zoran; Smith, Desmond J; Megalooikonomou, Vasileios
Gene expression signatures in the mammalian brain hold the key to understanding neural development and neurological disease. Researchers have previously used voxelation in combination with microarrays for acquisition of genome-wide atlases of expression patterns in the mouse brain. On the other hand, some work has been performed on studying gene functions, without taking into account the location information of a gene's expression in a mouse brain. In this paper, we present an approach for identifying the relation between gene expression maps obtained by voxelation and gene functions. To analyze the dataset, we chose typical genes as queries and aimed at discovering similar gene groups. Gene similarity was determined by using the wavelet features extracted from the left and right hemispheres averaged gene expression maps, and by the Euclidean distance between each pair of feature vectors. We also performed a multiple clustering approach on the gene expression maps, combined with hierarchical clustering. Among each group of similar genes and clusters, the gene function similarity was measured by calculating the average gene function distances in the gene ontology structure. By applying our methodology to find similar genes to certain target genes we were able to improve our understanding of gene expression patterns and gene functions. By applying the clustering analysis method, we obtained significant clusters, which have both very similar gene expression maps and very similar gene functions respectively to their corresponding gene ontologies. The cellular component ontology resulted in prominent clusters expressed in cortex and corpus callosum. The molecular function ontology gave prominent clusters in cortex, corpus callosum and hypothalamus. The biological process ontology resulted in clusters in cortex, hypothalamus and choroid plexus. Clusters from all three ontologies combined were most prominently expressed in cortex and corpus callosum. The experimental
Smith Desmond J
Full Text Available Abstract Background Gene expression signatures in the mammalian brain hold the key to understanding neural development and neurological disease. Researchers have previously used voxelation in combination with microarrays for acquisition of genome-wide atlases of expression patterns in the mouse brain. On the other hand, some work has been performed on studying gene functions, without taking into account the location information of a gene's expression in a mouse brain. In this paper, we present an approach for identifying the relation between gene expression maps obtained by voxelation and gene functions. Results To analyze the dataset, we chose typical genes as queries and aimed at discovering similar gene groups. Gene similarity was determined by using the wavelet features extracted from the left and right hemispheres averaged gene expression maps, and by the Euclidean distance between each pair of feature vectors. We also performed a multiple clustering approach on the gene expression maps, combined with hierarchical clustering. Among each group of similar genes and clusters, the gene function similarity was measured by calculating the average gene function distances in the gene ontology structure. By applying our methodology to find similar genes to certain target genes we were able to improve our understanding of gene expression patterns and gene functions. By applying the clustering analysis method, we obtained significant clusters, which have both very similar gene expression maps and very similar gene functions respectively to their corresponding gene ontologies. The cellular component ontology resulted in prominent clusters expressed in cortex and corpus callosum. The molecular function ontology gave prominent clusters in cortex, corpus callosum and hypothalamus. The biological process ontology resulted in clusters in cortex, hypothalamus and choroid plexus. Clusters from all three ontologies combined were most prominently expressed in
phylogenetic tree construction methods, has been considered as an equivalent of .... Further detailed analysis described is restricted to the first two groups only. ..... Aspartate-ammonia ligase. Plant virus ..... enzymatic activities?; Trends ...
Full Text Available Abstract Background Several mutations have been described as responsible for rifampicin resistance in Neisseria meningitidis. However, the intriguing question on why these strains are so rare remains open. The aim of this study was to investigate the protein content and to identify differential expression in specific proteins in two rifampicin resistant and one susceptible meningococci using two-dimensional electrophoresis (2-DE combined with mass spectrometry. Results In our experimental conditions, able to resolve soluble proteins with an isoelectric point between 4 and 7, twenty-three proteins have been found differentially expressed in the two resistant strains compared to the susceptible. Some of them, involved in the main metabolic pathways, showed an increased expression, mainly in the catabolism of pyruvate and in the tricarboxylic acid cycle. A decreased expression of proteins belonging to gene regulation and to those involved in the folding of polypeptides has also been observed. 2-DE analysis showed the presence of four proteins displaying a shift in their isoelectric point in both resistant strains, confirmed by the presence of amino acid changes in the sequence analysis, absent in the susceptible. Conclusions The analysis of differentially expressed proteins suggests that an intricate series of events occurs in N. meningitidis rifampicin resistant strains and the results here reported may be considered a starting point in understanding their decreased invasion capacity. In fact, they support the hypothesis that the presence of more than one protein differentially expressed, having a role in the metabolism of the meningococcus, influences its ability to infect and to spread in the population. Different reports have described and discussed how a drug resistant pathogen shows a high biological cost for survival and that may also explain why, for some pathogens, the rate of resistant organisms is relatively low considering the
Full Text Available Abstract Background MicroRNAs (miRNAs are small ~22-nt regulatory RNAs that can silence target genes, by blocking their protein production or degrading the mRNAs. Pig is an important animal in the agriculture industry because of its utility in the meat production. Besides, pig has tremendous biomedical importance as a model organism because of its closer proximity to humans than the mouse model. Several hundreds of miRNAs have been identified from mammals, humans, mice and rats, but little is known about the miRNA component in the pig genome. Here, we adopted an experimental approach to identify conserved and unique miRNAs and characterize their expression patterns in diverse tissues of pig. Results By sequencing a small RNA library generated using pooled RNA from the pig heart, liver and thymus; we identified a total of 120 conserved miRNA homologs in pig. Expression analysis of conserved miRNAs in 14 different tissue types revealed heart-specific expression of miR-499 and miR-208 and liver-specific expression of miR-122. Additionally, miR-1 and miR-133 in the heart, miR-181a and miR-142-3p in the thymus, miR-194 in the liver, and miR-143 in the stomach showed the highest levels of expression. miR-22, miR-26b, miR-29c and miR-30c showed ubiquitous expression in diverse tissues. The expression patterns of pig-specific miRNAs also varied among the tissues examined. Conclusion Identification of 120 miRNAs and determination of the spatial expression patterns of a sub-set of these in the pig is a valuable resource for molecular biologists, breeders, and biomedical investigators interested in post-transcriptional gene regulation in pig and in related mammals, including humans.
Bijnens, Luc J.M.; Lewi, Paul J.; Göhlmann, Hinrich W.; Molenberghs, Geert; Wouters, Luc
bioinformatics; biplot; correspondence factor analysis; data mining; data visualization; gene expression data; microarray data; multivariate exploratory data analysis; principal component analysis; Spectral map analysis
Full Text Available Abstract Background Systems biology aims to understand biological systems on a comprehensive scale, such that the components that make up the whole are connected to one another and work through dependent interactions. Molecular correlations and comparative studies of molecular expression are crucial to establishing interdependent connections in systems biology. The existing software packages provide limited data mining capability. The user must first generate visualization data with a preferred data mining algorithm and then upload the resulting data into the visualization package for graphic visualization of molecular relations. Results Presented is a novel interactive visual data mining application, SysNet that provides an interactive environment for the analysis of high data volume molecular expression information of most any type from biological systems. It integrates interactive graphic visualization and statistical data mining into a single package. SysNet interactively presents intermolecular correlation information with circular and heatmap layouts. It is also applicable to comparative analysis of molecular expression data, such as time course data. Conclusion The SysNet program has been utilized to analyze elemental profile changes in response to an increasing concentration of iron (Fe in growth media (an ionomics dataset. This study case demonstrates that the SysNet software is an effective platform for interactive analysis of molecular expression information in systems biology.
Midtgaard, Jan; Nielson, Flemming; Nielson, Hanne Riis
We present an iterated approach to statically analyze programs of two processes communicating by message passing. Our analysis operates over a domain of lattice-valued regular expressions, and computes increasingly better approximations of each process's communication behavior. Overall the work e...... extends traditional semantics-based program analysis techniques to automatically reason about message passing in a manner that can simultaneously analyze both values of variables as well as message order, message content, and their interdependencies.......We present an iterated approach to statically analyze programs of two processes communicating by message passing. Our analysis operates over a domain of lattice-valued regular expressions, and computes increasingly better approximations of each process's communication behavior. Overall the work...
Liu, Mengque; Fan, Xinyan; Fang, Kuangnan; Zhang, Qingzhao; Ma, Shuangge
In the analysis of gene expression data, dimension reduction techniques have been extensively adopted. The most popular one is perhaps the PCA (principal component analysis). To generate more reliable and more interpretable results, the SPCA (sparse PCA) technique has been developed. With the "small sample size, high dimensionality" characteristic of gene expression data, the analysis results generated from a single dataset are often unsatisfactory. Under contexts other than dimension reduction, integrative analysis techniques, which jointly analyze the raw data of multiple independent datasets, have been developed and shown to outperform "classic" meta-analysis and other multidatasets techniques and single-dataset analysis. In this study, we conduct integrative analysis by developing the iSPCA (integrative SPCA) method. iSPCA achieves the selection and estimation of sparse loadings using a group penalty. To take advantage of the similarity across datasets and generate more accurate results, we further impose contrasted penalties. Different penalties are proposed to accommodate different data conditions. Extensive simulations show that iSPCA outperforms the alternatives under a wide spectrum of settings. The analysis of breast cancer and pancreatic cancer data further shows iSPCA's satisfactory performance. © 2017 WILEY PERIODICALS, INC.
Zhang, Qing; Fan, Xiaodan; Wang, Yejun; Sun, Ming-An; Shao, Jianlin; Guo, Dianjing
Although high-throughput sequencing methods have been proposed to identify splicing branch points in the human genome, these methods can only detect a small fraction of the branch points subject to the sequencing depth, experimental cost and the expression level of the mRNA. An accurate computational model for branch point prediction is therefore an ongoing objective in human genome research. We here propose a novel branch point prediction algorithm that utilizes information on the branch point sequence and the polypyrimidine tract. Using experimentally validated data, we demonstrate that our proposed method outperforms existing methods. Availability and implementation: https://github.com/zhqingit/BPP. firstname.lastname@example.org. Supplementary data are available at Bioinformatics online. © The Author (2017). Published by Oxford University Press. All rights reserved. For Permissions, please email: email@example.com
Full Text Available Abstract Background Renal cell carcinoma (RCC is the most common cancer in adult kidney. The accuracy of current diagnosis and prognosis of the disease and the effectiveness of the treatment for the disease are limited by the poor understanding of the disease at the molecular level. To better understand the genetics and biology of RCC, we profiled the expression of 7,129 genes in both clear cell RCC tissue and cell lines using oligonucleotide arrays. Methods Total RNAs isolated from renal cell tumors, adjacent normal tissue and metastatic RCC cell lines were hybridized to affymatrix HuFL oligonucleotide arrays. Genes were categorized into different functional groups based on the description of the Gene Ontology Consortium and analyzed based on the gene expression levels. Gene expression profiles of the tissue and cell line samples were visualized and classified by singular value decomposition. Reverse transcription polymerase chain reaction was performed to confirm the expression alterations of selected genes in RCC. Results Selected genes were annotated based on biological processes and clustered into functional groups. The expression levels of genes in each group were also analyzed. Seventy-four commonly differentially expressed genes with more than five-fold changes in RCC tissues were identified. The expression alterations of selected genes from these seventy-four genes were further verified using reverse transcription polymerase chain reaction (RT-PCR. Detailed comparison of gene expression patterns in RCC tissue and RCC cell lines shows significant differences between the two types of samples, but many important expression patterns were preserved. Conclusions This is one of the initial studies that examine the functional ontology of a large number of genes in RCC. Extensive annotation, clustering and analysis of a large number of genes based on the gene functional ontology revealed many interesting gene expression patterns in RCC. Most
Full Text Available Abstract Background Gecko (Gene Expression: Computation and Knowledge Organization is a complete, high-capacity centralized gene expression analysis system, developed in response to the needs of a distributed user community. Results Based on a client-server architecture, with a centralized repository of typically many tens of thousands of Affymetrix scans, Gecko includes automatic processing pipelines for uploading data from remote sites, a data base, a computational engine implementing ~ 50 different analysis tools, and a client application. Among available analysis tools are clustering methods, principal component analysis, supervised classification including feature selection and cross-validation, multi-factorial ANOVA, statistical contrast calculations, and various post-processing tools for extracting data at given error rates or significance levels. On account of its open architecture, Gecko also allows for the integration of new algorithms. The Gecko framework is very general: non-Affymetrix and non-gene expression data can be analyzed as well. A unique feature of the Gecko architecture is the concept of the Analysis Tree (actually, a directed acyclic graph, in which all successive results in ongoing analyses are saved. This approach has proven invaluable in allowing a large (~ 100 users and distributed community to share results, and to repeatedly return over a span of years to older and potentially very complex analyses of gene expression data. Conclusions The Gecko system is being made publicly available as free software http://sourceforge.net/projects/geckoe. In totality or in parts, the Gecko framework should prove useful to users and system developers with a broad range of analysis needs.
U.S. Department of Health & Human Services — Gene Expression Omnibus is a public functional genomics data repository supporting MIAME-compliant submissions of array- and sequence-based data. Tools are provided...
Li, Shu-Fen; Zhang, Guo-Jun; Zhang, Xue-Jin; Yuan, Jin-Hong; Deng, Chuan-Liang; Gao, Wu-Jun
Garden asparagus (Asparagus officinalis) is a highly valuable vegetable crop of commercial and nutritional interest. It is also commonly used to investigate the mechanisms of sex determination and differentiation in plants. However, the sex expression mechanisms in asparagus remain poorly understood. De novo transcriptome sequencing via Illumina paired-end sequencing revealed more than 26 billion bases of high-quality sequence data from male and female asparagus flower buds. A total of 72,626 unigenes with an average length of 979 bp were assembled. In comparative transcriptome analysis, 4876 differentially expressed genes (DEGs) were identified in the possible sex-determining stage of female and male/supermale flower buds. Of these DEGs, 433, including 285 male/supermale-biased and 149 female-biased genes, were annotated as flower related. Of the male/supermale-biased flower-related genes, 102 were probably involved in anther development. In addition, 43 DEGs implicated in hormone response and biosynthesis putatively associated with sex expression and reproduction were discovered. Moreover, 128 transcription factor (TF)-related genes belonging to various families were found to be differentially expressed, and this finding implied the essential roles of TF in sex determination or differentiation in asparagus. Correlation analysis indicated that miRNA-DEG pairs were also implicated in asparagus sexual development. Our study identified a large number of DEGs involved in the sex expression and reproduction of asparagus, including known genes participating in plant reproduction, plant hormone signaling, TF encoding, and genes with unclear functions. We also found that miRNAs might be involved in the sex differentiation process. Our study could provide a valuable basis for further investigations on the regulatory networks of sex determination and differentiation in asparagus and facilitate further genetic and genomic studies on this dioecious species.
Juncker, Agnieszka; Jensen, Lars J.; Pierleoni, Andrea
A recent trend in computational methods for annotation of protein function is that many prediction tools are combined in complex workflows and pipelines to facilitate the analysis of feature combinations, for example, the entire repertoire of kinase-binding motifs in the human proteome....
Inoue, Tohru; Hirabayashi, Yoko
Authors explain that the radiation effect on biological system is stochastic along the law of physics, differing from chemical effect, using instances of Cs-137 gamma-ray (GR) and benzene (BZ) exposures to mice and of resultant comprehensive analyses of gene expression. Single GR irradiation is done with Gamma Cell 40 (CSR) to C57BL/6 or C3H/He mouse at 0, 0.6 and 3 Gy. BE is given orally at 150 mg/kg/day for 5 days x 2 weeks. Bone marrow cells are sampled 1 month after the exposure. Comprehensive gene expression is analyzed by Gene Chip Mouse Genome 430 2.0 Array (Affymetrix) and data are processed by programs like case normalization, statistics, network generation, functional analysis etc. GR irradiation brings about changes of gene expression, which are classifiable in common genes variable commonly on the dose change and stochastic genes variable stochastically within each dose: e.g., with Welch-t-test, significant differences are between 0/3 Gy (dose-specific difference, 455 pbs (probe set), in stochastic 2113 pbs), 0/0.6 Gy (267 in 1284 pbs) and 0.6/3 Gy (532 pbs); and with one-way analysis of variation (ANOVA) and hierarchial/dendrographic analyses, 520 pbs are shown to involve the dose-dependent 226 and dose-specific 294 pbs. It is also shown that at 3 Gy, expression of common genes are rather suppressed, including those related to the proliferation/apoptosis of B/T cells, and of stochastic genes, related to cell division/signaling. Ven diagram of the common genes of above 520 pbs, stochastic 2113 pbs at 3 Gy and 1284 pbs at 0.6 Gy shows the overlapping genes 29, 2 and 4, respectively, indicating only 35 pbs are overlapping in total. Network analysis of changes by GR shows the rather high expression of genes around hub of cAMP response element binding protein (CREB) at 0.6 Gy, and rather variable expression around CREB hub/suppressed expression of kinesin hub at 3 Gy; in the network by BZ exposure, unchanged or low expression around p53 hub and suppression
Full Text Available Abstract Background The degree of conservation of gene expression between homologous organs largely remains an open question. Several recent studies reported some evidence in favor of such conservation. Most studies compute organs' similarity across all orthologous genes, whereas the expression level of many genes are not informative about organ specificity. Results Here, we use a modularization algorithm to overcome this limitation through the identification of inter-species co-modules of organs and genes. We identify such co-modules using mouse and human microarray expression data. They are functionally coherent both in terms of genes and of organs from both organisms. We show that a large proportion of genes belonging to the same co-module are orthologous between mouse and human. Moreover, their zebrafish orthologs also tend to be expressed in the corresponding homologous organs. Notable exceptions to the general pattern of conservation are the testis and the olfactory bulb. Interestingly, some co-modules consist of single organs, while others combine several functionally related organs. For instance, amygdala, cerebral cortex, hypothalamus and spinal cord form a clearly discernible unit of expression, both in mouse and human. Conclusions Our study provides a new framework for comparative analysis which will be applicable also to other sets of large-scale phenotypic data collected across different species.
Piasecka, Barbara; Kutalik, Zoltán; Roux, Julien; Bergmann, Sven; Robinson-Rechavi, Marc
The degree of conservation of gene expression between homologous organs largely remains an open question. Several recent studies reported some evidence in favor of such conservation. Most studies compute organs' similarity across all orthologous genes, whereas the expression level of many genes are not informative about organ specificity. Here, we use a modularization algorithm to overcome this limitation through the identification of inter-species co-modules of organs and genes. We identify such co-modules using mouse and human microarray expression data. They are functionally coherent both in terms of genes and of organs from both organisms. We show that a large proportion of genes belonging to the same co-module are orthologous between mouse and human. Moreover, their zebrafish orthologs also tend to be expressed in the corresponding homologous organs. Notable exceptions to the general pattern of conservation are the testis and the olfactory bulb. Interestingly, some co-modules consist of single organs, while others combine several functionally related organs. For instance, amygdala, cerebral cortex, hypothalamus and spinal cord form a clearly discernible unit of expression, both in mouse and human. Our study provides a new framework for comparative analysis which will be applicable also to other sets of large-scale phenotypic data collected across different species.
Chu, Y X; Chen, H R; Wu, A Z; Cai, R; Pan, J S
Dihydroflavonol 4-reductase (DFR) genes from Rosa chinensis (Asn type) and Calibrachoa hybrida (Asp type), driven by a CaMV 35S promoter, were integrated into the petunia (Petunia hybrida) cultivar 9702. Exogenous DFR gene expression characteristics were similar to flower-color changes, and effects on anthocyanin concentration were observed in both types of DFR gene transformants. Expression analysis showed that exogenous DFR genes were expressed in all of the tissues, but the expression levels were significantly different. However, both of them exhibited a high expression level in petals that were starting to open. The introgression of DFR genes may significantly change DFR enzyme activity. Anthocyanin ultra-performance liquid chromatography results showed that anthocyanin concentrations changed according to DFR enzyme activity. Therefore, the change in flower color was probably the result of a DFR enzyme change. Pelargonidin 3-O-glucoside was found in two different transgenic petunias, indicating that both CaDFR and RoDFR could catalyze dihydrokaempferol. Our results also suggest that transgenic petunias with DFR gene of Asp type could biosynthesize pelargonidin 3-O-glucoside.
Full Text Available The objective of this study was to determine the molecular characteristics of the horse vascular endothelial growth factor alpha gene (VEGFα by constructing a phylogenetic tree, and to investigate gene expression profiles in tissues and blood leukocytes after exercise for development of suitable biomarkers. Using published amino acid sequences of other vertebrate species (human, chimpanzee, mouse, rat, cow, pig, chicken and dog, we constructed a phylogenetic tree which showed that equine VEGFα belonged to the same clade of the pig VEGFα. Analysis for synonymous (Ks and non-synonymous substitution ratios (Ka revealed that the horse VEGFα underwent positive selection. RNA was extracted from blood samples before and after exercise and different tissue samples of three horses. Expression analyses using reverse transcription-polymerase chain reaction (RT-PCR and quantitative-polymerase chain reaction (qPCR showed ubiquitous expression of VEGFα mRNA in skeletal muscle, kidney, thyroid, lung, appendix, colon, spinal cord, and heart tissues. Analysis of differential expression of VEGFα gene in blood leukocytes after exercise indicated a unimodal pattern. These results will be useful in developing biomarkers that can predict the recovery capacity of racing horses.
Cimica, Velasco; Batusic, Danko; Haralanova-Ilieva, Borislava; Chen, Yonglong; Hollemann, Thomas; Pieler, Tomas; Ramadori, Giuliano
We have applied serial analysis of gene expression for studying the molecular mechanism of the rat liver regeneration in the model of 70% partial hepatectomy. We generated three SAGE libraries from a normal control liver (NL library: 52,343 tags), from a sham control operated liver (Sham library: 51,028 tags), and from a regenerating liver (PH library: 53,061 tags). By SAGE bioinformatics analysis we identified 40 induced genes and 20 repressed genes during the liver regeneration. We verified temporal expression of such genes by real time PCR during the regeneration process and we characterized 13 induced genes and 3 repressed genes. We found connective tissue growth factor transcript and protein induced very early at 4 h after PH operation before hepatocytes proliferation is triggered. Our study suggests CTGF as a growth factor signaling mediator that could be involved directly in the mechanism of liver regeneration induction
Full Text Available Abstract Background Endothelial differentiation occurs during normal vascular development in the developing embryo. This process is recapitulated in the adult when endothelial progenitor cells are generated in the bone marrow and can contribute to vascular repair or angiogenesis at sites of vascular injury or ischemia. The molecular mechanisms of endothelial differentiation remain incompletely understood. Novel approaches are needed to identify the factors that regulate endothelial differentiation. Methods Mouse embryonic stem (ES cells were used to further define the molecular mechanisms of endothelial differentiation. By flow cytometry a population of VEGF-R2 positive cells was identified as early as 2.5 days after differentiation of ES cells, and a subset of VEGF-R2+ cells, that were CD41 positive at 3.5 days. A separate population of VEGF-R2+ stem cells expressing the endothelial-specific marker CD144 (VE-cadherin was also identified at this same time point. Channels lined by VE-cadherin positive cells developed within the embryoid bodies (EBs formed by differentiating ES cells. VE-cadherin and CD41 expressing cells differentiate in close proximity to each other within the EBs, supporting the concept of a common origin for cells of hematopoietic and endothelial lineages. Results Microarray analysis of >45,000 transcripts was performed on RNA obtained from cells expressing VEGF-R2+, CD41+, and CD144+ and VEGF-R2-, CD41-, and CD144-. All microarray experiments were performed in duplicate using RNA obtained from independent experiments, for each subset of cells. Expression profiling confirmed the role of several genes involved in hematopoiesis, and identified several putative genes involved in endothelial differentiation. Conclusion The isolation of CD144+ cells during ES cell differentiation from embryoid bodies provides an excellent model system and method for identifying genes that are expressed during endothelial differentiation and that
Wang, Bing; Zhang, Jun; Chen, Peng; Ji, Zhiwei; Deng, Shuping; Li, Chi
Background: Ion mobility-mass spectrometry (IMMS), an analytical technique which combines the features of ion mobility spectrometry (IMS) and mass spectrometry (MS), can rapidly separates ions on a millisecond time-scale. IMMS becomes a powerful tool to analyzing complex mixtures, especially for the analysis of peptides in proteomics. The high-throughput nature of this technique provides a challenge for the identification of peptides in complex biological samples. As an important parameter, peptide drift time can be used for enhancing downstream data analysis in IMMS-based proteomics.Results: In this paper, a model is presented based on least square support vectors regression (LS-SVR) method to predict peptide ion drift time in IMMS from the sequence-based features of peptide. Four descriptors were extracted from peptide sequence to represent peptide ions by a 34-component vector. The parameters of LS-SVR were selected by a grid searching strategy, and a 10-fold cross-validation approach was employed for the model training and testing. Our proposed method was tested on three datasets with different charge states. The high prediction performance achieve demonstrate the effectiveness and efficiency of the prediction model.Conclusions: Our proposed LS-SVR model can predict peptide drift time from sequence information in relative high prediction accuracy by a test on a dataset of 595 peptides. This work can enhance the confidence of protein identification by combining with current protein searching techniques. 2013 Wang et al.; licensee BioMed Central Ltd.
Background: Ion mobility-mass spectrometry (IMMS), an analytical technique which combines the features of ion mobility spectrometry (IMS) and mass spectrometry (MS), can rapidly separates ions on a millisecond time-scale. IMMS becomes a powerful tool to analyzing complex mixtures, especially for the analysis of peptides in proteomics. The high-throughput nature of this technique provides a challenge for the identification of peptides in complex biological samples. As an important parameter, peptide drift time can be used for enhancing downstream data analysis in IMMS-based proteomics.Results: In this paper, a model is presented based on least square support vectors regression (LS-SVR) method to predict peptide ion drift time in IMMS from the sequence-based features of peptide. Four descriptors were extracted from peptide sequence to represent peptide ions by a 34-component vector. The parameters of LS-SVR were selected by a grid searching strategy, and a 10-fold cross-validation approach was employed for the model training and testing. Our proposed method was tested on three datasets with different charge states. The high prediction performance achieve demonstrate the effectiveness and efficiency of the prediction model.Conclusions: Our proposed LS-SVR model can predict peptide drift time from sequence information in relative high prediction accuracy by a test on a dataset of 595 peptides. This work can enhance the confidence of protein identification by combining with current protein searching techniques. 2013 Wang et al.; licensee BioMed Central Ltd.
Rimas J. Orentas
Full Text Available Adoptive immunotherapy with antibody-based therapy or with T cells transduced to express chimeric antigen receptors (CARs is useful to the extent that the cell surface membrane protein being targeted is not expressed on normal tissues. The most successful CAR-based (anti-CD19 or antibody-based therapy (anti-CD20 in hematologic malignancies has the side effect of eliminating the normal B cell compartment. Targeting solid tumors may not provide a similar expendable marker. Beyond antibody to Her2/NEU and EGFR, very few antibody-based and no CAR-based therapies have seen broad clinical application for solid tumors. To expand the way in which the surfaceome of solid tumors can be analyzed, we created an algorithm that defines the pairwise relative overexpression of surface antigens. This enables the development of specific immunotherapies that require the expression of two discrete antigens on the surface of the tumor target. This dyad analysis was facilitated by employing the Hotelling’s T-squared test (Hotelling–Lawley multivariate analysis of variance for two independent variables in comparison to a third constant entity (i.e., gene expression levels in normal tissues. We also present a unique consensus scoring mechanism for identifying transcripts that encode cell surface proteins. The unique application of our bioinformatics processing pipeline and statistical tools allowed us to compare the expression of two membrane protein targets as a pair, and to propose a new strategy based on implementing immunotherapies that require both antigens to be expressed on the tumor cell surface to trigger therapeutic effector mechanisms. Specifically, we found that, for MYCN amplified neuroblastoma, pairwise expression of ACVR2B or anaplastic lymphoma kinase (ALK with GFRA3, GFRA2, Cadherin 24, or with one another provided the strongest hits. For MYCN, non-amplified stage 4 neuroblastoma, neurotrophic tyrosine kinase 1, or ALK paired with GFRA2, GFRA3, SSK
Kiiveri, Harri T
Typical analysis of microarray data ignores the correlation between gene expression values. In this paper we present a model for microarray data which specifically allows for correlation between genes. As a result we combine gene network ideas with linear models and differential expression. We use sparse inverse covariance matrices and their associated graphical representation to capture the notion of gene networks. An important issue in using these models is the identification of the pattern of zeroes in the inverse covariance matrix. The limitations of existing methods for doing this are discussed and we provide a workable solution for determining the zero pattern. We then consider a method for estimating the parameters in the inverse covariance matrix which is suitable for very high dimensional matrices. We also show how to construct multivariate tests of hypotheses. These overall multivariate tests can be broken down into two components, the first one being similar to tests for differential expression and the second involving the connections between genes. The methods in this paper enable the extraction of a wealth of information concerning the relationships between genes which can be conveniently represented in graphical form. Differentially expressed genes can be placed in the context of the gene network and places in the gene network where unusual or interesting patterns have emerged can be identified, leading to the formulation of hypotheses for future experimentation.
Kiiveri Harri T
Full Text Available Abstract Background Typical analysis of microarray data ignores the correlation between gene expression values. In this paper we present a model for microarray data which specifically allows for correlation between genes. As a result we combine gene network ideas with linear models and differential expression. Results We use sparse inverse covariance matrices and their associated graphical representation to capture the notion of gene networks. An important issue in using these models is the identification of the pattern of zeroes in the inverse covariance matrix. The limitations of existing methods for doing this are discussed and we provide a workable solution for determining the zero pattern. We then consider a method for estimating the parameters in the inverse covariance matrix which is suitable for very high dimensional matrices. We also show how to construct multivariate tests of hypotheses. These overall multivariate tests can be broken down into two components, the first one being similar to tests for differential expression and the second involving the connections between genes. Conclusion The methods in this paper enable the extraction of a wealth of information concerning the relationships between genes which can be conveniently represented in graphical form. Differentially expressed genes can be placed in the context of the gene network and places in the gene network where unusual or interesting patterns have emerged can be identified, leading to the formulation of hypotheses for future experimentation.
Regiane F. Travensolo
Full Text Available Xylella fastidiosa genome sequencing has generated valuable data by identifying genes acting either on metabolic pathways or in associated pathogenicity and virulence. Based on available information on these genes, new strategies for studying their expression patterns, such as microarray technology, were employed. A total of 2,600 primer pairs were synthesized and then used to generate fragments using the PCR technique. The arrays were hybridized against cDNAs labeled during reverse transcription reactions and which were obtained from bacteria grown under two different conditions (liquid XDM2 and liquid BCYE. All data were statistically analyzed to verify which genes were differentially expressed. In addition to exploring conditions for X. fastidiosa genome-wide transcriptome analysis, the present work observed the differential expression of several classes of genes (energy, protein, amino acid and nucleotide metabolism, transport, degradation of substances, toxins and hypothetical proteins, among others. The understanding of expressed genes in these two different media will be useful in comprehending the metabolic characteristics of X. fastidiosa, and in evaluating how important certain genes are for the functioning and survival of these bacteria in plants.
Full Text Available Kashin-Beck Disease (KBD is an endemic osteochondropathy with an unknown pathogenesis. Diagnosis of KBD is effective only in advanced cases, which eliminates the possibility of early treatment and leads to an inevitable exacerbation of symptoms. Therefore, we aim to identify an accurate blood-based gene signature for the detection of KBD. Previously published gene expression profile data on cartilage and peripheral blood mononuclear cells (PBMCs from adults with KBD were compared to select potential target genes. Microarray analysis was conducted to evaluate the expression of the target genes in a cohort of 100 KBD patients and 100 healthy controls. A gene expression signature was identified using a training set, which was subsequently validated using an independent test set with a minimum redundancy maximum relevance (mRMR algorithm and support vector machine (SVM algorithm. Fifty unique genes were differentially expressed between KBD patients and healthy controls. A 20-gene signature was identified that distinguished between KBD patients and controls with 90% accuracy, 85% sensitivity, and 95% specificity. This study identified a 20-gene signature that accurately distinguishes between patients with KBD and controls using peripheral blood samples. These results promote the further development of blood-based genetic biomarkers for detection of KBD.
Full Text Available Serial analysis of gene expression (SAGE is a powerful tool, which provides quantitative and comprehensive expression profile of genes in a given cell population. It works by isolating short fragments of genetic information from the expressed genes that are present in the cell being studied. These short sequences, called SAGE tags, are linked together for efficient sequencing. The frequency of each SAGE tag in the cloned multimers directly reflects the transcript abundance. Therefore, SAGE results in an accurate picture of gene expression at both the qualitative and the quantitative levels. It does not require a hybridization probe for each transcript and allows new genes to be discovered. This technique has been applied widely in human studies and various SAGE tags/SAGE libraries have been generated from different cells/tissues such as dendritic cells, lung fibroblast cells, oocytes, thyroid tissue, B-cell lymphoma, cultured keratinocytes, muscles, brain tissues, sciatic nerve, cultured Schwann cells, cord blood-derived mast cells, retina, macula, retinal pigment epithelial cells, skin cells, and so forth. In this review we present the updated information on the applications of SAGE technology mainly to human studies.
Abdulkarim Yasin Karim
Full Text Available Purpose: Gastric cancer has high incidence and mortality rate in several countries and is still one of the most frequent and lethal disease. In this study, we aimed to determine diagnostic markers in gastric cancer by molecular techniques; include mRNA expression analysis of FABP4 gene. Fatty acid binding protein 4 (FABP4 gene encodes the fatty acid binding protein found in adipocytes. The protein encoded by FABP4 are a family of small, highly conserved, cytoplasmic proteins that bind long-chain fatty acids and other hydrophobic ligands. It is thought that FABPs roles include fatty acid uptake, transport, and metabolism. Material and Methods: Total RNA were extracted from paired tumor and normal tissues of 47 gastric cancer. The mRNA expression level of FABP4 was measured employing semi- quantitative reverse transcription- polymerase chain reaction (RT- PCR. Results: The mRNA expression level of FABP4 was significantly decreased (down- regulated. Conclusion: Down-regulation of FABP4 gene seems to occur at the initial steps of gastric cancer development. In order to confirm the relationship between the gastric tumor and FABP4 gene, further analysis like immunohistochemistry and epigenetc techniques are necessary. [Cukurova Med J 2016; 41(2.000: 248-252
Huang, Jianhua; Miao, Xuexia; Jin, Weirong; Couble, Pierre; Mita, Kasuei; Zhang, Yong; Liu, Wenbin; Zhuang, Leijun; Shen, Yan; Keime, Celine; Gandrillon, Olivier; Brouilly, Patrick; Briolay, Jerome; Zhao, Guoping; Huang, Yongping
The silkworm Bombyx mori is one of the most economically important insects and serves as a model for Lepidoptera insects. We used serial analysis of gene expression (SAGE) to derive profiles of expressed genes during the developmental life cycle of the silkworm and to create a reference for understanding silkworm metamorphosis. We generated four SAGE libraries, one from each of the four developmental stages of the silkworm. In total we obtained 257,964 SAGE tags, of which 39,485 were unique tags. Sorted by copy number, 14.1% of the unique tags were detected at a median to high level (five or more copies), 24.2% at lower levels (two to four copies), and 61.7% as single copies. Using a basic local alignment search tool on the EST database, 35% of the tags matched known silkworm expressed sequence tags. SAGE demonstrated that a number of the genes were up- or down-regulated during the four developmental phases of the egg, larva, pupa, and adult. Furthermore, we found that the generation of longer cDNA fragments from SAGE tags constituted the most efficient method of gene identification, which facilitated the analysis of a large number of unknown genes.
Full Text Available Drosophila segmentation as a model organism is one of the most highly studied. Among many maternal segmentation coordinate genes, bicoid protein pattern plays a significant role during Drosophila embryogenesis, since this gradient determines most aspects of head and thorax development. Despite the fact that several models have been proposed to describe the bicoid gradient, due to its association with considerable error, each can only partially explain bicoid characteristics. In this paper, a modified version of singular spectrum analysis is examined for filtering and extracting the bicoid gene expression signal. The results with strong evidence indicate that the proposed technique is able to remove noise more effectively and can be considered as a promising method for filtering gene expression measurements for other applications.
Cebrián, Rubén; Rodríguez-Ruano, Sonia; Martínez-Bueno, Manuel; Valdivia, Eva; Maqueda, Mercedes; Montalbán-López, Manuel
The enterocin AS-48 is the best characterized antibacterial circular protein in prokaryotes. It is a hydrophobic and cationic bacteriocin, which is ribosomally synthesized by enterococcal cells and post-translationally cyclized by a head-to-tail peptide bond. The production of and immunity towards AS-48 depend upon the coordinated expression of ten genes organized in two operons, as-48ABC (where genes encoding enzymes with processing, secretion, and immunity functions are adjacent to the structural as-48A gene) and as-48C1DD1EFGH. The current study describes the identification of the promoters involved in AS-48 expression. Seven putative promoters have been here amplified, and separately inserted into the promoter-probe vector pTLR1, to create transcriptional fusions with the mCherry gene used as a reporter. The activity of these promoter regions was assessed measuring the expression of the fluorescent mCherry protein using the constitutive pneumococcal promoter PX as a reference. Our results revealed that only three promoters PA, P2(2) and PD1 were recognized in Enterococcus faecalis, Lactococcus lactis and Escherichia coli, in the conditions tested. The maximal fluorescence was obtained with PX in all the strains, followed by the P2(2) promoter, which level of fluorescence was 2-fold compared to PA and 4-fold compared to PD1. Analysis of putative factors influencing the promoter activity in single and double transformants in E. faecalis JH2-2 demonstrated that, in general, a better expression was achieved in presence of pAM401-81. In addition, the P2(2) promoter could be regulated in a negative fashion by genes existing in the native pMB-2 plasmid other than those of the as-48 cluster, while the pH seems to affect differently the as-48 promoter expression.
Full Text Available The enterocin AS-48 is the best characterized antibacterial circular protein in prokaryotes. It is a hydrophobic and cationic bacteriocin, which is ribosomally synthesized by enterococcal cells and post-translationally cyclized by a head-to-tail peptide bond. The production of and immunity towards AS-48 depend upon the coordinated expression of ten genes organized in two operons, as-48ABC (where genes encoding enzymes with processing, secretion, and immunity functions are adjacent to the structural as-48A gene and as-48C1DD1EFGH. The current study describes the identification of the promoters involved in AS-48 expression. Seven putative promoters have been here amplified, and separately inserted into the promoter-probe vector pTLR1, to create transcriptional fusions with the mCherry gene used as a reporter. The activity of these promoter regions was assessed measuring the expression of the fluorescent mCherry protein using the constitutive pneumococcal promoter PX as a reference. Our results revealed that only three promoters PA, P2(2 and PD1 were recognized in Enterococcus faecalis, Lactococcus lactis and Escherichia coli, in the conditions tested. The maximal fluorescence was obtained with PX in all the strains, followed by the P2(2 promoter, which level of fluorescence was 2-fold compared to PA and 4-fold compared to PD1. Analysis of putative factors influencing the promoter activity in single and double transformants in E. faecalis JH2-2 demonstrated that, in general, a better expression was achieved in presence of pAM401-81. In addition, the P2(2 promoter could be regulated in a negative fashion by genes existing in the native pMB-2 plasmid other than those of the as-48 cluster, while the pH seems to affect differently the as-48 promoter expression.
Koia, Jonni H; Moyle, Richard L; Botella, Jose R
Pineapple (Ananas comosus) is a tropical fruit crop of significant commercial importance. Although the physiological changes that occur during pineapple fruit development have been well characterized, little is known about the molecular events that occur during the fruit ripening process. Understanding the molecular basis of pineapple fruit ripening will aid the development of new varieties via molecular breeding or genetic modification. In this study we developed a 9277 element pineapple microarray and used it to profile gene expression changes that occur during pineapple fruit ripening. Microarray analyses identified 271 unique cDNAs differentially expressed at least 1.5-fold between the mature green and mature yellow stages of pineapple fruit ripening. Among these 271 sequences, 184 share significant homology with genes encoding proteins of known function, 53 share homology with genes encoding proteins of unknown function and 34 share no significant homology with any database accession. Of the 237 pineapple sequences with homologs, 160 were up-regulated and 77 were down-regulated during pineapple fruit ripening. DAVID Functional Annotation Cluster (FAC) analysis of all 237 sequences with homologs revealed confident enrichment scores for redox activity, organic acid metabolism, metalloenzyme activity, glycolysis, vitamin C biosynthesis, antioxidant activity and cysteine peptidase activity, indicating the functional significance and importance of these processes and pathways during pineapple fruit development. Quantitative real-time PCR analysis validated the microarray expression results for nine out of ten genes tested. This is the first report of a microarray based gene expression study undertaken in pineapple. Our bioinformatic analyses of the transcript profiles have identified a number of genes, processes and pathways with putative involvement in the pineapple fruit ripening process. This study extends our knowledge of the molecular basis of pineapple fruit
Weng, Li; Rubin, Edward M.; Bristow, James
Ecologists studying microbial life in the environment have recognized the enormous complexity of microbial diversity for many years, and the development of a variety of culture-independent methods, many of them coupled with high-throughput DNA sequencing, has allowed this diversity to be explored in ever greater detail. Despite the widespread application of these new techniques to the characterization of uncultivated microbes and microbial communities in the environment, their application to human health and disease has lagged behind. Because DNA based-techniques for defining uncultured microbes allow not only cataloging of microbial diversity, but also insight into microbial functions, investigators are beginning to apply these tools to the microbial communities that abound on and within us, in what has aptly been called the second Human Genome Project. In this review we discuss the sequence-based methods for microbial analysis that are currently available and their application to identify novel human pathogens, improve diagnosis of known infectious diseases, and to advance understanding of our relationship with microbial communities that normally reside in and on the human body.
Milius, Robert P; Heuer, Michael; Valiga, Daniel; Doroschak, Kathryn J; Kennedy, Caleb J; Bolon, Yung-Tsi; Schneider, Joel; Pollack, Jane; Kim, Hwa Ran; Cereb, Nezih; Hollenbach, Jill A; Mack, Steven J; Maiers, Martin
We present an electronic format for exchanging data for HLA and KIR genotyping with extensions for next-generation sequencing (NGS). This format addresses NGS data exchange by refining the Histoimmunogenetics Markup Language (HML) to conform to the proposed Minimum Information for Reporting Immunogenomic NGS Genotyping (MIRING) reporting guidelines (miring.immunogenomics.org). Our refinements of HML include two major additions. First, NGS is supported by new XML structures to capture additional NGS data and metadata required to produce a genotyping result, including analysis-dependent (dynamic) and method-dependent (static) components. A full genotype, consensus sequence, and the surrounding metadata are included directly, while the raw sequence reads and platform documentation are externally referenced. Second, genotype ambiguity is fully represented by integrating Genotype List Strings, which use a hierarchical set of delimiters to represent allele and genotype ambiguity in a complete and accurate fashion. HML also continues to enable the transmission of legacy methods (e.g. site-specific oligonucleotide, sequence-specific priming, and Sequence Based Typing (SBT)), adding features such as allowing multiple group-specific sequencing primers, and fully leveraging techniques that combine multiple methods to obtain a single result, such as SBT integrated with NGS. Copyright © 2015 The Authors. Published by Elsevier Inc. All rights reserved.
Zhang, Ying; Qu, Pinghua; Zhang, Jian; Chen, Shouyi
To characterize the genes of Legionella pneumophila isolated from different water source in Guangzhou from 2006 to 2009. To genotype the strains by using sequence-based typing (SBT) scheme. In total 44 L. pneumophila strains were identified by SBT with 7 diversifying genes of flaA, asd, mip, pilE, mompS, proA and neuA. Analysis of the amplicons sequence was taken in the European Working Group for Legionella Infections (EWGLI) international SBT database to obtain the allelic profiles and sequence types (STs). Serogroups were typed by latex agglutination test. Data from SBT revealed a high diversity among the strains and ST01 accounts for 30% (13/ 44). Fifteen new STs were discovered from 20 STs and 2 of them were newly assigned (ST887 and ST888) by EWGLI. SBT Phylogenetic tree was generated by SplitsTree and BURST programs. High diversity and specificity were observed of the L. pneumophila strains in Guangzhou. SBT is useful for L. pneumophila genomic study and epidemiological surveillance.
Full Text Available Background: Chronic inflammation is a risk factor for colorectal cancer (CRC development. The aim of this study was to determine the differences in protein expression between CRC and the surrounding nontumorous colonic tissues in the mice that received azoxymethane (AOM and dextran sodium sulfate (DSS using a proteomic analysis. Materials and Methods: Male ICR mice were given a single intraperitoneal injection of AOM (10 mg/kg body weight, followed by 2% (w/v DSS in their drinking water for seven days, starting one week after the AOM injection. Colonic adenocarcinoma developed after 20 weeks and a proteomics analysis based on two-dimensional gel electrophoresis and ultraflex TOF/TOF mass spectrometry was conducted in the cancerous and nontumorous tissue specimens. Results: The proteomic analysis revealed 21 differentially expressed proteins in the cancerous tissues in comparison to the nontumorous tissues. There were five markedly increased proteins (beta-tropomyosin, tropomyosin 1 alpha isoform b, S100 calcium binding protein A9, and an unknown protein and 16 markedly decreased proteins (Car1 proteins, selenium-binding protein 1, HMG-CoA synthase, thioredoxin 1, 1 Cys peroxiredoxin protein 2, Fcgbp protein, Cytochrome c oxidase, subunit Va, ETHE1 protein, and 7 unknown proteins. Conclusions: There were 21 differentially expressed proteins in the cancerous tissues of the mice that received AOM and DSS. Their functions include metabolism, the antioxidant system, oxidative stress, mucin production, and inflammation. These findings may provide new insights into the mechanisms of inflammation-related colon carcinogenesis and the establishment of novel therapies and preventative strategies to treat carcinogenesis in the inflamed colon.
Larsen, Knud; Madsen, Lone Bruhn; Bendixen, Christian
to and protection from Parkinson’s disease. Here we report cloning, characterization, expression analysis and mapping of porcine UCHL1. The UCHL1 cDNA was amplified by reverse transcriptase polymerase chain reaction (RT-PCR) using oligonucleotide primers derived from in silico sequences. The porcine cDNA codes...... in developing porcine embryos. UCHL1 transcript was detected as early as 40 days of gestation. A significant decrease in UCHL1 transcript was detected in basal ganglia from day 60 to day 115 of gestation...
Full Text Available In saffron, the cleavage of zeaxanthin by means of CCD2 generates crocetin dialdehyde, which is then converted by an unknown aldehyde dehydrogenase to crocetin. A proteome from saffron stigma was released recently and, based on the expression pattern and correlation analyses, five aldehyde dehydrogenases (ALDHs were suggested as possible candidates to generate crocetin from crocetin dialdehydes. We selected four of the suggested ALDHs and analyzed their expression in different tissues, determined their activity over crocetin dialdehyde, and performed structure modeling and docking calculation to find their specificity. All the ALDHs were able to convert crocetin dialdehyde to crocetin, but two of them were stigma tissue-specific. Structure modeling and docking analyses revealed that, in all cases, there was a high coverage of residues in the models. All of them showed a very close conformation, indicated by the low root-mean-square deviation (RMSD values of backbone atoms, which indicate a high similarity among them. However, low affinity between the enzymes and the crocetin dialdehyde were observed. Phylogenetic analysis and binding affinities calculations, including some ALDHs from Gardenia jasmonoides, Crocus sieberi, and Buddleja species that accumulate crocetin and Bixa orellana synthetizing the apocarotenoid bixin selected on their expression pattern matching with the accumulation of either crocins or bixin, pointed out that family 2 C4 members might be involved in the conversion of crocetin dialdehyde to crocetin with high specificity.
Nagarajan, Rakesh; Bartley, Angela N; Bridge, Julia A; Jennings, Lawrence J; Kamel-Reid, Suzanne; Kim, Annette; Lazar, Alexander J; Lindeman, Neal I; Moncur, Joel; Rai, Alex J; Routbort, Mark J; Vasalos, Patricia; Merker, Jason D
- Detection of acquired variants in cancer is a paradigm of precision medicine, yet little has been reported about clinical laboratory practices across a broad range of laboratories. - To use College of American Pathologists proficiency testing survey results to report on the results from surveys on next-generation sequencing-based oncology testing practices. - College of American Pathologists proficiency testing survey results from more than 250 laboratories currently performing molecular oncology testing were used to determine laboratory trends in next-generation sequencing-based oncology testing. - These presented data provide key information about the number of laboratories that currently offer or are planning to offer next-generation sequencing-based oncology testing. Furthermore, we present data from 60 laboratories performing next-generation sequencing-based oncology testing regarding specimen requirements and assay characteristics. The findings indicate that most laboratories are performing tumor-only targeted sequencing to detect single-nucleotide variants and small insertions and deletions, using desktop sequencers and predesigned commercial kits. Despite these trends, a diversity of approaches to testing exists. - This information should be useful to further inform a variety of topics, including national discussions involving clinical laboratory quality systems, regulation and oversight of next-generation sequencing-based oncology testing, and precision oncology efforts in a data-driven manner.
Full Text Available Diabetes is among the most common causes of end-stage renal disease, although its pathophysiology is incompletely understood. We performed next-generation sequencing-based transcriptome analysis of renal gene expression changes in the OVE26 murine model of diabetes (age 15 weeks, relative to non-diabetic control, in the presence and absence of short-term (seven-day treatment with the angiotensin receptor blocker, losartan (n = 3-6 biological replicates per condition. We detected 1438 statistically significant changes in gene expression across conditions. Of the 638 genes dysregulated in diabetes relative to the non-diabetic state, >70% were downregulation events. Unbiased functional annotation of genes up- and down-regulated by diabetes strongly associated (p52-fold, encoded by the cationic amino acid transporter Slc7a12, and the gene product most highly downregulated by diabetes (>99%--encoded by the "pseudogene" Gm6300--are adjacent in the murine genome, are members of the SLC7 gene family, and are likely paralogous. Therefore, diabetes activates a near-total genetic switch between these two paralogs. Other individual-level changes in gene expression are potentially relevant to diabetic pathophysiology, and novel pathways are suggested. Genes unaffected by diabetes alone but exhibiting increased renal expression with losartan produced a signature consistent with malignant potential.
Full Text Available BACKGROUND: Real-time quantitative PCR (qPCR is still the gold-standard technique for gene-expression quantification. Recent technological advances of this method allow for the high-throughput gene-expression analysis, without the limitations of sample space and reagent used. However, non-commercial and user-friendly software for the management and analysis of these data is not available. RESULTS: The recently developed commercial microarrays allow for the drawing of standard curves of multiple assays using the same n-fold diluted samples. Data Analysis Gene (DAG Expression software has been developed to perform high-throughput gene-expression data analysis using standard curves for relative quantification and one or multiple reference genes for sample normalization. We discuss the application of DAG Expression in the analysis of data from an experiment performed with Fluidigm technology, in which 48 genes and 115 samples were measured. Furthermore, the quality of our analysis was tested and compared with other available methods. CONCLUSIONS: DAG Expression is a freely available software that permits the automated analysis and visualization of high-throughput qPCR. A detailed manual and a demo-experiment are provided within the DAG Expression software at http://www.dagexpression.com/dage.zip.
Full Text Available DNA microarray technologies are used extensively to profile the expression levels of thousands of genes under various conditions, yielding extremely large data-matrices. Thus, analyzing this information and extracting biologically relevant knowledge becomes a considerable challenge. A classical approach for tackling this challenge is to use clustering (also known as one-way clustering methods where genes (or respectively samples are grouped together based on the similarity of their expression profiles across the set of all samples (or respectively genes. An alternative approach is to develop biclustering methods to identify local patterns in the data. These methods extract subgroups of genes that are co-expressed across only a subset of samples and may feature important biological or medical implications. In this study we evaluate 13 biclustering and 2 clustering (k-means and hierarchical methods. We use several approaches to compare their performance on two real gene expression data sets. For this purpose we apply four evaluation measures in our analysis: (1 we examine how well the considered (biclustering methods differentiate various sample types; (2 we evaluate how well the groups of genes discovered by the (biclustering methods are annotated with similar Gene Ontology categories; (3 we evaluate the capability of the methods to differentiate genes that are known to be specific to the particular sample types we study and (4 we compare the running time of the algorithms. In the end, we conclude that as long as the samples are well defined and annotated, the contamination of the samples is limited, and the samples are well replicated, biclustering methods such as Plaid and SAMBA are useful for discovering relevant subsets of genes and samples.
Eren, Kemal; Deveci, Mehmet; Küçüktunç, Onur; Çatalyürek, Ümit V.
The need to analyze high-dimension biological data is driving the development of new data mining methods. Biclustering algorithms have been successfully applied to gene expression data to discover local patterns, in which a subset of genes exhibit similar expression levels over a subset of conditions. However, it is not clear which algorithms are best suited for this task. Many algorithms have been published in the past decade, most of which have been compared only to a small number of algorithms. Surveys and comparisons exist in the literature, but because of the large number and variety of biclustering algorithms, they are quickly outdated. In this article we partially address this problem of evaluating the strengths and weaknesses of existing biclustering methods. We used the BiBench package to compare 12 algorithms, many of which were recently published or have not been extensively studied. The algorithms were tested on a suite of synthetic data sets to measure their performance on data with varying conditions, such as different bicluster models, varying noise, varying numbers of biclusters and overlapping biclusters. The algorithms were also tested on eight large gene expression data sets obtained from the Gene Expression Omnibus. Gene Ontology enrichment analysis was performed on the resulting biclusters, and the best enrichment terms are reported. Our analyses show that the biclustering method and its parameters should be selected based on the desired model, whether that model allows overlapping biclusters, and its robustness to noise. In addition, we observe that the biclustering algorithms capable of finding more than one model are more successful at capturing biologically relevant clusters. PMID:22772837
Full Text Available Abstract Background Spontaneous tumors in dog have been demonstrated to share many features with their human counterparts, including relevant molecular targets, histological appearance, genetics, biological behavior and response to conventional treatments. Mammary tumors in dog therefore provide an attractive alternative to more classical mouse models, such as transgenics or xenografts, where the tumour is artificially induced. To assess the extent to which dog tumors represent clinically significant human phenotypes, we performed the first genome-wide comparative analysis of transcriptional changes occurring in mammary tumors of the two species, with particular focus on the molecular pathways involved. Results We analyzed human and dog gene expression data derived from both tumor and normal mammary samples. By analyzing the expression levels of about ten thousand dog/human orthologous genes we observed a significant overlap of genes deregulated in the mammary tumor samples, as compared to their normal counterparts. Pathway analysis of gene expression data revealed a great degree of similarity in the perturbation of many cancer-related pathways, including the 'PI3K/AKT', 'KRAS', 'PTEN', 'WNT-beta catenin' and 'MAPK cascade'. Moreover, we show that the transcriptional relationships between different gene signatures observed in human breast cancer are largely maintained in the canine model, suggesting a close interspecies similarity in the network of cancer signalling circuitries. Conclusion Our data confirm and further strengthen the value of the canine mammary cancer model and open up new perspectives for the evaluation of novel cancer therapeutics and the development of prognostic and diagnostic biomarkers to be used in clinical studies.
Remez, V.P.; Belyakova, E.G.
To determine traces of radiocaesium in water solution, the sorbent on the base of ferric potassium hexacyanoferrate on cellulose carrier ANFEZH was worked out. The sorbent is capable to extract effectively the isotopes of caesium from various natural solutions (fresh and sea water, milk, juices and so on). The usage of sorbent allows practically completely concentrate the isotopes of caesium from water samples with the volume of tens and hundreds litres. The sorbent in quantity of 50-500 grams allows to extract 98±1% of caesium from natural water samples with the volume up to 1000 litres during 1-5 hours. The usage of this sorbent allowed to conduct the express analysis of multiple bore holes within the area of 30 km of Chernobyl Skaya NPP , drinking water and milk in the regions of Belorussia, Ukraine and Russia, hit by Chernobyl disaster and around NPP in Russia and America. The use of this express analysis reduced the time and required labour as compared with to precipitation methods
Spínola, Hélder; Bruges-Armas, Jácome; Middleton, Derek; Brehm, António
Human leukocyte antigen (HLA)-A, -B, and -DRB1 polymorphisms were examined in the Cabo Verde and Guiné-Bissau populations. The data were obtained at high-resolution level, using sequence-based typing. The most frequent alleles in each locus was: A*020101 (16.7% in Guiné-Bissau and 13.5% in Cabo Verde), B*350101 (14.4% in Guiné-Bissau and 13.2% in Cabo Verde), DRB1*1304 (19.6% in Guiné-Bissau), and DRB1*1101 (10.1% in Cabo Verde). The predominant three loci haplotype in Guiné-Bissau was A*2301-B*1503-DRB1*1101 (4.6%) and in Cabo Verde was A*3002-B*350101-DRB1*1001 (2.8%), exclusive to northwestern islands (5.6%) and absent in Guiné-Bissau. The present study corroborates historic sources and other genetic studies that say Cabo Verde were populated not only by Africans but also by Europeans. Haplotypes and dendrogram analysis shows a Caucasian genetic influence in today's gene pool of Cabo Verdeans. Haplotypes and allele frequencies present a differential distribution between southeastern and northwestern Cabo Verde islands, which could be the result of different genetic influences, founder effect, or bottlenecks. Dendrograms and principal coordinates analysis show that Guineans are more similar to North Africans than other HLA-studied sub-Saharans, probably from ancient and recent genetic contacts with other peoples, namely East Africans.
Martins, Natacha; Picão, Renata Cristina; Cerqueira-Alves, Morgana; Uehara, Aline; Barbosa, Lívia Carvalho; Riley, Lee W; Moreira, Beatriz Meurer
A collection of 163 Acinetobacter baumannii isolates detected in a large Brazilian hospital, was potentially related with the dissemination of four clonal complexes (CC): 113/79, 103/15, 109/1 and 110/25, defined by University of Oxford/Institut Pasteur multilocus sequence typing (MLST) schemes. The urge of a simple multiplex-PCR scheme to specify these clones has motivated the present study. The established trilocus sequence-based typing (3LST, for ompA, csuE and blaOXA-51-like genes) multiplex-PCR rapidly identifies international clones I (CC109/1), II (CC118/2) and III (CC187/3). Thus, the system detects only one (CC109/1) out of four main CC in Brazil. We aimed to develop an alternative multiplex-PCR scheme to detect these clones, known to be present additionally in Africa, Asia, Europe, USA and South America. MLST, performed in the present study to complement typing our whole collection of isolates, confirmed that all isolates belonged to the same four CC detected previously. When typed by 3LST-based multiplex-PCR, only 12% of the 163 isolates were classified into groups. By comparative sequence analysis of ompA, csuE and blaOXA-51-like genes, a set of eight primers was designed for an alternative multiplex-PCR to distinguish the five CC 113/79, 103/15, 109/1, 110/25 and 118/2. Study isolates and one CC118/2 isolate were blind-tested with the new alternative PCR scheme; all were correctly clustered in groups of the corresponding CC. The new multiplex-PCR, with the advantage of fitting in a single reaction, detects five leading A. baumannii clones and could help preventing the spread in healthcare settings. Copyright © 2016 Elsevier B.V. All rights reserved.
Gustavo S. Fernandes
Full Text Available OBJECTIVES: With the development of next-generation sequencing (NGS technologies, DNA sequencing has been increasingly utilized in clinical practice. Our goal was to investigate the impact of genomic evaluation on treatment decisions for heavily pretreated patients with metastatic cancer. METHODS: We analyzed metastatic cancer patients from a single institution whose cancers had progressed after all available standard-of-care therapies and whose tumors underwent next-generation sequencing analysis. We determined the percentage of patients who received any therapy directed by the test, and its efficacy. RESULTS: From July 2013 to December 2015, 185 consecutive patients were tested using a commercially available next-generation sequencing-based test, and 157 patients were eligible. Sixty-six patients (42.0% were female, and 91 (58.0% were male. The mean age at diagnosis was 52.2 years, and the mean number of pre-test lines of systemic treatment was 2.7. One hundred and seventy-seven patients (95.6% had at least one identified gene alteration. Twenty-four patients (15.2% underwent systemic treatment directed by the test result. Of these, one patient had a complete response, four (16.7% had partial responses, two (8.3% had stable disease, and 17 (70.8% had disease progression as the best result. The median progression-free survival time with matched therapy was 1.6 months, and the median overall survival was 10 months. CONCLUSION: We identified a high prevalence of gene alterations using an next-generation sequencing test. Although some benefit was associated with the matched therapy, most of the patients had disease progression as the best response, indicating the limited biological potential and unclear clinical relevance of this practice.
The nuclear matrices of plant cell nuclei display intrinsic nuclease activity which consists in nicking supercoiled DNA. A cDNA encoding a 32 kDa endonuclease has been cloned and sequenced. The nucleotide and deduced amino-acid sequences show high homology to known 14-3-3-protein sequences from other sources. The amino-acid sequence shows agreement with consensus sequences for potential phosphorylation by protein kinase A and C and for calcium, lipid and membrane-binding sites. The nucleotide-binding site is also present within the conserved part of the sequence. By Northern blot analysis, the differential expression of the corresponding mRNA was detected; it was the strongest in sink tissues. The endonuclease activity found on DNA-polyacrylamide gel electrophoresis coincided with mRNA content and was the highest in tuber. (author). 22 refs, 6 figs
Takenaka, Yasuhiro; Noda-Ogura, Akiko; Imanishi, Tadashi; Yamaguchi, Atsushi; Gojobori, Takashi; Shigeri, Yasushi
We recently reported the cDNA sequences of 11 copepod luciferases from the superfamily Augaptiloidea in the order Calanoida. They were classified into two groups, Metridinidae and Heterorhabdidae/Lucicutiidae families, by phylogenetic analyses. To elucidate the evolutionary processes, we have now further isolated 12 copepod luciferases from Augaptiloidea species (Metridia asymmetrica, Metridia curticauda, Pleuromamma scutullata, Pleuromamma xiphias, Lucicutia ovaliformis and Heterorhabdus tanneri). Codon-based synonymous/nonsynonymous tests of positive selection for 25 identified copepod luciferases suggested that positive Darwinian selection operated in the evolution of Heterorhabdidae luciferases, whereas two types of Metridinidae luciferases had diversified via neutral mechanism. By in silico analysis of the decoded amino acid sequences of 25 copepod luciferases, we inferred two protein sequences as ancestral copepod luciferases. They were expressed in HEK293 cells where they exhibited notable luciferase activity both in intracellular lysates and cultured media, indicating that the luciferase activity was established before evolutionary diversification of these copepod species. © 2013.
Liu, Wei; Li, Li; Ye, Hua; Tu, Wei
High-throughput biological technologies are now widely applied in biology and medicine, allowing scientists to monitor thousands of parameters simultaneously in a specific sample. However, it is still an enormous challenge to mine useful information from high-throughput data. The emergence of network biology provides deeper insights into complex bio-system and reveals the modularity in tissue/cellular networks. Correlation networks are increasingly used in bioinformatics applications. Weighted gene co-expression network analysis (WGCNA) tool can detect clusters of highly correlated genes. Therefore, we systematically reviewed the application of WGCNA in the study of disease diagnosis, pathogenesis and other related fields. First, we introduced principle, workflow, advantages and disadvantages of WGCNA. Second, we presented the application of WGCNA in disease, physiology, drug, evolution and genome annotation. Then, we indicated the application of WGCNA in newly developed high-throughput methods. We hope this review will help to promote the application of WGCNA in biomedicine research.
Presson, Angela P; Horvath, Steve; Yoon, Nam K; Bagryanova, Lora; Mah, Vei; Alavi, Mohammad; Maresh, Erin L; Rajasekaran, Ayyappan K; Goodglick, Lee; Chia, David
Tissue microarray (TMA) data are commonly used to validate the prognostic accuracy of tumor markers. For example, breast cancer TMA data have led to the identification of several promising prognostic markers of survival time. Several studies have shown that TMA data can also be used to cluster patients into clinically distinct groups. Here we use breast cancer TMA data to cluster patients into distinct prognostic groups. We apply weighted correlation network analysis (WGCNA) to TMA data consisting of 26 putative tumor biomarkers measured on 82 breast cancer patients. Based on this analysis we identify three groups of patients with low (5.4%), moderate (22%) and high (50%) mortality rates, respectively. We then develop a simple threshold rule using a subset of three markers (p53, Na-KATPase-β1, and TGF β receptor II) that can approximately define these mortality groups. We compare the results of this correlation network analysis with results from a standard Cox regression analysis. We find that the rule-based grouping variable (referred to as WGCNA*) is an independent predictor of survival time. While WGCNA* is based on protein measurements (TMA data), it validated in two independent Affymetrix microarray gene expression data (which measure mRNA abundance). We find that the WGCNA patient groups differed by 35% from mortality groups defined by a more conventional stepwise Cox regression analysis approach. We show that correlation network methods, which are primarily used to analyze the relationships between gene products, are also useful for analyzing the relationships between patients and for defining distinct patient groups based on TMA data. We identify a rule based on three tumor markers for predicting breast cancer survival outcomes
Lees Jonathan G
Full Text Available Abstract Background A number of sequence-based methods exist for protein secondary structure prediction. Protein secondary structures can also be determined experimentally from circular dichroism, and infrared spectroscopic data using empirical analysis methods. It has been proposed that comparable accuracy can be obtained from sequence-based predictions as from these biophysical measurements. Here we have examined the secondary structure determination accuracies of sequence prediction methods with the empirically determined values from the spectroscopic data on datasets of proteins for which both crystal structures and spectroscopic data are available. Results In this study we show that the sequence prediction methods have accuracies nearly comparable to those of spectroscopic methods. However, we also demonstrate that combining the spectroscopic and sequences techniques produces significant overall improvements in secondary structure determinations. In addition, combining the extra information content available from synchrotron radiation circular dichroism data with sequence methods also shows improvements. Conclusion Combining sequence prediction with experimentally determined spectroscopic methods for protein secondary structure content significantly enhances the accuracy of the overall results obtained.
In this study, comparison of the outer membrane protein P5 gene (ompP5) sequence-based typing with pulsed-field gel electrophoresis (PFGE) for the genotyping of Haemophilus parasuis, the 15 serovar reference strains and 43 isolates were investigated. When comparing the two methods, 31 ompP5 sequence types ...
Lundegaard, Claus; Hoof, Ilka; Lund, Ole
Sequence based T-cell epitope predictions have improved immensely in the last decade. From predictions of peptide binding to major histocompatibility complex molecules with moderate accuracy, limited allele coverage, and no good estimates of the other events in the antigen-processing pathway, the...
Yang, Xiaoxia; Wang, Jia; Sun, Jun; Liu, Rong
Protein-nucleic acid interactions are central to various fundamental biological processes. Automated methods capable of reliably identifying DNA- and RNA-binding residues in protein sequence are assuming ever-increasing importance. The majority of current algorithms rely on feature-based prediction, but their accuracy remains to be further improved. Here we propose a sequence-based hybrid algorithm SNBRFinder (Sequence-based Nucleic acid-Binding Residue Finder) by merging a feature predictor SNBRFinderF and a template predictor SNBRFinderT. SNBRFinderF was established using the support vector machine whose inputs include sequence profile and other complementary sequence descriptors, while SNBRFinderT was implemented with the sequence alignment algorithm based on profile hidden Markov models to capture the weakly homologous template of query sequence. Experimental results show that SNBRFinderF was clearly superior to the commonly used sequence profile-based predictor and SNBRFinderT can achieve comparable performance to the structure-based template methods. Leveraging the complementary relationship between these two predictors, SNBRFinder reasonably improved the performance of both DNA- and RNA-binding residue predictions. More importantly, the sequence-based hybrid prediction reached competitive performance relative to our previous structure-based counterpart. Our extensive and stringent comparisons show that SNBRFinder has obvious advantages over the existing sequence-based prediction algorithms. The value of our algorithm is highlighted by establishing an easy-to-use web server that is freely accessible at http://ibi.hzau.edu.cn/SNBRFinder.
Narjaikaew, Pattawan; Emarat, Narumon; Arayathanitkul, Kwan; Cowie, Bronwen
The study investigated the impact on student motivation and understanding of magnetism of teaching sequences based on an inductive approach. The study was conducted in large lecture classes. A pre- and post-Conceptual Survey of Electricity and Magnetism was conducted with just fewer than 700 Thai undergraduate science students, before and after…
Taft, A S; Vermeire, J J; Bernier, J; Birkeland, S R; Cipriano, M J; Papa, A R; McArthur, A G; Yoshino, T P
Infection of the snail, Biomphalaria glabrata, by the free-swimming miracidial stage of the human blood fluke, Schistosoma mansoni, and its subsequent development to the parasitic sporocyst stage is critical to establishment of viable infections and continued human transmission. We performed a genome-wide expression analysis of the S. mansoni miracidia and developing sporocyst using Long Serial Analysis of Gene Expression (LongSAGE). Five cDNA libraries were constructed from miracidia and in vitro cultured 6- and 20-day-old sporocysts maintained in sporocyst medium (SM) or in SM conditioned by previous cultivation with cells of the B. glabrata embryonic (Bge) cell line. We generated 21 440 SAGE tags and mapped 13 381 to the S. mansoni gene predictions (v4.0e) either by estimating theoretical 3' UTR lengths or using existing 3' EST sequence data. Overall, 432 transcripts were found to be differentially expressed amongst all 5 libraries. In total, 172 tags were differentially expressed between miracidia and 6-day conditioned sporocysts and 152 were differentially expressed between miracidia and 6-day unconditioned sporocysts. In addition, 53 and 45 tags, respectively, were differentially expressed in 6-day and 20-day cultured sporocysts, due to the effects of exposure to Bge cell-conditioned medium.
Full Text Available The miR-15/107 family comprises a group of 10 paralogous microRNAs (miRNAs, sharing a 5′ AGCAGC sequence. These miRNAs have overlapping targets. In order to characterize the expression of miR-15/107 family miRNAs, we employed customized TaqMan Low-Density micro-fluid PCR-array to investigate the expression of miR-15/107 family members, and other selected miRNAs, in 11 human tissues obtained at autopsy including the cerebral cortex, frontal cortex, primary visual cortex, thalamus, heart, lung, liver, kidney, spleen, stomach and skeletal muscle. miR-103, miR-195 and miR-497 were expressed at similar levels across various tissues, whereas miR-107 is enriched in brain samples. We also examined the expression patterns of evolutionarily conserved miR-15/107 miRNAs in three distinct primary rat brain cell preparations (enriched for cortical neurons, astrocytes and microglia, respectively. In primary cultures of rat brain cells, several members of the miR-15/107 family are enriched in neurons compared to other cell types in the central nervous system (CNS. In addition to mature miRNAs, we also examined the expression of precursors (pri-miRNAs. Our data suggested a generally poor correlation between the expression of mature miRNAs and their precursors. In summary, we provide a detailed study of the tissue and cell type-specific expression profile of this highly expressed and phylogenetically conserved family of miRNA genes.
Katherine E. Harris
Full Text Available We created a novel transgenic rat that expresses human antibodies comprising a diverse repertoire of heavy chains with a single common rearranged kappa light chain (IgKV3-15-JK1. This fixed light chain animal, called OmniFlic, presents a unique system for human therapeutic antibody discovery and a model to study heavy chain repertoire diversity in the context of a constant light chain. The purpose of this study was to analyze heavy chain variable gene usage, clonotype diversity, and to describe the sequence characteristics of antigen-specific monoclonal antibodies (mAbs isolated from immunized OmniFlic animals. Using next-generation sequencing antibody repertoire analysis, we measured heavy chain variable gene usage and the diversity of clonotypes present in the lymph node germinal centers of 75 OmniFlic rats immunized with 9 different protein antigens. Furthermore, we expressed 2,560 unique heavy chain sequences sampled from a diverse set of clonotypes as fixed light chain antibody proteins and measured their binding to antigen by ELISA. Finally, we measured patterns and overall levels of somatic hypermutation in the full B-cell repertoire and in the 2,560 mAbs tested for binding. The results demonstrate that OmniFlic animals produce an abundance of antigen-specific antibodies with heavy chain clonotype diversity that is similar to what has been described with unrestricted light chain use in mammals. In addition, we show that sequence-based discovery is a highly effective and efficient way to identify a large number of diverse monoclonal antibodies to a protein target of interest.
Stein, Wilfred D; Litman, Thomas; Fojo, Tito
are their corresponding solid tumors. We used the Serial Analysis of Gene Expression (SAGE) database to identify differences between solid tumors and cell lines, hoping to detect genes that could potentially explain differences in drug sensitivity. SAGE libraries were available for both solid tumors and cell lines from...
Rot, Gregor; Parikh, Anup; Curk, Tomaz; Kuspa, Adam; Shaulsky, Gad; Zupan, Blaz
Background Bioinformatics often leverages on recent advancements in computer science to support biologists in their scientific discovery process. Such efforts include the development of easy-to-use web interfaces to biomedical databases. Recent advancements in interactive web technologies require us to rethink the standard submit-and-wait paradigm, and craft bioinformatics web applications that share analytical and interactive power with their desktop relatives, while retaining simplicity and availability. Results We have developed dictyExpress, a web application that features a graphical, highly interactive explorative interface to our database that consists of more than 1000 Dictyostelium discoideum gene expression experiments. In dictyExpress, the user can select experiments and genes, perform gene clustering, view gene expression profiles across time, view gene co-expression networks, perform analyses of Gene Ontology term enrichment, and simultaneously display expression profiles for a selected gene in various experiments. Most importantly, these tasks are achieved through web applications whose components are seamlessly interlinked and immediately respond to events triggered by the user, thus providing a powerful explorative data analysis environment. Conclusion dictyExpress is a precursor for a new generation of web-based bioinformatics applications with simple but powerful interactive interfaces that resemble that of the modern desktop. While dictyExpress serves mainly the Dictyostelium research community, it is relatively easy to adapt it to other datasets. We propose that the design ideas behind dictyExpress will influence the development of similar applications for other model organisms. PMID:19706156
Guan, Lihong; Chen, Liping; Chen, Yongsen; Zhang, Nu; Han, Yawei
The fructosyltransferase gene was isolated and cloned from Aspergillus oryzae. The gene was 1368 bp, which encoded a protein of 455 amino acids. To analyze the activity of the expressed fructosyltransferase, the pET32a-fructosyltransferase recombined plasmid was transformed into Escherichia coli BL21. The fructosyltransferase gene was successfully expressed by Isopropyl-β-d-thiogalactoside (IPTG) induction. The molecular weight of the expression protein was about 45 kDa. The optimal conditions of protein expression were 25 °C, 0.1 mM IPTG, and 8 h of inducing time. The optimal concentration of urea dealing with inclusion body was 2.5 M. The expressed protein exhibited a strong fructosyl transfer activity. These results showed that the expressed fructosyltransferas owned transferase activity, and could catalyze the synthesis of sucrose-6-acetate.
Garcia-Fernàndez, J; Baguñà, J; Saló, E
Freshwater planarians (Platyhelminthes, Turbellaria, and Tricladida) are acoelomate, triploblastic, unsegmented, and bilaterally symmetrical organisms that are mainly known for their ample power to regenerate a complete organism from a small piece of their body. To identify potential pattern-control genes in planarian regeneration, we have isolated two homeobox-containing genes, Dth-1 and Dth-2 [Dugesia (Girardia) tigrina homeobox], by using degenerate oligonucleotides corresponding to the most conserved amino acid sequence from helix-3 of the homeodomain. Dth-1 and Dth-2 homeodomains are closely related (68% at the nucleotide level and 78% at the protein level) and show the conserved residues characteristic of the homeodomains identified to data. Similarity with most homeobox sequences is low (30-50%), except with Drosophila NK homeodomains (80-82% with NK-2) and the rodent TTF-1 homeodomain (77-87%). Some unusual amino acid residues specific to NK-2, TTF-1, Dth-1, and Dth-2 can be observed in the recognition helix (helix-3) and may define a family of homeodomains. The deduced amino acid sequences from the cDNAs contain, in addition to the homeodomain, other domains also present in various homeobox-containing genes. The expression of both genes, detected by Northern blot analysis, appear slightly higher in cephalic regions than in the rest of the intact organism, while a slight increase is detected in the central period (5 days) or regeneration. Images PMID:1714599
Sun, Shiquan; Hood, Michelle; Scott, Laura; Peng, Qinke; Mukherjee, Sayan; Tung, Jenny; Zhou, Xiang
Identifying differentially expressed (DE) genes from RNA sequencing (RNAseq) studies is among the most common analyses in genomics. However, RNAseq DE analysis presents several statistical and computational challenges, including over-dispersed read counts and, in some settings, sample non-independence. Previous count-based methods rely on simple hierarchical Poisson models (e.g. negative binomial) to model independent over-dispersion, but do not account for sample non-independence due to relatedness, population structure and/or hidden confounders. Here, we present a Poisson mixed model with two random effects terms that account for both independent over-dispersion and sample non-independence. We also develop a scalable sampling-based inference algorithm using a latent variable representation of the Poisson distribution. With simulations, we show that our method properly controls for type I error and is generally more powerful than other widely used approaches, except in small samples (n <15) with other unfavorable properties (e.g. small effect sizes). We also apply our method to three real datasets that contain related individuals, population stratification or hidden confounders. Our results show that our method increases power in all three data compared to other approaches, though the power gain is smallest in the smallest sample (n = 6). Our method is implemented in MACAU, freely available at www.xzlab.org/software.html. © The Author(s) 2017. Published by Oxford University Press on behalf of Nucleic Acids Research.
Niu, Jianfeng; Hu, Haiyan; Hu, Songnian; Wang, Guangce; Peng, Guang; Sun, Song
In 2008, a green tide broke out before the sailing competition of the 29th Olympic Games in Qingdao. The causative species was determined to be Enteromorpha prolifera ( Ulva prolifera O. F. Müller), a familiar green macroalga along the coastline of China. Rapid accumulation of a large biomass of floating U. prolifera prompted research on different aspects of this species. In this study, we constructed a nonnormalized cDNA library from the thalli of U. prolifera and acquired 10 072 high-quality expressed sequence tags (ESTs). These ESTs were assembled into 3 519 nonredundant gene groups, including 1 446 clusters and 2 073 singletons. After annotation with the nr database, a large number of genes were found to be related with chloroplast and ribosomal protein, GO functional classification showed 1 418 ESTs participated in photosynthesis and 1 359 ESTs were responsible for the generation of precursor metabolites and energy. In addition, rather comprehensive carbon fixation pathways were found in U. prolifera using KEGG. Some stress-related and signal transduction-related genes were also found in this study. All the evidences displayed that U. prolifera had substance and energy foundation for the intense photosynthesis and the rapid proliferation. Phylogenetic analysis of cytochrome c oxidase subunit I revealed that this green-tide causative species is most closely affiliated to Pseudendoclonium akinetum (Ulvophyceae).
An, L; Xie, H; Chin, MH; Obradovic, Z; Smith, DJ; Megalooikonomou, V
Abstract Background Gene expression signatures in the mammalian brain hold the key to understanding neural development and neurological disease. Researchers have previously used voxelation in combination with microarrays for acquisition of genome-wide atlases of expression patterns in the mouse brain. On the other hand, some work has been performed on studying gene functions, without taking into account the location information of a gene's expression in a mouse brain. In this paper, we presen...
Wan, Cen; Lees, Jonathan G; Minneci, Federico; Orengo, Christine A; Jones, David T
Accurate gene or protein function prediction is a key challenge in the post-genome era. Most current methods perform well on molecular function prediction, but struggle to provide useful annotations relating to biological process functions due to the limited power of sequence-based features in that functional domain. In this work, we systematically evaluate the predictive power of temporal transcription expression profiles for protein function prediction in Drosophila melanogaster. Our results show significantly better performance on predicting protein function when transcription expression profile-based features are integrated with sequence-derived features, compared with the sequence-derived features alone. We also observe that the combination of expression-based and sequence-based features leads to further improvement of accuracy on predicting all three domains of gene function. Based on the optimal feature combinations, we then propose a novel multi-classifier-based function prediction method for Drosophila melanogaster proteins, FFPred-fly+. Interpreting our machine learning models also allows us to identify some of the underlying links between biological processes and developmental stages of Drosophila melanogaster.
Full Text Available Accurate gene or protein function prediction is a key challenge in the post-genome era. Most current methods perform well on molecular function prediction, but struggle to provide useful annotations relating to biological process functions due to the limited power of sequence-based features in that functional domain. In this work, we systematically evaluate the predictive power of temporal transcription expression profiles for protein function prediction in Drosophila melanogaster. Our results show significantly better performance on predicting protein function when transcription expression profile-based features are integrated with sequence-derived features, compared with the sequence-derived features alone. We also observe that the combination of expression-based and sequence-based features leads to further improvement of accuracy on predicting all three domains of gene function. Based on the optimal feature combinations, we then propose a novel multi-classifier-based function prediction method for Drosophila melanogaster proteins, FFPred-fly+. Interpreting our machine learning models also allows us to identify some of the underlying links between biological processes and developmental stages of Drosophila melanogaster.
Rantalainen, Mattias; Klevebring, Daniel; Lindberg, Johan; Ivansson, Emma; Rosin, Gustaf; Kis, Lorand; Celebioglu, Fuat; Fredriksson, Irma; Czene, Kamila; Frisell, Jan; Hartman, Johan; Bergh, Jonas; Grönberg, Henrik
Sequencing-based breast cancer diagnostics have the potential to replace routine biomarkers and provide molecular characterization that enable personalized precision medicine. Here we investigate the concordance between sequencing-based and routine diagnostic biomarkers and to what extent tumor sequencing contributes clinically actionable information. We applied DNA- and RNA-sequencing to characterize tumors from 307 breast cancer patients with replication in up to 739 patients. We developed models to predict status of routine biomarkers (ER, HER2,Ki-67, histological grade) from sequencing data. Non-routine biomarkers, including mutations in BRCA1, BRCA2 and ERBB2(HER2), and additional clinically actionable somatic alterations were also investigated. Concordance with routine diagnostic biomarkers was high for ER status (AUC = 0.95;AUC(replication) = 0.97) and HER2 status (AUC = 0.97;AUC(replication) = 0.92). The transcriptomic grade model enabled classification of histological grade 1 and histological grade 3 tumors with high accuracy (AUC = 0.98;AUC(replication) = 0.94). Clinically actionable mutations in BRCA1, BRCA2 and ERBB2(HER2) were detected in 5.5% of patients, while 53% had genomic alterations matching ongoing or concluded breast cancer studies. Sequencing-based molecular profiling can be applied as an alternative to histopathology to determine ER and HER2 status, in addition to providing improved tumor grading and clinically actionable mutations and molecular subtypes. Our results suggest that sequencing-based breast cancer diagnostics in a near future can replace routine biomarkers.
Full Text Available Protein-nucleic acid interactions are central to various fundamental biological processes. Automated methods capable of reliably identifying DNA- and RNA-binding residues in protein sequence are assuming ever-increasing importance. The majority of current algorithms rely on feature-based prediction, but their accuracy remains to be further improved. Here we propose a sequence-based hybrid algorithm SNBRFinder (Sequence-based Nucleic acid-Binding Residue Finder by merging a feature predictor SNBRFinderF and a template predictor SNBRFinderT. SNBRFinderF was established using the support vector machine whose inputs include sequence profile and other complementary sequence descriptors, while SNBRFinderT was implemented with the sequence alignment algorithm based on profile hidden Markov models to capture the weakly homologous template of query sequence. Experimental results show that SNBRFinderF was clearly superior to the commonly used sequence profile-based predictor and SNBRFinderT can achieve comparable performance to the structure-based template methods. Leveraging the complementary relationship between these two predictors, SNBRFinder reasonably improved the performance of both DNA- and RNA-binding residue predictions. More importantly, the sequence-based hybrid prediction reached competitive performance relative to our previous structure-based counterpart. Our extensive and stringent comparisons show that SNBRFinder has obvious advantages over the existing sequence-based prediction algorithms. The value of our algorithm is highlighted by establishing an easy-to-use web server that is freely accessible at http://ibi.hzau.edu.cn/SNBRFinder.
Pak, Yu.; Ponomaryova, M
Full text: Sulphur content is an important qualitative coal parameter. The problem of coal sulphur content determining remains one of the most important both in Kazakhstan and in other coal-mining countries. The traditional method of sampling, the final stage of which is chemical analysis of coal for sulphur, is characterized by high labour intensity and low productivity. That's why it is ineffective for mass express analytical quality control and technological schemes of coal processing control. In this connection it is very urgent to develop a method of coal sulphur content on the base of a series nuclear-geophysical equipment with an isotope source of primary radiation, allowing to increase analysis representativity and maximally take into account coal real composition inconstancy. To solve the problem set it is necessary to study the main laws of X-ray-radiometric method applied to the coal quality analysis for working out instrumental methods of speed determining of coal sulphur content with satisfactory accuracy for technological tasks, to determine laws of changing the flows of characteristic X-ray and scattered radiation from coal sulphur content of various real composition and to optimize methodical and hardware parameters, providing minimal error of sulphur content control. On the base of studying laws of real composition coal components and their interconnections with sulphur content there has been substantiated the expediency of using hardware functions of calcium and iron to control coal sulphur contents; there has been suggested a model to estimate the methodical error of coal sulphur content determining on the base of the data about sensitivity to sulphur and effecting factors using ultimate methods of coal components substitution methods allowing to optimize sulphur control parameters; there has been worked out an algorithm of X-ray-radiometric control of sulphur content based on the sequential radiating the analyzed coal with gamma-radiation of
Huang, Jianzi; Lu, Xiang; Yan, Hao; Chen, Shouyi; Zhang, Wanke; Huang, Rongfeng; Zheng, Yizhi
Semi-mangroves form a group of transitional species between glycophytes and halophytes, and hold unique potential for learning molecular mechanisms underlying plant salt tolerance. Millettia pinnata is a semi-mangrove plant that can survive a wide range of saline conditions in the absence of specialized morphological and physiological traits. By employing the Illumina sequencing platform, we generated ~192 million short reads from four cDNA libraries of M. pinnata and processed them into 108,598 unisequences with a high depth of coverage. The mean length and total length of these unisequences were 606 bp and 65.8 Mb, respectively. A total of 54,596 (50.3%) unisequences were assigned Nr annotations. Functional classification revealed the involvement of unisequences in various biological processes related to metabolism and environmental adaptation. We identified 23,815 candidate salt-responsive genes with significantly differential expression under seawater and freshwater treatments. Based on the reverse transcription-polymerase chain reaction (RT-PCR) and real-time PCR analyses, we verified the changes in expression levels for a number of candidate genes. The functional enrichment analyses for the candidate genes showed tissue-specific patterns of transcriptome remodelling upon salt stress in the roots and the leaves. The transcriptome of M. pinnata will provide valuable gene resources for future application in crop improvement. In addition, this study sets a good example for large-scale identification of salt-responsive genes in non-model organisms using the sequencing-based approach.
Atarbashi Moghadam, Saede; Atarbashi Moghadam, Fazele; Eini, Ebrahim
P63 may have a role in tumorigenesis and cytodifferentiation of odontogenic lesions. We investigated the immunohistochemical expression of P63 in a total of 30 cases of odontogenic cysts and tumors. The percentage of positive cells was calculated in the lining of odontogenic cysts and islands of ameloblastoma. P63 expression was evident in all types of odontogenic lesions. P63 was expressed throughout the lining epithelium of odontogenic keratocyst except surface parakeratinized layer. In addition, calcifying odontogenic cyst showed P63 expression in all layers. In almost all radicular and dentigerous cysts, the basal and parabasal layers were immunoreactive. Peripheral cells of ameloblastoma expressed P63; however, stellate reticulum had weaker immunostaining. No significant difference in P63 expression was observed between studied lesions (P = 0.86). Expression of P63 in odontogenic lesions suggests that this protein is important in differentiation and proliferation of odontogenic epithelial cells. However, it seems that it could not be a useful marker to differentiate between aggressive and nonaggressive lesions. P63 also represents a progenitor or basal cell marker, and it is not expressed in mature differentiated cells. PMID:24350278
Valstar, M.F.; Mehu, M.; Jiang, Bihan; Pantic, Maja; Scherer, K.
Automatic facial expression recognition has been an active topic in computer science for over two decades, in particular facial action coding system action unit (AU) detection and classification of a number of discrete emotion states from facial expressive imagery. Standardization and comparability
Saede Atarbashi Moghadam
Full Text Available P63 may have a role in tumorigenesis and cytodifferentiation of odontogenic lesions. We investigated the immunohistochemical expression of P63 in a total of 30 cases of odontogenic cysts and tumors. The percentage of positive cells was calculated in the lining of odontogenic cysts and islands of ameloblastoma. P63 expression was evident in all types of odontogenic lesions. P63 was expressed throughout the lining epithelium of odontogenic keratocyst except surface parakeratinized layer. In addition, calcifying odontogenic cyst showed P63 expression in all layers. In almost all radicular and dentigerous cysts, the basal and parabasal layers were immunoreactive. Peripheral cells of ameloblastoma expressed P63; however, stellate reticulum had weaker immunostaining. No significant difference in P63 expression was observed between studied lesions (. Expression of P63 in odontogenic lesions suggests that this protein is important in differentiation and proliferation of odontogenic epithelial cells. However, it seems that it could not be a useful marker to differentiate between aggressive and nonaggressive lesions. P63 also represents a progenitor or basal cell marker, and it is not expressed in mature differentiated cells.
Schmöle, Anne-Caroline; Lundt, Ramona; Gennequin, Benjamin; Schrage, Hanna; Beins, Eva; Krämer, Alexandra; Zimmer, Till; Limmer, Andreas; Zimmer, Andreas; Otte, David-Marian
The endocannabinoid system (ECS) is a retrograde messenger system, consisting of lipid signaling molecules that bind to at least two G-protein-coupled receptors, Cannabinoid receptor 1 and 2 (CB1 and 2). As CB2 is primarily expressed on immune cells such as B cells, T cells, macrophages, dendritic cells, and microglia, it is of great interest how CB2 contributes to immune cell development and function in health and disease. Here, understanding the mechanisms of CB2 involvement in immune-cell function as well as the trafficking and regulation of CB2 expressing cells are crucial issues. Up to now, CB2 antibodies produce unclear results, especially those targeting the murine protein. Therefore, we have generated BAC transgenic GFP reporter mice (CB2-GFPTg) to trace CB2 expression in vitro and in situ. Those mice express GFP under the CB2 promoter and display GFP expression paralleling CB2 expression on the transcript level in spleen, thymus and brain tissue. Furthermore, by using fluorescence techniques we show that the major sources for GFP-CB2 expression are B cells in spleen and blood and microglia in the brain. This novel CB2-GFP transgenic reporter mouse line represents a powerful resource to study CB2 expression in different cell types. Furthermore, it could be used for analyzing CB2-mediated mobilization and trafficking of immune cells as well as studying the fate of recruited immune cells in models of acute and chronic inflammation.
Full Text Available The endocannabinoid system (ECS is a retrograde messenger system, consisting of lipid signaling molecules that bind to at least two G-protein-coupled receptors, Cannabinoid receptor 1 and 2 (CB1 and 2. As CB2 is primarily expressed on immune cells such as B cells, T cells, macrophages, dendritic cells, and microglia, it is of great interest how CB2 contributes to immune cell development and function in health and disease. Here, understanding the mechanisms of CB2 involvement in immune-cell function as well as the trafficking and regulation of CB2 expressing cells are crucial issues. Up to now, CB2 antibodies produce unclear results, especially those targeting the murine protein. Therefore, we have generated BAC transgenic GFP reporter mice (CB2-GFPTg to trace CB2 expression in vitro and in situ. Those mice express GFP under the CB2 promoter and display GFP expression paralleling CB2 expression on the transcript level in spleen, thymus and brain tissue. Furthermore, by using fluorescence techniques we show that the major sources for GFP-CB2 expression are B cells in spleen and blood and microglia in the brain. This novel CB2-GFP transgenic reporter mouse line represents a powerful resource to study CB2 expression in different cell types. Furthermore, it could be used for analyzing CB2-mediated mobilization and trafficking of immune cells as well as studying the fate of recruited immune cells in models of acute and chronic inflammation.
Vega, Ana Isabel; Pérez-Cerdá, Celia; Abia, David; Gámez, Alejandra; Briones, Paz; Artuch, Rafael; Desviat, Lourdes R; Ugarte, Magdalena; Pérez, Belén
Deficiency of phosphomannomutase (PMM2, MIM#601785) is the most common congenital disorder of glycosylation. Herein we report the genetic analysis of 22 Spanish PMM2 deficient patients and the functional analysis of 14 nucleotide changes in a prokaryotic expression system in order to elucidate their molecular pathogenesis. PMM2 activity assay revealed the presence of six protein changes with no enzymatic activities (p.R123Q, p.R141H, p.F157S, p.P184T, p.F207S and p.D209G) and seven mild protein changes with residual activities ranging from 16 to 54% (p.L32R, p.V44A p.D65Y, p.P113L p.T118S, p.T237M and p.C241S) and also one variant change with normal activity (p.E197A). The results obtained from Western blot analysis, degradation time courses of 11 protein changes and structural analysis of the PMM2 protein, suggest that the loss-of-function of most mutant proteins is based on their increased susceptibility to degradation or aggregation compared to the wild type protein, considering PMM2 deficiency as a conformational disease. We have identified exclusively catalytic protein change (p.D209G), catalytic protein changes affecting protein stability (p.R123Q and p.R141H), two protein changes disrupting the dimer interface (p.P113L and p.T118S) and several misfolding changes (p.L32R, p.V44A, p.D65Y, p.F157S, p.P184T, p.F207S, p.T237M and p.C241S). Our current work opens a promising therapeutic option using pharmacological chaperones to revert the effect of the characterized misfolding mutations identified in a wide range of PMM2 deficient patients.
Full Text Available Abstract Motivation Detecting differentially expressed (DE genes between disease and normal control group is one of the most common analyses in genome-wide transcriptomic data. Since most studies don’t have a lot of samples, researchers have used meta-analysis to group different datasets for the same disease. Even then, in many cases the statistical power is still not enough. Taking into account the fact that many diseases share the same disease genes, it is desirable to design a statistical framework that can identify diseases’ common and specific DE genes simultaneously to improve the identification power. Results We developed a novel empirical Bayes based mixture model to identify DE genes in specific study by leveraging the shared information across multiple different disease expression data sets. The effectiveness of joint analysis was demonstrated through comprehensive simulation studies and two real data applications. The simulation results showed that our method consistently outperformed single data set analysis and two other meta-analysis methods in identification power. In real data analysis, overall our method demonstrated better identification power in detecting DE genes and prioritized more disease related genes and disease related pathways than single data set analysis. Over 150% more disease related genes are identified by our method in application to Huntington’s disease. We expect that our method would provide researchers a new way of utilizing available data sets from different diseases when sample size of the focused disease is limited.
Dylan T Jones
Full Text Available Angiogenesis is essential for solid tumour growth, whilst the molecular profiles of tumour blood vessels have been reported to be different between cancer types. Although presently available anti-angiogenic strategies are providing some promise for the treatment of some cancers it is perhaps not surprisingly that, none of the anti-angiogenic agents available work on all tumours. Thus, the discovery of novel anti-angiogenic targets, relevant to individual cancer types, is required. Using Affymetrix microarray analysis of laser-captured, CD31-positive blood vessels we have identified 63 genes that are upregulated significantly (5-72 fold in angiogenic blood vessels associated with human invasive ductal carcinoma (IDC of the breast as compared with blood vessels in normal human breast. We tested the angiogenic capacity of a subset of these genes. Genes were selected based on either their known cellular functions, their enriched expression in endothelial cells and/or their sensitivity to anti-VEGF treatment; all features implicating their involvement in angiogenesis. For example, RRM2, a ribonucleotide reductase involved in DNA synthesis, was upregulated 32-fold in IDC-associated blood vessels; ATF1, a nuclear activating transcription factor involved in cellular growth and survival was upregulated 23-fold in IDC-associated blood vessels and HEX-B, a hexosaminidase involved in the breakdown of GM2 gangliosides, was upregulated 8-fold in IDC-associated blood vessels. Furthermore, in silico analysis confirmed that AFT1 and HEX-B also were enriched in endothelial cells when compared with non-endothelial cells. None of these genes have been reported previously to be involved in neovascularisation. However, our data establish that siRNA depletion of Rrm2, Atf1 or Hex-B had significant anti-angiogenic effects in VEGF-stimulated ex vivo mouse aortic ring assays. Overall, our results provide proof-of-principle that our approach can identify a cohort of
Xu, Zong-Chang; Kong, Yingzhen
Cellulose-synthase proteins (CESAs) are membrane localized proteins and they form protein complexes to produce cellulose in the plasma membrane. CESA proteins play very important roles in cell wall construction during plant growth and development. In this study, a total of 21 NtCESA gene sequences were identified by using PF03552 conserved protein sequence and 10 AtCESA protein sequences of Arabidopsis thaliana to blast against the common tobacco (Nicotiana tabacum L.) genome database with TBLASTN protocol. We analyzed the physical and chemical properties of protein sequences based on some software or on-line analysis tools. The results showed that there were no significant variances in terms of the physical and chemical properties of the 21 NtCESA proteins. First, phylogenetic tree analysis showed that 21 NtCESA genes and 10 AtCESA genes were clustered into five groups, and the gene structures were similar among the genes that are clustered into the same group. Second, in all of the 21 NtCESA proteins the conserved zinc finger domain was identified in the N-terminus, transmembrane domains were identified in the C-terminus and the DDD-QXXRW conserved domains were also identified. Third, gene expression analysis results indicated that most NtCESA genes were expressed in roots and leaves of seedling or mature tissues of tobacco, seeds and callus tissues. The genes that clustered into the same group share similar expression patterns. Importantly, NtCESA proteins that are involved in secondary cell wall cellulose synthesis have two extra transmembrane domains compared with that involved in primary cell wall cellulose biosynthesis. In addition, subcellular localization results showed that NtCESA9 and NtCESA14 were two plasma membrane anchored proteins. This study will lay a foundation for further functional characterization of these NtCESA genes.
Abstract. Background: Human cytomegalovirus (HCMV) is a virus which has the potential to alter cellular gene expression through .... and (reverse: 5'-CAG CAC CAT CCT CCT CTT. CCT CT ..... acute respiratory syndrome (SARS) coronavirus.
1á (eukaryotic elongation factor 1-alpha) using SYBER-Green. ... expression in the developing seed tissues and can be targeted using the dsRNA induced sequence specific RNA degradation mechanism for reduction of phytate levels without ...
Methods: Changes in mRNA expression levels of human endothelial-like ... recognized as a risk factor for vascular diseases, like ..... and JUN kinase signaling pathways and transform ... protein accumulates at the G1-S phase boundary and.
Gao, Na; Ma, Bin-Guang; Zhang, Yu-Sheng; Song, Qin; Chen, Ling-Ling; Zhang, Hong-Yu
To investigate the general radiation-resistant mechanisms of bacteria, bioinformatic method was employed to predict highly expressed genes for four radiation-resistant bacteria, i.e. Deinococcus geothermalis (D. geo), Deinococcus radiodurans (D. rad), Kineococcus radiotolerans (K. rad) and Rubrobacter xylanophilus (R. xyl). It is revealed that most of the three reference gene sets, i.e. ribosomal proteins, transcription factors and major chaperones, are generally highly expressed in the four ...
CCL5 Chemokine (C-C motif) ligand 5 /RANTES. IFNγ Interferon gamma TNFα Tumor necrosis factor alpha HMGB1 High mobility group box 1 protein /high...aim of this study was to analyze gene expression levels of human host factors in melioidosis patients and establish useful correlation with disease...PBMC’s) of study subjects. Gene expression profiles of 25 gene targets including 19 immune response genes and 6 epigenetic factors were analyzed by
Ancelin, C.; Le, P.; DeSaint-Quentin, S.; Villatte, N.
This paper presents EXPRESS, an expert system developed for the automation of reliability studies. The first part consists in the description of the method for static thermohydraulic systems. In this step, the authors define the knowledge representation based on the two inference engines - ALOUETTE and LCR developed by EDF. They explain all the process to construct a fault tree from a topological and functional description of the system. Numerous examples are exhibited in illustration of the method. This is followed by the lessons derived from the studies performed on some safety systems of the PALUEL nuclear plant. The development of the same approach for electric power systems is described, insisting on the difference resulting from the sequential nature of these systems. Finally, they show the main advantages identified during the studies
Salagacka-Kubiak, Aleksandra; Żebrowska, Marta; Wosiak, Agnieszka; Balcerczak, Mariusz; Mirowski, Marek; Balcerczak, Ewa
The aim of this study was to evaluate the participation of polymorphism at position C421A and mRNA expression of the ABCG2 gene in the development of peptic ulcers, which is a very common and severe disease. ABCG2, encoded by the ABCG2 gene, has been found inter alia in the gastrointestinal tract, where it plays a protective role eliminating xenobiotics from cells into the extracellular environment. The materials for the study were biopsies of gastric mucosa taken during a routine endoscopy. For genotyping by polymerase chain reaction-restriction fragment length polymorphism (PCR-RFLP) at position C421A, DNA was isolated from 201 samples, while for the mRNA expression level by real-time PCR, RNA was isolated from 60 patients. The control group of healthy individuals consisted of 97 blood donors. The dominant genotype in the group of peptic ulcer patients and healthy individuals was homozygous CC. No statistically significant differences between healthy individuals and the whole group of peptic ulcer patients and, likewise, between the subgroups of peptic ulcer patients (infected and uninfected with Helicobacter pylori) were found. ABCG2 expression relative to GAPDH expression was found in 38 of the 60 gastric mucosa samples. The expression level of the gene varies greatly among cases. The statistically significant differences between the intensity (p = 0.0375) of H. pylori infection and ABCG2 gene expression have been shown. It was observed that the more intense the infection, the higher the level of ABCG2 expression.
Full Text Available We evaluated the expression of several genes involved in tissue remodelling and bone development in patients with calcific tendinopathy of the rotator cuff. Biopsies from calcified and non-calcified areas were obtained from 10 patients (8 women and 2 men; average age: 55 years; range: 40-68 with calcific tendinopathy of the rotator cuff. To evaluate the expression of selected genes, RNA extraction, cDNA synthesis and quantitative polymerase chain reaction (PCR were performed. A significantly increased expression of tissue transglutaminase (tTG2 and its substrate, osteopontin, was detected in the calcific areas compared to the levels observed in the normal tissue from the same subject with calcific tendinopathy, whereas a modest increase was observed for catepsin K. There was also a significant decrease in mRNA expression of Bone Morphogenetic Protein (BMP4 and BMP6 in the calcific area. BMP-2, collagen V and vascular endothelial growth factor (VEGF did not show significant differences. Collagen X and matrix metalloproteinase (MMP-9 were not detectable. A variation in expression of these genes could be characteristic of this form tendinopathy, since an increased level of these genes has not been detected in other forms of tendon lesions.
Norton James H
Full Text Available Abstract Background Menisci play a vital role in load transmission, shock absorption and joint stability. There is increasing evidence suggesting that OA menisci may not merely be bystanders in the disease process of OA. This study sought: 1 to determine the prevalence of meniscal degeneration in OA patients, and 2 to examine gene expression in OA meniscal cells compared to normal meniscal cells. Methods Studies were approved by our human subjects Institutional Review Board. Menisci and articular cartilage were collected during joint replacement surgery for OA patients and lower limb amputation surgery for osteosarcoma patients (normal control specimens, and graded. Meniscal cells were prepared from these meniscal tissues and expanded in monolayer culture. Differential gene expression in OA meniscal cells and normal meniscal cells was examined using Affymetrix microarray and real time RT-PCR. Results The grades of meniscal degeneration correlated with the grades of articular cartilage degeneration (r = 0.672; P HLA-DPA1, integrin, beta 2 (ITGB2, ectonucleotide pyrophosphatase/phosphodiesterase 1 (ENPP1, ankylosis, progressive homolog (ANKH and fibroblast growth factor 7 (FGF7, were expressed at significantly higher levels in OA meniscal cells compared to normal meniscal cells. Importantly, many of the genes that have been shown to be differentially expressed in other OA cell types/tissues, including ADAM metallopeptidase with thrombospondin type 1 motif 5 (ADAMTS5 and prostaglandin E synthase (PTGES, were found to be expressed at significantly higher levels in OA meniscal cells. This consistency suggests that many of the genes detected in our study are disease-specific. Conclusion Our findings suggest that OA is a whole joint disease. Meniscal cells may play an active role in the development of OA. Investigation of the gene expression profiles of OA meniscal cells may reveal new therapeutic targets for OA therapy and also may uncover novel
Tan, Jie; Huyck, Matthew; Hu, Dongbo; Zelaya, René A; Hogan, Deborah A; Greene, Casey S
Gene set enrichment analysis and overrepresentation analyses are commonly used methods to determine the biological processes affected by a differential expression experiment. This approach requires biologically relevant gene sets, which are currently curated manually, limiting their availability and accuracy in many organisms without extensively curated resources. New feature learning approaches can now be paired with existing data collections to directly extract functional gene sets from big data. Here we introduce a method to identify perturbed processes. In contrast with methods that use curated gene sets, this approach uses signatures extracted from public expression data. We first extract expression signatures from public data using ADAGE, a neural network-based feature extraction approach. We next identify signatures that are differentially active under a given treatment. Our results demonstrate that these signatures represent biological processes that are perturbed by the experiment. Because these signatures are directly learned from data without supervision, they can identify uncurated or novel biological processes. We implemented ADAGE signature analysis for the bacterial pathogen Pseudomonas aeruginosa. For the convenience of different user groups, we implemented both an R package (ADAGEpath) and a web server ( http://adage.greenelab.com ) to run these analyses. Both are open-source to allow easy expansion to other organisms or signature generation methods. We applied ADAGE signature analysis to an example dataset in which wild-type and ∆anr mutant cells were grown as biofilms on the Cystic Fibrosis genotype bronchial epithelial cells. We mapped active signatures in the dataset to KEGG pathways and compared with pathways identified using GSEA. The two approaches generally return consistent results; however, ADAGE signature analysis also identified a signature that revealed the molecularly supported link between the MexT regulon and Anr. We designed
The results of prokaryotic expression of ZnBP and overexpression of the ZnBP gene in A. thaliana improve our understanding of the function of this gene. Future studies should investigate the molecular mechanisms involved in gland morphogenesis in cotton. Key words: Gossypium hirsutum, pigment gland, zinc binding ...
Full Text Available Heparan sulfate Proteoglycans (HSPG are ubiquitous molecules with indispensable functions in various biological processes. Glypicans are a family of HSPG's, characterized by a Gpi-anchor which directs them to the cell surface and/or extracellular matrix where they regulate growth factor signaling during development and disease. We report the identification and expression pattern of glypican genes from zebrafish. The zebrafish genome contains 10 glypican homologs, as opposed to six in mammals, which are highly conserved and are phylogenetically related to the mammalian genes. Some of the fish glypicans like Gpc1a, Gpc3, Gpc4, Gpc6a and Gpc6b show conserved synteny with their mammalian cognate genes. Many glypicans are expressed during the gastrulation stage, but their expression becomes more tissue specific and defined during somitogenesis stages, particularly in the developing central nervous system. Existence of multiple glypican orthologs in fish with diverse expression pattern suggests highly specialized and/or redundant function of these genes during embryonic development.
Aug 21, 2013 ... Plant basic leucine zipper (bZIP) proteins play an essential role in the genes expression and regulation in higher plants. They have been shown to regulate diverse plant specific phenomena, including germination, floral induction and development, seed maturation, photomorphogenesis, biotic and.
Aug 26, 2016 ... Late embryogenesis abundant (LEA) protein family is a large protein family that includes proteins accumulated at late stages of seed development or in vegetative tissues in response to drought, salinity, cold stress and exogenous application of abscisic acid. In order to isolate peanut genes, an expressed ...
Full Text Available Schistosome worms of the genus Schistosoma are the causative agents of schistosomiasis, a devastating parasitic disease affecting more than 240 million people worldwide. Schistosomes have complex life cycles, and have been challenging to manipulate genetically due to the dearth of molecular tools. Although the use of gene overexpression, gene knockouts or knockdowns are straight-forward genetic tools applied in many model systems, gene misexpression and genetic manipulation of schistosome genes in vivo has been exceptionally challenging, and plasmid based transfection inducing gene expression is limited. We recently reported the use of polyethyleneimine (PEI as a simple and effective method for schistosome transfection and gene expression. Here, we use PEI-mediated schistosome plasmid transgenesis to define and compare gene expression profiles from endogenous and nonendogenous promoters in the schistosomula stage of schistosomes that are potentially useful to misexpress (underexpress or overexpress gene product levels. In addition, we overexpress schistosome genes in vivo using a strong promoter and show plasmid-based misregulation of genes in schistosomes, producing a clear and distinct phenotype--death. These data focus on the schistosomula stage, but they foreshadow strong potential for genetic characterization of schistosome molecular pathways, and potential for use in overexpression screens and drug resistance studies in schistosomes using plasmid-based gene expression.
Dec 7, 2011 ... gene was cloned and expressed and the induction of the recombinant MAP30 protein on .... RNA reverse transcription was carried out by RevertAidTM First ... volume of Premix Ex Taq™ (Takara Bio Inc, Japan), PCR cycling.
Adifferentially expressed fragment EST145 was isolated by suppression subtractive hybridization (SSH) method. Using EST145 as the probe, a blue copper-binding protein gene designated as DvBCB was screened from Dasypyrum villosum cDNA Library. The DvBCB gene was 845 bp in length with an open reading frame ...
In this study, the DNA sequence of vitellogenin from Antheraea pernyi (Ap-Vg) was identified and its functional domain (30-740 aa, Ap-Vg-1) was expressed in Escherichia coli BL21 (DE3) cells. The recombinant Ap-Vg-1 proteins were purified and used for antibody preparation. The results showed that the intact DNA ...
Fehrmann, Rudolf S. N.; Karjalainen, Juha M.; Krajewska, Malgorzata
Many cancer-associated somatic copy number alterations (SCNAs) are known. Currently, one of the challenges is to identify the molecular downstream effects of these variants. Although several SCNAs are known to change gene expression levels, it is not clear whether each individual SCNA affects gen...
After challenged with Aeromonas hydrophila infection or iron-dextran stimulation, the hepcidin transcript levels were analyzed by RT-PCR. The results revealed that the expression of hepcidin dramatically increased at 24 h post-infection of the pathogen injection. Moreover, hepcidin mRNAs in the liver, intestine and brain ...
Several clustering and biclustering methods have been introduced to analyze the gene expression data by identifying the similar patterns and grouping genes into subsets that share biological significance. However, it is not clear how the different methods compare with each other with respect to the biological relevance of ...
Background Molecular characterization has contributed to the understanding of the inception, progression, treatment and prognosis of cancer. Nucleic acid array-based technologies extend molecular characterization of tumors to thousands of gene products. To effectively discriminate between tumor sub-types, reliable laboratory techniques and analytic methods are required. Results We derived mRNA expression profiles from 21 human tissue samples (eight normal kidneys and 13 kidney tumors) and two pooled samples using the Affymetrix GeneChip platform. A panel of ten clustering algorithms combined with four data pre-processing methods identified a consensus cluster dendrogram in 18 of 40 analyses and of these 16 used a logarithmic transformation. Within the consensus dendrogram the expression profiles of the samples grouped according to tissue type; clear cell and chromophobe carcinomas displayed distinctly different gene expression patterns. By using a rigorous statistical selection based method we identified 355 genes that showed significant (p Matrix Organization and Adhesion. Conclusions Affymetrix GeneChip profiling differentiated clear cell and chromophobe carcinomas from one another and from normal kidney cortex. Clustering methods that used logarithmic transformation of data sets produced dendrograms consistent with the sample biology. Functional taxonomy provided a practical approach to the interpretation of gene expression data. PMID:12356337
Madore Steven J
Full Text Available Abstract Background Molecular characterization has contributed to the understanding of the inception, progression, treatment and prognosis of cancer. Nucleic acid array-based technologies extend molecular characterization of tumors to thousands of gene products. To effectively discriminate between tumor sub-types, reliable laboratory techniques and analytic methods are required. Results We derived mRNA expression profiles from 21 human tissue samples (eight normal kidneys and 13 kidney tumors and two pooled samples using the Affymetrix GeneChip platform. A panel of ten clustering algorithms combined with four data pre-processing methods identified a consensus cluster dendrogram in 18 of 40 analyses and of these 16 used a logarithmic transformation. Within the consensus dendrogram the expression profiles of the samples grouped according to tissue type; clear cell and chromophobe carcinomas displayed distinctly different gene expression patterns. By using a rigorous statistical selection based method we identified 355 genes that showed significant (p Conclusions Affymetrix GeneChip profiling differentiated clear cell and chromophobe carcinomas from one another and from normal kidney cortex. Clustering methods that used logarithmic transformation of data sets produced dendrograms consistent with the sample biology. Functional taxonomy provided a practical approach to the interpretation of gene expression data.
This research cloned endochitinase-antifreeze protein precursor (EAPP) gene of Dong-mu 70 rye (Secale cereale) by designing special primers according to Genbank's EAPP gene sequence, and analyzing the influence of low temperature stress on the expression of mRNA with RT-PCR. The results indicated that the ...
Oct 20, 2008 ... isoform of the myocyte enhancer factor 2 gene from the silkworm, Bombyx mori. Qing-zhi Ling1, 2, ... BMEF2B mRNA content in the brain was measured using the combined method of quantitative RT-PCR and Southern ... specific cofactors to control gene expression in pheno- typically different muscles.
Oct 1, 2017 ... p-distance model for amino acid substitutions. A bootstrap .... These were a thymine/cytosine (T/C) SNP and a thymine/adenine (T/A) SNP. ..... Two rat homologues of Drosophila achaete–scute specifically expressed in ...
Haloxylon ammodendron (C.A Mey.) Bunge is a xero-halophytic desert shrub with excellent drought resistance and salt tolerance. To decipher the molecular responses involved in its drought resistance, the cDNA-AFLP (amplified fragment length polymorphism) technique was employed to identify genes expressed ...
Hal, van N.L.W.; Vorst, O.; Houwelingen, van A.M.M.L.; Kok, E.J.; Peijnenburg, A.A.C.M.; Aharoni, A.; Tunen, van A.J.; Keijer, J.
DNA microarray technology is a new and powerful technology that will substantially increase the speed of molecular biological research. This paper gives a survey of DNA microarray technology and its use in gene expression studies. The technical aspects and their potential improvements are discussed.
Wang, Xi; Gardiner, Erin J; Cairns, Murray J
Normalization of high-throughput molecular expression profiles secures differential expression analysis between samples of different phenotypes or biological conditions, and facilitates comparison between experimental batches. While the same general principles apply to microRNA (miRNA) normalization, there is mounting evidence that global shifts in their expression patterns occur in specific circumstances, which pose a challenge for normalizing miRNA expression data. As an alternative to global normalization, which has the propensity to flatten large trends, normalization against constitutively expressed reference genes presents an advantage through their relative independence. Here we investigated the performance of reference-gene-based (RGB) normalization for differential miRNA expression analysis of microarray expression data, and compared the results with other normalization methods, including: quantile, variance stabilization, robust spline, simple scaling, rank invariant, and Loess regression. The comparative analyses were executed using miRNA expression in tissue samples derived from subjects with schizophrenia and non-psychiatric controls. We proposed a consistency criterion for evaluating methods by examining the overlapping of differentially expressed miRNAs detected using different partitions of the whole data. Based on this criterion, we found that RGB normalization generally outperformed global normalization methods. Thus we recommend the application of RGB normalization for miRNA expression data sets, and believe that this will yield a more consistent and useful readout of differentially expressed miRNAs, particularly in biological conditions characterized by large shifts in miRNA expression.
Full Text Available The severity and prevalence of many diseases are known to differ between the sexes. Organ specific sex-biased gene expression may underpin these and other sexually dimorphic traits. To further our understanding of sex differences in transcriptional regulation, we performed meta-analyses of sex biased gene expression in multiple human tissues. We analysed 22 publicly available human gene expression microarray data sets including over 2500 samples from 15 different tissues and 9 different organs. Briefly, by using an inverse-variance method we determined the effect size difference of gene expression between males and females. We found the greatest sex differences in gene expression in the brain, specifically in the anterior cingulate cortex, (1818 genes, followed by the heart (375 genes, kidney (224 genes, colon (218 genes and thyroid (163 genes. More interestingly, we found different parts of the brain with varying numbers and identity of sex-biased genes, indicating that specific cortical regions may influence sexually dimorphic traits. The majority of sex-biased genes in other tissues such as the bladder, liver, lungs and pancreas were on the sex chromosomes or involved in sex hormone production. On average in each tissue, 32% of autosomal genes that were expressed in a sex-biased fashion contained androgen or estrogen hormone response elements. Interestingly, across all tissues, we found approximately two-thirds of autosomal genes that were sex-biased were not under direct influence of sex hormones. To our knowledge this is the largest analysis of sex-biased gene expression in human tissues to date. We identified many sex-biased genes that were not under the direct influence of sex chromosome genes or sex hormones. These may provide targets for future development of sex-specific treatments for diseases.
Nov 6, 2012 ... analysis of a water stress inducible copper-containing ... Although, in slico analysis of the protein have indicated its probable structure and functions, further ..... based on protein data bank (PDB) template c1ksiA which.
M J Pont
Full Text Available Cellular immunotherapy has proven to be effective in the treatment of hematological cancers by donor lymphocyte infusion after allogeneic hematopoietic stem cell transplantation and more recently by targeted therapy with chimeric antigen or T-cell receptor-engineered T cells. However, dependent on the tissue distribution of the antigens that are targeted, anti-tumor responses can be accompanied by undesired side effects. Therefore, detailed tissue distribution analysis is essential to estimate potential efficacy and toxicity of candidate targets for immunotherapy of hematological malignancies. We performed microarray gene expression analysis of hematological malignancies of different origins, healthy hematopoietic cells and various non-hematopoietic cell types from organs that are often targeted in detrimental immune responses after allogeneic stem cell transplantation leading to graft-versus-host disease. Non-hematopoietic cells were also cultured in the presence of IFN-γ to analyze gene expression under inflammatory circumstances. Gene expression was investigated by Illumina HT12.0 microarrays and quality control analysis was performed to confirm the cell-type origin and exclude contamination of non-hematopoietic cell samples with peripheral blood cells. Microarray data were validated by quantitative RT-PCR showing strong correlations between both platforms. Detailed gene expression profiles were generated for various minor histocompatibility antigens and B-cell surface antigens to illustrate the value of the microarray dataset to estimate efficacy and toxicity of candidate targets for immunotherapy. In conclusion, our microarray database provides a relevant platform to analyze and select candidate antigens with hematopoietic (lineage-restricted expression as potential targets for immunotherapy of hematological cancers.
Gao, Wu-Jun; Li, Shu-Fen; Zhang, Guo-Jun; Wang, Ning-Na; Deng, Chuan-Liang; Lu, Long-Dou
To identify rapidly a number of genes probably involved in sex determination and differentiation of the dioecious plant Asparagus officinalis, gene expression profiles in early flower development for male and female plants were investigated by microarray assay with 8,665 probes. In total, 638 male-biased and 543 female-biased genes were identified. These genes with biased-expression for male and female were involved in a variety of processes associated with molecular functions, cellular components, and biological processes, suggesting that a complex mechanism underlies the sex development of asparagus. Among the differentially expressed genes involved in the reproductive process, a number of genes associated with floral development were identified. Reverse transcription-PCR was performed for validation, and the results were largely consistent with those obtained by microarray analysis. The findings of this study might contribute to understanding of the molecular mechanisms of sex determination and differentiation in dioecious asparagus and provide a foundation for further studies of this plant.
Zhang, Zhang; Liu, Jingxing; Wu, Jiayan; Yu, Jun
The regulation of gene expression is essential for eukaryotes, as it drives the processes of cellular differentiation and morphogenesis, leading to the creation of different cell types in multicellular organisms. RNA-Sequencing (RNA-Seq) provides researchers with a powerful toolbox for characterization and quantification of transcriptome. Many different human tissue/cell transcriptome datasets coming from RNA-Seq technology are available on public data resource. The fundamental issue here is how to develop an effective analysis method to estimate expression pattern similarities between different tumor tissues and their corresponding normal tissues. We define the gene expression pattern from three directions: 1) expression breadth, which reflects gene expression on/off status, and mainly concerns ubiquitously expressed genes; 2) low/high or constant/variable expression genes, based on gene expression level and variation; and 3) the regulation of gene expression at the gene structure level. The cluster analysis indicates that gene expression pattern is higher related to physiological condition rather than tissue spatial distance. Two sets of human housekeeping (HK) genes are defined according to cell/tissue types, respectively. To characterize the gene expression pattern in gene expression level and variation, we firstly apply improved K-means algorithm and a gene expression variance model. We find that cancer-associated HK genes (a HK gene is specific in cancer group, while not in normal group) are expressed higher and more variable in cancer condition than in normal condition. Cancer-associated HK genes prefer to AT-rich genes, and they are enriched in cell cycle regulation related functions and constitute some cancer signatures. The expression of large genes is also avoided in cancer group. These studies will help us understand which cell type-specific patterns of gene expression differ among different cell types, and particularly for cancer. PMID:23382867
Mer, Arvind Singh; Klevebring, Daniel; Grönberg, Henrik; Rantalainen, Mattias
Sequencing-based molecular characterization of tumors provides information required for individualized cancer treatment. There are well-defined molecular subtypes of breast cancer that provide improved prognostication compared to routine biomarkers. However, molecular subtyping is not yet implemented in routine breast cancer care. Clinical translation is dependent on subtype prediction models providing high sensitivity and specificity. In this study we evaluate sample size and RNA-sequencing read requirements for breast cancer subtyping to facilitate rational design of translational studies. We applied subsampling to ascertain the effect of training sample size and the number of RNA sequencing reads on classification accuracy of molecular subtype and routine biomarker prediction models (unsupervised and supervised). Subtype classification accuracy improved with increasing sample size up to N = 750 (accuracy = 0.93), although with a modest improvement beyond N = 350 (accuracy = 0.92). Prediction of routine biomarkers achieved accuracy of 0.94 (ER) and 0.92 (Her2) at N = 200. Subtype classification improved with RNA-sequencing library size up to 5 million reads. Development of molecular subtyping models for cancer diagnostics requires well-designed studies. Sample size and the number of RNA sequencing reads directly influence accuracy of molecular subtyping. Results in this study provide key information for rational design of translational studies aiming to bring sequencing-based diagnostics to the clinic.
Sun, Tanlin; Zhou, Bo; Lai, Luhua; Pei, Jianfeng
Protein-protein interactions (PPIs) are critical for many biological processes. It is therefore important to develop accurate high-throughput methods for identifying PPI to better understand protein function, disease occurrence, and therapy design. Though various computational methods for predicting PPI have been developed, their robustness for prediction with external datasets is unknown. Deep-learning algorithms have achieved successful results in diverse areas, but their effectiveness for PPI prediction has not been tested. We used a stacked autoencoder, a type of deep-learning algorithm, to study the sequence-based PPI prediction. The best model achieved an average accuracy of 97.19% with 10-fold cross-validation. The prediction accuracies for various external datasets ranged from 87.99% to 99.21%, which are superior to those achieved with previous methods. To our knowledge, this research is the first to apply a deep-learning algorithm to sequence-based PPI prediction, and the results demonstrate its potential in this field.
Ma, J.; Stiller, J.; Berkman, P.J.; Wei, Y.M.; Rogers, J.; Feuillet, C.; Doležel, Jaroslav; Mayer, K. F.; Eversole, K.; Zheng, Y.L.; Liu, C.L.
Roč. 8, č. 11 (2013) E-ISSN 1932-6203 Institutional research plan: CEZ:AV0Z50380511 Keywords : CHROMOSOMAL TRANSLOCATIONS * HOMOEOLOGOUS GROUP-4 * EVOLUTION Subject RIV: EB - Genetics ; Molecular Biology Impact factor: 3.534, year: 2013
Ping, Zhu; Xu-Qing, Tang; Zhen-Yuan, Xu
Proteomics is the study of proteins and their interactions in a cell. With the successful completion of the Human Genome Project, it comes the postgenome era when the proteomics technology is emerging. This paper studies protein molecule from the algebraic point of view. The algebraic system (Σ, +, *) is introduced, where Σ is the set of 64 codons. According to the characteristics of (Σ, +, *), a novel quasi-amino acids code classification method is introduced and the corresponding algebraic operation table over the set ZU of the 16 kinds of quasi-amino acids is established. The internal relation is revealed about quasi-amino acids. The results show that there exist some very close correlations between the properties of the quasi-amino acids and the codon. All these correlation relationships may play an important part in establishing the logic relationship between codons and the quasi-amino acids during the course of life origination. According to Ma F et al (2003 J. Anhui Agricultural University 30 439), the corresponding relation and the excellent properties about amino acids code are very difficult to observe. The present paper shows that (ZU, ⊕, ) is a field. Furthermore, the operational results display that the codon tga has different property from other stop codons. In fact, in the mitochondrion from human and ox genomic codon, tga is just tryptophane, is not the stop codon like in other genetic code, it is the case of the Chen W C et al (2002 Acta Biophysica Sinica 18(1) 87). The present theory avoids some inexplicable events of the 20 kinds of amino acids code, in other words it solves the problem of 'the 64 codon assignments of mRNA to amino acids is probably completely wrong' proposed by Yang (2006 Progress in Modern Biomedicine 6 3). (cross-disciplinary physics and related areas of science and technology)
Zemla, Adam T; Zhou, Carol E; Lam, Marisa W; Smith, Jason R; Pardes, Elizabeth
Disclosed are computational methods, and associated hardware and software products for scoring conservation in a protein structure based on a computationally identified family or cluster of protein structures. A method of computationally identifying a family or cluster of protein structures in also disclosed herein.
Naidu, Hariprasad; Subramanian, B Mohana; Chinchkar, Shankar Ramchandra; Sriraman, Rajan; Rana, Samir Kumar; Srinivasan, V A
The antigenic types of canine parvovirus (CPV) are defined based on differences in the amino acids of the major capsid protein VP2. Type specificity is conferred by a limited number of amino acid changes and in particular by few nucleotide substitutions. PCR based methods are not particularly suitable for typing circulating variants which differ in a few specific nucleotide substitutions. Assays for determining SNPs can detect efficiently nucleotide substitutions and can thus be adapted to identify CPV types. In the present study, CPV typing was performed by single nucleotide extension using the mini-sequencing technique. A mini-sequencing signature was established for all the four CPV types (CPV2, 2a, 2b and 2c) and feline panleukopenia virus. The CPV typing using the mini-sequencing reaction was performed for 13 CPV field isolates and the two vaccine strains available in our repository. All the isolates had been typed earlier by full-length sequencing of the VP2 gene. The typing results obtained from mini-sequencing matched completely with that of sequencing. Typing could be achieved with less than 100 copies of standard plasmid DNA constructs or ≤10¹ FAID₅₀ of virus by mini-sequencing technique. The technique was also efficient for detecting multiple types in mixed infections. Copyright © 2012 Elsevier B.V. All rights reserved.
Full Text Available Cymbidium ensifolium is a Chinese Cymbidium with an elegant shape, beautiful appearance, and a fragrant aroma. C. ensifolium has a long history of cultivation in China and it has excellent commercial value as a potted plant and cut flower. The development of C. ensifolium genomic resources has been delayed because of its large genome size. Taking advantage of technical and cost improvement of RNA-Seq, we extracted total mRNA from flower buds and mature flowers and obtained a total of 9.52 Gb of filtered nucleotides comprising 98,819,349 filtered reads. The filtered reads were assembled into 101,423 isotigs, representing 51,696 genes. Of the 101,423 isotigs, 41,873 were putative homologs of annotated sequences in the public databases, of which 158 were associated with floral development and 119 were associated with flowering. The isotigs were categorized according to their putative functions. In total, 10,212 of the isotigs were assigned into 25 eukaryotic orthologous groups (KOGs, 41,690 into 58 gene ontology (GO terms, and 9,830 into 126 Arabidopsis Kyoto Encyclopedia of Genes and Genomes (KEGG pathways, and 9,539 isotigs into 123 rice pathways. Comparison of the isotigs with those of the two related orchid species P. equestris and C. sinense showed that 17,906 isotigs are unique to C. ensifolium. In addition, a total of 7,936 SSRs and 16,676 putative SNPs were identified. To our knowledge, this transcriptome database is the first major genomic resource for C. ensifolium and the most comprehensive transcriptomic resource for genus Cymbidium. These sequences provide valuable information for understanding the molecular mechanisms of floral development and flowering. Sequences predicted to be unique to C. ensifolium would provide more insights into C. ensifolium gene diversity. The numerous SNPs and SSRs identified in the present study will contribute to marker development for C. ensifolium.
Travensolo,Regiane F.; Carareto-Alves,Lucia M.; Costa,Maria V.C.G.; Lopes,Tiago J.S.; Carrilho,Emanuel; Lemos,Eliana G.M.
Xylella fastidiosa genome sequencing has generated valuable data by identifying genes acting either on metabolic pathways or in associated pathogenicity and virulence. Based on available information on these genes, new strategies for studying their expression patterns, such as microarray technology, were employed. A total of 2,600 primer pairs were synthesized and then used to generate fragments using the PCR technique. The arrays were hybridized against cDNAs labeled during reverse transcrip...
Xu, Hong-Wu; Huang, Hai-Hua; Wei, Xiao-Long; Man, Kwan; Zhang, Guo-Jun; Huang, Yue-Jun; Xie, Ze-Yu; Lin, Lan; Guo, Yan-Chun; Zhuang, Ze-Rui; Lin, Xin-Peng; Zhou, Wen; Li, Mu
Evidence suggests that cytoglobin (Cygb) may function as a tumor suppressor gene. We immunohistochemically evaluated the expression of Cygb, phosphatidylinositol-3 kinase (PI-3K), phosphorylated (p)-Akt, Interleukin-6 (IL-6), tumor necrosis factor-α (TNFα) and vascular endothelial growth factor (VEGF) in 88 patients with 41 high-grade gliomas and 47 low-grade gliomas. Intratumoral microvessel density (IMD) was also determined and associated with clinicopathological factors. Low expression of Cygb was significantly associated with the higher histological grading and tumor recurrence. A significant negative correlation emerged between Cygb expression and PI3K, p-Akt, IL-6, TNFα or VEGF expression. Cygb expression was negatively correlated with IMD. There was a positive correlation between PI3K, p-Akt, IL-6, TNFα and VEGF expression with IMD.High histologic grade, tumor recurrence, decreased Cygb expression, increased PI3K expression, increased p-Akt expression and increased VEGF expression correlated with patients’ overall survival in univariate analysis. However, only histological grading and Cygb expression exhibited a relationship with survival of patients as independent prognostic factors of glioma by multivariate analysis. Cygb loss may contribute to tumor recurrence and a worse prognosis in gliomas. Cygb may serve as an independent predictive factor for prognosis of glioma patients
Gregg, Jennifer L; Brown, Kathleen E; Mintz, Eric M; Piontkivska, Helen; Fraizer, Gail C
The prostate gland represents a multifaceted system in which prostate epithelia and stroma have distinct physiological roles. To understand the interaction between stroma and glandular epithelia, it is essential to delineate the gene expression profiles of these two tissue types in prostate cancer. Most studies have compared tumor and normal samples by performing global expression analysis using a mixture of cell populations. This report presents the first study of prostate tumor tissue that examines patterns of differential expression between specific cell types using laser capture microdissection (LCM). LCM was used to isolate distinct cell-type populations and identify their gene expression differences using oligonucleotide microarrays. Ten differentially expressed genes were then analyzed in paired tumor and non-neoplastic prostate tissues by quantitative real-time PCR. Expression patterns of the transcription factors, WT1 and EGR1, were further compared in established prostate cell lines. WT1 protein expression was also examined in prostate tissue microarrays using immunohistochemistry. The two-step method of laser capture and microarray analysis identified nearly 500 genes whose expression levels were significantly different in prostate epithelial versus stromal tissues. Several genes expressed in epithelial cells (WT1, GATA2, and FGFR-3) were more highly expressed in neoplastic than in non-neoplastic tissues; conversely several genes expressed in stromal cells (CCL5, CXCL13, IGF-1, FGF-2, and IGFBP3) were more highly expressed in non-neoplastic than in neoplastic tissues. Notably, EGR1 was also differentially expressed between epithelial and stromal tissues. Expression of WT1 and EGR1 in cell lines was consistent with these patterns of differential expression. Importantly, WT1 protein expression was demonstrated in tumor tissues and was absent in normal and benign tissues. The prostate represents a complex mix of cell types and there is a need to analyze
Ling, Jing; Wu, Xiaoli; Fu, Ziyi; Tan, Jie; Xu, Qing
Our previous study showed that the expression of miR-197 in leiomyoma was down-regulated compared with myometrium. Further, miR-197 has been identified to affect uterine leiomyoma cell proliferation, apoptosis, and metastasis ability, though the responsible molecular mechanism has not been well elucidated. In this study, we sought to determine the expression patterns of miR-197 targeted genes and to explore their potential functions, participating Pathways and the networks that are involved in the biological behavior of human uterine leiomyoma. After transfection of human uterine leiomyoma cells with miR-197, we confirmed the expression level of miR-197 using quantitative real-time PCR (qRT-PCR), and we detected the gene expression profiles after miR-197 over-expression through DNA microarray analysis. Further, we performed GO and Pathway analysis. The dominantly dys-regulated genes, which were up- or down-regulated by more than 10-fold, compared with parental cells, were confirmed using qRT-PCR technology. Compared with the control group, miR-197 was up-regulated by 30-fold after miR-197 lentiviral transfection. The microarray data showed that 872 genes were dys-regulated by more than 2-fold in human uterine leiomyoma cells after miR-197 overexpression, including 537 up-regulated and 335 down-regulated genes. The GO analysis indicated that the dys-regulated genes were primarily involved in response to stimuli, multicellular organ processes, and the signaling of biological progression. Further, Pathway analysis data showed that these genes participated in regulating several signaling Pathways, including the JAK/STAT signaling Pathway, the Toll-like receptor signaling Pathway, and cytokine-cytokine receptor interaction. The qRT-PCR results confirmed that 17 of the 66 selected genes, which were up- or down-regulated more than 10-fold by miR-197, were consistent with the microarray results, including tumorigenesis-related genes, such as DRT7, SLC549, SFMBT2, FLJ37956
van Hal, N L; Vorst, O; van Houwelingen, A M; Kok, E J; Peijnenburg, A; Aharoni, A; van Tunen, A J; Keijer, J
DNA microarray technology is a new and powerful technology that will substantially increase the speed of molecular biological research. This paper gives a survey of DNA microarray technology and its use in gene expression studies. The technical aspects and their potential improvements are discussed. These comprise array manufacturing and design, array hybridisation, scanning, and data handling. Furthermore, it is discussed how DNA microarrays can be applied in the working fields of: safety, functionality and health of food and gene discovery and pathway engineering in plants.
Full Text Available Abstract Background Co-regulation of genes may imply involvement in similar biological processes or related function. Many clusters of co-regulated genes have been identified using microarray experiments. In this study, we examined co-regulated gene families using large-scale cDNA microarray experiments on the human transcriptome. Results We present a simple model, which, for each probe pair, distills expression changes into binary digits and summarizes the expression of multiple members of a gene family as the Family Regulation Ratio. The set of Family Regulation Ratios for each protein family across multiple experiments is called a Family Regulation Profile. We analyzed these Family Regulation Profiles using Pearson Correlation Coefficients and derived a network diagram portraying relationships between the Family Regulation Profiles of gene families that are well represented on the microarrays. Our strategy was cross-validated with two randomly chosen data subsets and was proven to be a reliable approach. Conclusion This work will help us to understand and identify the functional relationships between gene families and the regulatory pathways in which each family is involved. Concepts presented here may be useful for objective clustering of protein functions and deriving a comprehensive protein interaction map. Functional genomic approaches such as this may also be applicable to the elucidation of complex genetic regulatory networks.
Hill, Jonathon T; Demarest, Bradley; Gorsi, Bushra; Smith, Megan; Yost, H Joseph
During embryogenesis the heart forms as a linear tube that then undergoes multiple simultaneous morphogenetic events to obtain its mature shape. To understand the gene regulatory networks (GRNs) driving this phase of heart development, during which many congenital heart disease malformations likely arise, we conducted an RNA-seq timecourse in zebrafish from 30 hpf to 72 hpf and identified 5861 genes with altered expression. We clustered the genes by temporal expression pattern, identified transcription factor binding motifs enriched in each cluster, and generated a model GRN for the major gene batteries in heart morphogenesis. This approach predicted hundreds of regulatory interactions and found batteries enriched in specific cell and tissue types, indicating that the approach can be used to narrow the search for novel genetic markers and regulatory interactions. Subsequent analyses confirmed the GRN using two mutants, Tbx5 and nkx2-5 , and identified sets of duplicated zebrafish genes that do not show temporal subfunctionalization. This dataset provides an essential resource for future studies on the genetic/epigenetic pathways implicated in congenital heart defects and the mechanisms of cardiac transcriptional regulation. © 2017. Published by The Company of Biologists Ltd.
Qiao, Weiqiang; Liu, Heyang; Liu, Ruidong; Liu, Qipeng; Zhang, Ting; Guo, Wanying; Li, Peng; Deng, Miao
There are conflicting reports about the role of histone deacetylase 1 (HDAC1) in breast cancer prognosis. Here, we conducted a meta-analysis to investigate the prognostic significance of HDAC1 in breast cancer. We searched different databases to identify studies evaluating the association between HDAC1 expression and its prognostic value in breast cancer. The pooled hazard ratios (HRs) and odds radios (ORs) with 95% confidence intervals (95% CIs) were calculated from these studies to assess specific correlation. Our meta-analysis of four databases identified 7 eligible studies with 1429 total patients. We found that HDAC1 over-expression did not correlate with disease-free survival (DFS) and overall survival (OS) in breast cancer. Subgroup analysis indicated an association between up-regulated HDAC1 expression and better OS (HR = 0.47, 95% CI: 0.23-0.97; P = 0.04) in Asian breast cancer patients. However, false-positive report probability (FPRP) analysis and trial sequential analysis (TSA) indicated that the results need further validation. Furthermore, HDAC1 over-expression was associated with positive estrogen receptor (ER) expression (OR, 3.30; 95% CI, 1.11-9.83; P = 0.03) and negative human epidermal growth factor receptor 2 (HER2) expression (OR, 1.79; 95% CI, 1.22-2.61; P = 0.003), but there were no significant differences between patients based on age, tumor size, lymph node metastasis, nuclear grade, or progesterone receptor (PR) expression. Overall, our meta-analysis demonstrated an association between increased HDAC1 expression and better OS in Asian breast cancer patients. In addition, HDAC1 over-expression correlated with positive ER and negative HER2 expression in breast cancer. However, researches in large patients' randomised controlled trials (RCTs) are needed to confirm the results. Copyright © 2018 Elsevier B.V. All rights reserved.
Full Text Available Abstract Background Serial Analysis of Gene Expression (SAGE is a new technique that allows a detailed and profound quantitative and qualitative knowledge of gene expression profile, without previous knowledge of sequence of analyzed genes. We carried out a modification of SAGE methodology (microSAGE, useful for the analysis of limited quantities of tissue samples, on normal human cervical tissue obtained from a donor without histopathological lesions. Cervical epithelium is constituted mainly by cervical keratinocytes which are the targets of human papilloma virus (HPV, where persistent HPV infection of cervical epithelium is associated with an increase risk for developing cervical carcinomas (CC. Results We report here a transcriptome analysis of cervical tissue by SAGE, derived from 30,418 sequenced tags that provide a wealth of information about the gene products involved in normal cervical epithelium physiology, as well as genes not previously found in uterine cervix tissue involved in the process of epidermal differentiation. Conclusion This first comprehensive and profound analysis of uterine cervix transcriptome, should be useful for the identification of genes involved in normal cervix uterine function, and candidate genes associated with cervical carcinoma.
Fan, Xing [Capital Medical University, Department of Neurosurgery, Beijing Tiantan Hospital, Beijing (China); Wang, Yinyan [Capital Medical University, Department of Neurosurgery, Beijing Tiantan Hospital, Beijing (China); Capital Medical University, Department of Neuropathology, Beijing Neurosurgical Institute, Beijing (China); Wang, Kai; Ma, Jun; Li, Shaowu [Capital Medical University, Department of Neuroradiology, Beijing Tiantan Hospital, Beijing (China); Liu, Shuai [Chinese Academy of Medical Sciences and Peking Union Medical College, Departments of Neurosurgery, Peking Union Medical College Hospital, Beijing (China); Liu, Yong [Chinese Academy of Sciences, Brainnetome Center, Institute of Automation, Beijing (China); Jiang, Tao [Capital Medical University, Department of Neurosurgery, Beijing Tiantan Hospital, Beijing (China); Beijing Academy of Critical Illness in Brain, Department of Clinical Oncology, Beijing (China)
The expression of vascular endothelial growth factor (VEGF) is a common genetic alteration in malignant gliomas and contributes to the angiogenesis of tumors. This study aimed to investigate the anatomical specificity of VEGF expression levels in glioblastomas using voxel-based neuroimaging analysis. Clinical information, MR scans, and immunohistochemistry stains of 209 patients with glioblastomas were reviewed. All tumor lesions were segmented manually and subsequently registered to standard brain space. Voxel-based regression analysis was performed to correlate the brain regions of tumor involvement with the level of VEGF expression. Brain regions identified as significantly associated with high or low VEGF expression were preserved following permutation correction. High VEGF expression was detected in 123 (58.9 %) of the 209 patients. Voxel-based statistical analysis demonstrated that high VEGF expression was more likely in tumors located in the left frontal lobe and the right caudate and low VEGF expression was more likely in tumors that occurred in the posterior region of the right lateral ventricle. Voxel-based neuroimaging analysis revealed the anatomic specificity of VEGF expression in glioblastoma, which may further our understanding of genetic heterogeneity during tumor origination. This finding provides primary theoretical support for potential future application of customized antiangiogenic therapy. (orig.)
Fan, Xing; Wang, Yinyan; Wang, Kai; Ma, Jun; Li, Shaowu; Liu, Shuai; Liu, Yong; Jiang, Tao
The expression of vascular endothelial growth factor (VEGF) is a common genetic alteration in malignant gliomas and contributes to the angiogenesis of tumors. This study aimed to investigate the anatomical specificity of VEGF expression levels in glioblastomas using voxel-based neuroimaging analysis. Clinical information, MR scans, and immunohistochemistry stains of 209 patients with glioblastomas were reviewed. All tumor lesions were segmented manually and subsequently registered to standard brain space. Voxel-based regression analysis was performed to correlate the brain regions of tumor involvement with the level of VEGF expression. Brain regions identified as significantly associated with high or low VEGF expression were preserved following permutation correction. High VEGF expression was detected in 123 (58.9 %) of the 209 patients. Voxel-based statistical analysis demonstrated that high VEGF expression was more likely in tumors located in the left frontal lobe and the right caudate and low VEGF expression was more likely in tumors that occurred in the posterior region of the right lateral ventricle. Voxel-based neuroimaging analysis revealed the anatomic specificity of VEGF expression in glioblastoma, which may further our understanding of genetic heterogeneity during tumor origination. This finding provides primary theoretical support for potential future application of customized antiangiogenic therapy. (orig.)
Zhang, Chongfu; Qiu, Kun; Ma, Chunli
In this paper, we utilize a new study method that is under independent case of multiple optical orthogonal codes to derive the probability function of MOOCS-OPS networks, discuss the performance characteristics for a variety of parameters, and compare some characteristics of the system employed by single optical orthogonal code or multiple optical orthogonal codes sequences-based optical labels. The performance of the system is also calculated, and our results verify that the method is effective. Additionally it is found that performance of MOOCS-OPS networks would, negatively, be worsened, compared with single optical orthogonal code-based optical label for optical packet switching (SOOC-OPS); however, MOOCS-OPS networks can greatly enlarge the scalability of optical packet switching networks.
Wu, Hongle; Kato, Takafumi; Yamada, Tomomi; Numao, Masayuki; Fukui, Ken-Ichi
We propose a method to discover sleep patterns via clustering of sound events recorded during sleep. The proposed method extends the conventional self-organizing map algorithm by kernelization and sequence-based technologies to obtain a fine-grained map that visualizes the distribution and changes of sleep-related events. We introduced features widely applied in sound processing and popular kernel functions to the proposed method to evaluate and compare performance. The proposed method provides a new aspect of sleep monitoring because the results demonstrate that sound events can be directly correlated to an individual's sleep patterns. In addition, by visualizing the transition of cluster dynamics, sleep-related sound events were found to relate to the various stages of sleep. Therefore, these results empirically warrant future study into the assessment of personal sleep quality using sound data. Copyright © 2017 Elsevier B.V. All rights reserved.
Quero, Sara; García-Núñez, Marian; Párraga-Niño, Noemí; Barrabeig, Irene; Pedro-Botet, Maria L; de Simon, Mercè; Sopena, Nieves; Sabrià, Miquel
To compare the discriminatory power of pulsed-field gel electrophoresis (PFGE) and sequence-based typing (SBT) in Legionella outbreaks for determining the infection source. Twenty-five investigations of Legionnaires' disease were analyzed by PFGE, SBT and Dresden monoclonal antibody. The results suggested that monoclonal antibody could reduce the number of Legionella isolates to be characterized by molecular methods. The epidemiological concordance PFGE-SBT was 100%, while the molecular concordance was 64%. Adjusted Wallace index (AW) showed that PFGE has better discriminatory power than SBT (AWSBT→PFGE = 0.767; AWPFGE→SBT = 1). The discrepancies appeared mostly in sequence type (ST) 1, a worldwide distributed ST for which PFGE discriminated different profiles. SBT discriminatory power was not sufficient verifying the infection source, especially in worldwide distributed STs, which were classified into different PFGE patterns.
Kelmansky, Diana M; Martínez, Elena J; Leiva, Víctor
In this paper, we introduce a new family of power transformations, which has the generalized logarithm as one of its members, in the same manner as the usual logarithm belongs to the family of Box-Cox power transformations. Although the new family has been developed for analyzing gene expression data, it allows a wider scope of mean-variance related data to be reached. We study the analytical properties of the new family of transformations, as well as the mean-variance relationships that are stabilized by using its members. We propose a methodology based on this new family, which includes a simple strategy for selecting the family member adequate for a data set. We evaluate the finite sample behavior of different classical and robust estimators based on this strategy by Monte Carlo simulations. We analyze real genomic data by using the proposed transformation to empirically show how the new methodology allows the variance of these data to be stabilized.
Minocherhomji, Sheroy; Seemann, Stefan; Mang, Yuan
/or overlap disease-associated loci, including the DLGAP4 locus. In this study, we sequenced ~99% of all three unfinished gaps on human chr 20, determined their complete genomic sizes and assessed epigenetic profiles using a combination of Sanger sequencing, mate pair paired-end high-throughput sequencing......The finished human genome-assemblies comprise several hundred un-sequenced euchromatic gaps, which may be rich in long polypurine/polypyrimidine stretches. Human chromosome 20 (chr 20) currently has three unfinished gaps remaining on its q-arm. All three gaps are within gene-dense regions and...... and chromatin, methylation and expression analyses. We found histone 3 trimethylated at Lysine 27 to be distributed across all three gaps in immortalized B-lymphocytes. In one gap, five novel CpG islands were predominantly hypermethylated in genomic DNA from peripheral blood lymphocytes and human cerebellum...
Ichikawa, Manabu; Okamura-Oho, Yuko; Shimokawa, Kazuro; Kondo, Shinji; Nakamura, Sakiko; Yokota, Hideo; Himeno, Ryutaro; Lesch, Klaus-Peter; Hayashizaki, Yoshihide
Inactivation of serotonin transporter (HTT) by pharmacologically in the neonate or genetically increases risk for depression in adulthood, whereas pharmacological inhibition of HTT ameliorates symptoms in depressed patients. The differing role of HTT function during early development and in adult brain plasticity in causing or reversing depression remains an unexplained paradox. To address this we profiled the gene expression of adult Htt knockout (Htt KO) mice and HTT inhibitor-treated mice. Inverted profile changes between the two experimental conditions were seen in 30 genes. Consistent results of the upstream regulatory element search and the co-localization search of these genes indicated that the regulation may be executed by Pax5, Pax7 and Gata3, known to be involved in the survival, proliferation, and migration of serotonergic neurons in the developing brain, and these factors are supposed to keep functioning to regulate downstream genes related to serotonin system in the adult brain
Full Text Available White mold, caused by the necrotrophic fungus (Lib. de Bary, is a major disease of common bean ( L.. WM7.1 and WM8.3 are two quantitative trait loci (QTL with major effects on tolerance to the pathogen. Advanced backcross populations segregating individually for either of the two QTL, and a recombinant inbred (RI population segregating for both QTL were used to fine map and confirm the genetic location of the QTL. The QTL intervals were physically mapped using the reference common bean genome sequence, and the physical intervals for each QTL were further confirmed by sequence-based introgression mapping. Using whole-genome sequence data from susceptible and tolerant DNA pools, introgressed regions were identified as those with significantly higher numbers of single-nucleotide polymorphisms (SNPs relative to the whole genome. By combining the QTL and SNP data, WM7.1 was located to a 660-kb region that contained 41 gene models on the proximal end of chromosome Pv07, while the WM8.3 introgression was narrowed to a 1.36-Mb region containing 70 gene models. The most polymorphic candidate gene in the WM7.1 region encodes a BEACH-domain protein associated with apoptosis. Within the WM8.3 interval, a receptor-like protein with the potential to recognize pathogen effectors was the most polymorphic gene. The use of gene and sequence-based mapping identified two candidate genes whose putative functions are consistent with the current model of pathogenicity.
Kaya, Koray D; Karakülah, Gökhan; Yakicier, Cengiz M; Acar, Aybar C; Konu, Ozlen
Waaijenborg, S.; Zwinderman, A.H.
ABSTRACT: BACKGROUND: We generalized penalized canonical correlation analysis for analyzing microarray gene-expression measurements for checking completeness of known metabolic pathways and identifying candidate genes for incorporation in the pathway. We used Wold's method for calculation of the
Crampton, Michael C
Full Text Available This presentation focused on the transcriptional analysis of heterologous gene expression using the endogenous sD promoter from Bacillus halodurans. It concludes to a successful implementation of a high throughput mRNA sandwich hybridisation...
sequencing gene differential expression analysis (Chen et al. ... DNase digestion (Takara, Shiga, Japan), so that any remain- ..... ing the early moments of pollen germination (Guyon et al. 2000). The steady-state transcript level of PGPS/D3 ...
Wu, Hong; Zheng, Xiaohong; Araki, Yoshio; Sahara, Hiroshi; Takagi, Hiroshi; Shimoi, Hitoshi
During the brewing of Japanese sake, Saccharomyces cerevisiae cells produce a high concentration of ethanol compared with other ethanol fermentation methods. We analyzed the gene expression profiles of yeast cells during sake brewing using DNA microarray analysis. This analysis revealed some characteristics of yeast gene expression during sake brewing and provided a scaffold for a molecular level understanding of the sake brewing process. PMID:16997994
Background The study and analysis of gene expression measurements is the primary focus of functional genomics. Once expression data is available, biologists are faced with the task of extracting (new) knowledge associated to the underlying biological phenomenon. Most often, in order to perform this task, biologists execute a number of analysis activities on the available gene expression dataset rather than a single analysis activity. The integration of heteregeneous tools and data sources to create an integrated analysis environment represents a challenging and error-prone task. Semantic integration enables the assignment of unambiguous meanings to data shared among different applications in an integrated environment, allowing the exchange of data in a semantically consistent and meaningful way. This work aims at developing an ontology-based methodology for the semantic integration of gene expression analysis tools and data sources. The proposed methodology relies on software connectors to support not only the access to heterogeneous data sources but also the definition of transformation rules on exchanged data. Results We have studied the different challenges involved in the integration of computer systems and the role software connectors play in this task. We have also studied a number of gene expression technologies, analysis tools and related ontologies in order to devise basic integration scenarios and propose a reference ontology for the gene expression domain. Then, we have defined a number of activities and associated guidelines to prescribe how the development of connectors should be carried out. Finally, we have applied the proposed methodology in the construction of three different integration scenarios involving the use of different tools for the analysis of different types of gene expression data. Conclusions The proposed methodology facilitates the development of connectors capable of semantically integrating different gene expression analysis tools
Gustavo Coelho Correa
Full Text Available Cysteine proteases are peptidyl hydrolyses dependent on a cysteine residue at the active center. The physical and chemical properties of cysteine proteases have been extensively characterized, but their precise biological functions have not yet been completely understood, although it is known that they are involved in a number of events such as protein turnover, cancer, germination, programmed cell death and senescence. Protein sequences from different cysteine proteinases, classified as members of the E.C.3.4.22 sub-sub-class, were used to perform a T-BLAST-n search on the Brazilian Sugarcane Expressed Sequence Tags project (SUCEST data bank. Sequence homology was found with 76 cluster sequences that corresponded to possible cysteine proteinases. The alignments of these SUCEST clusters with the sequence of cysteine proteinases of known origins provided important information about the classification and possible function of these sugarcane enzymes. Inferences about the expression pattern of each gene were made by direct correlation with the SUCEST cDNA libraries from which each cluster was derived. Since no previous reports of sugarcane cysteine proteinases genes exists, this study represents a first step in the study of new biochemical, physiological and biotechnological aspects of sugarcane cysteine proteases.Proteinases cisteínicas são peptidil-hidrolases dependentes de um resíduo de cisteína em seu sítio ativo. As propriedades físico-químicas destas proteinases têm sido amplamente caracterizadas, entretanto suas funções biológicas ainda não foram completamente elucidadas. Elas estão envolvidas em um grande número de eventos, tais como: processamento e degradação protéica, câncer, germinação, morte celular programada e processos de senescência. Diferentes proteinases cisteínicas, classificadas pelo Comitê de Nomenclatura da União Internacional de Bioquímica e Biologia Molecular (IUBMB como pertencentes à sub
Weng, Li; Rubin, Edward M.; Bristow, James
Ecologists studying microbial life in the environment have recognized the enormous complexity of microbial diversity for more than a decade (Whitman et al. 1998). The development of a variety of culture-independent methods, many of them coupled with high-throughput DNA sequencing, has allowed this diversity to be explored in ever greater detail (Handelsman 2004; Harris et al. 2004; Hugenholtz et al. 1998; Moreira and Lopez-Garcia 2002; Rappe and Giovannoni 2003). Despite the widespread application of these new techniques to the characterization of uncultivated microbes and microbial communities in the environment, their application to human health and disease has lagged behind. Because these techniques now allow not only cataloging of microbial diversity, but also insight into microbial functions, it is time for clinical microbiologists to apply these tools to the microbial communities that abound on and within us, in what has been aptly called ''the second Human Genome Project'' (Relman and Falkow 2001). In this review we will discuss the sequence-based methods for microbial analysis that are currently available and their application to identify novel human pathogens, improve diagnosis and treatment of known infectious diseases, and finally to advance understanding of our relationship with microbial communities that normally reside in and on the human body.
Hybrid promoters are created by shuffling of DNA fragments while keeping intact regulatory regions crucial of promoter activity. Two fragments of alcohol dehydrogenase (Adh) promoter from Zea mays were selected to generate hybrid promoter. Sequence analysis of both alcohol dehydrogenase promoter fragments through ...
Dec 9, 2013 ... Previous research has demonstrated that auxin induces and regulates the .... three main-stem nodes stage seedlings were prepared for the. IAA treatment ... For semi- quantitative RT-PCR analysis, first-strand cDNA was syn-.
Figure 1. Phylogenetic relation of apple ARF genes. The phylogenetic tree was constructed based on a complete protein sequence align- ment of MdARFs by the neighbour-joining method with bootstrapping analysis (1000 replicates). The scale bar represents 0.05 amino acid substitutions per site. Paralogous gene pairs ...
CCAAT/enhancer-binding protein beta as an essential transcriptional factor, regulates the differentiation of adipocytes and the deposition of fat. Herein, we cloned the whole open reading frame (ORF) of bovine C/EBPβ gene and analyzed its putative protein structures via DNA cloning and sequence analysis. Then, the ...
Jul 24, 2013 ... Key words: ABC transporter, potato, pleiotropic drug resistance (PDR), RNA-seq. INTRODUCTION ..... of relative transcript accumulation of each of 55 PDR genes as determined by RNA-seq analysis are presented as a heatmap, with ... specificities provide clues to the endogenous function of the individual ...
The plant pleiotropic drug resistance (PDR) family of ATP-binding cassette (ABC) transporters has comprehensively been researched in relation to transport of antifungal agents and resistant pathogens. In our study, analyses of the whole family of PDR genes present in the potato genome were provided. This analysis ...
Oct 19, 2011 ... conducted a molecular cloning and functional analysis to study a specific silkworm gene BmICAD related to apoptosis. .... blocking with 5% non-fat milk for 1 h at room temperature, the .... requirements for all next experiments.
Wu, Liang; Zhang, Xiaolong; Zhao, Zhikun; Wang, Ling; Li, Bo; Li, Guibo; Dean, Michael; Yu, Qichao; Wang, Yanhui; Lin, Xinxin; Rao, Weijian; Mei, Zhanlong; Li, Yang; Jiang, Runze; Yang, Huan; Li, Fuqiang; Xie, Guoyun; Xu, Liqin; Wu, Kui; Zhang, Jie; Chen, Jianghao; Wang, Ting; Kristiansen, Karsten; Zhang, Xiuqing; Li, Yingrui; Yang, Huanming; Wang, Jian; Hou, Yong; Xu, Xun
Viral infection causes multiple forms of human cancer, and HPV infection is the primary factor in cervical carcinomas. Recent single-cell RNA-seq studies highlight the tumor heterogeneity present in most cancers, but virally induced tumors have not been studied. HeLa is a well characterized HPV+ cervical cancer cell line. We developed a new high throughput platform to prepare single-cell RNA on a nanoliter scale based on a customized microwell chip. Using this method, we successfully amplified full-length transcripts of 669 single HeLa S3 cells and 40 of them were randomly selected to perform single-cell RNA sequencing. Based on these data, we obtained a comprehensive understanding of the heterogeneity of HeLa S3 cells in gene expression, alternative splicing and fusions. Furthermore, we identified a high diversity of HPV-18 expression and splicing at the single-cell level. By co-expression analysis we identified 283 E6, E7 co-regulated genes, including CDC25, PCNA, PLK4, BUB1B and IRF1 known to interact with HPV viral proteins. Our results reveal the heterogeneity of a virus-infected cell line. It not only provides a transcriptome characterization of HeLa S3 cells at the single cell level, but is a demonstration of the power of single cell RNA-seq analysis of virally infected cells and cancers.
Full Text Available Abstract Background High-throughput protein structure analysis of individual protein domains requires analysis of large numbers of expression clones to identify suitable constructs for structure determination. For this purpose, methods need to be implemented for fast and reliable screening of the expressed proteins as early as possible in the overall process from cloning to structure determination. Results 88 different E. coli expression constructs for 17 human protein domains were analysed using high-throughput cloning, purification and folding analysis to obtain candidates suitable for structural analysis. After 96 deep-well microplate expression and automated protein purification, protein domains were directly analysed using 1D 1H-NMR spectroscopy. In addition, analytical hydrophobic interaction chromatography (HIC was used to detect natively folded protein. With these two analytical methods, six constructs (representing two domains were quickly identified as being well folded and suitable for structural analysis. Conclusion The described approach facilitates high-throughput structural analysis. Clones expressing natively folded proteins suitable for NMR structure determination were quickly identified upon small scale expression screening using 1D 1H-NMR and/or analytical HIC. This procedure is especially effective as a fast and inexpensive screen for the 'low hanging fruits' in structural genomics.
Bonnefont, J.P.; Cepanec, C.; Leroux, J.P. [Unite INSERM, Paris (France)] [and others
Carnitine palmitoyltransferase (CPT) II deficiency, an inherited disorder of mitochondrial long-chain fatty-acid (LCFA) oxidation, results in two distinct clinical act phenotypes, namely, an adult (muscular) form and an infantile (hepatocardiomuscular) form. The rationale of this phenotypic heterogeneity is poorly understood. The adult form of the disease is commonly ascribed to the Ser-113-Leu substitution in CPT II. Only few data are available regarding the molecular basis of the infantile form of the disease. We report herein a homozygous A-2399-C transversion predicting a Tyr-628-Ser substitution in a CPT II-deficient infant. In vitro expression of mutant cDNA in COS-1 cells demonstrated the responsibility of this mutation for the disease. Metabolic consequences of the Ser-113-Leu and Tyr-628-Ser substitutions were studied in fibroblasts. The Tyr-628-Ser substitution (infantile form) resulted in a 10% CPT II residual activity, markedly impairing LCFA oxidation, whereas the Ser-113-Leu substitution (adult form) resulted in a 20% CPT II residual activity, without consequence on LCFA oxidation. These data show that CPT II activity has to be reduced below a critical threshold in order for LCFA oxidation in fibroblasts to be impaired. The hypothesis that this critical threshold differs among tissues could provide a basis to explain phenotypic heterogeneity of CPT II deficiency. 32 refs., 5 figs.
Full Text Available Botrytis cinerea is a filamentous plant pathogen of a wide range of plant species, and its infection may cause enormous damage both during plant growth and in the post-harvest phase. We have constructed a cDNA library from an isolate of B. cinerea and have sequenced 11,482 expressed sequence tags that were assembled into 1,003 contigs sequences and 3,032 singletons. Approximately 81% of the unigenes showed significant similarity to genes coding for proteins with known functions: more than 50% of the sequences code for genes involved in cellular metabolism, 12% for transport of metabolites, and approximately 10% for cellular organization. Other functional categories include responses to biotic and abiotic stimuli, cell communication, cell homeostasis, and cell development. We carried out pair-wise comparisons with fungal databases to determine the B. cinerea unisequence set with relevant similarity to genes in other fungal pathogenic counterparts. Among the 4,035 non-redundant B. cinerea unigenes, 1,338 (23% have significant homology with Fusarium verticillioides unigenes. Similar values were obtained for Saccharomyces cerevisiae and Aspergillus nidulans (22% and 24%, respectively. The lower percentages of homology were with Magnaporthe grisae and Neurospora crassa (13% and 19%, respectively. Several genes involved in putative and known fungal virulence and general pathogenicity were identified. The results provide important information for future research on this fungal pathogen
Galetzka, D; Weis, E; Rittner, G; Schindler, D; Haaf, T
Fanconi anemia (FA) cells are generally hypersensitive to DNA cross-linking agents, implying that mutations in the different FANC genes cause a similar DNA repair defect(s). By using a customized cDNA microarray chip for DNA repair- and cell cycle-associated genes, we identified three genes, cathepsin B (CTSB), glutaredoxin (GLRX), and polo-like kinase 2 (PLK2), that were misregulated in untreated primary fibroblasts from three unrelated FA-D2 patients, compared to six controls. Quantitative real-time RT PCR was used to validate these results and to study possible molecular links between FA-D2 and other FA subtypes. GLRX was misregulated to opposite directions in a variety of different FA subtypes. Increased CTSB and decreased PLK2 expression was found in all or almost all of the analyzed complementation groups and, therefore, may be related to the defective FA pathway. Transcriptional upregulation of the CTSB proteinase appears to be a secondary phenomenon due to proliferation differences between FA and normal fibroblast cultures. In contrast, PLK2 is known to play a pivotal role in processes that are linked to FA defects and may contribute in multiple ways to the FA phenotype: PLK2 is a target gene for TP53, is likely to function as a tumor suppressor gene in hematologic neoplasia, and Plk2(-/-) mice are small because of defective embryonal development. (c) 2008 S. Karger AG, Basel.
Jen, C. H.; Manfield, I. W.; Michalopoulos, D. W.
be examined using the novel clique finder tool to determine the sets of genes most likely to be regulated in a similar manner. In combination, these tools offer three levels of analysis: creation of correlation lists of co-expressed genes, refinement of these lists using two-dimensional scatter plots......We present a new WWW-based tool for plant gene analysis, the Arabidopsis Co-Expression Tool (act) , based on a large Arabidopsis thaliana microarray data set obtained from the Nottingham Arabidopsis Stock Centre. The co-expression analysis tool allows users to identify genes whose expression...
Church, Philip C; Goscinski, Andrzej; Lefèvre, Christophe
Microarrays and more recently RNA sequencing has led to an increase in available gene expression data. How to manage and store this data is becoming a key issue. In response we have developed EXP-PAC, a web based software package for storage, management and analysis of gene expression and sequence data. Unique to this package is SQL based querying of gene expression data sets, distributed normalization of raw gene expression data and analysis of gene expression data across experiments and species. This package has been populated with lactation data in the international milk genomic consortium web portal (http://milkgenomics.org/). Source code is also available which can be hosted on a Windows, Linux or Mac APACHE server connected to a private or public network (http://mamsap.it.deakin.edu.au/~pcc/Release/EXP_PAC.html). Copyright © 2012 Elsevier Inc. All rights reserved.
Full Text Available Abstract In spite of only a 1-2 per cent genomic DNA sequence difference, humans and chimpanzees differ considerably in behaviour and cognition. Affymetrix microarray technology provides a novel approach to addressing a long-term debate on whether the difference between humans and chimpanzees results from the alteration of gene expressions. Here, we used several statistical methods (distance method, two-sample t-tests, regularised t-tests, ANOVA and bootstrapping to detect the differential expression pattern between humans and great apes. Our analysis shows that the pattern we observed before is robust against various statistical methods; that is, the pronounced expression changes occurred on the human lineage after the split from chimpanzees, and that the dramatic brain expression alterations in humans may be mainly driven by a set of genes with increased expression (up-regulated rather than decreased expression (down-regulated.
Superoxide dismutases (SODs) play an important role in stress-tolerance in plants. In this study, for the first time, a full-length cDNA sequence of MnSOD gene, termed as Sc-MnSOD (GenBank accession number: GQ246460), was obtained in sugarcane. Sequence analysis revealed that Sc-MnSOD gene was 919 bp long, ...
Joshua C Kwekel
Full Text Available Age is a predisposing condition for susceptibility to chronic kidney disease and progression as well as acute kidney injury that may arise due to the adverse effects of some drugs. Age-related differences in kidney biology, therefore, are a key concern in understanding drug safety and disease progression. We hypothesize that the underlying suite of genes expressed in the kidney at various life cycle stages will impact susceptibility to adverse drug reactions. Therefore, establishing changes in baseline expression data between these life stages is the first and necessary step in evaluating this hypothesis. Untreated male F344 rats were sacrificed at 2, 5, 6, 8, 15, 21, 78, and 104 weeks of age. Kidneys were collected for histology and gene expression analysis. Agilent whole-genome rat microarrays were used to query global expression profiles. An ANOVA (p1.5 in relative mRNA expression, was used to identify 3,724 unique differentially expressed genes (DEGs. Principal component analyses of these DEGs revealed three major divisions in life-cycle renal gene expression. K-means cluster analysis identified several groups of genes that shared age-specific patterns of expression. Pathway analysis of these gene groups revealed age-specific gene networks and functions related to renal function and aging, including extracellular matrix turnover, immune cell response, and renal tubular injury. Large age-related changes in expression were also demonstrated for the genes that code for qualified renal injury biomarkers KIM-1, Clu, and Tff3. These results suggest specific groups of genes that may underlie age-specific susceptibilities to adverse drug reactions and disease. This analysis of the basal gene expression patterns of renal genes throughout the life cycle of the rat will improve the use of current and future renal biomarkers and inform our assessments of kidney injury and disease.
Iqbal, N.; Khatoon, A.; Asif, M.; Bashir, A.
Cotton fibers are unicellular seed trichomes and the largest known plant cells. Fiber morphogenesis in cotton is a complex process involving a large number of genes expressed throughout fiber development process. The expression profiling of five gene families in various cotton tissues was carried out through real time PCR. Expression analysis revealed that transcripts of expansin, tubulin and E6 were elevated from 5 to 20 days post anthesis (DPA) fibers. Three Lipid transfer proteins (LTPs) including LTP1, LTP3, LTP7 exhibited highest expression in 10 - 20 DPA fibers. Transcripts of LTP3 were detected in fibers and non fiber tissues that of LTP7 were almost negligible in non fiber tissues. Sucrose phosphate synthase gene showed highest expression in 10 DPA fibers while sucrose synthse (susy) expressed at higher rate in 5-20 DPA fibers as well as roots. The results reveal that most of fiber related genes showed high expression in 5-20 DPA fibers. Comprehensive expression study may help to determine tissue and stage specificity of genes under study. The study may also help to explore complex process of fiber development and understand the role of these genes in fiber development process. Highly expressed genes in fibers may be transformed in cotton for improvement of fiber quality traits. Genes that were expressed specifically in fibers or other tissues could be used for isolation of upstream regulatory sequences. (author)
Full Text Available Large numbers of quantitative trait loci (QTL affecting complex diseases and other quantitative traits have been reported in humans and model animals. However, the genetic architecture of these traits remains elusive due to the difficulty in identifying causal quantitative trait genes (QTGs for common QTL with relatively small phenotypic effects. A traditional strategy based on techniques such as positional cloning does not always enable identification of a single candidate gene for a QTL of interest because it is difficult to narrow down a target genomic interval of the QTL to a very small interval harboring only one gene. A combination of gene expression analysis and statistical causal analysis can greatly reduce the number of candidate genes. This integrated approach provides causal evidence that one of the candidate genes is a putative QTG for the QTL. Using this approach, I have recently succeeded in identifying a single putative QTG for resistance to obesity in mice. Here, I outline the integration approach and discuss its usefulness using my studies as an example.
Large numbers of quantitative trait loci (QTL) affecting complex diseases and other quantitative traits have been reported in humans and model animals. However, the genetic architecture of these traits remains elusive due to the difficulty in identifying causal quantitative trait genes (QTGs) for common QTL with relatively small phenotypic effects. A traditional strategy based on techniques such as positional cloning does not always enable identification of a single candidate gene for a QTL of interest because it is difficult to narrow down a target genomic interval of the QTL to a very small interval harboring only one gene. A combination of gene expression analysis and statistical causal analysis can greatly reduce the number of candidate genes. This integrated approach provides causal evidence that one of the candidate genes is a putative QTG for the QTL. Using this approach, I have recently succeeded in identifying a single putative QTG for resistance to obesity in mice. Here, I outline the integration approach and discuss its usefulness using my studies as an example.
Ma, W; Zhang, T-F; Lu, P; Lu, S H
Breast cancer is categorized into two broad groups: estrogen receptor positive (ER+) and ER negative (ER-) groups. Previous study proposed that under trastuzumab-based neoadjuvant chemotherapy, tumor initiating cell (TIC) featured ER- tumors response better than ER+ tumors. Exploration of the molecular difference of these two groups may help developing new therapeutic strategies, especially for ER- patients. With gene expression profile from the Gene Expression Omnibus (GEO) database, we performed partial least squares (PLS) based analysis, which is more sensitive than common variance/regression analysis. We acquired 512 differentially expressed genes. Four pathways were found to be enriched with differentially expressed genes, involving immune system, metabolism and genetic information processing process. Network analysis identified five hub genes with degrees higher than 10, including APP, ESR1, SMAD3, HDAC2, and PRKAA1. Our findings provide new understanding for the molecular difference between TIC featured ER- and ER+ breast tumors with the hope offer supports for therapeutic studies.
Jespersen, Jakob S.; Petersen, Bent; Seguin-Orlando, Andaine
To prevent the spread of resistance among gastro-intestinal nematode populations, the use of bioactive tannin-rich plants is currently investigated as an alternative to the exclusive use of anthelmintic (AH) synthetic drugs. Studies of AH effects on cattle nematodes using tannin-rich legumes...
Full Text Available Wheat seed development is an important physiological process of seed maturation and directly affects wheat yield and quality. In this study, we performed dynamic transcriptome microarray analysis of an elite Chinese bread wheat cultivar (Jimai 20 during grain development using the GeneChip Wheat Genome Array. Grain morphology and scanning electron microscope observations showed that the period of 11–15 days post-anthesis (DPA was a key stage for the synthesis and accumulation of seed starch. Genome-wide transcriptional profiling and significance analysis of microarrays revealed that the period from 11 to 15 DPA was more important than the 15–20 DPA stage for the synthesis and accumulation of nutritive reserves. Series test of cluster analysis of differential genes revealed five statistically significant gene expression profiles. Gene ontology annotation and enrichment analysis gave further information about differentially expressed genes, and MapMan analysis revealed expression changes within functional groups during seed development. Metabolic pathway network analysis showed that major and minor metabolic pathways regulate one another to ensure regular seed development and nutritive reserve accumulation. We performed gene co-expression network analysis to identify genes that play vital roles in seed development and identified several key genes involved in important metabolic pathways. The transcriptional expression of eight key genes involved in starch and protein synthesis and stress defense was further validated by qRT-PCR. Our results provide new insight into the molecular mechanisms of wheat seed development and the determinants of yield and quality.
Guardia, Gabriela D A; Pires, Luís Ferreira; Vêncio, Ricardo Z N; Malmegrim, Kelen C R; de Farias, Cléver R G
Gene expression studies are generally performed through multi-step analysis processes, which require the integrated use of a number of analysis tools. In order to facilitate tool/data integration, an increasing number of analysis tools have been developed as or adapted to semantic web services. In recent years, some approaches have been defined for the development and semantic annotation of web services created from legacy software tools, but these approaches still present many limitations. In addition, to the best of our knowledge, no suitable approach has been defined for the functional genomics domain. Therefore, this paper aims at defining an integrated methodology for the implementation of RESTful semantic web services created from gene expression analysis tools and the semantic annotation of such services. We have applied our methodology to the development of a number of services to support the analysis of different types of gene expression data, including microarray and RNASeq. All developed services are publicly available in the Gene Expression Analysis Services (GEAS) Repository at http://dcm.ffclrp.usp.br/lssb/geas. Additionally, we have used a number of the developed services to create different integrated analysis scenarios to reproduce parts of two gene expression studies documented in the literature. The first study involves the analysis of one-color microarray data obtained from multiple sclerosis patients and healthy donors. The second study comprises the analysis of RNA-Seq data obtained from melanoma cells to investigate the role of the remodeller BRG1 in the proliferation and morphology of these cells. Our methodology provides concrete guidelines and technical details in order to facilitate the systematic development of semantic web services. Moreover, it encourages the development and reuse of these services for the creation of semantically integrated solutions for gene expression analysis.
Gabriela D A Guardia
Full Text Available Gene expression studies are generally performed through multi-step analysis processes, which require the integrated use of a number of analysis tools. In order to facilitate tool/data integration, an increasing number of analysis tools have been developed as or adapted to semantic web services. In recent years, some approaches have been defined for the development and semantic annotation of web services created from legacy software tools, but these approaches still present many limitations. In addition, to the best of our knowledge, no suitable approach has been defined for the functional genomics domain. Therefore, this paper aims at defining an integrated methodology for the implementation of RESTful semantic web services created from gene expression analysis tools and the semantic annotation of such services. We have applied our methodology to the development of a number of services to support the analysis of different types of gene expression data, including microarray and RNASeq. All developed services are publicly available in the Gene Expression Analysis Services (GEAS Repository at http://dcm.ffclrp.usp.br/lssb/geas. Additionally, we have used a number of the developed services to create different integrated analysis scenarios to reproduce parts of two gene expression studies documented in the literature. The first study involves the analysis of one-color microarray data obtained from multiple sclerosis patients and healthy donors. The second study comprises the analysis of RNA-Seq data obtained from melanoma cells to investigate the role of the remodeller BRG1 in the proliferation and morphology of these cells. Our methodology provides concrete guidelines and technical details in order to facilitate the systematic development of semantic web services. Moreover, it encourages the development and reuse of these services for the creation of semantically integrated solutions for gene expression analysis.
Alvarez, Hector; Corvalan, Alejandro; Roa, Juan C; Argani, Pedram; Murillo, Francisco; Edwards, Jennifer; Beaty, Robert; Feldmann, Georg; Hong, Seung-Mo; Mullendore, Michael; Roa, Ivan; Ibañez, Luis; Pimentel, Fernando; Diaz, Alfonso; Riggins, Gregory J; Maitra, Anirban
Gallbladder cancer (GBC) is an uncommon neoplasm in the United States, but one with high mortality rates. This malignancy remains largely understudied at the molecular level such that few targeted therapies or predictive biomarkers exist. We built the first series of serial analysis of gene expression (SAGE) libraries from GBC and nonneoplastic gallbladder mucosa, composed of 21-bp long-SAGE tags. SAGE libraries were generated from three stage-matched GBC patients (representing Hispanic/Latino, Native American, and Caucasian ethnicities, respectively) and one histologically alithiasic gallbladder. Real-time quantitative PCR was done on microdissected epithelium from five matched GBC and corresponding nonneoplastic gallbladder mucosa. Immunohistochemical analysis was done on a panel of 182 archival GBC in high-throughput tissue microarray format. SAGE tags corresponding to connective tissue growth factor (CTGF) transcripts were identified as differentially overexpressed in all pairwise comparisons of GBC (P Cancer Genome Anatomy Project web site and should facilitate much needed research into this lethal neoplasm.
Sun, Xicai; Guo, Limin; Wang, Jingjing; Wang, Huan; Liu, Zhuofu; Liu, Juan; Yu, Huapeng; Hu, Li; Li, Han; Wang, Dehui
Although JNA is a benign neoplasm histopathologically, it has a propensity for locally destructive growth and remains a higher postoperative recurrence rate. The aim of this study was to analyze the expression and localization of MMP-9 in JNA using tissue microarray to elucidate its correlation with clinicopathological features and recurrence. The expression of MMP-9 was assessed by immunohistochemistry in a tissue microarray from 70 patients with JNA and 10 control subjects. Correlation between the levels of MMP-9 expression and clinicopathologic variables, as well as tumor recurrence, were analyzed. MMP-9 was detected in perivascular and extravascular less differentiated cells and stromal cells of patients with JNA but not in the matured vascular endothelial cells of these patients. The presence of MMP-9 expression in JNA was correlated with patient's age (p=0.001). Spearman correlation analysis suggested that high expression of MMP-9 in JNA had negative correlation with patient's age (r=-0.412, p<0.001). The recurrence rate in JNA patients with high MMP-9 expression was significantly higher than those with low MMP-9 expression (p=0.002). In multivariate and ROC curve analysis, MMP-9 was a good prognostic factor for tumor recurrence of JNA. Higher MMP-9 expression is a poor prognostic factor for patients with JNA who have been surgically treated. Copyright © 2014 Elsevier Ireland Ltd. All rights reserved.
Halabi, Najeeb M.; Martinez, Alejandra; Al-Farsi, Halema; Mery, Eliane; Puydenus, Laurence; Pujol, Pascal; Khalak, Hanif G.; McLurcan, Cameron; Ferron, Gwenael; Querleu, Denis; Al-Azwani, Iman; Al-Dous, Eman; Mohamoud, Yasmin A.; Malek, Joel A.; Rafii, Arash
Identifying genes where a variant allele is preferentially expressed in tumors could lead to a better understanding of cancer biology and optimization of targeted therapy. However, tumor sample heterogeneity complicates standard approaches for detecting preferential allele expression. We therefore developed a novel approach combining genome and transcriptome sequencing data from the same sample that corrects for sample heterogeneity and identifies significant preferentially expressed alleles. We applied this analysis to epithelial ovarian cancer samples consisting of matched primary ovary and peritoneum and lymph node metastasis. We find that preferentially expressed variant alleles include germline and somatic variants, are shared at a relatively high frequency between patients, and are in gene networks known to be involved in cancer processes. Analysis at a patient level identifies patient-specific preferentially expressed alleles in genes that are targets for known drugs. Analysis at a site level identifies patterns of site specific preferential allele expression with similar pathways being impacted in the primary and metastasis sites. We conclude that genes with preferentially expressed variant alleles can act as cancer drivers and that targeting those genes could lead to new therapeutic strategies. PMID:26735499
Xiao, Xiaolin; Moreno-Moral, Aida; Rotival, Maxime; Bottolo, Leonardo; Petretto, Enrico
Recent high-throughput efforts such as ENCODE have generated a large body of genome-scale transcriptional data in multiple conditions (e.g., cell-types and disease states). Leveraging these data is especially important for network-based approaches to human disease, for instance to identify coherent transcriptional modules (subnetworks) that can inform functional disease mechanisms and pathological pathways. Yet, genome-scale network analysis across conditions is significantly hampered by the paucity of robust and computationally-efficient methods. Building on the Higher-Order Generalized Singular Value Decomposition, we introduce a new algorithmic approach for efficient, parameter-free and reproducible identification of network-modules simultaneously across multiple conditions. Our method can accommodate weighted (and unweighted) networks of any size and can similarly use co-expression or raw gene expression input data, without hinging upon the definition and stability of the correlation used to assess gene co-expression. In simulation studies, we demonstrated distinctive advantages of our method over existing methods, which was able to recover accurately both common and condition-specific network-modules without entailing ad-hoc input parameters as required by other approaches. We applied our method to genome-scale and multi-tissue transcriptomic datasets from rats (microarray-based) and humans (mRNA-sequencing-based) and identified several common and tissue-specific subnetworks with functional significance, which were not detected by other methods. In humans we recapitulated the crosstalk between cell-cycle progression and cell-extracellular matrix interactions processes in ventricular zones during neocortex expansion and further, we uncovered pathways related to development of later cognitive functions in the cortical plate of the developing brain which were previously unappreciated. Analyses of seven rat tissues identified a multi-tissue subnetwork of co-expressed
Full Text Available Recent high-throughput efforts such as ENCODE have generated a large body of genome-scale transcriptional data in multiple conditions (e.g., cell-types and disease states. Leveraging these data is especially important for network-based approaches to human disease, for instance to identify coherent transcriptional modules (subnetworks that can inform functional disease mechanisms and pathological pathways. Yet, genome-scale network analysis across conditions is significantly hampered by the paucity of robust and computationally-efficient methods. Building on the Higher-Order Generalized Singular Value Decomposition, we introduce a new algorithmic approach for efficient, parameter-free and reproducible identification of network-modules simultaneously across multiple conditions. Our method can accommodate weighted (and unweighted networks of any size and can similarly use co-expression or raw gene expression input data, without hinging upon the definition and stability of the correlation used to assess gene co-expression. In simulation studies, we demonstrated distinctive advantages of our method over existing methods, which was able to recover accurately both common and condition-specific network-modules without entailing ad-hoc input parameters as required by other approaches. We applied our method to genome-scale and multi-tissue transcriptomic datasets from rats (microarray-based and humans (mRNA-sequencing-based and identified several common and tissue-specific subnetworks with functional significance, which were not detected by other methods. In humans we recapitulated the crosstalk between cell-cycle progression and cell-extracellular matrix interactions processes in ventricular zones during neocortex expansion and further, we uncovered pathways related to development of later cognitive functions in the cortical plate of the developing brain which were previously unappreciated. Analyses of seven rat tissues identified a multi
Liu, Ji; Li, Xue; Dong, Guang-Long; Zhang, Hong-Wei; Chen, Dong-Li; Du, Jian-Jun; Zheng, Jian-Yong; Li, Ji-Peng; Wang, Wei-Zhong
The S100 protein family comprises 22 members whose protein sequences encompass at least one EF-hand Ca 2+ binding motif. They were involved in the regulation of a number of cellular processes such as cell cycle progression and differentiation. However, the expression status of S100 family members in gastric cancer was not known yet. Combined with analysis of series analysis of gene expression, virtual Northern blot and microarray data, the expression levels of S100 family members in normal and malignant stomach tissues were systematically investigated. The expression of S100A3 was further evaluated by quantitative RT-PCR. At least 5 S100 genes were found to be upregulated in gastric cance by in silico analysis. Among them, four genes, including S100A2, S100A4, S100A7 and S100A10, were reported to overexpressed in gastric cancer previously. The expression of S100A3 in eighty patients of gastric cancer was further examined. The results showed that the mean expression levels of S100A3 in gastric cancer tissues were 2.5 times as high as in adjacent non-tumorous tissues. S100A3 expression was correlated with tumor differentiation and TNM (Tumor-Node-Metastasis) stage of gastric cancer, which was relatively highly expressed in poorly differentiated and advanced gastric cancer tissues (P < 0.05). To our knowledge this is the first report of systematic evaluation of S100 gene expressions in gastric cancers by multiple in silico analysis. The results indicated that overexpression of S100 gene family members were characteristics of gastric cancers and S100A3 might play important roles in differentiation and progression of gastric cancer
van Kampen, A. H.; van Schaik, B. D.; Pauws, E.; Michiels, E. M.; Ruijter, J. M.; Caron, H. N.; Versteeg, R.; Heisterkamp, S. H.; Leunissen, J. A.; Baas, F.; van der Mee, M.
MOTIVATION: SAGE enables the determination of genome-wide mRNA expression profiles. A comprehensive analysis of SAGE data requires software, which integrates (statistical) data analysis methods with a database system. Furthermore, to facilitate data sharing between users, the application should
Full Text Available Tumorigenesis is a complex dynamic biological process that includes multiple steps of genetic and epigenetic alterations, aberrant expression of noncoding RNA, and changes in the expression profiles of coding genes. We call the collection of those perturbations in genome space the “cancer initiatome.” Long noncoding RNAs (lncRNAs are pervasively transcribed in the genome and they have key regulatory functions in chromatin remodeling and gene expression. Spatiotemporal variation in the expression of lncRNAs has been observed in development and disease states, including cancer. A few dysregulated lncRNAs have been studied in cancers, but the role of lncRNAs in the cancer initiatome remains largely unknown, especially in esophageal squamous cell carcinoma (ESCC. We conducted a genome-wide screen of the expression of lncRNAs and coding RNAs from ESCC and matched adjacent nonneoplastic normal tissues. We identified differentially expressed lncRNAs and coding RNAs in ESCC relative to their matched normal tissue counterparts and validated the result using polymerase chain reaction analysis. Furthermore, we identified differentially expressed lncRNAs that are co-located and co-expressed with differentially expressed coding RNAs in ESCC and the results point to a potential interaction between lncRNAs and neighboring coding genes that affect ether lipid metabolism, and the interaction may contribute to the development of ESCC. These data provide compelling evidence for a potential novel genomic biomarker of esophageal squamous cell cancer.
Full Text Available Abstract Background Many Automatic Function Prediction (AFP methods were developed to cope with an increasing growth of the number of gene sequences that are available from high throughput sequencing experiments. To support the development of AFP methods, it is essential to have community wide experiments for evaluating performance of existing AFP methods. Critical Assessment of Function Annotation (CAFA is one such community experiment. The meeting of CAFA was held as a Special Interest Group (SIG meeting at the Intelligent Systems in Molecular Biology (ISMB conference in 2011. Here, we perform a detailed analysis of two sequence-based function prediction methods, PFP and ESG, which were developed in our lab, using the predictions submitted to CAFA. Results We evaluate PFP and ESG using four different measures in comparison with BLAST, Prior, and GOtcha. In addition to the predictions submitted to CAFA, we further investigate performance of a different scoring function to rank order predictions by PFP as well as PFP/ESG predictions enriched with Priors that simply adds frequently occurring Gene Ontology terms as a part of predictions. Prediction accuracies of each method were also evaluated separately for different functional categories. Successful and unsuccessful predictions by PFP and ESG are also discussed in comparison with BLAST. Conclusion The in-depth analysis discussed here will complement the overall assessment by the CAFA organizers. Since PFP and ESG are based on sequence database search results, our analyses are not only useful for PFP and ESG users but will also shed light on the relationship of the sequence similarity space and functions that can be inferred from the sequences.
Bandini, Andrea; Orlandi, Silvia; Escalante, Hugo Jair; Giovannelli, Fabio; Cincotta, Massimo; Reyes-Garcia, Carlos A; Vanni, Paola; Zaccara, Gaetano; Manfredi, Claudia
The automatic analysis of facial expressions is an evolving field that finds several clinical applications. One of these applications is the study of facial bradykinesia in Parkinson's disease (PD), which is a major motor sign of this neurodegenerative illness. Facial bradykinesia consists in the reduction/loss of facial movements and emotional facial expressions called hypomimia. In this work we propose an automatic method for studying facial expressions in PD patients relying on video-based METHODS: 17 Parkinsonian patients and 17 healthy control subjects were asked to show basic facial expressions, upon request of the clinician and after the imitation of a visual cue on a screen. Through an existing face tracker, the Euclidean distance of the facial model from a neutral baseline was computed in order to quantify the changes in facial expressivity during the tasks. Moreover, an automatic facial expressions recognition algorithm was trained in order to study how PD expressions differed from the standard expressions. Results show that control subjects reported on average higher distances than PD patients along the tasks. This confirms that control subjects show larger movements during both posed and imitated facial expressions. Moreover, our results demonstrate that anger and disgust are the two most impaired expressions in PD patients. Contactless video-based systems can be important techniques for analyzing facial expressions also in rehabilitation, in particular speech therapy, where patients could get a definite advantage from a real-time feedback about the proper facial expressions/movements to perform. Copyright © 2017 Elsevier B.V. All rights reserved.
Spínola, Hélder; Bruges-Armas, Jácome; Mora, Marian Gantes; Middleton, Derek; Brehm, António
Human leukocyte antigen (HLA)-A, HLA-B, and HLA-DRB1 polymorphisms were examined in Madeira Island populations. The data was obtained at high-resolution level, using sequence-based typing (SBT). The most frequent alleles at each loci were: A*020101 (24.6%), B*5101 (9.7%), B*440201 (9.2%), and DRB1*070101 (15.7%). The predominant three-loci haplotypes in Madeira were A*020101-B*510101-DRB1*130101 (2.7%) and A*010101-B*0801-DRB1*030101 (2.4%), previously found in north and central Portugal. The present study corroborates historical sources and other genetic studies that say Madeira were populated not only by Europeans, mostly Portuguese, but also sub-Saharan Africans due to slave trade. Comparison with other populations shows that Madeira experienced a stronger African influence due to slave trade than Portugal mainland and even the Azores archipelago. Despite this African genetic input, haplotype and allele frequencies were predominantly from European origin, mostly common to mainland Portugal.
Pantazes, Robert J; Saraf, Manish C; Maranas, Costas D
In this paper, we introduce and test two new sequence-based protein scoring systems (i.e. S1, S2) for assessing the likelihood that a given protein hybrid will be functional. By binning together amino acids with similar properties (i.e. volume, hydrophobicity and charge) the scoring systems S1 and S2 allow for the quantification of the severity of mismatched interactions in the hybrids. The S2 scoring system is found to be able to significantly functionally enrich a cytochrome P450 library over other scoring methods. Given this scoring base, we subsequently constructed two separate optimization formulations (i.e. OPTCOMB and OPTOLIGO) for optimally designing protein combinatorial libraries involving recombination or mutations, respectively. Notably, two separate versions of OPTCOMB are generated (i.e. model M1, M2) with the latter allowing for position-dependent parental fragment skipping. Computational benchmarking results demonstrate the efficacy of models OPTCOMB and OPTOLIGO to generate high scoring libraries of a prespecified size.
Full Text Available Identifying facial expressions is crucial for social interactions. Functional neuroimaging studies show that a set of brain areas, such as the fusiform gyrus and amygdala, become active when viewing emotional facial expressions. The majority of functional magnetic resonance imaging (fMRI studies investigating face perception typically employ static images of faces. However, studies that use dynamic facial expressions (e.g., videos are accumulating and suggest that a dynamic presentation may be more sensitive and ecologically valid for investigating faces. By using quantitative fMRI meta-analysis the present study examined concordance of brain regions associated with viewing dynamic facial expressions. We analyzed data from 216 participants that participated in 14 studies, which reported coordinates for 28 experiments. Our analysis revealed bilateral fusiform and middle temporal gyri, left amygdala, left declive of the cerebellum and the right inferior frontal gyrus. These regions are discussed in terms of their relation to models of face processing.
Zinchenko, Oksana; Yaple, Zachary A; Arsalidou, Marie
Identifying facial expressions is crucial for social interactions. Functional neuroimaging studies show that a set of brain areas, such as the fusiform gyrus and amygdala, become active when viewing emotional facial expressions. The majority of functional magnetic resonance imaging (fMRI) studies investigating face perception typically employ static images of faces. However, studies that use dynamic facial expressions (e.g., videos) are accumulating and suggest that a dynamic presentation may be more sensitive and ecologically valid for investigating faces. By using quantitative fMRI meta-analysis the present study examined concordance of brain regions associated with viewing dynamic facial expressions. We analyzed data from 216 participants that participated in 14 studies, which reported coordinates for 28 experiments. Our analysis revealed bilateral fusiform and middle temporal gyri, left amygdala, left declive of the cerebellum and the right inferior frontal gyrus. These regions are discussed in terms of their relation to models of face processing.
Full Text Available Abstract Background This paper addresses key biological problems and statistical issues in the analysis of large gene expression data sets that describe systemic temporal response cascades to therapeutic doses in multiple tissues such as liver, skeletal muscle, and kidney from the same animals. Affymetrix time course gene expression data U34A are obtained from three different tissues including kidney, liver and muscle. Our goal is not only to find the concordance of gene in different tissues, identify the common differentially expressed genes over time and also examine the reproducibility of the findings by integrating the results through meta analysis from multiple tissues in order to gain a significant increase in the power of detecting differentially expressed genes over time and to find the differential differences of three tissues responding to the drug. Results and conclusion Bayesian categorical model for estimating the proportion of the 'call' are used for pre-screening genes. Hierarchical Bayesian Mixture Model is further developed for the identifications of differentially expressed genes across time and dynamic clusters. Deviance information criterion is applied to determine the number of components for model comparisons and selections. Bayesian mixture model produces the gene-specific posterior probability of differential/non-differential expression and the 95% credible interval, which is the basis for our further Bayesian meta-inference. Meta-analysis is performed in order to identify commonly expressed genes from multiple tissues that may serve as ideal targets for novel treatment strategies and to integrate the results across separate studies. We have found the common expressed genes in the three tissues. However, the up/down/no regulations of these common genes are different at different time points. Moreover, the most differentially expressed genes were found in the liver, then in kidney, and then in muscle.
Warnat, Patrick; Oberthuer, André; Fischer, Matthias; Westermann, Frank; Eils, Roland; Brors, Benedikt
Neuroblastoma patients show heterogeneous clinical courses ranging from life-threatening progression to spontaneous regression. Recently, gene expression profiles of neuroblastoma tumours were associated with clinically different phenotypes. However, such data is still rare for important patient subgroups, such as patients with MYCN non-amplified advanced stage disease. Prediction of the individual course of disease and optimal therapy selection in this cohort is challenging. Additional research effort is needed to describe the patterns of gene expression in this cohort and to identify reliable prognostic markers for this subset of patients. We combined gene expression data from two studies in a meta-analysis in order to investigate differences in gene expression of advanced stage (3 or 4) tumours without MYCN amplification that show contrasting outcomes (alive or dead) at five years after initial diagnosis. In addition, a predictive model for outcome was generated. Gene expression profiles from 66 patients were included from two studies using different microarray platforms. In the combined data set, 72 genes were identified as differentially expressed by meta-analysis at a false discovery rate (FDR) of 8.33%. Meta-analysis detected 34 differentially expressed genes that were not found as significant in either single study. Outcome prediction based on data of both studies resulted in a predictive accuracy of 77%. Moreover, the genes that were differentially expressed in subgroups of advanced stage patients without MYCN amplification accurately separated MYCN amplified tumours from low stage tumours without MYCN amplification. Our findings support the hypothesis that neuroblastoma consists of two biologically distinct subgroups that differ by characteristic gene expression patterns, which are associated with divergent clinical outcome
Pandi, Narayanan Sathiya; Suganya, Sivagurunathan; Rajendran, Suriliyandi
Highlights: •Identified stomach lineage specific gene set (SLSGS) was found to be under expressed in gastric tumors. •Elevated expression of SLSGS in gastric tumor is a molecular predictor of metabolic type gastric cancer. •In silico pathway scanning identified estrogen-α signaling is a putative regulator of SLSGS in gastric cancer. •Elevated expression of SLSGS in GC is associated with an overall increase in the survival of GC patients. -- Abstract: Stomach lineage specific gene products act as a protective barrier in the normal stomach and their expression maintains the normal physiological processes, cellular integrity and morphology of the gastric wall. However, the regulation of stomach lineage specific genes in gastric cancer (GC) is far less clear. In the present study, we sought to investigate the role and regulation of stomach lineage specific gene set (SLSGS) in GC. SLSGS was identified by comparing the mRNA expression profiles of normal stomach tissue with other organ tissue. The obtained SLSGS was found to be under expressed in gastric tumors. Functional annotation analysis revealed that the SLSGS was enriched for digestive function and gastric epithelial maintenance. Employing a single sample prediction method across GC mRNA expression profiles identified the under expression of SLSGS in proliferative type and invasive type gastric tumors compared to the metabolic type gastric tumors. Integrative pathway activation prediction analysis revealed a close association between estrogen-α signaling and SLSGS expression pattern in GC. Elevated expression of SLSGS in GC is associated with an overall increase in the survival of GC patients. In conclusion, our results highlight that estrogen mediated regulation of SLSGS in gastric tumor is a molecular predictor of metabolic type GC and prognostic factor in GC
Pandi, Narayanan Sathiya, E-mail: firstname.lastname@example.org; Suganya, Sivagurunathan; Rajendran, Suriliyandi
Highlights: •Identified stomach lineage specific gene set (SLSGS) was found to be under expressed in gastric tumors. •Elevated expression of SLSGS in gastric tumor is a molecular predictor of metabolic type gastric cancer. •In silico pathway scanning identified estrogen-α signaling is a putative regulator of SLSGS in gastric cancer. •Elevated expression of SLSGS in GC is associated with an overall increase in the survival of GC patients. -- Abstract: Stomach lineage specific gene products act as a protective barrier in the normal stomach and their expression maintains the normal physiological processes, cellular integrity and morphology of the gastric wall. However, the regulation of stomach lineage specific genes in gastric cancer (GC) is far less clear. In the present study, we sought to investigate the role and regulation of stomach lineage specific gene set (SLSGS) in GC. SLSGS was identified by comparing the mRNA expression profiles of normal stomach tissue with other organ tissue. The obtained SLSGS was found to be under expressed in gastric tumors. Functional annotation analysis revealed that the SLSGS was enriched for digestive function and gastric epithelial maintenance. Employing a single sample prediction method across GC mRNA expression profiles identified the under expression of SLSGS in proliferative type and invasive type gastric tumors compared to the metabolic type gastric tumors. Integrative pathway activation prediction analysis revealed a close association between estrogen-α signaling and SLSGS expression pattern in GC. Elevated expression of SLSGS in GC is associated with an overall increase in the survival of GC patients. In conclusion, our results highlight that estrogen mediated regulation of SLSGS in gastric tumor is a molecular predictor of metabolic type GC and prognostic factor in GC.
Qu, Hua; Wang, Liu-Pu; Liang, Yan-Chun; Wu, Chun-Guo
Cheng and Church algorithm is an important approach in biclustering algorithms. In this paper, the process of the extended space in the second stage of Cheng and Church algorithm is improved and the selections of two important parameters are discussed. The results of the improved algorithm used in the gene expression spectrum analysis show that, compared with Cheng and Church algorithm, the quality of clustering results is enhanced obviously, the mining expression models are better, and the d...
Wang, Wei; Lin, Tianxin; Huang, Jian; Hu, Weilie; Xu, Kewei; Liu, Jun
Mel-18 is a member of the polycomb group (PcG) of proteins, which are chromatin regulatory factors that play an important role in development and oncogenesis. This study was designed to investigate the clinical and prognostic significance of Mel-18 in the patients with prostate cancer. Immunostaining with Mel-18 specific antibodies was performed on paraffin sections from 202 patients. Correlations between Mel-18 and the Gleason grading system, clinical stage, serum prostate-specific antigen (PSA) levels, and age were evaluated. PSA recurrence in 76 patients who underwent radical prostatectomy and survival in 59 patients with metastases at diagnosis were analyzed to evaluate the influence of Mel-18 expression in cancer progression using Kaplan-Meier analysis and multivariate Cox regression analysis. Staining was seen in all prostatic tissues. Mel-18 expression was significantly reduced in the prostate cancer patients with PSA levels over 100 ng/ml (P=0.009), advanced clinical stage (>T4, N1, or M1 disease, P=0.029), higher Gleason grade or with a higher Gleason score (P=0.018) than in those with other clinicopathologic features. Negative expression of Mel-18 was associated with significantly higher rates of PSA recurrence after radical prostatectomy than with positive expression of Mel-18 (P = 0.029), and was an independent predictor of PSA recurrence (P=0.034, HR=2.143) in multivariate analysis. Similarly, metastatic prostate cancer patients with negative expression of Mel-18 showed significantly worse survival compared with the positive expression of Mel-18 (P=0.025). In multivariate analysis, negative expression of Mel-18 was an independent predictor of cancer-specific survival (P=0.024, HR=2.365). Our study provides important evidence for the recognition of Mel-18 as a tumor suppressor. The expression of Mel-18 showed potential as a prognostic marker for human prostate cancer. Copyright © 2011 Elsevier Inc. All rights reserved.
Rieu, Ivo; Bots, Marc; Mariani, Celestina; Weterings, Koen A P
The Arabidopsis AINTEGUMENTA (ANT) protein is essential for proper ovule development, but functions in cell proliferation and organ growth throughout the plant. Here we report the isolation of a full-length cDNA clone from tobacco (Nicotiana tabacum L.) that encodes a protein with high similarity to ANT and is preferentially expressed in the pistil. In situ hybridization analysis on the tobacco ovary shows that the expression pattern of the corresponding gene is different from that of ANT in Arabidopsis.
Pandi, Narayanan Sathiya; Suganya, Sivagurunathan; Rajendran, Suriliyandi
Stomach lineage specific gene products act as a protective barrier in the normal stomach and their expression maintains the normal physiological processes, cellular integrity and morphology of the gastric wall. However, the regulation of stomach lineage specific genes in gastric cancer (GC) is far less clear. In the present study, we sought to investigate the role and regulation of stomach lineage specific gene set (SLSGS) in GC. SLSGS was identified by comparing the mRNA expression profiles of normal stomach tissue with other organ tissue. The obtained SLSGS was found to be under expressed in gastric tumors. Functional annotation analysis revealed that the SLSGS was enriched for digestive function and gastric epithelial maintenance. Employing a single sample prediction method across GC mRNA expression profiles identified the under expression of SLSGS in proliferative type and invasive type gastric tumors compared to the metabolic type gastric tumors. Integrative pathway activation prediction analysis revealed a close association between estrogen-α signaling and SLSGS expression pattern in GC. Elevated expression of SLSGS in GC is associated with an overall increase in the survival of GC patients. In conclusion, our results highlight that estrogen mediated regulation of SLSGS in gastric tumor is a molecular predictor of metabolic type GC and prognostic factor in GC. Copyright © 2013 Elsevier Inc. All rights reserved.
Borsos, Zsófia; Gyori, Miklos
Exploratory analyses of emotional expressions using a commercially available facial expression recognition software are reported, from the context of a serious game for screening purposes. Our results are based on a comparative analysis of two matched groups of kindergarten-age children (high-functioning children with autism spectrum condition: n=13; typically developing children: n=13). Results indicate that this technology has the potential to identify autism-specific emotion expression features, and may play a role in affective diagnostic and assistive technologies.
Rode, Tone Mari; Berget, Ingunn; Langsrud, Solveig; Møretrø, Trond; Holck, Askild
Microorganisms are constantly exposed to new and altered growth conditions, and respond by changing gene expression patterns. Several methods for studying gene expression exist. During the last decade, the analysis of microarrays has been one of the most common approaches applied for large scale gene expression studies. A relatively new method for gene expression analysis is MassARRAY, which combines real competitive-PCR and MALDI-TOF (matrix-assisted laser desorption/ionization time-of-flight) mass spectrometry. In contrast to microarray methods, MassARRAY technology is suitable for analysing a larger number of samples, though for a smaller set of genes. In this study we compare the results from MassARRAY with microarrays on gene expression responses of Staphylococcus aureus exposed to acid stress at pH 4.5. RNA isolated from the same stress experiments was analysed using both the MassARRAY and the microarray methods. The MassARRAY and microarray methods showed good correlation. Both MassARRAY and microarray estimated somewhat lower fold changes compared with quantitative real-time PCR (qRT-PCR). The results confirmed the up-regulation of the urease genes in acidic environments, and also indicated the importance of metal ion regulation. This study shows that the MassARRAY technology is suitable for gene expression analysis in prokaryotes, and has advantages when a set of genes is being analysed for an organism exposed to many different environmental conditions.
Tintle Nathan L
Full Text Available Abstract Background Statistical analyses of whole genome expression data require functional information about genes in order to yield meaningful biological conclusions. The Gene Ontology (GO and Kyoto Encyclopedia of Genes and Genomes (KEGG are common sources of functionally grouped gene sets. For bacteria, the SEED and MicrobesOnline provide alternative, complementary sources of gene sets. To date, no comprehensive evaluation of the data obtained from these resources has been performed. Results We define a series of gene set consistency metrics directly related to the most common classes of statistical analyses for gene expression data, and then perform a comprehensive analysis of 3581 Affymetrix® gene expression arrays across 17 diverse bacteria. We find that gene sets obtained from GO and KEGG demonstrate lower consistency than those obtained from the SEED and MicrobesOnline, regardless of gene set size. Conclusions Despite the widespread use of GO and KEGG gene sets in bacterial gene expression data analysis, the SEED and MicrobesOnline provide more consistent sets for a wide variety of statistical analyses. Increased use of the SEED and MicrobesOnline gene sets in the analysis of bacterial gene expression data may improve statistical power and utility of expression data.
Hawkins, Shannon M.; Loomans, Holli A.; Wan, Ying-Wooi; Ghosh-Choudhury, Triparna; Coffey, Donna; Xiao, Weimin; Liu, Zhandong; Sangi-Haghpeykar, Haleh
Context: Recent evidence implicates the orphan nuclear receptor, nuclear receptor subfamily 2, group F, member 2 (NR2F2; chicken ovalbumin upstream promoter-transcription factor II) as both a master regulator of angiogenesis and an oncogene in prostate and other human cancers. Objective: The objective of the study was to determine whether NR2F2 plays a role in ovarian cancer and dissect its potential mechanisms of action. Design, Setting, and Patients: We examined NR2F2 expression in healthy ovary and ovarian cancers using quantitative PCR and immunohistochemistry. NR2F2 expression was targeted in established ovarian cancer cell lines to assess the impact of dysregulated NR2F2 expression in the epithelial compartment of ovarian cancers. Results: Our results indicate that NR2F2 is robustly expressed in the stroma of healthy ovary with little or no expression in epithelia lining the ovarian surface, clefts, or crypts. This pattern of NR2F2 expression was markedly disrupted in ovarian cancers, in which decreased levels of stromal expression and ectopic epithelial expression were frequently observed. Ovarian cancers with the most disrupted patterns of NR2F2 were associated with significantly shorter disease-free interval by Kaplan-Meier analysis. Targeting NR2F2 expression in established ovarian cancer cell lines enhanced apoptosis and increased proliferation. In addition, we found that NR2F2 regulates the expression of NEK2, RAI14, and multiple other genes involved in the cell cycle, suggesting potential pathways by which dysregulated expression of NR2F2 impacts ovarian cancer. Conclusions: These results uncover novel roles for NR2F2 in ovarian cancer and point to a unique scenario in which a single nuclear receptor plays potentially distinct roles in the stromal and epithelial compartments of the same tissue. PMID:23690307
Lattin, Jane E; Schroder, Kate; Su, Andrew I; Walker, John R; Zhang, Jie; Wiltshire, Tim; Saijo, Kaoru; Glass, Christopher K; Hume, David A; Kellie, Stuart; Sweet, Matthew J
Monocytes and macrophages express an extensive repertoire of G Protein-Coupled Receptors (GPCRs) that regulate inflammation and immunity. In this study we performed a systematic micro-array analysis of GPCR expression in primary mouse macrophages to identify family members that are either enriched in macrophages compared to a panel of other cell types, or are regulated by an inflammatory stimulus, the bacterial product lipopolysaccharide (LPS). Several members of the P2RY family had striking expression patterns in macrophages; P2ry6 mRNA was essentially expressed in a macrophage-specific fashion, whilst P2ry1 and P2ry5 mRNA levels were strongly down-regulated by LPS. Expression of several other GPCRs was either restricted to macrophages (e.g. Gpr84) or to both macrophages and neural tissues (e.g. P2ry12, Gpr85). The GPCR repertoire expressed by bone marrow-derived macrophages and thioglycollate-elicited peritoneal macrophages had some commonality, but there were also several GPCRs preferentially expressed by either cell population. The constitutive or regulated expression in macrophages of several GPCRs identified in this study has not previously been described. Future studies on such GPCRs and their agonists are likely to provide important insights into macrophage biology, as well as novel inflammatory pathways that could be future targets for drug discovery.
Hermsen, Sanne A.B., E-mail: Sanne.Hermsen@rivm.nl [Centre for Health Protection, National Institute for Public Health and the Environment (RIVM), P.O. Box 1, 3720 BA Bilthoven (Netherlands); Department of Toxicogenomics, Maastricht University, P.O. Box 616, 6200 MD, Maastricht (Netherlands); Institute for Risk Assessment Sciences (IRAS), Utrecht University, P.O. Box 80.178, 3508 TD, Utrecht (Netherlands); Pronk, Tessa E. [Centre for Health Protection, National Institute for Public Health and the Environment (RIVM), P.O. Box 1, 3720 BA Bilthoven (Netherlands); Department of Toxicogenomics, Maastricht University, P.O. Box 616, 6200 MD, Maastricht (Netherlands); Brandhof, Evert-Jan van den [Centre for Environmental Quality, National Institute for Public Health and the Environment (RIVM), P.O. Box 1, 3720 BA Bilthoven (Netherlands); Ven, Leo T.M. van der [Centre for Health Protection, National Institute for Public Health and the Environment (RIVM), P.O. Box 1, 3720 BA Bilthoven (Netherlands); Piersma, Aldert H. [Centre for Health Protection, National Institute for Public Health and the Environment (RIVM), P.O. Box 1, 3720 BA Bilthoven (Netherlands); Institute for Risk Assessment Sciences (IRAS), Utrecht University, P.O. Box 80.178, 3508 TD, Utrecht (Netherlands)
The zebrafish embryotoxicity test is a promising alternative assay for developmental toxicity. Classically, morphological assessment of the embryos is applied to evaluate the effects of compound exposure. However, by applying differential gene expression analysis the sensitivity and predictability of the test may be increased. For defining gene expression signatures of developmental toxicity, we explored the possibility of using gene expression signatures of compound exposures based on commonly expressed individual genes as well as based on regulated gene pathways. Four developmental toxic compounds were tested in concentration-response design, caffeine, carbamazepine, retinoic acid and valproic acid, and two non-embryotoxic compounds, D-mannitol and saccharin, were included. With transcriptomic analyses we were able to identify commonly expressed genes, which were mostly development related, after exposure to the embryotoxicants. We also identified gene pathways regulated by the embryotoxicants, suggestive of their modes of action. Furthermore, whereas pathways may be regulated by all compounds, individual gene expression within these pathways can differ for each compound. Overall, the present study suggests that the use of individual gene expression signatures as well as pathway regulation may be useful starting points for defining gene biomarkers for predicting embryotoxicity. - Highlights: • The zebrafish embryotoxicity test in combination with transcriptomics was used. • We explored two approaches of defining gene biomarkers for developmental toxicity. • Four compounds in concentration-response design were tested. • We identified commonly expressed individual genes as well as regulated gene pathways. • Both approaches seem suitable starting points for defining gene biomarkers.
Zhu, Yizhang; Wang, Likun; Yin, Yuxin; Yang, Ence
Postmortem mRNA degradation is considered to be the major concern in gene expression research utilizing human postmortem tissues. A key factor in this process is the postmortem interval (PMI), which is defined as the interval between death and sample collection. However, global patterns of postmortem mRNA degradation at individual gene levels across diverse human tissues remain largely unknown. In this study, we performed a systematic analysis of alteration of gene expression associated with PMI in human tissues. From the Genotype-Tissue Expression (GTEx) database, we evaluated gene expression levels of 2,016 high-quality postmortem samples from 316 donors of European descent, with PMI ranging from 1 to 27 hours. We found that PMI-related mRNA degradation is tissue-specific, gene-specific, and even genotype-dependent, thus drawing a more comprehensive picture of PMI-associated gene expression across diverse human tissues. Additionally, we also identified 266 differentially variable (DV) genes, such as DEFB4B and IFNG, whose expression is significantly dispersed between short PMI (S-PMI) and long PMI (L-PMI) groups. In summary, our analyses provide a comprehensive profile of PMI-associated gene expression, which will help interpret gene expression patterns in the evaluation of postmortem tissues.
Ballouz, S; Verleyen, W; Gillis, J
RNA-seq co-expression analysis is in its infancy and reasonable practices remain poorly defined. We assessed a variety of RNA-seq expression data to determine factors affecting functional connectivity and topology in co-expression networks. We examine RNA-seq co-expression data generated from 1970 RNA-seq samples using a Guilt-By-Association framework, in which genes are assessed for the tendency of co-expression to reflect shared function. Minimal experimental criteria to obtain performance on par with microarrays were >20 samples with read depth >10 M per sample. While the aggregate network constructed shows good performance (area under the receiver operator characteristic curve ∼0.71), the dependency on number of experiments used is nearly identical to that present in microarrays, suggesting thousands of samples are required to obtain 'gold-standard' co-expression. We find a major topological difference between RNA-seq and microarray co-expression in the form of low overlaps between hub-like genes from each network due to changes in the correlation of expression noise within each technology. email@example.com or firstname.lastname@example.org Networks are available at: http://gillislab.labsites.cshl.edu/supplements/rna-seq-networks/ and supplementary data are available at Bioinformatics online. © The Author 2015. Published by Oxford University Press. All rights reserved. For Permissions, please e-mail: email@example.com.
Ruebel, Oliver; Keranen, Soile V.E.; Biggin, Mark; Knowles, David W.; Weber, Gunther H.; Hagen, Hans; Hamann, Bernd; Bethel, E. Wes
Three-dimensional gene expression PointCloud data generated by the Berkeley Drosophila Transcription Network Project (BDTNP) provides quantitative information about the spatial and temporal expression of genes in early Drosophila embryos at cellular resolution. The BDTNP team visualizes and analyzes Point-Cloud data using the software application PointCloudXplore (PCX). To maximize the impact of novel, complex data sets, such as PointClouds, the data needs to be accessible to biologists and comprehensible to developers of analysis functions. We address this challenge by linking PCX and Matlab via a dedicated interface, thereby providing biologists seamless access to advanced data analysis functions and giving bioinformatics researchers the opportunity to integrate their analysis directly into the visualization application. To demonstrate the usefulness of this approach, we computationally model parts of the expression pattern of the gene even skipped using a genetic algorithm implemented in Matlab and integrated into PCX via our Matlab interface.
El-Sherry, S; Ogedengbe, M E; Hafeez, M A; Sayf-Al-Din, M; Gad, N; Barta, J R
Unlike with Eimeria species infecting chickens, specific identification and nomenclature of Eimeria species infecting turkeys is complicated, and in the absence of molecular data, imprecise. In an attempt to reconcile contradictory data reported on oocyst morphometrics and biological descriptions of various Eimeria species infecting turkey, we established single oocyst derived lines of 5 important Eimeria species infecting turkeys, Eimeria meleagrimitis (USMN08-01 strain), Eimeria adenoeides (Guelph strain), Eimeria gallopavonis (Weybridge strain), Eimeria meleagridis (USAR97-01 strain), and Eimeria dispersa (Briston strain). Short portions (514 bp) of mitochondrial cytochrome c oxidase subunit I gene (mt COI) from each were amplified and sequenced. Comparison of these sequences showed sufficient species-specific sequence variation to recommend these short mt COI sequences as species-specific markers. Uniformity of oocyst features (dimensions and oocyst structure) of each pure line was observed. Additional morphological features of the oocysts of these species are described as useful for the microscopic differentiation of these Eimeria species. Combined molecular and morphometric data on these single species lines compared with the original species descriptions and more recent data have helped to clarify some confusing, and sometimes conflicting, features associated with these Eimeria spp. For example, these new data suggest that the KCH and KR strains of E. adenoeides reported previously represent 2 distinct species, E. adenoeides and E. meleagridis, respectively. Likewise, analysis of the Weybridge strain of E. adenoeides, which has long been used as a reference strain in various studies conducted on the pathogenicity of E. adenoeides, indicates that this coccidium is actually a strain of E. gallopavonis. We highly recommend mt COI sequence-based genotyping be incorporated into all studies using Eimeria spp. of turkeys to confirm species identifications and so
Robert T Gaeta
. Furthermore, our microarray analysis did not provide strong evidence that homoeologous rearrangements were a determinant of genome-wide nonadditive gene expression. In light of the inherent limitations of the Arabidopsis microarray to measure gene expression in polyploid Brassicas, further studies are warranted.
D. S. Mikhaylenko
Full Text Available Introduction. Prostate cancer (PCa is one of the common oncological diseases in men. Expression of the PCA3 gene in urine is currently used as a molecular genetic marker of PCa.Objective: to comparative analysis of the PCA3 expression in urine sediments and exosomes for the determination of the biomaterial, which allows detecting the PCA3 expression in more efficient manner.Materials and methods. The 12 patients with different stages of PCa and 8 control samples were examined.Results. The diagnostic accuracy of the PCA3 gene expression analysis in this cohort exceeded 90 %. We had not obtained significant differences in the sensitivity and specificity of the PCA3 hyperexpression in the urine sediments compared with exosomes. This result indicates in favor to using urine sediment for the PCA3 analysis as a biomaterial with less time-consuming sample preparation, although the possible advantage of exosomes for the analysis of the expression marker panels requires further studies.
Repin Mikhail V
Full Text Available Abstract Background The objective of this work is to obtain the correct relative DNA contents of chromosomes in the normal male and female human diploid genomes for the use at FISH analysis of radiation-induced chromosome aberrations. Results The relative DNA contents of chromosomes in the male and female human diploid genomes have been calculated from the publicly available international Human Genome Project data. New sequence-based data on the relative DNA contents of human chromosomes were compared with the data recommended by the International Atomic Energy Agency in 2001. The differences in the values of the relative DNA contents of chromosomes obtained by using different approaches for 15 human chromosomes, mainly for large chromosomes, were below 2%. For the chromosomes 13, 17, 20 and 22 the differences were above 5%. Conclusion New sequence-based data on the relative DNA contents of chromosomes in the normal male and female human diploid genomes were obtained. This approach, based on the genome sequence, can be recommended for the use in radiation molecular cytogenetics.
Adachi, Ryota; Sasaki, Yuko; Morita, Hiromi; Komai, Michio; Shirakawa, Hitoshi; Goto, Tomoko; Furuyama, Akira; Isono, Kunio
Transgenic Drosophila expressing human T2R4 and T2R38 bitter-taste receptors or PKD2L1 sour-taste receptor in the fly gustatory receptor neurons and other tissues were prepared using conventional Gal4/UAS binary system. Molecular analysis showed that the transgene mRNAs are expressed according to the tissue specificity of the Gal4 drivers. Transformants expressing the transgene taste receptors in the fly taste neurons were then studied by a behavioral assay to analyze whether transgene chemoreceptors are functional and coupled to the cell response. Since wild-type flies show strong aversion against the T2R ligands as in mammals, the authors analyzed the transformants where the transgenes are expressed in the fly sugar receptor neurons so that they promote feeding ligand-dependently if they are functional and activate the neurons. Although the feeding preference varied considerably among different strains and individuals, statistical analysis using large numbers of transformants indicated that transformants expressing T2R4 showed a small but significant increase in the preference for denatonium and quinine, the T2R4 ligands, as compared to the control flies, whereas transformants expressing T2R38 did not. Similarly, transformants expressing T2R38 and PKD2L1 also showed a similar preference increase for T2R38-specific ligand phenylthiocarbamide (PTC) and a sour-taste ligand, citric acid, respectively. Taken together, the transformants expressing mammalian taste receptors showed a small but significant increase in the feeding preference that is taste receptor and also ligand dependent. Although future improvements are required to attain performance comparable to the endogenous robust response, Drosophila taste neurons may serve as a potential in vivo heterologous expression system for analyzing chemoreceptor function.
Full Text Available The intracellular protozoan parasite Theileria parva transforms bovine lymphocytes inducing uncontrolled proliferation. Proteins released from the parasite are assumed to contribute to phenotypic changes of the host cell and parasite persistence. With 85 members, genes encoding subtelomeric variable secreted proteins (SVSPs form the largest gene family in T. parva. The majority of SVSPs contain predicted signal peptides, suggesting secretion into the host cell cytoplasm.We analysed SVSP expression in T. parva-transformed cell lines established in vitro by infection of T or B lymphocytes with cloned T. parva parasites. Microarray and quantitative real-time PCR analysis revealed mRNA expression for a wide range of SVSP genes. The pattern of mRNA expression was largely defined by the parasite genotype and not by host background or cell type, and found to be relatively stable in vitro over a period of two months. Interestingly, immunofluorescence analysis carried out on cell lines established from a cloned parasite showed that expression of a single SVSP encoded by TP03_0882 is limited to only a small percentage of parasites. Epitope-tagged TP03_0882 expressed in mammalian cells was found to translocate into the nucleus, a process that could be attributed to two different nuclear localisation signals.Our analysis reveals a complex pattern of Theileria SVSP mRNA expression, which depends on the parasite genotype. Whereas in cell lines established from a cloned parasite transcripts can be found corresponding to a wide range of SVSP genes, only a minority of parasites appear to express a particular SVSP protein. The fact that a number of SVSPs contain functional nuclear localisation signals suggests that proteins released from the parasite could contribute to phenotypic changes of the host cell. This initial characterisation will facilitate future studies on the regulation of SVSP gene expression and the potential biological role of these enigmatic
Li, Fupeng; Wu, Baoduo; Qin, Xiaowei; Yan, Lin; Hao, Chaoyun; Tan, Lehe; Lai, Jianxiong
In this study, we performed cloning and expression analysis of six putative sucrose transporter genes, designated TcSUT1, TcSUT2, TcSUT3, TcSUT4, TcSUT5 and TcSUT6, from the cacao genotype 'TAS-R8'. The combination of cDNA and genomic DNA sequences revealed that the cacao SUT genes contained exon numbers ranging from 1 to 14. The average molecular mass of all six deduced proteins was approximately 56 kDa (range 52 to 66 kDa). All six proteins were predicted to exhibit typical features of sucrose transporters with 12 trans-membrane spanning domains. Phylogenetic analysis revealed that TcSUT2 and TcSUT4 belonged to Group 2 SUT and Group 4 SUT, respectively, and the other TcSUT proteins were belonging to Group 1 SUT. Real-time PCR was conducted to investigate the expression pattern of each member of the SUT family in cacao. Our experiment showed that TcSUT1 was expressed dominantly in pods and that, TcSUT3 and TcSUT4 were highly expressed in both pods and in bark with phloem. Within pods, TcSUT1 and TcSUT4 were expressed more in the seed coat and seed from the pod enlargement stage to the ripening stage. TcSUT5 expression sharply increased to its highest expression level in the seed coat during the ripening stage. Expression pattern analysis indicated that TcSUT genes may be associated with photoassimilate transport into developing seeds and may, therefore, have an impact on seed production. Copyright © 2014 Elsevier B.V. All rights reserved.
Full Text Available Abstract Background In ovo electroporation is a widely used technique to study gene function in developmental biology. Despite the widespread acceptance of this technique, no genome-wide analysis of the effects of in ovo electroporation, principally the current applied across the tissue and exogenous vector DNA introduced, on endogenous gene expression has been undertaken. Here, the effects of electric current and expression of a GFP-containing construct, via electroporation into the midbrain of Hamburger-Hamilton stage 10 chicken embryos, are analysed by microarray. Results Both current alone and in combination with exogenous DNA expression have a small but reproducible effect on endogenous gene expression, changing the expression of the genes represented on the array by less than 0.1% (current and less than 0.5% (current + DNA, respectively. The subset of genes regulated by electric current and exogenous DNA span a disparate set of cellular functions. However, no genes involved in the regional identity were affected. In sharp contrast to this, electroporation of a known transcription factor, Dmrt5, caused a much greater change in gene expression. Conclusions These findings represent the first systematic genome-wide analysis of the effects of in ovo electroporation on gene expression during embryonic development. The analysis reveals that this process has minimal impact on the genetic basis of cell fate specification. Thus, the study demonstrates the validity of the in ovo electroporation technique to study gene function and expression during development. Furthermore, the data presented here can be used as a resource to refine the set of transcriptional responders in future in ovo electroporation studies of specific gene function.
Van Zeveren Alex
Full Text Available Abstract Background Normal preimplantation embryo development encompasses a series of events including first cleavage division, activation of the embryonic genome, compaction and blastocyst formation. First lineage differentiation starts at the blastocyst stage with the formation of the trophectoderm and the inner cell mass. The main objective of this study was the detection, identification and expression analysis of genes associated with blastocyst formation in order to help us better understand this process. This information could lead to improvements of in vitro embryo production procedures. Results A subtractive cDNA library was constructed enriched for transcripts preferentially expressed at the blastocyst stage compared to the 2-cell and 8-cell stage. Sequence information was obtained for 65 randomly selected clones. The RNA expression levels of 12 candidate genes were determined throughout 3 stages of preimplantation embryo development (2-cell, 8-cell and blastocyst and compared with the RNA expression levels of in vivo "golden standard" embryos using real-time PCR. The RNA expression profiles of 9 (75% transcripts (KRT18, FN1, MYL6, ATP1B3, FTH1, HINT1, SLC25A5, ATP6V0B, RPL10 were in agreement with the subtractive cDNA cloning approach, whereas for the remaining 3 (25% (ACTN1, COPE, EEF1A1 the RNA expression level was equal or even higher at the earlier developmental stages compared to the blastocyst stage. Moreover, significant differences in RNA expression levels were observed between in vitro and in vivo produced embryos. By immunofluorescent labelling, the protein expression of KRT18, FN1 and MYL6 was determined throughout bovine preimplantation embryo development and showed the same pattern as the RNA expression analyses. Conclusion By subtractive cDNA cloning, candidate genes involved in blastocyst formation were identified. For several candidate genes, important differences in gene expression were observed between in vivo and in
Schallig Henk DFH
Full Text Available Abstract Background Malaria is one of the most important infectious diseases in the world. Although most cases are found distributed in the tropical regions of Africa, Asia, Central and South Americas, there is in Europe a significant increase in the number of imported cases in non-endemic countries, in particular due to the higher mobility in today's society. Methods The prevalence of a possible asymptomatic infection with Plasmodium species was assessed using Nucleic Acid Sequence Based Amplification (NASBA assays on clinical samples collected from 195 study cases with no clinical signs related to malaria and coming from sub-Saharan African regions to Southern Italy. In addition, base-line demographic, clinical and socio-economic information was collected from study participants who also underwent a full clinical examination. Results Sixty-two study subjects (31.8% were found positive for Plasmodium using a pan Plasmodium specific NASBA which can detect all four Plasmodium species causing human disease, based on the small subunit 18S rRNA gene (18S NASBA. Twenty-four samples (38% of the 62 18S NASBA positive study cases were found positive with a Pfs25 mRNA NASBA, which is specific for the detection of gametocytes of Plasmodium falciparum. A statistically significant association was observed between 18S NASBA positivity and splenomegaly, hepatomegaly and leukopaenia and country of origin. Conclusion This study showed that a substantial proportion of people originating from malaria endemic countries harbor malaria parasites in their blood. If transmission conditions are available, they could potentially be a reservoir. Thefore, health authorities should pay special attention to the health of this potential risk group and aim to improve their health conditions.
Schoone, G. J.; Oskam, L.; Kroon, N. C.; Schallig, H. D.; Omar, S. A.
A quantitative nucleic acid sequence-based amplification (QT-NASBA) assay for the detection of Plasmodium parasites has been developed. Primers and probes were selected on the basis of the sequence of the small-subunit rRNA gene. Quantification was achieved by coamplification of the RNA in the
Ivan G. Costa
Full Text Available This work performs a data driven comparative study of clustering methods used in the analysis of gene expression time courses (or time series. Five clustering methods found in the literature of gene expression analysis are compared: agglomerative hierarchical clustering, CLICK, dynamical clustering, k-means and self-organizing maps. In order to evaluate the methods, a k-fold cross-validation procedure adapted to unsupervised methods is applied. The accuracy of the results is assessed by the comparison of the partitions obtained in these experiments with gene annotation, such as protein function and series classification.
Full Text Available A polymerase chain reaction (PCR assay was developed to test for tumor cell specific expression of the BCEI gene. This new marker gene, reported at first for human breast cancer, was found specifically active in various gastrointestinal carcinomas by previously applying immunohistochemistry and RNA (Northern blot analysis. Presently, by using reverse transcription -PCR analysis, a series of primary tumor tissues and established tumor cell lines were testcd for BCEI transcription. This approach was compared to immunostaining achieved by an antibody directed against the BCEI gene’s product. The result demonstrate the superior sensitivity of PCR by indicating the gene’ s expression in cases where immunohistochemical testing remained negative.
Full Text Available Abstract Background Uncharacterized proteases naturally expressed by bacterial pathogens represents important topic in infectious disease research, because these enzymes may have critical roles in pathogenicity and cell physiology. It has been observed that cloning, expression and purification of proteases often fail due to their catalytic functions which, in turn, cause toxicity in the E. coli heterologous host. Results In order to address this problem systematically, a modified pipeline of our high-throughput protein expression and purification platform was developed. This included the use of a specific E. coli strain, BL21(DE3 pLysS to tightly control the expression of recombinant proteins and various expression vectors encoding fusion proteins to enhance recombinant protein solubility. Proteases fused to large fusion protein domains, maltosebinding protein (MBP, SP-MBP which contains signal peptide at the N-terminus of MBP, disulfide oxidoreductase (DsbA and Glutathione S-transferase (GST improved expression and solubility of proteases. Overall, 86.1% of selected protease genes including hypothetical proteins were expressed and purified using a combination of five different expression vectors. To detect novel proteolytic activities, zymography and fluorescence-based assays were performed and the protease activities of more than 46% of purified proteases and 40% of hypothetical proteins that were predicted to be proteases were confirmed. Conclusions Multiple expression vectors, employing distinct fusion tags in a high throughput pipeline increased overall success rates in expression, solubility and purification of proteases. The combinatorial functional analysis of the purified proteases using fluorescence assays and zymography confirmed their function.
Full Text Available Abstract Background In a previous screen to identify differentially expressed genes associated with embryonic development, the porcine PNAS-4 gene had been found. Considering differentially expressed genes in early stages of muscle development are potential candidate genes to improve meat quality and production efficiency, we determined how porcine PNAS-4 gene regulates meat production. Therefore, this gene has been sequenced, expression analyzed and associated with meat production traits. Results We cloned the full-length cDNA of porcine PNAS-4 gene encoding a protein of 194 amino acids which was expressed in the Golgi complex. This gene was mapped to chromosome 10, q11–16, in a region of conserved synteny with human chromosome 1 where the human homologous gene was localized. Real-time PCR revealed that PNAS-4 mRNA was widely expressed with highest expression levels in skeletal muscle followed by lymph, liver and other tissues, and showed a down-regulated expression pattern during prenatal development while a up-regulated expression pattern after weaning. Association analysis revealed that allele C of SNP A1813C was prevalent in Chinese indigenous breeds whereas A was dominant allele in Landrace and Large White, and the pigs with homozygous CC had a higher fat content than those of the pigs with other genotypes (P Conclusion Porcine PNAS-4 protein tagged with green fluorescent protein accumulated in the Golgi complex, and its mRNA showed a widespread expression across many tissues and organs in pigs. It may be an important factor affecting the meat production efficiency, because its down-regulated expression pattern during early embryogenesis suggests involvement in increase of muscle fiber number. In addition, the SNP A1813C associated with fat traits might be a genetic marker for molecular-assisted selection in animal breeding.
Full Text Available Efforts to unravel the mechanisms underlying taste sensation (gustation have largely focused on rodents. Here we present the first comprehensive characterization of gene expression in primate taste buds. Our findings reveal unique new insights into the biology of taste buds. We generated a taste bud gene expression database using laser capture microdissection (LCM procured fungiform (FG and circumvallate (CV taste buds from primates. We also used LCM to collect the top and bottom portions of CV taste buds. Affymetrix genome wide arrays were used to analyze gene expression in all samples. Known taste receptors are preferentially expressed in the top portion of taste buds. Genes associated with the cell cycle and stem cells are preferentially expressed in the bottom portion of taste buds, suggesting that precursor cells are located there. Several chemokines including CXCL14 and CXCL8 are among the highest expressed genes in taste buds, indicating that immune system related processes are active in taste buds. Several genes expressed specifically in endocrine glands including growth hormone releasing hormone and its receptor are also strongly expressed in taste buds, suggesting a link between metabolism and taste. Cell type-specific expression of transcription factors and signaling molecules involved in cell fate, including KIT, reveals the taste bud as an active site of cell regeneration, differentiation, and development. IKBKAP, a gene mutated in familial dysautonomia, a disease that results in loss of taste buds, is expressed in taste cells that communicate with afferent nerve fibers via synaptic transmission. This database highlights the power of LCM coupled with transcriptional profiling to dissect the molecular composition of normal tissues, represents the most comprehensive molecular analysis of primate taste buds to date, and provides a foundation for further studies in diverse aspects of taste biology.
GERLSMA, C; VANDERLUBBE, PM; VANNIEUWENHUIZEN, C
When the factor structure and psychometric qualities of the Level of Expressed Emotion scale, an instrument intended to assess patient's perceptions of expressed emotion, were evaluated, three moderately intercorrelated factors emerged, with good internal consistency; these were lack of emotional
Dhavala, Soma S.
Massively Parallel Signature Sequencing (MPSS) is a high-throughput, counting-based technology available for gene expression profiling. It produces output that is similar to Serial Analysis of Gene Expression and is ideal for building complex relational databases for gene expression. Our goal is to compare the in vivo global gene expression profiles of tissues infected with different strains of Salmonella obtained using the MPSS technology. In this article, we develop an exact ANOVA type model for this count data using a zero-inflatedPoisson distribution, different from existing methods that assume continuous densities. We adopt two Bayesian hierarchical models-one parametric and the other semiparametric with a Dirichlet process prior that has the ability to "borrow strength" across related signatures, where a signature is a specific arrangement of the nucleotides, usually 16-21 base pairs long. We utilize the discreteness of Dirichlet process prior to cluster signatures that exhibit similar differential expression profiles. Tests for differential expression are carried out using nonparametric approaches, while controlling the false discovery rate. We identify several differentially expressed genes that have important biological significance and conclude with a summary of the biological discoveries. This article has supplementary materials online. © 2010 American Statistical Association.
Full Text Available To further understand the potential expression relationships of miRNAs in miRNA gene clusters and gene families, a global analysis was performed in 4 paired tumor (breast cancer and adjacent normal tissue samples using deep sequencing datasets. The compositions of miRNA gene clusters and families are not random, and clustered and homologous miRNAs may have close relationships with overlapped miRNA species. Members in the miRNA group always had various expression levels, and even some showed larger expression divergence. Despite the dynamic expression as well as individual difference, these miRNAs always indicated consistent or similar deregulation patterns. The consistent deregulation expression may contribute to dynamic and coordinated interaction between different miRNAs in regulatory network. Further, we found that those clustered or homologous miRNAs that were also identified as sense and antisense miRNAs showed larger expression divergence. miRNA gene clusters and families indicated important biological roles, and the specific distribution and expression further enrich and ensure the flexible and robust regulatory network.
Darias, M J; Zambonino-Infante, J L; Hugot, K; Cahu, C L; Mazurais, D
During the larval period, marine teleosts undergo very fast growth and dramatic changes in morphology, metabolism, and behavior to accomplish their metamorphosis into juvenile fish. Regulation of gene expression is widely thought to be a key mechanism underlying the management of the biological processes required for harmonious development over this phase of life. To provide an overall analysis of gene expression in the whole body during sea bass larval development, we monitored the expression of 6,626 distinct genes at 10 different points in time between 7 and 43 days post-hatching (dph) by using heterologous hybridization of a rainbow trout cDNA microarray. The differentially expressed genes (n = 485) could be grouped into two categories: genes that were generally up-expressed early, between 7 and 23 dph, and genes up-expressed between 25 and 43 dph. Interestingly, among the genes regulated during the larval period, those related to organogenesis, energy pathways, biosynthesis, and digestion were over-represented compared with total set of analyzed genes. We discuss the quantitative regulation of whole-body contents of these specific transcripts with regard to the ontogenesis and maturation of essential functions that take place over larval development. Our study is the first utilization of a transcriptomic approach in sea bass and reveals dynamic changes in gene expression patterns in relation to marine finfish larval development.
Li, Wen-Xing; Dai, Shao-Xing; Liu, Jia-Qian; Wang, Qian; Li, Gong-Hua; Huang, Jing-Fei
Alzheimer's disease (AD) and schizophrenia (SZ) are both accompanied by impaired learning and memory functions. This study aims to explore the expression profiles of learning or memory genes between AD and SZ. We downloaded 10 AD and 10 SZ datasets from GEO-NCBI for integrated analysis. These datasets were processed using RMA algorithm and a global renormalization for all studies. Then Empirical Bayes algorithm was used to find the differentially expressed genes between patients and controls. The results showed that most of the differentially expressed genes were related to AD whereas the gene expression profile was little affected in the SZ. Furthermore, in the aspects of the number of differentially expressed genes, the fold change and the brain region, there was a great difference in the expression of learning or memory related genes between AD and SZ. In AD, the CALB1, GABRA5, and TAC1 were significantly downregulated in whole brain, frontal lobe, temporal lobe, and hippocampus. However, in SZ, only two genes CRHBP and CX3CR1 were downregulated in hippocampus, and other brain regions were not affected. The effect of these genes on learning or memory impairment has been widely studied. It was suggested that these genes may play a crucial role in AD or SZ pathogenesis. The different gene expression patterns between AD and SZ on learning and memory functions in different brain regions revealed in our study may help to understand the different mechanism between two diseases.
Chervonsky, Elizabeth; Hunt, Caroline
Emotion expression is critical for the communication of important social information, such as emotional states and behavioral intentions. However, people tend to vary in their level of emotional expression. This meta-analysis investigated the relationships between levels of emotion expression and suppression, and social and interpersonal outcomes. PsycINFO databases, as well as reference lists were searched. Forty-three papers from a total of 3,200 papers met inclusion criteria, allowing for 105 effect sizes to be calculated. Meta-analyses revealed that greater suppression of emotion was significantly associated with poorer social wellbeing, including more negative first impressions, lower social support, lower social satisfaction and quality, and poorer romantic relationship quality. Furthermore, the expression of positive and general/nonspecific emotion was related to better social outcomes, while the expression of anger was associated with poorer social wellbeing. Expression of negative emotion generally was also associated with poorer social outcomes, although this effect size was very small and consisted of mixed results. These findings highlight the importance of considering the role that regulation of emotional expression can play in the development of social dysfunction and interpersonal problems. (PsycINFO Database Record (c) 2017 APA, all rights reserved).
Babenko Vladimir N.
Full Text Available ApoE expression status was proved to be a highly specific marker of energy metabolism rate in the brain. Along with its neighbor, Translocase of Outer Mitochondrial Membrane 40 kDa (TOMM40 which is involved in mitochondrial metabolism, the corresponding genomic region constitutes the neuroenergetic hotspot. Using RNA-Seq data from a murine model of chronic stress a significant positive expression coordination of seven neighboring genes in ApoE locus in five brain regions was observed. ApoE maintains one of the highest absolute expression values genome-wide, implying that ApoE can be the driver of the neighboring gene expression alteration observed under stressful loads. Notably, we revealed the highly statistically significant increase of ApoE expression in the hypothalamus of chronically aggressive (FDR < 0.007 and defeated (FDR < 0.001 mice compared to the control. Correlation analysis revealed a close association of ApoE and proopiomelanocortin (Pomc gene expression profiles implying the putative neuroendocrine stress response background of ApoE expression elevation therein.
Burland, Timothy G.; Schedl, Tim; Gull, Keith; Dove, William F.
Physarum displays two vegetative cell types, uninucleate myxamoebae and multinucleate plasmodia. Mutant myxamoebae of Physarum resistant to the antitubulin drug methylbenzimidazole-2-yl-carbamate (MBC) were isolated. All mutants tested were cross-resistant to other benzimidazoles but not to cycloheximide or emetine. Genetic analysis showed that mutation to MBC resistance can occur at any one of four unlinked loci, benA, benB, benC or benD. MBC resistance of benB and benD mutants was expressed in plasmodia, but benA and benC mutant plasmodia were MBC sensitive, suggesting that benA and benC encode myxamoeba-specific products. Myxamoebae carrying the recessive benD210 mutation express a β-tubulin with noval electrophoretic mobility, in addition to a β-tubulin with wild-type mobility. This and other evidence indicates that benD is a structural gene for β-tubulin, and that at least two β-tubulin genes are expressed in myxamoebae. Comparisons of the β-tubulins of wildtype and benD210 strains by gel electrophoresis revealed that, of the three (or more) β-tubulin genes expressed in Physarum, one, benD, is expressed in both myxamoebae and plasmodia, one is expressed specifically in myxamoebae and one is expressed specifically in plasmodia. However, mutation in only one gene, benD, is sufficient to confer MBC resistance on both myxamoebae and plasmodia. PMID:6479584
Bedeloglu, Merve; Topcu, Çagdas; Akgul, Arzu; Döger, Ela Naz; Sever, Refik; Ozkan, Ozlenen; Ozkan, Omer; Uysal, Hilmi; Polat, Ovunc; Çolak, Omer Halil
In this study, it is aimed to determine the degree of the development in emotional expression of full face transplant patients from photographs. Hence, a rehabilitation process can be planned according to the determination of degrees as a later work. As envisaged, in full face transplant cases, the determination of expressions can be confused or cannot be achieved as the healthy control group. In order to perform image-based analysis, a control group consist of 9 healthy males and 2 full-face transplant patients participated in the study. Appearance-based Gabor Wavelet Transform (GWT) and Local Binary Pattern (LBP) methods are adopted for recognizing neutral and 6 emotional expressions which consist of angry, scared, happy, hate, confused and sad. Feature extraction was carried out by using both methods and combination of these methods serially. In the performed expressions, the extracted features of the most distinct zones in the facial area where the eye and mouth region, have been used to classify the emotions. Also, the combination of these region features has been used to improve classifier performance. Control subjects and transplant patients' ability to perform emotional expressions have been determined with K-nearest neighbor (KNN) classifier with region-specific and method-specific decision stages. The results have been compared with healthy group. It has been observed that transplant patients don't reflect some emotional expressions. Also, there were confusions among expressions.
Dhavala, Soma S.; Datta, Sujay; Mallick, Bani K.; Carroll, Raymond J.; Khare, Sangeeta; Lawhon, Sara D.; Adams, L. Garry
Massively Parallel Signature Sequencing (MPSS) is a high-throughput, counting-based technology available for gene expression profiling. It produces output that is similar to Serial Analysis of Gene Expression and is ideal for building complex relational databases for gene expression. Our goal is to compare the in vivo global gene expression profiles of tissues infected with different strains of Salmonella obtained using the MPSS technology. In this article, we develop an exact ANOVA type model for this count data using a zero-inflatedPoisson distribution, different from existing methods that assume continuous densities. We adopt two Bayesian hierarchical models-one parametric and the other semiparametric with a Dirichlet process prior that has the ability to "borrow strength" across related signatures, where a signature is a specific arrangement of the nucleotides, usually 16-21 base pairs long. We utilize the discreteness of Dirichlet process prior to cluster signatures that exhibit similar differential expression profiles. Tests for differential expression are carried out using nonparametric approaches, while controlling the false discovery rate. We identify several differentially expressed genes that have important biological significance and conclude with a summary of the biological discoveries. This article has supplementary materials online. © 2010 American Statistical Association.
Madeira Sara C
Full Text Available Abstract Background The ability to monitor changes in expression patterns over time, and to observe the emergence of coherent temporal responses using expression time series, is critical to advance our understanding of complex biological processes. Biclustering has been recognized as an effective method for discovering local temporal expression patterns and unraveling potential regulatory mechanisms. The general biclustering problem is NP-hard. In the case of time series this problem is tractable, and efficient algorithms can be used. However, there is still a need for specialized applications able to take advantage of the temporal properties inherent to expression time series, both from a computational and a biological perspective. Findings BiGGEsTS makes available state-of-the-art biclustering algorithms for analyzing expression time series. Gene Ontology (GO annotations are used to assess the biological relevance of the biclusters. Methods for preprocessing expression time series and post-processing results are also included. The analysis is additionally supported by a visualization module capable of displaying informative representations of the data, including heatmaps, dendrograms, expression charts and graphs of enriched GO terms. Conclusion BiGGEsTS is a free open source graphical software tool for revealing local coexpression of genes in specific intervals of time, while integrating meaningful information on gene annotations. It is freely available at: http://kdbio.inesc-id.pt/software/biggests. We present a case study on the discovery of transcriptional regulatory modules in the response of Saccharomyces cerevisiae to heat stress.
Gur-Dedeoglu, Bala; Konu, Ozlen; Kir, Serkan; Ozturk, Ahmet Rasit; Bozkurt, Betul; Ergul, Gulusan; Yulug, Isik G
Accuracy in the diagnosis of breast cancer and classification of cancer subtypes has improved over the years with the development of well-established immunohistopathological criteria. More recently, diagnostic gene-sets at the mRNA expression level have been tested as better predictors of disease state. However, breast cancer is heterogeneous in nature; thus extraction of differentially expressed gene-sets that stably distinguish normal tissue from various pathologies poses challenges. Meta-analysis of high-throughput expression data using a collection of statistical methodologies leads to the identification of robust tumor gene expression signatures. A resampling-based meta-analysis strategy, which involves the use of resampling and application of distribution statistics in combination to assess the degree of significance in differential expression between sample classes, was developed. Two independent microarray datasets that contain normal breast, invasive ductal carcinoma (IDC), and invasive lobular carcinoma (ILC) samples were used for the meta-analysis. Expression of the genes, selected from the gene list for classification of normal breast samples and breast tumors encompassing both the ILC and IDC subtypes were tested on 10 independent primary IDC samples and matched non-tumor controls by real-time qRT-PCR. Other existing breast cancer microarray datasets were used in support of the resampling-based meta-analysis. The two independent microarray studies were found to be comparable, although differing in their experimental methodologies (Pearson correlation coefficient, R = 0.9389 and R = 0.8465 for ductal and lobular samples, respectively). The resampling-based meta-analysis has led to the identification of a highly stable set of genes for classification of normal breast samples and breast tumors encompassing both the ILC and IDC subtypes. The expression results of the selected genes obtained through real-time qRT-PCR supported the meta-analysis results. The
Full Text Available Abstract Background A major challenge in the interpretation of genomic profiling data generated from breast cancer samples is the identification of driver genes as distinct from bystander genes which do not impact tumorigenesis. One way to assess the relative importance of alterations in the transcriptome profile is to combine parallel analyses that assess changes in the copy number alterations (CNAs. This integrated analysis permits the identification of genes with altered expression that map within specific chromosomal regions which demonstrate copy number alterations, providing a mechanistic approach to identify the 'driver genes'. Methods We have performed whole genome analysis of CNAs using the Affymetrix 250K Mapping array on 22 infiltrating ductal carcinoma samples (IDCs. Analysis of transcript expression alterations was performed using the Affymetrix U133 Plus2.0 array on 16 IDC samples. Fourteen IDC samples were analyzed using both platforms and the data integrated. We also incorporated data from loss of heterozygosity (LOH analysis to identify genes showing altered expression in LOH regions. Results Common chromosome gains and amplifications were identified at 1q21.3, 6p21.3, 7p11.2-p12.1, 8q21.11 and 8q24.3. A novel amplicon was identified at 5p15.33. Frequent losses were found at 1p36.22, 8q23.3, 11p13, 11q23, and 22q13. Over 130 genes were identified with concurrent increases or decreases in expression that mapped to these regions of copy number alterations. LOH analysis revealed three tumors with whole chromosome or p arm allelic loss of chromosome 17. Genes were identified that mapped to copy neutral LOH regions. LOH with accompanying copy loss was detected on Xp24 and Xp25 and genes mapping to these regions with decreased expression were identified. Gene expression data highlighted the PPARα/RXRα Activation Pathway as down-regulated in the tumor samples. Conclusion We have demonstrated the utility of the application of
Full Text Available Abstract Background Accuracy in the diagnosis of breast cancer and classification of cancer subtypes has improved over the years with the development of well-established immunohistopathological criteria. More recently, diagnostic gene-sets at the mRNA expression level have been tested as better predictors of disease state. However, breast cancer is heterogeneous in nature; thus extraction of differentially expressed gene-sets that stably distinguish normal tissue from various pathologies poses challenges. Meta-analysis of high-throughput expression data using a collection of statistical methodologies leads to the identification of robust tumor gene expression signatures. Methods A resampling-based meta-analysis strategy, which involves the use of resampling and application of distribution statistics in combination to assess the degree of significance in differential expression between sample classes, was developed. Two independent microarray datasets that contain normal breast, invasive ductal carcinoma (IDC, and invasive lobular carcinoma (ILC samples were used for the meta-analysis. Expression of the genes, selected from the gene list for classification of normal breast samples and breast tumors encompassing both the ILC and IDC subtypes were tested on 10 independent primary IDC samples and matched non-tumor controls by real-time qRT-PCR. Other existing breast cancer microarray datasets were used in support of the resampling-based meta-analysis. Results The two independent microarray studies were found to be comparable, although differing in their experimental methodologies (Pearson correlation coefficient, R = 0.9389 and R = 0.8465 for ductal and lobular samples, respectively. The resampling-based meta-analysis has led to the identification of a highly stable set of genes for classification of normal breast samples and breast tumors encompassing both the ILC and IDC subtypes. The expression results of the selected genes obtained through real
Bazou, Despina; Kearney, Roisin; Mansergh, Fiona; Bourdon, Celine; Farrar, Jane; Wride, Michael
In the present paper, gene expression analysis of mouse embryonic stem (ES) cells levitated in a novel ultrasound standing wave trap (USWT) (Bazou et al. 2005a) at variable acoustic pressures (0.08-0.85 MPa) and times (5-60 min) was performed. Our results showed that levitation of ES cells at the highest employed acoustic pressure for 60 min does not modify gene expression and cells maintain their pluripotency. Embryoid bodies (EBs) also expressed the early and late neural differentiation markers, which were also unaffected by the acoustic field. Our results suggest that the ultrasound trap microenvironment is minimally invasive as the biologic consequences of ES cell replication and EB differentiation proceed without significantly affecting gene expression. The technique holds great promise in safe cell manipulation techniques for a variety of applications including tissue engineering and regenerative medicine. Copyright © 2011 World Federation for Ultrasound in Medicine & Biology. Published by Elsevier Inc. All rights reserved.
Zhang Linbi; Rong Tingzhao; Pan Guangtang; Cao Moju
The differential expression of male sterility induced by space flight with male fertility was studied using cDNA-AFLP technology. Total RNA was isolated from anther of male sterility and male fertility. Nine differential expression cDNA fragments were obtained with 16 primer combinations. The differential cDNA fragments were eluted, cloned and sequenced. Then half-quantitative RT-PCR was used to stuy the differential expressions of 4 development stages between sterility and fertility. Sequencing analysis shown 2 fragments from male sterility might be novel genes. Four fragments from male fertility were homology as chalcone and stilbene synthases, putative acyl CoA dehydrogenase, putative protein kinases and putative glycine decarboxylase. All these proteins might participate in the energy metabolisms, substance metabolisms or signal pollen development, Z8 took on increasing expression during the middle period of pollen development. These results just met the demand of more energy and more substance during the pollen development. (authors)
Zhou, Xiaobo; Qiu, Weiliang; Sathirapongsasuti, J. Fah.; Cho, Michael H.; Mancini, John D.; Lao, Taotao; Thibault, Derek M.; Litonjua, Gus; Bakke, Per S.; Gulsvik, Amund; Lomas, David A.; Beaty, Terri H.; Hersh, Craig P.; Anderson, Christopher; Geigenmuller, Ute; Raby, Benjamin A.; Rennard, Stephen I.; Perrella, Mark A.; Choi, Augustine M.K.; Quackenbush, John; Silverman, Edwin K.
Hedgehog Interacting Protein (HHIP) was implicated in chronic obstructive pulmonary disease (COPD) by genome-wide association studies (GWAS). However, it remains unclear how HHIP contributes to COPD pathogenesis. To identify genes regulated by HHIP, we performed gene expression microarray analysis in a human bronchial epithelial cell line (Beas-2B) stably infected with HHIP shRNAs. HHIP silencing led to differential expression of 296 genes; enrichment for variants nominally associated with COPD was found. Eighteen of the differentially expressed genes were validated by real-time PCR in Beas-2B cells. Seven of 11 validated genes tested in human COPD and control lung tissues demonstrated significant gene expression differences. Functional annotation indicated enrichment for extracellular matrix and cell growth genes. Network modeling demonstrated that the extracellular matrix and cell proliferation genes influenced by HHIP tended to be interconnected. Thus, we identified potential HHIP targets in human bronchial epithelial cells that may contribute to COPD pathogenesis. PMID:23459001
Fomekong-Nanfack, Y.; Postma, M.; Kaandorp, J.A.
Abstract Background Inference of gene regulatory networks (GRNs) requires accurate data, a method to simulate the expression patterns and an efficient optimization algorithm to estimate the unknown parameters. Using this approach it is possible to obtain alternative circuits without making any a priori assumptions about the interactions, which all simulate the observed patterns. It is important to analyze the properties of the circuits. Findings We have analyzed the simulated gene expression ...
Zhi, Ruicong; Cao, Lianyu; Cao, Gang
Growing evidence shows that consumer choices in real life are mostly driven by unconscious mechanisms rather than conscious. The unconscious process could be measured by behavioral measurements. This study aims to apply automatic facial expression analysis technique for consumers' emotion representation, and explore the relationships between sensory perception and facial responses. Basic taste solutions (sourness, sweetness, bitterness, umami, and saltiness) with 6 levels plus water were used, which could cover most of the tastes found in food and drink. The other contribution of this study is to analyze the characteristics of facial expressions and correlation between facial expressions and perceptive hedonic liking for Asian consumers. Up until now, the facial expression application researches only reported for western consumers, while few related researches investigated the facial responses during food consuming for Asian consumers. Experimental results indicated that facial expressions could identify different stimuli with various concentrations and different hedonic levels. The perceived liking increased at lower concentrations and decreased at higher concentrations, while samples with medium concentrations were perceived as the most pleasant except sweetness and bitterness. High correlations were founded between perceived intensities of bitterness, umami, saltiness, and facial reactions of disgust and fear. Facial expression disgust and anger could characterize emotion "dislike," and happiness could characterize emotion "like," while neutral could represent "neither like nor dislike." The identified facial expressions agree with the perceived sensory emotions elicited by basic taste solutions. The correlation analysis between hedonic levels and facial expression intensities obtained in this study are in accordance with that discussed for western consumers. © 2017 Institute of Food Technologists®.
Full Text Available Abstract Background Apple fruit develop over a period of 150 days from anthesis to fully ripe. An array representing approximately 13000 genes (15726 oligonucleotides of 45–55 bases designed from apple ESTs has been used to study gene expression over eight time points during fruit development. This analysis of gene expression lays the groundwork for a molecular understanding of fruit growth and development in apple. Results Using ANOVA analysis of the microarray data, 1955 genes showed significant changes in expression over this time course. Expression of genes is coordinated with four major patterns of expression observed: high in floral buds; high during cell division; high when starch levels and cell expansion rates peak; and high during ripening. Functional analysis associated cell cycle genes with early fruit development and three core cell cycle genes are significantly up-regulated in the early stages of fruit development. Starch metabolic genes were associated with changes in starch levels during fruit development. Comparison with microarrays of ethylene-treated apple fruit identified a group of ethylene induced genes also induced in normal fruit ripening. Comparison with fruit development microarrays in tomato has been used to identify 16 genes for which expression patterns are similar in apple and tomato and these genes may play fundamental roles in fruit development. The early phase of cell division and tissue specification that occurs in the first 35 days after pollination has been associated with up-regulation of a cluster of genes that includes core cell cycle genes. Conclusion Gene expression in apple fruit is coordinated with specific developmental stages. The array results are reproducible and comparisons with experiments in other species has been used to identify genes that may play a fundamental role in fruit development.
Full Text Available Genome-wide dissection of the heat stress response (HSR is necessary to overcome problems in crop production caused by global warming. To identify HSR genes, we profiled gene expression in two Chinese cabbage inbred lines with different thermotolerances, Chiifu and Kenshin. Many genes exhibited >2-fold changes in expression upon exposure to 0.5- 4 h at 45°C (high temperature, HT: 5.2% (2,142 genes in Chiifu and 3.7% (1,535 genes in Kenshin. The most enriched GO (Gene Ontology items included 'response to heat', 'response to reactive oxygen species (ROS', 'response to temperature stimulus', 'response to abiotic stimulus', and 'MAPKKK cascade'. In both lines, the genes most highly induced by HT encoded small heat shock proteins (Hsps and heat shock factor (Hsf-like proteins such as HsfB2A (Bra029292, whereas high-molecular weight Hsps were constitutively expressed. Other upstream HSR components were also up-regulated: ROS-scavenging genes like glutathione peroxidase 2 (BrGPX2, Bra022853, protein kinases, and phosphatases. Among heat stress (HS marker genes in Arabidopsis, only exportin 1A (XPO1A (Bra008580, Bra006382 can be applied to B. rapa for basal thermotolerance (BT and short-term acquired thermotolerance (SAT gene. CYP707A3 (Bra025083, Bra021965, which is involved in the dehydration response in Arabidopsis, was associated with membrane leakage in both lines following HS. Although many transcription factors (TF genes, including DREB2A (Bra005852, were involved in HS tolerance in both lines, Bra024224 (MYB41 and Bra021735 (a bZIP/AIR1 [Anthocyanin-Impaired-Response-1] were specific to Kenshin. Several candidate TFs involved in thermotolerance were confirmed as HSR genes by real-time PCR, and these assignments were further supported by promoter analysis. Although some of our findings are similar to those obtained using other plant species, clear differences in Brassica rapa reveal a distinct HSR in this species. Our data could also provide a
Crist, Courtney Alissa
Sensory and consumer sciences aim to understand the influences of product acceptability and purchase decisions. The food industry measures product acceptability through hedonic testing but often does not assess implicit or qualitative response. Incorporation of qualitative research and automated facial expression analysis (AFEA) may supplement hedonic acceptability testing to provide product insights. The purpose of this research was to assess the application of AFEA and qualitative analysis ...
Boris P Hejblum
Full Text Available Gene set analysis methods, which consider predefined groups of genes in the analysis of genomic data, have been successfully applied for analyzing gene expression data in cross-sectional studies. The time-course gene set analysis (TcGSA introduced here is an extension of gene set analysis to longitudinal data. The proposed method relies on random effects modeling with maximum likelihood estimates. It allows to use all available repeated measurements while dealing with unbalanced data due to missing at random (MAR measurements. TcGSA is a hypothesis driven method that identifies a priori defined gene sets with significant expression variations over time, taking into account the potential heterogeneity of expression within gene sets. When biological conditions are compared, the method indicates if the time patterns of gene sets significantly differ according to these conditions. The interest of the method is illustrated by its application to two real life datasets: an HIV therapeutic vaccine trial (DALIA-1 trial, and data from a recent study on influenza and pneumococcal vaccines. In the DALIA-1 trial TcGSA revealed a significant change in gene expression over time within 69 gene sets during vaccination, while a standard univariate individual gene analysis corrected for multiple testing as well as a standard a Gene Set Enrichment Analysis (GSEA for time series both failed to detect any significant pattern change over time. When applied to the second illustrative data set, TcGSA allowed the identification of 4 gene sets finally found to be linked with the influenza vaccine too although they were found to be associated to the pneumococcal vaccine only in previous analyses. In our simulation study TcGSA exhibits good statistical properties, and an increased power compared to other approaches for analyzing time-course expression patterns of gene sets. The method is made available for the community through an R package.
Provart Nicholas J
Full Text Available Abstract Background Sequencing of the first plant genomes has revealed that cytochromes P450 have evolved to become the largest family of enzymes in secondary metabolism. The proportion of P450 enzymes with characterized biochemical function(s is however very small. If P450 diversification mirrors evolution of chemical diversity, this points to an unexpectedly poor understanding of plant metabolism. We assumed that extensive analysis of gene expression might guide towards the function of P450 enzymes, and highlight overlooked aspects of plant metabolism. Results We have created a comprehensive database, 'CYPedia', describing P450 gene expression in four data sets: organs and tissues, stress response, hormone response, and mutants of Arabidopsis thaliana, based on public Affymetrix ATH1 microarray expression data. P450 expression was then combined with the expression of 4,130 re-annotated genes, predicted to act in plant metabolism, for co-expression analyses. Based on the annotation of co-expressed genes from diverse pathway annotation databases, co-expressed pathways were identified. Predictions were validated for most P450s with known functions. As examples, co-expression results for P450s related to plastidial functions/photosynthesis, and to phenylpropanoid, triterpenoid and jasmonate metabolism are highlighted here. Conclusion The large scale hypothesis generation tools presented here provide leads to new pathways, unexpected functions, and regulatory networks for many P450s in plant metabolism. These can now be exploited by the community to validate the proposed functions experimentally using reverse genetics, biochemistry, and metabolic profiling.
Full Text Available Tao Wu,1 Min Jiao,1 Li Jing,1 Min-Cong Wang,1 Hai-Feng Sun,2 Qing Li,1 Yi-Yang Bai,1 Yong-Chang Wei,1 Ke-Jun Nan,1 Hui Guo1 1Department of Medical Oncology, The First Affiliated Hospital of Xi’an Jiaotong University, 2Department of Oncology, Shaanxi Cancer Hospital, Xi’an, People’s Republic of China Abstract: Association of Notch-1 expression with prognosis of patients with hepatocellular carcinoma (HCC remains controversial. We conducted a meta-analysis to reevaluate the association of Notch-1 expression with clinicopathological characteristics and prognosis of HCC. PubMed, Embase, Web of Science, and China National Knowledge Infrastructure were searched to look for relevant studies. The association between Notch-1 expression and clinicopathological parameters and overall survival (OS was then reassessed using the meta-analysis for odds ratio (OR or hazard ratio (HR and 95% confidence interval (CI. A total of seven studies, including 810 HCC patients, were eligible for the meta-analysis. Our data showed that high Notch-1 expression was able to predict poor OS (HR 1.50, 95% CI 1.17–1.83, P=0.0001. The pooled OR showed that high Notch-1 expression was significantly associated with tumor metastasis (OR 0.37, 95% CI 0.16–0.86, P=0.02 and tumor size >5 cm (OR 0.48, 95% CI 0.26–0.88, P=0.02. In contrast, there was no association between high Notch-1 expression and tumor differentiation, late TNM stage, tumor number, and portal vein invasion of HCC. In conclusion, Notch-1 overexpression might predict poorer survival and more aggressive behavior in patients with HCC. Keywords: hepatocellular carcinoma, Notch-1, prognosis, clinicopathological features, meta-analysis
Sheng, Yue; Zhao, Wei; Song, Ying; Li, Zhigang; Luo, Majing; Lei, Quan; Cheng, Hanhua; Zhou, Rongjia
A variety of mechanisms are engaged in sex determination in vertebrates. The teleost fish swamp eel undergoes sex reversal naturally and is an ideal model for vertebrate sexual development. However, the importance of proteome-wide scanning for gonad reversal was not previously determined. We report a 2-D electrophoresis analysis of three gonad types of proteomes during sex reversal. MS/MS analysis revealed a group of differentially expressed proteins during ovary to ovotestis to testis transf...
Thomassen, Mads; Jochumsen, Kirsten M; Mogensen, Ole
the relation of gene expression and chromosomal position to identify chromosomal regions of importance for early recurrence of ovarian cancer. By use of *Gene Set Enrichment Analysis*, we have ranked chromosomal regions according to their association to survival. Over-representation analysis including 1...... using death (P = 0.015) and recurrence (P = 0.002) as outcome. The combined mutation score is strongly associated to upregulation of several growth factor pathways....
Thomassen, Mads; Tan, Qihua; Kruse, Torben
ABSTRACT: BACKGROUND: Metastasis is believed to progress in several steps including different pathways but the determination and understanding of these mechanisms is still fragmentary. Microarray analysis of gene expression patterns in breast tumors has been used to predict outcome in recent stud...
the first report of expression analysis of all CsDCL, CsAGO and CsRDR family genes in cucumber under .... tissues and organs were detected using online data and semi- ...... Wheeler B. S. 2013 Small RNAs, big impact: small RNA pathways.
Without proper linguistic competence in English language, academic writing is one of the most challenging tasks, especially, in various genre specific disciplines by L2 novice writers. This paper examines the role of diction and expression through error analysis in English language of L2 novice writers' academic writing in interdisciplinary texts…
Full Text Available Aging is closely connected with death, progressive physiological decline, and increased risk of diseases, such as cancer, arteriosclerosis, heart disease, hypertension, and neurodegenerative diseases. It is reported that moxibustion can treat more than 300 kinds of diseases including aging related problems and can improve immune function and physiological functions. The digital gene expression profiling of aged mice with or without moxibustion treatment was investigated and the mechanisms of moxibustion in aged mice were speculated by gene ontology and pathway analysis in the study. Almost 145 million raw reads were obtained by digital gene expression analysis and about 140 million (96.55% were clean reads. Five differentially expressed genes with an adjusted P value 1 were identified between the control and moxibustion groups. They were Gm6563, Gm8116, Rps26-ps1, Nat8f4, and Igkv3-12. Gene ontology analysis was carried out by the GOseq R package and functional annotations of the differentially expressed genes related to translation, mRNA export from nucleus, mRNA transport, nuclear body, acetyltransferase activity, and so on. Kyoto Encyclopedia of Genes and Genomes database was used for pathway analysis and ribosome was the most significantly enriched pathway term.
Obel, G.; Farinha, P.; Lam, W.
American patients with transformed FL. Methods: High-resolution BAC-array comparative genomic hybridisation (CGH) was used to detect genomic imbalances. Gene expression profiling was performed using cDNA microarrays (Affymetrix). Results: Of 9 biopsy pairs identified so far, analysis results of the first 4...
CLONING, EXPRESSION, AND MUTATIONAL ANALYSIS OF RAT S-ADENOSYL-L-METHIONINE: ARSENIC(III) METHYLTRANSFERASEStephen B. Waters, Ph.D., Miroslav Styblo, Ph.D., Melinda A. Beck, Ph.D., University of North Carolina at Chapel Hill; David J. Thomas, Ph.D., U.S. Environmental...
Full Text Available Background: Microarray technology has been previously used to identify genes that are differentially expressed between tumour and normal samples in a single study, as well as in syntheses involving multiple studies. When integrating results from several Affymetrix microarray datasets, previous studies summarized probeset-level data, which may potentially lead to a loss of information available at the probe-level. In this paper, we present an approach for integrating results across studies while taking probe-level data into account. Additionally, we follow a new direction in the analysis of microarray expression data, namely to focus on the variation of expression phenotypes in predefined gene sets, such as pathways. This targeted approach can be helpful for revealing information that is not easily visible from the changes in the individual genes. Results: We used a recently developed method to integrate Affymetrix expression data across studies. The idea is based on a probe-level based test statistic developed for testing for differentially expressed genes in individual studies. We incorporated this test statistic into a classic random-effects model for integrating data across studies. Subsequently, we used a gene set enrichment test to evaluate the significance of enriched biological pathways in the differentially expressed genes identified from the integrative analysis. We compared statistical and biological significance of the prognostic gene expression signatures and pathways identified in the probe-level model (PLM with those in the probeset-level model (PSLM. Our integrative analysis of Affymetrix microarray data from 110 prostate cancer samples obtained from three studies reveals thousands of genes significantly correlated with tumour cell differentiation. The bioinformatics analysis, mapping these genes to the publicly available KEGG database, reveals evidence that tumour cell differentiation is significantly associated with many
Chen, Chun; Xie, Tingna; Ye, Sudan; Jensen, Annette Bruun; Eilenberg, Jørgen
The selection of suitable reference genes is crucial for accurate quantification of gene expression and can add to our understanding of host-pathogen interactions. To identify suitable reference genes in Pandora neoaphidis, an obligate aphid pathogenic fungus, the expression of three traditional candidate genes including 18S rRNA(18S), 28S rRNA(28S) and elongation factor 1 alpha-like protein (EF1), were measured by quantitative polymerase chain reaction at different developmental stages (conidia, conidia with germ tubes, short hyphae and elongated hyphae), and under different nutritional conditions. We calculated the expression stability of candidate reference genes using four algorithms including geNorm, NormFinder, BestKeeper and Delta Ct. The analysis results revealed that the comprehensive ranking of candidate reference genes from the most stable to the least stable was 18S (1.189), 28S (1.414) and EF1 (3). The 18S was, therefore, the most suitable reference gene for real-time RT-PCR analysis of gene expression under all conditions. These results will support further studies on gene expression in P. neoaphidis. Copyright © 2015 Sociedade Brasileira de Microbiologia. Published by Elsevier Editora Ltda. All rights reserved.
Full Text Available Abstract The selection of suitable reference genes is crucial for accurate quantification of gene expression and can add to our understanding of host–pathogen interactions. To identify suitable reference genes in Pandora neoaphidis, an obligate aphid pathogenic fungus, the expression of three traditional candidate genes including 18S rRNA(18S, 28S rRNA(28S and elongation factor 1 alpha-like protein (EF1, were measured by quantitative polymerase chain reaction at different developmental stages (conidia, conidia with germ tubes, short hyphae and elongated hyphae, and under different nutritional conditions. We calculated the expression stability of candidate reference genes using four algorithms including geNorm, NormFinder, BestKeeper and Delta Ct. The analysis results revealed that the comprehensive ranking of candidate reference genes from the most stable to the least stable was 18S (1.189, 28S (1.414 and EF1 (3. The 18S was, therefore, the most suitable reference gene for real-time RT-PCR analysis of gene expression under all conditions. These results will support further studies on gene expression in P. neoaphidis.
Full Text Available Abstract Background Maturation of spermatozoa, including development of motility and the ability to fertilize the oocyte, occurs during transit through the microenvironment of the epididymis. Comprehensive understanding of sperm maturation requires identification and characterization of unique genes expressed in the epididymis. Results We systematically identified 32 novel genes with epididymis-specific or -predominant expression in the mouse epididymis UniGene library, containing 1505 gene-oriented transcript clusters, by in silico and in vitro analyses. The Northern blot analysis revealed various characteristics of the genes at the transcript level, such as expression level, size and the presence of isoform. We found that expression of the half of the genes is regulated by androgens. Further expression analyses demonstrated that the novel genes are region-specific and developmentally regulated. Computational analysis showed that 15 of the genes lack human orthologues, suggesting their implication in male reproduction unique to the mouse. A number of the novel genes are putative epididymal protease inhibitors or β-defensins. We also found that six of the genes have secretory activity, indicating that they may interact with sperm and have functional roles in sperm maturation. Conclusion We identified and characterized 32 novel epididymis-specific or -predominant genes by an integrative approach. Our study is unique in the aspect of systematic identification of novel epididymal genes and should be a firm basis for future investigation into molecular mechanisms underlying sperm maturation in the epididymis.
Davin, Nicolas; Edger, Patrick P; Hefer, Charles A; Mizrachi, Eshchar; Schuetz, Mathias; Smets, Erik; Myburg, Alexander A; Douglas, Carl J; Schranz, Michael E; Lens, Frederic
Many plant genes are known to be involved in the development of cambium and wood, but how the expression and functional interaction of these genes determine the unique biology of wood remains largely unknown. We used the soc1ful loss of function mutant - the woodiest genotype known in the otherwise herbaceous model plant Arabidopsis - to investigate the expression and interactions of genes involved in secondary growth (wood formation). Detailed anatomical observations of the stem in combination with mRNA sequencing were used to assess transcriptome remodeling during xylogenesis in wild-type and woody soc1ful plants. To interpret the transcriptome changes, we constructed functional gene association networks of differentially expressed genes using the STRING database. This analysis revealed functionally enriched gene association hubs that are differentially expressed in herbaceous and woody tissues. In particular, we observed the differential expression of genes related to mechanical stress and jasmonate biosynthesis/signaling during wood formation in soc1ful plants that may be an effect of greater tension within woody tissues. Our results suggest that habit shifts from herbaceous to woody life forms observed in many angiosperm lineages could have evolved convergently by genetic changes that modulate the gene expression and interaction network, and thereby redeploy the conserved wood developmental program. © 2016 The Authors. The Plant Journal published by Society for Experimental Biology and John Wiley & Sons Ltd.
Ahmed, Nasar Uddin; Jung, Hee-Jeong; Park, Jong-In; Cho, Yong-Gu; Hur, Yoonkang; Nou, Ill-Sup
Cold and freezing stress is a major environmental constraint to the production of Brassica crops. Enhancement of tolerance by exploiting cold and freezing tolerance related genes offers the most efficient approach to address this problem. Cold-induced transcriptional profiling is a promising approach to the identification of potential genes related to cold and freezing stress tolerance. In this study, 99 highly expressed genes were identified from a whole genome microarray dataset of Brassica rapa. Blast search analysis of the Brassica oleracea database revealed the corresponding homologous genes. To validate their expression, pre-selected cold tolerant and susceptible cabbage lines were analyzed. Out of 99 BoCRGs, 43 were differentially expressed in response to varying degrees of cold and freezing stress in the contrasting cabbage lines. Among the differentially expressed genes, 18 were highly up-regulated in the tolerant lines, which is consistent with their microarray expression. Additionally, 12 BoCRGs were expressed differentially after cold stress treatment in two contrasting cabbage lines, and BoCRG54, 56, 59, 62, 70, 72 and 99 were predicted to be involved in cold regulatory pathways. Taken together, the cold-responsive genes identified in this study provide additional direction for elucidating the regulatory network of low temperature stress tolerance and developing cold and freezing stress resistant Brassica crops. Copyright © 2014 Elsevier B.V. All rights reserved.
Zhao Guohua; Shi Lingfang; Qiu Daoming; Hu Hong; Kao, Peter N.
NF45/ILF2 associates with NF90/ILF3 in the nucleus and regulates IL-2 gene transcription at the antigen receptor response element (ARRE)/NF-AT DNA target sequence (P.N. Kao, L. Chen, G. Brock, J. Ng, A.J. Smith, B. Corthesy, J. Biol. Chem. 269 (1994) 20691-20699). NF45 is widely expressed in normal tissues, especially testis, brain, and kidney, with a predominantly nuclear distribution. NF45 mRNA expression is increased in lymphoma and leukemia cell lines. The human and murine NF45 proteins differ only by substitution of valine by isoleucine at amino acid 142. Fluorescence in situ hybridization localized the human NF45 gene to chromosome 1q21.3, and mouse NF45 gene to chromosome 3F1. Promoter analysis of 2.5 kB of the murine NF45 gene reveals that significant activation is conferred by factors, possible including NF-Y, that bind to the CCAAT-box sequence. The function of human NF45 in regulating IL-2 gene expression was characterized in Jurkat T-cells stably transfected with plasmids directing expression of NF45 cDNA in sense or antisense orientations. NF45 sense expression increased IL-2 luciferase reporter gene activity 120-fold, and IL-2 protein expression 2-fold compared to control cells. NF45 is a highly conserved, regulated transcriptional activator, and one target gene is IL-2
Meng, X.R. [Oncology Department, The First Affiliated Hospital of Zhengzhou University, Zhengzhou (China); Lu, P. [Gastrointestinal Surgery Department, People' s Hospital of Zhengzhou, Zhengzhou (China); Mei, J.Z.; Liu, G.J. [Medical Oncology Department, People' s Hospital of Zhengzhou, Zhengzhou (China); Fan, Q.X. [Oncology Department, The First Affiliated Hospital of Zhengzhou University, Zhengzhou (China)
We aimed to investigate miRNAs and related mRNAs through a network-based approach in order to learn the crucial role that they play in the biological processes of esophageal cancer. Esophageal squamous-cell carcinoma (ESCC) and adenocarcinoma (EAC)-related miRNA and gene expression data were downloaded from the Gene Expression Omnibus database, and differentially expressed miRNAs and genes were selected. Target genes of differentially expressed miRNAs were predicted and their regulatory networks were constructed. Differentially expressed miRNA analysis selected four miRNAs associated with EAC and ESCC, among which hsa-miR-21 and hsa-miR-202 were shared by both diseases. hsa-miR-202 was reported for the first time to be associated with esophageal cancer in the present study. Differentially expressed miRNA target genes were mainly involved in cancer-related and signal-transduction pathways. Functional categories of these target genes were related to transcriptional regulation. The results may indicate potential target miRNAs and genes for future investigations of esophageal cancer.
Malek Joel A
Full Text Available Abstract Background Ovarian cancer is the most deadly gynecological cancer due to late diagnosis at advanced stage with major peritoneal involvement. To date most research has focused on primary tumor. However the prognosis is directly related to residual disease at the end of the treatment. Therefore it is mandatory to focus and study the biology of meatastatic disease that is most frequently localized to the peritoneal caivty in ovarian cancer. Methods We used high-density gene expression arrays to investigate gene expression changes between matched primary and metastatic (peritoneal lesions. Results Here we show that gene expression profiles in peritoneal metastasis are significantly different than their matched primary tumor and these changes are affected by underlying copy number variation differences among other causes. We show that differentially expressed genes are enriched in specific pathways including JAK/STAT pathway, cytokine signaling and other immune related pathways. We show that underlying copy number variations significantly affect gene expression. Indeed patients with important differences in copy number variation displayed greater gene expression differences between their primary and matched metastatic lesions. Conclusions Our analysis shows a very specific targeting at both the genomic and transcriptomic level to upregulate certain pathways in the peritoneal metastasis of ovarian cancer. Moreover, while primary tumors use certain pathways we identify distinct differences with metastatic lesions. The variation between primary and metastatic lesions should be considered in personalized treatment of ovarian cancer.
Zhao, Tian-Tian; Zhang, Jin; Liang, Li-Song; Ma, Qing-Hua; Chen, Xin; Zong, Jian-Wei; Wang, Gui-Xi
Plant WRKY transcription factors are known to regulate various biotic and abiotic stress responses. In this study we identified a total of 30 putative WRKY unigenes in a transcriptome dataset of the Chinese wild Hazel, Corylus heterophylla, a species that is noted for its cold tolerance. Thirteen full-length of these ChWRKY genes were cloned and found to encode complete protein sequences, and they were divided into three groups, based on the number of WRKY domains and the pattern of zinc finger structures. Representatives of each of the groups, Unigene25835 (group I), Unigene37641 (group II) and Unigene20441 (group III), were transiently expressed as fusion proteins with yellow fluorescent fusion protein in Nicotiana benthamiana, where they were observed to accumulate in the nucleus, in accordance with their predicted roles as transcriptional activators. An analysis of the expression patterns of all 30 WRKY genes revealed differences in transcript abundance profiles following exposure to cold, drought and high salinity conditions. Among the stress-inducible genes, 23 were up-regulated by all three abiotic stresses and the WRKY genes collectively exhibited four different patterns of expression in flower buds during the overwintering period from November to April. The organ/tissue related expression analysis showed that 18 WRKY genes were highly expressed in stem but only 2 (Unigene9262 and Unigene43101) were greatest in male anthotaxies. The expression of Unigene37641, a member of the group II WRKY genes, was substantially up-regulated by cold, drought and salinity treatments, and its overexpression in Arabidopsis thaliana resulted in better seedling growth, compared with wild type plants, under cold treatment conditions. The transgenic lines also had exhibited higher soluble protein content, superoxide dismutase and peroxidase activiety and lower levels of malondialdehyde, which collectively suggets that Unigene37641 expression promotes cold tolerance.
Wang, Jinglu; Qu, Susu; Wang, Weixiao; Guo, Liyuan; Zhang, Kunlin; Chang, Suhua; Wang, Jing
Numbers of gene expression profiling studies of bipolar disorder have been published. Besides different array chips and tissues, variety of the data processes in different cohorts aggravated the inconsistency of results of these genome-wide gene expression profiling studies. By searching the gene expression databases, we obtained six data sets for prefrontal cortex (PFC) of bipolar disorder with raw data and combinable platforms. We used standardized pre-processing and quality control procedures to analyze each data set separately and then combined them into a large gene expression matrix with 101 bipolar disorder subjects and 106 controls. A standard linear mixed-effects model was used to calculate the differentially expressed genes (DEGs). Multiple levels of sensitivity analyses and cross validation with genetic data were conducted. Functional and network analyses were carried out on basis of the DEGs. In the result, we identified 198 unique differentially expressed genes in the PFC of bipolar disorder and control. Among them, 115 DEGs were robust to at least three leave-one-out tests or different pre-processing methods; 51 DEGs were validated with genetic association signals. Pathway enrichment analysis showed these DEGs were related with regulation of neurological system, cell death and apoptosis, and several basic binding processes. Protein-protein interaction network further identified one key hub gene. We have contributed the most comprehensive integrated analysis of bipolar disorder expression profiling studies in PFC to date. The DEGs, especially those with multiple validations, may denote a common signature of bipolar disorder and contribute to the pathogenesis of disease. Copyright © 2016 Elsevier Ltd. All rights reserved.
Lo, Miranda; Cordwell, Stuart J; Bulach, Dieter M; Adler, Ben
Leptospirosis is a global zoonosis affecting millions of people annually. Transcriptional changes in response to temperature were previously investigated using microarrays to identify genes potentially expressed upon host entry. Past studies found that various leptospiral outer membrane proteins are differentially expressed at different temperatures. However, our microarray studies highlighted a divergence between protein abundance and transcript levels for some proteins. Given the abundance of post-transcriptional expression control mechanisms, this finding highlighted the importance of global protein analysis systems. To complement our previous transcription study, we evaluated differences in the proteins of the leptospiral outer membrane fraction in response to temperature upshift. Outer membrane protein-enriched fractions from Leptospira interrogans grown at 30 degrees C or overnight upshift to 37 degrees C were isolated and the relative abundance of each protein was determined by iTRAQ analysis coupled with two-dimensional liquid chromatography and tandem mass spectrometry (2-DLC/MS-MS). We identified 1026 proteins with 99% confidence; 27 and 66 were present at elevated and reduced abundance respectively. Protein abundance changes were compared with transcriptional differences determined from the microarray studies. While there was some correlation between the microarray and iTRAQ data, a subset of genes that showed no differential expression by microarray was found to encode temperature-regulated proteins. This set of genes is of particular interest as it is likely that regulation of their expression occurs post-transcriptionally, providing an opportunity to develop hypotheses about the molecular dynamics of the outer membrane of Leptospira in response to changing environments. This is the first study to compare transcriptional and translational responses to temperature shift in L. interrogans. The results thus provide an insight into the mechanisms used by L
Turner, Helen C; Budak, Murat T; Akinci, M A Murat; Wolosin, J Mario
To determine global mRNA expression levels in corneal and conjunctival epithelia and identify transcripts that exhibit preferential tissue expression. cDNA samples derived from human conjunctival and corneal epithelia were hybridized in three independent experiments to a commercial oligonucleotide array representing more than 22,000 transcripts. The resultant signal intensities and microarray software transcript present/absent calls were used in conjunction with the local pooled error (LPE) statistical method to identify transcripts that are preferentially or exclusively expressed in one of the two tissues at significant levels (expression >1% of the beta-actin level). EASE (Expression Analysis Systematic Explorer software) was used to identify biological systems comparatively overrepresented in either epithelium. Immuno-, and cytohistochemistry was performed to validate or expand on selected results of interest. The analysis identified 332 preferential and 93 exclusive significant corneal epithelial transcripts. The corresponding numbers of conjunctival epithelium transcripts were 592 and 211, respectively. The overrepresented biological processes in the cornea were related to cell adhesion and oxiredox equilibria and cytoprotection activities. In the conjunctiva, the biological processes that were most prominent were related to innate immunity and melanogenesis. Immunohistochemistry for antigen-presenting cells and melanocytes was consistent with these gene signatures. The transcript comparison identified a substantial number of genes that have either not been identified previously or are not known to be highly expressed in these two epithelia, including testican-1, ECM1, formin, CRTAC1, and NQO1 in the cornea and, in the conjunctiva, sPLA(2)-IIA, lipocalin 2, IGFBP3, multiple MCH class II proteins, and the Na-Pi cotransporter type IIb. Comparative gene expression profiling leads to the identification of many biological processes and previously unknown genes that
Shen, Po-Chih; Hour, Ai-Ling; Liu, Li-Yu Daisy
Abiotic stresses are the major limiting factors that affect plant growth, development, yield and final quality. Deciphering the underlying mechanisms of plants' adaptations to stresses using few datasets might overlook the different aspects of stress tolerance in plants, which might be simultaneously and consequently operated in the system. Fortunately, the accumulated microarray expression data offer an opportunity to infer abiotic stress-specific gene expression patterns through meta-analysis. In this study, we propose to combine microarray gene expression data under control, cold, drought, heat, and salt conditions and determined modules (gene sets) of genes highly associated with each other according to the observed expression data. By analyzing the expression variations of the Eigen genes from different conditions, we had identified two, three, and five gene modules as cold-, heat-, and salt-specific modules, respectively. Most of the cold- or heat-specific modules were differentially expressed to a particular degree in shoot samples, while most of the salt-specific modules were differentially expressed to a particular degree in root samples. A gene ontology (GO) analysis on the stress-specific modules suggested that the gene modules exclusively enriched stress-related GO terms and that different genes under the same GO terms may be alternatively disturbed in different conditions. The gene regulatory events for two genes, DREB1A and DEAR1, in the cold-specific gene module had also been validated, as evidenced through the literature search. Our protocols study the specificity of the gene modules that were specifically activated under a particular type of abiotic stress. The biplot can also assist to visualize the stress-specific gene modules. In conclusion, our approach has the potential to further elucidate mechanisms in plants and beneficial for future experiments design under different abiotic stresses.
Full Text Available Background: Colorectal cancer (CRC is one of the most frequently occurring cancers in Japan, and thus a wide range of methods have been deployed to study the molecular mechanisms of CRC. In this study, we performed a comprehensive analysis of CRC, incorporating copy number aberration (CRC and gene expression data. For the last four years, we have been collecting data from CRC cases and organizing the information as an “omics” study by integrating many kinds of analysis into a single comprehensive investigation. In our previous studies, we had experienced difficulty in finding genes related to CRC, as we observed higher noise levels in the expression data than in the data for other cancers. Because chromosomal aberrations are often observed in CRC, here, we have performed a combination of CNA analysis and expression analysis in order to identify some new genes responsible for CRC. This study was performed as part of the Clinical Omics Database Project at Tokyo Medical and Dental University. The purpose of this study was to investigate the mechanism of genetic instability in CRC by this combination of expression analysis and CNA, and to establish a new method for the diagnosis and treatment of CRC. Materials and methods: Comprehensive gene expression analysis was performed on 79 CRC cases using an Affymetrix Gene Chip, and comprehensive CNA analysis was performed using an Affymetrix DNA Sty array. To avoid the contamination of cancer tissue with normal cells, laser micro-dissection was performed before DNA/RNA extraction. Data analysis was performed using original software written in the R language. Result: We observed a high percentage of CNA in colorectal cancer, including copy number gains at 7, 8q, 13 and 20q, and copy number losses at 8p, 17p and 18. Gene expression analysis provided many candidates for CRC-related genes, but their association with CRC did not reach the level of statistical significance. The combination of CNA and gene
Monavar Feshani, Aboozar; Mohammadi, Saeed; Frazier, Taylor P; Abbasi, Abbas; Abedini, Raha; Karimi Farsad, Laleh; Ehya, Farveh; Salekdeh, Ghasem Hosseini; Mardi, Mohsen
MicroRNAs (miRNAs) are small non-coding RNA molecules that play a vital role in the regulation of gene expression. Despite their identification in hundreds of plant species, few miRNAs have been identified in the Asteraceae, a large family that comprises approximately one tenth of all flowering plants. In this study, we used the expressed sequence tag (EST) analysis to identify potential conserved miRNAs and their putative target genes in the Asteraceae. We applied quantitative Real-Time PCR (qRT-PCR) to confirm the expression of eight potential miRNAs in Carthamus tinctorius and Helianthus annuus. We also performed qRT-PCR analysis to investigate the differential expression pattern of five newly identified miRNAs during five different cotyledon growth stages in safflower. Using these methods, we successfully identified and characterized 151 potentially conserved miRNAs, belonging to 26 miRNA families, in 11 genus of Asteraceae. EST analysis predicted that the newly identified conserved Asteraceae miRNAs target 130 total protein-coding ESTs in sunflower and safflower, as well as 433 additional target genes in other plant species. We experimentally confirmed the existence of seven predicted miRNAs, (miR156, miR159, miR160, miR162, miR166, miR396, and miR398) in safflower and sunflower seedlings. We also observed that five out of eight miRNAs are differentially expressed during cotyledon development. Our results indicate that miRNAs may be involved in the regulation of gene expression during seed germination and the formation of the cotyledons in the Asteraceae. The findings of this study might ultimately help in the understanding of miRNA-mediated gene regulation in important crop species. Copyright © 2011 Elsevier B.V. All rights reserved.
Ahmed, Farid E; Gouda, Mostafa M; Hussein, Laila A; Ahmed, Nancy C; Vos, Paul W; Mohammad, Mahmoud A
This article illustrates the importance of melt curve analysis (MCA) in interpretation of mild nutrogenomic micro(mi)RNA expression data, by measuring the magnitude of the expression of key miRNA molecules in stool of healthy human adults as molecular markers, following the intake of Pomegranate juice (PGJ), functional fermented sobya (FS), rich in potential probiotic lactobacilli, or their combination. Total small RNA was isolated from stool of 25 volunteers before and following a three-week dietary intervention trial. Expression of 88 miRNA genes was evaluated using Qiagen's 96 well plate RT 2 miRNA qPCR arrays. Employing parallel coordinates plots, there was no observed significant separation for the gene expression (Cq) values, using Roche 480® PCR LightCycler instrument used in this study, and none of the miRNAs showed significant statistical expression after controlling for the false discovery rate. On the other hand, melting temperature profiles produced during PCR amplification run, found seven significant genes (miR-184, miR-203, miR-373, miR-124, miR-96, miR-373 and miR-301a), which separated candidate miRNAs that could function as novel molecular markers of relevance to oxidative stress and immunoglobulin function, for the intake of polyphenol (PP)-rich, functional fermented foods rich in lactobacilli (FS), or their combination. We elaborate on these data, and present a detailed review on use of melt curves for analyzing nutigenomic miRNA expression data, which initially appear to show no significant expressions, but are actually more subtle than this simplistic view, necessitating the understanding of the role of MCA for a comprehensive understanding of what the collective expression and MCA data collectively imply. Copyright© 2017, International Institute of Anticancer Research (Dr. George J. Delinasios), All rights reserved.
Yan, Lulu; Su, Jiaqi; Wang, Zhaoping; Yan, Xiwu; Yu, Ruihai
Quantitative real-time polymerase chain reaction (qRT-PCR) is a rapid and reliable technique which has been widely used to quantifying gene transcripts (expression analysis). It is also employed for studying heterosis, hybridization breeding and hybrid tolerability of oysters, an ecologically and economically important taxonomic group. For these studies, selection of a suitable set of housekeeping genes as references is crucial for correct interpretation of qRT-PCR data. To identify suitable reference genes for oysters during low temperature and low salinity stresses, we analyzed twelve genes from the gill tissue of Crassostrea sikamea (SS), Crassostrea angulata (AA) and their hybrid (SA), which included three ribosomal genes, 28S ribosomal protein S5 ( RPS5), ribosomal protein L35 ( RPL35), and 60S ribosomal protein L29 ( RPL29); three structural genes, tubulin gamma ( TUBγ), annexin A6 and A7 ( AA6 and AA7); three metabolic pathway genes, ornithine decarboxylase ( OD), glyceraldehyde-3-phosphate dehydrogenase ( GAPDH) and glutathione S-transferase P1 ( GSP); two transcription factors, elongation factor 1 alpha and beta ( EF1α and EF1β); and one protein synthesis gene (ubiquitin ( UBQ). Primers specific for these genes were successfully developed for the three groups of oysters. Three different algorithms, geNorm, NormFinder and BestKeeper, were used to evaluate the expression stability of these candidate genes. BestKeeper program was found to be the most reliable. Based on our analysis, we found that the expression of RPL35 and EF1α was stable under low salinity stress, and the expression of OD, GAPDH and EF1α was stable under low temperature stress in hybrid (SA) oyster; the expression of RPS5 and GAPDH was stable under low salinity stress, and the expression of RPS5, UBQ, GAPDH was stable under low temperature stress in SS oyster; the expression of RPS5, GAPDH, EF1β and AA7 was stable under low salinity stress, and the expression of RPL35, EF1α, GAPDH
Liu, Yutao; Munro, Drew; Layfield, David; Dellinger, Andrew; Walter, Jeffrey; Peterson, Katherine; Rickman, Catherine Bowes; Allingham, R Rand; Hauser, Michael A
To identify the genes expressed in normal human trabecular meshwork tissue, a tissue critical to the pathogenesis of glaucoma. Total RNA was extracted from human trabecular meshwork (HTM) harvested from 3 different donors. Extracted RNA was used to synthesize individual SAGE (serial analysis of gene expression) libraries using the I-SAGE Long kit from Invitrogen. Libraries were analyzed using SAGE 2000 software to extract the 17 base pair sequence tags. The extracted sequence tags were mapped to the genome using SAGE Genie map. A total of 298,834 SAGE tags were identified from all HTM libraries (96,842, 88,126, and 113,866 tags, respectively). Collectively, there were 107,325 unique tags. There were 10,329 unique tags with a minimum of 2 counts from a single library. These tags were mapped to known unique Unigene clusters. Approximately 29% of the tags (orphan tags) did not map to a known Unigene cluster. Thirteen percent of the tags mapped to at least 2 Unigene clusters. Sequence tags from many glaucoma-related genes, including myocilin, optineurin, and WD repeat domain 36, were identified. This is the first time SAGE analysis has been used to characterize the gene expression profile in normal HTM. SAGE analysis provides an unbiased sampling of gene expression of the target tissue. These data will provide new and valuable information to improve understanding of the biology of human aqueous outflow.
Han, Xinxin; Yin, Linlin; Xue, Hongwei
Fatty acids (FAs) play crucial rules in signal transduction and plant development, however, the regulation of FA metabolism is still poorly understood. To study the relevant regulatory network, fifty-eight FA biosynthesis genes including de novo synthases, desaturases and elongases were selected as "guide genes" to construct the co-expression network. Calculation of the correlation between all Arabidopsis thaliana (L.) genes with each guide gene by Arabidopsis co-expression dating mining tools (ACT) identifies 797 candidate FA-correlated genes. Gene ontology (GO) analysis of these co-expressed genes showed they are tightly correlated to photosynthesis and carbohydrate metabolism, and function in many processes. Interestingly, 63 transcription factors (TFs) were identified as candidate FA biosynthesis regulators and 8 TF families are enriched. Two TF genes, CRC and AP1, both correlating with 8 FA guide genes, were further characterized. Analyses of the ap1 and crc mutant showed the altered total FA composition of mature seeds. The contents of palmitoleic acid, stearic acid, arachidic acid and eicosadienoic acid are decreased, whereas that of oleic acid is increased in ap1 and crc seeds, which is consistent with the qRT-PCR analysis revealing the suppressed expression of the corresponding guide genes. In addition, yeast one-hybrid analysis and electrophoretic mobility shift assay (EMSA) revealed that CRC can bind to the promoter regions of KCS7 and KCS15, indicating that CRC may directly regulate FA biosynthesis. © 2012 Institute of Botany, Chinese Academy of Sciences.
Jia, Cheng; Hu, Yu; Kelly, Derek; Kim, Junhyong; Li, Mingyao; Zhang, Nancy R
Recent technological breakthroughs have made it possible to measure RNA expression at the single-cell level, thus paving the way for exploring expression heterogeneity among individual cells. Current single-cell RNA sequencing (scRNA-seq) protocols are complex and introduce technical biases that vary across cells, which can bias downstream analysis without proper adjustment. To account for cell-to-cell technical differences, we propose a statistical framework, TASC (Toolkit for Analysis of Single Cell RNA-seq), an empirical Bayes approach to reliably model the cell-specific dropout rates and amplification bias by use of external RNA spike-ins. TASC incorporates the technical parameters, which reflect cell-to-cell batch effects, into a hierarchical mixture model to estimate the biological variance of a gene and detect differentially expressed genes. More importantly, TASC is able to adjust for covariates to further eliminate confounding that may originate from cell size and cell cycle differences. In simulation and real scRNA-seq data, TASC achieves accurate Type I error control and displays competitive sensitivity and improved robustness to batch effects in differential expression analysis, compared to existing methods. TASC is programmed to be computationally efficient, taking advantage of multi-threaded parallelization. We believe that TASC will provide a robust platform for researchers to leverage the power of scRNA-seq. © The Author(s) 2017. Published by Oxford University Press on behalf of Nucleic Acids Research.
Sung, Chang Ohk; Choi, Chel Hun; Ko, Young-Hyeh; Ju, Hyunjeong; Choi, Yoon-La; Kim, Nyunsu; Kang, So Young; Ha, Sang Yun; Choi, Kyusam; Bae, Duk-Soo; Lee, Jeong-Won; Kim, Tae-Joong; Song, Sang Yong; Kim, Byoung-Gie
Ovarian clear cell adenocarcinoma (Ov-CCA) is a distinctive subtype of ovarian epithelial carcinoma. In this study, we performed array comparative genomic hybridization (aCGH) and paired gene expression microarray of 19 fresh-frozen samples and conducted integrative analysis. For the copy number alterations, significantly amplified regions (false discovery rate [FDR] q genes demonstrating frequent copy number alterations (>25% of samples) that correlated with gene expression (FDR genes were mainly located on 8p11.21, 8p21.2-p21.3, 8q22.1, 8q24.3, 17q23.2-q23.3, 19p13.3, and 19p13.11. Among the regions, 8q24.3 was found to contain the most genes (30 of 94 genes) including PTK2. The 8q24.3 region was indicated as the most significant region, as supported by copy number, GISTIC, and integrative analysis. Pathway analysis using differentially expressed genes on 8q24.3 revealed several major nodes, including PTK2. In conclusion, we identified a set of 94 candidate genes with frequent copy number alterations that correlated with gene expression. Specific chromosomal alterations, such as the 8q24.3 gain containing PTK2, could be a therapeutic target in a subset of Ov-CCAs. Copyright © 2013. Published by Elsevier Inc.
Wang, Lianghai; Yu, Xiaodan; Li, Jing; Zhang, Zhiyu; Hou, Jun; Li, Feng
The prognostic value of p53 protein expression in esophageal cancer has been evaluated, but the results remain inconclusive and no consensus has yet been achieved. This meta-analysis was conducted to quantitatively assess the prognostic significance of p53 expression in esophageal cancer. Publications that assessed the clinical or prognostic significance of p53 expression in esophageal cancer and were published before July 1, 2015 were identified by searching the PubMed and EMBASE databases. A meta-analysis was performed to clarify the association between p53 expression and the clinical outcomes. A total of 36 publications met the criteria and included 4577 cases. Analysis of these data showed that p53 expression in esophageal cancer was significantly associated with poorer 5-year survival (RR = 1.30, 95 % CI: 1.11–1.51, P = 0.0008). Subgroup analyses according to histological type, continent of the patients, and cut-off value revealed the similar results. The results also indicated that p53 expression was highly associated with advanced TNM stages (I/II vs. III/IV, OR = 0.74, 95 % CI: 0.55–0.99, P = 0.04), lymph node metastasis (OR = 0.77, 95 % CI: 0.66–0.90, P = 0.001), and distant metastasis (OR = 0.46, 95 % CI: 0.26–0.80, P = 0.006). However, p53 expression in the included studies was not significantly associated with tumor size (≤ 5 cm vs. > 5 cm, OR = 1.13, 95 % CI: 0.92–1.40, P = 0.24), tumor location (upper + middle vs. lower, OR = 0.91, 95 % CI: 0.70–1.17, P = 0.45), grade of differentiation (well + moderate vs. poor, OR = 1.10, 95 % CI: 0.90–1.34, P = 0.35), and the depth of invasion (T1/T2 vs. T3/T4, OR = 0.86, 95 % CI: 0.71–1.03, P = 0.09). This meta-analysis showed that p53 expression may be a useful biomarker for predicting poorer prognosis in patients with esophageal cancer
Fomekong-Nanfack, Y.; Postma, M.; Kaandorp, J.A.
Background: Inference of gene regulatory networks (GRNs) requires accurate data, a method to simulate the expression patterns and an efficient optimization algorithm to estimate the unknown parameters. Using this approach it is possible to obtain alternative circuits without making any a priori
de Jong, Simone; van Eijk, Kristel R; Zeegers, Dave W L H
of the Psychiatric GWAS consortium (PGC) yielded five novel loci for schizophrenia. In this study, we aim to highlight additional schizophrenia susceptibility loci from the PGC study by combining the top association findings from the discovery stage (9394 schizophrenia cases and 12 462 controls) with expression QTLs...
Li, Shuang-Jiang; Chen, Da-Li; Zhang, Wen-Biao; Shen, Cheng; Che, Guo-Wei
Numbers of studies have investigated the biological functions of decorin (DCN) in oncogenesis, tumor progression, angiogenesis and metastasis. Although many of them aim to highlight the prognostic value of stromal DCN expression in breast cancer, some controversial results still exist and a consensus has not been reached until now. Therefore, our meta-analysis aims to determine the prognostic significance of stromal DCN expression in breast cancer patients. PubMed, EMBASE, the Web of Science and China National Knowledge Infrastructure (CNKI) databases were searched for full-text literatures met out inclusion criteria. We applied the hazard ratio (HR) with 95% confidence interval (CI) as the appropriate summarized statistics. Q-test and I(2) statistic were employed to estimate the level of heterogeneity across the included studies. Sensitivity analysis was conducted to further identify the possible origins of heterogeneity. The publication bias was detected by Begg's test and Egger's test. There were three English literatures (involving 6 studies) included into our meta-analysis. On the one hand, both the summarized outcomes based on univariate analysis (HR: 0.513; 95% CI: 0.406-0.648; Panalysis (HR: 0.544; 95% CI: 0.388-0.763; Panalysis (HR: 0.504; 95% CI: 0.389-0.651; Panalysis (HR: 0.568; 95% CI: 0.400-0.806; P=0.002) also indicated that stromal DCN expression was positively associated with high disease-free survival (DFS) of breast cancer patients. No significant heterogeneity or publication bias was observed within this meta-analysis. The present evidences indicate that high stromal DCN expression can significantly predict the good prognosis in patients with breast cancer. The discoveries from our meta-analysis have better be confirmed in the updated review pooling more relevant investigations in the future.
Full Text Available Many cells experience hypoxia, or low oxygen, and respond by dramatically altering gene expression. In the yeast Saccharomyces cerevisiae, genes that respond are required for many oxygen-dependent cellular processes, such as respiration, biosynthesis, and redox regulation. To more fully characterize the global response to hypoxia, we exposed yeast to hypoxic conditions, extracted RNA at different times, and performed RNA sequencing (RNA-seq analysis. Time-course statistical analysis revealed hundreds of genes that changed expression by up to 550-fold. The genes responded with varying kinetics suggesting that multiple regulatory pathways are involved. We identified most known oxygen-regulated genes and also uncovered new regulated genes. Reverse transcription-quantitative PCR (RT-qPCR analysis confirmed that the lysine methyltransferase EFM6 and the recombinase DMC1, both conserved in humans, are indeed oxygen-responsive. Looking more broadly, oxygen-regulated genes participate in expected processes like respiration and lipid metabolism, but also in unexpected processes like amino acid and vitamin metabolism. Using principle component analysis, we discovered that the hypoxic response largely occurs during the first 2 hr and then a new steady-state expression state is achieved. Moreover, we show that the oxygen-dependent genes are not part of the previously described environmental stress response (ESR consisting of genes that respond to diverse types of stress. While hypoxia appears to cause a transient stress, the hypoxic response is mostly characterized by a transition to a new state of gene expression. In summary, our results reveal that hypoxia causes widespread and complex changes in gene expression to prepare the cell to function with little or no oxygen.
Wang, Haibo; Zou, Zhurong; Wang, Shasha; Gong, Ming
Jatropha curcas L., also called the Physic nut, is an oil-rich shrub with multiple uses, including biodiesel production, and is currently exploited as a renewable energy resource in many countries. Nevertheless, because of its origin from the tropical MidAmerican zone, J. curcas confers an inherent but undesirable characteristic (low cold resistance) that may seriously restrict its large-scale popularization. This adaptive flaw can be genetically improved by elucidating the mechanisms underlying plant tolerance to cold temperatures. The newly developed Illumina Hiseq™ 2000 RNA-seq and Digital Gene Expression (DGE) are deep high-throughput approaches for gene expression analysis at the transcriptome level, using which we carefully investigated the gene expression profiles in response to cold stress to gain insight into the molecular mechanisms of cold response in J. curcas. In total, 45,251 unigenes were obtained by assembly of clean data generated by RNA-seq analysis of the J. curcas transcriptome. A total of 33,363 and 912 complete or partial coding sequences (CDSs) were determined by protein database alignments and ESTScan prediction, respectively. Among these unigenes, more than 41.52% were involved in approximately 128 known metabolic or signaling pathways, and 4,185 were possibly associated with cold resistance. DGE analysis was used to assess the changes in gene expression when exposed to cold condition (12°C) for 12, 24, and 48 h. The results showed that 3,178 genes were significantly upregulated and 1,244 were downregulated under cold stress. These genes were then functionally annotated based on the transcriptome data from RNA-seq analysis. This study provides a global view of transcriptome response and gene expression profiling of J. curcas in response to cold stress. The results can help improve our current understanding of the mechanisms underlying plant cold resistance and favor the screening of crucial genes for genetically enhancing cold resistance
Full Text Available BACKGROUND: Jatropha curcas L., also called the Physic nut, is an oil-rich shrub with multiple uses, including biodiesel production, and is currently exploited as a renewable energy resource in many countries. Nevertheless, because of its origin from the tropical MidAmerican zone, J. curcas confers an inherent but undesirable characteristic (low cold resistance that may seriously restrict its large-scale popularization. This adaptive flaw can be genetically improved by elucidating the mechanisms underlying plant tolerance to cold temperatures. The newly developed Illumina Hiseq™ 2000 RNA-seq and Digital Gene Expression (DGE are deep high-throughput approaches for gene expression analysis at the transcriptome level, using which we carefully investigated the gene expression profiles in response to cold stress to gain insight into the molecular mechanisms of cold response in J. curcas. RESULTS: In total, 45,251 unigenes were obtained by assembly of clean data generated by RNA-seq analysis of the J. curcas transcriptome. A total of 33,363 and 912 complete or partial coding sequences (CDSs were determined by protein database alignments and ESTScan prediction, respectively. Among these unigenes, more than 41.52% were involved in approximately 128 known metabolic or signaling pathways, and 4,185 were possibly associated with cold resistance. DGE analysis was used to assess the changes in gene expression when exposed to cold condition (12°C for 12, 24, and 48 h. The results showed that 3,178 genes were significantly upregulated and 1,244 were downregulated under cold stress. These genes were then functionally annotated based on the transcriptome data from RNA-seq analysis. CONCLUSIONS: This study provides a global view of transcriptome response and gene expression profiling of J. curcas in response to cold stress. The results can help improve our current understanding of the mechanisms underlying plant cold resistance and favor the screening of
Bendjilali, Nasrine; MacLeon, Samuel; Kalra, Gurmannat; Willis, Stephen D; Hossian, A K M Nawshad; Avery, Erica; Wojtowicz, Olivia; Hickman, Mark J
Many cells experience hypoxia, or low oxygen, and respond by dramatically altering gene expression. In the yeast Saccharomyces cerevisiae, genes that respond are required for many oxygen-dependent cellular processes, such as respiration, biosynthesis, and redox regulation. To more fully characterize the global response to hypoxia, we exposed yeast to hypoxic conditions, extracted RNA at different times, and performed RNA sequencing (RNA-seq) analysis. Time-course statistical analysis revealed hundreds of genes that changed expression by up to 550-fold. The genes responded with varying kinetics suggesting that multiple regulatory pathways are involved. We identified most known oxygen-regulated genes and also uncovered new regulated genes. Reverse transcription-quantitative PCR (RT-qPCR) analysis confirmed that the lysine methyltransferase EFM6 and the recombinase DMC1, both conserved in humans, are indeed oxygen-responsive. Looking more broadly, oxygen-regulated genes participate in expected processes like respiration and lipid metabolism, but also in unexpected processes like amino acid and vitamin metabolism. Using principle component analysis, we discovered that the hypoxic response largely occurs during the first 2 hr and then a new steady-state expression state is achieved. Moreover, we show that the oxygen-dependent genes are not part of the previously described environmental stress response (ESR) consisting of genes that respond to diverse types of stress. While hypoxia appears to cause a transient stress, the hypoxic response is mostly characterized by a transition to a new state of gene expression. In summary, our results reveal that hypoxia causes widespread and complex changes in gene expression to prepare the cell to function with little or no oxygen. Copyright © 2017 Bendjilali et al.
Full Text Available Gastric cancer is one of the most severe complex diseases with high morbidity and mortality in the world. The molecular mechanisms and risk factors for this disease are still not clear since the cancer heterogeneity caused by different genetic and environmental factors. With more and more expression data accumulated nowadays, we can perform integrative analysis for these data to understand the complexity of gastric cancer and to identify consensus players for the heterogeneous cancer. In the present work, we screened the published gene expression data and analyzed them with integrative tool, combined with pathway and gene ontology enrichment investigation. We identified several consensus differentially expressed genes and these genes were further confirmed with literature mining; at last, two genes, that is, immunoglobulin J chain and C-X-C motif chemokine ligand 17, were screened as novel gastric cancer associated genes. Experimental validation is proposed to further confirm this finding.
Affrida Abu Hassan; Ahmad Syazni Kamarudin; Nurul Nadia Aminuddin; Mohd Nazir Basiran
In vitro mutagenesis on Dendrobium Sonia in MINT has produced mutants with wide range of flower form and colour variations. Among the mutants are plants with different flower size and shape. These changes could be caused by alterations to the expression level of the genes responsible for the characteristics. In this studies, Differential Display technique was used to identify and analyse altered gene expression at the mRNA level. Total RNA of the control and mutants were reversed transcribed using three anchored oligo-d T primers. Subsequently, these cDNAs were Pcr amplified in combination with 16 arbitrary primers. The amplified products were electrophoresed side by side on agarose gel. Differentially expressed bands are isolated for further analysis. (Author)
Full Text Available Abstract Background The extensive use of DNA microarray technology in the characterization of the cell transcriptome is leading to an ever increasing amount of microarray data from cancer studies. Although similar questions for the same type of cancer are addressed in these different studies, a comparative analysis of their results is hampered by the use of heterogeneous microarray platforms and analysis methods. Results In contrast to a meta-analysis approach where results of different studies are combined on an interpretative level, we investigate here how to directly integrate raw microarray data from different studies for the purpose of supervised classification analysis. We use median rank scores and quantile discretization to derive numerically comparable measures of gene expression from different platforms. These transformed data are then used for training of classifiers based on support vector machines. We apply this approach to six publicly available cancer microarray gene expression data sets, which consist of three pairs of studies, each examining the same type of cancer, i.e. breast cancer, prostate cancer or acute myeloid leukemia. For each pair, one study was performed by means of cDNA microarrays and the other by means of oligonucleotide microarrays. In each pair, high classification accuracies (> 85% were achieved with training and testing on data instances randomly chosen from both data sets in a cross-validation analysis. To exemplify the potential of this cross-platform classification analysis, we use two leukemia microarray data sets to show that important genes with regard to the biology of leukemia are selected in an integrated analysis, which are missed in either single-set analysis. Conclusion Cross-platform classification of multiple cancer microarray data sets yields discriminative gene expression signatures that are found and validated on a large number of microarray samples, generated by different laboratories and
Full Text Available Abstract Background The Caenorhabditis elegans genome encodes ten proteins that share sequence similarity with the Hedgehog signaling molecule through their C-terminal autoprocessing Hint/Hog domain. These proteins contain novel N-terminal domains, and C. elegans encodes dozens of additional proteins containing only these N-terminal domains. These gene families are called warthog, groundhog, ground-like and quahog, collectively called hedgehog (hh-related genes. Previously, the expression pattern of seventeen genes was examined, which showed that they are primarily expressed in the ectoderm. Results With the completion of the C. elegans genome sequence in November 2002, we reexamined and identified 61 hh-related ORFs. Further, we identified 49 hh-related ORFs in C. briggsae. ORF analysis revealed that 30% of the genes still had errors in their predictions and we improved these predictions here. We performed a comprehensive expression analysis using GFP fusions of the putative intergenic regulatory sequence with one or two transgenic lines for most genes. The hh-related genes are expressed in one or a few of the following tissues: hypodermis, seam cells, excretory duct and pore cells, vulval epithelial cells, rectal epithelial cells, pharyngeal muscle or marginal cells, arcade cells, support cells of sensory organs, and neuronal cells. Using time-lapse recordings, we discovered that some hh-related genes are expressed in a cyclical fashion in phase with molting during larval development. We also generated several translational GFP fusions, but they did not show any subcellular localization. In addition, we also studied the expression patterns of two genes with similarity to Drosophila frizzled, T23D8.1 and F27E11.3A, and the ortholog of the Drosophila gene dally-like, gpn-1, which is a heparan sulfate proteoglycan. The two frizzled homologs are expressed in a few neurons in the head, and gpn-1 is expressed in the pharynx. Finally, we compare the
Song, Qiuming; Li, Dayong; Dai, Yi; Liu, Shixia; Huang, Lei; Hong, Yongbo; Zhang, Huijuan; Song, Fengming
Mitogen-activated protein kinase (MAPK) cascades, which consist of three functionally associated protein kinases, namely MEKKs, MKKs and MPKs, are universal signaling modules in all eukaryotes and have been shown to play critical roles in many physiological and biochemical processes in plants. However, little or nothing is known about the MPK and MKK families in watermelon. In the present study, we performed a systematic characterization of the ClMPK and ClMKK families including the identification and nomenclature, chromosomal localization, phylogenetic relationships, ClMPK-ClMKK interactions, expression patterns in different tissues and in response to abiotic and biotic stress and transient expression-based functional analysis for their roles in disease resistance. Genome-wide survey identified fifteen ClMPK and six ClMKK genes in watermelon genome and phylogenetic analysis revealed that both of the ClMPK and ClMKK families can be classified into four distinct groups. Yeast two-hybrid assays demonstrated significant interactions between members of the ClMPK and ClMKK families, defining putative ClMKK2-1/ClMKK6-ClMPK4-1/ClMPK4-2/ClMPK13 and ClMKK5-ClMPK6 cascades. Most of the members in the ClMPK and ClMKK families showed differential expression patterns in different tissues and in response to abiotic (e.g. drought, salt, cold and heat treatments) and biotic (e.g. infection of Fusarium oxysporum f. sp. niveum) stresses. Transient expression of ClMPK1, ClMPK4-2 and ClMPK7 in Nicotiana benthamiana resulted in enhanced resistance to Botrytis cinerea and upregulated expression of defense genes while transient expression of ClMPK6 and ClMKK2-2 led to increased susceptibility to B. cinerea. Furthermore, transient expression of ClMPK7 also led to hypersensitive response (HR)-like cell death and significant accumulation of H2O2 in N. benthamiana. We identified fifteen ClMPK and six ClMKK genes from watermelon and analyzed their phylogenetic relationships, expression
Lenburg, Marc E; Liou, Louis S; Gerry, Norman P; Frampton, Garrett M; Cohen, Herbert T; Christman, Michael F
Renal cell carcinoma is a common malignancy that often presents as a metastatic-disease for which there are no effective treatments. To gain insights into the mechanism of renal cell carcinogenesis, a number of genome-wide expression profiling studies have been performed. Surprisingly, there is very poor agreement among these studies as to which genes are differentially regulated. To better understand this lack of agreement we profiled renal cell tumor gene expression using genome-wide microarrays (45,000 probe sets) and compare our analysis to previous microarray studies. We hybridized total RNA isolated from renal cell tumors and adjacent normal tissue to Affymetrix U133A and U133B arrays. We removed samples with technical defects and removed probesets that failed to exhibit sequence-specific hybridization in any of the samples. We detected differential gene expression in the resulting dataset with parametric methods and identified keywords that are overrepresented in the differentially expressed genes with the Fisher-exact test. We identify 1,234 genes that are more than three-fold changed in renal tumors by t-test, 800 of which have not been previously reported to be altered in renal cell tumors. Of the only 37 genes that have been identified as being differentially expressed in three or more of five previous microarray studies of renal tumor gene expression, our analysis finds 33 of these genes (89%). A key to the sensitivity and power of our analysis is filtering out defective samples and genes that are not reliably detected. The widespread use of sample-wise voting schemes for detecting differential expression that do not control for false positives likely account for the poor overlap among previous studies. Among the many genes we identified using parametric methods that were not previously reported as being differentially expressed in renal cell tumors are several oncogenes and tumor suppressor genes that likely play important roles in renal cell
Full Text Available Williams-Beuren Syndrome (WBS is a neurodevelopmental disorder caused by a hemizygous deletion of a 1.5 Mb region on chromosome 7q11.23 encompassing 26 genes. One of these genes, GTF2IRD1, codes for a putative transcription factor that is expressed throughout the brain during development. Genotype-phenotype studies in patients with atypical deletions of 7q11.23 implicate this gene in the neurological features of WBS, and Gtf2ird1 knockout mice show reduced innate fear and increased sociability, consistent with features of WBS. Multiple studies have identified in vitro target genes of GTF2IRD1, but we sought to identify in vivo targets in the mouse brain.We performed the first in vivo microarray screen for transcriptional targets of Gtf2ird1 in brain tissue from Gtf2ird1 knockout and wildtype mice at embryonic day 15.5 and at birth. Changes in gene expression in the mutant mice were moderate (0.5 to 2.5 fold and of candidate genes with altered expression verified using real-time PCR, most were located on chromosome 5, within 10 Mb of Gtf2ird1. siRNA knock-down of Gtf2ird1 in two mouse neuronal cell lines failed to identify changes in expression of any of the genes identified from the microarray and subsequent analysis showed that differences in expression of genes on chromosome 5 were the result of retention of that chromosome region from the targeted embryonic stem cell line, and so were dependent upon strain rather than Gtf2ird1 genotype. In addition, specific analysis of genes previously identified as direct in vitro targets of GTF2IRD1 failed to show altered expression.We have been unable to identify any in vivo neuronal targets of GTF2IRD1 through genome-wide expression analysis, despite widespread and robust expression of this protein in the developing rodent brain.
Blenk, Steffen; Engelmann, Julia C; Pinkert, Stefan; Weniger, Markus; Schultz, Jörg; Rosenwald, Andreas; Müller-Hermelink, Hans K; Müller, Tobias; Dandekar, Thomas
Mantle cell lymphoma (MCL) is an incurable B cell lymphoma and accounts for 6% of all non-Hodgkin's lymphomas. On the genetic level, MCL is characterized by the hallmark translocation t(11;14) that is present in most cases with few exceptions. Both gene expression and comparative genomic hybridization (CGH) data vary considerably between patients with implications for their prognosis. We compare patients over and below the median of survival. Exploratory principal component analysis of gene expression data showed that the second principal component correlates well with patient survival. Explorative analysis of CGH data shows the same correlation. On chromosome 7 and 9 specific genes and bands are delineated which improve prognosis prediction independent of the previously described proliferation signature. We identify a compact survival predictor of seven genes for MCL patients. After extensive re-annotation using GEPAT, we established protein networks correlating with prognosis. Well known genes (CDC2, CCND1) and further proliferation markers (WEE1, CDC25, aurora kinases, BUB1, PCNA, E2F1) form a tight interaction network, but also non-proliferative genes (SOCS1, TUBA1B CEBPB) are shown to be associated with prognosis. Furthermore we show that aggressive MCL implicates a gene network shift to higher expressed genes in late cell cycle states and refine the set of non-proliferative genes implicated with bad prognosis in MCL. The results from explorative data analysis of gene expression and CGH data are complementary to each other. Including further tests such as Wilcoxon rank test we point both to proliferative and non-proliferative gene networks implicated in inferior prognosis of MCL and identify suitable markers both in gene expression and CGH data
Full Text Available Reverse transcription-qPCR (RT-qPCR has become a popular method for gene expression studies. Its results require data normalization by housekeeping genes. No single gene is proved to be stably expressed under all experimental conditions. Therefore, systematic evaluation of reference genes is necessary. With the aim to identify optimum reference genes for RT-qPCR analysis of gene expression in different tissues of Panax ginseng and the seedlings grown under heat stress, we investigated the expression stability of eight candidate reference genes, including elongation factor 1-beta (EF1-β, elongation factor 1-gamma (EF1-γ, eukaryotic translation initiation factor 3G (IF3G, eukaryotic translation initiation factor 3B (IF3B, actin (ACT, actin11 (ACT11, glyceraldehyde-3-phosphate dehydrogenase (GAPDH and cyclophilin ABH-like protein (CYC, using four widely used computational programs: geNorm, Normfinder, BestKeeper, and the comparative ΔCt method. The results were then integrated using the web-based tool RefFinder. As a result, EF1-γ, IF3G and EF1-β were the three most stable genes in different tissues of P. ginseng, while IF3G, ACT11 and GAPDH were the top three-ranked genes in seedlings treated with heat. Using three better reference genes alone or in combination as internal control, we examined the expression profiles of MAR, a multiple function-associated mRNA-like non-coding RNA (mlncRNA in P. ginseng. Taken together, we recommended EF1-γ/IF3G and IF3G/ACT11 as the suitable pair of reference genes for RT-qPCR analysis of gene expression in different tissues of P. ginseng and the seedlings grown under heat stress, respectively. The results serve as a foundation for future studies on P. ginseng functional genomics.
Full Text Available Background and objective It has been proven that ornithine aminotransferase (OAT might play an important role in the oncogenesis and progression of numerous malignant tumors. The aim of this study is to detect the mRNA and protein expression of OAT in non-small cell lung cancer (NSCLC, as well as to analyze the bioinformatic features and binary interactions. Methods OAT mRNA expression was detected in A549 and 16HBE cell lines by reverse transcription-polymerase chain reaction. OAT protein expression was determined in 55 cases of NSCLC and 17 cases of adjacent non-tumor lung tissues by immunohistochemical staining. The bioinformatic features and binary interactions of OAT were analyzed. Gene ontology annotation and signal pathway analysis were performed. Results OAT mRNA expression in A549 cells was 2.85-fold lower than that in 16HBE cells. OAT protein expression was significantly higher in NSCLC tissues than that in adjacent non-tumor lung tissues. A significant difference of OAT protein expression was existed between squamous cell lung cancer and adenocarcinoma (P<0.05, but was not correlated with the gender, age, lymph node metastasis, tumor size, and TNM stages. Bioinformatic analysis suggested that OAT was a highly homologous and stable protein located in the mitochondria. An aminotran-3 domain and several sites of phosphorylation, which may function in signal transduction, gene transcription, and molecular transit, were found. In the 54 selected binary interactions of OAT, TNF and TRAF6 play roles in the NF-κB pathway. Conclusion OAT may play an important role in the oncogenesis and progression of NSCLC. Thus, OAT may be a novel biomarker for the diagnosis of NSCLC or a new target for its treatment.
Ray, Sumanta; Hossain, Sk Md Mosaddek; Khatun, Lutfunnesa; Mukhopadhyay, Anirban
Alzheimer's disease (AD) is a chronic neuro-degenerative disruption of the brain which involves in large scale transcriptomic variation. The disease does not impact every regions of the brain at the same time, instead it progresses slowly involving somewhat sequential interaction with different regions. Analysis of the expression patterns of the genes in different regions of the brain influenced in AD surely contribute for a enhanced comprehension of AD pathogenesis and shed light on the early characterization of the disease. Here, we have proposed a framework to identify perturbation and preservation characteristics of gene expression patterns across six distinct regions of the brain ("EC", "HIP", "PC", "MTG", "SFG", and "VCX") affected in AD. Co-expression modules were discovered considering a couple of regions at once. These are then analyzed to know the preservation and perturbation characteristics. Different module preservation statistics and a rank aggregation mechanism have been adopted to detect the changes of expression patterns across brain regions. Gene ontology (GO) and pathway based analysis were also carried out to know the biological meaning of preserved and perturbed modules. In this article, we have extensively studied the preservation patterns of co-expressed modules in six distinct brain regions affected in AD. Some modules are emerged as the most preserved while some others are detected as perturbed between a pair of brain regions. Further investigation on the topological properties of preserved and non-preserved modules reveals a substantial association amongst "betweenness centrality" and "degree" of the involved genes. Our findings may render a deeper realization of the preservation characteristics of gene expression patterns in discrete brain regions affected by AD.
Full Text Available Genome-wide association studies (GWAS have uncovered numerous genetic variants (SNPs that are associated with blood pressure (BP. Genetic variants may lead to BP changes by acting on intermediate molecular phenotypes such as coded protein sequence or gene expression, which in turn affect BP variability. Therefore, characterizing genes whose expression is associated with BP may reveal cellular processes involved in BP regulation and uncover how transcripts mediate genetic and environmental effects on BP variability. A meta-analysis of results from six studies of global gene expression profiles of BP and hypertension in whole blood was performed in 7017 individuals who were not receiving antihypertensive drug treatment. We identified 34 genes that were differentially expressed in relation to BP (Bonferroni-corrected p<0.05. Among these genes, FOS and PTGS2 have been previously reported to be involved in BP-related processes; the others are novel. The top BP signature genes in aggregate explain 5%-9% of inter-individual variance in BP. Of note, rs3184504 in SH2B3, which was also reported in GWAS to be associated with BP, was found to be a trans regulator of the expression of 6 of the transcripts we found to be associated with BP (FOS, MYADM, PP1R15A, TAGAP, S100A10, and FGBP2. Gene set enrichment analysis suggested that the BP-related global gene expression changes include genes involved in inflammatory response and apoptosis pathways. Our study provides new insights into molecular mechanisms underlying BP regulation, and suggests novel transcriptomic markers for the treatment and prevention of hypertension.
Full Text Available Abstract Background Amyotrophic Lateral Sclerosis (ALS is a lethal disorder characterized by progressive degeneration of motor neurons in the brain and spinal cord. Diagnosis is mainly based on clinical symptoms, and there is currently no therapy to stop the disease or slow its progression. Since access to spinal cord tissue is not possible at disease onset, we investigated changes in gene expression profiles in whole blood of ALS patients. Results Our transcriptional study showed dramatic changes in blood of ALS patients; 2,300 probes (9.4% showed significant differential expression in a discovery dataset consisting of 30 ALS patients and 30 healthy controls. Weighted gene co-expression network analysis (WGCNA was used to find disease-related networks (modules and disease related hub genes. Two large co-expression modules were found to be associated with ALS. Our findings were replicated in a second (30 patients and 30 controls and third dataset (63 patients and 63 controls, thereby demonstrating a highly significant and consistent association of two large co-expression modules with ALS disease status. Ingenuity Pathway Analysis of the ALS related module genes implicates enrichment of functional categories related to genetic disorders, neurodegeneration of the nervous system and inflammatory disease. The ALS related modules contain a number of candidate genes possibly involved in pathogenesis of ALS. Conclusion This first large-scale blood gene expression study in ALS observed distinct patterns between cases and controls which may provide opportunities for biomarker development as well as new insights into the molecular mechanisms of the disease.
Nummenmaa, Lauri; Calvo, Manuel G
Happy facial expressions are recognized faster and more accurately than other expressions in categorization tasks, whereas detection in visual search tasks is widely believed to be faster for angry than happy faces. We used meta-analytic techniques for resolving this categorization versus detection advantage discrepancy for positive versus negative facial expressions. Effect sizes were computed on the basis of the r statistic for a total of 34 recognition studies with 3,561 participants and 37 visual search studies with 2,455 participants, yielding a total of 41 effect sizes for recognition accuracy, 25 for recognition speed, and 125 for visual search speed. Random effects meta-analysis was conducted to estimate effect sizes at population level. For recognition tasks, an advantage in recognition accuracy and speed for happy expressions was found for all stimulus types. In contrast, for visual search tasks, moderator analysis revealed that a happy face detection advantage was restricted to photographic faces, whereas a clear angry face advantage was found for schematic and "smiley" faces. Robust detection advantage for nonhappy faces was observed even when stimulus emotionality was distorted by inversion or rearrangement of the facial features, suggesting that visual features primarily drive the search. We conclude that the recognition advantage for happy faces is a genuine phenomenon related to processing of facial expression category and affective valence. In contrast, detection advantages toward either happy (photographic stimuli) or nonhappy (schematic) faces is contingent on visual stimulus features rather than facial expression, and may not involve categorical or affective processing. (c) 2015 APA, all rights reserved).
Wei, Kai-Fa; Chen, Juan; Chen, Yan-Feng; Wu, Ling-Juan; Xie, Dao-Xin
The WRKY transcription factors function in plant growth and development, and response to the biotic and abiotic stresses. Although many studies have focused on the functional identification of the WRKY transcription factors, much less is known about molecular phylogenetic and global expression analysis of the complete WRKY family in maize. In this study, we identified 136 WRKY proteins coded by 119 genes in the B73 inbred line from the complete genome and named them in an orderly manner. Then, a comprehensive phylogenetic analysis of five species was performed to explore the origin and evolutionary patterns of these WRKY genes, and the result showed that gene duplication is the major driving force for the origin of new groups and subgroups and functional divergence during evolution. Chromosomal location analysis of maize WRKY genes indicated that 20 gene clusters are distributed unevenly in the genome. Microarray-based expression analysis has revealed that 131 WRKY transcripts encoded by 116 genes may participate in the regulation of maize growth and development. Among them, 102 transcripts are stably expressed with a coefficient of variation (CV) value of WRKY genes with the CV value of >15% are further analysed to discover new organ- or tissue-specific genes. In addition, microarray analyses of transcriptional responses to drought stress and fungal infection showed that maize WRKY proteins are involved in stress responses. All these results contribute to a deep probing into the roles of WRKY transcription factors in maize growth and development and stress tolerance.
Full Text Available The full-length cDNA sequence of a porcine gene, MOSPD2, was amplified using the rapid amplification of cDNA ends method based on a pig expressed sequence tag sequence which was highly homologous to the coding sequence of the human MOSPD2 gene. Sequence prediction analysis revealed that the open reading frame of this gene encodes a protein of 491 amino acids that has high homology with the motile sperm domain-containing protein 2 (MOSPD2 of five species: horse (89%, human (90%, chimpanzee (89%, rhesus monkey (89% and mouse (85%; thus, it could be defined as a porcine MOSPD2 gene. This novel porcine gene was assigned GeneID: 100153601. This gene is structured in 15 exons and 14 introns as revealed by computer-assisted analysis. The phylogenetic analysis revealed that the porcine MOSPD2 gene has a closer genetic relationship with the MOSPD2 gene of horse. Tissue expression analysis indicated that the porcine MOSPD2 gene is generally and differentially expressed in the spleen, muscle, skin, kidney, lung, liver, fat and heart. Our experiment is the first to establish the primary foundation for further research on the porcine MOSPD2 gene.
Thomassen, Mads; Tan, Qihua; Kruse, Torben A
Metastasis is believed to progress in several steps including different pathways but the determination and understanding of these mechanisms is still fragmentary. Microarray analysis of gene expression patterns in breast tumors has been used to predict outcome in recent studies. Besides classification of outcome, these global expression patterns may reflect biological mechanisms involved in metastasis of breast cancer. Our purpose has been to investigate pathways and transcription factors involved in metastasis by use of gene expression data sets. We have analyzed 8 publicly available gene expression data sets. A global approach, 'gene set enrichment analysis' as well as an approach focusing on a subset of significantly differently regulated genes, GenMAPP, has been applied to rank pathway gene sets according to differential regulation in metastasizing tumors compared to non-metastasizing tumors. Meta-analysis has been used to determine overrepresentation of pathways and transcription factors targets, concordant deregulated in metastasizing breast tumors, in several data sets. The major findings are up-regulation of cell cycle pathways and a metabolic shift towards glucose metabolism reflected in several pathways in metastasizing tumors. Growth factor pathways seem to play dual roles; EGF and PDGF pathways are decreased, while VEGF and sex-hormone pathways are increased in tumors that metastasize. Furthermore, migration, proteasome, immune system, angiogenesis, DNA repair and several signal transduction pathways are associated to metastasis. Finally several transcription factors e.g. E2F, NFY, and YY1 are identified as being involved in metastasis. By pathway meta-analysis many biological mechanisms beyond major characteristics such as proliferation are identified. Transcription factor analysis identifies a number of key factors that support central pathways. Several previously proposed treatment targets are identified and several new pathways that may
Zhang, Shizhong; Xu, Ruirui; Luo, Xiaocui; Jiang, Zesheng; Shu, Huairui
MAPK signal transduction modules play crucial roles in regulating many biological processes in plants, which are composed of three classes of hierarchically organized protein kinases, namely MAPKKKs, MAPKKs, and MAPKs. Although genome-wide analysis of this family has been carried out in some species, little is known about MAPK and MAPKK genes in apple (Malus domestica). In this study, a total of 26 putative apple MAPK genes (MdMPKs) and 9 putative apple MAPKK genes (MdMKKs) have been identified and located within the apple genome. Phylogenetic analysis revealed that MdMAPKs and MdMAPKKs could be divided into 4 subfamilies (groups A, B, C and D), respectively. The predicted MdMAPKs and MdMAPKKs were distributed across 13 out of 17 chromosomes with different densities. In addition, analysis of exon-intron junctions and of intron phase inside the predicted coding region of each candidate gene has revealed high levels of conservation within and between phylogenetic groups. According to the microarray and expressed sequence tag (EST) analysis, the different expression patterns indicate that they may play different roles during fruit development and rootstock-scion interaction process. Moreover, MAPK and MAPKK genes were performed expression profile analyses in different tissues (root, stem, leaf, flower and fruit), and all of the selected genes were expressed in at least one of the tissues tested, indicating that the MAPKs and MAPKKs are involved in various aspects of physiological and developmental processes of apple. To our knowledge, this is the first report of a genome-wide analysis of the apple MAPK and MAPKK gene family. This study provides valuable information for understanding the classification and putative functions of the MAPK signal in apple. © 2013.
Tan, Wulin; Song, Yiyan; Mo, Chengqiang; Jiang, Shuangjian; Wang, Zhongxing
The aim of the present study was to predict key genes and proteins associated with complex regional pain syndrome (CRPS) using bioinformatics analysis. The gene expression profiling microarray data, GSE47603, which included peripheral blood samples from 4 patients with CRPS and 5 healthy controls, was obtained from the Gene Expression Omnibus (GEO) database. The differentially expressed genes (DEGs) in CRPS patients compared with healthy controls were identified using the GEO2R online tool. Functional enrichment analysis was then performed using The Database for Annotation Visualization and Integrated Discovery online tool. Protein‑protein interaction (PPI) network analysis was subsequently performed using Search Tool for the Retrieval of Interaction Genes database and analyzed with Cytoscape software. A total of 257 DEGs were identified, including 243 upregulated genes and 14 downregulated ones. Genes in the human leukocyte antigen (HLA) family were most significantly differentially expressed. Enrichment analysis demonstrated that signaling pathways, including immune response, cell motion, adhesion and angiogenesis were associated with CRPS. PPI network analysis revealed that key genes, including early region 1A binding protein p300 (EP300), CREB‑binding protein (CREBBP), signal transducer and activator of transcription (STAT)3, STAT5A and integrin α M were associated with CRPS. The results suggest that the immune response may therefore serve an important role in CRPS development. In addition, genes in the HLA family, such as HLA‑DQB1 and HLA‑DRB1, may present potential biomarkers for the diagnosis of CRPS. Furthermore, EP300, its paralog CREBBP, and the STAT family genes, STAT3 and STAT5 may be important in the development of CRPS.
Chai, Wenbo; Jiang, Pengfei; Huang, Guoyu; Jiang, Haiyang; Li, Xiaoyu
The TCP family is a group of plant-specific transcription factors. TCP genes encode proteins harboring bHLH structure, which is implicated in DNA binding and protein-protein interactions and known as the TCP domain. TCP genes play important roles in plant development and have been evolutionarily and functionally elaborated in various plants, however, no overall phylogenetic analysis or expression profiling of TCP genes in Zea mays has been reported. In the present study, a systematic analysis of molecular evolution and functional prediction of TCP family genes in maize ( Z . mays L.) has been conducted. We performed a genome-wide survey of TCP genes in maize, revealing the gene structure, chromosomal location and phylogenetic relationship of family members. Microsynteny between grass species and tissue-specific expression profiles were also investigated. In total, 29 TCP genes were identified in the maize genome, unevenly distributed on the 10 maize chromosomes. Additionally, ZmTCP genes were categorized into nine classes based on phylogeny and purifying selection may largely be responsible for maintaining the functions of maize TCP genes. What's more, microsynteny analysis suggested that TCP genes have been conserved during evolution. Finally, expression analysis revealed that most TCP genes are expressed in the stem and ear, which suggests that ZmTCP genes influence stem and ear growth. This result is consistent with the previous finding that maize TCP genes represses the growth of axillary organs and enables the formation of female inflorescences. Altogether, this study presents a thorough overview of TCP family in maize and provides a new perspective on the evolution of this gene family. The results also indicate that TCP family genes may be involved in development stage in plant growing conditions. Additionally, our results will be useful for further functional analysis of the TCP gene family in maize.
Xie, P; Wan, X P; Bu, Z; Zou, X T
Ghrelin and cholecystokinin (CCK) are multifunctional peptides. In the current study, complete sequences of ghrelin (800 bp) and CCK (739 bp) were firstly cloned in Columba livia by using rapid amplification of cDNA ends (RACE) method. The open reading frames of ghrelin (351bp) and CCK (393bp) encoded 116 amino acids and 130 amino acids, respectively. Sequence comparison indicated that pigeon ghrelin and CCK shared high identity with those reported in other avian species. Quantitative real-time PCR analysis found that ghrelin and CCK mRNAs expressed in three intestinal segments of pigeon during development. Both ghrelin and CCK showed generally higher expressions at days posthatch than embryonic periods regardless of intestinal segments. In duodenum and ileum, the expressions of ghrelin and CCK mRNA reached the peak values at 8 d posthatch. Jejunum CCK mRNA level increased linearly after hatching, and reached the highest point at posthatch 28 d. Based on documented effects of long chain fatty acids (LCFAs) on pigeon ghrelin and CCK expression were also investigated in vitro. Higher concentrations (50 μM or 250 μM) of linoleic acid, α-linolenic acid or arachidonic acid can significantly increase ghrelin mRNA level in pigeon jejunum. However, for oleic acid, the induction of ghrelin gene expressions needed a lower concentration (5 μM). 5 μM of linoleic acid, α-linolenic acid or arachidonic acid and 250 μM palmitic acid repressed CCK expression significantly. A higher concentration (250 μM) of oleic acid or α-linolenic acid can up-regulate CCK mRNA level significantly. Our results indicated that ghrelin and CCK may act key functions in pigeon intestine development and their expressions could be regulated by LCFAs. © 2016 Poultry Science Association Inc.
Anna A. Igolkina
Full Text Available Schizophrenia (SCZ is a psychiatric disorder of unknown etiology. There is evidence suggesting that aberrations in neurodevelopment are a significant attribute of schizophrenia pathogenesis and progression. To identify biologically relevant molecular abnormalities affecting neurodevelopment in SCZ we used cultured neural progenitor cells derived from olfactory neuroepithelium (CNON cells. Here, we tested the hypothesis that variance in gene expression differs between individuals from SCZ and control groups. In CNON cells, variance in gene expression was significantly higher in SCZ samples in comparison with control samples. Variance in gene expression was enriched in five molecular pathways: serine biosynthesis, PI3K-Akt, MAPK, neurotrophin and focal adhesion. More than 14% of variance in disease status was explained within the logistic regression model (C-value = 0.70 by predictors accounting for gene expression in 69 genes from these five pathways. Structural equation modeling (SEM was applied to explore how the structure of these five pathways was altered between SCZ patients and controls. Four out of five pathways showed differences in the estimated relationships among genes: between KRAS and NF1, and KRAS and SOS1 in the MAPK pathway; between PSPH and SHMT2 in serine biosynthesis; between AKT3 and TSC2 in the PI3K-Akt signaling pathway; and between CRK and RAPGEF1 in the focal adhesion pathway. Our analysis provides evidence that variance in gene expression is an important characteristic of SCZ, and SEM is a promising method for uncovering altered relationships between specific genes thus suggesting affected gene regulation associated with the disease. We identified altered gene-gene interactions in pathways enriched for genes with increased variance in expression in SCZ. These pathways and loci were previously implicated in SCZ, providing further support for the hypothesis that gene expression variance plays important role in the etiology
Rivka C Stone
Full Text Available Polymorphisms in the interferon regulatory factor 5 (IRF5 gene have been consistently replicated and shown to confer risk for or protection from the development of systemic lupus erythematosus (SLE. IRF5 expression is significantly upregulated in SLE patients and upregulation associates with IRF5-SLE risk haplotypes. IRF5 alternative splicing has also been shown to be elevated in SLE patients. Given that human IRF5 exists as multiple alternatively spliced transcripts with distinct function(s, it is important to determine whether the IRF5 transcript profile expressed in healthy donor immune cells is different from that expressed in SLE patients. Moreover, it is not currently known whether an IRF5-SLE risk haplotype defines the profile of IRF5 transcripts expressed. Using standard molecular cloning techniques, we identified and isolated 14 new differentially spliced IRF5 transcript variants from purified monocytes of healthy donors and SLE patients to generate an IRF5 variant transcriptome. Next-generation sequencing was then used to perform in-depth and quantitative analysis of full-length IRF5 transcript expression in primary immune cells of SLE patients and healthy donors by next-generation sequencing. Evidence for additional alternatively spliced transcripts was obtained from de novo junction discovery. Data from these studies support the overall complexity of IRF5 alternative splicing in SLE. Results from next-generation sequencing correlated with cloning and gave similar abundance rankings in SLE patients thus supporting the use of this new technology for in-depth single gene transcript profiling. Results from this study provide the first proof that 1 SLE patients express an IRF5 transcript signature that is distinct from healthy donors, 2 an IRF5-SLE risk haplotype defines the top four most abundant IRF5 transcripts expressed in SLE patients, and 3 an IRF5 transcript signature enables clustering of SLE patients with the H2 risk haplotype.
Ali, Hina; Liu, Yanhui; Azam, Syed Muhammad; Rahman, Zia Ur; Priyadarshani, S V G N; Li, Weimin; Huang, Xinyu; Hu, Bingyan; Xiong, Junjie; Ali, Umair; Qin, Yuan
Gene expression is regulated by transcription factors, which play many significant developmental processes. SQUAMOSA promoter-binding proteins (SBP) perform a variety of regulatory functions in leaf, flower, and fruit development, plant architecture, and sporogenesis. 16 SBP genes were identified in pineapple and were divided into four groups on basis of phylogenetic analysis. Five paralogs in pineapple for SBP genes were identified with Ka/Ks ratio varied from 0.20 for AcSBP14 and AcSBP15 to 0.36 for AcSBP6 and AcSBP16 , respectively. 16 SBP genes were located on 12 chromosomes out of 25 pineapple chromosomes with highly conserved protein sequence structures. The isoionic points of SBP ranged from 6.05 to 9.57, while molecular weight varied from 22.7 to 121.9 kD. Expression profiles of SBP genes revealed that AcSBP7 and AcSBP15 (leaf), AcSBP13 , AcSBP12 , AcSBP8 , AcSBP16 , AcSBP9 , and AcSBP11 (sepal), AcSBP6 , AcSBP4 , and AcSBP10 (stamen), AcSBP14 , AcSBP1 , and AcSBP5 (fruit) while the rest of genes showed low expression in studied tissues. Four genes, that is, AcSBP11 , AcSBP6 , AcSBP4 , and AcSBP12 , were highly expressed at 4°C, while AcSBP16 were upregulated at 45°C. RNA-Seq was validated through qRT-PCR for some genes. Salt stress-induced expression of two genes, that is, AcSBP7 and AcSBP14 , while in drought stress, AcSBP12 and AcSBP15 were highly expressed. Our study lays a foundation for further gene function and expression studies of SBP genes in pineapple.
Daryl G.S. Smith
Full Text Available Human serum albumin (HSA is a versatile and important protein for the pharmaceutical industry (Fanali et al., Mol. Aspects Med. 33(3 (2012 209–290. Due to the potential transmission of pathogens from plasma sourced albumin, numerous expression systems have been developed to produce recombinant HSA (rHSA (Chen et al., Biochim. Biophys. Acta (BBA—Gen. Subj. 1830(12 (2013 5515–5525; Kobayashi, Biologicals 34(1 (2006 55–59. Based on our previous study showing increased glycation of rHSA expressed in Asian rice (Frahm et al., J. Phys. Chem. B 116(15 (2012 4661–4670, both supplier-to-supplier and lot-to-lot variability of rHSAs from a number of expression systems were evaluated using reversed phase liquid chromatography linked with MS and MS/MS analyses. The data are associated with the research article ‘Determination of Supplier-to-Supplier and Lot-to-Lot Variability in Glycation of Recombinant Human Serum Albumin Expressed in Oryza sativa’ where further analysis of rHSA samples with additional biophysical methods can be found (Frahm et al., PLoS ONE 10(9 (2014 e109893. We determined that all rHSA samples expressed in rice showed elevated levels of arginine and lysine hexose glycation compared to rHSA expressed in yeast, suggesting that the extensive glycation of the recombinant proteins is a by-product of either the expression system or purification process and not a random occurrence.
Full Text Available Background/Aims: Osteosarcoma (OS is the most common primary malignant bone tumor in children and adolescents. However, the molecular mechanisms regulating osteosarcoma tumorigenesis and progression are still poorly understood. Circular RNAs (circRNAs have been identified as microRNA sponges and are involved in many important biological processes. This study aims to investigate the global changes in the expression pattern of circRNAs in osteosarcoma and provide a comprehensive understanding of differentially expressed circRNAs. Methods: Microarray based circRNA expression was determined in osteosarcoma cell lines and compared with hFOB1.19, which was used as the normal control. We confirmed the microarray data by real time-qPCR in both osteosarcoma cell lines and tissues. The circRNA/microRNA/mRNA interaction network was predicted using bioinformatics. Gene Ontology analysis and 4 annotation tools for pathway analysis (KEGG, Biocarta, PANTHER and Reactome were used to predict the functions of differentially expressed circRNAs. Results: We revealed a number of differentially expressed circRNAs and 12 of them were confirmed, which suggests a potential role of circRNAs in OS. Among these differentially expressed circRNAs, hsa_circRNA_103801 was up-regulated in both osteosarcoma cell lines and tissues, while hsa_circRNA_104980 was down-regulated. The most likely potential target miRNAs for hsa_circRNA_103801 include hsa-miR-370-3p, hsa-miR-338-3p and hsa-miR-877-3p, while the most potential target miRNAs of hsa_circRNA_104980 consist of hsa-miR-1298-3p and hsa-miR-660-3p. Functional analysis found that hsa_circRNA_103801 was involved in pathways in cancer, such as the HIF-1, VEGF and angiogenesis pathway, the Rap1 signaling pathway and the PI3K-Akt signaling pathway, while hsa_circRNA_104980 was related to some pathways such as the tight junction pathway. Conclusions: This study has identified the comprehensive expression profile of circRNAs in
Full Text Available The biological function of human ovaries declines with age. To identify the potential molecular changes in ovarian aging, we performed genome-wide gene expression analysis by microarray of ovaries from young, middle-aged, and old rhesus monkeys. Microarray data was validated by quantitative real-time PCR. Results showed that a total of 503 (60 upregulated, 443 downregulated and 84 (downregulated genes were differentially expressed in old ovaries compared to young and middle-aged groups, respectively. No difference in gene expression was found between middle-aged and young groups. Differentially expressed genes were mainly enriched in cell and organelle, cellular and physiological process, binding, and catalytic activity. These genes were primarily associated with KEGG pathways of cell cycle, DNA replication and repair, oocyte meiosis and maturation, MAPK, TGF-beta, and p53 signaling pathway. Genes upregulated were involved in aging, defense response, oxidation reduction, and negative regulation of cellular process; genes downregulated have functions in reproduction, cell cycle, DNA and RNA process, macromolecular complex assembly, and positive regulation of macromolecule metabolic process. These findings show that monkey ovary undergoes substantial change in global transcription with age. Gene expression profiles are useful in understanding the mechanisms underlying ovarian aging and age-associated infertility in primates.
Guo, Chunlei; Guo, Rongrong; Xu, Xiaozhao; Gao, Min; Li, Xiaoqin; Song, Junyang; Zheng, Yi; Wang, Xiping
WRKY proteins comprise a large family of transcription factors that play important roles in plant defence regulatory networks, including responses to various biotic and abiotic stresses. To date, no large-scale study of WRKY genes has been undertaken in grape (Vitis vinifera L.). In this study, a total of 59 putative grape WRKY genes (VvWRKY) were identified and renamed on the basis of their respective chromosome distribution. A multiple sequence alignment analysis using all predicted grape WRKY genes coding sequences, together with those from Arabidopsis thaliana and tomato (Solanum lycopersicum), indicated that the 59 VvWRKY genes can be classified into three main groups (I-III). An evaluation of the duplication events suggested that several WRKY genes arose before the divergence of the grape and Arabidopsis lineages. Moreover, expression profiles derived from semiquantitative PCR and real-time quantitative PCR analyses showed distinct expression patterns in various tissues and in response to different treatments. Four VvWRKY genes showed a significantly higher expression in roots or leaves, 55 responded to varying degrees to at least one abiotic stress treatment, and the expression of 38 were altered following powdery mildew (Erysiphe necator) infection. Most VvWRKY genes were downregulated in response to abscisic acid or salicylic acid treatments, while the expression of a subset was upregulated by methyl jasmonate or ethylene treatments.
Gesing, Stefan; Schindler, Daniel; Nowrousian, Minou
Ascomycetes differentiate four major morphological types of fruiting bodies (apothecia, perithecia, pseudothecia and cleistothecia) that are derived from an ancestral fruiting body. Thus, fruiting body differentiation is most likely controlled by a set of common core genes. One way to identify such genes is to search for genes with evolutionary conserved expression patterns. Using suppression subtractive hybridization (SSH), we selected differentially expressed transcripts in Pyronema confluens (Pezizales) by comparing two cDNA libraries specific for sexual and for vegetative development, respectively. The expression patterns of selected genes from both libraries were verified by quantitative real time PCR. Expression of several corresponding homologous genes was found to be conserved in two members of the Sordariales (Sordaria macrospora and Neurospora crassa), a derived group of ascomycetes that is only distantly related to the Pezizales. Knockout studies with N. crassa orthologues of differentially regulated genes revealed a functional role during fruiting body development for the gene NCU05079, encoding a putative MFS peptide transporter. These data indicate conserved gene expression patterns and a functional role of the corresponding genes during fruiting body development; such genes are candidates of choice for further functional analysis. © 2013 WILEY-VCH Verlag GmbH & Co. KGaA, Weinheim.
Full Text Available Background/Aims: Our aim was to explore the molecular mechanism underlying development of IgA nephropathy and discover candidate agents for IgA nephropathy. Methods: The differentially expressed genes (DEGs between patients with IgA nephropathy and normal controls were identified by the data of GSE35488 downloaded from GEO (Gene Expression Omnibus database. The co-expressed gene pairs among DEGs were screened to construct the gene-gene interaction network. Gene Ontology (GO enrichment analysis was performed to analyze the functions of DEGs. The biologically active small molecules capable of targeting IgA nephropathy were identified using the Connectivity Map (cMap database. Results: A total of 55 genes involved in response to organic substance, transcription factor activity and response to steroid hormone stimulus were identified to be differentially expressed in IgA nephropathy patients compared to healthy individuals. A network with 45 co-expressed gene pairs was constructed. DEGs in the network were significantly enriched in response to organic substance. Additionally, a group of small molecules were identified, such as doxorubicin and thapsigargin. Conclusion: Our work provided a systematic insight in understanding the mechanism of IgA nephropathy. Small molecules such as thapsigargin might be potential candidate agents for the treatment of IgA nephropathy.
Kwak, In Hae; Son, Minjun [Physics Department, University of Florida, P.O. Box 118440, Gainesville, FL 32611-8440 (United States); Hagen, Stephen J., E-mail: firstname.lastname@example.org [Physics Department, University of Florida, P.O. Box 118440, Gainesville, FL 32611-8440 (United States)
Highlights: Black-Right-Pointing-Pointer We present a method for extracting gene expression data from images of bacterial cells. Black-Right-Pointing-Pointer The method does not employ cell segmentation and does not require high magnification. Black-Right-Pointing-Pointer Fluorescence and phase contrast images of the cells are correlated through the physics of phase contrast. Black-Right-Pointing-Pointer We demonstrate the method by characterizing noisy expression of comX in Streptococcus mutans. -- Abstract: Studies of stochasticity in gene expression typically make use of fluorescent protein reporters, which permit the measurement of expression levels within individual cells by fluorescence microscopy. Analysis of such microscopy images is almost invariably based on a segmentation algorithm, where the image of a cell or cluster is analyzed mathematically to delineate individual cell boundaries. However segmentation can be ineffective for studying bacterial cells or clusters, especially at lower magnification, where outlines of individual cells are poorly resolved. Here we demonstrate an alternative method for analyzing such images without segmentation. The method employs a comparison between the pixel brightness in phase contrast vs fluorescence microscopy images. By fitting the correlation between phase contrast and fluorescence intensity to a physical model, we obtain well-defined estimates for the different levels of gene expression that are present in the cell or cluster. The method reveals the boundaries of the individual cells, even if the source images lack the resolution to show these boundaries clearly.
Kwak, In Hae; Son, Minjun; Hagen, Stephen J.
Highlights: ► We present a method for extracting gene expression data from images of bacterial cells. ► The method does not employ cell segmentation and does not require high magnification. ► Fluorescence and phase contrast images of the cells are correlated through the physics of phase contrast. ► We demonstrate the method by characterizing noisy expression of comX in Streptococcus mutans. -- Abstract: Studies of stochasticity in gene expression typically make use of fluorescent protein reporters, which permit the measurement of expression levels within individual cells by fluorescence microscopy. Analysis of such microscopy images is almost invariably based on a segmentation algorithm, where the image of a cell or cluster is analyzed mathematically to delineate individual cell boundaries. However segmentation can be ineffective for studying bacterial cells or clusters, especially at lower magnification, where outlines of individual cells are poorly resolved. Here we demonstrate an alternative method for analyzing such images without segmentation. The method employs a comparison between the pixel brightness in phase contrast vs fluorescence microscopy images. By fitting the correlation between phase contrast and fluorescence intensity to a physical model, we obtain well-defined estimates for the different levels of gene expression that are present in the cell or cluster. The method reveals the boundaries of the individual cells, even if the source images lack the resolution to show these boundaries clearly.
Daniel L Roden
Full Text Available Complex human diseases can show significant heterogeneity between patients with the same phenotypic disorder. An outlier detection strategy was developed to identify variants at the level of gene transcription that are of potential biological and phenotypic importance. Here we describe a graphical software package (z-score outlier detection (ZODET that enables identification and visualisation of gross abnormalities in gene expression (outliers in individuals, using whole genome microarray data. Mean and standard deviation of expression in a healthy control cohort is used to detect both over and under-expressed probes in individual test subjects. We compared the potential of ZODET to detect outlier genes in gene expression datasets with a previously described statistical method, gene tissue index (GTI, using a simulated expression dataset and a publicly available monocyte-derived macrophage microarray dataset. Taken together, these results support ZODET as a novel approach to identify outlier genes of potential pathogenic relevance in complex human diseases. The algorithm is implemented using R packages and Java.The software is freely available from http://www.ucl.ac.uk/medicine/molecular-medicine/publications/microarray-outlier-analysis.
Walton, Thomas J; Li, Geng; McCulloch, Thomas A; Seth, Rashmi; Powe, Desmond G; Bishop, Michael C; Rees, Robert C
Real-time quantitative RT-PCR analysis of laser microdissected tissue is considered the most accurate technique for determining tissue gene expression. The discovery of estrogen receptor beta (ERbeta) has focussed renewed interest on the role of estrogen receptors in prostate cancer, yet few studies have utilized the technique to analyze estrogen receptor gene expression in prostate cancer. Fresh tissue was obtained from 11 radical prostatectomy specimens and from 6 patients with benign prostate hyperplasia. Pure populations of benign and malignant prostate epithelium were laser microdissected, followed by RNA isolation and electrophoresis. Quantitative RT-PCR was performed using primers for androgen receptor (AR), estrogen receptor beta (ERbeta), estrogen receptor alpha (ERalpha), progesterone receptor (PGR) and prostate specific antigen (PSA), with normalization to two housekeeping genes. Differences in gene expression were analyzed using the Mann-Whitney U-test. Correlation coefficients were analyzed using Spearman's test. Significant positive correlations were seen when AR and AR-dependent PSA, and ERalpha and ERalpha-dependent PGR were compared, indicating a representative population of RNA transcripts. ERbeta gene expression was significantly over-expressed in the cancer group compared with benign controls (P cancer group (P prostate cancer specimens. In concert with recent studies the findings suggest differential production of ERbeta splice variants, which may play important roles in the genesis of prostate cancer. (c) 2009 Wiley-Liss, Inc.
Bi, Dongbin; Ning, Hao; Liu, Shuai; Que, Xinxiang; Ding, Kejia
To explore molecular mechanisms of bladder cancer (BC), network strategy was used to find biomarkers for early detection and diagnosis. The differentially expressed genes (DEGs) between bladder carcinoma patients and normal subjects were screened using empirical Bayes method of the linear models for microarray data package. Co-expression networks were constructed by differentially co-expressed genes and links. Regulatory impact factors (RIF) metric was used to identify critical transcription factors (TFs). The protein-protein interaction (PPI) networks were constructed by the Search Tool for the Retrieval of Interacting Genes/Proteins (STRING) and clusters were obtained through molecular complex detection (MCODE) algorithm. Centralities analyses for complex networks were performed based on degree, stress and betweenness. Enrichment analyses were performed based on Gene Ontology (GO) and Kyoto Encyclopedia of Genes and Genomes (KEGG) databases. Co-expression networks and TFs (based on expression data of global DEGs and DEGs in different stages and grades) were identified. Hub genes of complex networks, such as UBE2C, ACTA2, FABP4, CKS2, FN1 and TOP2A, were also obtained according to analysis of degree. In gene enrichment analyses of global DEGs, cell adhesion, proteinaceous extracellular matrix and extracellular matrix structural constituent were top three GO terms. ECM-receptor interaction, focal adhesion, and cell cycle were significant pathways. Our results provide some potential underlying biomarkers of BC. However, further validation is required and deep studies are needed to elucidate the pathogenesis of BC. Copyright © 2015 Elsevier Ltd. All rights reserved.
Full Text Available Background: Microarray technology has become highly valuable for identifying complex global changes in gene expression patterns. The assignment of functional information to these complex patterns remains a challenging task in effectively interpreting data and correlating results from across experiments, projects and laboratories. Methods which allow the rapid and robust evaluation of multiple functional hypotheses increase the power of individual researchers to data mine gene expression data more efficiently.Results: We have developed (gene set matrix analysis GSMA as a useful method for the rapid testing of group-wise up- or downregulation of gene expression simultaneously for multiple lists of genes (gene sets against entire distributions of gene expression changes (datasets for single or multiple experiments. The utility of GSMA lies in its flexibility to rapidly poll gene sets related by known biological function or as designated solely by the end-user against large numbers of datasets simultaneously.Conclusions: GSMA provides a simple and straightforward method for hypothesis testing in which genes are tested by groups across multiple datasets for patterns of expression enrichment.
Do, Kyong-Tak; Cho, Hyun-Woo; Badrinath, Narayanasamy; Park, Jeong-Woong; Choi, Jae-Young; Chung, Young-Hwa; Lee, Hak-Kyo; Song, Ki-Duk; Cho, Byung-Wook
Since ancient days, domestic horses have been closely associated with human civilization. Today, horse racing is an important industry. Various genes involved in energy production and muscle contraction are differentially regulated during a race. Among them, creatine kinase (CK) is well known for its regulation of energy preservation in animal cells. CK is an iso-enzyme, encoded by different genes and expressed in skeletal muscle, heart, brain and leucocytes. We confirmed that the expression of CK-M significantly increased in the blood after a 30 minute exercise period, while no considerable change was observed in skeletal muscle. Analysis of various tissues showed an ubiquitous expression of the CK-M gene in the horse; CK-M mRNA expression was predominant in the skeletal muscle and the cardiac muscle compared to other tissues. An evolutionary study by synonymous and non-synonymous single nucleotide polymorphism ratio of CK-M gene revealed a positive selection that was conserved in the horse. More studies are warranted in order to develop the expression of CK-M gene as a biomarker in blood of thoroughbred horses.
Jianxin, L.; Huaqiao, D.; Weiyong, W.; Danqing, T.
ACC oxidase is the last key enzyme of ethylene synthesis pathway, while ethylene is a key factor affecting flowering in ornamental bromeliad. To understand ACC oxidase gene's characteristics and its effect on ornamental bromeliad flowering, we cloned 1504bp full-length cDNA sequence (GenBank: JX972145) and 2546bp corresponding genomic sequence (GenBank: JX972146)of GoACO1 (ACC oxidase gene) from Guzmania variety: Ostara. Prokaryotic expression study showed that expression of GoACO1 can produced a 41 KD protein precipitation in Escherichia coli DE3(BL-21); Real-time quantitative analysis showed that GoACO1 can express in all tested tissues including floral organ, bract, leaf and scape, and expression quantity in bract was the highest. Through constructing plant overexpression vector, transforming into Arabidopsis thaliana, and investigating blossom character of T2 generation seeds, we found that first flowering time of the goal Arabidopsis thaliana was 1.5 days earlier, and their peak flowering time(the number of flowering more than 50%) was 1.8 days earlier, compared with wild type one. Taken together, our results suggested that GoACO1can express in all kinds of tissues and seems to promote Arabidopsis thaliana flowering earlier. (author)
Zai, W S; Miao, L X; Xiong, Z L; Zhang, H L; Ma, Y R; Li, Y L; Chen, Y B; Ye, S G
Heat shock protein 90 (Hsp90) is a protein produced by plants in response to adverse environmental stresses. In this study, we identified and analyzed Hsp90 gene family members using a bioinformatic method based on genomic data from tomato (Solanum lycopersicum L.). The results illustrated that tomato contains at least 7 Hsp90 genes distributed on 6 chromosomes; protein lengths ranged from 267-794 amino acids. Intron numbers ranged from 2-19 in the genes. The phylogenetic tree revealed that Hsp90 genes in tomato (Solanum lycopersicum L.), rice (Oryza sativa L.), and Arabidopsis (Arabidopsis thaliana L.) could be divided into 5 groups, which included 3 pairs of orthologous genes and 4 pairs of paralogous genes. Expression analysis of RNA-sequence data showed that the Hsp90-1 gene was specifically expressed in mature fruits, while Hsp90-5 and Hsp90-6 showed opposite expression patterns in various tissues of cultivated and wild tomatoes. The expression levels of the Hsp90-1, Hsp90-2, and Hsp90- 3 genes in various tissues of cultivated tomatoes were high, while both the expression levels of genes Hsp90-3 and Hsp90-4 were low. Additionally, quantitative real-time polymerase chain reaction showed that these genes were involved in the responses to yellow leaf curl virus in tomato plant leaves. Our results provide a foundation for identifying the function of the Hsp90 gene in tomato.
Long, Ling-Li; Han, Ying-Li; Sheng, Zhang; Du, Chen; Wang, You-Fa; Zhu, Jun-Quan
The gene encoding heat shock protein 70 (HSP70) was identified in Octopus tankahkeei by homologous cloning and rapid amplification of cDNA ends (RACE). The full-length cDNA (2471 bp) consists of a 5'-untranslated region (UTR) (89 bp), a 3'-UTR (426 bp), and an open reading frame (1956 bp) that encodes 651 amino acid residues with a predicted molecular mass of 71.8 kDa and an isoelectric point of 5.34. Based on the amino acid sequence analysis and multiple sequence alignment, this cDNA is a member of cytoplasmic hsp70 subfamily of the hsp70 family and was designated as ot-hsp70. Tissue expression analysis showed that HSP70 expression is highest in the testes when all examined organs were compared. Immunohistochemistry analysis, together with hematoxylin-eosin staining, revealed that the HSP70 protein was expressed in all spermatogenic cells, but not in fibroblasts. In addition, O. tankahkeei were heat challenged by exposure to 32 °C seawater for 2 h, then returned to 13 °C for various recovery time (0-24 h). Relative expression of ot-hsp70 mRNA in the testes was measured at different time points post-challenge by quantitative real-time PCR. A clear time-dependent mRNA expression of ot-hsp70 after thermal stress indicates that the HSP70 gene is inducible. Ultrastructural changes of the heat-stressed testis were observed by transmission electron microscopy. We suggest that HSP70 plays an important role in spermatogenesis and testis protection against thermal stress in O. tankahkeei. Copyright © 2015 Elsevier Inc. All rights reserved.
Je Seon Song
Full Text Available There are histological and functional differences between human deciduous and permanent periodontal ligament (PDL tissues. The aim of this study was to determine the differences between these two types of tissue at the molecular level by comparing their gene expression patterns. PDL samples were obtained from permanent premolars (n = 38 and anterior deciduous teeth (n = 31 extracted from 40 healthy persons. Comparative cDNA microarray analysis revealed several differences in gene expression between the deciduous and permanent PDL tissues. These findings were verified by qRT-PCR (quantitative reverse-transcription-polymerase chain reaction analysis, and the areas where genes are expressed were revealed by immunohistochemical staining. The expressions of 21 genes were up-regulated in deciduous relative to PDL tissues, and those of 30 genes were up-regulated in permanent relative to deciduous PDL tissues. The genes that were up-regulated in deciduous PDL tissues were those involved in the formation of the extracellular matrix (LAMC2, LAMB3, and COMP, tissue development (IGF2BP, MAB21L2, and PAX3, and inflammatory or immune reactions leading to tissue degradation (IL1A, CCL21, and CCL18. The up-regulated genes in permanent PDL tissues were related to tissue degradation (IL6 and ADAMTS18, myocontraction (PDE3B, CASQ2, and MYH10, and neurological responses (FOS, NCAM2, SYT1, SLC22A3, DOCK3, LRRTM1, LRRTM3, PRSS12, and ARPP21. The analysis of differential gene expressions between deciduous and permanent PDL tissues aids our understanding of histological and functional differences between them at the molecular level.
Song, Je Seon; Hwang, Dong Hwan; Kim, Seong-Oh; Jeon, Mijeong; Choi, Byung-Jai; Jung, Han-Sung; Moon, Seok Jun; Park, Wonse; Choi, Hyung-Jun
There are histological and functional differences between human deciduous and permanent periodontal ligament (PDL) tissues. The aim of this study was to determine the differences between these two types of tissue at the molecular level by comparing their gene expression patterns. PDL samples were obtained from permanent premolars (n = 38) and anterior deciduous teeth (n = 31) extracted from 40 healthy persons. Comparative cDNA microarray analysis revealed several differences in gene expression between the deciduous and permanent PDL tissues. These findings were verified by qRT-PCR (quantitative reverse-transcription-polymerase chain reaction) analysis, and the areas where genes are expressed were revealed by immunohistochemical staining. The expressions of 21 genes were up-regulated in deciduous relative to PDL tissues, and those of 30 genes were up-regulated in permanent relative to deciduous PDL tissues. The genes that were up-regulated in deciduous PDL tissues were those involved in the formation of the extracellular matrix (LAMC2, LAMB3, and COMP), tissue development (IGF2BP, MAB21L2, and PAX3), and inflammatory or immune reactions leading to tissue degradation (IL1A, CCL21, and CCL18). The up-regulated genes in permanent PDL tissues were related to tissue degradation (IL6 and ADAMTS18), myocontraction (PDE3B, CASQ2, and MYH10), and neurological responses (FOS, NCAM2, SYT1, SLC22A3, DOCK3, LRRTM1, LRRTM3, PRSS12, and ARPP21). The analysis of differential gene expressions between deciduous and permanent PDL tissues aids our understanding of histological and functional differences between them at the molecular level.
Full Text Available Gastric cancer continues to be one of the deadliest cancers in the world and therefore identification of new drugs targeting this type of cancer is thus of significant importance. The purpose of this study was to identify and validate a therapeutic agent which might improve the outcomes for gastric cancer patients in the future.Using microarray technology, we generated a gene expression profile of human gastric cancer-specific genes from human gastric cancer tissue samples. We used this profile in the Broad Institute's Connectivity Map analysis to identify candidate therapeutic compounds for gastric cancer. We found the histone deacetylase inhibitor vorinostat as the lead compound and thus a potential therapeutic drug for gastric cancer. Vorinostat induced both apoptosis and autophagy in gastric cancer cell lines. Pharmacological and genetic inhibition of autophagy however, increased the therapeutic efficacy of vorinostat, indicating that a combination of vorinostat with autophagy inhibitors may therapeutically be more beneficial. Moreover, gene expression analysis of gastric cancer identified a collection of genes (ITGB5, TYMS, MYB, APOC1, CBX5, PLA2G2A, and KIF20A whose expression was elevated in gastric tumor tissue and downregulated more than 2-fold by vorinostat treatment in gastric cancer cell lines. In contrast, SCGB2A1, TCN1, CFD, APLP1, and NQO1 manifested a reversed pattern.We showed that analysis of gene expression signature may represent an emerging approach to discover therapeutic agents for gastric cancer, such as vorinostat. The observation of altered gene expression after vorinostat treatment may provide the clue to identify the molecular mechanism of vorinostat and those patients likely to benefit from vorinostat treatment.
Luo, Jun; Zhou, Linlin; Wang, Hongren; Qin, Zhen; Xiang, Li; Zhu, Jie; Huang, Xiaojun; Yang, Yuan; Li, Wanyi; Wang, Baoning; Li, Mingyuan
Influenza A virus (IAV) and Streptococcus pneumoniae (SP) are two major upper respiratory tract pathogens that can also cause infection in polarized bronchial epithelial cells to exacerbate disease in coinfected individuals which may result in significant morbidity. However, the underlying molecular mechanism is poorly understood. Here, we employed BALB/c ByJ mice inflected with SP, IAV, IAV followed by SP (IAV+SP) and PBS (Control) as models to survey the global gene expression using digital gene expression (DGE) profiling. We attempt to gain insights into the underlying genetic basis of this synergy at the expression level. Gene expression profiles were obtain using the Illimina/Hisseq sequencing technique, and further analyzed by enrichment analysis of Gene Ontology (GO) and Pathway function. The hematoxylin-eosin (HE) staining revealed different tissue changes in groups during which IAV+SP group showed the most severe cell apoptosis. Compared with Control, a total of 2731, 3221 and 3946 differentially expressed genes (DEGs) were detected in SP, IAV and IAV+SP respectively. Besides, sixty-two GO terms were identified by Gene Ontology functional enrichment analysis, such as cell killing, biological regulation, response to stimulus, signaling, biological adhesion, enzyme regulator activity, receptor regulator activity and translation regulator activity. Pathway significant enrichment analysis indicated the dysregulation of multiple pathways, including apoptosis pathway. Among these, five selected genes were further verified by quantitative reverse transcription-polymerase chain reaction (qRT-PCR). This study shows that infection with SP, IAV or IAV+SP induces apoptosis with different degrees which might provide insights into the molecular mechanisms to facilitate further research.
V. V. Volkomorov
Full Text Available Introduction. Searching for specific and sensitive molecular tumor markers is one of the important tasks of modern oncology. These markers can be used for early tumor diagnosis and prognosis as well as for prediction of therapeutic response, estimation of tumor volume or to assess disease recurrence through monitoring. Gene expression data base mining followed by experimental validation of results obtained is one of the promising approaches for searching of that kind.Objective: to identify several membrane proteins which can be used for serum diagnosis of intestinal type of gastric adenocarcinoma.Materials and methods. We used bioinformatic-driven search using Gene Ontology and The Cancer Genome Atlas (TCGA data to identify mRNA up-regulated in gastric cancer (GC. Then, the expression levels of the mRNAs in 55 pare clinical specimens were investigated using reverse transcription polymerase chain reaction.Results. Comparative analysis of the mRNA levels in normal and tumor tissues using a new bioinformatics algorithm allowed to identify 3 high-copy transcripts (SULF1, PMEPA1 and SPARC, intracellular content of which markedly increased in GC. Expression analysis of these genes in clinical specimens showed significantly higher mRNA levels of PMEPA1 and SPARC in tumor as compared to normal gastric tissue. Interestingly more than twofold increase in expression level of these genes was observed in 75 % of intestinal-type GC. The same results were found only in 25 and 38 % of diffuse-type GC respectively.Conclusions. As a result of original bioinforamtic analysis using TCGA data base two genes (PMEPA1 and SPARC were shown to be significantly upregulated in intestinal-type gastric adenocarcinoma. The findings show the importance of further investigation to clarify the clinical value of their expression level in stomach tumors as well as their role in carcinogenesis.
Choi, Woonyoung; Park, Yun-Yong; Kim, KyoungHyun; Kim, Sang-Bae; Lee, Ju-Seog; Mills, Gordon B.; Cho, Jae Yong
Background Gastric cancer continues to be one of the deadliest cancers in the world and therefore identification of new drugs targeting this type of cancer is thus of significant importance. The purpose of this study was to identify and validate a therapeutic agent which might improve the outcomes for gastric cancer patients in the future. Methodology/Principal Findings Using microarray technology, we generated a gene expression profile of human gastric cancer–specific genes from human gastric cancer tissue samples. We used this profile in the Broad Institute's Connectivity Map analysis to identify candidate therapeutic compounds for gastric cancer. We found the histone deacetylase inhibitor vorinostat as the lead compound and thus a potential therapeutic drug for gastric cancer. Vorinostat induced both apoptosis and autophagy in gastric cancer cell lines. Pharmacological and genetic inhibition of autophagy however, increased the therapeutic efficacy of vorinostat, indicating that a combination of vorinostat with autophagy inhibitors may therapeutically be more beneficial. Moreover, gene expression analysis of gastric cancer identified a collection of genes (ITGB5, TYMS, MYB, APOC1, CBX5, PLA2G2A, and KIF20A) whose expression was elevated in gastric tumor tissue and downregulated more than 2-fold by vorinostat treatment in gastric cancer cell lines. In contrast, SCGB2A1, TCN1, CFD, APLP1, and NQO1 manifested a reversed pattern. Conclusions/Significance We showed that analysis of gene expression signature may represent an emerging approach to discover therapeutic agents for gastric cancer, such as vorinostat. The observation of altered gene expression after vorinostat treatment may provide the clue to identify the molecular mechanism of vorinostat and those patients likely to benefit from vorinostat treatment. PMID:21931799
Zhou, Xiaohong; Wang, Ke; Lv, Dongwen; Wu, Chengjun; Li, Jiarui; Zhao, Pei; Lin, Zhishan; Du, Lipu; Yan, Yueming; Ye, Xingguo
Agrobacterium-mediated plant transformation is an extremely complex and evolved process involving genetic determinants of both the bacteria and the host plant cells. However, the mechanism of the determinants remains obscure, especially in some cereal crops such as wheat, which is recalcitrant for Agrobacterium-mediated transformation. In this study, differentially expressed genes (DEGs) and differentially expressed proteins (DEPs) were analyzed in wheat callus cells co-cultured with Agrobacterium by using RNA sequencing (RNA-seq) and two-dimensional electrophoresis (2-DE) in conjunction with mass spectrometry (MS). A set of 4,889 DEGs and 90 DEPs were identified, respectively. Most of them are related to metabolism, chromatin assembly or disassembly and immune defense. After comparative analysis, 24 of the 90 DEPs were detected in RNA-seq and proteomics datasets simultaneously. In addition, real-time RT-PCR experiments were performed to check the differential expression of the 24 genes, and the results were consistent with the RNA-seq data. According to gene ontology (GO) analysis, we found that a big part of these differentially expressed genes were related to the process of stress or immunity response. Several putative determinants and candidate effectors responsive to Agrobacterium mediated transformation of wheat cells were discussed. We speculate that some of these genes are possibly related to Agrobacterium infection. Our results will help to understand the interaction between Agrobacterium and host cells, and may facilitate developing efficient transformation strategies in cereal crops. PMID:24278131
Ye, Yun; Li, Su-Liang; Wang, Yao; Yao, Yang; Wang, Juan; Ma, Yue-Yun; Hao, Xiao-Ke
There are a number of studies which show that expression of CD147 is increased significantly in prostate cancer (PCa). However, conflicting conclusions have also been reported by other researchers lately. In order to arrive at a clear conclusion, a meta-analysis of eligible studies was conducted. We searched PubMed, MEDLINE, Cochrane Library, and the China National Knowledge Infrastructure databases to identify all the published case-control studies on the relationship between the expression of CD147 and PCa until February 2016. In the end, a total of 930 patients in eight studies were included in the meta-analysis. CD147 expression in the PCa patients increased significantly (odds ratio [OR], 4.65; 95% confidence interval [CI], 3.52-6.14; Z=10.79; PCD147 was associated with PCa among the Asian population (OR, 21.01; 95% CI, 12.88-34.28; Z=12.19; PCD147 was related to PCa, significant heterogeneity was not found between Asian studies, and the result became more significant. The positive expression of CD147 was significantly related to the clinicopathological characteristics of PCa. This suggests that CD147 plays an essential role in poor prognosis and recurrence prediction.
Full Text Available Agrobacterium-mediated plant transformation is an extremely complex and evolved process involving genetic determinants of both the bacteria and the host plant cells. However, the mechanism of the determinants remains obscure, especially in some cereal crops such as wheat, which is recalcitrant for Agrobacterium-mediated transformation. In this study, differentially expressed genes (DEGs and differentially expressed proteins (DEPs were analyzed in wheat callus cells co-cultured with Agrobacterium by using RNA sequencing (RNA-seq and two-dimensional electrophoresis (2-DE in conjunction with mass spectrometry (MS. A set of 4,889 DEGs and 90 DEPs were identified, respectively. Most of them are related to metabolism, chromatin assembly or disassembly and immune defense. After comparative analysis, 24 of the 90 DEPs were detected in RNA-seq and proteomics datasets simultaneously. In addition, real-time RT-PCR experiments were performed to check the differential expression of the 24 genes, and the results were consistent with the RNA-seq data. According to gene ontology (GO analysis, we found that a big part of these differentially expressed genes were related to the process of stress or immunity response. Several putative determinants and candidate effectors responsive to Agrobacterium mediated transformation of wheat cells were discussed. We speculate that some of these genes are possibly related to Agrobacterium infection. Our results will help to understand the interaction between Agrobacterium and host cells, and may facilitate developing efficient transformation strategies in cereal crops.
Wu, Shuang; Wu, Hulin
One of the fundamental problems in time course gene expression data analysis is to identify genes associated with a biological process or a particular stimulus of interest, like a treatment or virus infection. Most of the existing methods for this problem are designed for data with longitudinal replicates. But in reality, many time course gene experiments have no replicates or only have a small number of independent replicates. We focus on the case without replicates and propose a new method for identifying differentially expressed genes by incorporating the functional principal component analysis (FPCA) into a hypothesis testing framework. The data-driven eigenfunctions allow a flexible and parsimonious representation of time course gene expression trajectories, leaving more degrees of freedom for the inference compared to that using a prespecified basis. Moreover, the information of all genes is borrowed for individual gene inferences. The proposed approach turns out to be more powerful in identifying time course differentially expressed genes compared to the existing methods. The improved performance is demonstrated through simulation studies and a real data application to the Saccharomyces cerevisiae cell cycle data.
Full Text Available BACKGROUND: Skeletal muscle is a complex, versatile tissue composed of a variety of functionally diverse fiber types. Although the biochemical, structural and functional properties of myofibers have been the subject of intense investigation for the last decades, understanding molecular processes regulating fiber type diversity is still complicated by the heterogeneity of cell types present in the whole muscle organ. METHODOLOGY/PRINCIPAL FINDINGS: We have produced a first catalogue of genes expressed in mouse slow-oxidative (type 1 and fast-glycolytic (type 2B fibers through transcriptome analysis at the single fiber level (microgenomics. Individual fibers were obtained from murine soleus and EDL muscles and initially classified by myosin heavy chain isoform content. Gene expression profiling on high density DNA oligonucleotide microarrays showed that both qualitative and quantitative improvements were achieved, compared to results with standard muscle homogenate. First, myofiber profiles were virtually free from non-muscle transcriptional activity. Second, thousands of muscle-specific genes were identified, leading to a better definition of gene signatures in the two fiber types as well as the detection of metabolic and signaling pathways that are differentially activated in specific fiber types. Several regulatory proteins showed preferential expression in slow myofibers. Discriminant analysis revealed novel genes that could be useful for fiber type functional classification. CONCLUSIONS/SIGNIFICANCE: As gene expression analyses at the single fiber level significantly increased the resolution power, this innovative approach would allow a better understanding of the adaptive transcriptomic transitions occurring in myofibers under physiological and pathological conditions.
Lai, Yi-Pin; Wang, Liang-Bo; Wang, Wei-An; Lai, Liang-Chuan; Tsai, Mong-Hsun; Lu, Tzu-Pin; Chuang, Eric Y
With the advancement in high-throughput technologies, researchers can simultaneously investigate gene expression and copy number alteration (CNA) data from individual patients at a lower cost. Traditional analysis methods analyze each type of data individually and integrate their results using Venn diagrams. Challenges arise, however, when the results are irreproducible and inconsistent across multiple platforms. To address these issues, one possible approach is to concurrently analyze both gene expression profiling and CNAs in the same individual. We have developed an open-source R/Bioconductor package (iGC). Multiple input formats are supported and users can define their own criteria for identifying differentially expressed genes driven by CNAs. The analysis of two real microarray datasets demonstrated that the CNA-driven genes identified by the iGC package showed significantly higher Pearson correlation coefficients with their gene expression levels and copy numbers than those genes located in a genomic region with CNA. Compared with the Venn diagram approach, the iGC package showed better performance. The iGC package is effective and useful for identifying CNA-driven genes. By simultaneously considering both comparative genomic and transcriptomic data, it can provide better understanding of biological and medical questions. The iGC package's source code and manual are freely available at https://www.bioconductor.org/packages/release/bioc/html/iGC.html .
Johnstone, Daniel M.; Riveros, Carlos; Heidari, Moones; Graham, Ross M.; Trinder, Debbie; Berretta, Regina; Olynyk, John K.; Scott, Rodney J.; Moscato, Pablo; Milward, Elizabeth A.
While Illumina microarrays can be used successfully for detecting small gene expression changes due to their high degree of technical replicability, there is little information on how different normalization and differential expression analysis strategies affect outcomes. To evaluate this, we assessed concordance across gene lists generated by applying different combinations of normalization strategy and analytical approach to two Illumina datasets with modest expression changes. In addition to using traditional statistical approaches, we also tested an approach based on combinatorial optimization. We found that the choice of both normalization strategy and analytical approach considerably affected outcomes, in some cases leading to substantial differences in gene lists and subsequent pathway analysis results. Our findings suggest that important biological phenomena may be overlooked when there is a routine practice of using only one approach to investigate all microarray datasets. Analytical artefacts of this kind are likely to be especially relevant for datasets involving small fold changes, where inherent technical variation—if not adequately minimized by effective normalization—may overshadow true biological variation. This report provides some basic guidelines for optimizing outcomes when working with Illumina datasets involving small expression changes. PMID:27605185
Bobkowska, Katarzyna; Przyborski, Marek; Skorupka, Dariusz
This article shows how complex emotions are. This has been proven by the analysis of the changes that occur on the face. The authors present the problem of image analysis for the purpose of identifying emotions. In addition, they point out the importance of recording the phenomenon of the development of emotions on the human face with the use of high-speed cameras, which allows the detection of micro expression. The work that was prepared for this article was based on analyzing the parallax pair correlation coefficients for specific faces. In the article authors proposed to divide the facial image into 8 characteristic segments. With this approach, it was confirmed that at different moments of emotion the pace of expression and the maximum change characteristic of a particular emotion, for each part of the face is different.
Full Text Available This article shows how complex emotions are. This has been proven by the analysis of the changes that occur on the face. The authors present the problem of image analysis for the purpose of identifying emotions. In addition, they point out the importance of recording the phenomenon of the development of emotions on the human face with the use of high-speed cameras, which allows the detection of micro expression. The work that was prepared for this article was based on analyzing the parallax pair correlation coefficients for specific faces. In the article authors proposed to divide the facial image into 8 characteristic segments. With this approach, it was confirmed that at different moments of emotion the pace of expression and the maximum change characteristic of a particular emotion, for each part of the face is different.
Full Text Available Chemical inhibition of the proteasome has been previously found to effectively impair pollen germination and tube growth in vitro. However, the mediators of these effects at the molecular level are unknown. By performing 2DE proteomic analysis, 24 differentially expressed protein spots, representing 14 unique candidate proteins, were identified in the pollen of kiwifruit (Actinidia deliciosa germinated in the presence of the MG132 proteasome inhibitor. qPCR analysis revealed that 11 of these proteins are not up-regulated at the mRNA level, but are most likely stabilized by proteasome inhibition. These differentially expressed proteins are predicted to function in various pathways including energy and lipid metabolism, cell wall synthesis, protein synthesis/degradation and stress responses. In line with this evidence, the MG132-induced changes in the proteome were accompanied by an increase in ATP and ROS content and by an alteration in fatty acid composition.
Coll, Anna; Wilson, Mandy L; Gruden, Kristina; Peccoud, Jean
With the rapid advances in prediction tools for discovery of new promoters and their cis-elements, there is a need to improve plant expression methodologies in order to facilitate a high-throughput functional validation of these promoters in planta. The promoter-reporter analysis is an indispensible approach for characterization of plant promoters. It requires the design of complex plant expression vectors, which can be challenging. Here, we describe the use of a plant grammar implemented in GenoCAD that will allow the users to quickly design constructs for promoter analysis experiments but also for other in planta functional studies. The GenoCAD plant grammar includes a library of plant biological parts organized in structural categories to facilitate their use and management and a set of rules that guides the process of assembling these biological parts into large constructs.
Full Text Available Ying Wang,1,2,* Yuelong Huang,2,* Peng Xiang,3 Wei Tian2 1Department of Molecular Orthopaedics, Beijing Institute of Traumatology and Orthopaedics, 2Department of Spinal Surgery, Beijing Jishuitan Hospital, The Fourth Clinical Medical College of Peking University, 3Department of Urology, Peking University First Hospital, Beijing, People’s Republic of China *These authors contributed equally to this work Purpose: Osteosarcoma is the most prevalent primary bone tumor in children, adolescents, and older adults, typically presenting with poor survival outcomes. In recent years, ample evidence has shown that many long noncoding RNAs (lncRNAs have been aberrantly expressed in osteosarcoma, demonstrating their potential to serve as prognostic markers. In this study, we performed a meta-analysis on four lncRNAs (TUG1, UCA1, BCAR4, and HULC to systematically evaluate their prognostic value in osteosarcoma.Materials and methods: The eligible articles were systematically searched in PubMed, Web of Science, Embase, and Elsevier ScienceDirect (up to September 22, 2017, and one meta-analysis concerning the association between lncRNA expression and the overall survival (OS of osteosarcoma patients was performed. Survival outcomes were analyzed by OS. Subgroup analyses were performed.Results: A total of 1,361 patients with osteosarcoma and 12 lncRNAs from 16 articles were included in the study. Of the listed lncRNAs, the high expression of 10 lncRNAs indicated worse survival outcomes, while only two lncRNAs were shown to positively affect patients’ OS.Conclusion: This meta-analysis indicated that the abnormally expressed lncRNAs might significantly affect the survival of osteosarcoma patients. Combined use of these lncRNAs may serve as potential novel biomarkers for the indication of clinical outcomes of osteosarcoma patients as well as the selection of adjuvant chemotherapy strategies for clinical treatment of this disease. Keywords: lncRNAs, osteosarcoma
Arnade, Elizabeth Amalia
Emotions are thought to play a crucial role in food behavior. Non-rational emotional decision making may be credited as the reason why consumers select what, how, and when they choose to interact with a food product. In this research, three experiments were completed for the overall goal of understanding the usefulness and validity of selected emotional measurement tools, specifically emotion questionnaire ballots and facial expression analysis, as compared to conventional sensory methods in ...
Young, Gavin M.; Radhakrishnan, Vijayababu M.; Centuori, Sara M.; Gomes, Cecil J.; Martinez, Jesse D.
The 14-3-3 family is a group of intracellular proteins found in all eukaryotic organisms. Humans have seven isoforms that serve as scaffolds to promote interactions of regulatory phospho-proteins involved in many vital cellular processes and previous studies have shown that disturbances in native 14-3-3 levels can contribute significantly to the development of various cancers. DNA and RNA was extracted from frozen tissue samples collected by the Human Cooperative Tissue Network. RNA samples were reverse transcribed and subjected to qRT-PCR analysis using fluorescently labelled probes. Genomic DNA was treated with bisulfite and cloned into bacterial vectors for subsequent high-resolution sequencing. Mammalian NIH3T3 cells were transformed with 14-3-3 eta and Ras expression vectors synthesized from cDNA. Colonies were counted and transforming capability assessed after 21 days of growth. Cell lysates were analyzed by western blot to verify protein expression. Here we examined normal and cancerous 14-3-3 expression levels of all seven isoforms in a cohort of sporadic colorectal adenocarcinomas and in a group of tumors and their matched normals using qRT-PCR analysis. We found a statistically significant decrease in the levels of 14-3-3 sigma, eta, and zeta observed among adenocarcinomas compared to normal tissue. A parallel analysis of microarray data from the TCGA dataset confirmed that expression of sigma and eta were down-regulated in colon tumors. To explore the mechanisms behind 14-3-3 expression changes, we examined the methylation status of the sigma, eta, and zeta gene promoters in selected samples. Our data identified novel CpG methylation sites in the eta promoter consistent with epigenetic silencing of both 14-3-3 sigma and eta isoforms during colon tumorigenesis. Because epigenetic silencing is the hallmark of a tumor suppressor we tested eta in focus formation assays and found that it is capable of suppressing ras-induced transformation of NIH3T3 cells. To
Tulchinsky, Alexander Y; Johnson, Norman A; Watt, Ward B; Porter, Adam H
Postzygotic isolation between incipient species results from the accumulation of incompatibilities that arise as a consequence of genetic divergence. When phenotypes are determined by regulatory interactions, hybrid incompatibility can evolve even as a consequence of parallel adaptation in parental populations because interacting genes can produce the same phenotype through incompatible allelic combinations. We explore the evolutionary conditions that promote and constrain hybrid incompatibility in regulatory networks using a bioenergetic model (combining thermodynamics and kinetics) of transcriptional regulation, considering the bioenergetic basis of molecular interactions between transcription factors (TFs) and their binding sites. The bioenergetic parameters consider the free energy of formation of the bond between the TF and its binding site and the availability of TFs in the intracellular environment. Together these determine fractional occupancy of the TF on the promoter site, the degree of subsequent gene expression and in diploids, and the degree of dominance among allelic interactions. This results in a sigmoid genotype-phenotype map and fitness landscape, with the details of the shape determining the degree of bioenergetic evolutionary constraint on hybrid incompatibility. Using individual-based simulations, we subjected two allopatric populations to parallel directional or stabilizing selection. Misregulation of hybrid gene expression occurred under either type of selection, although it evolved faster under directional selection. Under directional selection, the extent of hybrid incompatibility increased with the slope of the genotype-phenotype map near the derived parental expression level. Under stabilizing selection, hybrid incompatibility arose from compensatory mutations and was greater when the bioenergetic properties of the interaction caused the space of nearly neutral genotypes around the stable expression level to be wide. F2's showed higher
Manijak, Mieszko P.; Nielsen, Henrik Bjørn
circumvented by instead matching gene expression signatures to signatures of other experiments. FINDINGS: To facilitate this we present the Functional Association Response by Overlap (FARO) server, that match input signatures to a compendium of 242 gene expression signatures, extracted from more than 1700...... Arabidopsis microarray experiments. CONCLUSIONS: Hereby we present a publicly available tool for robust characterization of Arabidopsis gene expression experiments which can point to similar experimental factors in other experiments. The server is available at http://www.cbs.dtu.dk/services/faro/....
Alahakoon, Thushari I; Zhang, Weiyi; Arbuckle, Susan; Zhang, Kewei; Lee, Vincent
To localize, quantify and compare angiogenic factors, vascular endothelial growth factor (VEGF), placental growth factor (PlGF), as well as their receptors fms-like tyrosine kinase receptor (Flt-1) and kinase insert domain receptor (KDR) in the placentas of normal pregnancy and complications of preeclampsia (PE), intrauterine fetal growth restriction (IUGR) and PE + IUGR. In a prospective cross-sectional case-control study, 30 pregnant women between 24-40 weeks of gestation, were recruited into four clinical groups. Representative placental samples were stained for VEGF, PlGF, Flt-1 and KDR. Analysis was performed using semiquantitative methods and digital image analysis. The overall VEGF and Flt-1 were strongly expressed and did not show any conclusive difference in the expression between study groups. PlGF and KDR were significantly reduced in expression in the placentas from pregnancies complicated by IUGR compared with normal and preeclamptic pregnancies. The lack of PlGF and KDR may be a cause for the development of IUGR and may explain the loss of vasculature and villous architecture in IUGR. Automated digital image analysis software is a viable alternative method to the manual reading of placental immunohistochemical staining. © 2018 Japan Society of Obstetrics and Gynecology.
Data Analysis and Visualization (IDAV) and the Department of Computer Science, University of California, Davis, One Shields Avenue, Davis CA 95616, USA,; nternational Research Training Group ``Visualization of Large and Unstructured Data Sets,' ' University of Kaiserslautern, Germany; Computational Research Division, Lawrence Berkeley National Laboratory, One Cyclotron Road, Berkeley, CA 94720, USA; Genomics Division, Lawrence Berkeley National Laboratory, One Cyclotron Road, Berkeley CA 94720, USA; Life Sciences Division, Lawrence Berkeley National Laboratory, One Cyclotron Road, Berkeley CA 94720, USA,; Computer Science Division,University of California, Berkeley, CA, USA,; Computer Science Department, University of California, Irvine, CA, USA,; All authors are with the Berkeley Drosophila Transcription Network Project, Lawrence Berkeley National Laboratory,; Rubel, Oliver; Weber, Gunther H.; Huang, Min-Yu; Bethel, E. Wes; Biggin, Mark D.; Fowlkes, Charless C.; Hendriks, Cris L. Luengo; Keranen, Soile V. E.; Eisen, Michael B.; Knowles, David W.; Malik, Jitendra; Hagen, Hans; Hamann, Bernd
The recent development of methods for extracting precise measurements of spatial gene expression patterns from three-dimensional (3D) image data opens the way for new analyses of the complex gene regulatory networks controlling animal development. We present an integrated visualization and analysis framework that supports user-guided data clustering to aid exploration of these new complex datasets. The interplay of data visualization and clustering-based data classification leads to improved visualization and enables a more detailed analysis than previously possible. We discuss (i) integration of data clustering and visualization into one framework; (ii) application of data clustering to 3D gene expression data; (iii) evaluation of the number of clusters k in the context of 3D gene expression clustering; and (iv) improvement of overall analysis quality via dedicated post-processing of clustering results based on visualization. We discuss the use of this framework to objectively define spatial pattern boundaries and temporal profiles of genes and to analyze how mRNA patterns are controlled by their regulatory transcription factors.
Full Text Available Background. The objective of this study was to conduct a systematic review of literature evaluating human resistin expression as a diagnostic factor in osteoarthritis development and to quantify the overall diagnostic effect. Method. Relevant studies were identified and evaluated for quality through multiple search strategies. Studies analyzing resistin expression in the development of OA were eligible for inclusion. Data from eligible studies were extracted and included into the meta-analysis using a random-effects model. Results. Four case-control studies consisting of a total of 375 OA patients and 214 controls as well as three sex-stratified analyses composed of 53 males and 104 females were incorporated into our meta-analysis. Our results revealed that resistin levels were significantly higher in male OA subjects and OA patients overall. Country-stratified analysis yielded significantly different estimates in resistin levels between male OA subjects and female OA subjects in the Canadian subgroup but not among the French and USA subgroups. Based on the resistin levels in OA cases and controls, resistin levels were heightened in OA patients in the Dutch population. Conclusion. These results support the hypothesis that high expression of resistin represents a significant and reproducible marker of poor progression in OA patients, especially in males.
Full Text Available Objective Molecular cloning and bioinformatics analysis of annexin A2 (ANXA2 gene in sika deer antler tip were conducted. The role of ANXA2 gene in the growth and development of the antler were analyzed initially. Methods The reverse transcriptase polymerase chain reaction (RT-PCR was used to clone the cDNA sequence of the ANXA2 gene from antler tip of sika deer (Cervus Nippon hortulorum and the bioinformatics methods were applied to analyze the amino acid sequence of Anxa2 protein. The mRNA expression levels of the ANXA2 gene in different growth stages were examined by real time reverse transcriptase polymerase chain reaction (real time RT-PCR. Results The nucleotide sequence analysis revealed an open reading frame of 1,020 bp encoding 339 amino acids long protein of calculated molecular weight 38.6 kDa and isoelectric point 6.09. Homologous sequence alignment and phylogenetic analysis indicated that the Anxa2 mature protein of sika deer had the closest genetic distance with Cervus elaphus and Bos mutus. Real time RT-PCR results showed that the gene had differential expression levels in different growth stages, and the expression level of the ANXA2 gene was the highest at metaphase (rapid growing period. Conclusion ANXA2 gene may promote the cell proliferation, and the finding suggested Anxa2 as an important candidate for regulating the growth and development of deer antler.
Full Text Available Oryza meyeriana (O. meyeriana, with a GG genome type (2n = 24, accumulated plentiful excellent characteristics with respect to resistance to many diseases such as rice shade and blast, even immunity to bacterial blight. It is very important to know if the diseases-resistant genes exist and express in this wild rice under native conditions. However, limited genomic or transcriptomic data of O. meyeriana are currently available. In this study, we present the first comprehensive characterization of the O. meyeriana transcriptome using RNA-seq and obtained 185,323 contigs with an average length of 1,692 bp and an N50 of 2,391 bp. Through differential expression analysis, it was found that there were most tissue-specifically expressed genes in roots, and next to stems and leaves. By similarity search against protein databases, 146,450 had at least a significant alignment to existed gene models. Comparison with the Oryza sativa (japonica-type Nipponbare and indica-type 93-11 genomes revealed that 13% of the O. meyeriana contigs had not been detected in O. sativa. Many diseases-resistant genes, such as bacterial blight resistant, blast resistant, rust resistant, fusarium resistant, cyst nematode resistant and downy mildew gene, were mined from the transcriptomic database. There are two kinds of rice bacterial blight-resistant genes (Xa1 and Xa26 differentially or specifically expressed in O. meyeriana. The 4 Xa1 contigs were all only expressed in root, while three of Xa26 contigs have the highest expression level in leaves, two of Xa26 contigs have the highest expression profile in stems and one of Xa26 contigs was expressed dominantly in roots. The transcriptomic database of O. meyeriana has been constructed and many diseases-resistant genes were found to express under native condition, which provides a foundation for future discovery of a number of novel genes and provides a basis for studying the molecular mechanisms associated with disease
Bracht, Thilo; Schweinsberg, Vincent; Trippler, Martin; Kohl, Michael; Ahrens, Maike; Padden, Juliet; Naboulsi, Wael; Barkovits, Katalin; Megger, Dominik A; Eisenacher, Martin; Borchers, Christoph H; Schlaak, Jörg F; Hoffmann, Andreas-Claudius; Weber, Frank; Baba, Hideo A; Meyer, Helmut E; Sitek, Barbara
Hepatic fibrosis and cirrhosis are major health problems worldwide. Until now, highly invasive biopsy remains the diagnostic gold standard despite many disadvantages. To develop noninvasive diagnostic assays for the assessment of liver fibrosis, it is urgently necessary to identify molecules that are robustly expressed in association with the disease. We analyzed biopsied tissue samples from 95 patients with HBV/HCV-associated hepatic fibrosis using three different quantification methods. We performed a label-free proteomics discovery study to identify novel disease-associated proteins using a subset of the cohort (n = 27). Subsequently, gene expression data from all available clinical samples were analyzed (n = 77). Finally, we performed a targeted proteomics approach, multiple reaction monitoring (MRM), to verify the disease-associated expression in samples independent from the discovery approach (n = 68). We identified fibulin-5 (FBLN5) as a novel protein expressed in relation to hepatic fibrosis. Furthermore, we confirmed the altered expression of microfibril-associated glycoprotein 4 (MFAP4), lumican (LUM), and collagen alpha-1(XIV) chain (COL14A1) in association to hepatic fibrosis. To our knowledge, no tissue-based quantitative proteomics study for hepatic fibrosis has been performed using a cohort of comparable size. By this means, we add substantial evidence for the disease-related expression of the proteins examined in this study.
Chen, Hancai; Bodulovic, Greg; Hall, Prudence J; Moore, Andy; Higgins, Thomas J V; Djordjevic, Michael A; Rolfe, Barry G
Seeds of genetically modified (GM) peas (Pisum sativum L.) expressing the gene for alpha-amylase inhibitor-1 (alphaAI1) from the common bean (Phaseolus vulgaris L. cv. Tendergreen) exhibit resistance to the pea weevil (Bruchus pisorum). A proteomic analysis was carried out to compare seeds from GM pea lines expressing the bean alphaAI1 protein and the corresponding alphaAI1-free segregating lines and non-GM parental line to identify unintended alterations to the proteome of GM peas due to the introduction of the gene for alphaAI1. Proteomic analysis showed that in addition to the presence of alphaAI1, 33 other proteins were differentially accumulated in the alphaAI1-expressing GM lines compared with their non-GM parental line and these were grouped into five expression classes. Among these 33 proteins, only three were found to be associated with the expression of alphaAI1 in the GM pea lines. The accumulation of the remaining 30 proteins appears to be associated with Agrobacterium-mediated transformation events. Sixteen proteins were identified after MALDI-TOF-TOF analysis. About 56% of the identified proteins with altered accumulation in the GM pea were storage proteins including legumin, vicilin or convicilin, phaseolin, cupin and valosin-containing protein. Two proteins were uniquely expressed in the alphaAI1-expressing GM lines and one new protein was present in both the alphaAI1-expressing GM lines and their alphaAI1-free segregating lines, suggesting that both transgenesis and transformation events led to demonstrable changes in the proteomes of the GM lines tested.
Full Text Available Yun Ye,1,2,* Su-Liang Li,2,* Yao Wang,2 Yang Yao,2 Juan Wang,1 Yue-Yun Ma,1 Xiao-Ke Hao1 1Department of Laboratory Medicine, Xijing Hospital, Fourth Military Medical University, 2Department of Clinical Laboratory, The First Affiliated Hospital of Xi’an Medical University, Xi’an, Shaanxi, People’s Republic of China *These authors contributed equally to this work Background: There are a number of studies which show that expression of CD147 is increased significantly in prostate cancer (PCa. However, conflicting conclusions have also been reported by other researchers lately. In order to arrive at a clear conclusion, a meta-analysis of eligible studies was conducted.Materials and methods: We searched PubMed, MEDLINE, Cochrane Library, and the China National Knowledge Infrastructure databases to identify all the published case–control studies on the relationship between the expression of CD147 and PCa until February 2016. In the end, a total of 930 patients in eight studies were included in the meta-analysis.Results: CD147 expression in the PCa patients increased significantly (odds ratio [OR], 4.65; 95% confidence interval [CI], 3.52–6.14; Z=10.79; P<0.05, but there was obvious heterogeneity between studies (I2=92.9%, P<0.05. Subgroup analysis showed that positive expression of CD147 was associated with PCa among the Asian population (OR, 21.01; 95% CI, 12.88–34.28; Z=12.19; P<0.05. Furthermore, it was significantly related to TNM stage (OR, 0.24; 95% CI, 0.17–0.35; Z=7.74; P<0.05, Gleason score (OR, 0.41; 95% CI, 0.31–0.56; Z=5.62; P<0.05, differentiation grade (OR, 0.27; 95% CI, 0.13–0.56; Z=3.47; P<0.05, and pretreatment serum prostate-specific antigen level (OR, 0.07; 95% CI, 0.03–0.16; Z=6.47; P<0.05.Conclusion: Positive expression of CD147 was related to PCa, significant heterogeneity was not found between Asian studies, and the result became more significant. The positive expression of CD147 was significantly related to
Zhang, Fengbo; Ma, Xiumin; Zhu, Yuejie; Wang, Hongying; Liu, Xianfei; Zhu, Min; Ma, Haimei; Wen, Hao; Fan, Haining; Ding, Jianbing
Objective: This study was to clone, identify and analyze the characteristics of egG1Y162 gene from Echinococcus granulosus. Methods: Genomic DNA and total RNAs were extracted from four different developmental stages of protoscolex, germinal layer, adult and egg of Echinococcus granulosus, respectively. Fluorescent quantitative PCR was used for analyzing the expression of egG1Y162 gene. Prokaryotic expression plasmid of pET41a-EgG1Y162 was constructed to express recombinant His-EgG1Y162 antigen. Western blot analysis was performed to detect antigenicity of EgG1Y162 antigen. Gene sequence, amino acid alignment and phylogenetic tree of EgG1Y162 were analyzed by BLAST, online Spidey and MEGA4 software, respectively. Results: EgG1Y162 gene was expressed in four developmental stages of Echinococcus granulosus. And, egG1Y162 gene expression was the highest in the adult stage, with the relative value of 19.526, significantly higher than other three stages. Additionally, Western blot analysis revealed that EgG1Y162 recombinant protein had good reaction with serum samples from Echinococcus granulosus infected human and dog. Moreover, EgG1Y162 antigen was phylogenetically closest to EmY162 antigen, with the similarity over 90%. Conclusion: Our study identified EgG1Y162 antigen in Echinococcus granulosus for the first time. EgG1Y162 antigen had a high similarity with EmY162 antigen, with the genetic differences mainly existing in the intron region. And, EgG1Y162 recombinant protein showed good antigenicity. PMID:25337206
Zhang, Fengbo; Ma, Xiumin; Zhu, Yuejie; Wang, Hongying; Liu, Xianfei; Zhu, Min; Ma, Haimei; Wen, Hao; Fan, Haining; Ding, Jianbing
This study was to clone, identify and analyze the characteristics of egG1Y162 gene from Echinococcus granulosus. Genomic DNA and total RNAs were extracted from four different developmental stages of protoscolex, germinal layer, adult and egg of Echinococcus granulosus, respectively. Fluorescent quantitative PCR was used for analyzing the expression of egG1Y162 gene. Prokaryotic expression plasmid of pET41a-EgG1Y162 was constructed to express recombinant His-EgG1Y162 antigen. Western blot analysis was performed to detect antigenicity of EgG1Y162 antigen. Gene sequence, amino acid alignment and phylogenetic tree of EgG1Y162 were analyzed by BLAST, online Spidey and MEGA4 software, respectively. EgG1Y162 gene was expressed in four developmental stages of Echinococcus granulosus. And, egG1Y162 gene expression was the highest in the adult stage, with the relative value of 19.526, significantly higher than other three stages. Additionally, Western blot analysis revealed that EgG1Y162 recombinant protein had good reaction with serum samples from Echinococcus granulosus infected human and dog. Moreover, EgG1Y162 antigen was phylogenetically closest to EmY162 antigen, with the similarity over 90%. Our study identified EgG1Y162 antigen in Echinococcus granulosus for the first time. EgG1Y162 antigen had a high similarity with EmY162 antigen, with the genetic differences mainly existing in the intron region. And, EgG1Y162 recombinant protein showed good antigenicity.
Full Text Available Yan73, a teinturier (dyer grape variety in China, is one of the few Vitis vinifera cultivars with red-coloured berry flesh. To examine the tissue-specific expression of genes associated with berry colour in Yan73, we analysed the differential accumulation of anthocyanins in the skin and flesh tissues of two red-skinned grape varieties with either red (Yan73 or white flesh (Muscat Hamburg based on HPLC-MS analysis, as well as the differential expression of 18 anthocyanin biosynthesis genes in both varieties by quantitative RT-PCR. The results revealed that the transcripts of GST, OMT, AM3, CHS3, UFGT, MYBA1, F3′5′H, F3H1 and LDOX were barely detectable in the white flesh of Muscat Hamburg. In particular, GST, OMT, AM3, CHS3 and F3H1 showed approximately 50-fold downregulation in the white flesh of Muscat Hamburg compared to the red flesh of Yan73. A correlation analysis between the accumulation of different types of anthocyanins and gene expression indicated that the cumulative expression of GST, F3′5′H, LDOX and MYBA1 was more closely associated with the acylated anthocyanins and the 3′5′-OH anthocyanins, while OMT and AM3 were more closely associated with the total anthocyanins and methoxylated anthocyanins. Therefore, the transcripts of OMT, AM3, GST, F3′5′H, LDOX and MYBA1 explained most of the variation in the amount and composition of anthocyanins in skin and flesh of Yan73. The data suggest that the specific localization of anthocyanins in the flesh tissue of Yan73 is most likely due to the tissue-specific expression of OMT, AM3, GST, F3′5′H, LDOX and MYBA1 in the flesh.
Ma, Chunhui; Lv, Qi; Teng, Songsong; Yu, Yinxian; Niu, Kerun; Yi, Chengqin
This study aimed to identify rheumatoid arthritis (RA) related genes based on microarray data using the WGCNA (weighted gene co-expression network analysis) method. Two gene expression profile datasets GSE55235 (10 RA samples and 10 healthy controls) and GSE77298 (16 RA samples and seven healthy controls) were downloaded from Gene Expression Omnibus database. Characteristic genes were identified using metaDE package. WGCNA was used to find disease-related networks based on gene expression correlation coefficients, and module significance was defined as the average gene significance of all genes used to assess the correlation between the module and RA status. Genes in the disease-related gene co-expression network were subject to functional annotation and pathway enrichment analysis using Database for Annotation Visualization and Integrated Discovery. Characteristic genes were also mapped to the Connectivity Map to screen small molecules. A total of 599 characteristic genes were identified. For each dataset, characteristic genes in the green, red and turquoise modules were most closely associated with RA, with gene numbers of 54, 43 and 79, respectively. These genes were enriched in totally enriched in 17 Gene Ontology terms, mainly related to immune response (CD97, FYB, CXCL1, IKBKE, CCR1, etc.), inflammatory response (CD97, CXCL1, C3AR1, CCR1, LYZ, etc.) and homeostasis (C3AR1, CCR1, PLN, CCL19, PPT1, etc.). Two small-molecule drugs sanguinarine and papaverine were predicted to have a therapeutic effect against RA. Genes related to immune response, inflammatory response and homeostasis presumably have critical roles in RA pathogenesis. Sanguinarine and papaverine have a potential therapeutic effect against RA. © 2017 Asia Pacific League of Associations for Rheumatology and John Wiley & Sons Australia, Ltd.
Wang, Yumei; Yin, Xiaoling; Yang, Fang
Sepsis is an inflammatory-related disease, and severe sepsis would induce multiorgan dysfunction, which is the most common cause of death of patients in noncoronary intensive care units. Progression of novel therapeutic strategies has proven to be of little impact on the mortality of severe sepsis, and unfortunately, its mechanisms still remain poorly understood. In this study, we analyzed gene expression profiles of severe sepsis with failure of lung, kidney, and liver for the identification of potential biomarkers. We first downloaded the gene expression profiles from the Gene Expression Omnibus and performed preprocessing of raw microarray data sets and identification of differential expression genes (DEGs) through the R programming software; then, significantly enriched functions of DEGs in lung, kidney, and liver failure sepsis samples were obtained from the Database for Annotation, Visualization, and Integrated Discovery; finally, protein-protein interaction network was constructed for DEGs based on the STRING database, and network modules were also obtained through the MCODE cluster method. As a result, lung failure sepsis has the highest number of DEGs of 859, whereas the number of DEGs in kidney and liver failure sepsis samples is 178 and 175, respectively. In addition, 17 overlaps were obtained among the three lists of DEGs. Biological processes related to immune and inflammatory response were found to be significantly enriched in DEGs. Network and module analysis identified four gene clusters in which all or most of genes were upregulated. The expression changes of Icam1 and Socs3 were further validated through quantitative PCR analysis. This study should shed light on the development of sepsis and provide potential therapeutic targets for sepsis-induced multiorgan failure.
Full Text Available Alcohol dehydrogenases (ADH, encoded by multigene family in plants, play a critical role in plant growth, development, adaptation, fruit ripening and aroma production. Thirteen ADH genes were identified in melon genome, including 12 ADHs and one formaldehyde dehydrogenease (FDH, designated CmADH1-12 and CmFDH1, in which CmADH1 and CmADH2 have been isolated in Cantaloupe. ADH genes shared a lower identity with each other at the protein level and had different intron-exon structure at nucleotide level. No typical signal peptides were found in all CmADHs, and CmADH proteins might locate in the cytoplasm. The phylogenetic tree revealed that 13 ADH genes were divided into 3 groups respectively, namely long-, medium- and short-chain ADH subfamily, and CmADH1,3-11, which belongs to the medium-chain ADH subfamily, fell into 6 medium-chain ADH subgroups. CmADH12 may belong to the long-chain ADH subfamily, while CmFDH1 may be a Class III ADH and serve as an ancestral ADH in melon. Expression profiling revealed that CmADH1, CmADH2, CmADH10 and CmFDH1 were moderately or strongly expressed in different vegetative tissues and fruit at medium and late developmental stages, while CmADH8 and CmADH12 were highly expressed in fruit after 20 days. CmADH3 showed preferential expression in young tissues. CmADH4 only had slight expression in root. Promoter analysis revealed several motifs of CmADH genes involved in the gene expression modulated by various hormones, and the response pattern of CmADH genes to ABA, IAA and ethylene were different. These CmADHs were divided into ethylene-sensitive and –insensitive groups, and the functions of CmADHs were discussed.
Yu, Ming-Jiun; Miller, R Lance; Uawithya, Panapat; Rinschen, Markus M; Khositseth, Sookkasem; Braucht, Drew W W; Chou, Chung-Lin; Pisitkun, Trairak; Nelson, Raoul D; Knepper, Mark A
We used a systems biology-based approach to investigate the basis of cell-specific expression of the water channel aquaporin-2 (AQP2) in the renal collecting duct. Computational analysis of the 5'-flanking region of the AQP2 gene (Genomatix) revealed 2 conserved clusters of putative transcriptional regulator (TR) binding elements (BEs) centered at -513 bp (corresponding to the SF1, NFAT, and FKHD TR families) and -224 bp (corresponding to the AP2, SRF, CREB, GATA, and HOX TR families). Three other conserved motifs corresponded to the ETS, EBOX, and RXR TR families. To identify TRs that potentially bind to these BEs, we carried out mRNA profiling (Affymetrix) in mouse mpkCCDc14 collecting duct cells, revealing expression of 25 TRs that are also expressed in native inner medullary collecting duct. One showed a significant positive correlation with AQP2 mRNA abundance among mpkCCD subclones (Ets1), and 2 showed a significant negative correlation (Elf1 and an orphan nuclear receptor Nr1h2). Transcriptomic profiling in native proximal tubules (PT), medullary thick ascending limbs (MTAL), and IMCDs from kidney identified 14 TRs (including Ets1 and HoxD3) expressed in the IMCD but not PT or MTAL (candidate AQP2 enhancer roles), and 5 TRs (including HoxA5, HoxA9 and HoxA10) expressed in PT and MTAL but not in IMCD (candidate AQP2 repressor roles). In luciferase reporter assays, overexpression of 3 ETS family TRs transactivated the mouse proximal AQP2 promoter. The results implicate ETS family TRs in cell-specific expression of AQP2 and point to HOX, RXR, CREB and GATA family TRs as playing likely additional roles.
Borlawsky Tara B
Full Text Available Abstract Background Chronic lymphocytic leukemia (CLL is the most common adult leukemia. It is a highly heterogeneous disease, and can be divided roughly into indolent and progressive stages based on classic clinical markers. Immunoglobin heavy chain variable region (IgVH mutational status was found to be associated with patient survival outcome, and biomarkers linked to the IgVH status has been a focus in the CLL prognosis research field. However, biomarkers highly correlated with IgVH mutational status which can accurately predict the survival outcome are yet to be discovered. Results In this paper, we investigate the use of gene co-expression network analysis to identify potential biomarkers for CLL. Specifically we focused on the co-expression network involving ZAP70, a well characterized biomarker for CLL. We selected 23 microarray datasets corresponding to multiple types of cancer from the Gene Expression Omnibus (GEO and used the frequent network mining algorithm CODENSE to identify highly connected gene co-expression networks spanning the entire genome, then evaluated the genes in the co-expression network in which ZAP70 is involved. We then applied a set of feature selection methods to further select genes which are capable of predicting IgVH mutation status from the ZAP70 co-expression network. Conclusions We have identified a set of genes that are potential CLL prognostic biomarkers IL2RB, CD8A, CD247, LAG3 and KLRK1, which can predict CLL patient IgVH mutational status with high accuracies. Their prognostic capabilities were cross-validated by applying these biomarker candidates to classify patients into different outcome groups using a CLL microarray datasets with clinical information.
Li, Jieqin; Fan, Feifei; Wang, Lihua; Zhan, Qiuwen; Wu, Peijin; Du, Junli; Yang, Xiaocui; Liu, Yanlong
Cinnamoyl-CoA reductase (CCR) is the first enzyme in the monolignol-specific branch of the lignin biosynthetic pathway. In this research, three sorghum CCR genes including SbCCR1, SbCCR2-1 and SbCCR2-2 were cloned and characterized. Analyses of the structure and phylogeny of the three CCR genes showed evolutionary conservation of the functional domains and divergence of function. Transient expression assays in Nicotiana benthamiana leaves demonstrated that the three CCR proteins were localized in the cytoplasm. The expression analysis showed that the three CCR genes were induced by drought. But in 48 h, the expression levels of SbCCR1 and SbCCR2-2 did not differ between CK and the drought treatment; while the expression level of SbCCR2-1 in the drought treatment was higher than in CK. The expression of the SbCCR1 and SbCCR2-1 genes was not induced by sorghum aphid [Melanaphis sacchari (Zehntner)] attack, but SbCCR2-2 was significantly induced by sorghum aphid attack. It is suggested that SbCCR2-2 is involved in the process of pest defense. Absolute quantitative real-time PCR revealed that the three CCR genes were mainly expressed in lignin deposition organs. The gene copy number of SbCCR1 was significantly higher than those of SbCCR2-1 and SbCCR2-2 in the tested tissues, especially in stem. The results provide new insight into the functions of the three CCR genes in sorghum.
Peng, Jinbiao; Han, Hongxiao; Hong, Yang; Wang, Yan; Guo, Fanji; Shi, Yaojun; Fu, Zhiqiang; Liu, Jinming; Cheng, Guofeng; Lin, Jiaojiao
The present study was intend to clone and express the cDNA encoding Cyclophilin B (CyPB) of Schistosoma japonicum, its preliminary biological function and further immunoprotective effect against schistosome infection in mice. RT-PCR technique was applied to amplify a full-length cDNA encoding protein Cyclophilin B (Sj CyPB) from schistosomula cDNA. The expression profiles of Sj CyPB were determined by Real-time PCR using the template cDNAs isolated from 7, 13, 18, 23, 32 and 42 days parasites. The cDNA containing the Open Reading Frame of CyPB was then subcloned into a pGEX-6P-1 vector and transformed into competent Escherichia coli BL21 for expressing. The recombinant protein was renaturated, purified and its antigenicity were detected by Western blotting, and the immunoprotective effect induced by recombinant Sj CyPB was evaluated in Balb/C mice. The cDNA containing the ORF of Sj CyPB was cloned with the length of 672 base pairs, encoding 223 amino acids. Real-time PCR analysis revealed that the gene had the highest expression in 18-day schistosomula, suggesting that Sj CyPB was schistosomula differentially expressed gene. The recombinant protein showed a good antigenicity detected by Western blotting. Animal experiment indicated that the vaccination of recombinant CyPB protein in mice led to 31.5% worm and 41.01% liver egg burden reduction, respectively, compared with those of the control. A full-length cDNA differentially expressed in schistosomula was obtained. The recombinant Sj CyPB protein could induce partial protection against schistosome infection.
Lenzner, Steffen; Prietz, Sandra; Feil, Silke; Nuber, Ulrike A; Ropers, H-Hilger; Berger, Wolfgang
Mutations in the NDP gene give rise to a variety of eye diseases, including classic Norrie disease (ND), X-linked exudative vitreoretinopathy (EVRX), retinal telangiectasis (Coats disease), and advanced retinopathy of prematurity (ROP). The gene product is a cystine-knot-containing extracellular signaling molecule of unknown function. In the current study, gene expression was determined in a mouse model of ND, to unravel di