gene group analysis: Topics by WorldWideScience.org

Sample records for gene group analysis

Structural analysis of the RH-like blood group gene products in nonhuman primates

Energy Technology Data Exchange (ETDEWEB)

Salvignol, I. [Centre Regional de Transfusion Sanguine, Toulouse (France); Calvas, P.; Blancher, A. [Universitaire d`Immunogenetique moleculaire, Toulouse (France); Socha, W.W. [University Medical Center, New York, NY (United States); Colin, Y.; Le Van Kim, C.; Bailly, P.; Cartron, J.P. [Institut National de la Transfusion Sanguine, Paris (France); Ruffie, J.; Blancher, A. [College de France, Paris (France)

1995-03-01

Rh-related transcripts present in bone marrow samples from several species of nonhuman primates (chimpanzee, gorilla, gibbon, crab-eating macaque) have been amplified by RT-polymerase chain reaction using primers deduced from the sequence of human RH genes. Nucleotide sequence analysis of the nonhuman transcripts revealed a high degree of similarity to human blood group Rh sequences, suggesting a great conservation of the RH genes throughout evolution. Full-length transcripts, potentially encoding 417 amino acid long proteins homologous to Rh polypeptides, were characterized, as well as mRNA isoforms which harbored nucleotide deletions or insertions and potentially encode truncated proteins. Proteins of 30-40,000 M{sub r}, immunologically related to human Rh proteins, were detected by western blot analysis with antipeptide antibodies, indicating that Rh-like transcripts are translated into membrane proteins. Comparison of human and nonhuman protein sequences was pivotal in clarifying the molecular basis of the blood group C/c polymorphism, showing that only the Pro103Ser substitution was correlated with C/c polymorphism. In addition, it was shown that a proline residue at position 102 was critical in the expression of C and c epitopes, most likely by providing an appropriate conformation of Rh polypeptides. From these data a phylogenetic reconstruction of the RH locus evolution has been calculated from which an unrooted phylogenetic tree could be proposed, indicating that African ape Rh-like genes would be closer to the human RhD gene than to the human RhCE gene. 55 refs., 4 figs., 1 tab.
Conserved genomic organisation of Group B Sox genes in insects.

Directory of Open Access Journals (Sweden)

Woerfel Gertrud

2005-05-01

Full Text Available Abstract Background Sox domain containing genes are important metazoan transcriptional regulators implicated in a wide rage of developmental processes. The vertebrate B subgroup contains the Sox1, Sox2 and Sox3 genes that have early functions in neural development. Previous studies show that Drosophila Group B genes have been functionally conserved since they play essential roles in early neural specification and mutations in the Drosophila Dichaete and SoxN genes can be rescued with mammalian Sox genes. Despite their importance, the extent and organisation of the Group B family in Drosophila has not been fully characterised, an important step in using Drosophila to examine conserved aspects of Group B Sox gene function. Results We have used the directed cDNA sequencing along with the output from the publicly-available genome sequencing projects to examine the structure of Group B Sox domain genes in Drosophila melanogaster, Drosophila pseudoobscura, Anopheles gambiae and Apis mellifora. All of the insect genomes contain four genes encoding Group B proteins, two of which are intronless, as is the case with vertebrate group B genes. As has been previously reported and unusually for Group B genes, two of the insect group B genes, Sox21a and Sox21b, contain introns within their DNA-binding domains. We find that the highly unusual multi-exon structure of the Sox21b gene is common to the insects. In addition, we find that three of the group B Sox genes are organised in a linked cluster in the insect genomes. By in situ hybridisation we show that the pattern of expression of each of the four group B genes during embryogenesis is conserved between D. melanogaster and D. pseudoobscura. Conclusion The DNA-binding domain sequences and genomic organisation of the group B genes have been conserved over 300 My of evolution since the last common ancestor of the Hymenoptera and the Diptera. Our analysis suggests insects have two Group B1 genes, SoxN and
Screening strategies for a highly polymorphic gene: DHPLC analysis of the Fanconi anemia group A gene.

Science.gov (United States)

Rischewski, J; Schneppenheim, R

2001-01-30

Patients with Fanconi anemia (Fanc) are at risk of developing leukemia. Mutations of the group A gene (FancA) are most common. A multitude of polymorphisms and mutations within the 43 exons of the gene are described. To examine the role of heterozygosity as a risk factor for malignancies, a partially automatized screening method to identify aberrations was needed. We report on our experience with DHPLC (WAVE (Transgenomic)). PCR amplification of all 43 exons from one individual was performed on one microtiter plate on a gradient thermocycler. DHPLC analysis conditions were established via melting curves, prediction software, and test runs with aberrant samples. PCR products were analyzed twice: native, and after adding a WT-PCR product. Retention patterns were compared with previously identified polymorphic PCR products or mutants. We have defined the mutation screening conditions for all 43 exons of FancA using DHPLC. So far, 40 different sequence variations have been detected in more than 100 individuals. The native analysis identifies heterozygous individuals, and the second run detects homozygous aberrations. Retention patterns are specific for the underlying sequence aberration, thus reducing sequencing demand and costs. DHPLC is a valuable tool for reproducible recognition of known sequence aberrations and screening for unknown mutations in the highly polymorphic FancA gene.
Expansion and Functional Divergence of AP2 Group Genes in Spermatophytes Determined by Molecular Evolution and Arabidopsis Mutant Analysis

Directory of Open Access Journals (Sweden)

Pengkai Wang

2016-09-01

Full Text Available The APETALA2 (AP2 genes represent the AP2 group within a large group of DNA-binding proteins called AP2/EREBP. The AP2 gene is functional and necessary for flower development, stem cell maintenance, and seed development, whereas the other members of AP2 group redundantly affect flowering time. Here we study the phylogeny of AP2 group genes in spermatophytes. Spermatophyte AP2 group genes can be classified into AP2 and TOE types, six clades, and we found that the AP2 group homologs in gymnosperms belong to the AP2 type, whereas TOE types are absent, which indicates the AP2 type gene are more ancient and TOE type was split out of AP2 type and losing the major function. In Brassicaceae, the expansion of AP2 and TOE type lead to the gene number of AP2 group were up to six. Purifying selection appears to have been the primary driving force of spermatophyte AP2 group evolution, although positive selection occurred in the AP2 clade. The transition from exon to intron of AtAP2 in Arabidopsis mutant leads to the loss of gene function and the same situation was found in AtTOE2. Combining this evolutionary analysis and published research, the results suggest that typical AP2 group genes may first appear in gymnosperms and diverged in angiosperms, following expansion of group members and functional differentiation. In angiosperms, AP2 genes (AP2 clade inherited key functions from ancestors and other genes of AP2 group lost most function but just remained flowering time controlling in gene formation. In this study, the phylogenies of AP2 group genes in spermatophytes was analyzed, which supported the evidence for the research of gene functional evolution of AP2 group.
A comparative gene analysis with rice identified orthologous group II HKT genes and their association with Na(+) concentration in bread wheat.

Science.gov (United States)

Ariyarathna, H A Chandima K; Oldach, Klaus H; Francki, Michael G

2016-01-19

Although the HKT transporter genes ascertain some of the key determinants of crop salt tolerance mechanisms, the diversity and functional role of group II HKT genes are not clearly understood in bread wheat. The advanced knowledge on rice HKT and whole genome sequence was, therefore, used in comparative gene analysis to identify orthologous wheat group II HKT genes and their role in trait variation under different saline environments. The four group II HKTs in rice identified two orthologous gene families from bread wheat, including the known TaHKT2;1 gene family and a new distinctly different gene family designated as TaHKT2;2. A single copy of TaHKT2;2 was found on each homeologous chromosome arm 7AL, 7BL and 7DL and each gene was expressed in leaf blade, sheath and root tissues under non-stressed and at 200 mM salt stressed conditions. The proteins encoded by genes of the TaHKT2;2 family revealed more than 93% amino acid sequence identity but ≤52% amino acid identity compared to the proteins encoded by TaHKT2;1 family. Specifically, variations in known critical domains predicted functional differences between the two protein families. Similar to orthologous rice genes on chromosome 6L, TaHKT2;1 and TaHKT2;2 genes were located approximately 3 kb apart on wheat chromosomes 7AL, 7BL and 7DL, forming a static syntenic block in the two species. The chromosomal region on 7AL containing TaHKT2;1 7AL-1 co-located with QTL for shoot Na(+) concentration and yield in some saline environments. The differences in copy number, genes sequences and encoded proteins between TaHKT2;2 homeologous genes and other group II HKT gene families within and across species likely reflect functional diversity for ion selectivity and transport in plants. Evidence indicated that neither TaHKT2;2 nor TaHKT2;1 were associated with primary root Na(+) uptake but TaHKT2;1 may be associated with trait variation for Na(+) exclusion and yield in some but not all saline environments.
TXTGate: profiling gene groups with text-based information

DEFF Research Database (Denmark)

Glenisson, P.; Coessens, B.; Van Vooren, S.

2004-01-01

We implemented a framework called TXTGate that combines literature indices of selected public biological resources in a flexible text-mining system designed towards the analysis of groups of genes. By means of tailored vocabularies, term-as well as gene-centric views are offered on selected textual...
GeneLab Analysis Working Group Kick-Off Meeting

Science.gov (United States)

Costes, Sylvain V.

2018-01-01

Goals to achieve for GeneLab AWG - GL vision - Review of GeneLab AWG charter Timeline and milestones for 2018 Logistics - Monthly Meeting - Workshop - Internship - ASGSR Introduction of team leads and goals of each group Introduction of all members Q/A Three-tier Client Strategy to Democratize Data Physiological changes, pathway enrichment, differential expression, normalization, processing metadata, reproducibility, Data federation/integration with heterogeneous bioinformatics external databases The GLDS currently serves over 100 omics investigations to the biomedical community via open access. In order to expand the scope of metadata record searches via the GLDS, we designed a metadata warehouse that collects and updates metadata records from external systems housing similar data. To demonstrate the capabilities of federated search and retrieval of these data, we imported metadata records from three open-access data systems into the GLDS metadata warehouse: NCBI's Gene Expression Omnibus (GEO), EBI's PRoteomics IDEntifications (PRIDE) repository, and the Metagenomics Analysis server (MG-RAST). Each of these systems defines metadata for omics data sets differently. One solution to bridge such differences is to employ a common object model (COM) to which each systems' representation of metadata can be mapped. Warehoused metadata records are then transformed at ETL to this single, common representation. Queries generated via the GLDS are then executed against the warehouse, and matching records are shown in the COM representation (Fig. 1). While this approach is relatively straightforward to implement, the volume of the data in the omics domain presents challenges in dealing with latency and currency of records. Furthermore, the lack of a coordinated has been federated data search for and retrieval of these kinds of data across other open-access systems, so that users are able to conduct biological meta-investigations using data from a variety of sources. Such meta
Genetic analysis of the porcine group B rotavirus NSP2 gene from wild-type Brazilian strains

Directory of Open Access Journals (Sweden)

K.C. Médici

2010-01-01

Full Text Available Group B rotaviruses (RV-B were first identified in piglet feces, being later associated with diarrhea in humans, cattle, lambs, and rats. In human beings, the virus was only described in China, India, and Bangladesh, especially infecting adults. Only a few studies concerning molecular analysis of the RV-B NSP2 gene have been conducted, and porcine RV-B has not been characterized. In the present study, three porcine wild-type RV-B strains from piglet stool samples collected from Brazilian pig herds were used for analysis. PAGE results were inconclusive for those samples, but specific amplicons of the RV-B NSP2 gene (segment 8 were obtained in a semi-nested PCR assay. The three porcine RV-B strains showed the highest nucleotide identity with the human WH1 strain and the alignments with other published sequences resulted in three groups of strains divided according to host species. The group of human strains showed 92.4 to 99.7% nucleotide identity while the porcine strains of the Brazilian RV-B group showed 90.4 to 91.8% identity to each other. The identity of the Brazilian porcine RV-B strains with outer sequences consisting of group A and C rotaviruses was only 35.3 to 38.8%. A dendrogram was also constructed to group the strains into clusters according to host species: human, rat, and a distinct third cluster consisting exclusively of the Brazilian porcine RV-B strains. This is the first study of the porcine RV-B NSP2 gene that contributes to the partial characterization of this virus and demonstrates the relationship among RV-B strains from different host species.
WRKY domain-encoding genes of a crop legume chickpea (Cicer arietinum): comparative analysis with Medicago truncatula WRKY family and characterization of group-III gene(s).

Science.gov (United States)

Kumar, Kamal; Srivastava, Vikas; Purayannur, Savithri; Kaladhar, V Chandra; Cheruvu, Purnima Jaiswal; Verma, Praveen Kumar

2016-06-01

The WRKY genes have been identified as important transcriptional modulators predominantly during the environmental stresses, but they also play critical role at various stages of plant life cycle. We report the identification of WRKY domain (WD)-encoding genes from galegoid clade legumes chickpea (Cicer arietinum L.) and barrel medic (Medicago truncatula). In total, 78 and 98 WD-encoding genes were found in chickpea and barrel medic, respectively. Comparative analysis suggests the presence of both conserved and unique WRKYs, and expansion of WRKY family in M. truncatula primarily by tandem duplication. Exclusively found in galegoid legumes, CaWRKY16 and its orthologues encode for a novel protein having a transmembrane and partial Exo70 domains flanking a group-III WD. Genomic region of galegoids, having CaWRKY16, is more dynamic when compared with millettioids. In onion cells, fused CaWRKY16-EYFP showed punctate fluorescent signals in cytoplasm. The chickpea WRKY group-III genes were further characterized for their transcript level modulation during pathogenic stress and treatments of abscisic acid, jasmonic acid, and salicylic acid (SA) by real-time PCR. Differential regulation of genes was observed during Ascochyta rabiei infection and SA treatment. Characterization of A. rabiei and SA inducible gene CaWRKY50 showed that it localizes to plant nucleus, binds to W-box, and have a C-terminal transactivation domain. Overexpression of CaWRKY50 in tobacco plants resulted in early flowering and senescence. The in-depth comparative account presented here for two legume WRKY genes will be of great utility in hastening functional characterization of crop legume WRKYs and will also help in characterization of Exo70Js. © The Author 2016. Published by Oxford University Press on behalf of Kazusa DNA Research Institute.
PlantPAN: Plant promoter analysis navigator, for identifying combinatorial cis-regulatory elements with distance constraint in plant gene groups

Directory of Open Access Journals (Sweden)

Huang Hsien-Da

2008-11-01

Full Text Available Abstract Background The elucidation of transcriptional regulation in plant genes is important area of research for plant scientists, following the mapping of various plant genomes, such as A. thaliana, O. sativa and Z. mays. A variety of bioinformatic servers or databases of plant promoters have been established, although most have been focused only on annotating transcription factor binding sites in a single gene and have neglected some important regulatory elements (tandem repeats and CpG/CpNpG islands in promoter regions. Additionally, the combinatorial interaction of transcription factors (TFs is important in regulating the gene group that is associated with the same expression pattern. Therefore, a tool for detecting the co-regulation of transcription factors in a group of gene promoters is required. Results This study develops a database-assisted system, PlantPAN (Plant Promoter Analysis Navigator, for recognizing combinatorial cis-regulatory elements with a distance constraint in sets of plant genes. The system collects the plant transcription factor binding profiles from PLACE, TRANSFAC (public release 7.0, AGRIS, and JASPER databases and allows users to input a group of gene IDs or promoter sequences, enabling the co-occurrence of combinatorial transcription factor binding sites (TFBSs within a defined distance (20 bp to 200 bp to be identified. Furthermore, the new resource enables other regulatory features in a plant promoter, such as CpG/CpNpG islands and tandem repeats, to be displayed. The regulatory elements in the conserved regions of the promoters across homologous genes are detected and presented. Conclusion In addition to providing a user-friendly input/output interface, PlantPAN has numerous advantages in the analysis of a plant promoter. Several case studies have established the effectiveness of PlantPAN. This novel analytical resource is now freely available at http://PlantPAN.mbc.nctu.edu.tw.
Comparative Genomic Analysis of Soybean Flowering Genes

Science.gov (United States)

Jung, Chol-Hee; Wong, Chui E.; Singh, Mohan B.; Bhalla, Prem L.

2012-01-01

Flowering is an important agronomic trait that determines crop yield. Soybean is a major oilseed legume crop used for human and animal feed. Legumes have unique vegetative and floral complexities. Our understanding of the molecular basis of flower initiation and development in legumes is limited. Here, we address this by using a computational approach to examine flowering regulatory genes in the soybean genome in comparison to the most studied model plant, Arabidopsis. For this comparison, a genome-wide analysis of orthologue groups was performed, followed by an in silico gene expression analysis of the identified soybean flowering genes. Phylogenetic analyses of the gene families highlighted the evolutionary relationships among these candidates. Our study identified key flowering genes in soybean and indicates that the vernalisation and the ambient-temperature pathways seem to be the most variant in soybean. A comparison of the orthologue groups containing flowering genes indicated that, on average, each Arabidopsis flowering gene has 2-3 orthologous copies in soybean. Our analysis highlighted that the CDF3, VRN1, SVP, AP3 and PIF3 genes are paralogue-rich genes in soybean. Furthermore, the genome mapping of the soybean flowering genes showed that these genes are scattered randomly across the genome. A paralogue comparison indicated that the soybean genes comprising the largest orthologue group are clustered in a 1.4 Mb region on chromosome 16 of soybean. Furthermore, a comparison with the undomesticated soybean (Glycine soja) revealed that there are hundreds of SNPs that are associated with putative soybean flowering genes and that there are structural variants that may affect the genes of the light-signalling and ambient-temperature pathways in soybean. Our study provides a framework for the soybean flowering pathway and insights into the relationship and evolution of flowering genes between a short-day soybean and the long-day plant, Arabidopsis. PMID:22679494
Time-Course Gene Set Analysis for Longitudinal Gene Expression Data.

Directory of Open Access Journals (Sweden)

Boris P Hejblum

2015-06-01

Full Text Available Gene set analysis methods, which consider predefined groups of genes in the analysis of genomic data, have been successfully applied for analyzing gene expression data in cross-sectional studies. The time-course gene set analysis (TcGSA introduced here is an extension of gene set analysis to longitudinal data. The proposed method relies on random effects modeling with maximum likelihood estimates. It allows to use all available repeated measurements while dealing with unbalanced data due to missing at random (MAR measurements. TcGSA is a hypothesis driven method that identifies a priori defined gene sets with significant expression variations over time, taking into account the potential heterogeneity of expression within gene sets. When biological conditions are compared, the method indicates if the time patterns of gene sets significantly differ according to these conditions. The interest of the method is illustrated by its application to two real life datasets: an HIV therapeutic vaccine trial (DALIA-1 trial, and data from a recent study on influenza and pneumococcal vaccines. In the DALIA-1 trial TcGSA revealed a significant change in gene expression over time within 69 gene sets during vaccination, while a standard univariate individual gene analysis corrected for multiple testing as well as a standard a Gene Set Enrichment Analysis (GSEA for time series both failed to detect any significant pattern change over time. When applied to the second illustrative data set, TcGSA allowed the identification of 4 gene sets finally found to be linked with the influenza vaccine too although they were found to be associated to the pneumococcal vaccine only in previous analyses. In our simulation study TcGSA exhibits good statistical properties, and an increased power compared to other approaches for analyzing time-course expression patterns of gene sets. The method is made available for the community through an R package.
Identification of distinct genes associated with seawater aspiration-induced acute lung injury by gene expression profile analysis

Science.gov (United States)

Liu, Wei; Pan, Lei; Zhang, Minlong; Bo, Liyan; Li, Congcong; Liu, Qingqing; Wang, Li; Jin, Faguang

2016-01-01

Seawater aspiration-induced acute lung injury (ALI) is a syndrome associated with a high mortality rate, which is characterized by severe hypoxemia, pulmonary edema and inflammation. The present study is the first, to the best of our knowledge, to analyze gene expression profiles from a rat model of seawater aspiration-induced ALI. Adult male Sprague-Dawley rats were instilled with seawater (4 ml/kg) in the seawater aspiration-induced ALI group (S group) or with distilled water (4 ml/kg) in the distilled water negative control group (D group). In the blank control group (C group) the rats' tracheae were exposed without instillation. Subsequently, lung samples were examined by histopathology; total protein concentration was detected in bronchoalveolar lavage fluid (BALF); lung wet/dry weight ratios were determined; and transcript expression was detected by gene sequencing analysis. The results demonstrated that histopathological alterations, pulmonary edema and total protein concentrations in BALF were increased in the S group compared with in the D group. Analysis of differential gene expression identified up and downregulated genes in the S group compared with in the D and C groups. A gene ontology analysis of the differential gene expression revealed enrichment of genes in the functional pathways associated with neutrophil chemotaxis, immune and defense responses, and cytokine activity. Kyoto Encyclopedia of Genes and Genomes analysis revealed that the cytokine-cytokine receptor interaction pathway was one of the most important pathways involved in seawater aspiration-induced ALI. In conclusion, activation of the cytokine-cytokine receptor interaction pathway may have an essential role in the progression of seawater aspiration-induced ALI, and the downregulation of tumor necrosis factor superfamily member 10 may enhance inflammation. Furthermore, IL-6 may be considered a biomarker in seawater aspiration-induced ALI. PMID:27509884
Identification of the Fanconi Anemia Complementation Group I Gene, FANCI

Directory of Open Access Journals (Sweden)

Josephine C. Dorsman

2007-01-01

Full Text Available To identify the gene underlying Fanconi anemia (FA complementation group I we studied informative FA-I families by a genome-wide linkage analysis, which resulted in 4 candidate regions together encompassing 351 genes. Candidates were selected via bioinformatics and data mining on the basis of their resemblance to other FA genes/proteins acting in the FA pathway, such as: degree of evolutionary conservation, presence of nuclear localization signals and pattern of tissue-dependent expression. We found a candidate, KIAA1794 on chromosome 15q25-26, to be mutated in 8 affected individuals previously assigned to complementation group I. Western blots of endogenous FANCI indicated that functionally active KIAA1794 protein is lacking in FA-I individuals. Knock-down of KIAA1794 expression by siRNA in HeLa cells caused excessive chromosomal breakage induced by mitomycin C, a hallmark of FA cells. Furthermore, phenotypic reversion of a patient-derived cell line was associated with a secondary genetic alteration at the KIAA1794 locus. These data add up to two conclusions. First, KIAA1794 is a FA gene. Second, this gene is identical to FANCI, since the patient cell lines found mutated in this study included the reference cell line for group I, EUFA592.
Gene coexpression network analysis as a source of functional annotation for rice genes.

Directory of Open Access Journals (Sweden)

Kevin L Childs

Full Text Available With the existence of large publicly available plant gene expression data sets, many groups have undertaken data analyses to construct gene coexpression networks and functionally annotate genes. Often, a large compendium of unrelated or condition-independent expression data is used to construct gene networks. Condition-dependent expression experiments consisting of well-defined conditions/treatments have also been used to create coexpression networks to help examine particular biological processes. Gene networks derived from either condition-dependent or condition-independent data can be difficult to interpret if a large number of genes and connections are present. However, algorithms exist to identify modules of highly connected and biologically relevant genes within coexpression networks. In this study, we have used publicly available rice (Oryza sativa gene expression data to create gene coexpression networks using both condition-dependent and condition-independent data and have identified gene modules within these networks using the Weighted Gene Coexpression Network Analysis method. We compared the number of genes assigned to modules and the biological interpretability of gene coexpression modules to assess the utility of condition-dependent and condition-independent gene coexpression networks. For the purpose of providing functional annotation to rice genes, we found that gene modules identified by coexpression analysis of condition-dependent gene expression experiments to be more useful than gene modules identified by analysis of a condition-independent data set. We have incorporated our results into the MSU Rice Genome Annotation Project database as additional expression-based annotation for 13,537 genes, 2,980 of which lack a functional annotation description. These results provide two new types of functional annotation for our database. Genes in modules are now associated with groups of genes that constitute a collective functional
Statistical assessment of crosstalk enrichment between gene groups in biological networks.

Science.gov (United States)

McCormack, Theodore; Frings, Oliver; Alexeyenko, Andrey; Sonnhammer, Erik L L

2013-01-01

Analyzing groups of functionally coupled genes or proteins in the context of global interaction networks has become an important aspect of bioinformatic investigations. Assessing the statistical significance of crosstalk enrichment between or within groups of genes can be a valuable tool for functional annotation of experimental gene sets. Here we present CrossTalkZ, a statistical method and software to assess the significance of crosstalk enrichment between pairs of gene or protein groups in large biological networks. We demonstrate that the standard z-score is generally an appropriate and unbiased statistic. We further evaluate the ability of four different methods to reliably recover crosstalk within known biological pathways. We conclude that the methods preserving the second-order topological network properties perform best. Finally, we show how CrossTalkZ can be used to annotate experimental gene sets using known pathway annotations and that its performance at this task is superior to gene enrichment analysis (GEA). CrossTalkZ (available at http://sonnhammer.sbc.su.se/download/software/CrossTalkZ/) is implemented in C++, easy to use, fast, accepts various input file formats, and produces a number of statistics. These include z-score, p-value, false discovery rate, and a test of normality for the null distributions.
Analysis of common deafness gene mutations in deaf people from unique ethnic groups in Gansu Province, China.

Science.gov (United States)

Xu, Bai-Cheng; Bian, Pan-Pan; Liu, Xiao-Wen; Zhu, Yi-Ming; Yang, Xiao-Long; Ma, Jian-Li; Chen, Xing-Jian; Wang, Yan-Li; Guo, Yu-Fen

2014-09-01

The GJB2 gene mutation characteristic of Dongxiang was the interaction result of ethnic background and geographical environment, and Yugur exhibited the typical founder effect. The SLC26A4 gene mutation characteristic of Dongxiang was related to caucasian backgrounds and selection of purpose exons, i.e. ethnic background and the penetrance of ethnic specificity caused the low mtDNA1555A>G mutation frequency in Dongxiang. To determine the prevalence of GJB2 and SLC26A4 genes and mtDNA1555A>G mutations and analyze the ethnic specificity in the non-syndromic sensorineural hearing loss (NSHL) of unique ethnic groups in Gansu Province. Peripheral blood samples were obtained from Dongxiang, Yugur, Bonan, and ethnic Han groups with moderately severe to profound NSHL in Gansu Province. Bidirectional sequencing (or enzyme digestion) was applied to identify the sequence variations. The pathogenic allele frequency of the three gene mutations was different. The frequency of the GJB2 gene among the Dongxiang, Yugur, Bonan, and ethnic Han groups was 9.03%, 12.5%, 5.88%, and 12.17%, respectively. No difference was found between the ethnic groups. The frequencies of the SLC26A4 genes were 3.23%, 8.33%, 0%, and 9.81%, respectively. The mutation frequency of mtDNA1555A>G was 0%, 0%, 0%, and 6.03%, respectively. No difference was found between the ethnic groups, except for the Dongxiang and ethnic Han groups, both in SLC26A4 gene and mtDNA1555A>G.
New Mutation Identified in the SRY Gene High Mobility Group (HMG

Directory of Open Access Journals (Sweden)

Feride İffet Şahin

2013-06-01

Full Text Available Mutations in the SRY gene prevent the differentiation of the fetal gonads to testes and cause developing female phenotype, and as a result sex reversal and pure gonadal dysgenesis (Swyer syndrome can be developed. Different types of mutations identified in the SRY gene are responsible for 15% of the gonadal dysgenesis. In this study, we report a new mutation (R132P in the High Mobility Group (HMG region of SRY gene was detected in a patient with primary amenorrhea who has 46,XY karyotype. This mutation leads to replacement of the polar and basic arginine with a nonpolar hydrophobic proline residue at aminoacid 132 in the nuclear localization signal region of the protein. With this case report we want to emphasize the genetic approach to the patients with gonadal dysgenesis. If Y chromosome is detected during cytogenetic analysis, revealing the presence of the SRY gene and identification of mutations in this gene by sequencing analysis is become important in.
Identification and expression analysis of four 14-3-3 genes during fruit ripening in banana (Musa acuminata L. AAA group, cv. Brazilian).

Science.gov (United States)

Li, Mei-Ying; Xu, Bi-Yu; Liu, Ju-Hua; Yang, Xiao-Liang; Zhang, Jian-Bin; Jia, Cai-Hong; Ren, Li-Cheng; Jin, Zhi-Qiang

2012-02-01

To investigate the regulation of 14-3-3 proteins in banana (Musa acuminata L. AAA group, cv. Brazilian) fruit postharvest ripening, four cDNAs encoding 14-3-3 proteins were isolated from banana and designated as Ma-14-3-3a, Ma-14-3-3c, Ma-14-3-3e, and Ma-14-3-3i, respectively. Amino acid sequence alignment showed that the four 14-3-3 proteins shared a highly conserved core structure and variable C-terminal as well as N-terminal regions with 14-3-3 proteins from other plant species. Phylogenetic analysis revealed that the four 14-3-3 genes belong to the non-ε groups. They were differentially and specifically expressed in various tissues. Real-time RT-PCR analysis indicated that these four genes function differentially during banana fruit postharvest ripening. Three genes, Ma-14-3-3a, Ma-14-3-3c, and Ma-14-3-3e, were significantly induced by exogenous ethylene treatment. However, gene function differed in naturally ripened fruits. Ethylene could induce Ma-14-3-3c expression during postharvest ripening, but expression patterns of Ma-14-3-3a and Ma-14-3-3e suggest that these two genes appear to be involved in regulating ethylene biosynthesis during fruit ripening. No obvious relationship emerged between Ma-14-3-3i expression in naturally ripened and 1-MCP (1-methylcyclopropene)-treated fruit groups during fruit ripening. These results indicate that the 14-3-3 proteins might be involved in various regulatory processes of banana fruit ripening. Further studies will mainly focus on revealing the detailed biological mechanisms of these four 14-3-3 genes in regulating banana fruit postharvest ripening.
Human methanogen diversity and incidence in healthy and diseased colonic groups using mcrA gene analysis

Directory of Open Access Journals (Sweden)

Scanlan Pauline D

2008-05-01

Full Text Available Abstract Background The incidence and diversity of human methanogens are insufficiently characterised in the gastrointestinal tract of both health and disease. A PCR and clone library methodology targeting the mcrA gene was adopted to facilitate the two-fold aim of surveying the relative incidence of methanogens in health and disease groups and also to provide an overview of methanogen diversity in the human gastrointestinal tract. Results DNA faecal extracts (207 in total from a group of healthy controls and five gastrointestinal disease groups were investigated. Colorectal cancer, polypectomised, irritable bowel syndrome and the control group had largely equivalent numbers of individuals positive for methanogens (range 45–50%. Methanogen incidence in the inflammatory bowel disease groups was reduced, 24% for ulcerative colitis and 30% for Crohn's disease. Four unique mcrA gene restriction fragment length polymorphism profiles were identified and bioinformatic analyses revealed that the majority of all sequences (94% retrieved from libraries were 100% identical to Methanobrevibacter smithii mcrA gene. In addition, mcrA gene sequences most closely related to Methanobrevibacter oralis and members of the order Methanosarcinales were also recovered. Conclusion The mcrA gene serves as a useful biomarker for methanogen detection in the human gut and the varying trends of methanogen incidence in the human gut could serve as important indicators of intestinal function. Although Methanobrevibacter smithii is the dominant methanogen in both the distal colon of individuals in health and disease, the diversity of methanogens is greater than previously reported. In conclusion, the low incidence of methanogens in Inflammatory Bowel Disease, the functionality of the methanogens and impact of methane production in addition to competitive interactions between methanogens and other microbial groups in the human gastrointestinal tract warrants further

DAVID Knowledgebase: a gene-centered database integrating heterogeneous gene annotation resources to facilitate high-throughput gene functional analysis

Directory of Open Access Journals (Sweden)

Baseler Michael W

2007-11-01

Full Text Available Abstract Background Due to the complex and distributed nature of biological research, our current biological knowledge is spread over many redundant annotation databases maintained by many independent groups. Analysts usually need to visit many of these bioinformatics databases in order to integrate comprehensive annotation information for their genes, which becomes one of the bottlenecks, particularly for the analytic task associated with a large gene list. Thus, a highly centralized and ready-to-use gene-annotation knowledgebase is in demand for high throughput gene functional analysis. Description The DAVID Knowledgebase is built around the DAVID Gene Concept, a single-linkage method to agglomerate tens of millions of gene/protein identifiers from a variety of public genomic resources into DAVID gene clusters. The grouping of such identifiers improves the cross-reference capability, particularly across NCBI and UniProt systems, enabling more than 40 publicly available functional annotation sources to be comprehensively integrated and centralized by the DAVID gene clusters. The simple, pair-wise, text format files which make up the DAVID Knowledgebase are freely downloadable for various data analysis uses. In addition, a well organized web interface allows users to query different types of heterogeneous annotations in a high-throughput manner. Conclusion The DAVID Knowledgebase is designed to facilitate high throughput gene functional analysis. For a given gene list, it not only provides the quick accessibility to a wide range of heterogeneous annotation data in a centralized location, but also enriches the level of biological information for an individual gene. Moreover, the entire DAVID Knowledgebase is freely downloadable or searchable at http://david.abcc.ncifcrf.gov/knowledgebase/.
Genome-wide analysis of the GRAS gene family in Prunus mume.

Science.gov (United States)

Lu, Jiuxing; Wang, Tao; Xu, Zongda; Sun, Lidan; Zhang, Qixiang

2015-02-01

Prunus mume is an ornamental flower and fruit tree in Rosaceae. We investigated the GRAS gene family to improve the breeding and cultivation of P. mume and other Rosaceae fruit trees. The GRAS gene family encodes transcriptional regulators that have diverse functions in plant growth and development, such as gibberellin and phytochrome A signal transduction, root radial patterning, and axillary meristem formation and gametogenesis in the P. mume genome. Despite the important roles of these genes in plant growth regulation, no findings on the GRAS genes of P. mume have been reported. In this study, we discerned phylogenetic relationships of P. mume GRAS genes, and their locations, structures in the genome and expression levels of different tissues. Out of 46 identified GRAS genes, 45 were located on the 8 P. mume chromosomes. Phylogenetic results showed that these genes could be classified into 11 groups. We found that Group X was P. mume-specific, and three genes of Group IX clustered with the rice-specific gene Os4. We speculated that these genes existed before the divergence of dicotyledons and monocotyledons and were lost in Arabidopsis. Tissue expression analysis indicated that 13 genes showed high expression levels in roots, stems, leaves, flowers and fruits, and were related to plant growth and development. Functional analysis of 24 GRAS genes and an orthologous relationship analysis indicated that many functioned during plant growth and flower and fruit development. Our bioinformatics analysis provides valuable information to improve the economic, agronomic and ecological benefits of P. mume and other Rosaceae fruit trees.
Genome-wide analysis of the WRKY gene family in cotton.

Science.gov (United States)

Dou, Lingling; Zhang, Xiaohong; Pang, Chaoyou; Song, Meizhen; Wei, Hengling; Fan, Shuli; Yu, Shuxun

2014-12-01

WRKY proteins are major transcription factors involved in regulating plant growth and development. Although many studies have focused on the functional identification of WRKY genes, our knowledge concerning many areas of WRKY gene biology is limited. For example, in cotton, the phylogenetic characteristics, global expression patterns, molecular mechanisms regulating expression, and target genes/pathways of WRKY genes are poorly characterized. Therefore, in this study, we present a genome-wide analysis of the WRKY gene family in cotton (Gossypium raimondii and Gossypium hirsutum). We identified 116 WRKY genes in G. raimondii from the completed genome sequence, and we cloned 102 WRKY genes in G. hirsutum. Chromosomal location analysis indicated that WRKY genes in G. raimondii evolved mainly from segmental duplication followed by tandem amplifications. Phylogenetic analysis of alga, bryophyte, lycophyta, monocot and eudicot WRKY domains revealed family member expansion with increasing complexity of the plant body. Microarray, expression profiling and qRT-PCR data revealed that WRKY genes in G. hirsutum may regulate the development of fibers, anthers, tissues (roots, stems, leaves and embryos), and are involved in the response to stresses. Expression analysis showed that most group II and III GhWRKY genes are highly expressed under diverse stresses. Group I members, representing the ancestral form, seem to be insensitive to abiotic stress, with low expression divergence. Our results indicate that cotton WRKY genes might have evolved by adaptive duplication, leading to sensitivity to diverse stresses. This study provides fundamental information to inform further analysis and understanding of WRKY gene functions in cotton species.
Expression of the Fanconi anemia group A gene (Fanca) during mouse embryogenesis.

Science.gov (United States)

Abu-Issa, R; Eichele, G; Youssoufian, H

1999-07-15

About 80% of all cases of Fanconi anemia (FA) can be accounted for by complementation groups A and C. To understand the relationship between these groups, we analyzed the expression pattern of the mouse FA group-A gene (Fanca) during embryogenesis and compared it with the known pattern of the group-C gene (Fancc). Northern analysis of RNA from mouse embryos at embryonic days 7, 11, 15, and 17 showed a predominant 4.5 kb band in all stages. By in situ hybridization, Fanca transcripts were found in the whisker follicles, teeth, brain, retina, kidney, liver, and limbs. There was also stage-specific variation in Fanca expression, particularly within the developing whiskers and the brain. Some tissues known to express Fancc (eg, gut) failed to show Fanca expression. These observations show that (1) Fanca is under both tissue- and stage-specific regulation in several tissues; (2) the expression pattern of Fanca is consistent with the phenotype of the human disease; and (3) Fanca expression is not necessarily coupled to that of Fancc. The presence of distinct tissue targets for FA genes suggests that some of the variability in the clinical phenotype can be attributed to the complementation group assignment.
[Variation of CAG repeats in coding region of ATXN2 gene in different ethnic groups].

Science.gov (United States)

Chen, Xiao-Chen; Sun, Hao; Mi, Dong-Qing; Huang, Xiao-Qin; Lin, Ke-Qin; Yi, Wen; Yu, Liang; Shi, Lei; Shi, Li; Yang, Zhao-Qing; Chu, Jia-You

2011-04-01

Toinvestigate CAG repeats variation of ATXN2 gene coding region in six ethnic groups that live in comparatively different environments, to evaluate whether these variations are under positive selection, and to find factors driving selection effects, 291 unrelated healthy individuals were collected from six ethnic groups and their STR geneotyping was performed. The frequencies of alleles and genotypes were counted and thereby Slatkin's linearized Fst values were calculated. The UPGMA tree against this gene was constructed. The MDS analysis among these groups was carried out as well. The results from the linearized Fst values indicated that there were significant evolutionary differences of the STR in ATXN2 gene between Hui and Yi groups, but not among the other 4 groups. Further analysis was performed by combining our data with published data obtained from other groups. These results indicated that there were significant differences between Japanese and other groups including Hui, Hani, Yunnan Mongolian, and Inner Mongolian. Both Hui and Mongolian from Inner Mongolia were significantly different from Han. In conclusion, the six ethnic groups had their own distribution characterizations of allelic frequencies of ATXN2 STR, and the potential cause of frequency changes in rare alleles could be the consequence of positive selection.
Identification of Human HK Genes and Gene Expression Regulation Study in Cancer from Transcriptomics Data Analysis

Science.gov (United States)

Zhang, Zhang; Liu, Jingxing; Wu, Jiayan; Yu, Jun

2013-01-01

The regulation of gene expression is essential for eukaryotes, as it drives the processes of cellular differentiation and morphogenesis, leading to the creation of different cell types in multicellular organisms. RNA-Sequencing (RNA-Seq) provides researchers with a powerful toolbox for characterization and quantification of transcriptome. Many different human tissue/cell transcriptome datasets coming from RNA-Seq technology are available on public data resource. The fundamental issue here is how to develop an effective analysis method to estimate expression pattern similarities between different tumor tissues and their corresponding normal tissues. We define the gene expression pattern from three directions: 1) expression breadth, which reflects gene expression on/off status, and mainly concerns ubiquitously expressed genes; 2) low/high or constant/variable expression genes, based on gene expression level and variation; and 3) the regulation of gene expression at the gene structure level. The cluster analysis indicates that gene expression pattern is higher related to physiological condition rather than tissue spatial distance. Two sets of human housekeeping (HK) genes are defined according to cell/tissue types, respectively. To characterize the gene expression pattern in gene expression level and variation, we firstly apply improved K-means algorithm and a gene expression variance model. We find that cancer-associated HK genes (a HK gene is specific in cancer group, while not in normal group) are expressed higher and more variable in cancer condition than in normal condition. Cancer-associated HK genes prefer to AT-rich genes, and they are enriched in cell cycle regulation related functions and constitute some cancer signatures. The expression of large genes is also avoided in cancer group. These studies will help us understand which cell type-specific patterns of gene expression differ among different cell types, and particularly for cancer. PMID:23382867
Inferring gene expression dynamics via functional regression analysis

Directory of Open Access Journals (Sweden)

Leng Xiaoyan

2008-01-01

Full Text Available Abstract Background Temporal gene expression profiles characterize the time-dynamics of expression of specific genes and are increasingly collected in current gene expression experiments. In the analysis of experiments where gene expression is obtained over the life cycle, it is of interest to relate temporal patterns of gene expression associated with different developmental stages to each other to study patterns of long-term developmental gene regulation. We use tools from functional data analysis to study dynamic changes by relating temporal gene expression profiles of different developmental stages to each other. Results We demonstrate that functional regression methodology can pinpoint relationships that exist between temporary gene expression profiles for different life cycle phases and incorporates dimension reduction as needed for these high-dimensional data. By applying these tools, gene expression profiles for pupa and adult phases are found to be strongly related to the profiles of the same genes obtained during the embryo phase. Moreover, one can distinguish between gene groups that exhibit relationships with positive and others with negative associations between later life and embryonal expression profiles. Specifically, we find a positive relationship in expression for muscle development related genes, and a negative relationship for strictly maternal genes for Drosophila, using temporal gene expression profiles. Conclusion Our findings point to specific reactivation patterns of gene expression during the Drosophila life cycle which differ in characteristic ways between various gene groups. Functional regression emerges as a useful tool for relating gene expression patterns from different developmental stages, and avoids the problems with large numbers of parameters and multiple testing that affect alternative approaches.
System Biology Approach: Gene Network Analysis for Muscular Dystrophy.

Science.gov (United States)

Censi, Federica; Calcagnini, Giovanni; Mattei, Eugenio; Giuliani, Alessandro

2018-01-01

Phenotypic changes at different organization levels from cell to entire organism are associated to changes in the pattern of gene expression. These changes involve the entire genome expression pattern and heavily rely upon correlation patterns among genes. The classical approach used to analyze gene expression data builds upon the application of supervised statistical techniques to detect genes differentially expressed among two or more phenotypes (e.g., normal vs. disease). The use of an a posteriori, unsupervised approach based on principal component analysis (PCA) and the subsequent construction of gene correlation networks can shed a light on unexpected behaviour of gene regulation system while maintaining a more naturalistic view on the studied system.In this chapter we applied an unsupervised method to discriminate DMD patient and controls. The genes having the highest absolute scores in the discrimination between the groups were then analyzed in terms of gene expression networks, on the basis of their mutual correlation in the two groups. The correlation network structures suggest two different modes of gene regulation in the two groups, reminiscent of important aspects of DMD pathogenesis.
Integrative analysis of multiple diverse omics datasets by sparse group multitask regression

Directory of Open Access Journals (Sweden)

Dongdong eLin

2014-10-01

Full Text Available A variety of high throughput genome-wide assays enable the exploration of genetic risk factors underlying complex traits. Although these studies have remarkable impact on identifying susceptible biomarkers, they suffer from issues such as limited sample size and low reproducibility. Combining individual studies of different genetic levels/platforms has the promise to improve the power and consistency of biomarker identification. In this paper, we propose a novel integrative method, namely sparse group multitask regression, for integrating diverse omics datasets, platforms and populations to identify risk genes/factors of complex diseases. This method combines multitask learning with sparse group regularization, which will: 1 treat the biomarker identification in each single study as a task and then combine them by multitask learning; 2 group variables from all studies for identifying significant genes; 3 enforce sparse constraint on groups of variables to overcome the ‘small sample, but large variables’ problem. We introduce two sparse group penalties: sparse group lasso and sparse group ridge in our multitask model, and provide an effective algorithm for each model. In addition, we propose a significance test for the identification of potential risk genes. Two simulation studies are performed to evaluate the performance of our integrative method by comparing it with conventional meta-analysis method. The results show that our sparse group multitask method outperforms meta-analysis method significantly. In an application to our osteoporosis studies, 7 genes are identified as significant genes by our method and are found to have significant effects in other three independent studies for validation. The most significant gene SOD2 has been identified in our previous osteoporosis study involving the same expression dataset. Several other genes such as TREML2, HTR1E and GLO1 are shown to be novel susceptible genes for osteoporosis, as confirmed
Evaluating the consistency of gene sets used in the analysis of bacterial gene expression data

Directory of Open Access Journals (Sweden)

Tintle Nathan L

2012-08-01

Full Text Available Abstract Background Statistical analyses of whole genome expression data require functional information about genes in order to yield meaningful biological conclusions. The Gene Ontology (GO and Kyoto Encyclopedia of Genes and Genomes (KEGG are common sources of functionally grouped gene sets. For bacteria, the SEED and MicrobesOnline provide alternative, complementary sources of gene sets. To date, no comprehensive evaluation of the data obtained from these resources has been performed. Results We define a series of gene set consistency metrics directly related to the most common classes of statistical analyses for gene expression data, and then perform a comprehensive analysis of 3581 Affymetrix® gene expression arrays across 17 diverse bacteria. We find that gene sets obtained from GO and KEGG demonstrate lower consistency than those obtained from the SEED and MicrobesOnline, regardless of gene set size. Conclusions Despite the widespread use of GO and KEGG gene sets in bacterial gene expression data analysis, the SEED and MicrobesOnline provide more consistent sets for a wide variety of statistical analyses. Increased use of the SEED and MicrobesOnline gene sets in the analysis of bacterial gene expression data may improve statistical power and utility of expression data.
Supervised group Lasso with applications to microarray data analysis

Directory of Open Access Journals (Sweden)

Huang Jian

2007-02-01

Full Text Available Abstract Background A tremendous amount of efforts have been devoted to identifying genes for diagnosis and prognosis of diseases using microarray gene expression data. It has been demonstrated that gene expression data have cluster structure, where the clusters consist of co-regulated genes which tend to have coordinated functions. However, most available statistical methods for gene selection do not take into consideration the cluster structure. Results We propose a supervised group Lasso approach that takes into account the cluster structure in gene expression data for gene selection and predictive model building. For gene expression data without biological cluster information, we first divide genes into clusters using the K-means approach and determine the optimal number of clusters using the Gap method. The supervised group Lasso consists of two steps. In the first step, we identify important genes within each cluster using the Lasso method. In the second step, we select important clusters using the group Lasso. Tuning parameters are determined using V-fold cross validation at both steps to allow for further flexibility. Prediction performance is evaluated using leave-one-out cross validation. We apply the proposed method to disease classification and survival analysis with microarray data. Conclusion We analyze four microarray data sets using the proposed approach: two cancer data sets with binary cancer occurrence as outcomes and two lymphoma data sets with survival outcomes. The results show that the proposed approach is capable of identifying a small number of influential gene clusters and important genes within those clusters, and has better prediction performance than existing methods.
[Genome-wide identification and expression analysis of the WRKY gene family in peach].

Science.gov (United States)

Gu, Yan-bing; Ji, Zhi-rui; Chi, Fu-mei; Qiao, Zhuang; Xu, Cheng-nan; Zhang, Jun-xiang; Zhou, Zong-shan; Dong, Qing-long

2016-03-01

The WRKY transcription factors are one of the largest families of transcriptional regulators and play diverse regulatory roles in biotic and abiotic stresses, plant growth and development processes. In this study, the WRKY DNA-binding domain (Pfam Database number: PF03106) downloaded from Pfam protein families database was exploited to identify WRKY genes from the peach (Prunus persica 'Lovell') genome using HMMER 3.0. The obtained amino acid sequences were analyzed with DNAMAN 5.0, WebLogo 3, MEGA 5.1, MapInspect and MEME bioinformatics softwares. Totally 61 peach WRKY genes were found in the peach genome. Our phylogenetic analysis revealed that peach WRKY genes were classified into three Groups: Ⅰ, Ⅱ and Ⅲ. The WRKY N-terminal and C-terminal domains of Group Ⅰ (group I-N and group I-C) were monophyletic. The Group Ⅱ was sub-divided into five distinct clades (groupⅡ-a, Ⅱ-b, Ⅱ-c, Ⅱ-d and Ⅱ-e). Our domain analysis indicated that the WRKY regions contained a highly conserved heptapeptide stretch WRKYGQK at its N-terminus followed by a zinc-finger motif. The chromosome mapping analysis showed that peach WRKY genes were distributed with different densities over 8 chromosomes. The intron-exon structure analysis revealed that structures of the WRKY gene were highly conserved in the peach. The conserved motif analysis showed that the conserved motifs 1, 2 and 3, which specify the WRKY domain, were observed in all peach WRKY proteins, motif 5 as the unknown domain was observed in group Ⅱ-d, two WRKY domains were assigned to GroupⅠ. SqRT-PCR and qRT-PCR results indicated that 16 PpWRKY genes were expressed in roots, stems, leaves, flowers and fruits at various expression levels. Our analysis thus identified the PpWRKY gene families, and future functional studies are needed to reveal its specific roles.
Digital gene expression analysis of Microsporum canis exposed to berberine chloride.

Directory of Open Access Journals (Sweden)

Chen-Wen Xiao

Full Text Available Berberine, a natural isoquinoline alkaloid of many medicinal herbs, has an active function against a variety of microbial infections including Microsporum canis (M. canis. However, the underlying mechanisms are poorly understood. To study the effect of berberine chloride on M. canis infection, a Digital Gene Expression (DGE tag profiling was constructed and a transcriptome analysis of the M. canis cellular responses upon berberine treatment was performed. Illumina/Hisseq sequencing technique was used to generate the data of gene expression profile, and the following enrichment analysis of Gene Ontology (GO and Pathway function were conducted based on the data of transcriptome. The results of DGE showed that there were 8476945, 14256722, 7708575, 5669955, 6565513 and 9303468 tags respectively, which was obtained from M. canis incubated with berberine or control DMSO. 8,783 genes were totally mapped, and 1,890 genes have shown significant changes between the two groups. 1,030 genes were up-regulated and 860 genes were down-regulated (P<0.05 in berberine treated group compared to the control group. Besides, twenty-three GO terms were identified by Gene Ontology functional enrichment analysis, such as calcium-transporting ATPase activity, 2-oxoglutarate metabolic process, valine catabolic process, peroxisome and unfolded protein binding. Pathway significant enrichment analysis indicated 6 signaling pathways that are significant, including steroid biosynthesis, steroid hormone biosynthesis, Parkinson's disease, 2,4-Dichlorobenzoate degradation, and tropane, piperidine and Isoquinoline alkaloid biosynthesis. Among these, eleven selected genes were further verified by qRT-PCR. Our findings provide a comprehensive view on the gene expression profile of M. canis upon berberine treatment, and shed light on its complicated effects on M. canis.
A novel joint analysis framework improves identification of differentially expressed genes in cross disease transcriptomic analysis

Directory of Open Access Journals (Sweden)

Wenyi Qin

2018-02-01

Full Text Available Abstract Motivation Detecting differentially expressed (DE genes between disease and normal control group is one of the most common analyses in genome-wide transcriptomic data. Since most studies don’t have a lot of samples, researchers have used meta-analysis to group different datasets for the same disease. Even then, in many cases the statistical power is still not enough. Taking into account the fact that many diseases share the same disease genes, it is desirable to design a statistical framework that can identify diseases’ common and specific DE genes simultaneously to improve the identification power. Results We developed a novel empirical Bayes based mixture model to identify DE genes in specific study by leveraging the shared information across multiple different disease expression data sets. The effectiveness of joint analysis was demonstrated through comprehensive simulation studies and two real data applications. The simulation results showed that our method consistently outperformed single data set analysis and two other meta-analysis methods in identification power. In real data analysis, overall our method demonstrated better identification power in detecting DE genes and prioritized more disease related genes and disease related pathways than single data set analysis. Over 150% more disease related genes are identified by our method in application to Huntington’s disease. We expect that our method would provide researchers a new way of utilizing available data sets from different diseases when sample size of the focused disease is limited.
Genome-wide analysis of WRKY gene family in Cucumis sativus.

Science.gov (United States)

Ling, Jian; Jiang, Weijie; Zhang, Ying; Yu, Hongjun; Mao, Zhenchuan; Gu, Xingfang; Huang, Sanwen; Xie, Bingyan

2011-09-28

WRKY proteins are a large family of transcriptional regulators in higher plant. They are involved in many biological processes, such as plant development, metabolism, and responses to biotic and abiotic stresses. Prior to the present study, only one full-length cucumber WRKY protein had been reported. The recent publication of the draft genome sequence of cucumber allowed us to conduct a genome-wide search for cucumber WRKY proteins, and to compare these positively identified proteins with their homologs in model plants, such as Arabidopsis. We identified a total of 55 WRKY genes in the cucumber genome. According to structural features of their encoded proteins, the cucumber WRKY (CsWRKY) genes were classified into three groups (group 1-3). Analysis of expression profiles of CsWRKY genes indicated that 48 WRKY genes display differential expression either in their transcript abundance or in their expression patterns under normal growth conditions, and 23 WRKY genes were differentially expressed in response to at least one abiotic stresses (cold, drought or salinity). The expression profile of stress-inducible CsWRKY genes were correlated with those of their putative Arabidopsis WRKY (AtWRKY) orthologs, except for the group 3 WRKY genes. Interestingly, duplicated group 3 AtWRKY genes appear to have been under positive selection pressure during evolution. In contrast, there was no evidence of recent gene duplication or positive selection pressure among CsWRKY group 3 genes, which may have led to the expressional divergence of group 3 orthologs. Fifty-five WRKY genes were identified in cucumber and the structure of their encoded proteins, their expression, and their evolution were examined. Considering that there has been extensive expansion of group 3 WRKY genes in angiosperms, the occurrence of different evolutionary events could explain the functional divergence of these genes.
Group sparse canonical correlation analysis for genomic data integration.

Science.gov (United States)

Lin, Dongdong; Zhang, Jigang; Li, Jingyao; Calhoun, Vince D; Deng, Hong-Wen; Wang, Yu-Ping

2013-08-12

The emergence of high-throughput genomic datasets from different sources and platforms (e.g., gene expression, single nucleotide polymorphisms (SNP), and copy number variation (CNV)) has greatly enhanced our understandings of the interplay of these genomic factors as well as their influences on the complex diseases. It is challenging to explore the relationship between these different types of genomic data sets. In this paper, we focus on a multivariate statistical method, canonical correlation analysis (CCA) method for this problem. Conventional CCA method does not work effectively if the number of data samples is significantly less than that of biomarkers, which is a typical case for genomic data (e.g., SNPs). Sparse CCA (sCCA) methods were introduced to overcome such difficulty, mostly using penalizations with l-1 norm (CCA-l1) or the combination of l-1and l-2 norm (CCA-elastic net). However, they overlook the structural or group effect within genomic data in the analysis, which often exist and are important (e.g., SNPs spanning a gene interact and work together as a group). We propose a new group sparse CCA method (CCA-sparse group) along with an effective numerical algorithm to study the mutual relationship between two different types of genomic data (i.e., SNP and gene expression). We then extend the model to a more general formulation that can include the existing sCCA models. We apply the model to feature/variable selection from two data sets and compare our group sparse CCA method with existing sCCA methods on both simulation and two real datasets (human gliomas data and NCI60 data). We use a graphical representation of the samples with a pair of canonical variates to demonstrate the discriminating characteristic of the selected features. Pathway analysis is further performed for biological interpretation of those features. The CCA-sparse group method incorporates group effects of features into the correlation analysis while performs individual feature
BOG: R-package for Bacterium and virus analysis of Orthologous Groups

Directory of Open Access Journals (Sweden)

Jincheol Park

2015-01-01

Full Text Available BOG (Bacterium and virus analysis of Orthologous Groups is a package for identifying groups of differentially regulated genes in the light of gene functions for various virus and bacteria genomes. It is designed to identify Clusters of Orthologous Groups (COGs that are enriched among genes that have gone through significant changes under different conditions. This would contribute to the detection of pathogens, an important scientific research area of relevance in uncovering bioterrorism, among others. Particular statistical analyses include hypergeometric, Mann–Whitney rank sum, and gene set enrichment. Results from the analyses are organized and presented in tabular and graphical forms for ease of understanding and dissemination of results. BOG is implemented as an R-package, which is available from CRAN or can be downloaded from http://www.stat.osu.edu/~statgen/SOFTWARE/BOG/.
Orthoscape: a cytoscape application for grouping and visualization KEGG based gene networks by taxonomy and homology principles.

Science.gov (United States)

Mustafin, Zakhar Sergeevich; Lashin, Sergey Alexandrovich; Matushkin, Yury Georgievich; Gunbin, Konstantin Vladimirovich; Afonnikov, Dmitry Arkadievich

2017-01-27

There are many available software tools for visualization and analysis of biological networks. Among them, Cytoscape ( http://cytoscape.org/ ) is one of the most comprehensive packages, with many plugins and applications which extends its functionality by providing analysis of protein-protein interaction, gene regulatory and gene co-expression networks, metabolic, signaling, neural as well as ecological-type networks including food webs, communities networks etc. Nevertheless, only three plugins tagged 'network evolution' found in Cytoscape official app store and in literature. We have developed a new Cytoscape 3.0 application Orthoscape aimed to facilitate evolutionary analysis of gene networks and visualize the results. Orthoscape aids in analysis of evolutionary information available for gene sets and networks by highlighting: (1) the orthology relationships between genes; (2) the evolutionary origin of gene network components; (3) the evolutionary pressure mode (diversifying or stabilizing, negative or positive selection) of orthologous groups in general and/or branch-oriented mode. The distinctive feature of Orthoscape is the ability to control all data analysis steps via user-friendly interface. Orthoscape allows its users to analyze gene networks or separated gene sets in the context of evolution. At each step of data analysis, Orthoscape also provides for convenient visualization and data manipulation.
Life cycle analysis of kidney gene expression in male F344 rats.

Directory of Open Access Journals (Sweden)

Joshua C Kwekel

Full Text Available Age is a predisposing condition for susceptibility to chronic kidney disease and progression as well as acute kidney injury that may arise due to the adverse effects of some drugs. Age-related differences in kidney biology, therefore, are a key concern in understanding drug safety and disease progression. We hypothesize that the underlying suite of genes expressed in the kidney at various life cycle stages will impact susceptibility to adverse drug reactions. Therefore, establishing changes in baseline expression data between these life stages is the first and necessary step in evaluating this hypothesis. Untreated male F344 rats were sacrificed at 2, 5, 6, 8, 15, 21, 78, and 104 weeks of age. Kidneys were collected for histology and gene expression analysis. Agilent whole-genome rat microarrays were used to query global expression profiles. An ANOVA (p1.5 in relative mRNA expression, was used to identify 3,724 unique differentially expressed genes (DEGs. Principal component analyses of these DEGs revealed three major divisions in life-cycle renal gene expression. K-means cluster analysis identified several groups of genes that shared age-specific patterns of expression. Pathway analysis of these gene groups revealed age-specific gene networks and functions related to renal function and aging, including extracellular matrix turnover, immune cell response, and renal tubular injury. Large age-related changes in expression were also demonstrated for the genes that code for qualified renal injury biomarkers KIM-1, Clu, and Tff3. These results suggest specific groups of genes that may underlie age-specific susceptibilities to adverse drug reactions and disease. This analysis of the basal gene expression patterns of renal genes throughout the life cycle of the rat will improve the use of current and future renal biomarkers and inform our assessments of kidney injury and disease.
GSMA: Gene Set Matrix Analysis, An Automated Method for Rapid Hypothesis Testing of Gene Expression Data

Directory of Open Access Journals (Sweden)

Chris Cheadle

2007-01-01

Full Text Available Background: Microarray technology has become highly valuable for identifying complex global changes in gene expression patterns. The assignment of functional information to these complex patterns remains a challenging task in effectively interpreting data and correlating results from across experiments, projects and laboratories. Methods which allow the rapid and robust evaluation of multiple functional hypotheses increase the power of individual researchers to data mine gene expression data more efficiently.Results: We have developed (gene set matrix analysis GSMA as a useful method for the rapid testing of group-wise up- or downregulation of gene expression simultaneously for multiple lists of genes (gene sets against entire distributions of gene expression changes (datasets for single or multiple experiments. The utility of GSMA lies in its flexibility to rapidly poll gene sets related by known biological function or as designated solely by the end-user against large numbers of datasets simultaneously.Conclusions: GSMA provides a simple and straightforward method for hypothesis testing in which genes are tested by groups across multiple datasets for patterns of expression enrichment.

Gene-Transformation-Induced Changes in Chemical Functional Group Features and Molecular Structure Conformation in Alfalfa Plants Co-Expressing Lc-bHLH and C1-MYB Transcriptive Flavanoid Regulatory Genes: Effects of Single-Gene and Two-Gene Insertion.

Science.gov (United States)

Heendeniya, Ravindra G; Yu, Peiqiang

2017-03-20

Alfalfa ( Medicago sativa L.) genotypes transformed with Lc-bHLH and Lc transcription genes were developed with the intention of stimulating proanthocyanidin synthesis in the aerial parts of the plant. To our knowledge, there are no studies on the effect of single-gene and two-gene transformation on chemical functional groups and molecular structure changes in these plants. The objective of this study was to use advanced molecular spectroscopy with multivariate chemometrics to determine chemical functional group intensity and molecular structure changes in alfalfa plants when co-expressing Lc-bHLH and C1-MYB transcriptive flavanoid regulatory genes in comparison with non-transgenic (NT) and AC Grazeland (ACGL) genotypes. The results showed that compared to NT genotype, the presence of double genes ( Lc and C1 ) increased ratios of both the area and peak height of protein structural Amide I/II and the height ratio of α-helix to β-sheet. In carbohydrate-related spectral analysis, the double gene-transformed alfalfa genotypes exhibited lower peak heights at 1370, 1240, 1153, and 1020 cm -1 compared to the NT genotype. Furthermore, the effect of double gene transformation on carbohydrate molecular structure was clearly revealed in the principal component analysis of the spectra. In conclusion, single or double transformation of Lc and C1 genes resulted in changing functional groups and molecular structure related to proteins and carbohydrates compared to the NT alfalfa genotype. The current study provided molecular structural information on the transgenic alfalfa plants and provided an insight into the impact of transgenes on protein and carbohydrate properties and their molecular structure's changes.
A Bioinformatics Analysis Reveals a Group of MocR Bacterial Transcriptional Regulators Linked to a Family of Genes Coding for Membrane Proteins

Directory of Open Access Journals (Sweden)

Teresa Milano

2016-01-01

Full Text Available The MocR bacterial transcriptional regulators are characterized by an N-terminal domain, 60 residues long on average, possessing the winged-helix-turn-helix (wHTH architecture responsible for DNA recognition and binding, linked to a large C-terminal domain (350 residues on average that is homologous to fold type-I pyridoxal 5′-phosphate (PLP dependent enzymes like aspartate aminotransferase (AAT. These regulators are involved in the expression of genes taking part in several metabolic pathways directly or indirectly connected to PLP chemistry, many of which are still uncharacterized. A bioinformatics analysis is here reported that studied the features of a distinct group of MocR regulators predicted to be functionally linked to a family of homologous genes coding for integral membrane proteins of unknown function. This group occurs mainly in the Actinobacteria and Gammaproteobacteria phyla. An analysis of the multiple sequence alignments of their wHTH and AAT domains suggested the presence of specificity-determining positions (SDPs. Mapping of SDPs onto a homology model of the AAT domain hinted at possible structural/functional roles in effector recognition. Likewise, SDPs in wHTH domain suggested the basis of specificity of Transcription Factor Binding Site recognition. The results reported represent a framework for rational design of experiments and for bioinformatics analysis of other MocR subgroups.
Sequence comparison and phylogenetic analysis of core gene of ...

African Journals Online (AJOL)

Phylogenetic analysis suggests that our sequences are clustered with sequences reported from Japan. This is the first phylogenetic analysis of HCV core gene from Pakistani population. Our sequences and sequences from Japan are grouped into same cluster in the phylogenetic tree. Sequence comparison and ...
Virulence Gene Pool Detected in Bovine Group C Streptococcus dysgalactiae subsp. dysgalactiae Isolates by Use of a Group A S. pyogenes Virulence Microarray ▿

Science.gov (United States)

Rato, Márcia G.; Nerlich, Andreas; Bergmann, René; Bexiga, Ricardo; Nunes, Sandro F.; Vilela, Cristina L.; Santos-Sanches, Ilda; Chhatwal, Gursharan S.

2011-01-01

A custom-designed microarray containing 220 virulence genes of Streptococcus pyogenes (group A Streptococcus [GAS]) was used to test group C Streptococcus dysgalactiae subsp. dysgalactiae (GCS) field strains causing bovine mastitis and group C or group G Streptococcus dysgalactiae subsp. equisimilis (GCS/GGS) isolates from human infections, with the latter being used for comparative purposes, for the presence of virulence genes. All bovine and all human isolates carried a fraction of the 220 genes (23% and 39%, respectively). The virulence genes encoding streptolysin S, glyceraldehyde-3-phosphate dehydrogenase, the plasminogen-binding M-like protein PAM, and the collagen-like protein SclB were detected in the majority of both bovine and human isolates (94 to 100%). Virulence factors, usually carried by human beta-hemolytic streptococcal pathogens, such as streptokinase, laminin-binding protein, and the C5a peptidase precursor, were detected in all human isolates but not in bovine isolates. Additionally, GAS bacteriophage-associated virulence genes encoding superantigens, DNase, and/or streptodornase were detected in bovine isolates (72%) but not in the human isolates. Determinants located in non-bacteriophage-related mobile elements, such as the gene encoding R28, were detected in all bovine and human isolates. Several virulence genes, including genes of bacteriophage origin, were shown to be expressed by reverse transcriptase PCR (RT-PCR). Phylogenetic analysis of superantigen gene sequences revealed a high level (>98%) of identity among genes of bovine GCS, of the horse pathogen Streptococcus equi subsp. equi, and of the human pathogen GAS. Our findings indicate that alpha-hemolytic bovine GCS, an important mastitis pathogen and considered to be a nonhuman pathogen, carries important virulence factors responsible for virulence and pathogenesis in humans. PMID:21525223
Interactions of Polyhomeotic with Polycomb Group Genes of Drosophila Melanogaster

OpenAIRE

Cheng, N. N.; Sinclair, DAR.; Campbell, R. B.; Brock, H. W.

1994-01-01

The Polycomb (Pc) group genes of Drosophila are negative regulators of homeotic genes, but individual loci have pleiotropic phenotypes. It has been suggested that Pc group genes might form a regulatory hierarchy, or might be members of a multimeric complex that obeys the law of mass action. Recently, it was shown that polyhomeotic (ph) immunoprecipitates in a multimeric complex that includes Pc. Here, we show that duplications of ph suppress homeotic transformations of Pc and Pcl, supporting ...
Analysis of the Fanconi anaemia complementation group A gene in acute myeloid leukaemia.

Science.gov (United States)

Condie, Alison; Powles, Raymond L; Hudson, Chantelle D; Shepherd, Valerie; Bevan, Stephen; Yuille, Martin R; Houlston, Richard S

2002-09-01

Acute myeloid leukaemia (AML) is the most common acute leukaemia in adults. Around 10-15% of individuals with recessively inherited Fanconi anaemia (FA) develop AML. FA is one of a group of recessive syndromes characterized by excessive spontaneous chromosomal breakage in which heterozygote carriers appear to display an increased risk of cancer and there is some indirect evidence that FA carriers may also be at increased risk of AML. This suggests that FA genes may play a role in the development of AML in the wider context. To examine this proposition, further, we have screened samples from 79 AML patients for mutations in the major FA gene, FANCA. No truncating FANCA mutations were detected. One missense mutation previously designated as pathogenic and five novel missense mutations causing non-conservative amino acid substitutions were detected. The data suggests that while FANCA mutations are rare, FANCA mutations may contribute to the development of the disease in a subset of AML.
Abundance profiling of specific gene groups using precomputed gut metagenomes yields novel biological hypotheses.

Directory of Open Access Journals (Sweden)

Konstantin Yarygin

Full Text Available The gut microbiota is essentially a multifunctional bioreactor within a human being. The exploration of its enormous metabolic potential provides insights into the mechanisms underlying microbial ecology and interactions with the host. The data obtained using "shotgun" metagenomics capture information about the whole spectrum of microbial functions. However, each new study presenting new sequencing data tends to extract only a little of the information concerning the metabolic potential and often omits specific functions. A meta-analysis of the available data with an emphasis on biomedically relevant gene groups can unveil new global trends in the gut microbiota. As a step toward the reuse of metagenomic data, we developed a method for the quantitative profiling of user-defined groups of genes in human gut metagenomes. This method is based on the quick analysis of a gene coverage matrix obtained by pre-mapping the metagenomic reads to a global gut microbial catalogue. The method was applied to profile the abundance of several gene groups related to antibiotic resistance, phages, biosynthesis clusters and carbohydrate degradation in 784 metagenomes from healthy populations worldwide and patients with inflammatory bowel diseases and obesity. We discovered country-wise functional specifics in gut resistome and virome compositions. The most distinct features of the disease microbiota were found for Crohn's disease, followed by ulcerative colitis and obesity. Profiling of the genes belonging to crAssphage showed that its abundance varied across the world populations and was not associated with clinical status. We demonstrated temporal resilience of crAssphage and the influence of the sample preparation protocol on its detected abundance. Our approach offers a convenient method to add value to accumulated "shotgun" metagenomic data by helping researchers state and assess novel biological hypotheses.
Comparative genome analysis of PHB gene family reveals deep evolutionary origins and diverse gene function.

Science.gov (United States)

Di, Chao; Xu, Wenying; Su, Zhen; Yuan, Joshua S

2010-10-07

PHB (Prohibitin) gene family is involved in a variety of functions important for different biological processes. PHB genes are ubiquitously present in divergent species from prokaryotes to eukaryotes. Human PHB genes have been found to be associated with various diseases. Recent studies by our group and others have shown diverse function of PHB genes in plants for development, senescence, defence, and others. Despite the importance of the PHB gene family, no comprehensive gene family analysis has been carried to evaluate the relatedness of PHB genes across different species. In order to better guide the gene function analysis and understand the evolution of the PHB gene family, we therefore carried out the comparative genome analysis of the PHB genes across different kingdoms. The relatedness, motif distribution, and intron/exon distribution all indicated that PHB genes is a relatively conserved gene family. The PHB genes can be classified into 5 classes and each class have a very deep evolutionary origin. The PHB genes within the class maintained the same motif patterns during the evolution. With Arabidopsis as the model species, we found that PHB gene intron/exon structure and domains are also conserved during the evolution. Despite being a conserved gene family, various gene duplication events led to the expansion of the PHB genes. Both segmental and tandem gene duplication were involved in Arabidopsis PHB gene family expansion. However, segmental duplication is predominant in Arabidopsis. Moreover, most of the duplicated genes experienced neofunctionalization. The results highlighted that PHB genes might be involved in important functions so that the duplicated genes are under the evolutionary pressure to derive new function. PHB gene family is a conserved gene family and accounts for diverse but important biological functions based on the similar molecular mechanisms. The highly diverse biological function indicated that more research needs to be carried out
Restoration using Azolla imbricata increases nitrogen functional bacterial groups and genes in soil.

Science.gov (United States)

Lu, Xiao-Ming; Lu, Peng-Zhen; Yang, Ke

2017-05-01

Microbial groups are major factors that influence soil function. Currently, there is a lack of studies on microbial functional groups. Although soil microorganisms play an important role in the nitrogen cycle, systematic studies of the effects of environmental factors on microbial populations in relation to key metabolic processes in the nitrogen cycle are seldom reported. In this study, we conducted a systematic analysis of the changes in nitrogen functional groups in mandarin orange garden soil treated with Azolla imbricata. The structures of the major functional bacterial groups and the functional gene abundances involved in key processes of the soil nitrogen cycle were analyzed using high-throughput sequencing (HTS) and quantitative real-time PCR, respectively. The results indicated that returning A. imbricata had an important influence on the composition of soil nitrogen functional bacterial communities. Treatment with A. imbricata increased the diversity of the nitrogen functional bacteria. The abundances of nitrogen functional genes were significantly higher in the treated soil compared with the control soil. Both the diversity of the major nitrogen functional bacteria (nifH bacteria, nirK bacteria, and narG bacteria) and the abundances of nitrogen functional genes in the soil showed significant positive correlations with the soil pH, the organic carbon content, available nitrogen, available phosphorus, and NH 4 + -N and NO 3 - -N contents. Treatment with 12.5 kg fresh A. imbricata per mandarin orange tree was effective to improve the quality of the mandarin orange garden soil. This study analyzed the mechanism of the changes in functional bacterial groups and genes involved in key metabolic processes of the nitrogen cycle in soil treated by A. imbricata.
Knowledge Enrichment Analysis for Human Tissue- Specific Genes Uncover New Biological Insights

Directory of Open Access Journals (Sweden)

Gong Xiu-Jun

2012-06-01

Full Text Available The expression and regulation of genes in different tissues are fundamental questions to be answered in biology. Knowledge enrichment analysis for tissue specific (TS and housekeeping (HK genes may help identify their roles in biological process or diseases and gain new biological insights.In this paper, we performed the knowledge enrichment analysis for 17,343 genes in 84 human tissues using Gene Set Enrichment Analysis (GSEA and Hypergeometric Analysis (HA against three biological ontologies: Gene Ontology (GO, KEGG pathways and Disease Ontology (DO respectively.The analyses results demonstrated that the functions of most gene groups are consistent with their tissue origins. Meanwhile three interesting new associations for HK genes and the skeletal muscle tissuegenes are found. Firstly, Hypergeometric analysis against KEGG database for HK genes disclosed that three disease terms (Parkinson’s disease, Huntington’s disease, Alzheimer’s disease are intensively enriched.Secondly, Hypergeometric analysis against the KEGG database for Skeletal Muscle tissue genes shows that two cardiac diseases of “Hypertrophic cardiomyopathy (HCM” and “Arrhythmogenic right ventricular cardiomyopathy (ARVC” are heavily enriched, which are also considered as no relationship with skeletal functions.Thirdly, “Prostate cancer” is intensively enriched in Hypergeometric analysis against the disease ontology (DO for the Skeletal Muscle tissue genes, which is a much unexpected phenomenon.
ADAGE signature analysis: differential expression analysis with data-defined gene sets.

Science.gov (United States)

Tan, Jie; Huyck, Matthew; Hu, Dongbo; Zelaya, René A; Hogan, Deborah A; Greene, Casey S

2017-11-22

Gene set enrichment analysis and overrepresentation analyses are commonly used methods to determine the biological processes affected by a differential expression experiment. This approach requires biologically relevant gene sets, which are currently curated manually, limiting their availability and accuracy in many organisms without extensively curated resources. New feature learning approaches can now be paired with existing data collections to directly extract functional gene sets from big data. Here we introduce a method to identify perturbed processes. In contrast with methods that use curated gene sets, this approach uses signatures extracted from public expression data. We first extract expression signatures from public data using ADAGE, a neural network-based feature extraction approach. We next identify signatures that are differentially active under a given treatment. Our results demonstrate that these signatures represent biological processes that are perturbed by the experiment. Because these signatures are directly learned from data without supervision, they can identify uncurated or novel biological processes. We implemented ADAGE signature analysis for the bacterial pathogen Pseudomonas aeruginosa. For the convenience of different user groups, we implemented both an R package (ADAGEpath) and a web server ( http://adage.greenelab.com ) to run these analyses. Both are open-source to allow easy expansion to other organisms or signature generation methods. We applied ADAGE signature analysis to an example dataset in which wild-type and ∆anr mutant cells were grown as biofilms on the Cystic Fibrosis genotype bronchial epithelial cells. We mapped active signatures in the dataset to KEGG pathways and compared with pathways identified using GSEA. The two approaches generally return consistent results; however, ADAGE signature analysis also identified a signature that revealed the molecularly supported link between the MexT regulon and Anr. We designed
Structured association analysis leads to insight into Saccharomyces cerevisiae gene regulation by finding multiple contributing eQTL hotspots associated with functional gene modules.

Science.gov (United States)

Curtis, Ross E; Kim, Seyoung; Woolford, John L; Xu, Wenjie; Xing, Eric P

2013-03-21

Association analysis using genome-wide expression quantitative trait locus (eQTL) data investigates the effect that genetic variation has on cellular pathways and leads to the discovery of candidate regulators. Traditional analysis of eQTL data via pairwise statistical significance tests or linear regression does not leverage the availability of the structural information of the transcriptome, such as presence of gene networks that reveal correlation and potentially regulatory relationships among the study genes. We employ a new eQTL mapping algorithm, GFlasso, which we have previously developed for sparse structured regression, to reanalyze a genome-wide yeast dataset. GFlasso fully takes into account the dependencies among expression traits to suppress false positives and to enhance the signal/noise ratio. Thus, GFlasso leverages the gene-interaction network to discover the pleiotropic effects of genetic loci that perturb the expression level of multiple (rather than individual) genes, which enables us to gain more power in detecting previously neglected signals that are marginally weak but pleiotropically significant. While eQTL hotspots in yeast have been reported previously as genomic regions controlling multiple genes, our analysis reveals additional novel eQTL hotspots and, more interestingly, uncovers groups of multiple contributing eQTL hotspots that affect the expression level of functional gene modules. To our knowledge, our study is the first to report this type of gene regulation stemming from multiple eQTL hotspots. Additionally, we report the results from in-depth bioinformatics analysis for three groups of these eQTL hotspots: ribosome biogenesis, telomere silencing, and retrotransposon biology. We suggest candidate regulators for the functional gene modules that map to each group of hotspots. Not only do we find that many of these candidate regulators contain mutations in the promoter and coding regions of the genes, in the case of the Ribi group
Partial least squares based gene expression analysis in estrogen receptor positive and negative breast tumors.

Science.gov (United States)

Ma, W; Zhang, T-F; Lu, P; Lu, S H

2014-01-01

Breast cancer is categorized into two broad groups: estrogen receptor positive (ER+) and ER negative (ER-) groups. Previous study proposed that under trastuzumab-based neoadjuvant chemotherapy, tumor initiating cell (TIC) featured ER- tumors response better than ER+ tumors. Exploration of the molecular difference of these two groups may help developing new therapeutic strategies, especially for ER- patients. With gene expression profile from the Gene Expression Omnibus (GEO) database, we performed partial least squares (PLS) based analysis, which is more sensitive than common variance/regression analysis. We acquired 512 differentially expressed genes. Four pathways were found to be enriched with differentially expressed genes, involving immune system, metabolism and genetic information processing process. Network analysis identified five hub genes with degrees higher than 10, including APP, ESR1, SMAD3, HDAC2, and PRKAA1. Our findings provide new understanding for the molecular difference between TIC featured ER- and ER+ breast tumors with the hope offer supports for therapeutic studies.
Bioinformatics Analysis of NBS-LRR Encoding Resistance Genes in Setaria italica.

Science.gov (United States)

Zhao, Yan; Weng, Qiaoyun; Song, Jinhui; Ma, Hailian; Yuan, Jincheng; Dong, Zhiping; Liu, Yinghui

2016-06-01

In plants, resistance (R) genes are involved in pathogen recognition and subsequent activation of innate immune responses. The nucleotide-binding site-leucine-rich repeat (NBS-LRR) genes family forms the largest R-gene family among plant genomes and play an important role in plant disease resistance. In this paper, comprehensive analysis of NBS-encoding genes is performed in the whole Setaria italica genome. A total of 96 NBS-LRR genes are identified, and comprehensive overview of the NBS-LRR genes is undertaken, including phylogenetic analysis, chromosome locations, conserved motifs of proteins, and gene expression. Based on the domain, these genes are divided into two groups and distributed in all Setaria italica chromosomes. Most NBS-LRR genes are located at the distal tip of the long arms of the chromosomes. Setaria italica NBS-LRR proteins share at least one nucleotide-biding domain and one leucine-rich repeat domain. Our results also show the duplication of NBS-LRR genes in Setaria italica is related to their gene structure.
Molecular responses and expression analysis of genes in a ...

African Journals Online (AJOL)

STORAGESEVER

2009-06-17

Jun 17, 2009 ... Molecular responses and expression analysis of genes in a xerophytic desert shrub Haloxylon ammodendron .... physiological determination and cDNA-AFLP analysis, three groups of seeds were sowed in pots with sand and .... HaDR27. U. 234. PDR-like ABC transporter. AT1G59870. HaDR28. U. 135.
Genome-Wide Analysis of the NAC Gene Family in Physic Nut (Jatropha curcas L.).

Science.gov (United States)

Wu, Zhenying; Xu, Xueqin; Xiong, Wangdan; Wu, Pingzhi; Chen, Yaping; Li, Meiru; Wu, Guojiang; Jiang, Huawu

2015-01-01

The NAC proteins (NAM, ATAF1/2 and CUC2) are plant-specific transcriptional regulators that have a conserved NAM domain in the N-terminus. They are involved in various biological processes, including both biotic and abiotic stress responses. In the present study, a total of 100 NAC genes (JcNAC) were identified in physic nut (Jatropha curcas L.). Based on phylogenetic analysis and gene structures, 83 JcNAC genes were classified as members of, or proposed to be diverged from, 39 previously predicted orthologous groups (OGs) of NAC sequences. Physic nut has a single intron-containing NAC gene subfamily that has been lost in many plants. The JcNAC genes are non-randomly distributed across the 11 linkage groups of the physic nut genome, and appear to be preferentially retained duplicates that arose from both ancient and recent duplication events. Digital gene expression analysis indicates that some of the JcNAC genes have tissue-specific expression profiles (e.g. in leaves, roots, stem cortex or seeds), and 29 genes differentially respond to abiotic stresses (drought, salinity, phosphorus deficiency and nitrogen deficiency). Our results will be helpful for further functional analysis of the NAC genes in physic nut.
Occult HBV among Anti-HBc Alone: Mutation Analysis of an HBV Surface Gene and Pre-S Gene.

Science.gov (United States)

Kim, Myeong Hee; Kang, So Young; Lee, Woo In

2017-05-01

The aim of this study is to investigate the molecular characteristics of occult hepatitis B virus (HBV) infection in 'anti-HBc alone' subjects. Twenty-four patients with 'anti-HBc alone' and 20 control patients diagnosed with HBV were analyzed regarding S and pre-S gene mutations. All specimens were analyzed for HBs Ag, anti-HBc, and anti-HBs. For specimens with an anti-HBc alone, quantitative analysis of HBV DNA, as well as sequencing and mutation analysis of S and pre-S genes, were performed. A total 24 were analyzed for the S gene, and 14 were analyzed for the pre-S gene through sequencing. A total of 20 control patients were analyzed for S and pre-S gene simultaneously. Nineteen point mutations of the major hydrophilic region were found in six of 24 patients. Among them, three mutations, S114T, P127S/T, M133T, were detected in common. Only one mutation was found in five subjects of the control group; this mutation was not found in the occult HBV infection group, however. Pre-S mutations were detected in 10 patients, and mutations of site aa58-aa100 were detected in 9 patients. A mutation on D114E was simultaneously detected. Although five mutations from the control group were found at the same location (aa58-aa100), no mutations of occult HBV infection were detected. The prevalence of occult HBV infection is not low among 'anti-HBc alone' subjects. Variable mutations in the S gene and pre-S gene were associated with the occurrence of occult HBV infection. Further larger scale studies are required to determine the significance of newly detected mutations. © Copyright: Yonsei University College of Medicine 2017
Genome-wide comparative analysis reveals similar types of NBS genes in hybrid Citrus sinensis genome and original Citrus clementine genome and provides new insights into non-TIR NBS genes.

Directory of Open Access Journals (Sweden)

Yunsheng Wang

Full Text Available In this study, we identified and compared nucleotide-binding site (NBS domain-containing genes from three Citrus genomes (C. clementina, C. sinensis from USA and C. sinensis from China. Phylogenetic analysis of all Citrus NBS genes across these three genomes revealed that there are three approximately evenly numbered groups: one group contains the Toll-Interleukin receptor (TIR domain and two different Non-TIR groups in which most of proteins contain the Coiled Coil (CC domain. Motif analysis confirmed that the two groups of CC-containing NBS genes are from different evolutionary origins. We partitioned NBS genes into clades using NBS domain sequence distances and found most clades include NBS genes from all three Citrus genomes. This suggests that three Citrus genomes have similar numbers and types of NBS genes. We also mapped the re-sequenced reads of three pomelo and three mandarin genomes onto the C. sinensis genome. We found that most NBS genes of the hybrid C. sinensis genome have corresponding homologous genes in both pomelo and mandarin genomes. The homologous NBS genes in pomelo and mandarin suggest that the parental species of C. sinensis may contain similar types of NBS genes. This explains why the hybrid C. sinensis and original C. clementina have similar types of NBS genes in this study. Furthermore, we found that sequence variation amongst Citrus NBS genes were shaped by multiple independent and shared accelerated mutation accumulation events among different groups of NBS genes and in different Citrus genomes. Our comparative analyses yield valuable insight into the structure, organization and evolution of NBS genes in Citrus genomes. Furthermore, our comprehensive analysis showed that the non-TIR NBS genes can be divided into two groups that come from different evolutionary origins. This provides new insights into non-TIR genes, which have not received much attention.
Effect of the absolute statistic on gene-sampling gene-set analysis methods.

Science.gov (United States)

Nam, Dougu

2017-06-01

Gene-set enrichment analysis and its modified versions have commonly been used for identifying altered functions or pathways in disease from microarray data. In particular, the simple gene-sampling gene-set analysis methods have been heavily used for datasets with only a few sample replicates. The biggest problem with this approach is the highly inflated false-positive rate. In this paper, the effect of absolute gene statistic on gene-sampling gene-set analysis methods is systematically investigated. Thus far, the absolute gene statistic has merely been regarded as a supplementary method for capturing the bidirectional changes in each gene set. Here, it is shown that incorporating the absolute gene statistic in gene-sampling gene-set analysis substantially reduces the false-positive rate and improves the overall discriminatory ability. Its effect was investigated by power, false-positive rate, and receiver operating curve for a number of simulated and real datasets. The performances of gene-set analysis methods in one-tailed (genome-wide association study) and two-tailed (gene expression data) tests were also compared and discussed.
Genome-scale analysis of positional clustering of mouse testis-specific genes

Directory of Open Access Journals (Sweden)

Lee Bernett TK

2005-01-01

Full Text Available Abstract Background Genes are not randomly distributed on a chromosome as they were thought even after removal of tandem repeats. The positional clustering of co-expressed genes is known in prokaryotes and recently reported in several eukaryotic organisms such as Caenorhabditis elegans, Drosophila melanogaster, and Homo sapiens. In order to further investigate the mode of tissue-specific gene clustering in higher eukaryotes, we have performed a genome-scale analysis of positional clustering of the mouse testis-specific genes. Results Our computational analysis shows that a large proportion of testis-specific genes are clustered in groups of 2 to 5 genes in the mouse genome. The number of clusters is much higher than expected by chance even after removal of tandem repeats. Conclusion Our result suggests that testis-specific genes tend to cluster on the mouse chromosomes. This provides another piece of evidence for the hypothesis that clusters of tissue-specific genes do exist.

Identification of substituent groups and related genes involved in salecan biosynthesis in Agrobacterium sp. ZX09.

Science.gov (United States)

Xu, Linxiang; Cheng, Rui; Li, Jing; Wang, Yang; Zhu, Bin; Ma, Shihong; Zhang, Weiming; Dong, Wei; Wang, Shiming; Zhang, Jianfa

2017-01-01

Salecan, a soluble β-1,3-D-glucan produced by a salt-tolerant strain Agrobacterium sp. ZX09, has been the subject of considerable interest in recent years because of its multiple bioactivities and unusual rheological properties in solution. In this study, both succinyl and pyruvyl substituent groups on salecan were identified by an enzymatic hydrolysis following nuclear magnetic resonance (NMR), HPLC, and MS analysis. The putative succinyltransferase gene (sleA) and pyruvyltransferase gene (sleV) were determined and cloned. Disruption of the sleA gene resulted in the absence of succinyl substituent groups on salecan. This defect could be complemented by expressing the sleA cloned in a plasmid. Thus, the sleA and sleV genes located in a 19.6-kb gene cluster may be involved in salecan biosynthesis. Despite the lack of succinyl substituents, the molecular mass of salecan generated by the sleA mutant did not substantially differ from that generated by the wild-type strain. Loss of succinyl substituents on salecan changed its rheological characteristics, especially a decrease in intrinsic viscosity.
MAGMA: generalized gene-set analysis of GWAS data.

Science.gov (United States)

de Leeuw, Christiaan A; Mooij, Joris M; Heskes, Tom; Posthuma, Danielle

2015-04-01

By aggregating data for complex traits in a biologically meaningful way, gene and gene-set analysis constitute a valuable addition to single-marker analysis. However, although various methods for gene and gene-set analysis currently exist, they generally suffer from a number of issues. Statistical power for most methods is strongly affected by linkage disequilibrium between markers, multi-marker associations are often hard to detect, and the reliance on permutation to compute p-values tends to make the analysis computationally very expensive. To address these issues we have developed MAGMA, a novel tool for gene and gene-set analysis. The gene analysis is based on a multiple regression model, to provide better statistical performance. The gene-set analysis is built as a separate layer around the gene analysis for additional flexibility. This gene-set analysis also uses a regression structure to allow generalization to analysis of continuous properties of genes and simultaneous analysis of multiple gene sets and other gene properties. Simulations and an analysis of Crohn's Disease data are used to evaluate the performance of MAGMA and to compare it to a number of other gene and gene-set analysis tools. The results show that MAGMA has significantly more power than other tools for both the gene and the gene-set analysis, identifying more genes and gene sets associated with Crohn's Disease while maintaining a correct type 1 error rate. Moreover, the MAGMA analysis of the Crohn's Disease data was found to be considerably faster as well.
Group spike-and-slab lasso generalized linear models for disease prediction and associated genes detection by incorporating pathway information.

Science.gov (United States)

Tang, Zaixiang; Shen, Yueping; Li, Yan; Zhang, Xinyan; Wen, Jia; Qian, Chen'ao; Zhuang, Wenzhuo; Shi, Xinghua; Yi, Nengjun

2018-03-15

Large-scale molecular data have been increasingly used as an important resource for prognostic prediction of diseases and detection of associated genes. However, standard approaches for omics data analysis ignore the group structure among genes encoded in functional relationships or pathway information. We propose new Bayesian hierarchical generalized linear models, called group spike-and-slab lasso GLMs, for predicting disease outcomes and detecting associated genes by incorporating large-scale molecular data and group structures. The proposed model employs a mixture double-exponential prior for coefficients that induces self-adaptive shrinkage amount on different coefficients. The group information is incorporated into the model by setting group-specific parameters. We have developed a fast and stable deterministic algorithm to fit the proposed hierarchal GLMs, which can perform variable selection within groups. We assess the performance of the proposed method on several simulated scenarios, by varying the overlap among groups, group size, number of non-null groups, and the correlation within group. Compared with existing methods, the proposed method provides not only more accurate estimates of the parameters but also better prediction. We further demonstrate the application of the proposed procedure on three cancer datasets by utilizing pathway structures of genes. Our results show that the proposed method generates powerful models for predicting disease outcomes and detecting associated genes. The methods have been implemented in a freely available R package BhGLM (http://www.ssg.uab.edu/bhglm/). nyi@uab.edu. Supplementary data are available at Bioinformatics online.
Digital Gene Expression Profiling Analysis of Aged Mice under Moxibustion Treatment

Directory of Open Access Journals (Sweden)

Nan Liu

2018-01-01

Full Text Available Aging is closely connected with death, progressive physiological decline, and increased risk of diseases, such as cancer, arteriosclerosis, heart disease, hypertension, and neurodegenerative diseases. It is reported that moxibustion can treat more than 300 kinds of diseases including aging related problems and can improve immune function and physiological functions. The digital gene expression profiling of aged mice with or without moxibustion treatment was investigated and the mechanisms of moxibustion in aged mice were speculated by gene ontology and pathway analysis in the study. Almost 145 million raw reads were obtained by digital gene expression analysis and about 140 million (96.55% were clean reads. Five differentially expressed genes with an adjusted P value 1 were identified between the control and moxibustion groups. They were Gm6563, Gm8116, Rps26-ps1, Nat8f4, and Igkv3-12. Gene ontology analysis was carried out by the GOseq R package and functional annotations of the differentially expressed genes related to translation, mRNA export from nucleus, mRNA transport, nuclear body, acetyltransferase activity, and so on. Kyoto Encyclopedia of Genes and Genomes database was used for pathway analysis and ribosome was the most significantly enriched pathway term.
Identification of species of viridans group streptococci in clinical blood culture isolates by sequence analysis of the RNase P RNA gene, rnpB.

Science.gov (United States)

Westling, Katarina; Julander, Inger; Ljungman, Per; Vondracek, Martin; Wretlind, Bengt; Jalal, Shah

2008-03-01

Viridans group streptococci (VGS) cause severe diseases such as infective endocarditis and septicaemia. Genetically, VGS species are very close to each other and it is difficult to identify them to species level with conventional methods. The aims of the present study were to use sequence analysis of the RNase P RNA gene (rnpB) to identify VGS species in clinical blood culture isolates, and to compare the results with the API 20 Strep system that is based on phenotypical characteristics. Strains from patients with septicaemia or endocarditis were analysed with PCR amplification and sequence analysis of the rnpB gene. Clinical data were registered as well. One hundred and thirty two VGS clinical blood culture isolates from patients with septicaemia (n=95) or infective endocarditis (n=36) were analysed; all but one were identified by rnpB. Streptococcus oralis, Streptococcus sanguinis and Streptococcus gordonii strains were most common in the patients with infective endocarditis. In the isolates from patients with haematological diseases, Streptococcus mitis and S. oralis dominated. In addition in 76 of the isolates it was possible to compare the results from rnpB analysis and the API 20 Strep system. In 39/76 (51%) of the isolates the results were concordant to species level; in 55 isolates there were no results from API 20 Strep. Sequence analysis of the RNase P RNA gene (rnpB) showed that almost all isolates could be identified. This could be of importance for evaluation of the portal of entry in patients with septicaemia or infective endocarditis.
Genome-wide identification, subcellular localization and gene expression analysis of the members of CESA gene family in common tobacco (Nicotiana tabacum L.).

Science.gov (United States)

Xu, Zong-Chang; Kong, Yingzhen

2017-06-20

Cellulose-synthase proteins (CESAs) are membrane localized proteins and they form protein complexes to produce cellulose in the plasma membrane. CESA proteins play very important roles in cell wall construction during plant growth and development. In this study, a total of 21 NtCESA gene sequences were identified by using PF03552 conserved protein sequence and 10 AtCESA protein sequences of Arabidopsis thaliana to blast against the common tobacco (Nicotiana tabacum L.) genome database with TBLASTN protocol. We analyzed the physical and chemical properties of protein sequences based on some software or on-line analysis tools. The results showed that there were no significant variances in terms of the physical and chemical properties of the 21 NtCESA proteins. First, phylogenetic tree analysis showed that 21 NtCESA genes and 10 AtCESA genes were clustered into five groups, and the gene structures were similar among the genes that are clustered into the same group. Second, in all of the 21 NtCESA proteins the conserved zinc finger domain was identified in the N-terminus, transmembrane domains were identified in the C-terminus and the DDD-QXXRW conserved domains were also identified. Third, gene expression analysis results indicated that most NtCESA genes were expressed in roots and leaves of seedling or mature tissues of tobacco, seeds and callus tissues. The genes that clustered into the same group share similar expression patterns. Importantly, NtCESA proteins that are involved in secondary cell wall cellulose synthesis have two extra transmembrane domains compared with that involved in primary cell wall cellulose biosynthesis. In addition, subcellular localization results showed that NtCESA9 and NtCESA14 were two plasma membrane anchored proteins. This study will lay a foundation for further functional characterization of these NtCESA genes.
Comparative genomic and transcriptomic analysis of selected fatty acid biosynthesis genes and CNL disease resistance genes in oil palm

Science.gov (United States)

Rosli, Rozana; Amiruddin, Nadzirah; Ab Halim, Mohd Amin; Chan, Pek-Lan; Chan, Kuang-Lim; Azizi, Norazah; Morris, Priscilla E.; Leslie Low, Eng-Ti; Ong-Abdullah, Meilina; Sambanthamurthi, Ravigadevi; Singh, Rajinder

2018-01-01

Comparative genomics and transcriptomic analyses were performed on two agronomically important groups of genes from oil palm versus other major crop species and the model organism, Arabidopsis thaliana. The first analysis was of two gene families with key roles in regulation of oil quality and in particular the accumulation of oleic acid, namely stearoyl ACP desaturases (SAD) and acyl-acyl carrier protein (ACP) thioesterases (FAT). In both cases, these were found to be large gene families with complex expression profiles across a wide range of tissue types and developmental stages. The detailed classification of the oil palm SAD and FAT genes has enabled the updating of the latest version of the oil palm gene model. The second analysis focused on disease resistance (R) genes in order to elucidate possible candidates for breeding of pathogen tolerance/resistance. Ortholog analysis showed that 141 out of the 210 putative oil palm R genes had homologs in banana and rice. These genes formed 37 clusters with 634 orthologous genes. Classification of the 141 oil palm R genes showed that the genes belong to the Kinase (7), CNL (95), MLO-like (8), RLK (3) and Others (28) categories. The CNL R genes formed eight clusters. Expression data for selected R genes also identified potential candidates for breeding of disease resistance traits. Furthermore, these findings can provide information about the species evolution as well as the identification of agronomically important genes in oil palm and other major crops. PMID:29672525
Comparative genomic and transcriptomic analysis of selected fatty acid biosynthesis genes and CNL disease resistance genes in oil palm.

Science.gov (United States)

Rosli, Rozana; Amiruddin, Nadzirah; Ab Halim, Mohd Amin; Chan, Pek-Lan; Chan, Kuang-Lim; Azizi, Norazah; Morris, Priscilla E; Leslie Low, Eng-Ti; Ong-Abdullah, Meilina; Sambanthamurthi, Ravigadevi; Singh, Rajinder; Murphy, Denis J

2018-01-01

Comparative genomics and transcriptomic analyses were performed on two agronomically important groups of genes from oil palm versus other major crop species and the model organism, Arabidopsis thaliana. The first analysis was of two gene families with key roles in regulation of oil quality and in particular the accumulation of oleic acid, namely stearoyl ACP desaturases (SAD) and acyl-acyl carrier protein (ACP) thioesterases (FAT). In both cases, these were found to be large gene families with complex expression profiles across a wide range of tissue types and developmental stages. The detailed classification of the oil palm SAD and FAT genes has enabled the updating of the latest version of the oil palm gene model. The second analysis focused on disease resistance (R) genes in order to elucidate possible candidates for breeding of pathogen tolerance/resistance. Ortholog analysis showed that 141 out of the 210 putative oil palm R genes had homologs in banana and rice. These genes formed 37 clusters with 634 orthologous genes. Classification of the 141 oil palm R genes showed that the genes belong to the Kinase (7), CNL (95), MLO-like (8), RLK (3) and Others (28) categories. The CNL R genes formed eight clusters. Expression data for selected R genes also identified potential candidates for breeding of disease resistance traits. Furthermore, these findings can provide information about the species evolution as well as the identification of agronomically important genes in oil palm and other major crops.
Joint mapping of genes and conditions via multidimensional unfolding analysis

Directory of Open Access Journals (Sweden)

Engelen Kristof

2007-06-01

Full Text Available Abstract Background Microarray compendia profile the expression of genes in a number of experimental conditions. Such data compendia are useful not only to group genes and conditions based on their similarity in overall expression over profiles but also to gain information on more subtle relations between genes and conditions. Getting a clear visual overview of all these patterns in a single easy-to-grasp representation is a useful preliminary analysis step: We propose to use for this purpose an advanced exploratory method, called multidimensional unfolding. Results We present a novel algorithm for multidimensional unfolding that overcomes both general problems and problems that are specific for the analysis of gene expression data sets. Applying the algorithm to two publicly available microarray compendia illustrates its power as a tool for exploratory data analysis: The unfolding analysis of a first data set resulted in a two-dimensional representation which clearly reveals temporal regulation patterns for the genes and a meaningful structure for the time points, while the analysis of a second data set showed the algorithm's ability to go beyond a mere identification of those genes that discriminate between different patient or tissue types. Conclusion Multidimensional unfolding offers a useful tool for preliminary explorations of microarray data: By relying on an easy-to-grasp low-dimensional geometric framework, relations among genes, among conditions and between genes and conditions are simultaneously represented in an accessible way which may reveal interesting patterns in the data. An additional advantage of the method is that it can be applied to the raw data without necessitating the choice of suitable genewise transformations of the data.
A human repair gene ERCC5 is involved in group G xeroderma pigmentosum

International Nuclear Information System (INIS)

Shiomi, Tadahiro

1994-01-01

In E. coli, ultraviolet-induced DNA damage is removed by the coordinated action of UVR A, B, C, and D proteins (1). In Saccharomyces cerevisiae, more than ten genes have been reported to be involved in excision repair (2). The nucleotide excision repair pathway has been extensively studied in these organisms. To facilitate studying nucleotide excision repair in mammalian cells. Ultraviolet-sensitive rodent cell mutants have been isolated and classified into 11 complementation groups (9,10). The human nucleotide excision repair genes which complement the defects of the mutants have been designated as the ERCC (excision repair cross-complementing) genes; a number is added to refer to the particular rodent complementation group that is corrected by the gene. Recently, several human DNA repair genes have been cloned using rodent cell lines sensitive to ultraviolet. These include ERCC2 (3), ERCC3 (4), and ERCC6 (5), which correspond to the defective genes in the ultraviolet-sensitive human disorders xeroderma pigmentosum (XP) group D (6) and group B (4), and Cockayne's syndrome (CS) group B (7), respectively. The human excision repair gene ERCC5 was cloned after DNA-mediated gene transfer of human HeLa cell genomic DNA into the ultraviolet-sensitive mouse mutant XL216, a member of rodent complementation group 5 (11,12) and the gene was mapped on human chromosome 13q32.3-q33.1 by the replication R-banding fluorescence in situ hybridization method (13). The ERCC5 cDNA encodes a predicted 133 kDa nuclear protein that shares some homology with product of the yeast DNA repair gene RAD 2. Transfection with mouse ERCC5 cDNA restored normal levels of ultraviolet-resistance to XL216 cells. Microinjection of ERCC5 cDNA specifically restored the defect of XP group G cells (XP-G) as measured by unscheduled DNA synthesis (UDS), and XP-G cells stably transformed with ERCC5 cDNA showed nearly normal ultraviolet resistance. (J.P.N.)
Function analysis of unknown genes

DEFF Research Database (Denmark)

Rogowska-Wrzesinska, A.

2002-01-01

This thesis entitled "Function analysis of unknown genes" presents the use of proteome analysis for the characterisation of yeast (Saccharomyces cerevisiae) genes and their products (proteins especially those of unknown function). This study illustrates that proteome analysis can be used...... to describe different aspects of molecular biology of the cell, to study changes that occur in the cell due to overexpression or deletion of a gene and to identify various protein modifications. The biological questions and the results of the described studies show the diversity of the information that can...... genes and proteins. It reports the first global proteome database collecting 36 yeast single gene deletion mutants and selecting over 650 differences between analysed mutants and the wild type strain. The obtained results show that two-dimensional gel electrophoresis and mass spectrometry based proteome...
Molecular cloning of a mouse DNA repair gene that complements the defect of group-A xeroderma pigmentosum

International Nuclear Information System (INIS)

Tanaka, K.; Satokata, I.; Ogita, Z.; Uchida, T.; Okada, Y.

1989-01-01

For isolation of the gene responsible for xeroderma pigmentosum (XP) complementation group A, plasmid pSV2gpt and genomic DNA from a mouse embryo were cotransfected into XP2OSSV cells, a group-A XP cell line. Two primary UV-resistant XP transfectants were isolated from about 1.6 X 10(5) pSV2gpt-transformed XP colonies. pSV2gpt and genomic DNA from the primary transfectants were again cotransfected into XP2OSSV cells and a secondary UV-resistant XP transfectant was obtained by screening about 4.8 X 10(5) pSV2gpt-transformed XP colonies. The secondary transfectant retained fewer mouse repetitive sequences. A mouse gene that complements the defect of XP2OSSV cells was cloned into an EMBL3 vector from the genome of a secondary transfectant. Transfections of the cloned DNA also conferred UV resistance on another group-A XP cell line but not on XP cell lines of group C, D, F, or G. Northern blot analysis of poly(A)+ RNA with a subfragment of cloned mouse DNA repair gene as the probe revealed that an approximately 1.0 kilobase mRNA was transcribed in the donor mouse embryo and secondary transfectant, and approximately 1.0- and approximately 1.3-kilobase mRNAs were transcribed in normal human cells, but none of these mRNAs was detected in three strains of group-A XP cells. These results suggest that the cloned DNA repair gene is specific for group-A XP and may be the mouse homologue of the group-A XP human gene
Genome-wide identification of WRKY family genes in peach and analysis of WRKY expression during bud dormancy.

Science.gov (United States)

Chen, Min; Tan, Qiuping; Sun, Mingyue; Li, Dongmei; Fu, Xiling; Chen, Xiude; Xiao, Wei; Li, Ling; Gao, Dongsheng

2016-06-01

Bud dormancy in deciduous fruit trees is an important adaptive mechanism for their survival in cold climates. The WRKY genes participate in several developmental and physiological processes, including dormancy. However, the dormancy mechanisms of WRKY genes have not been studied in detail. We conducted a genome-wide analysis and identified 58 WRKY genes in peach. These putative genes were located on all eight chromosomes. In bioinformatics analyses, we compared the sequences of WRKY genes from peach, rice, and Arabidopsis. In a cluster analysis, the gene sequences formed three groups, of which group II was further divided into five subgroups. Gene structure was highly conserved within each group, especially in groups IId and III. Gene expression analyses by qRT-PCR showed that WRKY genes showed different expression patterns in peach buds during dormancy. The mean expression levels of six WRKY genes (Prupe.6G286000, Prupe.1G393000, Prupe.1G114800, Prupe.1G071400, Prupe.2G185100, and Prupe.2G307400) increased during endodormancy and decreased during ecodormancy, indicating that these six WRKY genes may play a role in dormancy in a perennial fruit tree. This information will be useful for selecting fruit trees with desirable dormancy characteristics or for manipulating dormancy in genetic engineering programs.
The identification of functional motifs in temporal gene expression analysis

Directory of Open Access Journals (Sweden)

Michael G. Surette

2005-01-01

Full Text Available The identification of transcription factor binding sites is essential to the understanding of the regulation of gene expression and the reconstruction of genetic regulatory networks. The in silico identification of cis-regulatory motifs is challenging due to sequence variability and lack of sufficient data to generate consensus motifs that are of quantitative or even qualitative predictive value. To determine functional motifs in gene expression, we propose a strategy to adopt false discovery rate (FDR and estimate motif effects to evaluate combinatorial analysis of motif candidates and temporal gene expression data. The method decreases the number of predicted motifs, which can then be confirmed by genetic analysis. To assess the method we used simulated motif/expression data to evaluate parameters. We applied this approach to experimental data for a group of iron responsive genes in Salmonella typhimurium 14028S. The method identified known and potentially new ferric-uptake regulator (Fur binding sites. In addition, we identified uncharacterized functional motif candidates that correlated with specific patterns of expression. A SAS code for the simulation and analysis gene expression data is available from the first author upon request.
Integrative analysis of genome-wide gene copy number changes and gene expression in non-small cell lung cancer.

Directory of Open Access Journals (Sweden)

Verena Jabs

Full Text Available Non-small cell lung cancer (NSCLC represents a genomically unstable cancer type with extensive copy number aberrations. The relationship of gene copy number alterations and subsequent mRNA levels has only fragmentarily been described. The aim of this study was to conduct a genome-wide analysis of gene copy number gains and corresponding gene expression levels in a clinically well annotated NSCLC patient cohort (n = 190 and their association with survival. While more than half of all analyzed gene copy number-gene expression pairs showed statistically significant correlations (10,296 of 18,756 genes, high correlations, with a correlation coefficient >0.7, were obtained only in a subset of 301 genes (1.6%, including KRAS, EGFR and MDM2. Higher correlation coefficients were associated with higher copy number and expression levels. Strong correlations were frequently based on few tumors with high copy number gains and correspondingly increased mRNA expression. Among the highly correlating genes, GO groups associated with posttranslational protein modifications were particularly frequent, including ubiquitination and neddylation. In a meta-analysis including 1,779 patients we found that survival associated genes were overrepresented among highly correlating genes (61 of the 301 highly correlating genes, FDR adjusted p<0.05. Among them are the chaperone CCT2, the core complex protein NUP107 and the ubiquitination and neddylation associated protein CAND1. In conclusion, in a comprehensive analysis we described a distinct set of highly correlating genes. These genes were found to be overrepresented among survival-associated genes based on gene expression in a large collection of publicly available datasets.
Transcriptome analysis reveals key differentially expressed genes involved in wheat grain development

Directory of Open Access Journals (Sweden)

Yonglong Yu

2016-04-01

Full Text Available Wheat seed development is an important physiological process of seed maturation and directly affects wheat yield and quality. In this study, we performed dynamic transcriptome microarray analysis of an elite Chinese bread wheat cultivar (Jimai 20 during grain development using the GeneChip Wheat Genome Array. Grain morphology and scanning electron microscope observations showed that the period of 11–15 days post-anthesis (DPA was a key stage for the synthesis and accumulation of seed starch. Genome-wide transcriptional profiling and significance analysis of microarrays revealed that the period from 11 to 15 DPA was more important than the 15–20 DPA stage for the synthesis and accumulation of nutritive reserves. Series test of cluster analysis of differential genes revealed five statistically significant gene expression profiles. Gene ontology annotation and enrichment analysis gave further information about differentially expressed genes, and MapMan analysis revealed expression changes within functional groups during seed development. Metabolic pathway network analysis showed that major and minor metabolic pathways regulate one another to ensure regular seed development and nutritive reserve accumulation. We performed gene co-expression network analysis to identify genes that play vital roles in seed development and identified several key genes involved in important metabolic pathways. The transcriptional expression of eight key genes involved in starch and protein synthesis and stress defense was further validated by qRT-PCR. Our results provide new insight into the molecular mechanisms of wheat seed development and the determinants of yield and quality.
Diagnostic value of immunoglobulin κ light chain gene rearrangement analysis in B-cell lymphomas.

Science.gov (United States)

Kokovic, Ira; Jezersek Novakovic, Barbara; Novakovic, Srdjan

2015-03-01

Analysis of the immunoglobulin κ light chain (IGK) gene is an alternative method for B-cell clonality assessment in the diagnosis of mature B-cell proliferations in which the detection of clonal immunoglobulin heavy chain (IGH) gene rearrangements fails. The aim of the present study was to evaluate the added value of standardized BIOMED-2 assay for the detection of clonal IGK gene rearrangements in the diagnostic setting of suspected B-cell lymphomas. With this purpose, 92 specimens from 80 patients with the final diagnosis of mature B-cell lymphoma (37 specimens), mature T-cell lymphoma (26 specimens) and reactive lymphoid proliferation (29 specimens) were analyzed for B-cell clonality. B-cell clonality analysis was performed using the BIOMED-2 IGH and IGK gene clonality assays. The determined sensitivity of the IGK assay was 67.6%, while the determined sensitivity of the IGH assay was 75.7%. The sensitivity of combined IGH+IGK assay was 81.1%. The determined specificity of the IGK assay was 96.2% in the group of T-cell lymphomas and 96.6% in the group of reactive lesions. The determined specificity of the IGH assay was 84.6% in the group of lymphomas and 86.2% in the group of reactive lesions. The comparison of GeneScan (GS) and heteroduplex pretreatment-polyacrylamide gel electrophoresis (HD-PAGE) methods for the analysis of IGK gene rearrangements showed a higher efficacy of GS analysis in a series of 27 B-cell lymphomas analyzed by both methods. In the present study, we demonstrated that by applying the combined IGH+IGK clonality assay the overall detection rate of B-cell clonality was increased by 5.4%. Thus, we confirmed the added value of the standardized BIOMED-2 IGK assay for assessment of B-cell clonality in suspected B-cell lymphomas with inconclusive clinical and cyto/histological diagnosis.
QTL global meta-analysis: are trait determining genes clustered?

Directory of Open Access Journals (Sweden)

Adelson David L

2009-04-01

Full Text Available Abstract Background A key open question in biology is if genes are physically clustered with respect to their known functions or phenotypic effects. This is of particular interest for Quantitative Trait Loci (QTL where a QTL region could contain a number of genes that contribute to the trait being measured. Results We observed a significant increase in gene density within QTL regions compared to non-QTL regions and/or the entire bovine genome. By grouping QTL from the Bovine QTL Viewer database into 8 categories of non-redundant regions, we have been able to analyze gene density and gene function distribution, based on Gene Ontology (GO with relation to their location within QTL regions, outside of QTL regions and across the entire bovine genome. We identified a number of GO terms that were significantly over represented within particular QTL categories. Furthermore, select GO terms expected to be associated with the QTL category based on common biological knowledge have also proved to be significantly over represented in QTL regions. Conclusion Our analysis provides evidence of over represented GO terms in QTL regions. This increased GO term density indicates possible clustering of gene functions within QTL regions of the bovine genome. Genes with similar functions may be grouped in specific locales and could be contributing to QTL traits. Moreover, we have identified over-represented GO terminology that from a biological standpoint, makes sense with respect to QTL category type.
A gene pathway analysis highlights the role of cellular adhesion molecules in multiple sclerosis susceptibility

DEFF Research Database (Denmark)

Damotte, V; Guillot-Noel, L; Patsopoulos, N A

2014-01-01

adhesion molecule (CAMs) biological pathway using Cytoscape software. This network is a strong candidate, as it is involved in the crossing of the blood-brain barrier by the T cells, an early event in MS pathophysiology, and is used as an efficient therapeutic target. We drew up a list of 76 genes...... in interaction with other genes as a group. Pathway analysis is an alternative way to highlight such group of genes. Using SNP association P-values from eight multiple sclerosis (MS) GWAS data sets, we performed a candidate pathway analysis for MS susceptibility by considering genes interacting in the cell...... belonging to the CAM network. We highlighted 64 networks enriched with CAM genes with low P-values. Filtering by a percentage of CAM genes up to 50% and rejecting enriched signals mainly driven by transcription factors, we highlighted five networks associated with MS susceptibility. One of them, constituted...
Gene frequencies of ABO and rhesus blood groups and ...

African Journals Online (AJOL)

The distribution and gene frequencies of ABO and rhesus (Rh) blood groups and haemoglobin variants for samples of the Nigerian population at Ogbomoso was determined. Data consisting of records of blood groups and haemoglobin types of different ages ranging from infants to adults for a period of 4 to 6 years (1995 ...

Ranking metrics in gene set enrichment analysis: do they matter?

Science.gov (United States)

Zyla, Joanna; Marczyk, Michal; Weiner, January; Polanska, Joanna

2017-05-12

There exist many methods for describing the complex relation between changes of gene expression in molecular pathways or gene ontologies under different experimental conditions. Among them, Gene Set Enrichment Analysis seems to be one of the most commonly used (over 10,000 citations). An important parameter, which could affect the final result, is the choice of a metric for the ranking of genes. Applying a default ranking metric may lead to poor results. In this work 28 benchmark data sets were used to evaluate the sensitivity and false positive rate of gene set analysis for 16 different ranking metrics including new proposals. Furthermore, the robustness of the chosen methods to sample size was tested. Using k-means clustering algorithm a group of four metrics with the highest performance in terms of overall sensitivity, overall false positive rate and computational load was established i.e. absolute value of Moderated Welch Test statistic, Minimum Significant Difference, absolute value of Signal-To-Noise ratio and Baumgartner-Weiss-Schindler test statistic. In case of false positive rate estimation, all selected ranking metrics were robust with respect to sample size. In case of sensitivity, the absolute value of Moderated Welch Test statistic and absolute value of Signal-To-Noise ratio gave stable results, while Baumgartner-Weiss-Schindler and Minimum Significant Difference showed better results for larger sample size. Finally, the Gene Set Enrichment Analysis method with all tested ranking metrics was parallelised and implemented in MATLAB, and is available at https://github.com/ZAEDPolSl/MrGSEA . Choosing a ranking metric in Gene Set Enrichment Analysis has critical impact on results of pathway enrichment analysis. The absolute value of Moderated Welch Test has the best overall sensitivity and Minimum Significant Difference has the best overall specificity of gene set analysis. When the number of non-normally distributed genes is high, using Baumgartner
Gene Co-expression Analysis to Characterize Genes Related to Marbling Trait in Hanwoo (Korean) Cattle.

Science.gov (United States)

Lim, Dajeong; Lee, Seung-Hwan; Kim, Nam-Kuk; Cho, Yong-Min; Chai, Han-Ha; Seong, Hwan-Hoo; Kim, Heebal

2013-01-01

Marbling (intramuscular fat) is an important trait that affects meat quality and is a casual factor determining the price of beef in the Korean beef market. It is a complex trait and has many biological pathways related to muscle and fat. There is a need to identify functional modules or genes related to marbling traits and investigate their relationships through a weighted gene co-expression network analysis based on the system level. Therefore, we investigated the co-expression relationships of genes related to the 'marbling score' trait and systemically analyzed the network topology in Hanwoo (Korean cattle). As a result, we determined 3 modules (gene groups) that showed statistically significant results for marbling score. In particular, one module (denoted as red) has a statistically significant result for marbling score (p = 0.008) and intramuscular fat (p = 0.02) and water capacity (p = 0.006). From functional enrichment and relationship analysis of the red module, the pathway hub genes (IL6, CHRNE, RB1, INHBA and NPPA) have a direct interaction relationship and share the biological functions related to fat or muscle, such as adipogenesis or muscle growth. This is the first gene network study with m.logissimus in Hanwoo to observe co-expression patterns in divergent marbling phenotypes. It may provide insights into the functional mechanisms of the marbling trait.
Gene Co-expression Analysis to Characterize Genes Related to Marbling Trait in Hanwoo (Korean Cattle

Directory of Open Access Journals (Sweden)

Dajeong Lim

2013-01-01

Full Text Available Marbling (intramuscular fat is an important trait that affects meat quality and is a casual factor determining the price of beef in the Korean beef market. It is a complex trait and has many biological pathways related to muscle and fat. There is a need to identify functional modules or genes related to marbling traits and investigate their relationships through a weighted gene co-expression network analysis based on the system level. Therefore, we investigated the co-expression relationships of genes related to the ‘marbling score’ trait and systemically analyzed the network topology in Hanwoo (Korean cattle. As a result, we determined 3 modules (gene groups that showed statistically significant results for marbling score. In particular, one module (denoted as red has a statistically significant result for marbling score (p = 0.008 and intramuscular fat (p = 0.02 and water capacity (p = 0.006. From functional enrichment and relationship analysis of the red module, the pathway hub genes (IL6, CHRNE, RB1, INHBA and NPPA have a direct interaction relationship and share the biological functions related to fat or muscle, such as adipogenesis or muscle growth. This is the first gene network study with m.logissimus in Hanwoo to observe co-expression patterns in divergent marbling phenotypes. It may provide insights into the functional mechanisms of the marbling trait.
Global Analysis of WRKY Genes and Their Response to Dehydration and Salt Stress in Soybean.

Science.gov (United States)

Song, Hui; Wang, Pengfei; Hou, Lei; Zhao, Shuzhen; Zhao, Chuanzhi; Xia, Han; Li, Pengcheng; Zhang, Ye; Bian, Xiaotong; Wang, Xingjun

2016-01-01

WRKY proteins are plant specific transcription factors involved in various developmental and physiological processes, especially in biotic and abiotic stress resistance. Although previous studies suggested that WRKY proteins in soybean (Glycine max var. Williams 82) involved in both abiotic and biotic stress responses, the global information of WRKY proteins in the latest version of soybean genome (Wm82.a2v1) and their response to dehydration and salt stress have not been reported. In this study, we identified 176 GmWRKY proteins from soybean Wm82.a2v1 genome. These proteins could be classified into three groups, namely group I (32 proteins), group II (120 proteins), and group III (24 proteins). Our results showed that most GmWRKY genes were located on Chromosome 6, while chromosome 11, 12, and 20 contained the least number of this gene family. More GmWRKY genes were distributed on the ends of chromosomes to compare with other regions. The cis-acting elements analysis suggested that GmWRKY genes were transcriptionally regulated upon dehydration and salt stress. RNA-seq data analysis indicated that three GmWRKY genes responded negatively to dehydration, and 12 genes positively responded to salt stress at 1, 6, and 12 h, respectively. We confirmed by qRT-PCR that the expression of GmWRKY47 and GmWRKY 58 genes was decreased upon dehydration, and the expression of GmWRKY92, 144 and 165 genes was increased under salt treatment.
Microarray gene expression profiling and analysis in renal cell carcinoma

Directory of Open Access Journals (Sweden)

Sadhukhan Provash

2004-06-01

Full Text Available Abstract Background Renal cell carcinoma (RCC is the most common cancer in adult kidney. The accuracy of current diagnosis and prognosis of the disease and the effectiveness of the treatment for the disease are limited by the poor understanding of the disease at the molecular level. To better understand the genetics and biology of RCC, we profiled the expression of 7,129 genes in both clear cell RCC tissue and cell lines using oligonucleotide arrays. Methods Total RNAs isolated from renal cell tumors, adjacent normal tissue and metastatic RCC cell lines were hybridized to affymatrix HuFL oligonucleotide arrays. Genes were categorized into different functional groups based on the description of the Gene Ontology Consortium and analyzed based on the gene expression levels. Gene expression profiles of the tissue and cell line samples were visualized and classified by singular value decomposition. Reverse transcription polymerase chain reaction was performed to confirm the expression alterations of selected genes in RCC. Results Selected genes were annotated based on biological processes and clustered into functional groups. The expression levels of genes in each group were also analyzed. Seventy-four commonly differentially expressed genes with more than five-fold changes in RCC tissues were identified. The expression alterations of selected genes from these seventy-four genes were further verified using reverse transcription polymerase chain reaction (RT-PCR. Detailed comparison of gene expression patterns in RCC tissue and RCC cell lines shows significant differences between the two types of samples, but many important expression patterns were preserved. Conclusions This is one of the initial studies that examine the functional ontology of a large number of genes in RCC. Extensive annotation, clustering and analysis of a large number of genes based on the gene functional ontology revealed many interesting gene expression patterns in RCC. Most
Association of duffy blood group gene polymorphisms with IL8 gene in chronic periodontitis.

Directory of Open Access Journals (Sweden)

Emília Ângela Sippert

Full Text Available The antigens of the Duffy blood group system (DARC act as a receptor for the interleukin IL-8. IL-8 plays an important role in the pathogenesis of chronic periodontitis due to its chemotactic properties on neutrophils. The aim of this study was to investigate a possible association of Duffy blood group gene polymorphisms with the -353T>A, -845T>C and -738T>A SNPs of the IL8 gene in chronic periodontitis. One hundred and twenty-four individuals with chronic periodontitis and 187 controls were enrolled. DNA was extracted using the salting-out method. The Duffy genotypes and IL8 gene promoter polymorphisms were investigated by PCR-RFLP. Statistical analyses were conducted using the Chi square test with Yates correction or Fisher's Exact Test, and the possibility of associations were evaluated by odds ratio with a 95% confidence interval. When analyzed separately, for the Duffy blood group system, differences in the genotype and allele frequencies were not observed between all the groups analyzed; and, in nonsmokers, the -845C allele (3.6% vs. 0.4%, -845TC genotype (7.3% vs. 0.7% and the CTA haplotype (3.6% vs. 0.4% were positively associated with chronic periodontitis. For the first time to our knowledge, the polymorphisms of erythroid DARC plus IL8 -353T>A SNPs were associated with chronic periodontitis in Brazilian individuals. In Afro-Brazilians patients, the FY*02N.01 with IL8 -353A SNP was associated with protection to chronic periodontitis.
Association of duffy blood group gene polymorphisms with IL8 gene in chronic periodontitis.

Science.gov (United States)

Sippert, Emília Ângela; de Oliveira e Silva, Cléverson; Visentainer, Jeane Eliete Laguila; Sell, Ana Maria

2013-01-01

The antigens of the Duffy blood group system (DARC) act as a receptor for the interleukin IL-8. IL-8 plays an important role in the pathogenesis of chronic periodontitis due to its chemotactic properties on neutrophils. The aim of this study was to investigate a possible association of Duffy blood group gene polymorphisms with the -353T>A, -845T>C and -738T>A SNPs of the IL8 gene in chronic periodontitis. One hundred and twenty-four individuals with chronic periodontitis and 187 controls were enrolled. DNA was extracted using the salting-out method. The Duffy genotypes and IL8 gene promoter polymorphisms were investigated by PCR-RFLP. Statistical analyses were conducted using the Chi square test with Yates correction or Fisher's Exact Test, and the possibility of associations were evaluated by odds ratio with a 95% confidence interval. When analyzed separately, for the Duffy blood group system, differences in the genotype and allele frequencies were not observed between all the groups analyzed; and, in nonsmokers, the -845C allele (3.6% vs. 0.4%), -845TC genotype (7.3% vs. 0.7%) and the CTA haplotype (3.6% vs. 0.4%) were positively associated with chronic periodontitis. For the first time to our knowledge, the polymorphisms of erythroid DARC plus IL8 -353T>A SNPs were associated with chronic periodontitis in Brazilian individuals. In Afro-Brazilians patients, the FY*02N.01 with IL8 -353A SNP was associated with protection to chronic periodontitis.
Leaving out control groups: an internal contrast analysis of gene expression profiles in atrial fibrillation patients--a systems biology approach to clinical categorization.

Science.gov (United States)

Vanhoutte, Kurt; de Asmundis, Carlo; Francesconi, Anna; Figysl, Jurgen; Steurs, Griet; Boussy, Tim; Roos, Markus; Mueller, Andreas; Massimo, Lucio; Paparella, Gaetano; Van Caelenberg, Kristien; Chierchia, Gian Battista; Sarkozy, Andrea; Terradellas, Pedro Brugada Y; Zizi, Martin

2009-01-01

Atrial fibrillation (AF) is a frequent chronic dysrythmia with an incidence that increases with age (>40). Because of its medical and socio-economic impacts it is expected to become an increasing burden on most health care systems. AF is a multi-factorial disease for which the identification of subtypes is warranted. Novel approaches based on the broad concepts of systems biology may overcome the blurred notion of normal and pathological phenotype, which is inherent to high throughput molecular arrays analysis. Here we apply an internal contrast algorithm on AF patient data with an analytical focus on potential entry pathways into the disease. We used a RMA (Robust Multichip Average) normalized Affymetrix micro-array data set from 10 AF patients (geo_accession #GSE2240). Four series of probes were selected based on physiopathogenic links with AF entryways: apoptosis (remodeling), MAP kinase (cell remodeling), OXPHOS (ability to sustain hemodynamic workload) and glycolysis (ischemia). Annotated probe lists were polled with Bioconductor packages in R (version 2.7.1). Genetic profile contrasts were analysed with hierarchical clustering and principal component analysis. The analysis revealed distinct patient groups for all probe sets. A substantial part (54% till 67%) of the variance is explained in the first 2 principal components. Genes in PC1/2 with high discriminatory value were selected and analyzed in detail. We aim for reliable molecular stratification of AF. We show that stratification is possible based on physiologically relevant gene sets. Genes with high contrast value are likely to give pathophysiological insight into permanent AF subtypes.
Analysis of multiplex gene expression maps obtained by voxelation

Directory of Open Access Journals (Sweden)

Smith Desmond J

2009-04-01

Full Text Available Abstract Background Gene expression signatures in the mammalian brain hold the key to understanding neural development and neurological disease. Researchers have previously used voxelation in combination with microarrays for acquisition of genome-wide atlases of expression patterns in the mouse brain. On the other hand, some work has been performed on studying gene functions, without taking into account the location information of a gene's expression in a mouse brain. In this paper, we present an approach for identifying the relation between gene expression maps obtained by voxelation and gene functions. Results To analyze the dataset, we chose typical genes as queries and aimed at discovering similar gene groups. Gene similarity was determined by using the wavelet features extracted from the left and right hemispheres averaged gene expression maps, and by the Euclidean distance between each pair of feature vectors. We also performed a multiple clustering approach on the gene expression maps, combined with hierarchical clustering. Among each group of similar genes and clusters, the gene function similarity was measured by calculating the average gene function distances in the gene ontology structure. By applying our methodology to find similar genes to certain target genes we were able to improve our understanding of gene expression patterns and gene functions. By applying the clustering analysis method, we obtained significant clusters, which have both very similar gene expression maps and very similar gene functions respectively to their corresponding gene ontologies. The cellular component ontology resulted in prominent clusters expressed in cortex and corpus callosum. The molecular function ontology gave prominent clusters in cortex, corpus callosum and hypothalamus. The biological process ontology resulted in clusters in cortex, hypothalamus and choroid plexus. Clusters from all three ontologies combined were most prominently expressed in
Analysis of multiplex gene expression maps obtained by voxelation.

Science.gov (United States)

An, Li; Xie, Hongbo; Chin, Mark H; Obradovic, Zoran; Smith, Desmond J; Megalooikonomou, Vasileios

2009-04-29

Gene expression signatures in the mammalian brain hold the key to understanding neural development and neurological disease. Researchers have previously used voxelation in combination with microarrays for acquisition of genome-wide atlases of expression patterns in the mouse brain. On the other hand, some work has been performed on studying gene functions, without taking into account the location information of a gene's expression in a mouse brain. In this paper, we present an approach for identifying the relation between gene expression maps obtained by voxelation and gene functions. To analyze the dataset, we chose typical genes as queries and aimed at discovering similar gene groups. Gene similarity was determined by using the wavelet features extracted from the left and right hemispheres averaged gene expression maps, and by the Euclidean distance between each pair of feature vectors. We also performed a multiple clustering approach on the gene expression maps, combined with hierarchical clustering. Among each group of similar genes and clusters, the gene function similarity was measured by calculating the average gene function distances in the gene ontology structure. By applying our methodology to find similar genes to certain target genes we were able to improve our understanding of gene expression patterns and gene functions. By applying the clustering analysis method, we obtained significant clusters, which have both very similar gene expression maps and very similar gene functions respectively to their corresponding gene ontologies. The cellular component ontology resulted in prominent clusters expressed in cortex and corpus callosum. The molecular function ontology gave prominent clusters in cortex, corpus callosum and hypothalamus. The biological process ontology resulted in clusters in cortex, hypothalamus and choroid plexus. Clusters from all three ontologies combined were most prominently expressed in cortex and corpus callosum. The experimental
Genome-Wide Identification and Expression Analysis of the UGlcAE Gene Family in Tomato

Directory of Open Access Journals (Sweden)

Xing Ding

2018-05-01

Full Text Available The UGlcAE has the capability of interconverting UDP-d-galacturonic acid and UDP-d-glucuronic acid, and UDP-d-galacturonic acid is an activated precursor for the synthesis of pectins in plants. In this study, we identified nine UGlcAE protein-encoding genes in tomato. The nine UGlcAE genes that were distributed on eight chromosomes in tomato, and the corresponding proteins contained one or two trans-membrane domains. The phylogenetic analysis showed that SlUGlcAE genes could be divided into seven groups, designated UGlcAE1 to UGlcAE6, of which the UGlcAE2 were classified into two groups. Expression profile analysis revealed that the SlUGlcAE genes display diverse expression patterns in various tomato tissues. Selective pressure analysis indicated that all of the amino acid sites of SlUGlcAE proteins are undergoing purifying selection. Fifteen stress-, hormone-, and development-related elements were identified in the upstream regions (0.5 kb of these SlUGlcAE genes. Furthermore, we investigated the expression patterns of SlUGlcAE genes in response to three hormones (indole-3-acetic acid (IAA, gibberellin (GA, and salicylic acid (SA. We detected firmness, pectin contents, and expression levels of UGlcAE family genes during the development of tomato fruit. Here, we systematically summarize the general characteristics of the SlUGlcAE genes in tomato, which could provide a basis for further function studies of tomato UGlcAE genes.
Using genes as characters and a parsimony analysis to explore the phylogenetic position of turtles.

Directory of Open Access Journals (Sweden)

Bin Lu

Full Text Available The phylogenetic position of turtles within the vertebrate tree of life remains controversial. Conflicting conclusions from different studies are likely a consequence of systematic error in the tree construction process, rather than random error from small amounts of data. Using genomic data, we evaluate the phylogenetic position of turtles with both conventional concatenated data analysis and a "genes as characters" approach. Two datasets were constructed, one with seven species (human, opossum, zebra finch, chicken, green anole, Chinese pond turtle, and western clawed frog and 4584 orthologous genes, and the second with four additional species (soft-shelled turtle, Nile crocodile, royal python, and tuatara but only 1638 genes. Our concatenated data analysis strongly supported turtle as the sister-group to archosaurs (the archosaur hypothesis, similar to several recent genomic data based studies using similar methods. When using genes as characters and gene trees as character-state trees with equal weighting for each gene, however, our parsimony analysis suggested that turtles are possibly sister-group to diapsids, archosaurs, or lepidosaurs. None of these resolutions were strongly supported by bootstraps. Furthermore, our incongruence analysis clearly demonstrated that there is a large amount of inconsistency among genes and most of the conflict relates to the placement of turtles. We conclude that the uncertain placement of turtles is a reflection of the true state of nature. Concatenated data analysis of large and heterogeneous datasets likely suffers from systematic error and over-estimates of confidence as a consequence of a large number of characters. Using genes as characters offers an alternative for phylogenomic analysis. It has potential to reduce systematic error, such as data heterogeneity and long-branch attraction, and it can also avoid problems associated with computation time and model selection. Finally, treating genes as
Using Genes as Characters and a Parsimony Analysis to Explore the Phylogenetic Position of Turtles

Science.gov (United States)

Lu, Bin; Yang, Weizhao; Dai, Qiang; Fu, Jinzhong

2013-01-01

The phylogenetic position of turtles within the vertebrate tree of life remains controversial. Conflicting conclusions from different studies are likely a consequence of systematic error in the tree construction process, rather than random error from small amounts of data. Using genomic data, we evaluate the phylogenetic position of turtles with both conventional concatenated data analysis and a “genes as characters” approach. Two datasets were constructed, one with seven species (human, opossum, zebra finch, chicken, green anole, Chinese pond turtle, and western clawed frog) and 4584 orthologous genes, and the second with four additional species (soft-shelled turtle, Nile crocodile, royal python, and tuatara) but only 1638 genes. Our concatenated data analysis strongly supported turtle as the sister-group to archosaurs (the archosaur hypothesis), similar to several recent genomic data based studies using similar methods. When using genes as characters and gene trees as character-state trees with equal weighting for each gene, however, our parsimony analysis suggested that turtles are possibly sister-group to diapsids, archosaurs, or lepidosaurs. None of these resolutions were strongly supported by bootstraps. Furthermore, our incongruence analysis clearly demonstrated that there is a large amount of inconsistency among genes and most of the conflict relates to the placement of turtles. We conclude that the uncertain placement of turtles is a reflection of the true state of nature. Concatenated data analysis of large and heterogeneous datasets likely suffers from systematic error and over-estimates of confidence as a consequence of a large number of characters. Using genes as characters offers an alternative for phylogenomic analysis. It has potential to reduce systematic error, such as data heterogeneity and long-branch attraction, and it can also avoid problems associated with computation time and model selection. Finally, treating genes as characters
Analysis of Single-cell Gene Transcription by RNA Fluorescent In Situ Hybridization (FISH)

DEFF Research Database (Denmark)

Ronander, Elena; Bengtsson, Dominique C; Joergensen, Louise

2012-01-01

Adhesion of Plasmodium falciparum infected erythrocytes (IE) to human endothelial receptors during malaria infections is mediated by expression of PfEMP1 protein variants encoded by the var genes. The haploid P. falciparum genome harbors approximately 60 different var genes of which only one has...... been believed to be transcribed per cell at a time during the blood stage of the infection. How such mutually exclusive regulation of var gene transcription is achieved is unclear, as is the identification of individual var genes or sub-groups of var genes associated with different receptors...... fluorescent in situ hybridization (FISH) analysis of var gene transcription by the parasite in individual nuclei of P. falciparum IE(1). Here, we present a detailed protocol for carrying out the RNA-FISH methodology for analysis of var gene transcription in single-nuclei of P. falciparum infected human...
Using OWL reasoning to support the generation of novel gene sets for enrichment analysis.

Science.gov (United States)

Osumi-Sutherland, David J; Ponta, Enrico; Courtot, Melanie; Parkinson, Helen; Badi, Laura

2018-02-14

The Gene Ontology (GO) consists of over 40,000 terms for biological processes, cell components and gene product activities linked into a graph structure by over 90,000 relationships. It has been used to annotate the functions and cellular locations of several million gene products. The graph structure is used by a variety of tools to group annotated genes into sets whose products share function or location. These gene sets are widely used to interpret the results of genomics experiments by assessing which sets are significantly over- or under-represented in results lists. F Hoffmann-La Roche Ltd. has developed a bespoke, manually maintained controlled vocabulary (RCV) for use in over-representation analysis. Many terms in this vocabulary group GO terms in novel ways that cannot easily be derived using the graph structure of the GO. For example, some RCV terms group GO terms by the cell, chemical or tissue type they refer to. Recent improvements in the content and formal structure of the GO make it possible to use logical queries in Web Ontology Language (OWL) to automatically map these cross-cutting classifications to sets of GO terms. We used this approach to automate mapping between RCV and GO, largely replacing the increasingly unsustainable manual mapping process. We then tested the utility of the resulting groupings for over-representation analysis. We successfully mapped 85% of RCV terms to logical OWL definitions and showed that these could be used to recapitulate and extend manual mappings between RCV terms and the sets of GO terms subsumed by them. We also show that gene sets derived from the resulting GO terms sets can be used to detect the signatures of cell and tissue types in whole genome expression data. The rich formal structure of the GO makes it possible to use reasoning to dynamically generate novel, biologically relevant groupings of GO terms. GO term groupings generated with this approach can be used in. over-representation analysis to detect
Transcriptome profiling and digital gene expression analysis of genes associated with salinity resistance in peanut

Directory of Open Access Journals (Sweden)

Jiongming Sui

2018-03-01

Full Text Available Background: Soil salinity can significantly reduce crop production, but the molecular mechanism of salinity tolerance in peanut is poorly understood. A mutant (S1 with higher salinity resistance than its mutagenic parent HY22 (S3 was obtained. Transcriptome sequencing and digital gene expression (DGE analysis were performed with leaves of S1 and S3 before and after plants were irrigated with 250 mM NaCl. Results: A total of 107,725 comprehensive transcripts were assembled into 67,738 unigenes using TIGR Gene Indices clustering tools (TGICL. All unigenes were searched against the euKaryotic Ortholog Groups (KOG, gene ontology (GO and Kyoto Encyclopedia of Genes and Genomes (KEGG databases, and these unigenes were assigned to 26 functional KOG categories, 56 GO terms, 32 KEGG groups, respectively. In total 112 differentially expressed genes (DEGs between S1 and S3 after salinity stress were screened, among them, 86 were responsive to salinity stress in S1 and/or S3. These 86 DEGs included genes that encoded the following kinds of proteins that are known to be involved in resistance to salinity stress: late embryogenesis abundant proteins (LEAs, major intrinsic proteins (MIPs or aquaporins, metallothioneins (MTs, lipid transfer protein (LTP, calcineurin B-like protein-interacting protein kinases (CIPKs, 9-cis-epoxycarotenoid dioxygenase (NCED and oleosins, etc. Of these 86 DEGs, 18 could not be matched with known proteins. Conclusion: The results from this study will be useful for further research on the mechanism of salinity resistance and will provide a useful gene resource for the variety breeding of salinity resistance in peanut. Keywords: Digital gene expression, Gene, Mutant, NaCl, Peanut (Arachis hypogaea L., RNA-seq, Salinity stress, Salinity tolerance, Soil salinity, Transcripts, Unigenes
Comparison between smaller ruptured intracranial aneurysm and larger un-ruptured intracranial aneurysm: gene expression profile analysis.

Science.gov (United States)

Li, Hao; Li, Haowen; Yue, Haiyan; Wang, Wen; Yu, Lanbing; ShuoWang; Cao, Yong; Zhao, Jizong

2017-07-01

As it grows in size, an intracranial aneurysm (IA) is prone to rupture. In this study, we compared two extreme groups of IAs, ruptured IAs (RIAs) smaller than 10 mm and un-ruptured IAs (UIAs) larger than 10 mm, to investigate the genes involved in the facilitation and prevention of IA rupture. The aneurismal walls of 6 smaller saccular RIAs (size smaller than 10 mm), 6 larger saccular UIAs (size larger than 10 mm) and 12 paired control arteries were obtained during surgery. The transcription profiles of these samples were studied by microarray analysis. RT-qPCR was used to confirm the expression of the genes of interest. In addition, functional group analysis of the differentially expressed genes was performed. Between smaller RIAs and larger UIAs, 101 genes and 179 genes were significantly over-expressed, respectively. In addition, functional group analysis demonstrated that the up-regulated genes in smaller RIAs mainly participated in the cellular response to metal ions and inorganic substances, while most of the up-regulated genes in larger UIAs were involved in inflammation and extracellular matrix (ECM) organization. Moreover, compared with control arteries, inflammation was up-regulated and muscle-related biological processes were down-regulated in both smaller RIAs and larger UIAs. The genes involved in the cellular response to metal ions and inorganic substances may facilitate the rupture of IAs. In addition, the healing process, involving inflammation and ECM organization, may protect IAs from rupture.
Discovery of new candidate genes for rheumatoid arthritis through integration of genetic association data with expression pathway analysis.

Science.gov (United States)

Shchetynsky, Klementy; Diaz-Gallo, Lina-Marcella; Folkersen, Lasse; Hensvold, Aase Haj; Catrina, Anca Irinel; Berg, Louise; Klareskog, Lars; Padyukov, Leonid

2017-02-02

Here we integrate verified signals from previous genetic association studies with gene expression and pathway analysis for discovery of new candidate genes and signaling networks, relevant for rheumatoid arthritis (RA). RNA-sequencing-(RNA-seq)-based expression analysis of 377 genes from previously verified RA-associated loci was performed in blood cells from 5 newly diagnosed, non-treated patients with RA, 7 patients with treated RA and 12 healthy controls. Differentially expressed genes sharing a similar expression pattern in treated and untreated RA sub-groups were selected for pathway analysis. A set of "connector" genes derived from pathway analysis was tested for differential expression in the initial discovery cohort and validated in blood cells from 73 patients with RA and in 35 healthy controls. There were 11 qualifying genes selected for pathway analysis and these were grouped into two evidence-based functional networks, containing 29 and 27 additional connector molecules. The expression of genes, corresponding to connector molecules was then tested in the initial RNA-seq data. Differences in the expression of ERBB2, TP53 and THOP1 were similar in both treated and non-treated patients with RA and an additional nine genes were differentially expressed in at least one group of patients compared to healthy controls. The ERBB2, TP53. THOP1 expression profile was successfully replicated in RNA-seq data from peripheral blood mononuclear cells from healthy controls and non-treated patients with RA, in an independent collection of samples. Integration of RNA-seq data with findings from association studies, and consequent pathway analysis implicate new candidate genes, ERBB2, TP53 and THOP1 in the pathogenesis of RA.
Loss of lager specific genes and subtelomeric regions define two different Saccharomyces cerevisiae lineages for Saccharomyces pastorianus Group I and II strains.

Science.gov (United States)

Monerawela, Chandre; James, Tharappel C; Wolfe, Kenneth H; Bond, Ursula

2015-03-01

Lager yeasts, Saccharomyces pastorianus, are interspecies hybrids between S. cerevisiae and S. eubayanus and are classified into Group I and Group II clades. The genome of the Group II strain, Weihenstephan 34/70, contains eight so-called 'lager-specific' genes that are located in subtelomeric regions. We evaluated the origins of these genes through bioinformatic and PCR analyses of Saccharomyces genomes. We determined that four are of cerevisiae origin while four originate from S. eubayanus. The Group I yeasts contain all four S. eubayanus genes but individual strains contain only a subset of the cerevisiae genes. We identified S. cerevisiae strains that contain all four cerevisiae 'lager-specific' genes, and distinct patterns of loss of these genes in other strains. Analysis of the subtelomeric regions uncovered patterns of loss in different S. cerevisiae strains. We identify two classes of S. cerevisiae strains: ale yeasts (Foster O) and stout yeasts with patterns of 'lager-specific' genes and subtelomeric regions identical to Group I and II S. pastorianus yeasts, respectively. These findings lead us to propose that Group I and II S. pastorianus strains originate from separate hybridization events involving different S. cerevisiae lineages. Using the combined bioinformatic and PCR data, we describe a potential classification map for industrial yeasts. © FEMS 2015. All rights reserved. For permissions, please e-mail: journals.permission@oup.com.
Comparative analysis of chromatin landscape in regulatory regions of human housekeeping and tissue specific genes

Directory of Open Access Journals (Sweden)

Dasgupta Dipayan

2005-05-01

Full Text Available Abstract Background Global regulatory mechanisms involving chromatin assembly and remodelling in the promoter regions of genes is implicated in eukaryotic transcription control especially for genes subjected to spatial and temporal regulation. The potential to utilise global regulatory mechanisms for controlling gene expression might depend upon the architecture of the chromatin in and around the gene. In-silico analysis can yield important insights into this aspect, facilitating comparison of two or more classes of genes comprising of a large number of genes within each group. Results In the present study, we carried out a comparative analysis of chromatin characteristics in terms of the scaffold/matrix attachment regions, nucleosome formation potential and the occurrence of repetitive sequences, in the upstream regulatory regions of housekeeping and tissue specific genes. Our data show that putative scaffold/matrix attachment regions are more abundant and nucleosome formation potential is higher in the 5' regions of tissue specific genes as compared to the housekeeping genes. Conclusion The differences in the chromatin features between the two groups of genes indicate the involvement of chromatin organisation in the control of gene expression. The presence of global regulatory mechanisms mediated through chromatin organisation can decrease the burden of invoking gene specific regulators for maintenance of the active/silenced state of gene expression. This could partially explain the lower number of genes estimated in the human genome.

A Strategy for Identifying Quantitative Trait Genes Using Gene Expression Analysis and Causal Analysis

Directory of Open Access Journals (Sweden)

Akira Ishikawa

2017-11-01

Full Text Available Large numbers of quantitative trait loci (QTL affecting complex diseases and other quantitative traits have been reported in humans and model animals. However, the genetic architecture of these traits remains elusive due to the difficulty in identifying causal quantitative trait genes (QTGs for common QTL with relatively small phenotypic effects. A traditional strategy based on techniques such as positional cloning does not always enable identification of a single candidate gene for a QTL of interest because it is difficult to narrow down a target genomic interval of the QTL to a very small interval harboring only one gene. A combination of gene expression analysis and statistical causal analysis can greatly reduce the number of candidate genes. This integrated approach provides causal evidence that one of the candidate genes is a putative QTG for the QTL. Using this approach, I have recently succeeded in identifying a single putative QTG for resistance to obesity in mice. Here, I outline the integration approach and discuss its usefulness using my studies as an example.
A Strategy for Identifying Quantitative Trait Genes Using Gene Expression Analysis and Causal Analysis.

Science.gov (United States)

Ishikawa, Akira

2017-11-27

Large numbers of quantitative trait loci (QTL) affecting complex diseases and other quantitative traits have been reported in humans and model animals. However, the genetic architecture of these traits remains elusive due to the difficulty in identifying causal quantitative trait genes (QTGs) for common QTL with relatively small phenotypic effects. A traditional strategy based on techniques such as positional cloning does not always enable identification of a single candidate gene for a QTL of interest because it is difficult to narrow down a target genomic interval of the QTL to a very small interval harboring only one gene. A combination of gene expression analysis and statistical causal analysis can greatly reduce the number of candidate genes. This integrated approach provides causal evidence that one of the candidate genes is a putative QTG for the QTL. Using this approach, I have recently succeeded in identifying a single putative QTG for resistance to obesity in mice. Here, I outline the integration approach and discuss its usefulness using my studies as an example.
Recent adaptive events in human brain revealed by meta-analysis of positively selected genes.

Directory of Open Access Journals (Sweden)

Yue Huang

Full Text Available BACKGROUND AND OBJECTIVES: Analysis of positively-selected genes can help us understand how human evolved, especially the evolution of highly developed cognitive functions. However, previous works have reached conflicting conclusions regarding whether human neuronal genes are over-represented among genes under positive selection. METHODS AND RESULTS: We divided positively-selected genes into four groups according to the identification approaches, compiling a comprehensive list from 27 previous studies. We showed that genes that are highly expressed in the central nervous system are enriched in recent positive selection events in human history identified by intra-species genomic scan, especially in brain regions related to cognitive functions. This pattern holds when different datasets, parameters and analysis pipelines were used. Functional category enrichment analysis supported these findings, showing that synapse-related functions are enriched in genes under recent positive selection. In contrast, immune-related functions, for instance, are enriched in genes under ancient positive selection revealed by inter-species coding region comparison. We further demonstrated that most of these patterns still hold even after controlling for genomic characteristics that might bias genome-wide identification of positively-selected genes including gene length, gene density, GC composition, and intensity of negative selection. CONCLUSION: Our rigorous analysis resolved previous conflicting conclusions and revealed recent adaptation of human brain functions.
Integrative sparse principal component analysis of gene expression data.

Science.gov (United States)

Liu, Mengque; Fan, Xinyan; Fang, Kuangnan; Zhang, Qingzhao; Ma, Shuangge

2017-12-01

In the analysis of gene expression data, dimension reduction techniques have been extensively adopted. The most popular one is perhaps the PCA (principal component analysis). To generate more reliable and more interpretable results, the SPCA (sparse PCA) technique has been developed. With the "small sample size, high dimensionality" characteristic of gene expression data, the analysis results generated from a single dataset are often unsatisfactory. Under contexts other than dimension reduction, integrative analysis techniques, which jointly analyze the raw data of multiple independent datasets, have been developed and shown to outperform "classic" meta-analysis and other multidatasets techniques and single-dataset analysis. In this study, we conduct integrative analysis by developing the iSPCA (integrative SPCA) method. iSPCA achieves the selection and estimation of sparse loadings using a group penalty. To take advantage of the similarity across datasets and generate more accurate results, we further impose contrasted penalties. Different penalties are proposed to accommodate different data conditions. Extensive simulations show that iSPCA outperforms the alternatives under a wide spectrum of settings. The analysis of breast cancer and pancreatic cancer data further shows iSPCA's satisfactory performance. © 2017 WILEY PERIODICALS, INC.
Gene organization in rice revealed by full-length cDNA mapping and gene expression analysis through microarray.

Directory of Open Access Journals (Sweden)

Kouji Satoh

Full Text Available Rice (Oryza sativa L. is a model organism for the functional genomics of monocotyledonous plants since the genome size is considerably smaller than those of other monocotyledonous plants. Although highly accurate genome sequences of indica and japonica rice are available, additional resources such as full-length complementary DNA (FL-cDNA sequences are also indispensable for comprehensive analyses of gene structure and function. We cross-referenced 28.5K individual loci in the rice genome defined by mapping of 578K FL-cDNA clones with the 56K loci predicted in the TIGR genome assembly. Based on the annotation status and the presence of corresponding cDNA clones, genes were classified into 23K annotated expressed (AE genes, 33K annotated non-expressed (ANE genes, and 5.5K non-annotated expressed (NAE genes. We developed a 60mer oligo-array for analysis of gene expression from each locus. Analysis of gene structures and expression levels revealed that the general features of gene structure and expression of NAE and ANE genes were considerably different from those of AE genes. The results also suggested that the cloning efficiency of rice FL-cDNA is associated with the transcription activity of the corresponding genetic locus, although other factors may also have an effect. Comparison of the coverage of FL-cDNA among gene families suggested that FL-cDNA from genes encoding rice- or eukaryote-specific domains, and those involved in regulatory functions were difficult to produce in bacterial cells. Collectively, these results indicate that rice genes can be divided into distinct groups based on transcription activity and gene structure, and that the coverage bias of FL-cDNA clones exists due to the incompatibility of certain eukaryotic genes in bacteria.
MAGMA: Generalized Gene-Set Analysis of GWAS Data

NARCIS (Netherlands)

de Leeuw, C.A.; Mooij, J.M.; Heskes, T.; Posthuma, D.

2015-01-01

By aggregating data for complex traits in a biologically meaningful way, gene and gene-set analysis constitute a valuable addition to single-marker analysis. However, although various methods for gene and gene-set analysis currently exist, they generally suffer from a number of issues. Statistical
MAGMA: generalized gene-set analysis of GWAS data.

NARCIS (Netherlands)

de Leeuw, C.A.; Mooij, J.M.; Heskes, T.; Posthuma, D.

2015-01-01

By aggregating data for complex traits in a biologically meaningful way, gene and gene-set analysis constitute a valuable addition to single-marker analysis. However, although various methods for gene and gene-set analysis currently exist, they generally suffer from a number of issues. Statistical
Identification of Putative Genes Involved in Limonoids Biosynthesis in Citrus by Comparative Transcriptomic Analysis

Directory of Open Access Journals (Sweden)

Fusheng Wang

2017-05-01

Full Text Available Limonoids produced by citrus are a group of highly bioactive secondary metabolites which provide health benefits for humans. Currently there is a lack of information derived from research on the genetic mechanisms controlling the biosynthesis of limonoids, which has limited the improvement of citrus for high production of limonoids. In this study, the transcriptome sequences of leaves, phloems and seeds of pummelo (Citrus grandis (L. Osbeck at different development stages with variances in limonoids contents were used for digital gene expression profiling analysis in order to identify the genes corresponding to the biosynthesis of limonoids. Pair-wise comparison of transcriptional profiles between different tissues identified 924 differentially expressed genes commonly shared between them. Expression pattern analysis suggested that 382 genes from three conjunctive groups of K-means clustering could be possibly related to the biosynthesis of limonoids. Correlation analysis with the samples from different genotypes, and different developing tissues of the citrus revealed that the expression of 15 candidate genes were highly correlated with the contents of limonoids. Among them, the cytochrome P450s (CYP450s and transcriptional factor MYB demonstrated significantly high correlation coefficients, which indicated the importance of those genes on the biosynthesis of limonoids. CiOSC gene encoding the critical enzyme oxidosqualene cyclase (OSC for biosynthesis of the precursor of triterpene scaffolds was found positively corresponding to the accumulation of limonoids during the development of seeds. Suppressing the expression of CiOSC with VIGS (Virus-induced gene silencing demonstrated that the level of gene silencing was significantly correlated to the reduction of limonoids contents. The results indicated that the CiOSC gene plays a pivotal role in biosynthesis of limonoids.
Form gene clustering method about pan-ethnic-group products based on emotional semantic

Science.gov (United States)

Chen, Dengkai; Ding, Jingjing; Gao, Minzhuo; Ma, Danping; Liu, Donghui

2016-09-01

The use of pan-ethnic-group products form knowledge primarily depends on a designer's subjective experience without user participation. The majority of studies primarily focus on the detection of the perceptual demands of consumers from the target product category. A pan-ethnic-group products form gene clustering method based on emotional semantic is constructed. Consumers' perceptual images of the pan-ethnic-group products are obtained by means of product form gene extraction and coding and computer aided product form clustering technology. A case of form gene clustering about the typical pan-ethnic-group products is investigated which indicates that the method is feasible. This paper opens up a new direction for the future development of product form design which improves the agility of product design process in the era of Industry 4.0.
Three phylogenetic groups of nodA and nifH genes in Sinorhizobium and Mesorhizobium isolates from leguminous trees growing in Africa and Latin America.

Science.gov (United States)

Haukka, K; Lindström, K; Young, J P

1998-02-01

The diversity and phylogeny of nodA and nifH genes were studied by using 52 rhizobial isolates from Acacia senegal, Prosopis chilensis, and related leguminous trees growing in Africa and Latin America. All of the strains had similar host ranges and belonged to the genera Sinorhizobium and Mesorhizobium, as previously determined by 16S rRNA gene sequence analysis. The restriction patterns and a sequence analysis of the nodA and nifH genes divided the strains into the following three distinct groups: sinorhizobia from Africa, sinorhizobia from Latin America, and mesorhizobia from both regions. In a phylogenetic tree also containing previously published sequences, the nodA genes of our rhizobia formed a branch of their own, but within the branch no correlation between symbiotic genes and host trees was apparent. Within the large group of African sinorhizobia, similar symbiotic gene types were found in different chromosomal backgrounds, suggesting that transfer of symbiotic genes has occurred across species boundaries. Most strains had plasmids, and the presence of plasmid-borne nifH was demonstrated by hybridization for some examples. The nodA and nifH genes of Sinorhizobium teranga ORS1009T grouped with the nodA and nifH genes of the other African sinorhizobia, but Sinorhizobium saheli ORS609T had a totally different nodA sequence, although it was closely related based on the 16S rRNA gene and nifH data. This might be because this S. saheli strain was originally isolated from Sesbania sp., which belongs to a different cross-nodulation group than Acacia and Prosopis spp. The factors that appear to have influenced the evolution of rhizobial symbiotic genes vary in importance at different taxonomic levels.
Genome-Wide Analysis of the Aquaporin Gene Family in Chickpea (Cicer arietinum L.).

Science.gov (United States)

Deokar, Amit A; Tar'an, Bunyamin

2016-01-01

Aquaporins (AQPs) are essential membrane proteins that play critical role in the transport of water and many other solutes across cell membranes. In this study, a comprehensive genome-wide analysis identified 40 AQP genes in chickpea ( Cicer arietinum L.). A complete overview of the chickpea AQP (CaAQP) gene family is presented, including their chromosomal locations, gene structure, phylogeny, gene duplication, conserved functional motifs, gene expression, and conserved promoter motifs. To understand AQP's evolution, a comparative analysis of chickpea AQPs with AQP orthologs from soybean, Medicago, common bean, and Arabidopsis was performed. The chickpea AQP genes were found on all of the chickpea chromosomes, except chromosome 7, with a maximum of six genes on chromosome 6, and a minimum of one gene on chromosome 5. Gene duplication analysis indicated that the expansion of chickpea AQP gene family might have been due to segmental and tandem duplications. CaAQPs were grouped into four subfamilies including 15 NOD26-like intrinsic proteins (NIPs), 13 tonoplast intrinsic proteins (TIPs), eight plasma membrane intrinsic proteins (PIPs), and four small basic intrinsic proteins (SIPs) based on sequence similarities and phylogenetic position. Gene structure analysis revealed a highly conserved exon-intron pattern within CaAQP subfamilies supporting the CaAQP family classification. Functional prediction based on conserved Ar/R selectivity filters, Froger's residues, and specificity-determining positions suggested wide differences in substrate specificity among the subfamilies of CaAQPs. Expression analysis of the AQP genes indicated that some of the genes are tissue-specific, whereas few other AQP genes showed differential expression in response to biotic and abiotic stresses. Promoter profiling of CaAQP genes for conserved cis -acting regulatory elements revealed enrichment of cis -elements involved in circadian control, light response, defense and stress responsiveness
Expression Analysis of Gata4, Tbx5 and Nkx2.5 Genes Involved in Congenital Heart Disease

Directory of Open Access Journals (Sweden)

Mahta Mazaheri-Naeeini

2016-04-01

Full Text Available Background Congenital heart disease (CHD is the most widespread congenital disease in newborn babies and is one of the main causes of death worldwide. The causal agent of heart congenital diseases is unknown but genetic factors have an important role in prevalence of disease. Objectives The main objective of this research is comparison of the gene expression level of three Gata4, Tbx5 and Nkx2.5 genes in three groups of children between 6 months and 13 year old with congenital heart disease. Patients and Methods In this case-control study, 30 samples from each cyanotic and acyanotic patients and 30 samples from healthy children as control were used. RNA extraction was done using commercial kit and gene expression analysis was performed by qRT-PCR approach in three replication using Gata4, Tbx5 and Nkx2.5 genes. Data analysis was done by REST software. Results The results of RNA extraction and cDNA synthesis of all sample showed high quantity and quality of genetic materials. Expression level of tested genes was reduced in two patients group. In cyanotic group reduction was more than acyanotic samples. All tested gene were reduced in both group. Tbx5 gene was suppressed more than other genes. Conclusions Based on our results we could conclude that a gene family play an important role in cardiogenesis process and heart formation. These genes are closely related together. So a genetic consultation for such diseases on parents of these patients to determine the probable genetic mutations is recommended.
Genome-Wide Analysis of the Musa WRKY Gene Family: Evolution and Differential Expression during Development and Stress.

Science.gov (United States)

Goel, Ridhi; Pandey, Ashutosh; Trivedi, Prabodh K; Asif, Mehar H

2016-01-01

The WRKY gene family plays an important role in the development and stress responses in plants. As information is not available on the WRKY gene family in Musa species, genome-wide analysis has been carried out in this study using available genomic information from two species, Musa acuminata and Musa balbisiana. Analysis identified 147 and 132 members of the WRKY gene family in M. acuminata and M. balbisiana, respectively. Evolutionary analysis suggests that the WRKY gene family expanded much before the speciation in both the species. Most of the orthologs retained in two species were from the γ duplication event which occurred prior to α and β genome-wide duplication (GWD) events. Analysis also suggests that subtle changes in nucleotide sequences during the course of evolution have led to the development of new motifs which might be involved in neo-functionalization of different WRKY members in two species. Expression and cis-regulatory motif analysis suggest possible involvement of Group II and Group III WRKY members during various stresses and growth/development including fruit ripening process respectively.
Genome-wide analysis of the Musa WRKY gene family: evolution and differential expression during development and stress

Directory of Open Access Journals (Sweden)

Ridhi eGoel

2016-03-01

Full Text Available The WRKY gene family plays an important role in the development and stress responses in plants. As information is not available on the WRKY gene family in Musa species, genome-wide analysis has been carried out in this study using available genomic information from two species, Musa acuminata and Musa balbisiana. Analysis identified 147 and 132 members of the WRKY gene family in M. acuminata and M. balbisiana respectively. Evolutionary analysis suggests that the WRKY gene family expanded much before the speciation in both the species. Most of the orthologs retained in two species were from the γ duplication event which occurred prior to α and β genome-wide duplication (GWD events. Analysis also suggests that subtle changes in nucleotide sequences during the course of evolution have led to the development of new motifs which might be involved in neo-functionalization of different WRKY members in two species. Expression and cis-regulatory motif analysis suggest possible involvement of Group II and Group III WRKY members during various stresses and growth/ development including fruit ripening process respectively.
ERC analysis: web-based inference of gene function via evolutionary rate covariation.

Science.gov (United States)

Wolfe, Nicholas W; Clark, Nathan L

2015-12-01

The recent explosion of comparative genomics data presents an unprecedented opportunity to construct gene networks via the evolutionary rate covariation (ERC) signature. ERC is used to identify genes that experienced similar evolutionary histories, and thereby draws functional associations between them. The ERC Analysis website allows researchers to exploit genome-wide datasets to infer novel genes in any biological function and to explore deep evolutionary connections between distinct pathways and complexes. The website provides five analytical methods, graphical output, statistical support and access to an increasing number of taxonomic groups. Analyses and data at http://csb.pitt.edu/erc_analysis/ nclark@pitt.edu. © The Author 2015. Published by Oxford University Press. All rights reserved. For Permissions, please e-mail: journals.permissions@oup.com.
Network-based differential gene expression analysis suggests cell cycle related genes regulated by E2F1 underlie the molecular difference between smoker and non-smoker lung adenocarcinoma

Science.gov (United States)

2013-01-01

Background Differential gene expression (DGE) analysis is commonly used to reveal the deregulated molecular mechanisms of complex diseases. However, traditional DGE analysis (e.g., the t test or the rank sum test) tests each gene independently without considering interactions between them. Top-ranked differentially regulated genes prioritized by the analysis may not directly relate to the coherent molecular changes underlying complex diseases. Joint analyses of co-expression and DGE have been applied to reveal the deregulated molecular modules underlying complex diseases. Most of these methods consist of separate steps: first to identify gene-gene relationships under the studied phenotype then to integrate them with gene expression changes for prioritizing signature genes, or vice versa. It is warrant a method that can simultaneously consider gene-gene co-expression strength and corresponding expression level changes so that both types of information can be leveraged optimally. Results In this paper, we develop a gene module based method for differential gene expression analysis, named network-based differential gene expression (nDGE) analysis, a one-step integrative process for prioritizing deregulated genes and grouping them into gene modules. We demonstrate that nDGE outperforms existing methods in prioritizing deregulated genes and discovering deregulated gene modules using simulated data sets. When tested on a series of smoker and non-smoker lung adenocarcinoma data sets, we show that top differentially regulated genes identified by the rank sum test in different sets are not consistent while top ranked genes defined by nDGE in different data sets significantly overlap. nDGE results suggest that a differentially regulated gene module, which is enriched for cell cycle related genes and E2F1 targeted genes, plays a role in the molecular differences between smoker and non-smoker lung adenocarcinoma. Conclusions In this paper, we develop nDGE to prioritize
Comparative Genetic Variability in HIV-1 Subtype C vpu Gene in Early Age Groups of Infants.

Science.gov (United States)

Sharma, Uma; Gupta, Poonam; Gupta, Sunil; Venkatesh, S; Husain, Mohammad

2018-01-01

Identifying the genetic variability in vertically transmitted viruses in early infancy is important to understand the disease progression. Being important in HIV-1 disease pathogenesis, vpu gene, isolated from young infants was investigated to understand the viral characteristics. Blood samples were obtained from 80 HIV-1 positive infants, categorized in two age groups; acute (6-18 months). A total of 77 PCR positive samples, amplified for vpu gene, were sequenced and analyzed. 73 isolates belonged to subtype C. Analysis of heterogeneity of amino acid sequences in infant groups showed that in the sequences of acute age group both insertions and deletions were present while in the early age group only deletions were present. In the acute age group, a deletion of 3 residues (RAE) in the first alfa helix in one sequence and insertions of 1-2 residues (DM, GH, G and H) in the second alfa helix in 4 sequences were observed. In the early age group, deletion of 2 residues (VN) in the cytoplasmic tail region in 2 sequences was observed. Length of the amino terminal was observed to be gradually increasing with the increasing age of the infants. Protein Variation Effect Analyzer software showed that deleterious mutations were more in the acute than the early age group. Entropy analysis revealed that heterogeneity of the residues was comparatively higher in the sequences of acute than the early age group. Mutations observed in the helixes may affect the conformation and lose the ability to degrade CD4 receptors. Heterogeneity was decreasing with the increasing ages of the infants, indicating positive selection for robust virion survival. Copyright© Bentham Science Publishers; For any queries, please email at epub@benthamscience.org.
Gene expression analysis of flax seed development

Science.gov (United States)

2011-01-01

Background Flax, Linum usitatissimum L., is an important crop whose seed oil and stem fiber have multiple industrial applications. Flax seeds are also well-known for their nutritional attributes, viz., omega-3 fatty acids in the oil and lignans and mucilage from the seed coat. In spite of the importance of this crop, there are few molecular resources that can be utilized toward improving seed traits. Here, we describe flax embryo and seed development and generation of comprehensive genomic resources for the flax seed. Results We describe a large-scale generation and analysis of expressed sequences in various tissues. Collectively, the 13 libraries we have used provide a broad representation of genes active in developing embryos (globular, heart, torpedo, cotyledon and mature stages) seed coats (globular and torpedo stages) and endosperm (pooled globular to torpedo stages) and genes expressed in flowers, etiolated seedlings, leaves, and stem tissue. A total of 261,272 expressed sequence tags (EST) (GenBank accessions LIBEST_026995 to LIBEST_027011) were generated. These EST libraries included transcription factor genes that are typically expressed at low levels, indicating that the depth is adequate for in silico expression analysis. Assembly of the ESTs resulted in 30,640 unigenes and 82% of these could be identified on the basis of homology to known and hypothetical genes from other plants. When compared with fully sequenced plant genomes, the flax unigenes resembled poplar and castor bean more than grape, sorghum, rice or Arabidopsis. Nearly one-fifth of these (5,152) had no homologs in sequences reported for any organism, suggesting that this category represents genes that are likely unique to flax. Digital analyses revealed gene expression dynamics for the biosynthesis of a number of important seed constituents during seed development. Conclusions We have developed a foundational database of expressed sequences and collection of plasmid clones that comprise
Gene expression analysis of flax seed development

Directory of Open Access Journals (Sweden)

Sharpe Andrew

2011-04-01

Full Text Available Abstract Background Flax, Linum usitatissimum L., is an important crop whose seed oil and stem fiber have multiple industrial applications. Flax seeds are also well-known for their nutritional attributes, viz., omega-3 fatty acids in the oil and lignans and mucilage from the seed coat. In spite of the importance of this crop, there are few molecular resources that can be utilized toward improving seed traits. Here, we describe flax embryo and seed development and generation of comprehensive genomic resources for the flax seed. Results We describe a large-scale generation and analysis of expressed sequences in various tissues. Collectively, the 13 libraries we have used provide a broad representation of genes active in developing embryos (globular, heart, torpedo, cotyledon and mature stages seed coats (globular and torpedo stages and endosperm (pooled globular to torpedo stages and genes expressed in flowers, etiolated seedlings, leaves, and stem tissue. A total of 261,272 expressed sequence tags (EST (GenBank accessions LIBEST_026995 to LIBEST_027011 were generated. These EST libraries included transcription factor genes that are typically expressed at low levels, indicating that the depth is adequate for in silico expression analysis. Assembly of the ESTs resulted in 30,640 unigenes and 82% of these could be identified on the basis of homology to known and hypothetical genes from other plants. When compared with fully sequenced plant genomes, the flax unigenes resembled poplar and castor bean more than grape, sorghum, rice or Arabidopsis. Nearly one-fifth of these (5,152 had no homologs in sequences reported for any organism, suggesting that this category represents genes that are likely unique to flax. Digital analyses revealed gene expression dynamics for the biosynthesis of a number of important seed constituents during seed development. Conclusions We have developed a foundational database of expressed sequences and collection of plasmid
GO-Bayes: Gene Ontology-based overrepresentation analysis using a Bayesian approach.

Science.gov (United States)

Zhang, Song; Cao, Jing; Kong, Y Megan; Scheuermann, Richard H

2010-04-01

A typical approach for the interpretation of high-throughput experiments, such as gene expression microarrays, is to produce groups of genes based on certain criteria (e.g. genes that are differentially expressed). To gain more mechanistic insights into the underlying biology, overrepresentation analysis (ORA) is often conducted to investigate whether gene sets associated with particular biological functions, for example, as represented by Gene Ontology (GO) annotations, are statistically overrepresented in the identified gene groups. However, the standard ORA, which is based on the hypergeometric test, analyzes each GO term in isolation and does not take into account the dependence structure of the GO-term hierarchy. We have developed a Bayesian approach (GO-Bayes) to measure overrepresentation of GO terms that incorporates the GO dependence structure by taking into account evidence not only from individual GO terms, but also from their related terms (i.e. parents, children, siblings, etc.). The Bayesian framework borrows information across related GO terms to strengthen the detection of overrepresentation signals. As a result, this method tends to identify sets of closely related GO terms rather than individual isolated GO terms. The advantage of the GO-Bayes approach is demonstrated with a simulation study and an application example.

Analysis of MaACS2, a stress-inducible ACC Synthase Gene in Musa acuminata AAA Group Cultivar Pisang Ambon

Directory of Open Access Journals (Sweden)

Resnanti Utami Handayani

2014-07-01

Full Text Available Ethylene has an important function in plant growth and development. Ethylene production generally increases in response to pathogen attacks and other environmental stress conditions. The synthesis of this phytohormone is regulated by two enzymes, ACC synthase (ACS and ACC oxidase (ACO. ACC synthase is encoded by a multigene that regulates the production of ACC, after which this precursor is converted into ethylene by ACO. Pisang Ambon (Musa sp. AAA group, a banana cultivar originating from Indonesia, has nine ACS genes (MaACS 1-9 and one ACO gene (MaACO. One of the banana ACS genes, MaACS2, is stress-inducible. In this research, we have investigated the expression profile of MaACS2 in the roots and leaf tissues of infected tissue culture plants. Quantification of gene expression was analyzed using Real-Time PCR (qPCR using Ma18srRNA and MaGAPDH as reference genes. The results showed nine-to ten fold higher MaACS2 expression levels in the infected roots tissues compared to the uninfected roots tissues. However, MaACS2 expression in the leaves was only detected in infected tissue.
Biclustering methods: biological relevance and application in gene expression analysis.

Directory of Open Access Journals (Sweden)

Ali Oghabian

Full Text Available DNA microarray technologies are used extensively to profile the expression levels of thousands of genes under various conditions, yielding extremely large data-matrices. Thus, analyzing this information and extracting biologically relevant knowledge becomes a considerable challenge. A classical approach for tackling this challenge is to use clustering (also known as one-way clustering methods where genes (or respectively samples are grouped together based on the similarity of their expression profiles across the set of all samples (or respectively genes. An alternative approach is to develop biclustering methods to identify local patterns in the data. These methods extract subgroups of genes that are co-expressed across only a subset of samples and may feature important biological or medical implications. In this study we evaluate 13 biclustering and 2 clustering (k-means and hierarchical methods. We use several approaches to compare their performance on two real gene expression data sets. For this purpose we apply four evaluation measures in our analysis: (1 we examine how well the considered (biclustering methods differentiate various sample types; (2 we evaluate how well the groups of genes discovered by the (biclustering methods are annotated with similar Gene Ontology categories; (3 we evaluate the capability of the methods to differentiate genes that are known to be specific to the particular sample types we study and (4 we compare the running time of the algorithms. In the end, we conclude that as long as the samples are well defined and annotated, the contamination of the samples is limited, and the samples are well replicated, biclustering methods such as Plaid and SAMBA are useful for discovering relevant subsets of genes and samples.
Digital gene expression analysis in mice lung with coinfection of influenza and streptococcus pneumoniae.

Science.gov (United States)

Luo, Jun; Zhou, Linlin; Wang, Hongren; Qin, Zhen; Xiang, Li; Zhu, Jie; Huang, Xiaojun; Yang, Yuan; Li, Wanyi; Wang, Baoning; Li, Mingyuan

2017-12-22

Influenza A virus (IAV) and Streptococcus pneumoniae (SP) are two major upper respiratory tract pathogens that can also cause infection in polarized bronchial epithelial cells to exacerbate disease in coinfected individuals which may result in significant morbidity. However, the underlying molecular mechanism is poorly understood. Here, we employed BALB/c ByJ mice inflected with SP, IAV, IAV followed by SP (IAV+SP) and PBS (Control) as models to survey the global gene expression using digital gene expression (DGE) profiling. We attempt to gain insights into the underlying genetic basis of this synergy at the expression level. Gene expression profiles were obtain using the Illimina/Hisseq sequencing technique, and further analyzed by enrichment analysis of Gene Ontology (GO) and Pathway function. The hematoxylin-eosin (HE) staining revealed different tissue changes in groups during which IAV+SP group showed the most severe cell apoptosis. Compared with Control, a total of 2731, 3221 and 3946 differentially expressed genes (DEGs) were detected in SP, IAV and IAV+SP respectively. Besides, sixty-two GO terms were identified by Gene Ontology functional enrichment analysis, such as cell killing, biological regulation, response to stimulus, signaling, biological adhesion, enzyme regulator activity, receptor regulator activity and translation regulator activity. Pathway significant enrichment analysis indicated the dysregulation of multiple pathways, including apoptosis pathway. Among these, five selected genes were further verified by quantitative reverse transcription-polymerase chain reaction (qRT-PCR). This study shows that infection with SP, IAV or IAV+SP induces apoptosis with different degrees which might provide insights into the molecular mechanisms to facilitate further research.
Microarray analysis of pancreatic gene expression during biotin repletion in biotin-deficient rats.

Science.gov (United States)

Dakshinamurti, Krishnamurti; Bagchi, Rushita A; Abrenica, Bernard; Czubryt, Michael P

2015-12-01

Biotin is a B vitamin involved in multiple metabolic pathways. In humans, biotin deficiency is relatively rare but can cause dermatitis, alopecia, and perosis. Low biotin levels occur in individuals with type-2 diabetes, and supplementation with biotin plus chromium may improve blood sugar control. The acute effect on pancreatic gene expression of biotin repletion following chronic deficiency is unclear, therefore we induced biotin deficiency in adult male rats by feeding them a 20% raw egg white diet for 6 weeks. Animals were then randomized into 2 groups: one group received a single biotin supplement and returned to normal chow lacking egg white, while the second group remained on the depletion diet. After 1 week, pancreata were removed from biotin-deficient (BD) and biotin-repleted (BR) animals and RNA was isolated for microarray analysis. Biotin depletion altered gene expression in a manner indicative of inflammation, fibrosis, and defective pancreatic function. Conversely, biotin repletion activated numerous repair and anti-inflammatory pathways, reduced fibrotic gene expression, and induced multiple genes involved in pancreatic endocrine and exocrine function. A subset of the results was confirmed by quantitative real-time PCR analysis, as well as by treatment of pancreatic AR42J cells with biotin. The results indicate that biotin repletion, even after lengthy deficiency, results in the rapid induction of repair processes in the pancreas.
[BIOINFORMATIC SEARCH AND PHYLOGENETIC ANALYSIS OF THE CELLULOSE SYNTHASE GENES OF FLAX (LINUM USITATISSIMUM)].

Science.gov (United States)

Pydiura, N A; Bayer, G Ya; Galinousky, D V; Yemets, A I; Pirko, Ya V; Podvitski, T A; Anisimova, N V; Khotyleva, L V; Kilchevsky, A V; Blume, Ya B

2015-01-01

A bioinformatic search of sequences encoding cellulose synthase genes in the flax genome, and their comparison to dicots orthologs was carried out. The analysis revealed 32 cellulose synthase gene candidates, 16 of which are highly likely to encode cellulose synthases, and the remaining 16--cellulose synthase-like proteins (Csl). Phylogenetic analysis of gene products of cellulose synthase genes allowed distinguishing 6 groups of cellulose synthase genes of different classes: CesA1/10, CesA3, CesA4, CesA5/6/2/9, CesA7 and CesA8. Paralogous sequences within classes CesA1/10 and CesA5/6/2/9 which are associated with the primary cell wall formation are characterized by a greater similarity within these classes than orthologous sequences. Whereas the genes controlling the biosynthesis of secondary cell wall cellulose form distinct clades: CesA4, CesA7, and CesA8. The analysis of 16 identified flax cellulose synthase gene candidates shows the presence of at least 12 different cellulose synthase gene variants in flax genome which are represented in all six clades of cellulose synthase genes. Thus, at this point genes of all ten known cellulose synthase classes are identify in flax genome, but their correct classification requires additional research.
Genome-wide analysis of the WRKY gene family in physic nut (Jatropha curcas L.).

Science.gov (United States)

Xiong, Wangdan; Xu, Xueqin; Zhang, Lin; Wu, Pingzhi; Chen, Yaping; Li, Meiru; Jiang, Huawu; Wu, Guojiang

2013-07-25

The WRKY proteins, which contain highly conserved WRKYGQK amino acid sequences and zinc-finger-like motifs, constitute a large family of transcription factors in plants. They participate in diverse physiological and developmental processes. WRKY genes have been identified and characterized in a number of plant species. We identified a total of 58 WRKY genes (JcWRKY) in the genome of the physic nut (Jatropha curcas L.). On the basis of their conserved WRKY domain sequences, all of the JcWRKY proteins could be assigned to one of the previously defined groups, I-III. Phylogenetic analysis of JcWRKY genes with Arabidopsis and rice WRKY genes, and separately with castor bean WRKY genes, revealed no evidence of recent gene duplication in JcWRKY gene family. Analysis of transcript abundance of JcWRKY gene products were tested in different tissues under normal growth condition. In addition, 47 WRKY genes responded to at least one abiotic stress (drought, salinity, phosphate starvation and nitrogen starvation) in individual tissues (leaf, root and/or shoot cortex). Our study provides a useful reference data set as the basis for cloning and functional analysis of physic nut WRKY genes. Copyright © 2013 Elsevier B.V. All rights reserved.
Genes involved in immunity and apoptosis are associated with human presbycusis based on microarray analysis.

Science.gov (United States)

Dong, Yang; Li, Ming; Liu, Puzhao; Song, Haiyan; Zhao, Yuping; Shi, Jianrong

2014-06-01

Genes involved in immunity and apoptosis were associated with human presbycusis. CCR3 and GILZ played an important role in the pathogenesis of presbycusis, probably through regulating chemokine receptor, T-cell apoptosis, or T-cell activation pathways. To identify genes associated with human presbycusis and explore the molecular mechanism of presbycusis. Hearing function was tested by pure-tone audiometry. Microarray analysis was performed to identify presbycusis-correlated genes by Illumina Human-6 BeadChip using the peripheral blood samples of subjects. To identify biological process categories and pathways associated with presbycusis-correlated genes, bioinformatics analysis was carried out by Gene Ontology Tree Machine (GOTM) and database for annotation, visualization, and integrated discovery (DAVID). Quantitative RT-PCR (qRT-PCR) was used to validate the microarray data. Microarray analysis identified 469 up-regulated genes and 323 down-regulated genes. Both the dominant biological processes by Gene Ontology (GO) analysis and the enriched pathways by Kyoto encyclopedia of genes and genomes (KEGG) and BIOCARTA showed that genes involved in immunity and apoptosis were associated with presbycusis. In addition, CCR3, GILZ, CXCL10, and CX3CR1 genes showed consistent difference between groups for both the gene chip and qRT-PCR data. The differences of CCR3 and GILZ between presbycusis patients and controls were statistically significant (p < 0.05).
Global Analysis of miRNA Gene Clusters and Gene Families Reveals Dynamic and Coordinated Expression

Directory of Open Access Journals (Sweden)

Li Guo

2014-01-01

Full Text Available To further understand the potential expression relationships of miRNAs in miRNA gene clusters and gene families, a global analysis was performed in 4 paired tumor (breast cancer and adjacent normal tissue samples using deep sequencing datasets. The compositions of miRNA gene clusters and families are not random, and clustered and homologous miRNAs may have close relationships with overlapped miRNA species. Members in the miRNA group always had various expression levels, and even some showed larger expression divergence. Despite the dynamic expression as well as individual difference, these miRNAs always indicated consistent or similar deregulation patterns. The consistent deregulation expression may contribute to dynamic and coordinated interaction between different miRNAs in regulatory network. Further, we found that those clustered or homologous miRNAs that were also identified as sense and antisense miRNAs showed larger expression divergence. miRNA gene clusters and families indicated important biological roles, and the specific distribution and expression further enrich and ensure the flexible and robust regulatory network.
Genome-wide identification and expression analysis of MAPK and MAPKK gene family in Malus domestica.

Science.gov (United States)

Zhang, Shizhong; Xu, Ruirui; Luo, Xiaocui; Jiang, Zesheng; Shu, Huairui

2013-12-01

MAPK signal transduction modules play crucial roles in regulating many biological processes in plants, which are composed of three classes of hierarchically organized protein kinases, namely MAPKKKs, MAPKKs, and MAPKs. Although genome-wide analysis of this family has been carried out in some species, little is known about MAPK and MAPKK genes in apple (Malus domestica). In this study, a total of 26 putative apple MAPK genes (MdMPKs) and 9 putative apple MAPKK genes (MdMKKs) have been identified and located within the apple genome. Phylogenetic analysis revealed that MdMAPKs and MdMAPKKs could be divided into 4 subfamilies (groups A, B, C and D), respectively. The predicted MdMAPKs and MdMAPKKs were distributed across 13 out of 17 chromosomes with different densities. In addition, analysis of exon-intron junctions and of intron phase inside the predicted coding region of each candidate gene has revealed high levels of conservation within and between phylogenetic groups. According to the microarray and expressed sequence tag (EST) analysis, the different expression patterns indicate that they may play different roles during fruit development and rootstock-scion interaction process. Moreover, MAPK and MAPKK genes were performed expression profile analyses in different tissues (root, stem, leaf, flower and fruit), and all of the selected genes were expressed in at least one of the tissues tested, indicating that the MAPKs and MAPKKs are involved in various aspects of physiological and developmental processes of apple. To our knowledge, this is the first report of a genome-wide analysis of the apple MAPK and MAPKK gene family. This study provides valuable information for understanding the classification and putative functions of the MAPK signal in apple. © 2013.
Characterization of the bovine pregnancy-associated glycoprotein gene family – analysis of gene sequences, regulatory regions within the promoter and expression of selected genes

Directory of Open Access Journals (Sweden)

Walker Angela M

2009-04-01

Full Text Available Abstract Background The Pregnancy-associated glycoproteins (PAGs belong to a large family of aspartic peptidases expressed exclusively in the placenta of species in the Artiodactyla order. In cattle, the PAG gene family is comprised of at least 22 transcribed genes, as well as some variants. Phylogenetic analyses have shown that the PAG family segregates into 'ancient' and 'modern' groupings. Along with sequence differences between family members, there are clear distinctions in their spatio-temporal distribution and in their relative level of expression. In this report, 1 we performed an in silico analysis of the bovine genome to further characterize the PAG gene family, 2 we scrutinized proximal promoter sequences of the PAG genes to evaluate the evolution pressures operating on them and to identify putative regulatory regions, 3 we determined relative transcript abundance of selected PAGs during pregnancy and, 4 we performed preliminary characterization of the putative regulatory elements for one of the candidate PAGs, bovine (bo PAG-2. Results From our analysis of the bovine genome, we identified 18 distinct PAG genes and 14 pseudogenes. We observed that the first 500 base pairs upstream of the translational start site contained multiple regions that are conserved among all boPAGs. However, a preponderance of conserved regions, that harbor recognition sites for putative transcriptional factors (TFs, were found to be unique to the modern boPAG grouping, but not the ancient boPAGs. We gathered evidence by means of Q-PCR and screening of EST databases to show that boPAG-2 is the most abundant of all boPAG transcripts. Finally, we provided preliminary evidence for the role of ETS- and DDVL-related TFs in the regulation of the boPAG-2 gene. Conclusion PAGs represent a relatively large gene family in the bovine genome. The proximal promoter regions of these genes display differences in putative TF binding sites, likely contributing to observed
Analysis of a Gene Regulatory Cascade Mediating Circadian Rhythm in Zebrafish

Science.gov (United States)

Wang, Haifang; Du, Jiulin; Yan, Jun

2013-01-01

In the study of circadian rhythms, it has been a puzzle how a limited number of circadian clock genes can control diverse aspects of physiology. Here we investigate circadian gene expression genome-wide using larval zebrafish as a model system. We made use of a spatial gene expression atlas to investigate the expression of circadian genes in various tissues and cell types. Comparison of genome-wide circadian gene expression data between zebrafish and mouse revealed a nearly anti-phase relationship and allowed us to detect novel evolutionarily conserved circadian genes in vertebrates. We identified three groups of zebrafish genes with distinct responses to light entrainment: fast light-induced genes, slow light-induced genes, and dark-induced genes. Our computational analysis of the circadian gene regulatory network revealed several transcription factors (TFs) involved in diverse aspects of circadian physiology through transcriptional cascade. Of these, microphthalmia-associated transcription factor a (mitfa), a dark-induced TF, mediates a circadian rhythm of melanin synthesis, which may be involved in zebrafish's adaptation to daily light cycling. Our study describes a systematic method to discover previously unidentified TFs involved in circadian physiology in complex organisms. PMID:23468616
Bioinformatics Analysis Reveals Genes Involved in the Pathogenesis of Ameloblastoma and Keratocystic Odontogenic Tumor.

Science.gov (United States)

Santos, Eliane Macedo Sobrinho; Santos, Hércules Otacílio; Dos Santos Dias, Ivoneth; Santos, Sérgio Henrique; Batista de Paula, Alfredo Maurício; Feltenberger, John David; Sena Guimarães, André Luiz; Farias, Lucyana Conceição

2016-01-01

Pathogenesis of odontogenic tumors is not well known. It is important to identify genetic deregulations and molecular alterations. This study aimed to investigate, through bioinformatic analysis, the possible genes involved in the pathogenesis of ameloblastoma (AM) and keratocystic odontogenic tumor (KCOT). Genes involved in the pathogenesis of AM and KCOT were identified in GeneCards. Gene list was expanded, and the gene interactions network was mapped using the STRING software. "Weighted number of links" (WNL) was calculated to identify "leader genes" (highest WNL). Genes were ranked by K-means method and Kruskal-Wallis test was used (Preview data was used to corroborate the bioinformatics data. CDK1 was identified as leader gene for AM. In KCOT group, results show PCNA and TP53 . Both tumors exhibit a power law behavior. Our topological analysis suggested leader genes possibly important in the pathogenesis of AM and KCOT, by clustering coefficient calculated for both odontogenic tumors (0.028 for AM, zero for KCOT). The results obtained in the scatter diagram suggest an important relationship of these genes with the molecular processes involved in AM and KCOT. Ontological analysis for both AM and KCOT demonstrated different mechanisms. Bioinformatics analyzes were confirmed through literature review. These results may suggest the involvement of promising genes for a better understanding of the pathogenesis of AM and KCOT.
Bone Metastasis in Advanced Breast Cancer: Analysis of Gene Expression Microarray.

Science.gov (United States)

Cosphiadi, Irawan; Atmakusumah, Tubagus D; Siregar, Nurjati C; Muthalib, Abdul; Harahap, Alida; Mansyur, Muchtarruddin

2018-03-08

Approximately 30% to 40% of breast cancer recurrences involve bone metastasis (BM). Certain genes have been linked to BM; however, none have been able to predict bone involvement. In this study, we analyzed gene expression profiles in advanced breast cancer patients to elucidate genes that can be used to predict BM. A total of 92 advanced breast cancer patients, including 46 patients with BM and 46 patients without BM, were identified for this study. Immunohistochemistry and gene expression analysis was performed on 81 formalin-fixed paraffin-embedded samples. Data were collected through medical records, and gene expression of 200 selected genes compiled from 6 previous studies was performed using NanoString nCounter. Genetic expression profiles showed that 22 genes were significantly differentially expressed between breast cancer patients with metastasis in bone and other organs (BM+) and non-BM, whereas subjects with only BM showed 17 significantly differentially expressed genes. The following genes were associated with an increasing incidence of BM in the BM+ group: estrogen receptor 1 (ESR1), GATA binding protein 3 (GATA3), and melanophilin with an area under the curve (AUC) of 0.804. In the BM group, the following genes were associated with an increasing incidence of BM: ESR1, progesterone receptor, B-cell lymphoma 2, Rab escort protein, N-acetyltransferase 1, GATA3, annexin A9, and chromosome 9 open reading frame 116. ESR1 and GATA3 showed an increased strength of association with an AUC of 0.928. A combination of the identified 3 genes in BM+ and 8 genes in BM showed better prediction than did each individual gene, and this combination can be used as a training set. Copyright © 2018 The Authors. Published by Elsevier Inc. All rights reserved.
Associations of candidate genes to age-related macular degeneration among racial/ethnic groups in the multi-ethnic study of atherosclerosis.

Science.gov (United States)

Klein, Ronald; Li, Xiaohui; Kuo, Jane Z; Klein, Barbara E K; Cotch, Mary Frances; Wong, Tien Y; Taylor, Kent D; Rotter, Jerome I

2013-11-01

To describe the relationships of selected candidate genes to the prevalence of early age-related macular degeneration (AMD) in a cohort of whites, blacks, Hispanics, and Chinese Americans. Cross-sectional study. setting: Multicenter study. study population: A total of 2456 persons aged 45-84 years with genotype information and fundus photographs. procedures: Twelve of 2862 single nucleotide polymorphisms (SNPs) from 11 of 233 candidate genes for cardiovascular disease were selected for analysis based on screening with marginal unadjusted P value ethnic groups. Logistic regression models tested for association in case-control samples. main outcome measure: Prevalence of early AMD. Early AMD was present in 4.0% of the cohort and varied from 2.4% in blacks to 6.0% in whites. The odds ratio increased from 2.3 for 1 to 10.0 for 4 risk alleles in a joint effect analysis of Age-Related Maculopathy Susceptibility 2 rs10490924 and Complement Factor H Y402H (P for trend = 4.2×10(-7)). Frequencies of each SNP varied among the racial/ethnic groups. Adjusting for age and other factors, few statistically significant associations of the 12 SNPs with AMD were consistent across all groups. In a multivariate model, most candidate genes did not attenuate the comparatively higher odds of AMD in whites. The higher frequency of risk alleles for several SNPs in Chinese Americans may partially explain their AMD frequency's approaching that of whites. The relationships of 11 candidate genes to early AMD varied among 4 racial/ethnic groups, and partially explained the observed variations in early AMD prevalence among them. Copyright © 2013 Elsevier Inc. All rights reserved.
Principles of gene microarray data analysis.

Science.gov (United States)

Mocellin, Simone; Rossi, Carlo Riccardo

2007-01-01

The development of several gene expression profiling methods, such as comparative genomic hybridization (CGH), differential display, serial analysis of gene expression (SAGE), and gene microarray, together with the sequencing of the human genome, has provided an opportunity to monitor and investigate the complex cascade of molecular events leading to tumor development and progression. The availability of such large amounts of information has shifted the attention of scientists towards a nonreductionist approach to biological phenomena. High throughput technologies can be used to follow changing patterns of gene expression over time. Among them, gene microarray has become prominent because it is easier to use, does not require large-scale DNA sequencing, and allows for the parallel quantification of thousands of genes from multiple samples. Gene microarray technology is rapidly spreading worldwide and has the potential to drastically change the therapeutic approach to patients affected with tumor. Therefore, it is of paramount importance for both researchers and clinicians to know the principles underlying the analysis of the huge amount of data generated with microarray technology.
Exercise-associated DNA methylation change in skeletal muscle and the importance of imprinted genes: a bioinformatics meta-analysis.

Science.gov (United States)

Brown, William M

2015-12-01

Epigenetics is the study of processes--beyond DNA sequence alteration--producing heritable characteristics. For example, DNA methylation modifies gene expression without altering the nucleotide sequence. A well-studied DNA methylation-based phenomenon is genomic imprinting (ie, genotype-independent parent-of-origin effects). We aimed to elucidate: (1) the effect of exercise on DNA methylation and (2) the role of imprinted genes in skeletal muscle gene networks (ie, gene group functional profiling analyses). Gene ontology (ie, gene product elucidation)/meta-analysis. 26 skeletal muscle and 86 imprinted genes were subjected to g:Profiler ontology analysis. Meta-analysis assessed exercise-associated DNA methylation change. g:Profiler found four muscle gene networks with imprinted loci. Meta-analysis identified 16 articles (387 genes/1580 individuals) associated with exercise. Age, method, sample size, sex and tissue variation could elevate effect size bias. Only skeletal muscle gene networks including imprinted genes were reported. Exercise-associated effect sizes were calculated by gene. Age, method, sample size, sex and tissue variation were moderators. Six imprinted loci (RB1, MEG3, UBE3A, PLAGL1, SGCE, INS) were important for muscle gene networks, while meta-analysis uncovered five exercise-associated imprinted loci (KCNQ1, MEG3, GRB10, L3MBTL1, PLAGL1). DNA methylation decreased with exercise (60% of loci). Exercise-associated DNA methylation change was stronger among older people (ie, age accounted for 30% of the variation). Among older people, genes exhibiting DNA methylation decreases were part of a microRNA-regulated gene network functioning to suppress cancer. Imprinted genes were identified in skeletal muscle gene networks and exercise-associated DNA methylation change. Exercise-associated DNA methylation modification could rewind the 'epigenetic clock' as we age. CRD42014009800. Published by the BMJ Publishing Group Limited. For permission to use (where
Genome-wide methylation analysis identifies genes silenced in non-seminoma cell lines.

Science.gov (United States)

Noor, Dzul Azri Mohamed; Jeyapalan, Jennie N; Alhazmi, Safiah; Carr, Matthew; Squibb, Benjamin; Wallace, Claire; Tan, Christopher; Cusack, Martin; Hughes, Jaime; Reader, Tom; Shipley, Janet; Sheer, Denise; Scotting, Paul J

2016-01-01

Silencing of genes by DNA methylation is a common phenomenon in many types of cancer. However, the genome-wide effect of DNA methylation on gene expression has been analysed in relatively few cancers. Germ cell tumours (GCTs) are a complex group of malignancies. They are unique in developing from a pluripotent progenitor cell. Previous analyses have suggested that non-seminomas exhibit much higher levels of DNA methylation than seminomas. The genomic targets that are methylated, the extent to which this results in gene silencing and the identity of the silenced genes most likely to play a role in the tumours' biology have not yet been established. In this study, genome-wide methylation and expression analysis of GCT cell lines was combined with gene expression data from primary tumours to address this question. Genome methylation was analysed using the Illumina infinium HumanMethylome450 bead chip system and gene expression was analysed using Affymetrix GeneChip Human Genome U133 Plus 2.0 arrays. Regulation by methylation was confirmed by demethylation using 5-aza-2-deoxycytidine and reverse transcription-quantitative PCR. Large differences in the level of methylation of the CpG islands of individual genes between tumour cell lines correlated well with differential gene expression. Treatment of non-seminoma cells with 5-aza-2-deoxycytidine verified that methylation of all genes tested played a role in their silencing in yolk sac tumour cells and many of these genes were also differentially expressed in primary tumours. Genes silenced by methylation in the various GCT cell lines were identified. Several pluripotency-associated genes were identified as a major functional group of silenced genes.
Identification, isolation and expression analysis of auxin response factor (ARF) genes in Solanum lycopersicum.

Science.gov (United States)

Wu, Jian; Wang, Feiyan; Cheng, Lin; Kong, Fuling; Peng, Zhen; Liu, Songyu; Yu, Xiaolin; Lu, Gang

2011-11-01

Auxin response factors (ARFs) encode transcriptional factors that bind specifically to the TGTCTC-containing auxin response elements found in the promoters of primary/early auxin response genes that regulate plant development. In this study, investigation of the tomato genome revealed 21 putative functional ARF genes (SlARFs), a number comparable to that found in Arabidopsis (23) and rice (25). The full cDNA sequences of 15 novel SlARFs were isolated and delineated by sequencing of PCR products. A comprehensive genome-wide analysis of this gene family is presented, including the gene structures, chromosome locations, phylogeny, and conserved motifs. In addition, a comparative analysis between ARF family genes in tomato and maize was performed. A phylogenetic tree generated from alignments of the full-length protein sequences of 21 OsARFs, 23 AtARFs, 31 ZmARFs, and 21 SlARFs revealed that these ARFs were clustered into four major groups. However, we could not find homologous genes in rice, maize, or tomato with AtARF12-15 and AtARF20-23. The expression patterns of tomato ARF genes were analyzed by quantitative real-time PCR. Our comparative analysis will help to define possible functions for many of these newly isolated ARF-family genes in plant development.
Gene set analysis for GWAS

DEFF Research Database (Denmark)

Debrabant, Birgit; Soerensen, Mette

2014-01-01

Abstract We discuss the use of modified Kolmogorov-Smirnov (KS) statistics in the context of gene set analysis and review corresponding null and alternative hypotheses. Especially, we show that, when enhancing the impact of highly significant genes in the calculation of the test statistic, the co...
Investigating the effect of paralogs on microarray gene-set analysis

LENUS (Irish Health Repository)

Faure, Andre J

2011-01-24

Abstract Background In order to interpret the results obtained from a microarray experiment, researchers often shift focus from analysis of individual differentially expressed genes to analyses of sets of genes. These gene-set analysis (GSA) methods use previously accumulated biological knowledge to group genes into sets and then aim to rank these gene sets in a way that reflects their relative importance in the experimental situation in question. We suspect that the presence of paralogs affects the ability of GSA methods to accurately identify the most important sets of genes for subsequent research. Results We show that paralogs, which typically have high sequence identity and similar molecular functions, also exhibit high correlation in their expression patterns. We investigate this correlation as a potential confounding factor common to current GSA methods using Indygene http:\\/\\/www.cbio.uct.ac.za\\/indygene, a web tool that reduces a supplied list of genes so that it includes no pairwise paralogy relationships above a specified sequence similarity threshold. We use the tool to reanalyse previously published microarray datasets and determine the potential utility of accounting for the presence of paralogs. Conclusions The Indygene tool efficiently removes paralogy relationships from a given dataset and we found that such a reduction, performed prior to GSA, has the ability to generate significantly different results that often represent novel and plausible biological hypotheses. This was demonstrated for three different GSA approaches when applied to the reanalysis of previously published microarray datasets and suggests that the redundancy and non-independence of paralogs is an important consideration when dealing with GSA methodologies.

Correlation between Group B Streptococcal Genotypes, Their Antimicrobial Resistance Profiles, and Virulence Genes among Pregnant Women in Lebanon

Directory of Open Access Journals (Sweden)

Antoine Hannoun

2009-01-01

Full Text Available The antimicrobial susceptibility profiles of 76 Streptococcus agalactiae (Group B Streptococci [GBS] isolates from vaginal specimens of pregnant women near term were correlated to their genotypes generated by Random Amplified Polymorphic DNA analysis and their virulence factors encoding genes cylE, lmb, scpB, rib, and bca by PCR. Based on the distribution of the susceptibility patterns, six profiles were generated. RAPD analysis detected 7 clusters of genotypes. The cylE gene was present in 99% of the isolates, the lmb in 96%, scpB in 94.7%, rib in 33%, and bca in 56.5% of isolates. The isolates demonstrated a significant correlation between antimicrobial resistance and genotype clusters denoting the distribution of particular clones with different antimicrobial resistance profiles, entailing the practice of caution in therapeutic options. All virulence factors encoding genes were detected in all seven genotypic clusters with rib and bca not coexisting in the same genome.
Reporter gene bioassays in environmental analysis.

Science.gov (United States)

Köhler, S; Belkin, S; Schmid, R D

2000-01-01

In parallel to the continuous development of increasingly more sophisticated physical and chemical analytical technologies for the detection of environmental pollutants, there is a progressively more urgent need also for bioassays which report not only on the presence of a chemical but also on its bioavailability and its biological effects. As a partial fulfillment of that need, there has been a rapid development of biosensors based on genetically engineered bacteria. Such microorganisms typically combine a promoter-operator, which acts as the sensing element, with reporter gene(s) coding for easily detectable proteins. These sensors have the ability to detect global parameters such as stress conditions, toxicity or DNA-damaging agents as well as specific organic and inorganic compounds. The systems described in this review, designed to detect different groups of target chemicals, vary greatly in their detection limits, specificity, response times and more. These variations reflect on their potential applicability which, for most of the constructs described, is presently rather limited. Nevertheless, present trends promise that additional improvements will make microbial biosensors an important tool for future environmental analysis.
Genome-wide identification, characterisation and expression analysis of the MADS-box gene family in Prunus mume.

Science.gov (United States)

Xu, Zongda; Zhang, Qixiang; Sun, Lidan; Du, Dongliang; Cheng, Tangren; Pan, Huitang; Yang, Weiru; Wang, Jia

2014-10-01

MADS-box genes encode transcription factors that play crucial roles in plant development, especially in flower and fruit development. To gain insight into this gene family in Prunus mume, an important ornamental and fruit plant in East Asia, and to elucidate their roles in flower organ determination and fruit development, we performed a genome-wide identification, characterisation and expression analysis of MADS-box genes in this Rosaceae tree. In this study, 80 MADS-box genes were identified in P. mume and categorised into MIKC, Mα, Mβ, Mγ and Mδ groups based on gene structures and phylogenetic relationships. The MIKC group could be further classified into 12 subfamilies. The FLC subfamily was absent in P. mume and the six tandemly arranged DAM genes might experience a species-specific evolution process in P. mume. The MADS-box gene family might experience an evolution process from MIKC genes to Mδ genes to Mα, Mβ and Mγ genes. The expression analysis suggests that P. mume MADS-box genes have diverse functions in P. mume development and the functions of duplicated genes diverged after the duplication events. In addition to its involvement in the development of female gametophytes, type I genes also play roles in male gametophytes development. In conclusion, this study adds to our understanding of the roles that the MADS-box genes played in flower and fruit development and lays a foundation for selecting candidate genes for functional studies in P. mume and other species. Furthermore, this study also provides a basis to study the evolution of the MADS-box family.
Age-Specific Gene Expression Profiles of Rhesus Monkey Ovaries Detected by Microarray Analysis

Directory of Open Access Journals (Sweden)

Hengxi Wei

2015-01-01

Full Text Available The biological function of human ovaries declines with age. To identify the potential molecular changes in ovarian aging, we performed genome-wide gene expression analysis by microarray of ovaries from young, middle-aged, and old rhesus monkeys. Microarray data was validated by quantitative real-time PCR. Results showed that a total of 503 (60 upregulated, 443 downregulated and 84 (downregulated genes were differentially expressed in old ovaries compared to young and middle-aged groups, respectively. No difference in gene expression was found between middle-aged and young groups. Differentially expressed genes were mainly enriched in cell and organelle, cellular and physiological process, binding, and catalytic activity. These genes were primarily associated with KEGG pathways of cell cycle, DNA replication and repair, oocyte meiosis and maturation, MAPK, TGF-beta, and p53 signaling pathway. Genes upregulated were involved in aging, defense response, oxidation reduction, and negative regulation of cellular process; genes downregulated have functions in reproduction, cell cycle, DNA and RNA process, macromolecular complex assembly, and positive regulation of macromolecule metabolic process. These findings show that monkey ovary undergoes substantial change in global transcription with age. Gene expression profiles are useful in understanding the mechanisms underlying ovarian aging and age-associated infertility in primates.
Digital gene expression analysis of gene expression differences within Brassica diploids and allopolyploids.

Science.gov (United States)

Jiang, Jinjin; Wang, Yue; Zhu, Bao; Fang, Tingting; Fang, Yujie; Wang, Youping

2015-01-27

Brassica includes many successfully cultivated crop species of polyploid origin, either by ancestral genome triplication or by hybridization between two diploid progenitors, displaying complex repetitive sequences and transposons. The U's triangle, which consists of three diploids and three amphidiploids, is optimal for the analysis of complicated genomes after polyploidization. Next-generation sequencing enables the transcriptome profiling of polyploids on a global scale. We examined the gene expression patterns of three diploids (Brassica rapa, B. nigra, and B. oleracea) and three amphidiploids (B. napus, B. juncea, and B. carinata) via digital gene expression analysis. In total, the libraries generated between 5.7 and 6.1 million raw reads, and the clean tags of each library were mapped to 18547-21995 genes of B. rapa genome. The unambiguous tag-mapped genes in the libraries were compared. Moreover, the majority of differentially expressed genes (DEGs) were explored among diploids as well as between diploids and amphidiploids. Gene ontological analysis was performed to functionally categorize these DEGs into different classes. The Kyoto Encyclopedia of Genes and Genomes analysis was performed to assign these DEGs into approximately 120 pathways, among which the metabolic pathway, biosynthesis of secondary metabolites, and peroxisomal pathway were enriched. The non-additive genes in Brassica amphidiploids were analyzed, and the results indicated that orthologous genes in polyploids are frequently expressed in a non-additive pattern. Methyltransferase genes showed differential expression pattern in Brassica species. Our results provided an understanding of the transcriptome complexity of natural Brassica species. The gene expression changes in diploids and allopolyploids may help elucidate the morphological and physiological differences among Brassica species.
[Single nucleotide polymorphisms of HIV coreceptor CCR5 gene in Chinese Yi ethnic group and its association with HIV infection].

Science.gov (United States)

Ma, Li-ying; Hong, Kun-xue; Lu, Xiao-zhi; Qin, Guang-ming; Chen, Jian-ping; Chen, Kang-lin; Ruan, Yu-hua; Xing, Hui; Zhu, Jia-hong; Shao, Yi-ming

2005-11-30

To investigate the single nucleotide polymorphism (SNP) of HIV-1 coreceptor CCR5 gene in Chinese Yi ethnic group and the association between these SNPs and HIV/AIDS. Peripheral blood samples of 102 HIV negative persons of Chinese Yi nationality, 87 males amd 15 females, aged 23 (12-37), and 68 HIV carriers, 61 males and 7 females, aged 27 (17-51). The regulatory and structural regions of the HIV coreceptor CCR5 gene were amplified from the genomic DNA by nested PCR, each of the two regions was divided into three gene fragments which were overlapped. High throughput DHPLC was used for screening of unknown mutations in each gene fragment. The PCR products showing different peak traces from wild types in DHPLC were sequenced by forward and reverse primers respectively. The sequences were analyzed with the help of Sequence Navigator software to search for SNP loci. Statistical analysis by SPSS and PPAP softwares were made to study the association between these SNPs and HIV infection. Five SNPs (A77G, G316A, T532C, C921T, and G668A) and a AGA deletion of the 686-688 nucleotides were discovered in the coding region of this gene in Chinese Yi ethnic group. C921T mutation was a nonsense mutation, and the other SNPs (A77G, G316A, T532C, and G668A) are sense mutation, with the amino acid changes of K26R, G106R, C178R, and R223Q. Only the frequency of R223Q allelic gene was high (0.08) but those of the others were low (less than 0.01). There was no significant difference in the allele frequency between the HIV negative and HIV positive groups (all P > 0.05). Five SNP loci (T58934G, G59029A, T59353C, G59402A, and C59653T) were found in the regulatory region of CCR5 gene with high allelic frequencies of 0.1912-0.2941. Between the HIV negative and HIV positive groups, there were no differences in the SNP loc (all P > 0.05). Statistical analysis of the association between the linkage of mutation loci with HIV infection suggested a significant difference in the haplotype frequency
Principal Angle Enrichment Analysis (PAEA): Dimensionally Reduced Multivariate Gene Set Enrichment Analysis Tool.

Science.gov (United States)

Clark, Neil R; Szymkiewicz, Maciej; Wang, Zichen; Monteiro, Caroline D; Jones, Matthew R; Ma'ayan, Avi

2015-11-01

Gene set analysis of differential expression, which identifies collectively differentially expressed gene sets, has become an important tool for biology. The power of this approach lies in its reduction of the dimensionality of the statistical problem and its incorporation of biological interpretation by construction. Many approaches to gene set analysis have been proposed, but benchmarking their performance in the setting of real biological data is difficult due to the lack of a gold standard. In a previously published work we proposed a geometrical approach to differential expression which performed highly in benchmarking tests and compared well to the most popular methods of differential gene expression. As reported, this approach has a natural extension to gene set analysis which we call Principal Angle Enrichment Analysis (PAEA). PAEA employs dimensionality reduction and a multivariate approach for gene set enrichment analysis. However, the performance of this method has not been assessed nor its implementation as a web-based tool. Here we describe new benchmarking protocols for gene set analysis methods and find that PAEA performs highly. The PAEA method is implemented as a user-friendly web-based tool, which contains 70 gene set libraries and is freely available to the community.
Analysis of baseline gene expression levels from ...

Science.gov (United States)

The use of gene expression profiling to predict chemical mode of action would be enhanced by better characterization of variance due to individual, environmental, and technical factors. Meta-analysis of microarray data from untreated or vehicle-treated animals within the control arm of toxicogenomics studies has yielded useful information on baseline fluctuations in gene expression. A dataset of control animal microarray expression data was assembled by a working group of the Health and Environmental Sciences Institute's Technical Committee on the Application of Genomics in Mechanism Based Risk Assessment in order to provide a public resource for assessments of variability in baseline gene expression. Data from over 500 Affymetrix microarrays from control rat liver and kidney were collected from 16 different institutions. Thirty-five biological and technical factors were obtained for each animal, describing a wide range of study characteristics, and a subset were evaluated in detail for their contribution to total variability using multivariate statistical and graphical techniques. The study factors that emerged as key sources of variability included gender, organ section, strain, and fasting state. These and other study factors were identified as key descriptors that should be included in the minimal information about a toxicogenomics study needed for interpretation of results by an independent source. Genes that are the most and least variable, gender-selectiv
Genome-wide identification and expression analysis of SBP-like transcription factor genes in Moso Bamboo (Phyllostachys edulis).

Science.gov (United States)

Pan, Feng; Wang, Yue; Liu, Huanglong; Wu, Min; Chu, Wenyuan; Chen, Danmei; Xiang, Yan

2017-06-27

The SQUAMOSA promoter binding protein-like (SPL) proteins are plant-specific transcription factors (TFs) that function in a variety of developmental processes including growth, flower development, and signal transduction. SPL proteins are encoded by a gene family, and these genes have been characterized in two model grass species, Zea mays and Oryza sativa. The SPL gene family has not been well studied in moso bamboo (Phyllostachys edulis), a woody grass species. We identified 32 putative PeSPL genes in the P. edulis genome. Phylogenetic analysis arranged the PeSPL protein sequences in eight groups. Similarly, phylogenetic analysis of the SBP-like and SBP proteins from rice and maize clustered them into eight groups analogous to those from P. edulis. Furthermore, the deduced PeSPL proteins in each group contained very similar conserved sequence motifs. Our analyses indicate that the PeSPL genes experienced a large-scale duplication event ~15 million years ago (MYA), and that divergence between the PeSPL and OsSPL genes occurred 34 MYA. The stress-response expression profiles and tissue-specificity of the putative PeSPL gene promoter regions showed that SPL genes in moso bamboo have potential biological functions in stress resistance as well as in growth and development. We therefore examined PeSPL gene expression in response to different plant hormone and drought (polyethylene glycol-6000; PEG) treatments to mimic biotic and abiotic stresses. Expression of three (PeSPL10, -12, -17), six (PeSPL1, -10, -12, -17, -20, -31), and nine (PeSPL5, -8, -9, -14, -15, -19, -20, -31, -32) genes remained relatively stable after treating with salicylic acid (SA), gibberellic acid (GA), and PEG, respectively, while the expression patterns of other genes changed. In addition, analysis of tissue-specific expression of the moso bamboo SPL genes during development showed differences in their spatiotemporal expression patterns, and many were expressed at high levels in flowers and
GxGrare: gene-gene interaction analysis method for rare variants from high-throughput sequencing data.

Science.gov (United States)

Kwon, Minseok; Leem, Sangseob; Yoon, Joon; Park, Taesung

2018-03-19

With the rapid advancement of array-based genotyping techniques, genome-wide association studies (GWAS) have successfully identified common genetic variants associated with common complex diseases. However, it has been shown that only a small proportion of the genetic etiology of complex diseases could be explained by the genetic factors identified from GWAS. This missing heritability could possibly be explained by gene-gene interaction (epistasis) and rare variants. There has been an exponential growth of gene-gene interaction analysis for common variants in terms of methodological developments and practical applications. Also, the recent advancement of high-throughput sequencing technologies makes it possible to conduct rare variant analysis. However, little progress has been made in gene-gene interaction analysis for rare variants. Here, we propose GxGrare which is a new gene-gene interaction method for the rare variants in the framework of the multifactor dimensionality reduction (MDR) analysis. The proposed method consists of three steps; 1) collapsing the rare variants, 2) MDR analysis for the collapsed rare variants, and 3) detect top candidate interaction pairs. GxGrare can be used for the detection of not only gene-gene interactions, but also interactions within a single gene. The proposed method is illustrated with 1080 whole exome sequencing data of the Korean population in order to identify causal gene-gene interaction for rare variants for type 2 diabetes. The proposed GxGrare performs well for gene-gene interaction detection with collapsing of rare variants. GxGrare is available at http://bibs.snu.ac.kr/software/gxgrare which contains simulation data and documentation. Supported operating systems include Linux and OS X.
Enrichr: interactive and collaborative HTML5 gene list enrichment analysis tool.

Science.gov (United States)

Chen, Edward Y; Tan, Christopher M; Kou, Yan; Duan, Qiaonan; Wang, Zichen; Meirelles, Gabriela Vaz; Clark, Neil R; Ma'ayan, Avi

2013-04-15

System-wide profiling of genes and proteins in mammalian cells produce lists of differentially expressed genes/proteins that need to be further analyzed for their collective functions in order to extract new knowledge. Once unbiased lists of genes or proteins are generated from such experiments, these lists are used as input for computing enrichment with existing lists created from prior knowledge organized into gene-set libraries. While many enrichment analysis tools and gene-set libraries databases have been developed, there is still room for improvement. Here, we present Enrichr, an integrative web-based and mobile software application that includes new gene-set libraries, an alternative approach to rank enriched terms, and various interactive visualization approaches to display enrichment results using the JavaScript library, Data Driven Documents (D3). The software can also be embedded into any tool that performs gene list analysis. We applied Enrichr to analyze nine cancer cell lines by comparing their enrichment signatures to the enrichment signatures of matched normal tissues. We observed a common pattern of up regulation of the polycomb group PRC2 and enrichment for the histone mark H3K27me3 in many cancer cell lines, as well as alterations in Toll-like receptor and interlukin signaling in K562 cells when compared with normal myeloid CD33+ cells. Such analyses provide global visualization of critical differences between normal tissues and cancer cell lines but can be applied to many other scenarios. Enrichr is an easy to use intuitive enrichment analysis web-based tool providing various types of visualization summaries of collective functions of gene lists. Enrichr is open source and freely available online at: http://amp.pharm.mssm.edu/Enrichr.
Model-based gene set analysis for Bioconductor.

Science.gov (United States)

Bauer, Sebastian; Robinson, Peter N; Gagneur, Julien

2011-07-01

Gene Ontology and other forms of gene-category analysis play a major role in the evaluation of high-throughput experiments in molecular biology. Single-category enrichment analysis procedures such as Fisher's exact test tend to flag large numbers of redundant categories as significant, which can complicate interpretation. We have recently developed an approach called model-based gene set analysis (MGSA), that substantially reduces the number of redundant categories returned by the gene-category analysis. In this work, we present the Bioconductor package mgsa, which makes the MGSA algorithm available to users of the R language. Our package provides a simple and flexible application programming interface for applying the approach. The mgsa package has been made available as part of Bioconductor 2.8. It is released under the conditions of the Artistic license 2.0. peter.robinson@charite.de; julien.gagneur@embl.de.
Microarray analysis of the gene expression profile in triethylene glycol dimethacrylate-treated human dental pulp cells.

Science.gov (United States)

Torun, D; Torun, Z Ö; Demirkaya, K; Sarper, M; Elçi, M P; Avcu, F

2017-11-01

Triethylene glycol dimethacrylate (TEGDMA) is an important resin monomer commonly used in the structure of dental restorative materials. Recent studies have shown that unpolymerized resin monomers may be released into the oral environment and cause harmful biological effects. We investigated changes in the gene expression profiles of TEGDMA-treated human dental pulp cells (hDPCs) following short- (1-day) and long-term (7-days) exposure. HDPCs were exposed to a noncytotoxic concentration of TEGDMA, and gene expression profiles were evaluated by microarray analysis. The results were confirmed by quantitative reverse-transcriptase PCR (qRT PCR). In total, 1282 and 1319 genes (up- or down-regulated) were differentially expressed compared with control group after the 1- and 7-day incubation periods, respectively. Biological ontology-based analyses revealed that metabolic, cellular, and developmental processes constituted the largest groups of biological functional processes. qRT-PCR analysis on bone morphogenetic protein-2 (BMP-2), BMP-4, secreted protein, acidic, cysteine-rich, collagen type I alpha 1, oxidative stress-induced growth inhibitor 1, MMP3, interleukin-6, and heme oxygenase-1 genes confirmed the changes in expression observed in the microarray analysis. Our results suggest that TEGDMA can change the many functions of hDPCs through large changes in gene expression levels and complex interactions with different signaling pathways.
Meta-analysis of peripheral blood gene expression modules for COPD phenotypes.

Directory of Open Access Journals (Sweden)

Dominik Reinhold

Full Text Available Chronic obstructive pulmonary disease (COPD occurs typically in current or former smokers, but only a minority of people with smoking history develops the disease. Besides environmental factors, genetics is an important risk factor for COPD. However, the relationship between genetics, environment and phenotypes is not well understood. Sample sizes for genome-wide expression studies based on lung tissue have been small due to the invasive nature of sample collection. Increasing evidence for the systemic nature of the disease makes blood a good alternative source to study the disease, but there have also been few large-scale blood genomic studies in COPD. Due to the complexity and heterogeneity of COPD, examining groups of interacting genes may have more relevance than identifying individual genes. Therefore, we used Weighted Gene Co-expression Network Analysis to find groups of genes (modules that are highly connected. However, module definitions may vary between individual data sets. To alleviate this problem, we used a consensus module definition based on two cohorts, COPDGene and ECLIPSE. We studied the relationship between the consensus modules and COPD phenotypes airflow obstruction and emphysema. We also used these consensus module definitions on an independent cohort (TESRA and performed a meta analysis involving all data sets. We found several modules that are associated with COPD phenotypes, are enriched in functional categories and are overrepresented for cell-type specific genes. Of the 14 consensus modules, three were strongly associated with airflow obstruction (meta p ≤ 0.0002, and two had some association with emphysema (meta p ≤ 0.06; some associations were stronger in the case-control cohorts, and others in the cases-only subcohorts. Gene Ontology terms that were overrepresented included "immune response" and "defense response." The cell types whose type-specific genes were overrepresented in modules (p < 0.05 included
Molecular elucidation of a new allelic variation at the Sg-5 gene associated with the absence of group A saponins in wild soybean.

Science.gov (United States)

Sundaramoorthy, Jagadeesh; Park, Gyu Tae; Mukaiyama, Kyosuke; Tsukamoto, Chigen; Chang, Jeong Ho; Lee, Jeong-Dong; Kim, Jeong Hoe; Seo, Hak Soo; Song, Jong Tae

2018-01-01

In soybean, triterpenoid saponin is one of the major secondary metabolites and is further classified into group A and DDMP saponins. Although they have known health benefits for humans and animals, acetylation of group A saponins causes bitterness and gives an astringent taste to soy products. Therefore, several studies are being conducted to eliminate acetylated group A saponins. Previous studies have isolated and characterized the Sg-5 (Glyma.15g243300) gene, which encodes the cytochrome P450 72A69 enzyme and is responsible for soyasapogenol A biosynthesis. In this study, we elucidated the molecular identity of a novel mutant of Glycine soja, 'CWS5095'. Phenotypic analysis using TLC and LC-PDA/MS/MS showed that the mutant 'CWS5095' did not produce any group A saponins. Segregation analysis showed that the absence of group A saponins is controlled by a single recessive allele. The locus was mapped on chromosome 15 (4.3 Mb) between Affx-89193969 and Affx-89134397 where the previously identified Glyma.15g243300 gene is positioned. Sequence analysis of the coding region for the Glyma.15g243300 gene revealed the presence of four SNPs in 'CWS5095' compared to the control lines. One of these four SNPs (G1127A) leads to the amino acid change Arg376Lys in the EXXR motif, which is invariably conserved among the CYP450 superfamily proteins. Co-segregation analysis showed that the missense mutation (Arg376Lys) was tightly linked with the absence of group A saponins in 'CWS5095'. Even though Arg and Lys have similar chemical features, the 3D modelled protein structure indicates that the replacement of Arg with Lys may cause a loss-of-function of the Sg-5 protein by inhibiting the stable binding of a heme cofactor to the CYP72A69 apoenzyme.
Transcriptome analysis describing new immunity and defense genes in peripheral blood mononuclear cells of rheumatoid arthritis patients.

Directory of Open Access Journals (Sweden)

Vitor Hugo Teixeira

Full Text Available BACKGROUND: Large-scale gene expression profiling of peripheral blood mononuclear cells from Rheumatoid Arthritis (RA patients could provide a molecular description that reflects the contribution of diverse cellular responses associated with this disease. The aim of our study was to identify peripheral blood gene expression profiles for RA patients, using Illumina technology, to gain insights into RA molecular mechanisms. METHODOLOGY/PRINCIPAL FINDINGS: The Illumina Human-6v2 Expression BeadChips were used for a complete genome-wide transcript profiling of peripheral blood mononuclear cells (PBMCs from 18 RA patients and 15 controls. Differential analysis per gene was performed with one-way analysis of variance (ANOVA and P values were adjusted to control the False Discovery Rate (FDR<5%. Genes differentially expressed at significant level between patients and controls were analyzed using Gene Ontology (GO in the PANTHER database to identify biological processes. A differentially expression of 339 Reference Sequence genes (238 down-regulated and 101 up-regulated between the two groups was observed. We identified a remarkably elevated expression of a spectrum of genes involved in Immunity and Defense in PBMCs of RA patients compared to controls. This result is confirmed by GO analysis, suggesting that these genes could be activated systemically in RA. No significant down-regulated ontology groups were found. Microarray data were validated by real time PCR in a set of nine genes showing a high degree of correlation. CONCLUSIONS/SIGNIFICANCE: Our study highlighted several new genes that could contribute in the identification of innovative clinical biomarkers for diagnostic procedures and therapeutic interventions.
PHYLOGENETIC RELATIONSHIPS AMONGST 10 Durio SPECIES BASED ON PCR-RFLP ANALYSIS OF TWO CHLOROPLAST GENES

Directory of Open Access Journals (Sweden)

Panca J. Santoso

2013-07-01

Full Text Available Twenty seven species of Durio have been identified in Sabah and Sarawak, Malaysia, but their relationships have not been studied. This study was conducted to analyse phylogenetic relationships amongst 10 Durio species in Malaysia using PCR-RFLP on two chloroplast DNA genes, i.e. ndhC-trnV and rbcL. DNAs were extracted from young leaves of 11 accessions from 10 Durio species collected from the Tenom Agriculture Research Station, Sabah, and University Agriculture Park, Universiti Putra Malaysia. Two pairs of oligonucleotide primers, N1-N2 and rbcL1-rbcL2, were used to flank the target regions ndhC-trnV and rbcL. Eight restriction enzymes, HindIII, BsuRI, PstI, TaqI, MspI, SmaI, BshNI, and EcoR130I, were used to digest the amplicons. Based on the results of PCR-RFLP on ndhC-trnV gene, the 10 Durio species were grouped into five distinct clusters, and the accessions generally showed high variations. However, based on the results of PCR-RFLP on the rbcL gene, the species were grouped into three distinct clusters, and generally showed low variations. This means that ndhC-trnV gene is more reliable for phylogenetic analysis in lower taxonomic level of Durio species or for diversity analysis, while rbcL gene is reliable marker for phylogenetic analysis at higher taxonomic level. PCR-RFLP on the ndhC-trnV and rbcL genes could therefore be considered as useful markers to phylogenetic analysis amongst Durio species. These finding might be used for further molecular marker assisted in Durio breeding program.
Genome-wide analysis of the MYB gene family in physic nut (Jatropha curcas L.).

Science.gov (United States)

Zhou, Changpin; Chen, Yanbo; Wu, Zhenying; Lu, Wenjia; Han, Jinli; Wu, Pingzhi; Chen, Yaping; Li, Meiru; Jiang, Huawu; Wu, Guojiang

2015-11-01

The MYB proteins comprise one of the largest transcription factor families in plants, and play key roles in regulatory networks controlling development, metabolism, and stress responses. A total of 125 MYB genes (JcMYB) have been identified in the physic nut (Jatropha curcas L.) genome, including 120 2R-type MYB, 4 3R-MYB, and 1 4R-MYB genes. Based on exon-intron arrangement of MYBs from both lower (Physcomitrella patens) and higher (physic nut, Arabidopsis, and rice) plants, we can classify plant MYB genes into ten groups (MI-X), except for MIX genes which are nonexistent in higher plants. We also observed that MVIII genes may be one of the most ancient MYB types which consist of both R2R3- and 3R-MYB genes. Most MYB genes (76.8% in physic nut) belong to the MI group which can be divided into 34 subgroups. The JcMYB genes were nonrandomly distributed on its 11 linkage groups (LGs). The expansion of MYB genes across several subgroups was observed and resulted from genome triplication of ancient dicotyledons and from both ancient and recent tandem duplication events in the physic nut genome. The expression patterns of several MYB duplicates in the physic nut showed differences in four tissues (root, stem, leaf, and seed), and 34 MYB genes responded to at least one abiotic stressor (drought, salinity, phosphate starvation, and nitrogen starvation) in leaves and/or roots based on the data analysis of digital gene expression tags. Overexpression of the JcMYB001 gene in Arabidopsis increased its sensitivity to drought and salinity stresses. Copyright © 2015 Elsevier B.V. All rights reserved.
Final report of the specific research. Investigations on the analysis of bio-protective factors against radiation. 1998-2000 FY (Research Group of NIRS)

International Nuclear Information System (INIS)

2002-03-01

This report concerns investigations in the title conducted by 8 groups of National Institute of Radiological Sciences (NIRS) during the period of 1998-2000. The groups are for investigation of: Effects of p53 tumor suppressor gene in radiation-induced leukemia, Role of atm-gene in dose rate effect of ionizing radiation, Function of DNA-dependent protein kinase catalytic subunit (DNA-PK cs ), Functional complementation of radiation-sensitive mutant M10 cell line by human XRCC4 cDNA expression, Role of radiation-induced apoptosis in digital defects in embryonic mice, Functional analysis of S-phase specific novel nuclear protein NP95 by gene targeting, Role of chemokine in T cell development and lymphomagenesis, and establishment of production techniques of gene-modified mice using embryonic stem cells for genetic analysis of radiation-sensitive genes. The groups describe summaries of their studies and published original articles are also given. (N.I.)
Final report of the specific research. Investigations on the analysis of bio-protective factors against radiation. 1998-2000 FY (Research Group of NIRS)

Energy Technology Data Exchange (ETDEWEB)

NONE

2002-03-01

This report concerns investigations in the title conducted by 8 groups of National Institute of Radiological Sciences (NIRS) during the period of 1998-2000. The groups are for investigation of: Effects of p53 tumor suppressor gene in radiation-induced leukemia, Role of atm-gene in dose rate effect of ionizing radiation, Function of DNA-dependent protein kinase catalytic subunit (DNA-PK{sub cs}), Functional complementation of radiation-sensitive mutant M10 cell line by human XRCC4 cDNA expression, Role of radiation-induced apoptosis in digital defects in embryonic mice, Functional analysis of S-phase specific novel nuclear protein NP95 by gene targeting, Role of chemokine in T cell development and lymphomagenesis, and establishment of production techniques of gene-modified mice using embryonic stem cells for genetic analysis of radiation-sensitive genes. The groups describe summaries of their studies and published original articles are also given. (N.I.)

batman Interacts with polycomb and trithorax group genes and encodes a BTB/POZ protein that is included in a complex containing GAGA factor.

Science.gov (United States)

Faucheux, M; Roignant, J-Y; Netter, S; Charollais, J; Antoniewski, C; Théodore, L

2003-02-01

Polycomb and trithorax group genes maintain the appropriate repressed or activated state of homeotic gene expression throughout Drosophila melanogaster development. We have previously identified the batman gene as a Polycomb group candidate since its function is necessary for the repression of Sex combs reduced. However, our present genetic analysis indicates functions of batman in both activation and repression of homeotic genes. The 127-amino-acid Batman protein is almost reduced to a BTB/POZ domain, an evolutionary conserved protein-protein interaction domain found in a large protein family. We show that this domain is involved in the interaction between Batman and the DNA binding GAGA factor encoded by the Trithorax-like gene. The GAGA factor and Batman codistribute on polytene chromosomes, coimmunoprecipitate from nuclear embryonic and larval extracts, and interact in the yeast two-hybrid assay. Batman, together with the GAGA factor, binds to MHS-70, a 70-bp fragment of the bithoraxoid Polycomb response element. This binding, like that of the GAGA factor, requires the presence of d(GA)n sequences. Together, our results suggest that batman belongs to a subset of the Polycomb/trithorax group of genes that includes Trithorax-like, whose products are involved in both activation and repression of homeotic genes.
Analysis of breast cancer metastasis candidate genes from next generation-sequencing via systematic functional genomics

DEFF Research Database (Denmark)

Blomstrøm, Monica Marie

2016-01-01

several growth modulators and invasion modulators were identified and independently validated. These candidates revealed a group of genes with metastasis-related functions in vitro that are involved in RNA-related processes, such as RNA-processing. Moreover, a general feature was that proliferation......) and non-CSCs. The main goal of this project was to functionally characterize a set of candidate genes recovered from next-generation sequencing analysis for their role in breast cancer metastasis formation. The starting gene set comprised 104 gene variants; i.e. 57 wildtype and 47 mutated variants. During...
Genome-wide identification and expression analysis of the WRKY gene family in cassava

Directory of Open Access Journals (Sweden)

Yunxie eWei

2016-02-01

Full Text Available The WRKY family, a large family of transcription factors (TFs found in higher plants, plays central roles in many aspects of physiological processes and adaption to environment. However, little information is available regarding the WRKY family in cassava (Manihot esculenta. In the present study, 85 WRKY genes were identified from the cassava genome and classified into three groups according to conserved WRKY domains and zinc-finger structure. Conserved motif analysis showed that all of the identified MeWRKYs had the conserved WRKY domain. Gene structure analysis suggested that the number of introns in MeWRKY genes varied from 1 to 5, with the majority of MeWRKY genes containing 3 exons. Expression profiles of MeWRKY genes in different tissues and in response to drought stress were analyzed using the RNA-seq technique. The results showed that 72 MeWRKY genes had differential expression in their transcript abundance and 78 MeWRKY genes were differentially expressed in response to drought stresses in different accessions, indicating their contribution to plant developmental processes and drought stress resistance in cassava. Finally, the expression of 9 WRKY genes was analyzed by qRT-PCR under osmotic, salt, ABA, H2O2, and cold treatments, indicating that MeWRKYs may be involved in different signaling pathways. Taken together, this systematic analysis identifies some tissue-specific and abiotic stress-responsive candidate MeWRKY genes for further functional assays in planta, and provides a solid foundation for understanding of abiotic stress responses and signal transduction mediated by WRKYs in cassava.
Genome-Wide Identification and Expression Analysis of the WRKY Gene Family in Cassava.

Science.gov (United States)

Wei, Yunxie; Shi, Haitao; Xia, Zhiqiang; Tie, Weiwei; Ding, Zehong; Yan, Yan; Wang, Wenquan; Hu, Wei; Li, Kaimian

2016-01-01

The WRKY family, a large family of transcription factors (TFs) found in higher plants, plays central roles in many aspects of physiological processes and adaption to environment. However, little information is available regarding the WRKY family in cassava (Manihot esculenta). In the present study, 85 WRKY genes were identified from the cassava genome and classified into three groups according to conserved WRKY domains and zinc-finger structure. Conserved motif analysis showed that all of the identified MeWRKYs had the conserved WRKY domain. Gene structure analysis suggested that the number of introns in MeWRKY genes varied from 1 to 5, with the majority of MeWRKY genes containing three exons. Expression profiles of MeWRKY genes in different tissues and in response to drought stress were analyzed using the RNA-seq technique. The results showed that 72 MeWRKY genes had differential expression in their transcript abundance and 78 MeWRKY genes were differentially expressed in response to drought stresses in different accessions, indicating their contribution to plant developmental processes and drought stress resistance in cassava. Finally, the expression of 9 WRKY genes was analyzed by qRT-PCR under osmotic, salt, ABA, H2O2, and cold treatments, indicating that MeWRKYs may be involved in different signaling pathways. Taken together, this systematic analysis identifies some tissue-specific and abiotic stress-responsive candidate MeWRKY genes for further functional assays in planta, and provides a solid foundation for understanding of abiotic stress responses and signal transduction mediated by WRKYs in cassava.
Genome-wide survey of flavonoid biosynthesis genes and gene expression analysis between black- and yellow-seeded Brassica napus

Directory of Open Access Journals (Sweden)

Cunmin Qu

2016-12-01

Full Text Available Flavonoids, the compounds that impart color to fruits, flowers, and seeds, are the most widespread secondary metabolites in plants. However, a systematic analysis of these loci has not been performed in Brassicaceae. In this study, we isolated 649 nucleotide sequences related to flavonoid biosynthesis, i.e., the Transparent Testa (TT genes, and their associated amino acid sequences in 17 Brassicaceae species, grouped into Arabidopsis or Brassicaceae subgroups. Moreover, 36 copies of 21 genes of the flavonoid biosynthesis pathway were identified in A. thaliana, 53 were identified in B. rapa, 50 in B. oleracea, and 95 in B. napus, followed the genomic distribution, collinearity analysis and genes triplication of them among Brassicaceae species. The results showed that the extensive gene loss, whole genome triplication, and diploidization that occurred after divergence from the common ancestor. Using qRT-PCR methods, we analyzed the expression of eighteen flavonoid biosynthesis genes in 6 yellow- and black-seeded B. napus inbred lines with different genetic background, found that 12 of which were preferentially expressed during seed development, whereas the remaining genes were expressed in all B. napus tissues examined. Moreover, fourteen of these genes showed significant differences in expression level during seed development, and all but four of these (i.e., BnTT5, BnTT7, BnTT10, and BnTTG1 had similar expression patterns among the yellow- and black-seeded B. napus. Results showed that the structural genes (BnTT3, BnTT18 and BnBAN, regulatory genes (BnTTG2 and BnTT16 and three encoding transfer proteins (BnTT12, BnTT19, and BnAHA10 might play an crucial roles in the formation of different seed coat colors in B. napus. These data will be helpful for illustrating the molecular mechanisms of flavonoid biosynthesis in Brassicaceae species.
Separate enrichment analysis of pathways for up- and downregulated genes.

Science.gov (United States)

Hong, Guini; Zhang, Wenjing; Li, Hongdong; Shen, Xiaopei; Guo, Zheng

2014-03-06

Two strategies are often adopted for enrichment analysis of pathways: the analysis of all differentially expressed (DE) genes together or the analysis of up- and downregulated genes separately. However, few studies have examined the rationales of these enrichment analysis strategies. Using both microarray and RNA-seq data, we show that gene pairs with functional links in pathways tended to have positively correlated expression levels, which could result in an imbalance between the up- and downregulated genes in particular pathways. We then show that the imbalance could greatly reduce the statistical power for finding disease-associated pathways through the analysis of all-DE genes. Further, using gene expression profiles from five types of tumours, we illustrate that the separate analysis of up- and downregulated genes could identify more pathways that are really pertinent to phenotypic difference. In conclusion, analysing up- and downregulated genes separately is more powerful than analysing all of the DE genes together.
Identification of conserved drought-adaptive genes using a cross-species meta-analysis approach.

Science.gov (United States)

Shaar-Moshe, Lidor; Hübner, Sariel; Peleg, Zvi

2015-05-03

Drought is the major environmental stress threatening crop-plant productivity worldwide. Identification of new genes and metabolic pathways involved in plant adaptation to progressive drought stress at the reproductive stage is of great interest for agricultural research. We developed a novel Cross-Species meta-Analysis of progressive Drought stress at the reproductive stage (CSA:Drought) to identify key drought adaptive genes and mechanisms and to test their evolutionary conservation. Empirically defined filtering criteria were used to facilitate a robust integration of 17 deposited microarray experiments (148 arrays) of Arabidopsis, rice, wheat and barley. By prioritizing consistency over intensity, our approach was able to identify 225 differentially expressed genes shared across studies and taxa. Gene ontology enrichment and pathway analyses classified the shared genes into functional categories involved predominantly in metabolic processes (e.g. amino acid and carbohydrate metabolism), regulatory function (e.g. protein degradation and transcription) and response to stimulus. We further investigated drought related cis-acting elements in the shared gene promoters, and the evolutionary conservation of shared genes. The universal nature of the identified drought-adaptive genes was further validated in a fifth species, Brachypodium distachyon that was not included in the meta-analysis. qPCR analysis of 27, randomly selected, shared orthologs showed similar expression pattern as was found by the CSA:Drought.In accordance, morpho-physiological characterization of progressive drought stress, in B. distachyon, highlighted the key role of osmotic adjustment as evolutionary conserved drought-adaptive mechanism. Our CSA:Drought strategy highlights major drought-adaptive genes and metabolic pathways that were only partially, if at all, reported in the original studies included in the meta-analysis. These genes include a group of unclassified genes that could be involved
Identification and expression profiling analysis of TCP family genes involved in growth and development in maize.

Science.gov (United States)

Chai, Wenbo; Jiang, Pengfei; Huang, Guoyu; Jiang, Haiyang; Li, Xiaoyu

2017-10-01

The TCP family is a group of plant-specific transcription factors. TCP genes encode proteins harboring bHLH structure, which is implicated in DNA binding and protein-protein interactions and known as the TCP domain. TCP genes play important roles in plant development and have been evolutionarily and functionally elaborated in various plants, however, no overall phylogenetic analysis or expression profiling of TCP genes in Zea mays has been reported. In the present study, a systematic analysis of molecular evolution and functional prediction of TCP family genes in maize ( Z . mays L.) has been conducted. We performed a genome-wide survey of TCP genes in maize, revealing the gene structure, chromosomal location and phylogenetic relationship of family members. Microsynteny between grass species and tissue-specific expression profiles were also investigated. In total, 29 TCP genes were identified in the maize genome, unevenly distributed on the 10 maize chromosomes. Additionally, ZmTCP genes were categorized into nine classes based on phylogeny and purifying selection may largely be responsible for maintaining the functions of maize TCP genes. What's more, microsynteny analysis suggested that TCP genes have been conserved during evolution. Finally, expression analysis revealed that most TCP genes are expressed in the stem and ear, which suggests that ZmTCP genes influence stem and ear growth. This result is consistent with the previous finding that maize TCP genes represses the growth of axillary organs and enables the formation of female inflorescences. Altogether, this study presents a thorough overview of TCP family in maize and provides a new perspective on the evolution of this gene family. The results also indicate that TCP family genes may be involved in development stage in plant growing conditions. Additionally, our results will be useful for further functional analysis of the TCP gene family in maize.
Gene expression analysis reveals new possible mechanisms of vancomycin-induced nephrotoxicity and identifies gene markers candidates.

Science.gov (United States)

Dieterich, Christine; Puey, Angela; Lin, Sylvia; Lyn, Sylvia; Swezey, Robert; Furimsky, Anna; Fairchild, David; Mirsalis, Jon C; Ng, Hanna H

2009-01-01

Vancomycin, one of few effective treatments against methicillin-resistant Staphylococcus aureus, is nephrotoxic. The goals of this study were to (1) gain insights into molecular mechanisms of nephrotoxicity at the genomic level, (2) evaluate gene markers of vancomycin-induced kidney injury, and (3) compare gene expression responses after iv and ip administration. Groups of six female BALB/c mice were treated with seven daily iv or ip doses of vancomycin (50, 200, and 400 mg/kg) or saline, and sacrificed on day 8. Clinical chemistry and histopathology demonstrated kidney injury at 400 mg/kg only. Hierarchical clustering analysis revealed that kidney gene expression profiles of all mice treated at 400 mg/kg clustered with those of mice administered 200 mg/kg iv. Transcriptional profiling might thus be more sensitive than current clinical markers for detecting kidney damage, though the profiles can differ with the route of administration. Analysis of transcripts whose expression was changed by at least twofold compared with vehicle saline after high iv and ip doses of vancomycin suggested the possibility of oxidative stress and mitochondrial damage in vancomycin-induced toxicity. In addition, our data showed changes in expression of several transcripts from the complement and inflammatory pathways. Such expression changes were confirmed by relative real-time reverse transcription-polymerase chain reaction. Finally, our results further substantiate the use of gene markers of kidney toxicity such as KIM-1/Havcr1, as indicators of renal injury.
Gene Module Identification from Microarray Data Using Nonnegative Independent Component Analysis

Directory of Open Access Journals (Sweden)

Ting Gong

2007-01-01

Full Text Available Genes mostly interact with each other to form transcriptional modules for performing single or multiple functions. It is important to unravel such transcriptional modules and to determine how disturbances in them may lead to disease. Here, we propose a non-negative independent component analysis (nICA approach for transcriptional module discovery. nICA method utilizes the non-negativity constraint to enforce the independence of biological processes within the participated genes. In such, nICA decomposes the observed gene expression into positive independent components, which fi ts better to the reality of corresponding putative biological processes. In conjunction with nICA modeling, visual statistical data analyzer (VISDA is applied to group genes into modules in latent variable space. We demonstrate the usefulness of the approach through the identification of composite modules from yeast data and the discovery of pathway modules in muscle regeneration.
Evolutionary and genetic analysis of the VP2 gene of canine parvovirus.

Science.gov (United States)

Li, Gairu; Ji, Senlin; Zhai, Xiaofeng; Zhang, Yuxiang; Liu, Jie; Zhu, Mengyan; Zhou, Jiyong; Su, Shuo

2017-07-17

Canine parvovirus (CPV) type 2 emerged in 1978 in the USA and quickly spread among dog populations all over the world with high morbidity. Although CPV is a DNA virus, its genomic substitution rate is similar to some RNA viruses. Therefore, it is important to trace the evolution of CPV to monitor the appearance of mutations that might affect vaccine effectiveness. Our analysis shows that the VP2 genes of CPV isolated from 1979 to 2016 are divided into six groups: GI, GII, GIII, GIV, GV, and GVI. Amino acid mutation analysis revealed several undiscovered important mutation sites: F267Y, Y324I, and T440A. Of note, the evolutionary rate of the CPV VP2 gene from Asia and Europe decreased. Codon usage analysis showed that the VP2 gene of CPV exhibits high bias with an ENC ranging from 34.93 to 36.7. Furthermore, we demonstrate that natural selection plays a major role compared to mutation pressure driving CPV evolution. There are few studies on the codon usage of CPV. Here, we comprehensively studied the genetic evolution, codon usage pattern, and evolutionary characterization of the VP2 gene of CPV. The novel findings revealing the evolutionary process of CPV will greatly serve future CPV research.
A Key Gene, PLIN1, Can Affect Porcine Intramuscular Fat Content Based on Transcriptome Analysis.

Science.gov (United States)

Li, Bojiang; Weng, Qiannan; Dong, Chao; Zhang, Zengkai; Li, Rongyang; Liu, Jingge; Jiang, Aiwen; Li, Qifa; Jia, Chao; Wu, Wangjun; Liu, Honglin

2018-04-04

Intramuscular fat (IMF) content is an important indicator for meat quality evaluation. However, the key genes and molecular regulatory mechanisms affecting IMF deposition remain unclear. In the present study, we identified 75 differentially expressed genes (DEGs) between the higher (H) and lower (L) IMF content of pigs using transcriptome analysis, of which 27 were upregulated and 48 were downregulated. Notably, Kyoto Encyclopedia of Genes and Genomes (KEGG) enrichment analysis indicated that the DEG perilipin-1 ( PLIN1 ) was significantly enriched in the fat metabolism-related peroxisome proliferator-activated receptor (PPAR) signaling pathway. Furthermore, we determined the expression patterns and functional role of porcine PLIN1. Our results indicate that PLIN1 was highly expressed in porcine adipose tissue, and its expression level was significantly higher in the H IMF content group when compared with the L IMF content group, and expression was increased during adipocyte differentiation. Additionally, our results confirm that PLIN1 knockdown decreases the triglyceride (TG) level and lipid droplet (LD) size in porcine adipocytes. Overall, our data identify novel candidate genes affecting IMF content and provide new insight into PLIN1 in porcine IMF deposition and adipocyte differentiation.
A Key Gene, PLIN1, Can Affect Porcine Intramuscular Fat Content Based on Transcriptome Analysis

Directory of Open Access Journals (Sweden)

Bojiang Li

2018-04-01

Full Text Available Intramuscular fat (IMF content is an important indicator for meat quality evaluation. However, the key genes and molecular regulatory mechanisms affecting IMF deposition remain unclear. In the present study, we identified 75 differentially expressed genes (DEGs between the higher (H and lower (L IMF content of pigs using transcriptome analysis, of which 27 were upregulated and 48 were downregulated. Notably, Kyoto Encyclopedia of Genes and Genomes (KEGG enrichment analysis indicated that the DEG perilipin-1 (PLIN1 was significantly enriched in the fat metabolism-related peroxisome proliferator-activated receptor (PPAR signaling pathway. Furthermore, we determined the expression patterns and functional role of porcine PLIN1. Our results indicate that PLIN1 was highly expressed in porcine adipose tissue, and its expression level was significantly higher in the H IMF content group when compared with the L IMF content group, and expression was increased during adipocyte differentiation. Additionally, our results confirm that PLIN1 knockdown decreases the triglyceride (TG level and lipid droplet (LD size in porcine adipocytes. Overall, our data identify novel candidate genes affecting IMF content and provide new insight into PLIN1 in porcine IMF deposition and adipocyte differentiation.
Differential gene expression in granulosa cells from polycystic ovary syndrome patients with and without insulin resistance: identification of susceptibility gene sets through network analysis.

Science.gov (United States)

Kaur, Surleen; Archer, Kellie J; Devi, M Gouri; Kriplani, Alka; Strauss, Jerome F; Singh, Rita

2012-10-01

Polycystic ovary syndrome (PCOS) is a heterogeneous, genetically complex, endocrine disorder of uncertain etiology in women. Our aim was to compare the gene expression profiles in stimulated granulosa cells of PCOS women with and without insulin resistance vs. matched controls. This study included 12 normal ovulatory women (controls), 12 women with PCOS without evidence for insulin resistance (PCOS non-IR), and 16 women with insulin resistance (PCOS-IR) undergoing in vitro fertilization. Granulosa cell gene expression profiling was accomplished using Affymetrix Human Genome-U133 arrays. Differentially expressed genes were classified according to gene ontology using ingenuity pathway analysis tools. Microarray results for selected genes were confirmed by real-time quantitative PCR. A total of 211 genes were differentially expressed in PCOS non-IR and PCOS-IR granulosa cells (fold change≥1.5; P≤0.001) vs. matched controls. Diabetes mellitus and inflammation genes were significantly increased in PCOS-IR patients. Real-time quantitative PCR confirmed higher expression of NCF2 (2.13-fold), TCF7L2 (1.92-fold), and SERPINA1 (5.35-fold). Increased expression of inflammation genes ITGAX (3.68-fold) and TAB2 (1.86-fold) was confirmed in PCOS non-IR. Different cardiometabolic disease genes were differentially expressed in the two groups. Decreased expression of CAV1 (-3.58-fold) in PCOS non-IR and SPARC (-1.88-fold) in PCOS-IR was confirmed. Differential expression of genes involved in TGF-β signaling (IGF2R, increased; and HAS2, decreased), and oxidative stress (TXNIP, increased) was confirmed in both groups. Microarray analysis demonstrated differential expression of genes linked to diabetes mellitus, inflammation, cardiovascular diseases, and infertility in the granulosa cells of PCOS women with and without insulin resistance. Because these dysregulated genes are also involved in oxidative stress, lipid metabolism, and insulin signaling, we hypothesize that these
Genome-Wide Identification and Structural Analysis of bZIP Transcription Factor Genes in Brassica napus.

Science.gov (United States)

Zhou, Yan; Xu, Daixiang; Jia, Ledong; Huang, Xiaohu; Ma, Guoqiang; Wang, Shuxian; Zhu, Meichen; Zhang, Aoxiang; Guan, Mingwei; Lu, Kun; Xu, Xinfu; Wang, Rui; Li, Jiana; Qu, Cunmin

2017-10-24

The basic region/leucine zipper motif (bZIP) transcription factor family is one of the largest families of transcriptional regulators in plants. bZIP genes have been systematically characterized in some plants, but not in rapeseed ( Brassica napus ). In this study, we identified 247 BnbZIP genes in the rapeseed genome, which we classified into 10 subfamilies based on phylogenetic analysis of their deduced protein sequences. The BnbZIP genes were grouped into functional clades with Arabidopsis genes with similar putative functions, indicating functional conservation. Genome mapping analysis revealed that the BnbZIPs are distributed unevenly across all 19 chromosomes, and that some of these genes arose through whole-genome duplication and dispersed duplication events. All expression profiles of 247 bZIP genes were extracted from RNA-sequencing data obtained from 17 different B . napus ZS11 tissues with 42 various developmental stages. These genes exhibited different expression patterns in various tissues, revealing that these genes are differentially regulated. Our results provide a valuable foundation for functional dissection of the different BnbZIP homologs in B . napus and its parental lines and for molecular breeding studies of bZIP genes in B . napus .
Gene Circuit Analysis of the Terminal Gap Gene huckebein

Science.gov (United States)

Ashyraliyev, Maksat; Siggens, Ken; Janssens, Hilde; Blom, Joke; Akam, Michael; Jaeger, Johannes

2009-01-01

The early embryo of Drosophila melanogaster provides a powerful model system to study the role of genes in pattern formation. The gap gene network constitutes the first zygotic regulatory tier in the hierarchy of the segmentation genes involved in specifying the position of body segments. Here, we use an integrative, systems-level approach to investigate the regulatory effect of the terminal gap gene huckebein (hkb) on gap gene expression. We present quantitative expression data for the Hkb protein, which enable us to include hkb in gap gene circuit models. Gap gene circuits are mathematical models of gene networks used as computational tools to extract regulatory information from spatial expression data. This is achieved by fitting the model to gap gene expression patterns, in order to obtain estimates for regulatory parameters which predict a specific network topology. We show how considering variability in the data combined with analysis of parameter determinability significantly improves the biological relevance and consistency of the approach. Our models are in agreement with earlier results, which they extend in two important respects: First, we show that Hkb is involved in the regulation of the posterior hunchback (hb) domain, but does not have any other essential function. Specifically, Hkb is required for the anterior shift in the posterior border of this domain, which is now reproduced correctly in our models. Second, gap gene circuits presented here are able to reproduce mutants of terminal gap genes, while previously published models were unable to reproduce any null mutants correctly. As a consequence, our models now capture the expression dynamics of all posterior gap genes and some variational properties of the system correctly. This is an important step towards a better, quantitative understanding of the developmental and evolutionary dynamics of the gap gene network. PMID:19876378
Transcriptome analysis of WRKY gene family in Oryza officinalis Wall ex Watt and WRKY genes involved in responses to Xanthomonas oryzae pv. oryzae stress.

Science.gov (United States)

Jiang, Chunmiao; Shen, Qingxi J; Wang, Bo; He, Bin; Xiao, Suqin; Chen, Ling; Yu, Tengqiong; Ke, Xue; Zhong, Qiaofang; Fu, Jian; Chen, Yue; Wang, Lingxian; Yin, Fuyou; Zhang, Dunyu; Ghidan, Walid; Huang, Xingqi; Cheng, Zaiquan

2017-01-01

Oryza officinalis Wall ex Watt, a very important and special wild rice species, shows abundant genetic diversity and disease resistance features, especially high resistance to bacterial blight. The molecular mechanisms of bacterial blight resistance in O. officinalis have not yet been elucidated. The WRKY transcription factor family is one of the largest gene families involved in plant growth, development and stress response. However, little is known about the numbers, structure, molecular phylogenetics, and expression of the WRKY genes under Xanthomonas oryzae pv. oryzae (Xoo) stress in O. officinalis due to lacking of O. officinalis genome. Therefore, based on the RNA-sequencing data of O. officinalis, we performed a comprehensive study of WRKY genes in O. officinalis and identified 89 OoWRKY genes. Then 89 OoWRKY genes were classified into three groups based on the WRKY domains and zinc finger motifs. Phylogenetic analysis strongly supported that the evolution of OoWRKY genes were consistent with previous studies of WRKYs, and subgroup IIc OoWRKY genes were the original ancestors of some group II and group III OoWRKYs. Among the 89 OoWRKY genes, eight OoWRKYs displayed significantly different expression (>2-fold, pWRKY family of transcription factors in O.officinalis. Insight was gained into the classification, evolution, and function of the OoWRKY genes, revealing the putative roles of eight significantly different expression OoWRKYs in Xoo strains PXO99 and C5 stress responses in O.officinalis. This study provided a better understanding of the evolution and functions of O. officinalis WRKY genes, and suggested that manipulating eight significantly different expression OoWRKYs would enhance resistance to bacterial blight.
Gene expression data clustering and it’s application in differential analysis of leukemia

Directory of Open Access Journals (Sweden)

M. Vahedi

2008-02-01

Full Text Available Introduction: DNA microarray technique is one of the most important categories in bioinformatics,which allows the possibility of monitoring thousands of expressed genes has been resulted in creatinggiant data bases of gene expression data, recently. Statistical analysis of such databases includednormalization, clustering, classification and etc.Materials and Methods: Golub et al (1999 collected data bases of leukemia based on the method ofoligonucleotide. The data is on the internet. In this paper, we analyzed gene expression data. It wasclustered by several methods including multi-dimensional scaling, hierarchical and non-hierarchicalclustering. Data set included 20 Acute Lymphoblastic Leukemia (ALL patients and 14 Acute MyeloidLeukemia (AML patients. The results of tow methods of clustering were compared with regard to realgrouping (ALL & AML. R software was used for data analysis.Results: Specificity and sensitivity of divisive hierarchical clustering in diagnosing of ALL patientswere 75% and 92%, respectively. Specificity and sensitivity of partitioning around medoids indiagnosing of ALL patients were 90% and 93%, respectively. These results showed a wellaccomplishment of both methods of clustering. It is considerable that, due to clustering methodsresults, one of the samples was placed in ALL groups, which was in AML group in clinical test.Conclusion: With regard to concordance of the results with real grouping of data, therefore we canuse these methods in the cases where we don't have accurate information of real grouping of data.Moreover, Results of clustering might distinct subgroups of data in such a way that would be necessaryfor concordance with clinical outcomes, laboratory results and so on.
Blood Groups Distribution and Gene Diversity of the ABO and Rh (D Loci in the Mexican Population

Directory of Open Access Journals (Sweden)

Adrián Canizalez-Román

2018-01-01

Full Text Available Objective. To determine the frequency and distribution of ABO and Rh (D antigens and, additionally, investigate gene diversity and the structure of Mexican populations. Materials and Methods. Blood groups were tested in 271,164 subjects from 2014 to 2016. The ABO blood group was determined by agglutination using the antibodies anti-A, Anti-B, and Anti-D for the Rh factor, respectively. Results. The overall distribution of ABO and Rh (D groups in the population studied was as follows: O: 61.82%; A: 27.44%; B: 8.93%; and AB: 1.81%. For the Rh group, 95.58% of people were Rh (D, and 4.42% were Rh (d. Different distributions of blood groups across regions were found; additionally, genetic analysis revealed that the IO and ID allele showed an increasing trend from the north to the center, while the IA and Id allele tended to increase from the center to the north. Also, we found more gene diversity in both loci in the north compared with the center, suggesting population structure in Mexico. Conclusion. This work could help health institutions to identify where they can obtain blood products necessary for medical interventions. Moreover, this piece of information contributes to the knowledge of the genetic structure of the Mexican populations which could have significant implications in different fields of biomedicine.
Genome-Wide Analysis of the AP2/ERF Gene Family in Physic Nut and Overexpression of the JcERF011 Gene in Rice Increased Its Sensitivity to Salinity Stress.

Science.gov (United States)

Tang, Yuehui; Qin, Shanshan; Guo, Yali; Chen, Yanbo; Wu, Pingzhi; Chen, Yaping; Li, Meiru; Jiang, Huawu; Wu, Guojiang

2016-01-01

The AP2/ERF transcription factors play crucial roles in plant growth, development and responses to biotic and abiotic stresses. A total of 119 AP2/ERF genes (JcAP2/ERFs) have been identified in the physic nut genome; they include 16 AP2, 4 RAV, 1 Soloist, and 98 ERF genes. Phylogenetic analysis indicated that physic nut AP2 genes could be divided into 3 subgroups, while ERF genes could be classed into 11 groups or 43 subgroups. The AP2/ERF genes are non-randomly distributed across the 11 linkage groups of the physic nut genome and retain many duplicates which arose from ancient duplication events. The expression patterns of several JcAP2/ERF duplicates in the physic nut showed differences among four tissues (root, stem, leaf, and seed), and 38 JcAP2/ERF genes responded to at least one abiotic stressor (drought, salinity, phosphate starvation, and nitrogen starvation) in leaves and/or roots according to analysis of digital gene expression tag data. The expression of JcERF011 was downregulated by salinity stress in physic nut roots. Overexpression of the JcERF011 gene in rice plants increased its sensitivity to salinity stress. The increased expression levels of several salt tolerance-related genes were impaired in the JcERF011-overexpressing plants under salinity stress.

Discrimination of the Lactobacillus acidophilus group using sequencing, species-specific PCR and SNaPshot mini-sequencing technology based on the recA gene.

Science.gov (United States)

Huang, Chien-Hsun; Chang, Mu-Tzu; Huang, Mu-Chiou; Wang, Li-Tin; Huang, Lina; Lee, Fwu-Ling

2012-10-01

To clearly identify specific species and subspecies of the Lactobacillus acidophilus group using phenotypic and genotypic (16S rDNA sequence analysis) techniques alone is difficult. The aim of this study was to use the recA gene for species discrimination in the L. acidophilus group, as well as to develop a species-specific primer and single nucleotide polymorphism primer based on the recA gene sequence for species and subspecies identification. The average sequence similarity for the recA gene among type strains was 80.0%, and most members of the L. acidophilus group could be clearly distinguished. The species-specific primer was designed according to the recA gene sequencing, which was employed for polymerase chain reaction with the template DNA of Lactobacillus strains. A single 231-bp species-specific band was found only in L. delbrueckii. A SNaPshot mini-sequencing assay using recA as a target gene was also developed. The specificity of the mini-sequencing assay was evaluated using 31 strains of L. delbrueckii species and was able to unambiguously discriminate strains belonging to the subspecies L. delbrueckii subsp. bulgaricus. The phylogenetic relationships of most strains in the L. acidophilus group can be resolved using recA gene sequencing, and a novel method to identify the species and subspecies of the L. delbrueckii and L. delbrueckii subsp. bulgaricus was developed by species-specific polymerase chain reaction combined with SNaPshot mini-sequencing. Copyright © 2012 Society of Chemical Industry.
Dynamic association rules for gene expression data analysis.

Science.gov (United States)

Chen, Shu-Chuan; Tsai, Tsung-Hsien; Chung, Cheng-Han; Li, Wen-Hsiung

2015-10-14

The purpose of gene expression analysis is to look for the association between regulation of gene expression levels and phenotypic variations. This association based on gene expression profile has been used to determine whether the induction/repression of genes correspond to phenotypic variations including cell regulations, clinical diagnoses and drug development. Statistical analyses on microarray data have been developed to resolve gene selection issue. However, these methods do not inform us of causality between genes and phenotypes. In this paper, we propose the dynamic association rule algorithm (DAR algorithm) which helps ones to efficiently select a subset of significant genes for subsequent analysis. The DAR algorithm is based on association rules from market basket analysis in marketing. We first propose a statistical way, based on constructing a one-sided confidence interval and hypothesis testing, to determine if an association rule is meaningful. Based on the proposed statistical method, we then developed the DAR algorithm for gene expression data analysis. The method was applied to analyze four microarray datasets and one Next Generation Sequencing (NGS) dataset: the Mice Apo A1 dataset, the whole genome expression dataset of mouse embryonic stem cells, expression profiling of the bone marrow of Leukemia patients, Microarray Quality Control (MAQC) data set and the RNA-seq dataset of a mouse genomic imprinting study. A comparison of the proposed method with the t-test on the expression profiling of the bone marrow of Leukemia patients was conducted. We developed a statistical way, based on the concept of confidence interval, to determine the minimum support and minimum confidence for mining association relationships among items. With the minimum support and minimum confidence, one can find significant rules in one single step. The DAR algorithm was then developed for gene expression data analysis. Four gene expression datasets showed that the proposed
Average correlation clustering algorithm (ACCA) for grouping of co-regulated genes with similar pattern of variation in their expression values.

Science.gov (United States)

Bhattacharya, Anindya; De, Rajat K

2010-08-01

Distance based clustering algorithms can group genes that show similar expression values under multiple experimental conditions. They are unable to identify a group of genes that have similar pattern of variation in their expression values. Previously we developed an algorithm called divisive correlation clustering algorithm (DCCA) to tackle this situation, which is based on the concept of correlation clustering. But this algorithm may also fail for certain cases. In order to overcome these situations, we propose a new clustering algorithm, called average correlation clustering algorithm (ACCA), which is able to produce better clustering solution than that produced by some others. ACCA is able to find groups of genes having more common transcription factors and similar pattern of variation in their expression values. Moreover, ACCA is more efficient than DCCA with respect to the time of execution. Like DCCA, we use the concept of correlation clustering concept introduced by Bansal et al. ACCA uses the correlation matrix in such a way that all genes in a cluster have the highest average correlation values with the genes in that cluster. We have applied ACCA and some well-known conventional methods including DCCA to two artificial and nine gene expression datasets, and compared the performance of the algorithms. The clustering results of ACCA are found to be more significantly relevant to the biological annotations than those of the other methods. Analysis of the results show the superiority of ACCA over some others in determining a group of genes having more common transcription factors and with similar pattern of variation in their expression profiles. Availability of the software: The software has been developed using C and Visual Basic languages, and can be executed on the Microsoft Windows platforms. The software may be downloaded as a zip file from http://www.isical.ac.in/~rajat. Then it needs to be installed. Two word files (included in the zip file) need to
A resampling-based meta-analysis for detection of differential gene expression in breast cancer

International Nuclear Information System (INIS)

Gur-Dedeoglu, Bala; Konu, Ozlen; Kir, Serkan; Ozturk, Ahmet Rasit; Bozkurt, Betul; Ergul, Gulusan; Yulug, Isik G

2008-01-01

proposed meta-analysis approach has the ability to detect a set of differentially expressed genes with the least amount of within-group variability, thus providing highly stable gene lists for class prediction. Increased statistical power and stringent filtering criteria used in the present study also make identification of novel candidate genes possible and may provide further insight to improve our understanding of breast cancer development
A resampling-based meta-analysis for detection of differential gene expression in breast cancer

Directory of Open Access Journals (Sweden)

Ergul Gulusan

2008-12-01

-time qRT-PCR supported the meta-analysis results. Conclusion The proposed meta-analysis approach has the ability to detect a set of differentially expressed genes with the least amount of within-group variability, thus providing highly stable gene lists for class prediction. Increased statistical power and stringent filtering criteria used in the present study also make identification of novel candidate genes possible and may provide further insight to improve our understanding of breast cancer development.
Meta Analysis of Gene Expression Data within and Across Species.

Science.gov (United States)

Fierro, Ana C; Vandenbussche, Filip; Engelen, Kristof; Van de Peer, Yves; Marchal, Kathleen

2008-12-01

Since the second half of the 1990s, a large number of genome-wide analyses have been described that study gene expression at the transcript level. To this end, two major strategies have been adopted, a first one relying on hybridization techniques such as microarrays, and a second one based on sequencing techniques such as serial analysis of gene expression (SAGE), cDNA-AFLP, and analysis based on expressed sequence tags (ESTs). Despite both types of profiling experiments becoming routine techniques in many research groups, their application remains costly and laborious. As a result, the number of conditions profiled in individual studies is still relatively small and usually varies from only two to few hundreds of samples for the largest experiments. More and more, scientific journals require the deposit of these high throughput experiments in public databases upon publication. Mining the information present in these databases offers molecular biologists the possibility to view their own small-scale analysis in the light of what is already available. However, so far, the richness of the public information remains largely unexploited. Several obstacles such as the correct association between ESTs and microarray probes with the corresponding gene transcript, the incompleteness and inconsistency in the annotation of experimental conditions, and the lack of standardized experimental protocols to generate gene expression data, all impede the successful mining of these data. Here, we review the potential and difficulties of combining publicly available expression data from respectively EST analyses and microarray experiments. With examples from literature, we show how meta-analysis of expression profiling experiments can be used to study expression behavior in a single organism or between organisms, across a wide range of experimental conditions. We also provide an overview of the methods and tools that can aid molecular biologists in exploiting these public data.
Gene set analysis using variance component tests.

Science.gov (United States)

Huang, Yen-Tsung; Lin, Xihong

2013-06-28

Gene set analyses have become increasingly important in genomic research, as many complex diseases are contributed jointly by alterations of numerous genes. Genes often coordinate together as a functional repertoire, e.g., a biological pathway/network and are highly correlated. However, most of the existing gene set analysis methods do not fully account for the correlation among the genes. Here we propose to tackle this important feature of a gene set to improve statistical power in gene set analyses. We propose to model the effects of an independent variable, e.g., exposure/biological status (yes/no), on multiple gene expression values in a gene set using a multivariate linear regression model, where the correlation among the genes is explicitly modeled using a working covariance matrix. We develop TEGS (Test for the Effect of a Gene Set), a variance component test for the gene set effects by assuming a common distribution for regression coefficients in multivariate linear regression models, and calculate the p-values using permutation and a scaled chi-square approximation. We show using simulations that type I error is protected under different choices of working covariance matrices and power is improved as the working covariance approaches the true covariance. The global test is a special case of TEGS when correlation among genes in a gene set is ignored. Using both simulation data and a published diabetes dataset, we show that our test outperforms the commonly used approaches, the global test and gene set enrichment analysis (GSEA). We develop a gene set analyses method (TEGS) under the multivariate regression framework, which directly models the interdependence of the expression values in a gene set using a working covariance. TEGS outperforms two widely used methods, GSEA and global test in both simulation and a diabetes microarray data.
The WRKY Transcription Factor Genes in Lotus japonicus.

Science.gov (United States)

Song, Hui; Wang, Pengfei; Nan, Zhibiao; Wang, Xingjun

2014-01-01

WRKY transcription factor genes play critical roles in plant growth and development, as well as stress responses. WRKY genes have been examined in various higher plants, but they have not been characterized in Lotus japonicus. The recent release of the L. japonicus whole genome sequence provides an opportunity for a genome wide analysis of WRKY genes in this species. In this study, we identified 61 WRKY genes in the L. japonicus genome. Based on the WRKY protein structure, L. japonicus WRKY (LjWRKY) genes can be classified into three groups (I-III). Investigations of gene copy number and gene clusters indicate that only one gene duplication event occurred on chromosome 4 and no clustered genes were detected on chromosomes 3 or 6. Researchers previously believed that group II and III WRKY domains were derived from the C-terminal WRKY domain of group I. Our results suggest that some WRKY genes in group II originated from the N-terminal domain of group I WRKY genes. Additional evidence to support this hypothesis was obtained by Medicago truncatula WRKY (MtWRKY) protein motif analysis. We found that LjWRKY and MtWRKY group III genes are under purifying selection, suggesting that WRKY genes will become increasingly structured and functionally conserved.
Mining the archives: a cross-platform analysis of gene ...

Science.gov (United States)

Formalin-fixed paraffin-embedded (FFPE) tissue samples represent a potentially invaluable resource for genomic research into the molecular basis of disease. However, use of FFPE samples in gene expression studies has been limited by technical challenges resulting from degradation of nucleic acids. Here we evaluated gene expression profiles derived from fresh-frozen (FRO) and FFPE mouse liver tissues using two DNA microarray protocols and two whole transcriptome sequencing (RNA-seq) library preparation methodologies. The ribo-depletion protocol outperformed the other three methods by having the highest correlations of differentially expressed genes (DEGs) and best overlap of pathways between FRO and FFPE groups. We next tested the effect of sample time in formalin (18 hours or 3 weeks) on gene expression profiles. Hierarchical clustering of the datasets indicated that test article treatment, and not preservation method, was the main driver of gene expression profiles. Meta- and pathway analyses indicated that biological responses were generally consistent for 18-hour and 3-week FFPE samples compared to FRO samples. However, clear erosion of signal intensity with time in formalin was evident, and DEG numbers differed by platform and preservation method. Lastly, we investigated the effect of age in FFPE block on genomic profiles. RNA-seq analysis of 8-, 19-, and 26-year-old control blocks using the ribo-depletion protocol resulted in comparable quality metrics, inc
Genome-Wide Identification and Analysis of the TIFY Gene Family in Grape

Science.gov (United States)

Zhang, Yucheng; Gao, Min; Singer, Stacy D.; Fei, Zhangjun; Wang, Hua; Wang, Xiping

2012-01-01

Background The TIFY gene family constitutes a plant-specific group of genes with a broad range of functions. This family encodes four subfamilies of proteins, including ZML, TIFY, PPD and JASMONATE ZIM-Domain (JAZ) proteins. JAZ proteins are targets of the SCFCOI1 complex, and function as negative regulators in the JA signaling pathway. Recently, it has been reported in both Arabidopsis and rice that TIFY genes, and especially JAZ genes, may be involved in plant defense against insect feeding, wounding, pathogens and abiotic stresses. Nonetheless, knowledge concerning the specific expression patterns and evolutionary history of plant TIFY family members is limited, especially in a woody species such as grape. Methodology/Principal Findings A total of two TIFY, four ZML, two PPD and 11 JAZ genes were identified in the Vitis vinifera genome. Phylogenetic analysis of TIFY protein sequences from grape, Arabidopsis and rice indicated that the grape TIFY proteins are more closely related to those of Arabidopsis than those of rice. Both segmental and tandem duplication events have been major contributors to the expansion of the grape TIFY family. In addition, synteny analysis between grape and Arabidopsis demonstrated that homologues of several grape TIFY genes were found in the corresponding syntenic blocks of Arabidopsis, suggesting that these genes arose before the divergence of lineages that led to grape and Arabidopsis. Analyses of microarray and quantitative real-time RT-PCR expression data revealed that grape TIFY genes are not a major player in the defense against biotrophic pathogens or viruses. However, many of these genes were responsive to JA and ABA, but not SA or ET. Conclusion The genome-wide identification, evolutionary and expression analyses of grape TIFY genes should facilitate further research of this gene family and provide new insights regarding their evolutionary history and regulatory control. PMID:22984514
Serial analysis of gene expression (SAGE)

NARCIS (Netherlands)

van Ruissen, Fred; Baas, Frank

2007-01-01

In 1995, serial analysis of gene expression (SAGE) was developed as a versatile tool for gene expression studies. SAGE technology does not require pre-existing knowledge of the genome that is being examined and therefore SAGE can be applied to many different model systems. In this chapter, the SAGE
Network Graph Analysis of Gene-Gene Interactions in Genome-Wide Association Study Data

Directory of Open Access Journals (Sweden)

Sungyoung Lee

2012-12-01

Full Text Available Most common complex traits, such as obesity, hypertension, diabetes, and cancers, are known to be associated with multiple genes, environmental factors, and their epistasis. Recently, the development of advanced genotyping technologies has allowed us to perform genome-wide association studies (GWASs. For detecting the effects of multiple genes on complex traits, many approaches have been proposed for GWASs. Multifactor dimensionality reduction (MDR is one of the powerful and efficient methods for detecting high-order gene-gene (GxG interactions. However, the biological interpretation of GxG interactions identified by MDR analysis is not easy. In order to aid the interpretation of MDR results, we propose a network graph analysis to elucidate the meaning of identified GxG interactions. The proposed network graph analysis consists of three steps. The first step is for performing GxG interaction analysis using MDR analysis. The second step is to draw the network graph using the MDR result. The third step is to provide biological evidence of the identified GxG interaction using external biological databases. The proposed method was applied to Korean Association Resource (KARE data, containing 8838 individuals with 327,632 single-nucleotide polymorphisms, in order to perform GxG interaction analysis of body mass index (BMI. Our network graph analysis successfully showed that many identified GxG interactions have known biological evidence related to BMI. We expect that our network graph analysis will be helpful to interpret the biological meaning of GxG interactions.
Network graph analysis of gene-gene interactions in genome-wide association study data.

Science.gov (United States)

Lee, Sungyoung; Kwon, Min-Seok; Park, Taesung

2012-12-01

Most common complex traits, such as obesity, hypertension, diabetes, and cancers, are known to be associated with multiple genes, environmental factors, and their epistasis. Recently, the development of advanced genotyping technologies has allowed us to perform genome-wide association studies (GWASs). For detecting the effects of multiple genes on complex traits, many approaches have been proposed for GWASs. Multifactor dimensionality reduction (MDR) is one of the powerful and efficient methods for detecting high-order gene-gene (GxG) interactions. However, the biological interpretation of GxG interactions identified by MDR analysis is not easy. In order to aid the interpretation of MDR results, we propose a network graph analysis to elucidate the meaning of identified GxG interactions. The proposed network graph analysis consists of three steps. The first step is for performing GxG interaction analysis using MDR analysis. The second step is to draw the network graph using the MDR result. The third step is to provide biological evidence of the identified GxG interaction using external biological databases. The proposed method was applied to Korean Association Resource (KARE) data, containing 8838 individuals with 327,632 single-nucleotide polymorphisms, in order to perform GxG interaction analysis of body mass index (BMI). Our network graph analysis successfully showed that many identified GxG interactions have known biological evidence related to BMI. We expect that our network graph analysis will be helpful to interpret the biological meaning of GxG interactions.
SIGNATURE: A workbench for gene expression signature analysis

Directory of Open Access Journals (Sweden)

Chang Jeffrey T

2011-11-01

Full Text Available Abstract Background The biological phenotype of a cell, such as a characteristic visual image or behavior, reflects activities derived from the expression of collections of genes. As such, an ability to measure the expression of these genes provides an opportunity to develop more precise and varied sets of phenotypes. However, to use this approach requires computational methods that are difficult to implement and apply, and thus there is a critical need for intelligent software tools that can reduce the technical burden of the analysis. Tools for gene expression analyses are unusually difficult to implement in a user-friendly way because their application requires a combination of biological data curation, statistical computational methods, and database expertise. Results We have developed SIGNATURE, a web-based resource that simplifies gene expression signature analysis by providing software, data, and protocols to perform the analysis successfully. This resource uses Bayesian methods for processing gene expression data coupled with a curated database of gene expression signatures, all carried out within a GenePattern web interface for easy use and access. Conclusions SIGNATURE is available for public use at http://genepattern.genome.duke.edu/signature/.
Identifying key genes in rheumatoid arthritis by weighted gene co-expression network analysis.

Science.gov (United States)

Ma, Chunhui; Lv, Qi; Teng, Songsong; Yu, Yinxian; Niu, Kerun; Yi, Chengqin

2017-08-01

This study aimed to identify rheumatoid arthritis (RA) related genes based on microarray data using the WGCNA (weighted gene co-expression network analysis) method. Two gene expression profile datasets GSE55235 (10 RA samples and 10 healthy controls) and GSE77298 (16 RA samples and seven healthy controls) were downloaded from Gene Expression Omnibus database. Characteristic genes were identified using metaDE package. WGCNA was used to find disease-related networks based on gene expression correlation coefficients, and module significance was defined as the average gene significance of all genes used to assess the correlation between the module and RA status. Genes in the disease-related gene co-expression network were subject to functional annotation and pathway enrichment analysis using Database for Annotation Visualization and Integrated Discovery. Characteristic genes were also mapped to the Connectivity Map to screen small molecules. A total of 599 characteristic genes were identified. For each dataset, characteristic genes in the green, red and turquoise modules were most closely associated with RA, with gene numbers of 54, 43 and 79, respectively. These genes were enriched in totally enriched in 17 Gene Ontology terms, mainly related to immune response (CD97, FYB, CXCL1, IKBKE, CCR1, etc.), inflammatory response (CD97, CXCL1, C3AR1, CCR1, LYZ, etc.) and homeostasis (C3AR1, CCR1, PLN, CCL19, PPT1, etc.). Two small-molecule drugs sanguinarine and papaverine were predicted to have a therapeutic effect against RA. Genes related to immune response, inflammatory response and homeostasis presumably have critical roles in RA pathogenesis. Sanguinarine and papaverine have a potential therapeutic effect against RA. © 2017 Asia Pacific League of Associations for Rheumatology and John Wiley & Sons Australia, Ltd.
Classification and evolutionary analysis of the basic helix-loop-helix gene family in the green anole lizard, Anolis carolinensis.

Science.gov (United States)

Liu, Ake; Wang, Yong; Zhang, Debao; Wang, Xuhua; Song, Huifang; Dang, Chunwang; Yao, Qin; Chen, Keping

2013-08-01

Helix-loop-helix (bHLH) proteins play essential regulatory roles in a variety of biological processes. These highly conserved proteins form a large transcription factor superfamily, and are commonly identified in large numbers within animal, plant, and fungal genomes. The bHLH domain has been well studied in many animal species, but has not yet been characterized in non-avian reptiles. In this study, we identified 102 putative bHLH genes in the genome of the green anole lizard, Anolis carolinensis. Based on phylogenetic analysis, these genes were classified into 43 families, with 43, 24, 16, 3, 10, and 3 members assigned into groups A, B, C, D, E, and F, respectively, and 3 members categorized as "orphans". Within-group evolutionary relationships inferred from the phylogenetic analysis were consistent with highly conserved patterns observed for introns and additional domains. Results from phylogenetic analysis of the H/E(spl) family suggest that genome and tandem gene duplications have contributed to this family's expansion. Our classification and evolutionary analysis has provided insights into the evolutionary diversification of animal bHLH genes, and should aid future studies on bHLH protein regulation of key growth and developmental processes.
Ignalina Safety Analysis Group

International Nuclear Information System (INIS)

Ushpuras, E.

1995-01-01

The article describes the fields of activities of Ignalina NPP Safety Analysis Group (ISAG) in the Lithuanian Energy Institute and overview the main achievements gained since the group establishment in 1992. The group is working under the following guidelines: in-depth analysis of the fundamental physical processes of RBMK-1500 reactors; collection, systematization and verification of the design and operational data; simulation and analysis of potential accident consequences; analysis of thermohydraulic and neutronic characteristics of the plant; provision of technical and scientific consultations to VATESI, Governmental authorities, and also international institutions, participating in various projects aiming at Ignalina NPP safety enhancement. The ISAG is performing broad scientific co-operation programs with both Eastern and Western scientific groups, supplying engineering assistance for Ignalina NPP. ISAG is also participating in the joint Lithuanian - Swedish - Russian project - Barselina, the first Probabilistic Safety Assessment (PSA) study of Ignalina NPP. The work is underway together with Maryland University (USA) for assessment of the accident confinement system for a range of breaks in the primary circuit. At present the ISAG personnel is also involved in the project under the grant from the Nuclear Safety Account, administered by the European Bank for reconstruction and development for the preparation and review of an in-depth safety assessment of the Ignalina plant
Association between polymorphism in STAT4 gene and risk of rheumatoid arthritis: a meta-analysis.

Science.gov (United States)

Tong, Guanghui; Zhang, Xiaochen; Tong, Weiwei; Liu, Yong

2013-05-01

Rheumatoid arthritis (RA) is a common chronic inflammatory autoimmune disease, affecting 1% of the population worldwide. Single nucleotide polymorphisms (SNPs) of signal transducer and activator of transcription 4 (STAT4) gene are suspected to have some relationship with the risk of RA. This meta-analysis aimed to evaluate the relationship between the polymorphism rs7574865 in STAT4 gene with RA and also examine whether the associations that have been reported in these studies differ between ethnic groups. We retrieved the relevant articles from PubMed, EMBASE and the China National Knowledge Infrastructure (CNKI) databases. The odds ratios (ORs) and their 95% confidence intervals (95% CIs) associated with the minor T allele of STAT4 rs7574865 SNP were extracted from the published studies and included in the analysis. Meta-analyses were performed on the total data set and separately for the major ethnic groups and RF and anti-CCP status. All analyses were performed using the Stata software. Twenty-three articles were included in the present analysis. Meta-analysis showed an association between the STAT4 polymorphism and RA in all subjects (OR=1.299, 95%CI=1.230-1.371, Prs7574865 T allele was significantly associated with RA in both Caucasians and Asians, in both positive and negative RF patients versus controls, also significantly in the presence of anti-CCP, both positive and negative. As for genotypes of rs7574865 polymorphism, all the results were significant, no matter in total subjects or stratified analyses by ethnic groups or by RF and anti-CCP status. Genetic polymorphism rs7574865 in STAT4 gene might be associated with RA susceptibility in total subjects, major ethnic groups and different status of anti-CCP or RF. Copyright © 2012 American Society for Histocompatibility and Immunogenetics. Published by Elsevier Inc. All rights reserved.
Genome-wide identification and analysis of the SBP-box family genes in apple (Malus × domestica Borkh.).

Science.gov (United States)

Li, Jun; Hou, Hongmin; Li, Xiaoqin; Xiang, Jiang; Yin, Xiangjing; Gao, Hua; Zheng, Yi; Bassett, Carole L; Wang, Xiping

2013-09-01

SQUAMOSA promoter binding protein (SBP)-box genes encode a family of plant-specific transcription factors and play many crucial roles in plant development. In this study, 27 SBP-box gene family members were identified in the apple (Malus × domestica Borkh.) genome, 15 of which were suggested to be putative targets of MdmiR156. Plant SBPs were classified into eight groups according to the phylogenetic analysis of SBP-domain proteins. Gene structure, gene chromosomal location and synteny analyses of MdSBP genes within the apple genome demonstrated that tandem and segmental duplications, as well as whole genome duplications, have likely contributed to the expansion and evolution of the SBP-box gene family in apple. Additionally, synteny analysis between apple and Arabidopsis indicated that several paired homologs of MdSBP and AtSPL genes were located in syntenic genomic regions. Tissue-specific expression analysis of MdSBP genes in apple demonstrated their diversified spatiotemporal expression patterns. Most MdmiR156-targeted MdSBP genes, which had relatively high transcript levels in stems, leaves, apical buds and some floral organs, exhibited a more differential expression pattern than most MdmiR156-nontargeted MdSBP genes. Finally, expression analysis of MdSBP genes in leaves upon various plant hormone treatments showed that many MdSBP genes were responsive to different plant hormones, indicating that MdSBP genes may be involved in responses to hormone signaling during stress or in apple development. Copyright © 2013 Elsevier Masson SAS. All rights reserved.
Transcriptome Analysis and Discovery of Genes Involved in Immune Pathways from Coelomocytes of Sea Cucumber (Apostichopus japonicus) after Vibrio splendidus Challenge.

Science.gov (United States)

Gao, Qiong; Liao, Meijie; Wang, Yingeng; Li, Bin; Zhang, Zheng; Rong, Xiaojun; Chen, Guiping; Wang, Lan

2015-07-17

Vibrio splendidus is identified as one of the major pathogenic factors for the skin ulceration syndrome in sea cucumber (Apostichopus japonicus), which has vastly limited the development of the sea cucumber culture industry. In order to screen the immune genes involving Vibrio splendidus challenge in sea cucumber and explore the molecular mechanism of this process, the related transcriptome and gene expression profiling of resistant and susceptible biotypes of sea cucumber with Vibrio splendidus challenge were collected for analysis. A total of 319,455,942 trimmed reads were obtained, which were assembled into 186,658 contigs. After that, 89,891 representative contigs (without isoform) were clustered. The analysis of the gene expression profiling identified 358 differentially expression genes (DEGs) in the bacterial-resistant group, and 102 DEGs in the bacterial-susceptible group, compared with that in control group. According to the reported references and annotation information from BLAST, GO and KEGG, 30 putative bacterial-resistant genes and 19 putative bacterial-susceptible genes were identified from DEGs. The qRT-PCR results were consistent with the RNA-Seq results. Furthermore, many DGEs were involved in immune signaling related pathways, such as Endocytosis, Lysosome, MAPK, Chemokine and the ERBB signaling pathway.

Novel groups and unique distribution of phage phoH genes in paddy waters in northeast China

Science.gov (United States)

Wang, Xinzhen; Liu, Junjie; Yu, Zhenhua; Jin, Jian; Liu, Xiaobing; Wang, Guanghua

2016-01-01

Although bacteriophages are ubiquitous in various environments, their genetic diversity is primarily investigated in pelagic marine environments. Corresponding studies in terrestrial environments are few. In this study, we conducted the first survey of phage diversity in the paddy ecosystem by targeting a new viral biomarker gene, phoH. A total of 424 phoH sequences were obtained from four paddy waters generated from a pot experiment with different soils collected from open paddy fields in northeast China. The majority of phoH sequences in paddy waters were novel, with the highest identity of ≤70% with known phoH sequences. Four unique groups (Group α, Group β, Group γ and Group δ) and seven new subgroups (Group 2b, Group 3d, Group 3e, Group 6a, Group 6b, Group 6c and Group 6d) were formed exclusively with the clones from the paddy waters, suggesting novel phage phoH groups exist in the paddy ecosystem. Additionally, the distribution proportions of phoH clones in different groups varied among paddy water samples, suggesting the phage community in paddy fields is biogeographically distributed. Furthermore, non-metric multidimensional scaling analysis indicated that phage phoH assemblages in paddy waters were distinct from those in marine waters. PMID:27910929
Identification of the WRKY gene family and functional analysis of two genes in Caragana intermedia.

Science.gov (United States)

Wan, Yongqing; Mao, Mingzhu; Wan, Dongli; Yang, Qi; Yang, Feiyun; Mandlaa; Li, Guojing; Wang, Ruigang

2018-02-09

WRKY transcription factors, one of the largest families of transcriptional regulators in plants, play important roles in plant development and various stress responses. The WRKYs of Caragana intermedia are still not well characterized, although many WRKYs have been identified in various plant species. We identified 53 CiWRKY genes from C. intermedia transcriptome data, 28 of which exhibited complete open reading frames (ORFs). These CiWRKYs were divided into three groups via phylogenetic analysis according to their WRKY domains and zinc finger motifs. Conserved domain analysis showed that the CiWRKY proteins contain a highly conserved WRKYGQK motif and two variant motifs (WRKYGKK and WKKYEEK). The subcellular localization of CiWRKY26 and CiWRKY28-1 indicated that these two proteins localized exclusively to nuclei, supporting their role as transcription factors. The expression patterns of the 28 CiWRKYs with complete ORFs were examined through quantitative real-time PCR (qRT-PCR) in various tissues and under different abiotic stresses (drought, cold, salt, high-pH and abscisic acid (ABA)). The results showed that each CiWRKY responded to at least one stress treatment. Furthermore, overexpression of CiWRKY75-1 and CiWRKY40-4 in Arabidopsis thaliana suppressed the drought stress tolerance of the plants and delayed leaf senescence, respectively. Fifty-three CiWRKY genes from the C. intermedia transcriptome were identified and divided into three groups via phylogenetic analysis. The expression patterns of the 28 CiWRKYs under different abiotic stresses suggested that each CiWRKY responded to at least one stress treatment. Overexpression of CiWRKY75-1 and CiWRKY40-4 suppressed the drought stress tolerance of Arabidopsis and delayed leaf senescence, respectively. These results provide a basis for the molecular mechanism through which CiWRKYs mediate stress tolerance.
Global gene expression analysis of apple fruit development from the floral bud to ripe fruit

Directory of Open Access Journals (Sweden)

McArtney Steve

2008-02-01

Full Text Available Abstract Background Apple fruit develop over a period of 150 days from anthesis to fully ripe. An array representing approximately 13000 genes (15726 oligonucleotides of 45–55 bases designed from apple ESTs has been used to study gene expression over eight time points during fruit development. This analysis of gene expression lays the groundwork for a molecular understanding of fruit growth and development in apple. Results Using ANOVA analysis of the microarray data, 1955 genes showed significant changes in expression over this time course. Expression of genes is coordinated with four major patterns of expression observed: high in floral buds; high during cell division; high when starch levels and cell expansion rates peak; and high during ripening. Functional analysis associated cell cycle genes with early fruit development and three core cell cycle genes are significantly up-regulated in the early stages of fruit development. Starch metabolic genes were associated with changes in starch levels during fruit development. Comparison with microarrays of ethylene-treated apple fruit identified a group of ethylene induced genes also induced in normal fruit ripening. Comparison with fruit development microarrays in tomato has been used to identify 16 genes for which expression patterns are similar in apple and tomato and these genes may play fundamental roles in fruit development. The early phase of cell division and tissue specification that occurs in the first 35 days after pollination has been associated with up-regulation of a cluster of genes that includes core cell cycle genes. Conclusion Gene expression in apple fruit is coordinated with specific developmental stages. The array results are reproducible and comparisons with experiments in other species has been used to identify genes that may play a fundamental role in fruit development.
Gene Environment Interactions and Predictors of Colorectal Cancer in Family-Based, Multi-Ethnic Groups.

Science.gov (United States)

Shiao, S Pamela K; Grayson, James; Yu, Chong Ho; Wasek, Brandi; Bottiglieri, Teodoro

2018-02-16

For the personalization of polygenic/omics-based health care, the purpose of this study was to examine the gene-environment interactions and predictors of colorectal cancer (CRC) by including five key genes in the one-carbon metabolism pathways. In this proof-of-concept study, we included a total of 54 families and 108 participants, 54 CRC cases and 54 matched family friends representing four major racial ethnic groups in southern California (White, Asian, Hispanics, and Black). We used three phases of data analytics, including exploratory, family-based analyses adjusting for the dependence within the family for sharing genetic heritage, the ensemble method, and generalized regression models for predictive modeling with a machine learning validation procedure to validate the results for enhanced prediction and reproducibility. The results revealed that despite the family members sharing genetic heritage, the CRC group had greater combined gene polymorphism rates than the family controls ( p relation to gene-environment interactions in the prevention of CRC.
Stem Cell Gene Therapy for Fanconi Anemia: Report from the 1st International Fanconi Anemia Gene Therapy Working Group Meeting

Science.gov (United States)

Tolar, Jakub; Adair, Jennifer E; Antoniou, Michael; Bartholomae, Cynthia C; Becker, Pamela S; Blazar, Bruce R; Bueren, Juan; Carroll, Thomas; Cavazzana-Calvo, Marina; Clapp, D Wade; Dalgleish, Robert; Galy, Anne; Gaspar, H Bobby; Hanenberg, Helmut; Von Kalle, Christof; Kiem, Hans-Peter; Lindeman, Dirk; Naldini, Luigi; Navarro, Susana; Renella, Raffaele; Rio, Paula; Sevilla, Julián; Schmidt, Manfred; Verhoeyen, Els; Wagner, John E; Williams, David A; Thrasher, Adrian J

2011-01-01

Survival rates after allogeneic hematopoietic cell transplantation (HCT) for Fanconi anemia (FA) have increased dramatically since 2000. However, the use of autologous stem cell gene therapy, whereby the patient's own blood stem cells are modified to express the wild-type gene product, could potentially avoid the early and late complications of allogeneic HCT. Over the last decades, gene therapy has experienced a high degree of optimism interrupted by periods of diminished expectation. Optimism stems from recent examples of successful gene correction in several congenital immunodeficiencies, whereas diminished expectations come from the realization that gene therapy will not be free of side effects. The goal of the 1st International Fanconi Anemia Gene Therapy Working Group Meeting was to determine the optimal strategy for moving stem cell gene therapy into clinical trials for individuals with FA. To this end, key investigators examined vector design, transduction method, criteria for large-scale clinical-grade vector manufacture, hematopoietic cell preparation, and eligibility criteria for FA patients most likely to benefit. The report summarizes the roadmap for the development of gene therapy for FA. PMID:21540837
Detection and sequence analysis of accessory gene regulator genes of Staphylococcus pseudintermedius isolates

Directory of Open Access Journals (Sweden)

M. Ananda Chitra

2015-07-01

Full Text Available Background: Staphylococcus pseudintermedius (SP is the major pathogenic species of dogs involved in a wide variety of skin and soft tissue infections. The accessory gene regulator (agr locus of Staphylococcus aureus has been extensively studied, and it influences the expression of many virulence genes. It encodes a two-component signal transduction system that leads to down-regulation of surface proteins and up-regulation of secreted proteins during in vitro growth of S. aureus. The objective of this study was to detect and sequence analyzing the AgrA, B, and D of SP isolated from canine skin infections. Materials and Methods: In this study, we have isolated and identified SP from canine pyoderma and otitis cases by polymerase chain reaction (PCR and confirmed by PCR-restriction fragment length polymorphism. Primers for SP agrA and agrBD genes were designed using online primer designing software and BLAST searched for its specificity. Amplification of the agr genes was carried out for 53 isolates of SP by PCR and sequencing of agrA, B, and D were carried out for five isolates and analyzed using DNAstar and Mega5.2 software. Results: A total of 53 (59% SP isolates were obtained from 90 samples. 15 isolates (28% were confirmed to be methicillinresistant SP (MRSP with the detection of the mecA gene. Accessory gene regulator A, B, and D genes were detected in all the SP isolates. Complete nucleotide sequences of the above three genes for five isolates were submitted to GenBank, and their accession numbers are from KJ133557 to KJ133571. AgrA amino acid sequence analysis showed that it is mainly made of alpha-helices and is hydrophilic in nature. AgrB is a transmembrane protein, and AgrD encodes the precursor of the autoinducing peptide (AIP. Sequencing of the agrD gene revealed that the 5 canine SP strains tested could be divided into three Agr specificity groups (RIPTSTGFF, KIPTSTGFF, and RIPISTGFF based on the putative AIP produced by each strain
Identification of a gene module associated with BMD through the integration of network analysis and genome-wide association data.

Science.gov (United States)

Farber, Charles R

2010-11-01

Bone mineral density (BMD) is influenced by a complex network of gene interactions; therefore, elucidating the relationships between genes and how those genes, in turn, influence BMD is critical for developing a comprehensive understanding of osteoporosis. To investigate the role of transcriptional networks in the regulation of BMD, we performed a weighted gene coexpression network analysis (WGCNA) using microarray expression data on monocytes from young individuals with low or high BMD. WGCNA groups genes into modules based on patterns of gene coexpression. and our analysis identified 11 gene modules. We observed that the overall expression of one module (referred to as module 9) was significantly higher in the low-BMD group (p = .03). Module 9 was highly enriched for genes belonging to the immune system-related gene ontology (GO) category "response to virus" (p = 7.6 × 10(-11)). Using publically available genome-wide association study data, we independently validated the importance of module 9 by demonstrating that highly connected module 9 hubs were more likely, relative to less highly connected genes, to be genetically associated with BMD. This study highlights the advantages of systems-level analyses to uncover coexpression modules associated with bone mass and suggests that particular monocyte expression patterns may mediate differences in BMD. © 2010 American Society for Bone and Mineral Research.
Ultrahigh-dimensional variable selection method for whole-genome gene-gene interaction analysis

Directory of Open Access Journals (Sweden)

Ueki Masao

2012-05-01

Full Text Available Abstract Background Genome-wide gene-gene interaction analysis using single nucleotide polymorphisms (SNPs is an attractive way for identification of genetic components that confers susceptibility of human complex diseases. Individual hypothesis testing for SNP-SNP pairs as in common genome-wide association study (GWAS however involves difficulty in setting overall p-value due to complicated correlation structure, namely, the multiple testing problem that causes unacceptable false negative results. A large number of SNP-SNP pairs than sample size, so-called the large p small n problem, precludes simultaneous analysis using multiple regression. The method that overcomes above issues is thus needed. Results We adopt an up-to-date method for ultrahigh-dimensional variable selection termed the sure independence screening (SIS for appropriate handling of numerous number of SNP-SNP interactions by including them as predictor variables in logistic regression. We propose ranking strategy using promising dummy coding methods and following variable selection procedure in the SIS method suitably modified for gene-gene interaction analysis. We also implemented the procedures in a software program, EPISIS, using the cost-effective GPGPU (General-purpose computing on graphics processing units technology. EPISIS can complete exhaustive search for SNP-SNP interactions in standard GWAS dataset within several hours. The proposed method works successfully in simulation experiments and in application to real WTCCC (Wellcome Trust Case–control Consortium data. Conclusions Based on the machine-learning principle, the proposed method gives powerful and flexible genome-wide search for various patterns of gene-gene interaction.
QTL mapping and transcriptome analysis of cowpea reveals candidate genes for root-knot nematode resistance.

Science.gov (United States)

Santos, Jansen Rodrigo Pereira; Ndeve, Arsenio Daniel; Huynh, Bao-Lam; Matthews, William Charles; Roberts, Philip Alan

2018-01-01

Cowpea is one of the most important food and forage legumes in drier regions of the tropics and subtropics. However, cowpea yield worldwide is markedly below the known potential due to abiotic and biotic stresses, including parasitism by root-knot nematodes (Meloidogyne spp., RKN). Two resistance genes with dominant effect, Rk and Rk2, have been reported to provide resistance against RKN in cowpea. Despite their description and use in breeding for resistance to RKN and particularly genetic mapping of the Rk locus, the exact genes conferring resistance to RKN remain unknown. In the present work, QTL mapping using recombinant inbred line (RIL) population 524B x IT84S-2049 segregating for a newly mapped locus and analysis of the transcriptome changes in two cowpea near-isogenic lines (NIL) were used to identify candidate genes for Rk and the newly mapped locus. A major QTL, designated QRk-vu9.1, associated with resistance to Meloidogyne javanica reproduction, was detected and mapped on linkage group LG9 at position 13.37 cM using egg production data. Transcriptome analysis on resistant and susceptible NILs 3 and 9 days after inoculation revealed up-regulation of 109 and 98 genes and down-regulation of 110 and 89 genes, respectively, out of 19,922 unique genes mapped to the common bean reference genome. Among the differentially expressed genes, four and nine genes were found within the QRk-vu9.1 and QRk-vu11.1 QTL intervals, respectively. Six of these genes belong to the TIR-NBS-LRR family of resistance genes and three were upregulated at one or more time-points. Quantitative RT-PCR validated gene expression to be positively correlated with RNA-seq expression pattern for eight genes. Future functional analysis of these cowpea genes will enhance our understanding of Rk-mediated resistance and identify the specific gene responsible for the resistance.
Whole genome sequencing as a tool for phylogenetic analysis of clinical strains of Mitis group streptococci

DEFF Research Database (Denmark)

Rasmusen, L. H.; Dargis, R.; Iversen, Katrine Højholt

2016-01-01

observed in single gene analyses. Species identification based on single gene analysis showed their limitations when more strains were included. In contrast, analyses incorporating more sequence data, like MLSA, SNPs and core-genome analyses, provided more distinct clustering. The core-genome tree showed......Identification of Mitis group streptococci (MGS) to the species level is challenging for routine microbiology laboratories. Correct identification is crucial for the diagnosis of infective endocarditis, identification of treatment failure, and/or infection relapse. Eighty MGS from Danish patients...
In silico analysis of miRNA-mediated gene regulation in OCA and OA genes.

Science.gov (United States)

Kamaraj, Balu; Gopalakrishnan, Chandrasekhar; Purohit, Rituraj

2014-12-01

Albinism is an autosomal recessive genetic disorder due to low secretion of melanin. The oculocutaneous albinism (OCA) and ocular albinism (OA) genes are responsible for melanin production and also act as a potential targets for miRNAs. The role of miRNA is to inhibit the protein synthesis partially or completely by binding with the 3'UTR of the mRNA thus regulating gene expression. In this analysis, we predicted the genetic variation that occurred in 3'UTR of the transcript which can be a reason for low melanin production thus causing albinism. The single nucleotide polymorphisms (SNPs) in 3'UTR cause more new binding sites for miRNA which binds with mRNA which leads to inhibit the translation process either partially or completely. The SNPs in the mRNA of OCA and OA genes can create new binding sites for miRNA which may control the gene expression and lead to hypopigmentation. We have developed a computational procedure to determine the SNPs in the 3'UTR region of mRNA of OCA (TYR, OCA2, TYRP1 and SLC45A2) and OA (GPR143) genes which will be a potential cause for albinism. We identified 37 SNPs in five genes that are predicted to create 87 new binding sites on mRNA, which may lead to abrogation of the translation process. Expression analysis confirms that these genes are highly expressed in skin and eye regions. It is well supported by enrichment analysis that these genes are mainly involved in eye pigmentation and melanin biosynthesis process. The network analysis also shows how the genes are interacting and expressing in a complex network. This insight provides clue to wet-lab researches to understand the expression pattern of OCA and OA genes and binding phenomenon of mRNA and miRNA upon mutation, which is responsible for inhibition of translation process at genomic levels.
MiR-210 disturbs mitotic progression through regulating a group of mitosis-related genes.

Science.gov (United States)

He, Jie; Wu, Jiangbin; Xu, Naihan; Xie, Weidong; Li, Mengnan; Li, Jianna; Jiang, Yuyang; Yang, Burton B; Zhang, Yaou

2013-01-07

MiR-210 is up-regulated in multiple cancer types but its function is disputable and further investigation is necessary. Using a bioinformatics approach, we identified the putative target genes of miR-210 in hypoxia-induced CNE cells from genome-wide scale. Two functional gene groups related to cell cycle and RNA processing were recognized as the major targets of miR-210. Here, we investigated the molecular mechanism and biological consequence of miR-210 in cell cycle regulation, particularly mitosis. Hypoxia-induced up-regulation of miR-210 was highly correlated with the down-regulation of a group of mitosis-related genes, including Plk1, Cdc25B, Cyclin F, Bub1B and Fam83D. MiR-210 suppressed the expression of these genes by directly targeting their 3'-UTRs. Over-expression of exogenous miR-210 disturbed mitotic progression and caused aberrant mitosis. Furthermore, miR-210 mimic with pharmacological doses reduced tumor formation in a mouse metastatic tumor model. Taken together, these results implicate that miR-210 disturbs mitosis through targeting multi-genes involved in mitotic progression, which may contribute to its inhibitory role on tumor formation.
Selection of reference genes for expression analysis of Kumamoto and Portuguese oysters and their hybrid

Science.gov (United States)

Yan, Lulu; Su, Jiaqi; Wang, Zhaoping; Yan, Xiwu; Yu, Ruihai

2017-12-01

Quantitative real-time polymerase chain reaction (qRT-PCR) is a rapid and reliable technique which has been widely used to quantifying gene transcripts (expression analysis). It is also employed for studying heterosis, hybridization breeding and hybrid tolerability of oysters, an ecologically and economically important taxonomic group. For these studies, selection of a suitable set of housekeeping genes as references is crucial for correct interpretation of qRT-PCR data. To identify suitable reference genes for oysters during low temperature and low salinity stresses, we analyzed twelve genes from the gill tissue of Crassostrea sikamea (SS), Crassostrea angulata (AA) and their hybrid (SA), which included three ribosomal genes, 28S ribosomal protein S5 ( RPS5), ribosomal protein L35 ( RPL35), and 60S ribosomal protein L29 ( RPL29); three structural genes, tubulin gamma ( TUBγ), annexin A6 and A7 ( AA6 and AA7); three metabolic pathway genes, ornithine decarboxylase ( OD), glyceraldehyde-3-phosphate dehydrogenase ( GAPDH) and glutathione S-transferase P1 ( GSP); two transcription factors, elongation factor 1 alpha and beta ( EF1α and EF1β); and one protein synthesis gene (ubiquitin ( UBQ). Primers specific for these genes were successfully developed for the three groups of oysters. Three different algorithms, geNorm, NormFinder and BestKeeper, were used to evaluate the expression stability of these candidate genes. BestKeeper program was found to be the most reliable. Based on our analysis, we found that the expression of RPL35 and EF1α was stable under low salinity stress, and the expression of OD, GAPDH and EF1α was stable under low temperature stress in hybrid (SA) oyster; the expression of RPS5 and GAPDH was stable under low salinity stress, and the expression of RPS5, UBQ, GAPDH was stable under low temperature stress in SS oyster; the expression of RPS5, GAPDH, EF1β and AA7 was stable under low salinity stress, and the expression of RPL35, EF1α, GAPDH
Genome-wide analysis of the Dof transcription factor gene family reveals soybean-specific duplicable and functional characteristics.

Directory of Open Access Journals (Sweden)

Yong Guo

Full Text Available The Dof domain protein family is a classic plant-specific zinc-finger transcription factor family involved in a variety of biological processes. There is great diversity in the number of Dof genes in different plants. However, there are only very limited reports on the characterization of Dof transcription factors in soybean (Glycine max. In the present study, 78 putative Dof genes were identified from the whole-genome sequence of soybean. The predicted GmDof genes were non-randomly distributed within and across 19 out of 20 chromosomes and 97.4% (38 pairs were preferentially retained duplicate paralogous genes located in duplicated regions of the genome. Soybean-specific segmental duplications contributed significantly to the expansion of the soybean Dof gene family. These Dof proteins were phylogenetically clustered into nine distinct subgroups among which the gene structure and motif compositions were considerably conserved. Comparative phylogenetic analysis of these Dof proteins revealed four major groups, similar to those reported for Arabidopsis and rice. Most of the GmDofs showed specific expression patterns based on RNA-seq data analyses. The expression patterns of some duplicate genes were partially redundant while others showed functional diversity, suggesting the occurrence of sub-functionalization during subsequent evolution. Comprehensive expression profile analysis also provided insights into the soybean-specific functional divergence among members of the Dof gene family. Cis-regulatory element analysis of these GmDof genes suggested diverse functions associated with different processes. Taken together, our results provide useful information for the functional characterization of soybean Dof genes by combining phylogenetic analysis with global gene-expression profiling.
Human microRNA target analysis and gene ontology clustering by GOmir, a novel stand-alone application.

Science.gov (United States)

Roubelakis, Maria G; Zotos, Pantelis; Papachristoudis, Georgios; Michalopoulos, Ioannis; Pappa, Kalliopi I; Anagnou, Nicholas P; Kossida, Sophia

2009-06-16

microRNAs (miRNAs) are single-stranded RNA molecules of about 20-23 nucleotides length found in a wide variety of organisms. miRNAs regulate gene expression, by interacting with target mRNAs at specific sites in order to induce cleavage of the message or inhibit translation. Predicting or verifying mRNA targets of specific miRNAs is a difficult process of great importance. GOmir is a novel stand-alone application consisting of two separate tools: JTarget and TAGGO. JTarget integrates miRNA target prediction and functional analysis by combining the predicted target genes from TargetScan, miRanda, RNAhybrid and PicTar computational tools as well as the experimentally supported targets from TarBase and also providing a full gene description and functional analysis for each target gene. On the other hand, TAGGO application is designed to automatically group gene ontology annotations, taking advantage of the Gene Ontology (GO), in order to extract the main attributes of sets of proteins. GOmir represents a new tool incorporating two separate Java applications integrated into one stand-alone Java application. GOmir (by using up to five different databases) introduces miRNA predicted targets accompanied by (a) full gene description, (b) functional analysis and (c) detailed gene ontology clustering. Additionally, a reverse search initiated by a potential target can also be conducted. GOmir can freely be downloaded BRFAA.
Genome-wide identification and comparative expression analysis of LEA genes in watermelon and melon genomes.

Science.gov (United States)

Celik Altunoglu, Yasemin; Baloglu, Mehmet Cengiz; Baloglu, Pinar; Yer, Esra Nurten; Kara, Sibel

2017-01-01

Late embryogenesis abundant (LEA) proteins are large and diverse group of polypeptides which were first identified during seed dehydration and then in vegetative plant tissues during different stress responses. Now, gene family members of LEA proteins have been detected in various organisms. However, there is no report for this protein family in watermelon and melon until this study. A total of 73 LEA genes from watermelon ( ClLEA ) and 61 LEA genes from melon ( CmLEA ) were identified in this comprehensive study. They were classified into four and three distinct clusters in watermelon and melon, respectively. There was a correlation between gene structure and motif composition among each LEA groups. Segmental duplication played an important role for LEA gene expansion in watermelon. Maximum gene ontology of LEA genes was observed with poplar LEA genes. For evaluation of tissue specific expression patterns of ClLEA and CmLEA genes, publicly available RNA-seq data were analyzed. The expression analysis of selected LEA genes in root and leaf tissues of drought-stressed watermelon and melon were examined using qRT-PCR. Among them, ClLEA - 12 - 17 - 46 genes were quickly induced after drought application. Therefore, they might be considered as early response genes for water limitation conditions in watermelon. In addition, CmLEA - 42 - 43 genes were found to be up-regulated in both tissues of melon under drought stress. Our results can open up new frontiers about understanding of functions of these important family members under normal developmental stages and stress conditions by bioinformatics and transcriptomic approaches.
Comprehensive analysis of gene expression patterns of hedgehog-related genes

Directory of Open Access Journals (Sweden)

Baillie David

2006-10-01

Full Text Available Abstract Background The Caenorhabditis elegans genome encodes ten proteins that share sequence similarity with the Hedgehog signaling molecule through their C-terminal autoprocessing Hint/Hog domain. These proteins contain novel N-terminal domains, and C. elegans encodes dozens of additional proteins containing only these N-terminal domains. These gene families are called warthog, groundhog, ground-like and quahog, collectively called hedgehog (hh-related genes. Previously, the expression pattern of seventeen genes was examined, which showed that they are primarily expressed in the ectoderm. Results With the completion of the C. elegans genome sequence in November 2002, we reexamined and identified 61 hh-related ORFs. Further, we identified 49 hh-related ORFs in C. briggsae. ORF analysis revealed that 30% of the genes still had errors in their predictions and we improved these predictions here. We performed a comprehensive expression analysis using GFP fusions of the putative intergenic regulatory sequence with one or two transgenic lines for most genes. The hh-related genes are expressed in one or a few of the following tissues: hypodermis, seam cells, excretory duct and pore cells, vulval epithelial cells, rectal epithelial cells, pharyngeal muscle or marginal cells, arcade cells, support cells of sensory organs, and neuronal cells. Using time-lapse recordings, we discovered that some hh-related genes are expressed in a cyclical fashion in phase with molting during larval development. We also generated several translational GFP fusions, but they did not show any subcellular localization. In addition, we also studied the expression patterns of two genes with similarity to Drosophila frizzled, T23D8.1 and F27E11.3A, and the ortholog of the Drosophila gene dally-like, gpn-1, which is a heparan sulfate proteoglycan. The two frizzled homologs are expressed in a few neurons in the head, and gpn-1 is expressed in the pharynx. Finally, we compare the
Toxigenic genes, spoilage potential, and antimicrobial resistance of Bacillus cereus group strains from ice cream.

Science.gov (United States)

Arslan, Seza; Eyi, Ayla; Küçüksarı, Rümeysa

2014-02-01

Bacillus spp. can be recovered from almost every environment. It is also found readily in foods, where it may cause food spoilage and/or food poisoning due to its toxigenic and pathogenic nature, and extracellular enzymes. In this study, 29 Bacillus cereus group strains from ice cream were examined for the presence of following virulence genes hblC, nheA, cytK and ces genes, and tested for a range of the extracellular enzymes, and antimicrobial susceptibility. The strains were found to produce extracellular enzymes: proteolytic and lipolytic activity, gelatin hydrolysis and lecithinase production (100%), DNase production (93.1%) and amylase activity (93.1%). Of 29 strains examined, 24 (82.8%) showed hemolytic activity on blood agar. Beta-lactamase enzyme was only produced by 20.7% of B. cereus group. Among 29 B. cereus group from ice cream, nheA was the most common virulence gene detected in 44.8% of the strains, followed by hblC gene with 17.2%. Four (13.8%) of the 29 strains were positive for both hblC gene and nheA gene. Contrarily, cytK and ces genes were not detected in any of the strains. Antimicrobial susceptibility of ice cream isolates was tested to 14 different antimicrobial agents using the disc diffusion method. We detected resistance to penicillin and ampicillin with the same rate of 89.7%. Thirty-one percent of the strains were multiresistant to three or more antibiotics. This study emphasizes that the presence of natural isolates of Bacillus spp. harboring one or more enterotoxin genes, producing extracellular enzymes which may cause spoilage and acquiring antibiotic resistance might hold crucial importance in the food safety and quality. Copyright © 2013 Elsevier Ltd. All rights reserved.
Coverage analysis of lists of genes involved in heterogeneous ...

Indian Academy of Sciences (India)

Genes involved in myopathies: 82 genes, based on the disease groups ... 605517 Muscular dystrophy-dystroglycanopathy (congenital with brain and eye ..... Epilepsy, X-linked, with variable learning disabilities and behavior disorders. 300491.
Transcriptomics Analysis of Crassostrea hongkongensis for the Discovery of Reproduction-Related Genes

Science.gov (United States)

Tong, Ying; Zhang, Yang; Huang, Jiaomei; Xiao, Shu; Zhang, Yuehuan; Li, Jun; Chen, Jinhui; Yu, Ziniu

2015-01-01

Background The reproductive mechanisms of mollusk species have been interesting targets in biological research because of the diverse reproductive strategies observed in this phylum. These species have also been studied for the development of fishery technologies in molluscan aquaculture. Although the molecular mechanisms underlying the reproductive process have been well studied in animal models, the relevant information from mollusks remains limited, particularly in species of great commercial interest. Crassostrea hongkongensis is the dominant oyster species that is distributed along the coast of the South China Sea and little genomic information on this species is available. Currently, high-throughput sequencing techniques have been widely used for investigating the basis of physiological processes and facilitating the establishment of adequate genetic selection programs. Results The C.hongkongensis transcriptome included a total of 1,595,855 reads, which were generated by 454 sequencing and were assembled into 41,472 contigs using de novo methods. Contigs were clustered into 33,920 isotigs and further grouped into 22,829 isogroups. Approximately 77.6% of the isogroups were successfully annotated by the Nr database. More than 1,910 genes were identified as being related to reproduction. Some key genes involved in germline development, sex determination and differentiation were identified for the first time in C.hongkongensis (nanos, piwi, ATRX, FoxL2, β-catenin, etc.). Gene expression analysis indicated that vasa, nanos, piwi, ATRX, FoxL2, β-catenin and SRD5A1 were highly or specifically expressed in C.hongkongensis gonads. Additionally, 94,056 single nucleotide polymorphisms (SNPs) and 1,699 simple sequence repeats (SSRs) were compiled. Conclusions Our study significantly increased C.hongkongensis genomic information based on transcriptomics analysis. The group of reproduction-related genes identified in the present study constitutes a new tool for research

Transcriptomics Analysis of Crassostrea hongkongensis for the Discovery of Reproduction-Related Genes.

Directory of Open Access Journals (Sweden)

Ying Tong

Full Text Available The reproductive mechanisms of mollusk species have been interesting targets in biological research because of the diverse reproductive strategies observed in this phylum. These species have also been studied for the development of fishery technologies in molluscan aquaculture. Although the molecular mechanisms underlying the reproductive process have been well studied in animal models, the relevant information from mollusks remains limited, particularly in species of great commercial interest. Crassostrea hongkongensis is the dominant oyster species that is distributed along the coast of the South China Sea and little genomic information on this species is available. Currently, high-throughput sequencing techniques have been widely used for investigating the basis of physiological processes and facilitating the establishment of adequate genetic selection programs.The C.hongkongensis transcriptome included a total of 1,595,855 reads, which were generated by 454 sequencing and were assembled into 41,472 contigs using de novo methods. Contigs were clustered into 33,920 isotigs and further grouped into 22,829 isogroups. Approximately 77.6% of the isogroups were successfully annotated by the Nr database. More than 1,910 genes were identified as being related to reproduction. Some key genes involved in germline development, sex determination and differentiation were identified for the first time in C.hongkongensis (nanos, piwi, ATRX, FoxL2, β-catenin, etc.. Gene expression analysis indicated that vasa, nanos, piwi, ATRX, FoxL2, β-catenin and SRD5A1 were highly or specifically expressed in C.hongkongensis gonads. Additionally, 94,056 single nucleotide polymorphisms (SNPs and 1,699 simple sequence repeats (SSRs were compiled.Our study significantly increased C.hongkongensis genomic information based on transcriptomics analysis. The group of reproduction-related genes identified in the present study constitutes a new tool for research on bivalve
When Is Hub Gene Selection Better than Standard Meta-Analysis?

Science.gov (United States)

Langfelder, Peter; Mischel, Paul S.; Horvath, Steve

2013-01-01

Since hub nodes have been found to play important roles in many networks, highly connected hub genes are expected to play an important role in biology as well. However, the empirical evidence remains ambiguous. An open question is whether (or when) hub gene selection leads to more meaningful gene lists than a standard statistical analysis based on significance testing when analyzing genomic data sets (e.g., gene expression or DNA methylation data). Here we address this question for the special case when multiple genomic data sets are available. This is of great practical importance since for many research questions multiple data sets are publicly available. In this case, the data analyst can decide between a standard statistical approach (e.g., based on meta-analysis) and a co-expression network analysis approach that selects intramodular hubs in consensus modules. We assess the performance of these two types of approaches according to two criteria. The first criterion evaluates the biological insights gained and is relevant in basic research. The second criterion evaluates the validation success (reproducibility) in independent data sets and often applies in clinical diagnostic or prognostic applications. We compare meta-analysis with consensus network analysis based on weighted correlation network analysis (WGCNA) in three comprehensive and unbiased empirical studies: (1) Finding genes predictive of lung cancer survival, (2) finding methylation markers related to age, and (3) finding mouse genes related to total cholesterol. The results demonstrate that intramodular hub gene status with respect to consensus modules is more useful than a meta-analysis p-value when identifying biologically meaningful gene lists (reflecting criterion 1). However, standard meta-analysis methods perform as good as (if not better than) a consensus network approach in terms of validation success (criterion 2). The article also reports a comparison of meta-analysis techniques applied to
When is hub gene selection better than standard meta-analysis?

Directory of Open Access Journals (Sweden)

Peter Langfelder

Full Text Available Since hub nodes have been found to play important roles in many networks, highly connected hub genes are expected to play an important role in biology as well. However, the empirical evidence remains ambiguous. An open question is whether (or when hub gene selection leads to more meaningful gene lists than a standard statistical analysis based on significance testing when analyzing genomic data sets (e.g., gene expression or DNA methylation data. Here we address this question for the special case when multiple genomic data sets are available. This is of great practical importance since for many research questions multiple data sets are publicly available. In this case, the data analyst can decide between a standard statistical approach (e.g., based on meta-analysis and a co-expression network analysis approach that selects intramodular hubs in consensus modules. We assess the performance of these two types of approaches according to two criteria. The first criterion evaluates the biological insights gained and is relevant in basic research. The second criterion evaluates the validation success (reproducibility in independent data sets and often applies in clinical diagnostic or prognostic applications. We compare meta-analysis with consensus network analysis based on weighted correlation network analysis (WGCNA in three comprehensive and unbiased empirical studies: (1 Finding genes predictive of lung cancer survival, (2 finding methylation markers related to age, and (3 finding mouse genes related to total cholesterol. The results demonstrate that intramodular hub gene status with respect to consensus modules is more useful than a meta-analysis p-value when identifying biologically meaningful gene lists (reflecting criterion 1. However, standard meta-analysis methods perform as good as (if not better than a consensus network approach in terms of validation success (criterion 2. The article also reports a comparison of meta-analysis techniques
When is hub gene selection better than standard meta-analysis?

Science.gov (United States)

Langfelder, Peter; Mischel, Paul S; Horvath, Steve

2013-01-01

Since hub nodes have been found to play important roles in many networks, highly connected hub genes are expected to play an important role in biology as well. However, the empirical evidence remains ambiguous. An open question is whether (or when) hub gene selection leads to more meaningful gene lists than a standard statistical analysis based on significance testing when analyzing genomic data sets (e.g., gene expression or DNA methylation data). Here we address this question for the special case when multiple genomic data sets are available. This is of great practical importance since for many research questions multiple data sets are publicly available. In this case, the data analyst can decide between a standard statistical approach (e.g., based on meta-analysis) and a co-expression network analysis approach that selects intramodular hubs in consensus modules. We assess the performance of these two types of approaches according to two criteria. The first criterion evaluates the biological insights gained and is relevant in basic research. The second criterion evaluates the validation success (reproducibility) in independent data sets and often applies in clinical diagnostic or prognostic applications. We compare meta-analysis with consensus network analysis based on weighted correlation network analysis (WGCNA) in three comprehensive and unbiased empirical studies: (1) Finding genes predictive of lung cancer survival, (2) finding methylation markers related to age, and (3) finding mouse genes related to total cholesterol. The results demonstrate that intramodular hub gene status with respect to consensus modules is more useful than a meta-analysis p-value when identifying biologically meaningful gene lists (reflecting criterion 1). However, standard meta-analysis methods perform as good as (if not better than) a consensus network approach in terms of validation success (criterion 2). The article also reports a comparison of meta-analysis techniques applied to
Gene Ontology-Based Analysis of Zebrafish Omics Data Using the Web Tool Comparative Gene Ontology.

Science.gov (United States)

Ebrahimie, Esmaeil; Fruzangohar, Mario; Moussavi Nik, Seyyed Hani; Newman, Morgan

2017-10-01

Gene Ontology (GO) analysis is a powerful tool in systems biology, which uses a defined nomenclature to annotate genes/proteins within three categories: "Molecular Function," "Biological Process," and "Cellular Component." GO analysis can assist in revealing functional mechanisms underlying observed patterns in transcriptomic, genomic, and proteomic data. The already extensive and increasing use of zebrafish for modeling genetic and other diseases highlights the need to develop a GO analytical tool for this organism. The web tool Comparative GO was originally developed for GO analysis of bacterial data in 2013 ( www.comparativego.com ). We have now upgraded and elaborated this web tool for analysis of zebrafish genetic data using GOs and annotations from the Gene Ontology Consortium.
A comparative study of three different gene expression analysis methods.

Science.gov (United States)

Choe, Jae Young; Han, Hyung Soo; Lee, Seon Duk; Lee, Hanna; Lee, Dong Eun; Ahn, Jae Yun; Ryoo, Hyun Wook; Seo, Kang Suk; Kim, Jong Kun

2017-12-04

TNF-α regulates immune cells and acts as an endogenous pyrogen. Reverse transcription polymerase chain reaction (RT-PCR) is one of the most commonly used methods for gene expression analysis. Among the alternatives to PCR, loop-mediated isothermal amplification (LAMP) shows good potential in terms of specificity and sensitivity. However, few studies have compared RT-PCR and LAMP for human gene expression analysis. Therefore, in the present study, we compared one-step RT-PCR, two-step RT-LAMP and one-step RT-LAMP for human gene expression analysis. We compared three gene expression analysis methods using the human TNF-α gene as a biomarker from peripheral blood cells. Total RNA from the three selected febrile patients were subjected to the three different methods of gene expression analysis. In the comparison of three gene expression analysis methods, the detection limit of both one-step RT-PCR and one-step RT-LAMP were the same, while that of two-step RT-LAMP was inferior. One-step RT-LAMP takes less time, and the experimental result is easy to determine. One-step RT-LAMP is a potentially useful and complementary tool that is fast and reasonably sensitive. In addition, one-step RT-LAMP could be useful in environments lacking specialized equipment or expertise.
Genome-wide analysis of the SBP-box gene family in Chinese cabbage (Brassica rapa subsp. pekinensis).

Science.gov (United States)

Tan, Hua-Wei; Song, Xiao-Ming; Duan, Wei-Ke; Wang, Yan; Hou, Xi-Lin

2015-11-01

The SQUAMOSA PROMOTER BINDING PROTEIN (SBP)-box gene family contains highly conserved plant-specific transcription factors that play an important role in plant development, especially in flowering. Chinese cabbage (Brassica rapa subsp. pekinensis) is a leafy vegetable grown worldwide and is used as a model crop for research in genome duplication. The present study aimed to characterize the SBP-box transcription factor genes in Chinese cabbage. Twenty-nine SBP-box genes were identified in the Chinese cabbage genome and classified into six groups. We identified 23 orthologous and 5 co-orthologous SBP-box gene pairs between Chinese cabbage and Arabidopsis. An interaction network among these genes was constructed. Sixteen SBP-box genes were expressed more abundantly in flowers than in other tissues, suggesting their involvement in flowering. We show that the MiR156/157 family members may regulate the coding regions or 3'-UTR regions of Chinese cabbage SBP-box genes. As SBP-box genes were found to potentially participate in some plant development pathways, quantitative real-time PCR analysis was performed and showed that Chinese cabbage SBP-box genes were also sensitive to the exogenous hormones methyl jasmonic acid and salicylic acid. The SBP-box genes have undergone gene duplication and loss, evolving a more refined regulation for diverse stimulation in plant tissues. Our comprehensive genome-wide analysis provides insights into the SBP-box gene family of Chinese cabbage.
Identification of pathogenic genes related to rheumatoid arthritis through integrated analysis of DNA methylation and gene expression profiling.

Science.gov (United States)

Zhang, Lei; Ma, Shiyun; Wang, Huailiang; Su, Hang; Su, Ke; Li, Longjie

2017-11-15

The purpose of our study was to identify new pathogenic genes used for exploring the pathogenesis of rheumatoid arthritis (RA). To screen pathogenic genes of RA, an integrated analysis was performed by using the microarray datasets in RA derived from the Gene Expression Omnibus (GEO) database. The functional annotation and potential pathways of differentially expressed genes (DEGs) were further discovered by Gene Ontology (GO) and Kyoto Encyclopedia of Genes and Genomes (KEGG) enrichment analysis. Afterwards, the integrated analysis of DNA methylation and gene expression profiling was used to screen crucial genes. In addition, we used RT-PCR and MSP to verify the expression levels and methylation status of these crucial genes in 20 synovial biopsy samples obtained from 10 RA model mice and 10 normal mice. BCL11B, CCDC88C, FCRLA and APOL6 were both up-regulated and hypomethylated in RA according to integrated analysis, RT-PCR and MSP verification. Four crucial genes (BCL11B, CCDC88C, FCRLA and APOL6) identified and analyzed in this study might be closely connected with the pathogenesis of RA. Copyright © 2017. Published by Elsevier B.V.
Analysis and visualization of gene expression data using ...

African Journals Online (AJOL)

Several clustering and biclustering methods have been introduced to analyze the gene expression data by identifying the similar patterns and grouping genes into subsets that share biological significance. However, it is not clear how the different methods compare with each other with respect to the biological relevance of ...
Transcriptome Analysis and Discovery of Genes Involved in Immune Pathways from Coelomocytes of Sea Cucumber (Apostichopus japonicus after Vibrio splendidus Challenge

Directory of Open Access Journals (Sweden)

Qiong Gao

2015-07-01

Full Text Available Vibrio splendidus is identified as one of the major pathogenic factors for the skin ulceration syndrome in sea cucumber (Apostichopus japonicus, which has vastly limited the development of the sea cucumber culture industry. In order to screen the immune genes involving Vibrio splendidus challenge in sea cucumber and explore the molecular mechanism of this process, the related transcriptome and gene expression profiling of resistant and susceptible biotypes of sea cucumber with Vibrio splendidus challenge were collected for analysis. A total of 319,455,942 trimmed reads were obtained, which were assembled into 186,658 contigs. After that, 89,891 representative contigs (without isoform were clustered. The analysis of the gene expression profiling identified 358 differentially expression genes (DEGs in the bacterial-resistant group, and 102 DEGs in the bacterial-susceptible group, compared with that in control group. According to the reported references and annotation information from BLAST, GO and KEGG, 30 putative bacterial-resistant genes and 19 putative bacterial-susceptible genes were identified from DEGs. The qRT-PCR results were consistent with the RNA-Seq results. Furthermore, many DGEs were involved in immune signaling related pathways, such as Endocytosis, Lysosome, MAPK, Chemokine and the ERBB signaling pathway.
Gene Locater

DEFF Research Database (Denmark)

Anwar, Muhammad Zohaib; Sehar, Anoosha; Rehman, Inayat-Ur

2012-01-01

software's for calculating recombination frequency is mostly limited to the range and flexibility of this type of analysis. GENE LOCATER is a fully customizable program for calculating recombination frequency, written in JAVA. Through an easy-to-use interface, GENE LOCATOR allows users a high degree...... of flexibility in calculating genetic linkage and displaying linkage group. Among other features, this software enables user to identify linkage groups with output visualized graphically. The program calculates interference and coefficient of coincidence with elevated accuracy in sample datasets. AVAILABILITY...
Functional analysis of the putative peroxidase domain of FANCA, the Fanconi anemia complementation group A protein.

Science.gov (United States)

Ren, J; Youssoufian, H

2001-01-01

Fanconi anemia (FA) is an autosomal recessive disorder manifested by chromosomal breakage, birth defects, and susceptibility to bone marrow failure and cancer. At least seven complementation groups have been identified, and the genes defective in four groups have been cloned. The most common subtype is complementation group A. Although the normal functions of the gene products defective in FA cells are not completely understood, a clue to the function of the FA group A gene product (FANCA) was provided by the detection of limited homology in the amino terminal region to a class of heme peroxidases. We evaluated this hypothesis by mutagenesis and functional complementation studies. We substituted alanine residues for the most conserved FANCA residues in the putative peroxidase domain and tested their effects on known biochemical and cellular functions of FANCA. While the substitution mutants were comparable to wild-type FANCA with regard to their stability, subcellular localization, and interaction with FANCG, only the Trp(183)-to-Ala substitution (W183A) abolished the ability of FANCA to complement the sensitivity of FA group A cells to mitomycin C. By contrast, TUNEL assays for apoptosis after exposure to H2O2 showed no differences between parental FA group A cells, cells complemented with wild-type FANCA, and cells complemented with the W183A of FANCA. Moreover, semiquantitative RT-PCR analysis for the expression of the peroxide-sensitive heme oxygenase gene showed appropriate induction after H2O2 exposure. Thus, W183A appears to be essential for the in vivo activity of FANCA in a manner independent of its interaction with FANCG. Moreover, neither wild-type FANCA nor the W183A mutation appears to alter the peroxide-induced apoptosisor peroxide-sensing ability of FA group A cells. Copyright 2001 Academic Press.
Validation of suitable reference genes for quantitative gene expression analysis in Panax ginseng

Directory of Open Access Journals (Sweden)

Meizhen eWang

2016-01-01

Full Text Available Reverse transcription-qPCR (RT-qPCR has become a popular method for gene expression studies. Its results require data normalization by housekeeping genes. No single gene is proved to be stably expressed under all experimental conditions. Therefore, systematic evaluation of reference genes is necessary. With the aim to identify optimum reference genes for RT-qPCR analysis of gene expression in different tissues of Panax ginseng and the seedlings grown under heat stress, we investigated the expression stability of eight candidate reference genes, including elongation factor 1-beta (EF1-β, elongation factor 1-gamma (EF1-γ, eukaryotic translation initiation factor 3G (IF3G, eukaryotic translation initiation factor 3B (IF3B, actin (ACT, actin11 (ACT11, glyceraldehyde-3-phosphate dehydrogenase (GAPDH and cyclophilin ABH-like protein (CYC, using four widely used computational programs: geNorm, Normfinder, BestKeeper, and the comparative ΔCt method. The results were then integrated using the web-based tool RefFinder. As a result, EF1-γ, IF3G and EF1-β were the three most stable genes in different tissues of P. ginseng, while IF3G, ACT11 and GAPDH were the top three-ranked genes in seedlings treated with heat. Using three better reference genes alone or in combination as internal control, we examined the expression profiles of MAR, a multiple function-associated mRNA-like non-coding RNA (mlncRNA in P. ginseng. Taken together, we recommended EF1-γ/IF3G and IF3G/ACT11 as the suitable pair of reference genes for RT-qPCR analysis of gene expression in different tissues of P. ginseng and the seedlings grown under heat stress, respectively. The results serve as a foundation for future studies on P. ginseng functional genomics.
Monitoring expression profiles of rice (Oryza sativa L.) genes under abiotic stresses using cDNA Microarray Analysis (abstract)

International Nuclear Information System (INIS)

Rabbani, M.A.

2005-01-01

Transcript regulation in response to cold, drought, high salinity and ABA application was investigated in rice (Oryza sativa L., Nipponbare) with microarray analysis including approx. 1700 independent DNA elements derived from three cDNA libraries constructed from 15-day old rice seedlings stressed with drought, cold and high salinity. A total of 141 non-redundant genes were identified, whose expression ratios were more than three-fold compared with the control genes for at least one of stress treatments in microarray analysis. However, after RNA gel blot analysis, a total of 73 genes were identified, among them the transcripts of 36, 62, 57 and 43 genes were found increased after cold, drought, high salinity and ABA application, respectively. Sixteen of these identified genes have been reported previously to be stress inducible in rice, while 57 of which are novel that have not been reported earlier as stress responsive in rice. We observed a strong association in the expression patterns of stress responsive genes and found 15 stress inducible genes that responded to all four treatments. Based on Venn diagram analysis, 56 genes were induced by both drought and high salinity, whereas 22 genes were upregulated by both cold and high salinity stress. Similarly 43 genes were induced by both drought stress and ABA application, while only 17 genes were identified as cold and ABA inducible genes. These results indicated the existence of greater cross talk between drought, ABA and high salinity stress signaling processes than those between cold and ABA, and cold and high salinity stress signaling pathways. The cold, drought, high salinity and ABA inducible genes were classified into four gene groups from their expression profiles. Analysis of data enabled us to identify a number of promoters and possible cis-acting DNA elements of several genes induced by a variety of abiotic stresses by combining expression data with genomic sequence data of rice. Comparative analysis of
Multiscale Embedded Gene Co-expression Network Analysis.

Directory of Open Access Journals (Sweden)

Won-Min Song

2015-11-01

Full Text Available Gene co-expression network analysis has been shown effective in identifying functional co-expressed gene modules associated with complex human diseases. However, existing techniques to construct co-expression networks require some critical prior information such as predefined number of clusters, numerical thresholds for defining co-expression/interaction, or do not naturally reproduce the hallmarks of complex systems such as the scale-free degree distribution of small-worldness. Previously, a graph filtering technique called Planar Maximally Filtered Graph (PMFG has been applied to many real-world data sets such as financial stock prices and gene expression to extract meaningful and relevant interactions. However, PMFG is not suitable for large-scale genomic data due to several drawbacks, such as the high computation complexity O(|V|3, the presence of false-positives due to the maximal planarity constraint, and the inadequacy of the clustering framework. Here, we developed a new co-expression network analysis framework called Multiscale Embedded Gene Co-expression Network Analysis (MEGENA by: i introducing quality control of co-expression similarities, ii parallelizing embedded network construction, and iii developing a novel clustering technique to identify multi-scale clustering structures in Planar Filtered Networks (PFNs. We applied MEGENA to a series of simulated data and the gene expression data in breast carcinoma and lung adenocarcinoma from The Cancer Genome Atlas (TCGA. MEGENA showed improved performance over well-established clustering methods and co-expression network construction approaches. MEGENA revealed not only meaningful multi-scale organizations of co-expressed gene clusters but also novel targets in breast carcinoma and lung adenocarcinoma.
Multiscale Embedded Gene Co-expression Network Analysis.

Science.gov (United States)

Song, Won-Min; Zhang, Bin

2015-11-01

Gene co-expression network analysis has been shown effective in identifying functional co-expressed gene modules associated with complex human diseases. However, existing techniques to construct co-expression networks require some critical prior information such as predefined number of clusters, numerical thresholds for defining co-expression/interaction, or do not naturally reproduce the hallmarks of complex systems such as the scale-free degree distribution of small-worldness. Previously, a graph filtering technique called Planar Maximally Filtered Graph (PMFG) has been applied to many real-world data sets such as financial stock prices and gene expression to extract meaningful and relevant interactions. However, PMFG is not suitable for large-scale genomic data due to several drawbacks, such as the high computation complexity O(|V|3), the presence of false-positives due to the maximal planarity constraint, and the inadequacy of the clustering framework. Here, we developed a new co-expression network analysis framework called Multiscale Embedded Gene Co-expression Network Analysis (MEGENA) by: i) introducing quality control of co-expression similarities, ii) parallelizing embedded network construction, and iii) developing a novel clustering technique to identify multi-scale clustering structures in Planar Filtered Networks (PFNs). We applied MEGENA to a series of simulated data and the gene expression data in breast carcinoma and lung adenocarcinoma from The Cancer Genome Atlas (TCGA). MEGENA showed improved performance over well-established clustering methods and co-expression network construction approaches. MEGENA revealed not only meaningful multi-scale organizations of co-expressed gene clusters but also novel targets in breast carcinoma and lung adenocarcinoma.
A Regulatory Network Analysis of Orphan Genes in Arabidopsis Thaliana

Science.gov (United States)

Singh, Pramesh; Chen, Tianlong; Arendsee, Zebulun; Wurtele, Eve S.; Bassler, Kevin E.

Orphan genes, which are genes unique to each particular species, have recently drawn significant attention for their potential usefulness for organismal robustness. Their origin and regulatory interaction patterns remain largely undiscovered. Recently, methods that use the context likelihood of relatedness to infer a network followed by modularity maximizing community detection algorithms on the inferred network to find the functional structure of regulatory networks were shown to be effective. We apply improved versions of these methods to gene expression data from Arabidopsis thaliana, identify groups (clusters) of interacting genes with related patterns of expression and analyze the structure within those groups. Focusing on clusters that contain orphan genes, we compare the identified clusters to gene ontology (GO) terms, regulons, and pathway designations and analyze their hierarchical structure. We predict new regulatory interactions and unravel the structure of the regulatory interaction patterns of orphan genes. Work supported by the NSF through Grants DMR-1507371 and IOS-1546858.
Genome-Wide Detection and Analysis of Multifunctional Genes

Science.gov (United States)

Pritykin, Yuri; Ghersi, Dario; Singh, Mona

2015-01-01

Many genes can play a role in multiple biological processes or molecular functions. Identifying multifunctional genes at the genome-wide level and studying their properties can shed light upon the complexity of molecular events that underpin cellular functioning, thereby leading to a better understanding of the functional landscape of the cell. However, to date, genome-wide analysis of multifunctional genes (and the proteins they encode) has been limited. Here we introduce a computational approach that uses known functional annotations to extract genes playing a role in at least two distinct biological processes. We leverage functional genomics data sets for three organisms—H. sapiens, D. melanogaster, and S. cerevisiae—and show that, as compared to other annotated genes, genes involved in multiple biological processes possess distinct physicochemical properties, are more broadly expressed, tend to be more central in protein interaction networks, tend to be more evolutionarily conserved, and are more likely to be essential. We also find that multifunctional genes are significantly more likely to be involved in human disorders. These same features also hold when multifunctionality is defined with respect to molecular functions instead of biological processes. Our analysis uncovers key features about multifunctional genes, and is a step towards a better genome-wide understanding of gene multifunctionality. PMID:26436655
Causal relationship between the AHSG gene and BMD through fetuin-A and BMI: multiple mediation analysis.

Science.gov (United States)

Sritara, C; Thakkinstian, A; Ongphiphadhanakul, B; Chailurkit, L; Chanprasertyothin, S; Ratanachaiwong, W; Vathesatogkit, P; Sritara, P

2014-05-01

Using mediation analysis, a causal relationship between the AHSG gene and bone mineral density (BMD) through fetuin-A and body mass index (BMI) mediators was suggested. Fetuin-A, a multifunctional protein of hepatic origin, is associated with bone mineral density. It is unclear if this association is causal. This study aimed at clarification of this issue. A cross-sectional study was conducted among 1,741 healthy workers from the Electricity Generating Authority of Thailand (EGAT) cohort. The alpha-2-Heremans-Schmid glycoprotein (AHSG) rs2248690 gene was genotyped. Three mediation models were constructed using seemingly unrelated regression analysis. First, the ln[fetuin-A] group was regressed on the AHSG gene. Second, the BMI group was regressed on the AHSG gene and the ln[fetuin-A] group. Finally, the BMD model was constructed by fitting BMD on two mediators (ln[fetuin-A] and BMI) and the independent AHSG variable. All three analyses were adjusted for confounders. The prevalence of the minor T allele for the AHSG locus was 15.2%. The AHSG locus was highly related to serum fetuin-A levels (P Multiple mediation analyses showed that AHSG was significantly associated with BMD through the ln[fetuin-A] and BMI pathway, with beta coefficients of 0.0060 (95% CI 0.0038, 0.0083) and 0.0030 (95% CI 0.0020, 0.0045) at the total hip and lumbar spine, respectively. About 27.3 and 26.0% of total genetic effects on hip and spine BMD, respectively, were explained by the mediation effects of fetuin-A and BMI. Our study suggested evidence of a causal relationship between the AHSG gene and BMD through fetuin-A and BMI mediators.
Host Gene Expression Analysis in Sri Lankan Melioidosis Patients

Science.gov (United States)

2017-06-19

CCL5 Chemokine (C-C motif) ligand 5 /RANTES. IFNγ Interferon gamma TNFα Tumor necrosis factor alpha HMGB1 High mobility group box 1 protein /high...aim of this study was to analyze gene expression levels of human host factors in melioidosis patients and establish useful correlation with disease...PBMC’s) of study subjects. Gene expression profiles of 25 gene targets including 19 immune response genes and 6 epigenetic factors were analyzed by

Genome-Wide Identification, Evolutionary Analysis, and Stress Responses of the GRAS Gene Family in Castor Beans

Directory of Open Access Journals (Sweden)

Wei Xu

2016-06-01

Full Text Available Plant-specific GRAS transcription factors play important roles in regulating growth, development, and stress responses. Castor beans (Ricinus communis are important non-edible oilseed plants, cultivated worldwide for its seed oils and its adaptability to growth conditions. In this study, we identified and characterized a total of 48 GRAS genes based on the castor bean genome. Combined with phylogenetic analysis, the castor bean GRAS members were divided into 13 distinct groups. Functional divergence analysis revealed the presence of mostly Type-I functional divergence. The gene structures and conserved motifs, both within and outside the GRAS domain, were characterized. Gene expression analysis, performed in various tissues and under a range of abiotic stress conditions, uncovered the potential functions of GRAS members in regulating plant growth development and stress responses. The results obtained from this study provide valuable information toward understanding the potential molecular mechanisms of GRAS proteins in castor beans. These findings also serve as a resource for identifying the genes that allow castor beans to grow in stressful conditions and to enable further breeding and genetic improvements in agriculture.
Participation of Polycomb group gene extra sex combs in hedgehog signaling pathway

International Nuclear Information System (INIS)

Shindo, Norihisa; Sakai, Atsushi; Yamada, Kouji; Higashinakagawa, Toru

2004-01-01

Polycomb group (PcG) genes are required for stable inheritance of epigenetic states across cell divisions, a phenomenon termed cellular memory. PcG proteins form multimeric nuclear complex which modifies the chromatin structure of target site. Drosophila PcG gene extra sex combs (esc) and its vertebrate orthologs constitute a member of ESC-E(Z) complex, which possesses histone methyltransferase activity. Here we report isolation and characterization of medaka esc homolog, termed oleed. Hypomorphic knock-down of oleed using morpholino antisense oligonucleotides resulted in the fusion of eyes, termed cyclopia. Prechordal plate formation was not substantially impaired, but expression of hedgehog target genes was dependent on oleed, suggesting some link with hedgehog signaling. In support of this implication, histone methylation, which requires the activity of esc gene product, is increased in hedgehog stimulated mouse NIH-3T3 cells. Our data argue for the novel role of esc in hedgehog signaling and provide fundamental insight into the epigenetic mechanisms in general
Analysis of Gene Expression Variance in Schizophrenia Using Structural Equation Modeling

Directory of Open Access Journals (Sweden)

Anna A. Igolkina

2018-06-01

Full Text Available Schizophrenia (SCZ is a psychiatric disorder of unknown etiology. There is evidence suggesting that aberrations in neurodevelopment are a significant attribute of schizophrenia pathogenesis and progression. To identify biologically relevant molecular abnormalities affecting neurodevelopment in SCZ we used cultured neural progenitor cells derived from olfactory neuroepithelium (CNON cells. Here, we tested the hypothesis that variance in gene expression differs between individuals from SCZ and control groups. In CNON cells, variance in gene expression was significantly higher in SCZ samples in comparison with control samples. Variance in gene expression was enriched in five molecular pathways: serine biosynthesis, PI3K-Akt, MAPK, neurotrophin and focal adhesion. More than 14% of variance in disease status was explained within the logistic regression model (C-value = 0.70 by predictors accounting for gene expression in 69 genes from these five pathways. Structural equation modeling (SEM was applied to explore how the structure of these five pathways was altered between SCZ patients and controls. Four out of five pathways showed differences in the estimated relationships among genes: between KRAS and NF1, and KRAS and SOS1 in the MAPK pathway; between PSPH and SHMT2 in serine biosynthesis; between AKT3 and TSC2 in the PI3K-Akt signaling pathway; and between CRK and RAPGEF1 in the focal adhesion pathway. Our analysis provides evidence that variance in gene expression is an important characteristic of SCZ, and SEM is a promising method for uncovering altered relationships between specific genes thus suggesting affected gene regulation associated with the disease. We identified altered gene-gene interactions in pathways enriched for genes with increased variance in expression in SCZ. These pathways and loci were previously implicated in SCZ, providing further support for the hypothesis that gene expression variance plays important role in the etiology
Impact of two common xeroderma pigmentosum group D (XPD gene polymorphisms on risk of prostate cancer.

Directory of Open Access Journals (Sweden)

Yuanyuan Mi

Full Text Available BACKGROUND: DNA repair genes (EG: xeroderma pigmentosum group D, XPD may affect the capacity of encoded DNA repair enzymes to effectively remove DNA adducts or lesions, which may result in enhanced cancer risk. The association between XPD gene polymorphisms and the susceptibility of prostate cancer (PCa was inconsistent in previous studies. METHODOLOGY/PRINCIPAL FINDINGS: A meta-analysis based on 9 independent case-control studies involving 3165 PCa patients and 3539 healthy controls for XPD Gln751Lys SNP (single nucleotide polymorphism and 2555 cases and 3182 controls for Asn312Asp SNP was performed to address this association. Meanwhile, odds ratio (OR and 95% confidence intervals (CIs were used to evaluate this relationship. Statistical analysis was performed with STATA10.0. No significant association was found between XPD Gln751Lys SNP and PCa risk. On the other hand, in subgroup analysis based on ethnicity, associations were observed in Asian (eg. Asn vs. Asp: OR = 1.34, 95%CI = 1.16-1.55; Asn/Asn+Asn/Asp vs. Asp/Asp: OR = 1.23, 95%CI = 1.07-1.42 and African (eg. Asn vs. Asp: OR = 1.31, 95%CI = 1.01-1.70; Asn/Asn vs. Asp/Asp: OR = 1.71, 95%CI = 1.03-7.10 populations for Asn312Asp SNP. Moreover, similar associations were detected in hospital-based controls studies; the frequency of Asn/Asn genotype in early stage of PCa men was poorly higher than those in advanced stage of PCa men (OR = 1.45, 95%CI = 1.00-2.11. CONCLUSION/SIGNIFICANCE: Our investigations demonstrate that XPD Asn312Asp SNP not the Gln751Lys SNP, might poorly increase PCa risk in Asians and Africans, moreover, this SNPs may associate with the tumor stage of PCa. Further studies based on larger sample size and gene-environment interactions should be conducted to determine the role of XPD gene polymorphisms in PCa risk.
Systematic analysis of gene expression patterns associated with postmortem interval in human tissues.

Science.gov (United States)

Zhu, Yizhang; Wang, Likun; Yin, Yuxin; Yang, Ence

2017-07-14

Postmortem mRNA degradation is considered to be the major concern in gene expression research utilizing human postmortem tissues. A key factor in this process is the postmortem interval (PMI), which is defined as the interval between death and sample collection. However, global patterns of postmortem mRNA degradation at individual gene levels across diverse human tissues remain largely unknown. In this study, we performed a systematic analysis of alteration of gene expression associated with PMI in human tissues. From the Genotype-Tissue Expression (GTEx) database, we evaluated gene expression levels of 2,016 high-quality postmortem samples from 316 donors of European descent, with PMI ranging from 1 to 27 hours. We found that PMI-related mRNA degradation is tissue-specific, gene-specific, and even genotype-dependent, thus drawing a more comprehensive picture of PMI-associated gene expression across diverse human tissues. Additionally, we also identified 266 differentially variable (DV) genes, such as DEFB4B and IFNG, whose expression is significantly dispersed between short PMI (S-PMI) and long PMI (L-PMI) groups. In summary, our analyses provide a comprehensive profile of PMI-associated gene expression, which will help interpret gene expression patterns in the evaluation of postmortem tissues.
Presence of fibronectin-binding protein gene prtF2 in invasive group A streptococci in tropical Australia is associated with increased internalisation efficiency.

Science.gov (United States)

Gorton, Davina; Norton, Robert; Layton, Ramon; Smith, Helen; Ketheesan, Natkunam

2005-03-01

The fibronectin-binding proteins (FnBPs) PrtF1 and PrtF2 are considered to be major group A streptococcal virulence factors, mediating adherence to and internalisation of host cells. The present study investigated an association between the presence of prtF1 and prtF2 genes and internalisation efficiency in group A streptococci (GAS) isolated from patients with invasive disease. Of the 80 isolates tested, 58 (73%) had prtF1 and 71 (89%) possessed prtF2. Three isolates (4%) had neither gene, seven (9%) had prtF1 only, 19 (24%) had prtF2 only and 51 isolates (64%) had both prtF1 and prtF2. prtF2-positive isolates internalised up to three times more efficiently than isolates that had prtF1 alone (Pinternalisation efficiency and presence of the prtF1 gene. Analysis of the fibronectin-binding repeat domain (FBRD) of prtF2 revealed that this gene can contain 2, 3, 4 or 5 repeat regions and that five repeat regions conferred very high internalisation efficiency in invasive GAS isolates.
Cloning, annotation and expression analysis of mycoparasitism-related genes in Trichoderma harzianum 88.

Science.gov (United States)

Yao, Lin; Yang, Qian; Song, Jinzhu; Tan, Chong; Guo, Changhong; Wang, Li; Qu, Lianhai; Wang, Yun

2013-04-01

Trichoderma harzianum 88, a filamentous soil fungus, is an effective biocontrol agent against several plant pathogens. High-throughput sequencing was used here to study the mycoparasitism mechanisms of T. harzianum 88. Plate confrontation tests of T. harzianum 88 against plant pathogens were conducted, and a cDNA library was constructed from T. harzianum 88 mycelia in the presence of plant pathogen cell walls. Randomly selected transcripts from the cDNA library were compared with eukaryotic plant and fungal genomes. Of the 1,386 transcripts sequenced, the most abundant Gene Ontology (GO) classification group was "physiological process". Differential expression of 19 genes was confirmed by real-time RT-PCR at different mycoparasitism stages against plant pathogens. Gene expression analysis revealed the transcription of various genes involved in mycoparasitism of T. harzianum 88. Our study provides helpful insights into the mechanisms of T. harzianum 88-plant pathogen interactions.
New Dimensions in Microbial Ecology—Functional Genes in Studies to Unravel the Biodiversity and Role of Functional Microbial Groups in the Environment

Science.gov (United States)

Imhoff, Johannes F.

2016-01-01

During the past decades, tremendous advances have been made in the possibilities to study the diversity of microbial communities in the environment. The development of methods to study these communities on the basis of 16S rRNA gene sequences analysis was a first step into the molecular analysis of environmental communities and the study of biodiversity in natural habitats. A new dimension in this field was reached with the introduction of functional genes of ecological importance and the establishment of genetic tools to study the diversity of functional microbial groups and their responses to environmental factors. Functional gene approaches are excellent tools to study the diversity of a particular function and to demonstrate changes in the composition of prokaryote communities contributing to this function. The phylogeny of many functional genes largely correlates with that of the 16S rRNA gene, and microbial species may be identified on the basis of functional gene sequences. Functional genes are perfectly suited to link culture-based microbiological work with environmental molecular genetic studies. In this review, the development of functional gene studies in environmental microbiology is highlighted with examples of genes relevant for important ecophysiological functions. Examples are presented for bacterial photosynthesis and two types of anoxygenic phototrophic bacteria, with genes of the Fenna-Matthews-Olson-protein (fmoA) as target for the green sulfur bacteria and of two reaction center proteins (pufLM) for the phototrophic purple bacteria, with genes of adenosine-5′phosphosulfate (APS) reductase (aprA), sulfate thioesterase (soxB) and dissimilatory sulfite reductase (dsrAB) for sulfur oxidizing and sulfate reducing bacteria, with genes of ammonia monooxygenase (amoA) for nitrifying/ammonia-oxidizing bacteria, with genes of particulate nitrate reductase and nitrite reductases (narH/G, nirS, nirK) for denitrifying bacteria and with genes of methane
New Dimensions in Microbial Ecology—Functional Genes in Studies to Unravel the Biodiversity and Role of Functional Microbial Groups in the Environment

Directory of Open Access Journals (Sweden)

Johannes F. Imhoff

2016-05-01

Full Text Available During the past decades, tremendous advances have been made in the possibilities to study the diversity of microbial communities in the environment. The development of methods to study these communities on the basis of 16S rRNA gene sequences analysis was a first step into the molecular analysis of environmental communities and the study of biodiversity in natural habitats. A new dimension in this field was reached with the introduction of functional genes of ecological importance and the establishment of genetic tools to study the diversity of functional microbial groups and their responses to environmental factors. Functional gene approaches are excellent tools to study the diversity of a particular function and to demonstrate changes in the composition of prokaryote communities contributing to this function. The phylogeny of many functional genes largely correlates with that of the 16S rRNA gene, and microbial species may be identified on the basis of functional gene sequences. Functional genes are perfectly suited to link culture-based microbiological work with environmental molecular genetic studies. In this review, the development of functional gene studies in environmental microbiology is highlighted with examples of genes relevant for important ecophysiological functions. Examples are presented for bacterial photosynthesis and two types of anoxygenic phototrophic bacteria, with genes of the Fenna-Matthews-Olson-protein (fmoA as target for the green sulfur bacteria and of two reaction center proteins (pufLM for the phototrophic purple bacteria, with genes of adenosine-5′phosphosulfate (APS reductase (aprA, sulfate thioesterase (soxB and dissimilatory sulfite reductase (dsrAB for sulfur oxidizing and sulfate reducing bacteria, with genes of ammonia monooxygenase (amoA for nitrifying/ammonia-oxidizing bacteria, with genes of particulate nitrate reductase and nitrite reductases (narH/G, nirS, nirK for denitrifying bacteria and with genes
GhWRKY25, a group I WRKY gene from cotton, confers differential tolerance to abiotic and biotic stresses in transgenic Nicotiana benthamiana.

Science.gov (United States)

Liu, Xiufang; Song, Yunzhi; Xing, Fangyu; Wang, Ning; Wen, Fujiang; Zhu, Changxiang

2016-09-01

WRKY transcription factors are involved in various processes, ranging from plant growth to abiotic and biotic stress responses. Group I WRKY members have been rarely reported compared with group II or III members, particularly in cotton (Gossypium hirsutum). In this study, a group I WRKY gene, namely, GhWRKY25, was cloned from cotton and characterized. Expression analysis revealed that GhWRKY25 can be induced or deduced by the treatments of abiotic stresses and multiple defense-related signaling molecules. Overexpression of GhWRKY25 in Nicotiana benthamiana reduced plant tolerance to drought stress but enhanced tolerance to salt stress. Moreover, more MDA and ROS accumulated in transgenic plants after drought treatment with lower activities of SOD, POD, and CAT. Our study further demonstrated that GhWRKY25 overexpression in plants enhanced sensitivity to the fungal pathogen Botrytis cinerea by reducing the expression of SA or ET signaling related genes and inducing the expression of genes involved in the JA signaling pathway. These results indicated that GhWRKY25 plays negative or positive roles in response to abiotic stresses, and the reduced pathogen resistance may be related to the crosstalk of the SA and JA/ET signaling pathways.
Genome-wide analysis of WRKY gene family in the sesame genome and identification of the WRKY genes involved in responses to abiotic stresses.

Science.gov (United States)

Li, Donghua; Liu, Pan; Yu, Jingyin; Wang, Linhai; Dossa, Komivi; Zhang, Yanxin; Zhou, Rong; Wei, Xin; Zhang, Xiurong

2017-09-11

Sesame (Sesamum indicum L.) is one of the world's most important oil crops. However, it is susceptible to abiotic stresses in general, and to waterlogging and drought stresses in particular. The molecular mechanisms of abiotic stress tolerance in sesame have not yet been elucidated. The WRKY domain transcription factors play significant roles in plant growth, development, and responses to stresses. However, little is known about the number, location, structure, molecular phylogenetics, and expression of the WRKY genes in sesame. We performed a comprehensive study of the WRKY gene family in sesame and identified 71 SiWRKYs. In total, 65 of these genes were mapped to 15 linkage groups within the sesame genome. A phylogenetic analysis was performed using a related species (Arabidopsis thaliana) to investigate the evolution of the sesame WRKY genes. Tissue expression profiles of the WRKY genes demonstrated that six SiWRKY genes were highly expressed in all organs, suggesting that these genes may be important for plant growth and organ development in sesame. Analysis of the SiWRKY gene expression patterns revealed that 33 and 26 SiWRKYs respond strongly to waterlogging and drought stresses, respectively. Changes in the expression of 12 SiWRKY genes were observed at different times after the waterlogging and drought treatments had begun, demonstrating that sesame gene expression patterns vary in response to abiotic stresses. In this study, we analyzed the WRKY family of transcription factors encoded by the sesame genome. Insight was gained into the classification, evolution, and function of the SiWRKY genes, revealing their putative roles in a variety of tissues. Responses to abiotic stresses in different sesame cultivars were also investigated. The results of our study provide a better understanding of the structures and functions of sesame WRKY genes and suggest that manipulating these WRKYs could enhance resistance to waterlogging and drought.
Analysis of the clonal repertoire of gene-corrected cells in gene therapy.

Science.gov (United States)

Paruzynski, Anna; Glimm, Hanno; Schmidt, Manfred; Kalle, Christof von

2012-01-01

Gene therapy-based clinical phase I/II studies using integrating retroviral vectors could successfully treat different monogenetic inherited diseases. However, with increased efficiency of this therapy, severe side effects occurred in various gene therapy trials. In all cases, integration of the vector close to or within a proto-oncogene contributed substantially to the development of the malignancies. Thus, the in-depth analysis of integration site patterns is of high importance to uncover potential clonal outgrowth and to assess the safety of gene transfer vectors and gene therapy protocols. The standard and nonrestrictive linear amplification-mediated PCR (nrLAM-PCR) in combination with high-throughput sequencing exhibits technologies that allow to comprehensively analyze the clonal repertoire of gene-corrected cells and to assess the safety of the used vector system at an early stage on the molecular level. It enables clarifying the biological consequences of the vector system on the fate of the transduced cell. Furthermore, the downstream performance of real-time PCR allows a quantitative estimation of the clonality of individual cells and their clonal progeny. Here, we present a guideline that should allow researchers to perform comprehensive integration site analysis in preclinical and clinical studies. Copyright Â© 2012 Elsevier Inc. All rights reserved.
Canonical correlation analysis for gene-based pleiotropy discovery.

Directory of Open Access Journals (Sweden)

Jose A Seoane

2014-10-01

Full Text Available Genome-wide association studies have identified a wealth of genetic variants involved in complex traits and multifactorial diseases. There is now considerable interest in testing variants for association with multiple phenotypes (pleiotropy and for testing multiple variants for association with a single phenotype (gene-based association tests. Such approaches can increase statistical power by combining evidence for association over multiple phenotypes or genetic variants respectively. Canonical Correlation Analysis (CCA measures the correlation between two sets of multidimensional variables, and thus offers the potential to combine these two approaches. To apply CCA, we must restrict the number of attributes relative to the number of samples. Hence we consider modules of genetic variation that can comprise a gene, a pathway or another biologically relevant grouping, and/or a set of phenotypes. In order to do this, we use an attribute selection strategy based on a binary genetic algorithm. Applied to a UK-based prospective cohort study of 4286 women (the British Women's Heart and Health Study, we find improved statistical power in the detection of previously reported genetic associations, and identify a number of novel pleiotropic associations between genetic variants and phenotypes. New discoveries include gene-based association of NSF with triglyceride levels and several genes (ACSM3, ERI2, IL18RAP, IL23RAP and NRG1 with left ventricular hypertrophy phenotypes. In multiple-phenotype analyses we find association of NRG1 with left ventricular hypertrophy phenotypes, fibrinogen and urea and pleiotropic relationships of F7 and F10 with Factor VII, Factor IX and cholesterol levels.
In Silico Analysis of FMR1 Gene Missense SNPs.

Science.gov (United States)

Tekcan, Akin

2016-06-01

The FMR1 gene, a member of the fragile X-related gene family, is responsible for fragile X syndrome (FXS). Missense single-nucleotide polymorphisms (SNPs) are responsible for many complex diseases. The effect of FMR1 gene missense SNPs is unknown. The aim of this study, using in silico techniques, was to analyze all known missense mutations that can affect the functionality of the FMR1 gene, leading to mental retardation (MR) and FXS. Data on the human FMR1 gene were collected from the Ensembl database (release 81), National Centre for Biological Information dbSNP Short Genetic Variations database, 1000 Genomes Browser, and NHLBI Exome Sequencing Project Exome Variant Server. In silico analysis was then performed. One hundred-twenty different missense SNPs of the FMR1 gene were determined. Of these, 11.66 % of the FMR1 gene missense SNPs were in highly conserved domains, and 83.33 % were in domains with high variety. The results of the in silico prediction analysis showed that 31.66 % of the FMR1 gene SNPs were disease related and that 50 % of SNPs had a pathogenic effect. The results of the structural and functional analysis revealed that although the R138Q mutation did not seem to have a damaging effect on the protein, the G266E and I304N SNPs appeared to disturb the interaction between the domains and affect the function of the protein. This is the first study to analyze all missense SNPs of the FMR1 gene. The results indicate the applicability of a bioinformatics approach to FXS and other FMR1-related diseases. I think that the analysis of FMR1 gene missense SNPs using bioinformatics methods would help diagnosis of FXS and other FMR1-related diseases.
Automatic identification of optimal marker genes for phenotypic and taxonomic groups of microorganisms.

Directory of Open Access Journals (Sweden)

Elad Segev

Full Text Available Finding optimal markers for microorganisms important in the medical, agricultural, environmental or ecological fields is of great importance. Thousands of complete microbial genomes now available allow us, for the first time, to exhaustively identify marker proteins for groups of microbial organisms. In this work, we model the biological task as the well-known mathematical "hitting set" problem, solving it based on both greedy and randomized approximation algorithms. We identify unique markers for 17 phenotypic and taxonomic microbial groups, including proteins related to the nitrite reductase enzyme as markers for the non-anammox nitrifying bacteria group, and two transcription regulation proteins, nusG and yhiF, as markers for the Archaea and Escherichia/Shigella taxonomic groups, respectively. Additionally, we identify marker proteins for three subtypes of pathogenic E. coli, which previously had no known optimal markers. Practically, depending on the completeness of the database this algorithm can be used for identification of marker genes for any microbial group, these marker genes may be prime candidates for the understanding of the genetic basis of the group's phenotype or to help discover novel functions which are uniquely shared among a group of microbes. We show that our method is both theoretically and practically efficient, while establishing an upper bound on its time complexity and approximation ratio; thus, it promises to remain efficient and permit the identification of marker proteins that are specific to phenotypic or taxonomic groups, even as more and more bacterial genomes are being sequenced.
A gene network bioinformatics analysis for pemphigoid autoimmune blistering diseases.

Science.gov (United States)

Barone, Antonio; Toti, Paolo; Giuca, Maria Rita; Derchi, Giacomo; Covani, Ugo

2015-07-01

In this theoretical study, a text mining search and clustering analysis of data related to genes potentially involved in human pemphigoid autoimmune blistering diseases (PAIBD) was performed using web tools to create a gene/protein interaction network. The Search Tool for the Retrieval of Interacting Genes/Proteins (STRING) database was employed to identify a final set of PAIBD-involved genes and to calculate the overall significant interactions among genes: for each gene, the weighted number of links, or WNL, was registered and a clustering procedure was performed using the WNL analysis. Genes were ranked in class (leader, B, C, D and so on, up to orphans). An ontological analysis was performed for the set of 'leader' genes. Using the above-mentioned data network, 115 genes represented the final set; leader genes numbered 7 (intercellular adhesion molecule 1 (ICAM-1), interferon gamma (IFNG), interleukin (IL)-2, IL-4, IL-6, IL-8 and tumour necrosis factor (TNF)), class B genes were 13, whereas the orphans were 24. The ontological analysis attested that the molecular action was focused on extracellular space and cell surface, whereas the activation and regulation of the immunity system was widely involved. Despite the limited knowledge of the present pathologic phenomenon, attested by the presence of 24 genes revealing no protein-protein direct or indirect interactions, the network showed significant pathways gathered in several subgroups: cellular components, molecular functions, biological processes and the pathologic phenomenon obtained from the Kyoto Encyclopaedia of Genes and Genomes (KEGG) database. The molecular basis for PAIBD was summarised and expanded, which will perhaps give researchers promising directions for the identification of new therapeutic targets.
QTL mapping and transcriptome analysis of cowpea reveals candidate genes for root-knot nematode resistance.

Directory of Open Access Journals (Sweden)

Jansen Rodrigo Pereira Santos

Full Text Available Cowpea is one of the most important food and forage legumes in drier regions of the tropics and subtropics. However, cowpea yield worldwide is markedly below the known potential due to abiotic and biotic stresses, including parasitism by root-knot nematodes (Meloidogyne spp., RKN. Two resistance genes with dominant effect, Rk and Rk2, have been reported to provide resistance against RKN in cowpea. Despite their description and use in breeding for resistance to RKN and particularly genetic mapping of the Rk locus, the exact genes conferring resistance to RKN remain unknown. In the present work, QTL mapping using recombinant inbred line (RIL population 524B x IT84S-2049 segregating for a newly mapped locus and analysis of the transcriptome changes in two cowpea near-isogenic lines (NIL were used to identify candidate genes for Rk and the newly mapped locus. A major QTL, designated QRk-vu9.1, associated with resistance to Meloidogyne javanica reproduction, was detected and mapped on linkage group LG9 at position 13.37 cM using egg production data. Transcriptome analysis on resistant and susceptible NILs 3 and 9 days after inoculation revealed up-regulation of 109 and 98 genes and down-regulation of 110 and 89 genes, respectively, out of 19,922 unique genes mapped to the common bean reference genome. Among the differentially expressed genes, four and nine genes were found within the QRk-vu9.1 and QRk-vu11.1 QTL intervals, respectively. Six of these genes belong to the TIR-NBS-LRR family of resistance genes and three were upregulated at one or more time-points. Quantitative RT-PCR validated gene expression to be positively correlated with RNA-seq expression pattern for eight genes. Future functional analysis of these cowpea genes will enhance our understanding of Rk-mediated resistance and identify the specific gene responsible for the resistance.
Genome-wide analysis of Aux/IAA gene family in Solanaceae species using tomato as a model.

Science.gov (United States)

Wu, Jian; Peng, Zhen; Liu, Songyu; He, Yanjun; Cheng, Lin; Kong, Fuling; Wang, Jie; Lu, Gang

2012-04-01

Auxin plays key roles in a wide variety of plant activities, including embryo development, leaf formation, phototropism, fruit development and root initiation and development. Auxin/indoleacetic acid (Aux/IAA) genes, encoding short-lived nuclear proteins, are key regulators in the auxin transduction pathway. But how they work is still unknown. In order to conduct a systematic analysis of this gene family in Solanaceae species, a genome-wide search for the homologues of auxin response genes was carried out. Here, 26 and 27 non redundant AUX/IAAs were identified in tomato and potato, respectively. Using tomato as a model, a comprehensive overview of SlIAA gene family is presented, including the gene structures, phylogeny, chromosome locations, conserved motifs and cis-elements in promoter sequences. A phylogenetic tree generated from alignments of the predicted protein sequences of 31 OsIAAs, 29 AtIAAs, 31 ZmIAAs, and 26 SlIAAs revealed that these IAAs were clustered into three major groups and ten subgroups. Among them, seven subgroups were present in both monocot and dicot species, which indicated that the major functional diversification within the IAA family predated the monocot/dicot divergence. In contrast, group C and some other subgroups seemed to be species-specific. Quantitative real-time PCR (qRT-PCR) analysis showed that 19 of the 26 SlIAA genes could be detected in all tomato organs/tissues, however, seven of them were specifically expressed in some of tomato tissues. The transcript abundance of 17 SlIAA genes were increased within a few hours when the seedlings were treated with exogenous IAA. However, those of other six SlIAAs were decreased. The results of stress treatments showed that most SIIAA family genes responded to at least one of the three stress treatments, however, they exhibited diverse expression levels under different abiotic stress conditions in tomato seedlings. SlIAA20, SlIAA21 and SlIAA22 were not significantly influenced by stress
Gene coexpression network analysis of fruit transcriptomes uncovers a possible mechanistically distinct class of sugar/acid ratio-associated genes in sweet orange.

Science.gov (United States)

Qiao, Liang; Cao, Minghao; Zheng, Jian; Zhao, Yihong; Zheng, Zhi-Liang

2017-10-30

The ratio of sugars to organic acids, two of the major metabolites in fleshy fruits, has been considered the most important contributor to fruit sweetness. Although accumulation of sugars and acids have been extensively studied, whether plants evolve a mechanism to maintain, sense or respond to the fruit sugar/acid ratio remains a mystery. In a prior study, we used an integrated systems biology tool to identify a group of 39 acid-associated genes from the fruit transcriptomes in four sweet orange varieties (Citrus sinensis L. Osbeck) with varying fruit acidity, Succari (acidless), Bingtang (low acid), and Newhall and Xinhui (normal acid). We reanalyzed the prior sweet orange fruit transcriptome data, leading to the identification of 72 genes highly correlated with the fruit sugar/acid ratio. The majority of these sugar/acid ratio-related genes are predicted to be involved in regulatory functions such as transport, signaling and transcription or encode enzymes involved in metabolism. Surprisingly, only three of these sugar/acid ratio-correlated genes are weakly correlated with sugar level and none of them overlaps with the acid-associated genes. Weighted Gene Coexpression Network Analysis (WGCNA) has revealed that these genes belong to four modules, Blue, Grey, Brown and Turquoise, with the former two modules being unique to the sugar/acid ratio control. Our results indicate that orange fruits contain a possible mechanistically distinct class of genes that may potentially be involved in maintaining fruit sugar/acid ratios and/or responding to the cellular sugar/acid ratio status. Therefore, our analysis of orange transcriptomes provides an intriguing insight into the potentially novel genetic or molecular mechanisms controlling the sugar/acid ratio in fruits.
Neuropeptide Y receptor genes on human chromosome 4q31-q32 map to conserved linkage groups on mouse chromosomes 3 and 8

Energy Technology Data Exchange (ETDEWEB)

Lutz, C.M.; Frankel, W.N. [Jackson Lab., Bar Harbor, ME (United States); Richards, J.E. [Univ. of Michigan Medical School, Ann Arbor, MI (United States)] [and others

1997-05-01

Npy1r and Npy2r, the genes encoding mouse type 1 and type 2 neuropeptide Y receptors, have been mapped by interspecific backcross analysis. Previous studies have localized the human genes encoding these receptors to chromosome 4q31-q32. We have now assigned Npy1r and Npy2r to conserved linkage groups on mouse Chr 8 and Chr 3, respectively, which correspond to the distal region of human chromosome 4q. Using yeast artificial chromosomes, we have estimated the distance between the human genes to be approximately 6 cM. Although ancient tandem duplication events may account for some closely spaced G-protein-coupled receptor genes, the large genetic distance between the human type 1 and type 2 neuropeptide Y receptor genes raises questions about whether this mechanism accounts for their proximity. 20 refs., 1 fig.

LEGO: a novel method for gene set over-representation analysis by incorporating network-based gene weights.

Science.gov (United States)

Dong, Xinran; Hao, Yun; Wang, Xiao; Tian, Weidong

2016-01-11

Pathway or gene set over-representation analysis (ORA) has become a routine task in functional genomics studies. However, currently widely used ORA tools employ statistical methods such as Fisher's exact test that reduce a pathway into a list of genes, ignoring the constitutive functional non-equivalent roles of genes and the complex gene-gene interactions. Here, we develop a novel method named LEGO (functional Link Enrichment of Gene Ontology or gene sets) that takes into consideration these two types of information by incorporating network-based gene weights in ORA analysis. In three benchmarks, LEGO achieves better performance than Fisher and three other network-based methods. To further evaluate LEGO's usefulness, we compare LEGO with five gene expression-based and three pathway topology-based methods using a benchmark of 34 disease gene expression datasets compiled by a recent publication, and show that LEGO is among the top-ranked methods in terms of both sensitivity and prioritization for detecting target KEGG pathways. In addition, we develop a cluster-and-filter approach to reduce the redundancy among the enriched gene sets, making the results more interpretable to biologists. Finally, we apply LEGO to two lists of autism genes, and identify relevant gene sets to autism that could not be found by Fisher.
Comparative genomics of Mycoplasma: analysis of conserved essential genes and diversity of the pan-genome.

Directory of Open Access Journals (Sweden)

Wei Liu

Full Text Available Mycoplasma, the smallest self-replicating organism with a minimal metabolism and little genomic redundancy, is expected to be a close approximation to the minimal set of genes needed to sustain bacterial life. This study employs comparative evolutionary analysis of twenty Mycoplasma genomes to gain an improved understanding of essential genes. By analyzing the core genome of mycoplasmas, we finally revealed the conserved essential genes set for mycoplasma survival. Further analysis showed that the core genome set has many characteristics in common with experimentally identified essential genes. Several key genes, which are related to DNA replication and repair and can be disrupted in transposon mutagenesis studies, may be critical for bacteria survival especially over long period natural selection. Phylogenomic reconstructions based on 3,355 homologous groups allowed robust estimation of phylogenetic relatedness among mycoplasma strains. To obtain deeper insight into the relative roles of molecular evolution in pathogen adaptation to their hosts, we also analyzed the positive selection pressures on particular sites and lineages. There appears to be an approximate correlation between the divergence of species and the level of positive selection detected in corresponding lineages.
Cloning and Functional Analysis of the MADS-box CiMADS9 Gene from Carya illinoinensis

Directory of Open Access Journals (Sweden)

Zhang Jiyu

2015-07-01

Full Text Available A MADS-box gene, CiMADS9, was cloned from the male flowers of Carya illinoinensis by rapid amplification of cDNA ends. The gene was 1 077 bp with a 768 bp open reading frame encoding 255 amino acids. Multiple sequence comparisons revealed that CiMADS9 is a typical MIKC-type MADS-box gene with a MADS-box domain and a K semi-conserved region. Phylogenetic analysis indicated that CiMADS9 belongs to the AGL15 group of the MADS-box gene family. Quantitative reverse transcription polymerase chain reaction analysis indicated that the expression levels in reproductive organs (i.e., flowers and young fruits were considerably higher than in vegetative tissues (i.e., leaves and branches. The highest expression levels were observed in male flowers. An overexpression vector for CiMADS9 was constructed and the gene was inserted into the Arabidopsis thaliana genome. CiMADS9 expression was confirmed in all transgenic lines. Compared with wild-type plants, transgenic A. thaliana plants overexpressing CiMADS9 exhibited delayed flowering and an increased number of leaves.
Significant Microsynteny with New Evolutionary Highlights Is Detected through Comparative Genomic Sequence Analysis of Maize CCCH IX Gene Subfamily

Directory of Open Access Journals (Sweden)

Wei-Jun Chen

2015-01-01

Full Text Available CCCH zinc finger proteins, which are characterized by the presence of three cysteine residues and one histidine residue, play important roles in RNA processing in plants. Subfamily IX CCCH proteins were recently shown to function in stress tolerances. In this study, we analyzed CCCH IX genes in Zea mays, Oryza sativa, and Sorghum bicolor. These genes, which are almost intronless, were divided into four groups based on phylogenetic analysis. Microsynteny analysis revealed microsynteny in regions of some gene pairs, indicating that segmental duplication has played an important role in the expansion of this gene family. In addition, we calculated the dates of duplication by Ks analysis, finding that all microsynteny blocks were formed after the monocot-eudicot divergence. We found that deletions, multiplications, and inversions were shown to have occurred over the course of evolution. Moreover, the Ka/Ks ratios indicated that the genes in these three grass species are under strong purifying selection. Finally, we investigated the evolutionary patterns of some gene pairs conferring tolerance to abiotic stress, laying the foundation for future functional studies of these transcription factors.
Molecular cloning and expression analysis of the sucrose transporter gene family from Theobroma cacao L.

Science.gov (United States)

Li, Fupeng; Wu, Baoduo; Qin, Xiaowei; Yan, Lin; Hao, Chaoyun; Tan, Lehe; Lai, Jianxiong

2014-08-10

In this study, we performed cloning and expression analysis of six putative sucrose transporter genes, designated TcSUT1, TcSUT2, TcSUT3, TcSUT4, TcSUT5 and TcSUT6, from the cacao genotype 'TAS-R8'. The combination of cDNA and genomic DNA sequences revealed that the cacao SUT genes contained exon numbers ranging from 1 to 14. The average molecular mass of all six deduced proteins was approximately 56 kDa (range 52 to 66 kDa). All six proteins were predicted to exhibit typical features of sucrose transporters with 12 trans-membrane spanning domains. Phylogenetic analysis revealed that TcSUT2 and TcSUT4 belonged to Group 2 SUT and Group 4 SUT, respectively, and the other TcSUT proteins were belonging to Group 1 SUT. Real-time PCR was conducted to investigate the expression pattern of each member of the SUT family in cacao. Our experiment showed that TcSUT1 was expressed dominantly in pods and that, TcSUT3 and TcSUT4 were highly expressed in both pods and in bark with phloem. Within pods, TcSUT1 and TcSUT4 were expressed more in the seed coat and seed from the pod enlargement stage to the ripening stage. TcSUT5 expression sharply increased to its highest expression level in the seed coat during the ripening stage. Expression pattern analysis indicated that TcSUT genes may be associated with photoassimilate transport into developing seeds and may, therefore, have an impact on seed production. Copyright © 2014 Elsevier B.V. All rights reserved.
A Serial Analysis of Gene Expression (SAGE) database analysis of chemosensitivity

DEFF Research Database (Denmark)

Stein, Wilfred D; Litman, Thomas; Fojo, Tito

2004-01-01

are their corresponding solid tumors. We used the Serial Analysis of Gene Expression (SAGE) database to identify differences between solid tumors and cell lines, hoping to detect genes that could potentially explain differences in drug sensitivity. SAGE libraries were available for both solid tumors and cell lines from...
The Screening of Genes Sensitive to Long-Term, Low-Level Microwave Exposure and Bioinformatic Analysis of Potential Correlations to Learning and Memory

Institute of Scientific and Technical Information of China (English)

ZHAO Ya Li; LI Ying Xian; MA Hong Bo; LI Dong; LI Hai Liang; JIANG Rui; KAN Guang Han; YANG Zhen Zhong; HUANG Zeng Xin

2015-01-01

Objective To gain a better understanding of gene expression changes in the brain following microwave exposure in mice. This study hopes to reveal mechanisms contributing to microwave-induced learning and memory dysfunction. Methods Mice were exposed to whole body 2100 MHz microwaves with specific absorption rates (SARs) of 0.45 W/kg, 1.8 W/kg, and 3.6 W/kg for 1 hour daily for 8 weeks. Differentially expressing genes in the brains were screened using high-density oligonucleotide arrays, with genes showing more significant differences further confirmed by RT-PCR. Results The gene chip results demonstrated that 41 genes (0.45 W/kg group), 29 genes (1.8 W/kg group), and 219 genes (3.6 W/kg group) were differentially expressed. GO analysis revealed that these differentially expressed genes were primarily involved in metabolic processes, cellular metabolic processes, regulation of biological processes, macromolecular metabolic processes, biosynthetic processes, cellular protein metabolic processes, transport, developmental processes, cellular component organization, etc. KEGG pathway analysis showed that these genes are mainly involved in pathways related to ribosome, Alzheimer's disease, Parkinson's disease, long-term potentiation, Huntington's disease, and Neurotrophin signaling. Construction of a protein interaction network identified several important regulatory genes including synbindin (sbdn), Crystallin (CryaB), PPP1CA, Ywhaq, Psap, Psmb1, Pcbp2, etc., which play important roles in the processes of learning and memory. Conclusion Long-term, low-level microwave exposure may inhibit learning and memory by affecting protein and energy metabolic processes and signaling pathways relating to neurological functions or diseases.
Detecting coordinated regulation of multi-protein complexes using logic analysis of gene expression

Directory of Open Access Journals (Sweden)

Yeates Todd O

2009-12-01

Full Text Available Abstract Background Many of the functional units in cells are multi-protein complexes such as RNA polymerase, the ribosome, and the proteasome. For such units to work together, one might expect a high level of regulation to enable co-appearance or repression of sets of complexes at the required time. However, this type of coordinated regulation between whole complexes is difficult to detect by existing methods for analyzing mRNA co-expression. We propose a new methodology that is able to detect such higher order relationships. Results We detect coordinated regulation of multiple protein complexes using logic analysis of gene expression data. Specifically, we identify gene triplets composed of genes whose expression profiles are found to be related by various types of logic functions. In order to focus on complexes, we associate the members of a gene triplet with the distinct protein complexes to which they belong. In this way, we identify complexes related by specific kinds of regulatory relationships. For example, we may find that the transcription of complex C is increased only if the transcription of both complex A AND complex B is repressed. We identify hundreds of examples of coordinated regulation among complexes under various stress conditions. Many of these examples involve the ribosome. Some of our examples have been previously identified in the literature, while others are novel. One notable example is the relationship between the transcription of the ribosome, RNA polymerase and mannosyltransferase II, which is involved in N-linked glycan processing in the Golgi. Conclusions The analysis proposed here focuses on relationships among triplets of genes that are not evident when genes are examined in a pairwise fashion as in typical clustering methods. By grouping gene triplets, we are able to decipher coordinated regulation among sets of three complexes. Moreover, using all triplets that involve coordinated regulation with the ribosome
Genome-Wide Analysis of Soybean LATERAL ORGAN BOUNDARIES Domain-Containing Genes: A Functional Investigation of GmLBD12

Directory of Open Access Journals (Sweden)

Hui Yang

2017-03-01

Full Text Available Plant-specific ( genes play critical roles in various plant growth and development processes. However, the number and characteristics of genes in soybean [ (L. Merr.] remain unknown. Here, we identified 90 homologous genes in the soybean genome that phylogenetically clustered into two classes (I and II. The majority of the genes were evenly distributed across all 20 soybean chromosomes, and 77 (81.11% of them were detected in segmental duplicated regions. Furthermore, the exon–intron organization and motif composition for each were analyzed. A close phylogenetic relationship was identified between the soybean genes and 41 previously reported genes of different plants in the same group, providing insights into their putative functions. Expression analysis indicated that more than half of the genes were expressed, with the two gene classes showing differential tissue expression characteristics; in addition, they were differentially induced by biotic and abiotic stresses. To further explore the functions of genes in soybean, was selected for functional characterization. GmLBD12 was mainly localized to the nucleus and showed high expression in root and seed tissues. Overexpressing in (L. Heynh resulted in increases in lateral root (LR number and plant height. Quantitative real-time polymerase chain reaction (qRT-PCR analysis demonstrated that was induced by drought, salt, cold, indole acetic acid (IAA, abscisic acid (ABA, and salicylic acid SA treatments. This study provides the first comprehensive analysis of the soybean gene family and a valuable foundation for future functional studies of genes.
Combined chromatin and expression analysis reveals specific regulatory mechanisms within cytokine genes in the macrophage early immune response.

Directory of Open Access Journals (Sweden)

Maria Jesus Iglesias

Full Text Available Macrophages play a critical role in innate immunity, and the expression of early response genes orchestrate much of the initial response of the immune system. Macrophages undergo extensive transcriptional reprogramming in response to inflammatory stimuli such as Lipopolysaccharide (LPS.To identify gene transcription regulation patterns involved in early innate immune responses, we used two genome-wide approaches--gene expression profiling and chromatin immunoprecipitation-sequencing (ChIP-seq analysis. We examined the effect of 2 hrs LPS stimulation on early gene expression and its relation to chromatin remodeling (H3 acetylation; H3Ac and promoter binding of Sp1 and RNA polymerase II phosphorylated at serine 5 (S5P RNAPII, which is a marker for transcriptional initiation. Our results indicate novel and alternative gene regulatory mechanisms for certain proinflammatory genes. We identified two groups of up-regulated inflammatory genes with respect to chromatin modification and promoter features. One group, including highly up-regulated genes such as tumor necrosis factor (TNF, was characterized by H3Ac, high CpG content and lack of TATA boxes. The second group, containing inflammatory mediators (interleukins and CCL chemokines, was up-regulated upon LPS stimulation despite lacking H3Ac in their annotated promoters, which were low in CpG content but did contain TATA boxes. Genome-wide analysis showed that few H3Ac peaks were unique to either +/-LPS condition. However, within these, an unpacking/expansion of already existing H3Ac peaks was observed upon LPS stimulation. In contrast, a significant proportion of S5P RNAPII peaks (approx 40% was unique to either condition. Furthermore, data indicated a large portion of previously unannotated TSSs, particularly in LPS-stimulated macrophages, where only 28% of unique S5P RNAPII peaks overlap annotated promoters. The regulation of the inflammatory response appears to occur in a very specific manner at
Genome-wide analysis of the Solanum tuberosum (potato) trehalose-6-phosphate synthase (TPS) gene family: evolution and differential expression during development and stress.

Science.gov (United States)

Xu, Yingchun; Wang, Yanjie; Mattson, Neil; Yang, Liu; Jin, Qijiang

2017-12-01

Trehalose-6-phosphate synthase (TPS) serves important functions in plant desiccation tolerance and response to environmental stimuli. At present, a comprehensive analysis, i.e. functional classification, molecular evolution, and expression patterns of this gene family are still lacking in Solanum tuberosum (potato). In this study, a comprehensive analysis of the TPS gene family was conducted in potato. A total of eight putative potato TPS genes (StTPSs) were identified by searching the latest potato genome sequence. The amino acid identity among eight StTPSs varied from 59.91 to 89.54%. Analysis of d N /d S ratios suggested that regions in the TPP (trehalose-6-phosphate phosphatase) domains evolved faster than the TPS domains. Although the sequence of the eight StTPSs showed high similarity (2571-2796 bp), their gene length is highly differentiated (3189-8406 bp). Many of the regulatory elements possibly related to phytohormones, abiotic stress and development were identified in different TPS genes. Based on the phylogenetic tree constructed using TPS genes of potato, and four other Solanaceae plants, TPS genes could be categorized into 6 distinct groups. Analysis revealed that purifying selection most likely played a major role during the evolution of this family. Amino acid changes detected in specific branches of the phylogenetic tree suggests relaxed constraints might have contributed to functional divergence among groups. Moreover, StTPSs were found to exhibit tissue and treatment specific expression patterns upon analysis of transcriptome data, and performing qRT-PCR. This study provides a reference for genome-wide identification of the potato TPS gene family and sets a framework for further functional studies of this important gene family in development and stress response.
Gene set analysis of purine and pyrimidine antimetabolites cancer therapies.

Science.gov (United States)

Fridley, Brooke L; Batzler, Anthony; Li, Liang; Li, Fang; Matimba, Alice; Jenkins, Gregory D; Ji, Yuan; Wang, Liewei; Weinshilboum, Richard M

2011-11-01

Responses to therapies, either with regard to toxicities or efficacy, are expected to involve complex relationships of gene products within the same molecular pathway or functional gene set. Therefore, pathways or gene sets, as opposed to single genes, may better reflect the true underlying biology and may be more appropriate units for analysis of pharmacogenomic studies. Application of such methods to pharmacogenomic studies may enable the detection of more subtle effects of multiple genes in the same pathway that may be missed by assessing each gene individually. A gene set analysis of 3821 gene sets is presented assessing the association between basal messenger RNA expression and drug cytotoxicity using ethnically defined human lymphoblastoid cell lines for two classes of drugs: pyrimidines [gemcitabine (dFdC) and arabinoside] and purines [6-thioguanine and 6-mercaptopurine]. The gene set nucleoside-diphosphatase activity was found to be significantly associated with both dFdC and arabinoside, whereas gene set γ-aminobutyric acid catabolic process was associated with dFdC and 6-thioguanine. These gene sets were significantly associated with the phenotype even after adjusting for multiple testing. In addition, five associated gene sets were found in common between the pyrimidines and two gene sets for the purines (3',5'-cyclic-AMP phosphodiesterase activity and γ-aminobutyric acid catabolic process) with a P value of less than 0.0001. Functional validation was attempted with four genes each in gene sets for thiopurine and pyrimidine antimetabolites. All four genes selected from the pyrimidine gene sets (PSME3, CANT1, ENTPD6, ADRM1) were validated, but only one (PDE4D) was validated for the thiopurine gene sets. In summary, results from the gene set analysis of pyrimidine and purine therapies, used often in the treatment of various cancers, provide novel insight into the relationship between genomic variation and drug response.
Genomic identification of WRKY transcription factors in carrot (Daucus carota) and analysis of evolution and homologous groups for plants.

Science.gov (United States)

Li, Meng-Yao; Xu, Zhi-Sheng; Tian, Chang; Huang, Ying; Wang, Feng; Xiong, Ai-Sheng

2016-03-15

WRKY transcription factors belong to one of the largest transcription factor families. These factors possess functions in plant growth and development, signal transduction, and stress response. Here, we identified 95 DcWRKY genes in carrot based on the carrot genomic and transcriptomic data, and divided them into three groups. Phylogenetic analysis of WRKY proteins from carrot and Arabidopsis divided these proteins into seven subgroups. To elucidate the evolution and distribution of WRKY transcription factors in different species, we constructed a schematic of the phylogenetic tree and compared the WRKY family factors among 22 species, which including plants, slime mold and protozoan. An in-depth study was performed to clarify the homologous factor groups of nine divergent taxa in lower and higher plants. Based on the orthologous factors between carrot and Arabidopsis, 38 DcWRKY proteins were calculated to interact with other proteins in the carrot genome. Yeast two-hybrid assay showed that DcWRKY20 can interact with DcMAPK1 and DcMAPK4. The expression patterns of the selected DcWRKY genes based on transcriptome data and qRT-PCR suggested that those selected DcWRKY genes are involved in root development, biotic and abiotic stress response. This comprehensive analysis provides a basis for investigating the evolution and function of WRKY genes.
Gene Environment Interactions and Predictors of Colorectal Cancer in Family-Based, Multi-Ethnic Groups

Directory of Open Access Journals (Sweden)

S. Pamela K. Shiao

2018-02-01

Full Text Available For the personalization of polygenic/omics-based health care, the purpose of this study was to examine the gene–environment interactions and predictors of colorectal cancer (CRC by including five key genes in the one-carbon metabolism pathways. In this proof-of-concept study, we included a total of 54 families and 108 participants, 54 CRC cases and 54 matched family friends representing four major racial ethnic groups in southern California (White, Asian, Hispanics, and Black. We used three phases of data analytics, including exploratory, family-based analyses adjusting for the dependence within the family for sharing genetic heritage, the ensemble method, and generalized regression models for predictive modeling with a machine learning validation procedure to validate the results for enhanced prediction and reproducibility. The results revealed that despite the family members sharing genetic heritage, the CRC group had greater combined gene polymorphism rates than the family controls (p < 0.05, on MTHFR C677T, MTR A2756G, MTRR A66G, and DHFR 19 bp except MTHFR A1298C. Four racial groups presented different polymorphism rates for four genes (all p < 0.05 except MTHFR A1298C. Following the ensemble method, the most influential factors were identified, and the best predictive models were generated by using the generalized regression models, with Akaike’s information criterion and leave-one-out cross validation methods. Body mass index (BMI and gender were consistent predictors of CRC for both models when individual genes versus total polymorphism counts were used, and alcohol use was interactive with BMI status. Body mass index status was also interactive with both gender and MTHFR C677T gene polymorphism, and the exposure to environmental pollutants was an additional predictor. These results point to the important roles of environmental and modifiable factors in relation to gene–environment interactions in the prevention of CRC.
Gene expression profiling of resting and activated vascular smooth muscle cells by serial analysis of gene expression and clustering analysis

NARCIS (Netherlands)

Beauchamp, Nicholas J.; van Achterberg, Tanja A. E.; Engelse, Marten A.; Pannekoek, Hans; de Vries, Carlie J. M.

2003-01-01

Migration and proliferation of vascular smooth muscle cells (SMCs) are key events in atherosclerosis. However, little is known about alterations in gene expression upon transition of the quiescent, contractile SMC to the proliferative SMC. We performed serial analysis of gene expression (SAGE) of
Lynx web services for annotations and systems analysis of multi-gene disorders.

Science.gov (United States)

Sulakhe, Dinanath; Taylor, Andrew; Balasubramanian, Sandhya; Feng, Bo; Xie, Bingqing; Börnigen, Daniela; Dave, Utpal J; Foster, Ian T; Gilliam, T Conrad; Maltsev, Natalia

2014-07-01

Lynx is a web-based integrated systems biology platform that supports annotation and analysis of experimental data and generation of weighted hypotheses on molecular mechanisms contributing to human phenotypes and disorders of interest. Lynx has integrated multiple classes of biomedical data (genomic, proteomic, pathways, phenotypic, toxicogenomic, contextual and others) from various public databases as well as manually curated data from our group and collaborators (LynxKB). Lynx provides tools for gene list enrichment analysis using multiple functional annotations and network-based gene prioritization. Lynx provides access to the integrated database and the analytical tools via REST based Web Services (http://lynx.ci.uchicago.edu/webservices.html). This comprises data retrieval services for specific functional annotations, services to search across the complete LynxKB (powered by Lucene), and services to access the analytical tools built within the Lynx platform. © The Author(s) 2014. Published by Oxford University Press on behalf of Nucleic Acids Research.
Expressed sequence enrichment for candidate gene analysis of citrus tristeza virus resistance.

Science.gov (United States)

Bernet, G P; Bretó, M P; Asins, M J

2004-02-01

Several studies have reported markers linked to a putative resistance gene from Poncirus trifoliata ( Ctv-R) located at linkage group 4 that confers resistance against one of the most important citrus pathogens, citrus tristeza virus (CTV). To be successful in both marker-assisted selection and transformation experiments, its accurate mapping is needed. Several factors may affect its localization, among them two are considered here: the definition of resistance and the genetic background of progeny. Two progenies derived from P. trifoliata, by self-pollination and by crossing with sour orange ( Citrus aurantium), a citrus rootstock well-adapted to arid and semi-arid areas, were used for linkage group-4 marker enrichment. Two new methodologies were used to enrich this region with expressed sequences. The enrichment of group 4 resulted in the fusion of several C. aurantium linkage groups. The new one A(7+3+4) is now saturated with 48 markers including expressed sequences. Surprisingly, sour orange was as resistant to the CTV isolate tested as was P. trifoliata, and three hybrids that carry Ctv-R, as deduced from its flanking markers, are susceptible to CTV. The new linkage maps were used to map Ctv-R under the hypothesis of monogenic inheritance. Its position on linkage group 4 of P. trifoliata differs from the location previously reported in other progenies. The genetic analysis of virus-plant interaction in the family derived from C. aurantium after a CTV chronic infection showed the segregation of five types of interaction, which is not compatible with the hypothesis of a single gene controlling resistance. Two major issues are discussed: another type of genetic analysis of CTV resistance is needed to avoid the assumption of monogenic inheritance, and transferring Ctv-R from P. trifoliata to sour orange might not avoid the CTV decline of sweet orange trees.
Reference genes for gene expression analysis by real-time reverse transcription polymerase chain reaction of renal cell carcinoma.

Science.gov (United States)

Bjerregaard, Henriette; Pedersen, Shona; Kristensen, Søren Risom; Marcussen, Niels

2011-12-01

Differentiation between malignant renal cell carcinoma and benign oncocytoma is of great importance to choose the optimal treatment. Accurate preoperative diagnosis of renal tumor is therefore crucial; however, existing imaging techniques and histologic examinations are incapable of providing an optimal differentiation profile. Analysis of gene expression of molecular markers is a new possibility but relies on appropriate standardization to compare different samples. The aim of this study was to identify stably expressed reference genes suitable for the normalization of results extracted from gene expression analysis of renal tumors. Expression levels of 8 potential reference genes (ATP5J, HMBS, HPRT1, PPIA, TBP, 18S, GAPDH, and POLR2A) were examined by real-time reverse transcription polymerase chain reaction in tumor and normal tissue from removed kidneys from 13 patients with renal cell carcinoma and 5 patients with oncocytoma. The expression levels of genes were compared by gene stability value M, average gene stability M, pairwise variation V, and coefficient of variation CV. More candidates were not suitable for the purpose, but a combination of HMBS, PPIA, ATP5J, and TBP was found to be the best combination with an average gene stability value M of 0.9 and a CV of 0.4 in the 18 tumors and normal tissues. A combination of 4 genes, HMBS, PPIA, ATP5J, and TBP, is a possible reference in renal tumor gene expression analysis by reverse transcription polymerase chain reaction. A combination of four genes, HMBS, PPIA, ATP5J and TBP, being stably expressed in tissues from RCC is possible reference genes for gene expression analysis.
Harmonic Analysis and Group Representation

CERN Document Server

Figa-Talamanca, Alessandro

2011-01-01

This title includes: Lectures - A. Auslander, R. Tolimeri - Nilpotent groups and abelian varieties, M Cowling - Unitary and uniformly bounded representations of some simple Lie groups, M. Duflo - Construction de representations unitaires d'un groupe de Lie, R. Howe - On a notion of rank for unitary representations of the classical groups, V.S. Varadarajan - Eigenfunction expansions of semisimple Lie groups, and R. Zimmer - Ergodic theory, group representations and rigidity; and, Seminars - A. Koranyi - Some applications of Gelfand pairs in classical analysis.
Genome-wide Identification and Expression Analysis of the CDPK Gene Family in Grape, Vitis spp.

Science.gov (United States)

Zhang, Kai; Han, Yong-Tao; Zhao, Feng-Li; Hu, Yang; Gao, Yu-Rong; Ma, Yan-Fei; Zheng, Yi; Wang, Yue-Jin; Wen, Ying-Qiang

2015-06-30

Calcium-dependent protein kinases (CDPKs) play vital roles in plant growth and development, biotic and abiotic stress responses, and hormone signaling. Little is known about the CDPK gene family in grapevine. In this study, we performed a genome-wide analysis of the 12X grape genome (Vitis vinifera) and identified nineteen CDPK genes. Comparison of the structures of grape CDPK genes allowed us to examine their functional conservation and differentiation. Segmentally duplicated grape CDPK genes showed high structural conservation and contributed to gene family expansion. Additional comparisons between grape and Arabidopsis thaliana demonstrated that several grape CDPK genes occured in the corresponding syntenic blocks of Arabidopsis, suggesting that these genes arose before the divergence of grapevine and Arabidopsis. Phylogenetic analysis divided the grape CDPK genes into four groups. Furthermore, we examined the expression of the corresponding nineteen homologous CDPK genes in the Chinese wild grape (Vitis pseudoreticulata) under various conditions, including biotic stress, abiotic stress, and hormone treatments. The expression profiles derived from reverse transcription and quantitative PCR suggested that a large number of VpCDPKs responded to various stimuli on the transcriptional level, indicating their versatile roles in the responses to biotic and abiotic stresses. Moreover, we examined the subcellular localization of VpCDPKs by transiently expressing six VpCDPK-GFP fusion proteins in Arabidopsis mesophyll protoplasts; this revealed high variability consistent with potential functional differences. Taken as a whole, our data provide significant insights into the evolution and function of grape CDPKs and a framework for future investigation of grape CDPK genes.

Genome-Wide Identification and Expression Analysis of WRKY Gene Family in Capsicum annuum L.

Science.gov (United States)

Diao, Wei-Ping; Snyder, John C; Wang, Shu-Bin; Liu, Jin-Bing; Pan, Bao-Gui; Guo, Guang-Jun; Wei, Ge

2016-01-01

The WRKY family of transcription factors is one of the most important families of plant transcriptional regulators with members regulating multiple biological processes, especially in regulating defense against biotic and abiotic stresses. However, little information is available about WRKYs in pepper (Capsicum annuum L.). The recent release of completely assembled genome sequences of pepper allowed us to perform a genome-wide investigation for pepper WRKY proteins. In the present study, a total of 71 WRKY genes were identified in the pepper genome. According to structural features of their encoded proteins, the pepper WRKY genes (CaWRKY) were classified into three main groups, with the second group further divided into five subgroups. Genome mapping analysis revealed that CaWRKY were enriched on four chromosomes, especially on chromosome 1, and 15.5% of the family members were tandemly duplicated genes. A phylogenetic tree was constructed depending on WRKY domain' sequences derived from pepper and Arabidopsis. The expression of 21 selected CaWRKY genes in response to seven different biotic and abiotic stresses (salt, heat shock, drought, Phytophtora capsici, SA, MeJA, and ABA) was evaluated by quantitative RT-PCR; Some CaWRKYs were highly expressed and up-regulated by stress treatment. Our results will provide a platform for functional identification and molecular breeding studies of WRKY genes in pepper.
Localization of the xeroderma pigmentosum group B-correcting gene ERCC-3 to human chromosome 2q21.

NARCIS (Netherlands)

G. Weeda (Geert); J. Wiegant; M. van der Ploeg; A.H.M. Geurts van Kessel (Ad); A.J. van der Eb; J.H.J. Hoeijmakers (Jan)

1991-01-01

textabstractThe human excision-repair gene ERCC3 was cloned after DNA-mediated gene transfer to the uv-sensitive Chinese hamster ovary mutant cell line 27-1, a member of complementation group 3 of the excision-defective rodent cell lines. The ERCC3 gene specifically corrects the DNA repair defect of
Phytoplasma phylogenetics based on analysis of secA and 23S rRNA gene sequences for improved resolution of candidate species of 'Candidatus Phytoplasma'.

Science.gov (United States)

Hodgetts, Jennifer; Boonham, Neil; Mumford, Rick; Harrison, Nigel; Dickinson, Matthew

2008-08-01

Phytoplasma phylogenetics has focused primarily on sequences of the non-coding 16S rRNA gene and the 16S-23S rRNA intergenic spacer region (16-23S ISR), and primers that enable amplification of these regions from all phytoplasmas by PCR are well established. In this study, primers based on the secA gene have been developed into a semi-nested PCR assay that results in a sequence of the expected size (about 480 bp) from all 34 phytoplasmas examined, including strains representative of 12 16Sr groups. Phylogenetic analysis of secA gene sequences showed similar clustering of phytoplasmas when compared with clusters resolved by similar sequence analyses of a 16-23S ISR-23S rRNA gene contig or of the 16S rRNA gene alone. The main differences between trees were in the branch lengths, which were elongated in the 16-23S ISR-23S rRNA gene tree when compared with the 16S rRNA gene tree and elongated still further in the secA gene tree, despite this being a shorter sequence. The improved resolution in the secA gene-derived phylogenetic tree resulted in the 16SrII group splitting into two distinct clusters, while phytoplasmas associated with coconut lethal yellowing-type diseases split into three distinct groups, thereby supporting past proposals that they represent different candidate species within 'Candidatus Phytoplasma'. The ability to differentiate 16Sr groups and subgroups by virtual RFLP analysis of secA gene sequences suggests that this gene may provide an informative alternative molecular marker for pathogen identification and diagnosis of phytoplasma diseases.
Comparative genomic analysis of Drosophila melanogaster and vector mosquito developmental genes.

Directory of Open Access Journals (Sweden)

Susanta K Behura

Full Text Available Genome sequencing projects have presented the opportunity for analysis of developmental genes in three vector mosquito species: Aedes aegypti, Culex quinquefasciatus, and Anopheles gambiae. A comparative genomic analysis of developmental genes in Drosophila melanogaster and these three important vectors of human disease was performed in this investigation. While the study was comprehensive, special emphasis centered on genes that 1 are components of developmental signaling pathways, 2 regulate fundamental developmental processes, 3 are critical for the development of tissues of vector importance, 4 function in developmental processes known to have diverged within insects, and 5 encode microRNAs (miRNAs that regulate developmental transcripts in Drosophila. While most fruit fly developmental genes are conserved in the three vector mosquito species, several genes known to be critical for Drosophila development were not identified in one or more mosquito genomes. In other cases, mosquito lineage-specific gene gains with respect to D. melanogaster were noted. Sequence analyses also revealed that numerous repetitive sequences are a common structural feature of Drosophila and mosquito developmental genes. Finally, analysis of predicted miRNA binding sites in fruit fly and mosquito developmental genes suggests that the repertoire of developmental genes targeted by miRNAs is species-specific. The results of this study provide insight into the evolution of developmental genes and processes in dipterans and other arthropods, serve as a resource for those pursuing analysis of mosquito development, and will promote the design and refinement of functional analysis experiments.
Maternal intake of methyl-group donors affects DNA methylation of metabolic genes in infants.

Science.gov (United States)

Pauwels, Sara; Ghosh, Manosij; Duca, Radu Corneliu; Bekaert, Bram; Freson, Kathleen; Huybrechts, Inge; Langie, Sabine A S; Koppen, Gudrun; Devlieger, Roland; Godderis, Lode

2017-01-01

Maternal nutrition during pregnancy and infant nutrition in the early postnatal period (lactation) are critically involved in the development and health of the newborn infant. The Maternal Nutrition and Offspring's Epigenome (MANOE) study was set up to assess the effect of maternal methyl-group donor intake (choline, betaine, folate, methionine) on infant DNA methylation. Maternal intake of dietary methyl-group donors was assessed using a food-frequency questionnaire (FFQ). Before and during pregnancy, we evaluated maternal methyl-group donor intake through diet and supplementation (folic acid) in relation to gene-specific ( IGF2 DMR, DNMT1 , LEP , RXRA ) buccal epithelial cell DNA methylation in 6 months old infants ( n = 114) via pyrosequencing. In the early postnatal period, we determined the effect of maternal choline intake during lactation (in mothers who breast-fed for at least 3 months) on gene-specific buccal DNA methylation ( n = 65). Maternal dietary and supplemental intake of methyl-group donors (folate, betaine, folic acid), only in the periconception period, was associated with buccal cell DNA methylation in genes related to growth ( IGF2 DMR), metabolism ( RXRA ), and appetite control ( LEP ). A negative association was found between maternal folate and folic acid intake before pregnancy and infant LEP (slope = -1.233, 95% CI -2.342; -0.125, p = 0.0298) and IGF2 DMR methylation (slope = -0.706, 95% CI -1.242; -0.107, p = 0.0101), respectively. Positive associations were observed for maternal betaine (slope = 0.875, 95% CI 0.118; 1.633, p = 0.0241) and folate (slope = 0.685, 95% CI 0.245; 1.125, p = 0.0027) intake before pregnancy and RXRA methylation. Buccal DNMT1 methylation in the infant was negatively associated with maternal methyl-group donor intake in the first and second trimester of pregnancy and negatively in the third trimester. We found no clear association between maternal choline intake
Evaluating the performance of clinical criteria for predicting mismatch repair gene mutations in Lynch syndrome: a comprehensive analysis of 3,671 families.

Science.gov (United States)

Steinke, Verena; Holzapfel, Stefanie; Loeffler, Markus; Holinski-Feder, Elke; Morak, Monika; Schackert, Hans K; Görgens, Heike; Pox, Christian; Royer-Pokora, Brigitte; von Knebel-Doeberitz, Magnus; Büttner, Reinhard; Propping, Peter; Engel, Christoph

2014-07-01

Carriers of mismatch repair (MMR) gene mutations have a high lifetime risk for colorectal and endometrial cancers, as well as other malignancies. As mutation analysis to detect these patients is expensive and time-consuming, clinical criteria and tumor-tissue analysis are widely used as pre-screening methods. The aim of our study was to evaluate the performance of commonly applied clinical criteria (the Amsterdam I and II Criteria, and the original and revised Bethesda Guidelines) and the results of tumor-tissue analysis in predicting MMR gene mutations. We analyzed 3,671 families from the German HNPCC Registry and divided them into nine mutually exclusive groups with different clinical criteria. A total of 680 families (18.5%) were found to have a pathogenic MMR gene mutation. Among all 1,284 families with microsatellite instability-high (MSI-H) colorectal cancer, the overall mutation detection rate was 53.0%. Mutation frequencies and their distribution between the four MMR genes differed significantly between clinical groups (p small-bowel cancer (p small-bowel cancer were clinically relevant predictors for Lynch syndrome. © 2013 UICC.
Characterization of the global profile of genes expressed in cervical epithelium by Serial Analysis of Gene Expression (SAGE

Directory of Open Access Journals (Sweden)

Piña-Sanchez Patricia

2005-09-01

Full Text Available Abstract Background Serial Analysis of Gene Expression (SAGE is a new technique that allows a detailed and profound quantitative and qualitative knowledge of gene expression profile, without previous knowledge of sequence of analyzed genes. We carried out a modification of SAGE methodology (microSAGE, useful for the analysis of limited quantities of tissue samples, on normal human cervical tissue obtained from a donor without histopathological lesions. Cervical epithelium is constituted mainly by cervical keratinocytes which are the targets of human papilloma virus (HPV, where persistent HPV infection of cervical epithelium is associated with an increase risk for developing cervical carcinomas (CC. Results We report here a transcriptome analysis of cervical tissue by SAGE, derived from 30,418 sequenced tags that provide a wealth of information about the gene products involved in normal cervical epithelium physiology, as well as genes not previously found in uterine cervix tissue involved in the process of epidermal differentiation. Conclusion This first comprehensive and profound analysis of uterine cervix transcriptome, should be useful for the identification of genes involved in normal cervix uterine function, and candidate genes associated with cervical carcinoma.
Gene expression analysis to identify molecular correlates of pre- and post-conditioning derived neuroprotection.

Science.gov (United States)

Prasad, Shiv S; Russell, Marsha; Nowakowska, Margeryta; Williams, Andrew; Yauk, Carole

2012-06-01

Mild ischaemic exposures before or after severe injurious ischaemia that elicit neuroprotective responses are referred to as preconditioning and post-conditioning. The corresponding molecular mechanisms of neuroprotection are not completely understood. Identification of the genes and associated pathways of corresponding neuroprotection would provide insight into neuronal survival, potential therapeutic approaches and assessments of therapies for stroke. The objectives of this study were to use global gene expression approach to infer the molecular mechanisms in pre- and post-conditioning-derived neuroprotection in cortical neurons following oxygen and glucose deprivation (OGD) in vitro and then to apply these findings to predict corresponding functional pathways. To this end, microarray analysis was applied to rat cortical neurons with or without the pre- and post-conditioning treatments at 3-h post-reperfusion, and differentially expressed transcripts were subjected to statistical, hierarchical clustering and pathway analyses. The expression patterns of 3,431 genes altered under all conditions of ischaemia (with and without pre- or post-conditioning). We identified 1,595 genes that were commonly regulated within both the pre- and post-conditioning treatments. Cluster analysis revealed that transcription profiles clustered tightly within controls, non-conditioned OGD and neuroprotected groups. Two clusters defining neuroprotective conditions associated with up- and downregulated genes were evident. The five most upregulated genes within the neuroprotective clusters were Tagln, Nes, Ptrf, Vim and Adamts9, and the five most downregulated genes were Slc7a3, Bex1, Brunol4, Nrxn3 and Cpne4. Pathway analysis revealed that the intracellular and second messenger signalling pathways in addition to cell death were predominantly associated with downregulated pre- and post-conditioning associated genes, suggesting that modulation of cell death and signal transduction pathways
Biotechnological applications of mobile group II introns and their reverse transcriptases: gene targeting, RNA-seq, and non-coding RNA analysis.

Science.gov (United States)

Enyeart, Peter J; Mohr, Georg; Ellington, Andrew D; Lambowitz, Alan M

2014-01-13

Mobile group II introns are bacterial retrotransposons that combine the activities of an autocatalytic intron RNA (a ribozyme) and an intron-encoded reverse transcriptase to insert site-specifically into DNA. They recognize DNA target sites largely by base pairing of sequences within the intron RNA and achieve high DNA target specificity by using the ribozyme active site to couple correct base pairing to RNA-catalyzed intron integration. Algorithms have been developed to program the DNA target site specificity of several mobile group II introns, allowing them to be made into 'targetrons.' Targetrons function for gene targeting in a wide variety of bacteria and typically integrate at efficiencies high enough to be screened easily by colony PCR, without the need for selectable markers. Targetrons have found wide application in microbiological research, enabling gene targeting and genetic engineering of bacteria that had been intractable to other methods. Recently, a thermostable targetron has been developed for use in bacterial thermophiles, and new methods have been developed for using targetrons to position recombinase recognition sites, enabling large-scale genome-editing operations, such as deletions, inversions, insertions, and 'cut-and-pastes' (that is, translocation of large DNA segments), in a wide range of bacteria at high efficiency. Using targetrons in eukaryotes presents challenges due to the difficulties of nuclear localization and sub-optimal magnesium concentrations, although supplementation with magnesium can increase integration efficiency, and directed evolution is being employed to overcome these barriers. Finally, spurred by new methods for expressing group II intron reverse transcriptases that yield large amounts of highly active protein, thermostable group II intron reverse transcriptases from bacterial thermophiles are being used as research tools for a variety of applications, including qRT-PCR and next-generation RNA sequencing (RNA-seq). The
Analysis of phenotype, genotype and serotype distribution in erythromycin-resistant group B streptococci isolated from vaginal flora in Southern Ireland.

LENUS (Irish Health Repository)

Kiely, R A

2010-02-01

The screening of 2000 women of childbearing age in Cork between 2004 and 2006 produced 37 erythromycin-resistant group B streptococcus (GBS) isolates. PCR analysis was performed to determine the basis for erythromycin resistance. The ermTR gene was most frequently expressed (n = 19), followed by the ermB gene (n = 8). Four isolates harboured the mefA gene. Six isolates yielded no PCR products. Some phenotype-genotype correlation was observed. All isolates expressing the mefA gene displayed the M phenotype whilst all those expressing ermB displayed the constitutive macrolide resistance (cMLS(B)) phenotype. Of 19 isolates that expressed the ermTR gene, 16 displayed the inducible macrolide resistance (iMLS(B)) phenotype. Serotype analysis revealed that serotypes III and V predominated in these isolates. The identification of two erythromycin-resistant serotype VIII isolates among this collection represents the first reported finding of erythromycin resistance in this serotype. A single isolate was non-typable using two latex agglutination serotyping kits.
Gene Dosage Analysis in a Clinical Environment: Gene-Targeted Microarrays as the Platform-of-Choice

Directory of Open Access Journals (Sweden)

Donald R. Love

2013-03-01

Full Text Available The role of gene deletion and duplication in the aetiology of disease has become increasingly evident over the last decade. In addition to the classical deletion/duplication disorders diagnosed using molecular techniques, such as Duchenne Muscular Dystrophy and Charcot-Marie-Tooth Neuropathy Type 1A, the significance of partial or whole gene deletions in the pathogenesis of a large number single-gene disorders is becoming more apparent. A variety of dosage analysis methods are available to the diagnostic laboratory but the widespread application of many of these techniques is limited by the expense of the kits/reagents and restrictive targeting to a particular gene or portion of a gene. These limitations are particularly important in the context of a small diagnostic laboratory with modest sample throughput. We have developed a gene-targeted, custom-designed comparative genomic hybridisation (CGH array that allows twelve clinical samples to be interrogated simultaneously for exonic deletions/duplications within any gene (or panel of genes on the array. We report here on the use of the array in the analysis of a series of clinical samples processed by our laboratory over a twelve-month period. The array has proven itself to be robust, flexible and highly suited to the diagnostic environment.
Phylogenetic Analysis of Seven WRKY Genes across the Palm Subtribe Attaleinae (Arecaceae) Identifies Syagrus as Sister Group of the Coconut

Science.gov (United States)

Meerow, Alan W.; Noblick, Larry; Borrone, James W.; Couvreur, Thomas L. P.; Mauro-Herrera, Margarita; Hahn, William J.; Kuhn, David N.; Nakamura, Kyoko; Oleas, Nora H.; Schnell, Raymond J.

2009-01-01

Background The Cocoseae is one of 13 tribes of Arecaceae subfam. Arecoideae, and contains a number of palms with significant economic importance, including the monotypic and pantropical Cocos nucifera L., the coconut, the origins of which have been one of the “abominable mysteries” of palm systematics for decades. Previous studies with predominantly plastid genes weakly supported American ancestry for the coconut but ambiguous sister relationships. In this paper, we use multiple single copy nuclear loci to address the phylogeny of the Cocoseae subtribe Attaleinae, and resolve the closest extant relative of the coconut. Methodology/Principal Findings We present the results of combined analysis of DNA sequences of seven WRKY transcription factor loci across 72 samples of Arecaceae tribe Cocoseae subtribe Attaleinae, representing all genera classified within the subtribe, and three outgroup taxa with maximum parsimony, maximum likelihood, and Bayesian approaches, producing highly congruent and well-resolved trees that robustly identify the genus Syagrus as sister to Cocos and resolve novel and well-supported relationships among the other genera of the Attaleinae. We also address incongruence among the gene trees with gene tree reconciliation analysis, and assign estimated ages to the nodes of our tree. Conclusions/Significance This study represents the as yet most extensive phylogenetic analyses of Cocoseae subtribe Attaleinae. We present a well-resolved and supported phylogeny of the subtribe that robustly indicates a sister relationship between Cocos and Syagrus. This is not only of biogeographic interest, but will also open fruitful avenues of inquiry regarding evolution of functional genes useful for crop improvement. Establishment of two major clades of American Attaleinae occurred in the Oligocene (ca. 37 MYBP) in Eastern Brazil. The divergence of Cocos from Syagrus is estimated at 35 MYBP. The biogeographic and morphological congruence that we see for
Digital Gene Expression Analysis Based on De Novo Transcriptome Assembly Reveals New Genes Associated with Floral Organ Differentiation of the Orchid Plant Cymbidium ensifolium.

Directory of Open Access Journals (Sweden)

Fengxi Yang

Full Text Available Cymbidium ensifolium belongs to the genus Cymbidium of the orchid family. Owing to its spectacular flower morphology, C. ensifolium has considerable ecological and cultural value. However, limited genetic data is available for this non-model plant, and the molecular mechanism underlying floral organ identity is still poorly understood. In this study, we characterize the floral transcriptome of C. ensifolium and present, for the first time, extensive sequence and transcript abundance data of individual floral organs. After sequencing, over 10 Gb clean sequence data were generated and assembled into 111,892 unigenes with an average length of 932.03 base pairs, including 1,227 clusters and 110,665 singletons. Assembled sequences were annotated with gene descriptions, gene ontology, clusters of orthologous group terms, the Kyoto Encyclopedia of Genes and Genomes, and the plant transcription factor database. From these annotations, 131 flowering-associated unigenes, 61 CONSTANS-LIKE (COL unigenes and 90 floral homeotic genes were identified. In addition, four digital gene expression libraries were constructed for the sepal, petal, labellum and gynostemium, and 1,058 genes corresponding to individual floral organ development were identified. Among them, eight MADS-box genes were further investigated by full-length cDNA sequence analysis and expression validation, which revealed two APETALA1/AGL9-like MADS-box genes preferentially expressed in the sepal and petal, two AGAMOUS-like genes particularly restricted to the gynostemium, and four DEF-like genes distinctively expressed in different floral organs. The spatial expression of these genes varied distinctly in different floral mutant corresponding to different floral morphogenesis, which validated the specialized roles of them in floral patterning and further supported the effectiveness of our in silico analysis. This dataset generated in our study provides new insights into the molecular mechanisms
CDNA Microarray Based Comparative Gene Expression Analysis of Primary Breast Tumors Versus In Vitro Transformed Neoplastic Breast Epithelium

National Research Council Canada - National Science Library

Szallasi, Zoltan

2001-01-01

.... The first group of clones is being sorted by their ability to form tumors. We are currently performing cDNA microarray analysis quantifying the expression level of about 15,000 genes in these cell lines...
Population genomic analysis of strain variation in Leptospirillum group II bacteria involved in acid mine drainage formation.

Science.gov (United States)

Simmons, Sheri L; Dibartolo, Genevieve; Denef, Vincent J; Goltsman, Daniela S Aliaga; Thelen, Michael P; Banfield, Jillian F

2008-07-22

Deeply sampled community genomic (metagenomic) datasets enable comprehensive analysis of heterogeneity in natural microbial populations. In this study, we used sequence data obtained from the dominant member of a low-diversity natural chemoautotrophic microbial community to determine how coexisting closely related individuals differ from each other in terms of gene sequence and gene content, and to uncover evidence of evolutionary processes that occur over short timescales. DNA sequence obtained from an acid mine drainage biofilm was reconstructed, taking into account the effects of strain variation, to generate a nearly complete genome tiling path for a Leptospirillum group II species closely related to L. ferriphilum (sampling depth approximately 20x). The population is dominated by one sequence type, yet we detected evidence for relatively abundant variants (>99.5% sequence identity to the dominant type) at multiple loci, and a few rare variants. Blocks of other Leptospirillum group II types ( approximately 94% sequence identity) have recombined into one or more variants. Variant blocks of both types are more numerous near the origin of replication. Heterogeneity in genetic potential within the population arises from localized variation in gene content, typically focused in integrated plasmid/phage-like regions. Some laterally transferred gene blocks encode physiologically important genes, including quorum-sensing genes of the LuxIR system. Overall, results suggest inter- and intrapopulation genetic exchange involving distinct parental genome types and implicate gain and loss of phage and plasmid genes in recent evolution of this Leptospirillum group II population. Population genetic analyses of single nucleotide polymorphisms indicate variation between closely related strains is not maintained by positive selection, suggesting that these regions do not represent adaptive differences between strains. Thus, the most likely explanation for the observed patterns of
Population genomic analysis of strain variation in Leptospirillum group II bacteria involved in acid mine drainage formation.

Directory of Open Access Journals (Sweden)

Sheri L Simmons

2008-07-01

Full Text Available Deeply sampled community genomic (metagenomic datasets enable comprehensive analysis of heterogeneity in natural microbial populations. In this study, we used sequence data obtained from the dominant member of a low-diversity natural chemoautotrophic microbial community to determine how coexisting closely related individuals differ from each other in terms of gene sequence and gene content, and to uncover evidence of evolutionary processes that occur over short timescales. DNA sequence obtained from an acid mine drainage biofilm was reconstructed, taking into account the effects of strain variation, to generate a nearly complete genome tiling path for a Leptospirillum group II species closely related to L. ferriphilum (sampling depth approximately 20x. The population is dominated by one sequence type, yet we detected evidence for relatively abundant variants (>99.5% sequence identity to the dominant type at multiple loci, and a few rare variants. Blocks of other Leptospirillum group II types ( approximately 94% sequence identity have recombined into one or more variants. Variant blocks of both types are more numerous near the origin of replication. Heterogeneity in genetic potential within the population arises from localized variation in gene content, typically focused in integrated plasmid/phage-like regions. Some laterally transferred gene blocks encode physiologically important genes, including quorum-sensing genes of the LuxIR system. Overall, results suggest inter- and intrapopulation genetic exchange involving distinct parental genome types and implicate gain and loss of phage and plasmid genes in recent evolution of this Leptospirillum group II population. Population genetic analyses of single nucleotide polymorphisms indicate variation between closely related strains is not maintained by positive selection, suggesting that these regions do not represent adaptive differences between strains. Thus, the most likely explanation for the
41 CFR 60-2.12 - Job group analysis.

Science.gov (United States)

2010-07-01

... 41 Public Contracts and Property Management 1 2010-07-01 2010-07-01 true Job group analysis. 60-2... group analysis. (a) Purpose: A job group analysis is a method of combining job titles within the... employed. (b) In the job group analysis, jobs at the establishment with similar content, wage rates, and...
Combined analysis of DNA methylome and transcriptome reveal novel candidate genes with susceptibility to bovine Staphylococcus aureus subclinical mastitis.

Science.gov (United States)

Song, Minyan; He, Yanghua; Zhou, Huangkai; Zhang, Yi; Li, Xizhi; Yu, Ying

2016-07-14

Subclinical mastitis is a widely spread disease of lactating cows. Its major pathogen is Staphylococcus aureus (S. aureus). In this study, we performed genome-wide integrative analysis of DNA methylation and transcriptional expression to identify candidate genes and pathways relevant to bovine S. aureus subclinical mastitis. The genome-scale DNA methylation profiles of peripheral blood lymphocytes in cows with S. aureus subclinical mastitis (SA group) and healthy controls (CK) were generated by methylated DNA immunoprecipitation combined with microarrays. We identified 1078 differentially methylated genes in SA cows compared with the controls. By integrating DNA methylation and transcriptome data, 58 differentially methylated genes were shared with differently expressed genes, in which 20.7% distinctly hypermethylated genes showed down-regulated expression in SA versus CK, whereas 14.3% dramatically hypomethylated genes showed up-regulated expression. Integrated pathway analysis suggested that these genes were related to inflammation, ErbB signalling pathway and mismatch repair. Further functional analysis revealed that three genes, NRG1, MST1 and NAT9, were strongly correlated with the progression of S. aureus subclinical mastitis and could be used as powerful biomarkers for the improvement of bovine mastitis resistance. Our studies lay the groundwork for epigenetic modification and mechanistic studies on susceptibility of bovine mastitis.
Identification of suitable reference genes for gene expression studies of shoulder instability.

Directory of Open Access Journals (Sweden)

Mariana Ferreira Leal

Full Text Available Shoulder instability is a common shoulder injury, and patients present with plastic deformation of the glenohumeral capsule. Gene expression analysis may be a useful tool for increasing the general understanding of capsule deformation, and reverse-transcription quantitative polymerase chain reaction (RT-qPCR has become an effective method for such studies. Although RT-qPCR is highly sensitive and specific, it requires the use of suitable reference genes for data normalization to guarantee meaningful and reproducible results. In the present study, we evaluated the suitability of a set of reference genes using samples from the glenohumeral capsules of individuals with and without shoulder instability. We analyzed the expression of six commonly used reference genes (ACTB, B2M, GAPDH, HPRT1, TBP and TFRC in the antero-inferior, antero-superior and posterior portions of the glenohumeral capsules of cases and controls. The stability of the candidate reference gene expression was determined using four software packages: NormFinder, geNorm, BestKeeper and DataAssist. Overall, HPRT1 was the best single reference gene, and HPRT1 and B2M composed the best pair of reference genes from different analysis groups, including simultaneous analysis of all tissue samples. GenEx software was used to identify the optimal number of reference genes to be used for normalization and demonstrated that the accumulated standard deviation resulting from the use of 2 reference genes was similar to that resulting from the use of 3 or more reference genes. To identify the optimal combination of reference genes, we evaluated the expression of COL1A1. Although the use of different reference gene combinations yielded variable normalized quantities, the relative quantities within sample groups were similar and confirmed that no obvious differences were observed when using 2, 3 or 4 reference genes. Consequently, the use of 2 stable reference genes for normalization, especially
Photoreceptor dysplasia (pd) in miniature schnauzer dogs: evaluation of candidate genes by molecular genetic analysis.

Science.gov (United States)

Zhang, Q; Baldwin, V J; Acland, G M; Parshall, C J; Haskel, J; Aguirre, G D; Ray, K

1999-01-01

Photoreceptor dysplasia (pd) is one of a group of at least six distinct autosomal and one X-linked retinal disorders identified in dogs which are collectively known as progressive retinal atrophy (PRA). It is an early onset retinal disease identified in miniature schnauzer dogs, and pedigree analysis and breeding studies have established autosomal recessive inheritance of the disease. Using a gene-based approach, a number of retina-expressed genes, including some members of the phototransduction pathway, have been causally implicated in retinal diseases of humans and other animals. Here we examined seven such potential candidate genes (opsin, RDS/peripherin, ROM1, rod cGMP-gated cation channel alpha-subunit, and three subunits of transducin) for their causal association with the pd locus by testing segregation of intragenic markers with the disease locus, or, in the absence of informative polymorphisms, sequencing of the coding regions of the genes. Based on these results, we have conclusively excluded four photoreceptor-specific genes as candidates for pd by linkage analysis. For three other photoreceptor-specific genes, we did not find any mutation in the coding sequences of the genes and have excluded them provisionally. Formal exclusion would require investigation of the levels of expression of the candidate genes in pd-affected dogs relative to age-matched controls. At present we are building suitable informative pedigrees for the disease locus with a sufficient number of meiosis to be useful for genomewide screening. This should identify markers linked to the disease locus and eventually permit progress toward the identification of the photoreceptor dysplasia gene and the disease-causing mutation.

Group Counseling with United States Racial Minority Groups: A 25-Year Content Analysis

Science.gov (United States)

Stark-Rose, Rose M.; Livingston-Sacin, Tina M.; Merchant, Niloufer; Finley, Amanda C.

2012-01-01

A 25-year content analysis was conducted of published group work articles that focused on 5 racial groups (African American, Asian American/Pacific Islander, Latino/a, Native American, and Intercultural group). Articles were included if they described an intervention or conceptual model with 1 of the racial groups. The analysis revealed 15 content…
Evolutionary Analysis of Minor Histocompatibility Genes In Hydra

KAUST Repository

Aalismail, Nojood

2016-05-01

Hydra is a simple freshwater solitary polyp used as a model system to study evolutionary aspects. The immune response of this organism has not been studied extensively and the immune response genes have not been identified and characterized. On the other hand, immune response has been investigated and genetic analysis has been initiated in other lower invertebrates. In the present study we took initiative to study the self/nonself recognition in hydra and its relation to the immune response. Moreover, performing phylogenetic analysis to look for annotated immune genes in hydra gave us a potential to analyze the expression of minor histocompatibility genes that have been shown to play a major role in grafting and transplantation in mammals. Here we obtained the cDNA library that shows expression of minor histocompatibility genes and confirmed that the annotated sequences in databases are actually present. In addition, grafting experiments suggested, although still preliminary, that homograft showed less rejection response than in heterograft. Involvement of possible minor histocompatibility gene orthologous in immune response was examined by qPCR.
The Screening of Genes Sensitive to Long-Term, Low-Level Microwave Exposure and Bioinformatic Analysis of Potential Correlations to Learning and Memory.

Science.gov (United States)

Zhao, Ya Li; Li, Ying Xian; Ma, Hong Bo; Li, Dong; Li, Hai Liang; Jiang, Rui; Kan, Guang Han; Yang, Zhen Zhong; Huang, Zeng Xin

2015-08-01

To gain a better understanding of gene expression changes in the brain following microwave exposure in mice. This study hopes to reveal mechanisms contributing to microwave-induced learning and memory dysfunction. Mice were exposed to whole body 2100 MHz microwaves with specific absorption rates (SARs) of 0.45 W/kg, 1.8 W/kg, and 3.6 W/kg for 1 hour daily for 8 weeks. Differentially expressing genes in the brains were screened using high-density oligonucleotide arrays, with genes showing more significant differences further confirmed by RT-PCR. The gene chip results demonstrated that 41 genes (0.45 W/kg group), 29 genes (1.8 W/kg group), and 219 genes (3.6 W/kg group) were differentially expressed. GO analysis revealed that these differentially expressed genes were primarily involved in metabolic processes, cellular metabolic processes, regulation of biological processes, macromolecular metabolic processes, biosynthetic processes, cellular protein metabolic processes, transport, developmental processes, cellular component organization, etc. KEGG pathway analysis showed that these genes are mainly involved in pathways related to ribosome, Alzheimer's disease, Parkinson's disease, long-term potentiation, Huntington's disease, and Neurotrophin signaling. Construction of a protein interaction network identified several important regulatory genes including synbindin (sbdn), Crystallin (CryaB), PPP1CA, Ywhaq, Psap, Psmb1, Pcbp2, etc., which play important roles in the processes of learning and memorye. Long-term, low-level microwave exposure may inhibit learning and memory by affecting protein and energy metabolic processes and signaling pathways relating to neurological functions or diseases. Copyright © 2015 The Editorial Board of Biomedical and Environmental Sciences. Published by China CDC. All rights reserved.
An 80-gene set to predict response to preoperative chemoradiotherapy for rectal cancer by principle component analysis.

Science.gov (United States)

Empuku, Shinichiro; Nakajima, Kentaro; Akagi, Tomonori; Kaneko, Kunihiko; Hijiya, Naoki; Etoh, Tsuyoshi; Shiraishi, Norio; Moriyama, Masatsugu; Inomata, Masafumi

2016-05-01

Preoperative chemoradiotherapy (CRT) for locally advanced rectal cancer not only improves the postoperative local control rate, but also induces downstaging. However, it has not been established how to individually select patients who receive effective preoperative CRT. The aim of this study was to identify a predictor of response to preoperative CRT for locally advanced rectal cancer. This study is additional to our multicenter phase II study evaluating the safety and efficacy of preoperative CRT using oral fluorouracil (UMIN ID: 03396). From April, 2009 to August, 2011, 26 biopsy specimens obtained prior to CRT were analyzed by cyclopedic microarray analysis. Response to CRT was evaluated according to a histological grading system using surgically resected specimens. To decide on the number of genes for dividing into responder and non-responder groups, we statistically analyzed the data using a dimension reduction method, a principle component analysis. Of the 26 cases, 11 were responders and 15 non-responders. No significant difference was found in clinical background data between the two groups. We determined that the optimal number of genes for the prediction of response was 80 of 40,000 and the functions of these genes were analyzed. When comparing non-responders with responders, genes expressed at a high level functioned in alternative splicing, whereas those expressed at a low level functioned in the septin complex. Thus, an 80-gene expression set that predicts response to preoperative CRT for locally advanced rectal cancer was identified using a novel statistical method.
Characterization of a new Vaccinia virus isolate reveals the C23L gene as a putative genetic marker for autochthonous Group 1 Brazilian Vaccinia virus.

Directory of Open Access Journals (Sweden)

Felipe L Assis

Full Text Available Since 1999, several Vaccinia virus (VACV isolates, the etiological agents of bovine vaccinia (BV, have been frequently isolated and characterized with various biological and molecular methods. The results from these approaches have grouped these VACV isolates into two different clusters. This dichotomy has elicited debates surrounding the origin of the Brazilian VACV and its epidemiological significance. To ascertain vital information to settle these debates, we and other research groups have made efforts to identify molecular markers to discriminate VACV from other viruses of the genus Orthopoxvirus (OPV and other VACV-BR groups. In this way, some genes have been identified as useful markers to discriminate between the VACV-BR groups. However, new markers are needed to infer ancestry and to correlate each sample or group with its unique epidemiological and biological features. The aims of this work were to characterize a new VACV isolate (VACV DMTV-2005 molecularly and biologically using conserved and non-conserved gene analyses for phylogenetic inference and to search for new genes that would elucidate the VACV-BR dichotomy. The VACV DMTV-2005 isolate reported in this study is biologically and phylogenetically clustered with other strains of Group 1 VACV-BR, the most prevalent VACV group that was isolated during the bovine vaccinia outbreaks in Brazil. Sequence analysis of C23L, the gene that encodes for the CC-chemokine-binding protein, revealed a ten-nucleotide deletion, which is a new Group 1 Brazilian VACV genetic marker. This deletion in the C23L open reading frame produces a premature stop-codon that is shared by all Group 1 VACV-BR strains and may also reflect the VACV-BR dichotomy; the deletion can also be considered to be a putative genetic marker for non-virulent Brazilian VACV isolates and may be used for the detection and molecular characterization of new isolates.
Genome-Wide Identification, Evolution and Expression Analysis of the Grape (Vitis vinifera L. Zinc Finger-Homeodomain Gene Family

Directory of Open Access Journals (Sweden)

Hao Wang

2014-04-01

Full Text Available Plant zinc finger-homeodomain (ZHD genes encode a family of transcription factors that have been demonstrated to play an important role in the regulation of plant growth and development. In this study, we identified a total of 13 ZHD genes (VvZHD in the grape genome that were further classified into at least seven groups. Genome synteny analysis revealed that a number of VvZHD genes were present in the corresponding syntenic blocks of Arabidopsis, indicating that they arose before the divergence of these two species. Gene expression analysis showed that the identified VvZHD genes displayed distinct spatiotemporal expression patterns, and were differentially regulated under various stress conditions and hormone treatments, suggesting that the grape VvZHDs might be also involved in plant response to a variety of biotic and abiotic insults. Our work provides insightful information and knowledge about the ZHD genes in grape, which provides a framework for further characterization of their roles in regulation of stress tolerance as well as other aspects of grape productivity.
Prediction of Metastasis and Recurrence in Colorectal Cancer Based on Gene Expression Analysis: Ready for the Clinic?

Energy Technology Data Exchange (ETDEWEB)

Shibayama, Masaki [Sysmex Corporation, Central Research Laboratories, Kobe 651-2271 (Japan); Maak, Matthias; Nitsche, Ulrich [Chirurgische Klinik, Klinikum Rechts der Isar der TUM, München 81657 (Germany); Gotoh, Kengo [Sysmex Corporation, Central Research Laboratories, Kobe 651-2271 (Japan); Rosenberg, Robert; Janssen, Klaus-Peter, E-mail: klaus-peter.janssen@lrz.tum.de [Chirurgische Klinik, Klinikum Rechts der Isar der TUM, München 81657 (Germany)

2011-07-07

Cancers of the colon and rectum, which rank among the most frequent human tumors, are currently treated by surgical resection in locally restricted tumor stages. However, disease recurrence and formation of local and distant metastasis frequently occur even in cases with successful curative resection of the primary tumor (R0). Recent technological advances in molecular diagnostic analysis have led to a wealth of knowledge about the changes in gene transcription in all stages of colorectal tumors. Differential gene expression, or transcriptome analysis, has been proposed by many groups to predict disease recurrence, clinical outcome, and also response to therapy, in addition to the well-established clinico-pathological factors. However, the clinical usability of gene expression profiling as a reliable and robust prognostic tool that allows evidence-based clinical decisions is currently under debate. In this review, we will discuss the most recent data on the prognostic significance and potential clinical application of genome wide expression analysis in colorectal cancer.
Prediction of Metastasis and Recurrence in Colorectal Cancer Based on Gene Expression Analysis: Ready for the Clinic?

International Nuclear Information System (INIS)

Shibayama, Masaki; Maak, Matthias; Nitsche, Ulrich; Gotoh, Kengo; Rosenberg, Robert; Janssen, Klaus-Peter

2011-01-01

Cancers of the colon and rectum, which rank among the most frequent human tumors, are currently treated by surgical resection in locally restricted tumor stages. However, disease recurrence and formation of local and distant metastasis frequently occur even in cases with successful curative resection of the primary tumor (R0). Recent technological advances in molecular diagnostic analysis have led to a wealth of knowledge about the changes in gene transcription in all stages of colorectal tumors. Differential gene expression, or transcriptome analysis, has been proposed by many groups to predict disease recurrence, clinical outcome, and also response to therapy, in addition to the well-established clinico-pathological factors. However, the clinical usability of gene expression profiling as a reliable and robust prognostic tool that allows evidence-based clinical decisions is currently under debate. In this review, we will discuss the most recent data on the prognostic significance and potential clinical application of genome wide expression analysis in colorectal cancer
Gene set analysis: limitations in popular existing methods and proposed improvements.

Science.gov (United States)

Mishra, Pashupati; Törönen, Petri; Leino, Yrjö; Holm, Liisa

2014-10-01

Gene set analysis is the analysis of a set of genes that collectively contribute to a biological process. Most popular gene set analysis methods are based on empirical P-value that requires large number of permutations. Despite numerous gene set analysis methods developed in the past decade, the most popular methods still suffer from serious limitations. We present a gene set analysis method (mGSZ) based on Gene Set Z-scoring function (GSZ) and asymptotic P-values. Asymptotic P-value calculation requires fewer permutations, and thus speeds up the gene set analysis process. We compare the GSZ-scoring function with seven popular gene set scoring functions and show that GSZ stands out as the best scoring function. In addition, we show improved performance of the GSA method when the max-mean statistics is replaced by the GSZ scoring function. We demonstrate the importance of both gene and sample permutations by showing the consequences in the absence of one or the other. A comparison of asymptotic and empirical methods of P-value estimation demonstrates a clear advantage of asymptotic P-value over empirical P-value. We show that mGSZ outperforms the state-of-the-art methods based on two different evaluations. We compared mGSZ results with permutation and rotation tests and show that rotation does not improve our asymptotic P-values. We also propose well-known asymptotic distribution models for three of the compared methods. mGSZ is available as R package from cran.r-project.org. © The Author 2014. Published by Oxford University Press. All rights reserved. For Permissions, please e-mail: journals.permissions@oup.com.
Ensemble attribute profile clustering: discovering and characterizing groups of genes with similar patterns of biological features

Directory of Open Access Journals (Sweden)

Bissell MJ

2006-03-01

Full Text Available Abstract Background Ensemble attribute profile clustering is a novel, text-based strategy for analyzing a user-defined list of genes and/or proteins. The strategy exploits annotation data present in gene-centered corpora and utilizes ideas from statistical information retrieval to discover and characterize properties shared by subsets of the list. The practical utility of this method is demonstrated by employing it in a retrospective study of two non-overlapping sets of genes defined by a published investigation as markers for normal human breast luminal epithelial cells and myoepithelial cells. Results Each genetic locus was characterized using a finite set of biological properties and represented as a vector of features indicating attributes associated with the locus (a gene attribute profile. In this study, the vector space models for a pre-defined list of genes were constructed from the Gene Ontology (GO terms and the Conserved Domain Database (CDD protein domain terms assigned to the loci by the gene-centered corpus LocusLink. This data set of GO- and CDD-based gene attribute profiles, vectors of binary random variables, was used to estimate multiple finite mixture models and each ensuing model utilized to partition the profiles into clusters. The resultant partitionings were combined using a unanimous voting scheme to produce consensus clusters, sets of profiles that co-occured consistently in the same cluster. Attributes that were important in defining the genes assigned to a consensus cluster were identified. The clusters and their attributes were inspected to ascertain the GO and CDD terms most associated with subsets of genes and in conjunction with external knowledge such as chromosomal location, used to gain functional insights into human breast biology. The 52 luminal epithelial cell markers and 89 myoepithelial cell markers are disjoint sets of genes. Ensemble attribute profile clustering-based analysis indicated that both lists
A Comprehensive Classification and Evolutionary Analysis of Plant Homeobox Genes

OpenAIRE

Mukherjee, Krishanu; Brocchieri, Luciano; B?rglin, Thomas R.

2009-01-01

The full complement of homeobox transcription factor sequences, including genes and pseudogenes, was determined from the analysis of 10 complete genomes from flowering plants, moss, Selaginella, unicellular green algae, and red algae. Our exhaustive genome-wide searches resulted in the discovery in each class of a greater number of homeobox genes than previously reported. All homeobox genes can be unambiguously classified by sequence evolutionary analysis into 14 distinct classes also charact...
Genome-Wide Gene Set Analysis for Identification of Pathways Associated with Alcohol Dependence

Science.gov (United States)

Biernacka, Joanna M.; Geske, Jennifer; Jenkins, Gregory D.; Colby, Colin; Rider, David N.; Karpyak, Victor M.; Choi, Doo-Sup; Fridley, Brooke L.

2013-01-01

It is believed that multiple genetic variants with small individual effects contribute to the risk of alcohol dependence. Such polygenic effects are difficult to detect in genome-wide association studies that test for association of the phenotype with each single nucleotide polymorphism (SNP) individually. To overcome this challenge, gene set analysis (GSA) methods that jointly test for the effects of pre-defined groups of genes have been proposed. Rather than testing for association between the phenotype and individual SNPs, these analyses evaluate the global evidence of association with a set of related genes enabling the identification of cellular or molecular pathways or biological processes that play a role in development of the disease. It is hoped that by aggregating the evidence of association for all available SNPs in a group of related genes, these approaches will have enhanced power to detect genetic associations with complex traits. We performed GSA using data from a genome-wide study of 1165 alcohol dependent cases and 1379 controls from the Study of Addiction: Genetics and Environment (SAGE), for all 200 pathways listed in the Kyoto Encyclopedia of Genes and Genomes (KEGG) database. Results demonstrated a potential role of the “Synthesis and Degradation of Ketone Bodies” pathway. Our results also support the potential involvement of the “Neuroactive Ligand Receptor Interaction” pathway, which has previously been implicated in addictive disorders. These findings demonstrate the utility of GSA in the study of complex disease, and suggest specific directions for further research into the genetic architecture of alcohol dependence. PMID:22717047
Selection and validation of reference genes for gene expression analysis in switchgrass (Panicum virgatum using quantitative real-time RT-PCR.

Directory of Open Access Journals (Sweden)

Jacinta Gimeno

Full Text Available Switchgrass (Panicum virgatum has received a lot of attention as a forage and bioenergy crop during the past few years. Gene expression studies are in progress to improve new traits and develop new cultivars. Quantitative real time PCR (qRT-PCR has emerged as an important technique to study gene expression analysis. For accurate and reliable results, normalization of data with reference genes is essential. In this work, we evaluate the stability of expression of genes to use as reference for qRT-PCR in the grass P. virgatum. Eleven candidate reference genes, including eEF-1α, UBQ6, ACT12, TUB6, eIF-4a, GAPDH, SAMDC, TUA6, CYP5, U2AF, and FTSH4, were validated for qRT-PCR normalization in different plant tissues and under different stress conditions. The expression stability of these genes was verified by the use of two distinct algorithms, geNorm and NormFinder. Differences were observed after comparison of the ranking of the candidate reference genes identified by both programs but eEF-1α, eIF-4a, CYP5 and U2AF are ranked as the most stable genes in the samples sets under study. Both programs discard the use of SAMDC and TUA6 for normalization. Validation of the reference genes proposed by geNorm and NormFinder were performed by normalization of transcript abundance of a group of target genes in different samples. Results show similar expression patterns when the best reference genes selected by both programs were used but differences were detected in the transcript abundance of the target genes. Based on the above research, we recommend the use of different statistical algorithms to identify the best reference genes for expression data normalization. The best genes selected in this study will help to improve the quality of gene expression data in a wide variety of samples in switchgrass.
Use of a fragment of the tuf gene for phytoplasma 16Sr group/subgroup differentiation

DEFF Research Database (Denmark)

Contaldo, Nicoletta; Canel, Alessandro; Makarova, Olga

2011-01-01

The usefulness of RFLP analyses on a 435 bp fragment of the tuf gene for preliminary identification of phytoplasmas from a number of phytoplasma ribosomal groups and/or 'Candidatus. Phytoplasma' was verified. The strains employed belong to thirteen 16Sr DNA groups and 22 different subgroups...
Rice Transcriptome Analysis to Identify Possible Herbicide Quinclorac Detoxification Genes

Directory of Open Access Journals (Sweden)

Wenying eXu

2015-09-01

Full Text Available Quinclorac is a highly selective auxin-type herbicide, and is widely used in the effective control of barnyard grass in paddy rice fields, improving the world’s rice yield. The herbicide mode of action of quinclorac has been proposed and hormone interactions affect quinclorac signaling. Because of widespread use, quinclorac may be transported outside rice fields with the drainage waters, leading to soil and water pollution and environmental health problems.In this study, we used 57K Affymetrix rice whole-genome array to identify quinclorac signaling response genes to study the molecular mechanisms of action and detoxification of quinclorac in rice plants. Overall, 637 probe sets were identified with differential expression levels under either 6 or 24 h of quinclorac treatment. Auxin-related genes such as GH3 and OsIAAs responded to quinclorac treatment. Gene Ontology analysis showed that genes of detoxification-related family genes were significantly enriched, including cytochrome P450, GST, UGT, and ABC and drug transporter genes. Moreover, real-time RT-PCR analysis showed that top candidate P450 families such as CYP81, CYP709C and CYP72A genes were universally induced by different herbicides. Some Arabidopsis genes for the same P450 family were up-regulated under quinclorac treatment.We conduct rice whole-genome GeneChip analysis and the first global identification of quinclorac response genes. This work may provide potential markers for detoxification of quinclorac and biomonitors of environmental chemical pollution.
Gene Expression Analysis in Tubule Interstitial Compartments Reveals Candidate Agents for IgA Nephropathy

Directory of Open Access Journals (Sweden)

Jinling Wang

2014-09-01

Full Text Available Background/Aims: Our aim was to explore the molecular mechanism underlying development of IgA nephropathy and discover candidate agents for IgA nephropathy. Methods: The differentially expressed genes (DEGs between patients with IgA nephropathy and normal controls were identified by the data of GSE35488 downloaded from GEO (Gene Expression Omnibus database. The co-expressed gene pairs among DEGs were screened to construct the gene-gene interaction network. Gene Ontology (GO enrichment analysis was performed to analyze the functions of DEGs. The biologically active small molecules capable of targeting IgA nephropathy were identified using the Connectivity Map (cMap database. Results: A total of 55 genes involved in response to organic substance, transcription factor activity and response to steroid hormone stimulus were identified to be differentially expressed in IgA nephropathy patients compared to healthy individuals. A network with 45 co-expressed gene pairs was constructed. DEGs in the network were significantly enriched in response to organic substance. Additionally, a group of small molecules were identified, such as doxorubicin and thapsigargin. Conclusion: Our work provided a systematic insight in understanding the mechanism of IgA nephropathy. Small molecules such as thapsigargin might be potential candidate agents for the treatment of IgA nephropathy.
Gene frequencies of ABO and Rh blood groups in Nigeria: A review ...

African Journals Online (AJOL)

Background: ABO and Rhesus factor (Rh) blood type are germane in human life in genetics and clinical studies. Aim of the study: The review was undertaken with the objective to provide data on the ABO and Rh(D) blood group distribution and gene frequency across Nigeria which is vital for blood transfusion and ...
Cloning and analysis of two Ceratopteris thalictroides MADS-box genes

Directory of Open Access Journals (Sweden)

XU Daolan

2014-06-01

Full Text Available MADS-box transcription factors,as a large gene family,play an important role in plant growth and development,especially act as key regulators in controlling the identities of floral organs in flowering plants.They are also significant in the evolutionary revelation.In order to understand MADS-box genes,we need more information of MADS-box genes in non flowering plant.MADS-box genes of Ceratopteris thalictroides were selected to clone and analysis by using RACE method.Two MADS-box genes,designated CtMADS1 and CtMADS2 in C. thalictroides,were cloned.Analysis indicates that CtMADS1 is belonged to MIKC*-clade,while CtMADS2 is belonged to MIKCc-clade.Phylogeny suggests that these two MADS-box genes of C. thalictroides have a close relationship with flowering plants,the data indicates that at least two different MADS-box genes are homologous to floral homeotic genes existed in the last common ancestor of contemporary vascular plants.
Genomic sequence around butterfly wing development genes: annotation and comparative analysis.

Directory of Open Access Journals (Sweden)

Inês C Conceição

Full Text Available BACKGROUND: Analysis of genomic sequence allows characterization of genome content and organization, and access beyond gene-coding regions for identification of functional elements. BAC libraries, where relatively large genomic regions are made readily available, are especially useful for species without a fully sequenced genome and can increase genomic coverage of phylogenetic and biological diversity. For example, no butterfly genome is yet available despite the unique genetic and biological properties of this group, such as diversified wing color patterns. The evolution and development of these patterns is being studied in a few target species, including Bicyclus anynana, where a whole-genome BAC library allows targeted access to large genomic regions. METHODOLOGY/PRINCIPAL FINDINGS: We characterize ∼1.3 Mb of genomic sequence around 11 selected genes expressed in B. anynana developing wings. Extensive manual curation of in silico predictions, also making use of a large dataset of expressed genes for this species, identified repetitive elements and protein coding sequence, and highlighted an expansion of Alcohol dehydrogenase genes. Comparative analysis with orthologous regions of the lepidopteran reference genome allowed assessment of conservation of fine-scale synteny (with detection of new inversions and translocations and of DNA sequence (with detection of high levels of conservation of non-coding regions around some, but not all, developmental genes. CONCLUSIONS: The general properties and organization of the available B. anynana genomic sequence are similar to the lepidopteran reference, despite the more than 140 MY divergence. Our results lay the groundwork for further studies of new interesting findings in relation to both coding and non-coding sequence: 1 the Alcohol dehydrogenase expansion with higher similarity between the five tandemly-repeated B. anynana paralogs than with the corresponding B. mori orthologs, and 2 the high
Investigating a multigene prognostic assay based on significant pathways for Luminal A breast cancer through gene expression profile analysis.

Science.gov (United States)

Gao, Haiyan; Yang, Mei; Zhang, Xiaolan

2018-04-01

The present study aimed to investigate potential recurrence-risk biomarkers based on significant pathways for Luminal A breast cancer through gene expression profile analysis. Initially, the gene expression profiles of Luminal A breast cancer patients were downloaded from The Cancer Genome Atlas database. The differentially expressed genes (DEGs) were identified using a Limma package and the hierarchical clustering analysis was conducted for the DEGs. In addition, the functional pathways were screened using Kyoto Encyclopedia of Genes and Genomes pathway enrichment analyses and rank ratio calculation. The multigene prognostic assay was exploited based on the statistically significant pathways and its prognostic function was tested using train set and verified using the gene expression data and survival data of Luminal A breast cancer patients downloaded from the Gene Expression Omnibus. A total of 300 DEGs were identified between good and poor outcome groups, including 176 upregulated genes and 124 downregulated genes. The DEGs may be used to effectively distinguish Luminal A samples with different prognoses verified by hierarchical clustering analysis. There were 9 pathways screened as significant pathways and a total of 18 DEGs involved in these 9 pathways were identified as prognostic biomarkers. According to the survival analysis and receiver operating characteristic curve, the obtained 18-gene prognostic assay exhibited good prognostic function with high sensitivity and specificity to both the train and test samples. In conclusion the 18-gene prognostic assay including the key genes, transcription factor 7-like 2, anterior parietal cortex and lymphocyte enhancer factor-1 may provide a new method for predicting outcomes and may be conducive to the promotion of precision medicine for Luminal A breast cancer.

Validation of Tuba1a as Appropriate Internal Control for Normalization of Gene Expression Analysis during Mouse Lung Development

Directory of Open Access Journals (Sweden)

Aditi Mehta

2015-02-01

Full Text Available The expression ratio between the analysed gene and an internal control gene is the most widely used normalization method for quantitative RT-PCR (qRT-PCR expression analysis. The ideal reference gene for a specific experiment is the one whose expression is not affected by the different experimental conditions tested. In this study, we validate the applicability of five commonly used reference genes during different stages of mouse lung development. The stability of expression of five different reference genes (Tuba1a, Actb Gapdh, Rn18S and Hist4h4 was calculated within five experimental groups using the statistical algorithm of geNorm software. Overall, Tuba1a showed the least variability in expression among the different stages of lung development, while Hist4h4 and Rn18S showed the maximum variability in their expression. Expression analysis of two lung specific markers, surfactant protein C (SftpC and Clara cell-specific 10 kDA protein (Scgb1a1, normalized to each of the five reference genes tested here, confirmed our results and showed that incorrect reference gene choice can lead to artefacts. Moreover, a combination of two internal controls for normalization of expression analysis during lung development will increase the accuracy and reliability of results.
Gastric Cancer Associated Genes Identified by an Integrative Analysis of Gene Expression Data

Directory of Open Access Journals (Sweden)

Bing Jiang

2017-01-01

Full Text Available Gastric cancer is one of the most severe complex diseases with high morbidity and mortality in the world. The molecular mechanisms and risk factors for this disease are still not clear since the cancer heterogeneity caused by different genetic and environmental factors. With more and more expression data accumulated nowadays, we can perform integrative analysis for these data to understand the complexity of gastric cancer and to identify consensus players for the heterogeneous cancer. In the present work, we screened the published gene expression data and analyzed them with integrative tool, combined with pathway and gene ontology enrichment investigation. We identified several consensus differentially expressed genes and these genes were further confirmed with literature mining; at last, two genes, that is, immunoglobulin J chain and C-X-C motif chemokine ligand 17, were screened as novel gastric cancer associated genes. Experimental validation is proposed to further confirm this finding.
Genomewide analysis of MATE-type gene family in maize reveals ...

Indian Academy of Sciences (India)

Huasheng Zhu and Jiandong Wu contributed equally to this work. As a group of secondary active transporters, the MATE gene family consists of multiple genes that widely exist in ..... Roots of the stress-treated plants were collected at 0,.
GenePublisher: automated analysis of DNA microarray data

DEFF Research Database (Denmark)

Knudsen, Steen; Workman, Christopher; Sicheritz-Ponten, T.

2003-01-01

GenePublisher, a system for automatic analysis of data from DNA microarray experiments, has been implemented with a web interface at http://www.cbs.dtu.dk/services/GenePublisher. Raw data are uploaded to the server together with aspecification of the data. The server performs normalization...
Serial Expression Analysis: a web tool for the analysis of serial gene expression data

Science.gov (United States)

Nueda, Maria José; Carbonell, José; Medina, Ignacio; Dopazo, Joaquín; Conesa, Ana

2010-01-01

Serial transcriptomics experiments investigate the dynamics of gene expression changes associated with a quantitative variable such as time or dosage. The statistical analysis of these data implies the study of global and gene-specific expression trends, the identification of significant serial changes, the comparison of expression profiles and the assessment of transcriptional changes in terms of cellular processes. We have created the SEA (Serial Expression Analysis) suite to provide a complete web-based resource for the analysis of serial transcriptomics data. SEA offers five different algorithms based on univariate, multivariate and functional profiling strategies framed within a user-friendly interface and a project-oriented architecture to facilitate the analysis of serial gene expression data sets from different perspectives. SEA is available at sea.bioinfo.cipf.es. PMID:20525784
A multicolor panel of novel lentiviral "gene ontology" (LeGO) vectors for functional gene analysis.

Science.gov (United States)

Weber, Kristoffer; Bartsch, Udo; Stocking, Carol; Fehse, Boris

2008-04-01

Functional gene analysis requires the possibility of overexpression, as well as downregulation of one, or ideally several, potentially interacting genes. Lentiviral vectors are well suited for this purpose as they ensure stable expression of complementary DNAs (cDNAs), as well as short-hairpin RNAs (shRNAs), and can efficiently transduce a wide spectrum of cell targets when packaged within the coat proteins of other viruses. Here we introduce a multicolor panel of novel lentiviral "gene ontology" (LeGO) vectors designed according to the "building blocks" principle. Using a wide spectrum of different fluorescent markers, including drug-selectable enhanced green fluorescent protein (eGFP)- and dTomato-blasticidin-S resistance fusion proteins, LeGO vectors allow simultaneous analysis of multiple genes and shRNAs of interest within single, easily identifiable cells. Furthermore, each functional module is flanked by unique cloning sites, ensuring flexibility and individual optimization. The efficacy of these vectors for analyzing multiple genes in a single cell was demonstrated in several different cell types, including hematopoietic, endothelial, and neural stem and progenitor cells, as well as hepatocytes. LeGO vectors thus represent a valuable tool for investigating gene networks using conditional ectopic expression and knock-down approaches simultaneously.
Suppression subtractive hybridization and comparative expression analysis to identify developmentally regulated genes in filamentous fungi.

Science.gov (United States)

Gesing, Stefan; Schindler, Daniel; Nowrousian, Minou

2013-09-01

Ascomycetes differentiate four major morphological types of fruiting bodies (apothecia, perithecia, pseudothecia and cleistothecia) that are derived from an ancestral fruiting body. Thus, fruiting body differentiation is most likely controlled by a set of common core genes. One way to identify such genes is to search for genes with evolutionary conserved expression patterns. Using suppression subtractive hybridization (SSH), we selected differentially expressed transcripts in Pyronema confluens (Pezizales) by comparing two cDNA libraries specific for sexual and for vegetative development, respectively. The expression patterns of selected genes from both libraries were verified by quantitative real time PCR. Expression of several corresponding homologous genes was found to be conserved in two members of the Sordariales (Sordaria macrospora and Neurospora crassa), a derived group of ascomycetes that is only distantly related to the Pezizales. Knockout studies with N. crassa orthologues of differentially regulated genes revealed a functional role during fruiting body development for the gene NCU05079, encoding a putative MFS peptide transporter. These data indicate conserved gene expression patterns and a functional role of the corresponding genes during fruiting body development; such genes are candidates of choice for further functional analysis. © 2013 WILEY-VCH Verlag GmbH & Co. KGaA, Weinheim.
Genome-wide identification and comparative expression analysis reveal a rapid expansion and functional divergence of duplicated genes in the WRKY gene family of cabbage, Brassica oleracea var. capitata.

Science.gov (United States)

Yao, Qiu-Yang; Xia, En-Hua; Liu, Fei-Hu; Gao, Li-Zhi

2015-02-15

WRKY transcription factors (TFs), one of the ten largest TF families in higher plants, play important roles in regulating plant development and resistance. To date, little is known about the WRKY TF family in Brassica oleracea. Recently, the completed genome sequence of cabbage (B. oleracea var. capitata) allows us to systematically analyze WRKY genes in this species. A total of 148 WRKY genes were characterized and classified into seven subgroups that belong to three major groups. Phylogenetic and synteny analyses revealed that the repertoire of cabbage WRKY genes was derived from a common ancestor shared with Arabidopsis thaliana. The B. oleracea WRKY genes were found to be preferentially retained after the whole-genome triplication (WGT) event in its recent ancestor, suggesting that the WGT event had largely contributed to a rapid expansion of the WRKY gene family in B. oleracea. The analysis of RNA-Seq data from various tissues (i.e., roots, stems, leaves, buds, flowers and siliques) revealed that most of the identified WRKY genes were positively expressed in cabbage, and a large portion of them exhibited patterns of differential and tissue-specific expression, demonstrating that these gene members might play essential roles in plant developmental processes. Comparative analysis of the expression level among duplicated genes showed that gene expression divergence was evidently presented among cabbage WRKY paralogs, indicating functional divergence of these duplicated WRKY genes. Copyright © 2014 Elsevier B.V. All rights reserved.
Quantitative RT-PCR analysis of estrogen receptor gene expression in laser microdissected prostate cancer tissue.

Science.gov (United States)

Walton, Thomas J; Li, Geng; McCulloch, Thomas A; Seth, Rashmi; Powe, Desmond G; Bishop, Michael C; Rees, Robert C

2009-06-01

Real-time quantitative RT-PCR analysis of laser microdissected tissue is considered the most accurate technique for determining tissue gene expression. The discovery of estrogen receptor beta (ERbeta) has focussed renewed interest on the role of estrogen receptors in prostate cancer, yet few studies have utilized the technique to analyze estrogen receptor gene expression in prostate cancer. Fresh tissue was obtained from 11 radical prostatectomy specimens and from 6 patients with benign prostate hyperplasia. Pure populations of benign and malignant prostate epithelium were laser microdissected, followed by RNA isolation and electrophoresis. Quantitative RT-PCR was performed using primers for androgen receptor (AR), estrogen receptor beta (ERbeta), estrogen receptor alpha (ERalpha), progesterone receptor (PGR) and prostate specific antigen (PSA), with normalization to two housekeeping genes. Differences in gene expression were analyzed using the Mann-Whitney U-test. Correlation coefficients were analyzed using Spearman's test. Significant positive correlations were seen when AR and AR-dependent PSA, and ERalpha and ERalpha-dependent PGR were compared, indicating a representative population of RNA transcripts. ERbeta gene expression was significantly over-expressed in the cancer group compared with benign controls (P cancer group (P prostate cancer specimens. In concert with recent studies the findings suggest differential production of ERbeta splice variants, which may play important roles in the genesis of prostate cancer. (c) 2009 Wiley-Liss, Inc.
De novo transcriptome assembly and analysis of differential gene expression in response to drought in European beech.

Directory of Open Access Journals (Sweden)

Markus Müller

Full Text Available Despite the ecological and economic importance of European beech (Fagus sylvatica L. genomic resources of this species are still limited. This hampers an understanding of the molecular basis of adaptation to stress. Since beech will most likely be threatened by the consequences of climate change, an understanding of adaptive processes to climate change-related drought stress is of major importance. Here, we used RNA-seq to provide the first drought stress-related transcriptome of beech. In a drought stress trial with beech saplings, 50 samples were taken for RNA extraction at five points in time during a soil desiccation experiment. De novo transcriptome assembly and analysis of differential gene expression revealed 44,335 contigs, and 662 differentially expressed genes between the stress and normally watered control group. Gene expression was specific to the different time points, and only five genes were significantly differentially expressed between the stress and control group on all five sampling days. GO term enrichment showed that mostly genes involved in lipid- and homeostasis-related processes were upregulated, whereas genes involved in oxidative stress response were downregulated in the stressed seedlings. This study gives first insights into the genomic drought stress response of European beech, and provides new genetic resources for adaptation research in this species.
Estimating phylogenetic relationships despite discordant gene trees across loci: the species tree of a diverse species group of feather mites (Acari: Proctophyllodidae).

Science.gov (United States)

Knowles, Lacey L; Klimov, Pavel B

2011-11-01

With the increased availability of multilocus sequence data, the lack of concordance of gene trees estimated for independent loci has focused attention on both the biological processes producing the discord and the methodologies used to estimate phylogenetic relationships. What has emerged is a suite of new analytical tools for phylogenetic inference--species tree approaches. In contrast to traditional phylogenetic methods that are stymied by the idiosyncrasies of gene trees, approaches for estimating species trees explicitly take into account the cause of discord among loci and, in the process, provides a direct estimate of phylogenetic history (i.e. the history of species divergence, not divergence of specific loci). We illustrate the utility of species tree estimates with an analysis of a diverse group of feather mites, the pinnatus species group (genus Proctophyllodes). Discord among four sequenced nuclear loci is consistent with theoretical expectations, given the short time separating speciation events (as evident by short internodes relative to terminal branch lengths in the trees). Nevertheless, many of the relationships are well resolved in a Bayesian estimate of the species tree; the analysis also highlights ambiguous aspects of the phylogeny that require additional loci. The broad utility of species tree approaches is discussed, and specifically, their application to groups with high speciation rates--a history of diversification with particular prevalence in host/parasite systems where species interactions can drive rapid diversification.
Evolution and expression analysis of the grape (Vitis vinifera L.) WRKY gene family.

Science.gov (United States)

Guo, Chunlei; Guo, Rongrong; Xu, Xiaozhao; Gao, Min; Li, Xiaoqin; Song, Junyang; Zheng, Yi; Wang, Xiping

2014-04-01

WRKY proteins comprise a large family of transcription factors that play important roles in plant defence regulatory networks, including responses to various biotic and abiotic stresses. To date, no large-scale study of WRKY genes has been undertaken in grape (Vitis vinifera L.). In this study, a total of 59 putative grape WRKY genes (VvWRKY) were identified and renamed on the basis of their respective chromosome distribution. A multiple sequence alignment analysis using all predicted grape WRKY genes coding sequences, together with those from Arabidopsis thaliana and tomato (Solanum lycopersicum), indicated that the 59 VvWRKY genes can be classified into three main groups (I-III). An evaluation of the duplication events suggested that several WRKY genes arose before the divergence of the grape and Arabidopsis lineages. Moreover, expression profiles derived from semiquantitative PCR and real-time quantitative PCR analyses showed distinct expression patterns in various tissues and in response to different treatments. Four VvWRKY genes showed a significantly higher expression in roots or leaves, 55 responded to varying degrees to at least one abiotic stress treatment, and the expression of 38 were altered following powdery mildew (Erysiphe necator) infection. Most VvWRKY genes were downregulated in response to abscisic acid or salicylic acid treatments, while the expression of a subset was upregulated by methyl jasmonate or ethylene treatments.
Bioinformatics analysis and detection of gelatinase encoded gene in Lysinibacillussphaericus

Science.gov (United States)

Repin, Rul Aisyah Mat; Mutalib, Sahilah Abdul; Shahimi, Safiyyah; Khalid, Rozida Mohd.; Ayob, Mohd. Khan; Bakar, Mohd. Faizal Abu; Isa, Mohd Noor Mat

2016-11-01

In this study, we performed bioinformatics analysis toward genome sequence of Lysinibacillussphaericus (L. sphaericus) to determine gene encoded for gelatinase. L. sphaericus was isolated from soil and gelatinase species-specific bacterium to porcine and bovine gelatin. This bacterium offers the possibility of enzymes production which is specific to both species of meat, respectively. The main focus of this research is to identify the gelatinase encoded gene within the bacteria of L. Sphaericus using bioinformatics analysis of partially sequence genome. From the research study, three candidate gene were identified which was, gelatinase candidate gene 1 (P1), NODE_71_length_93919_cov_158.931839_21 which containing 1563 base pair (bp) in size with 520 amino acids sequence; Secondly, gelatinase candidate gene 2 (P2), NODE_23_length_52851_cov_190.061386_17 which containing 1776 bp in size with 591 amino acids sequence; and Thirdly, gelatinase candidate gene 3 (P3), NODE_106_length_32943_cov_169.147919_8 containing 1701 bp in size with 566 amino acids sequence. Three pairs of oligonucleotide primers were designed and namely as, F1, R1, F2, R2, F3 and R3 were targeted short sequences of cDNA by PCR. The amplicons were reliably results in 1563 bp in size for candidate gene P1 and 1701 bp in size for candidate gene P3. Therefore, the results of bioinformatics analysis of L. Sphaericus resulting in gene encoded gelatinase were identified.
Multiple Genes Cause Postmating Prezygotic Reproductive Isolation in the Drosophila virilis Group.

Science.gov (United States)

Ahmed-Braimah, Yasir H

2016-12-07

Understanding the genetic basis of speciation is a central problem in evolutionary biology. Studies of reproductive isolation have provided several insights into the genetic causes of speciation, especially in taxa that lend themselves to detailed genetic scrutiny. Reproductive barriers have usually been divided into those that occur before zygote formation (prezygotic) and after (postzygotic), with the latter receiving a great deal of attention over several decades. Reproductive barriers that occur after mating but before zygote formation [postmating prezygotic (PMPZ)] are especially understudied at the genetic level. Here, I present a phenotypic and genetic analysis of a PMPZ reproductive barrier between two species of the Drosophila virilis group: D. americana and D. virilis This species pair shows strong PMPZ isolation, especially when D. americana males mate with D. virilis females: ∼99% of eggs laid after these heterospecific copulations are not fertilized. Previous work has shown that the paternal loci contributing to this incompatibility reside on two chromosomes, one of which (chromosome 5) likely carries multiple factors. The other (chromosome 2) is fixed for a paracentric inversion that encompasses nearly half the chromosome. Here, I present two results. First, I show that PMPZ in this species cross is largely due to defective sperm storage in heterospecific copulations. Second, using advanced intercross and backcross mapping approaches, I identify genomic regions that carry genes capable of rescuing heterospecific fertilization. I conclude that paternal incompatibility between D. americana males and D. virilis females is underlain by four or more genes on chromosomes 2 and 5. Copyright © 2016 Ahmed-Braimah.
Genome-wide investigation and transcriptome analysis of the WRKY gene family in Gossypium.

Science.gov (United States)

Ding, Mingquan; Chen, Jiadong; Jiang, Yurong; Lin, Lifeng; Cao, YueFen; Wang, Minhua; Zhang, Yuting; Rong, Junkang; Ye, Wuwei

2015-02-01

WRKY transcription factors play important roles in various stress responses in diverse plant species. In cotton, this family has not been well studied, especially in relation to fiber development. Here, the genomes and transcriptomes of Gossypium raimondii and Gossypium arboreum were investigated to identify fiber development related WRKY genes. This represents the first comprehensive comparative study of WRKY transcription factors in both diploid A and D cotton species. In total, 112 G. raimondii and 109 G. arboreum WRKY genes were identified. No significant gene structure or domain alterations were detected between the two species, but many SNPs distributed unequally in exon and intron regions. Physical mapping revealed that the WRKY genes in G. arboreum were not located in the corresponding chromosomes of G. raimondii, suggesting great chromosome rearrangement in the diploid cotton genomes. The cotton WRKY genes, especially subgroups I and II, have expanded through multiple whole genome duplications and tandem duplications compared with other plant species. Sequence comparison showed many functionally divergent sites between WRKY subgroups, while the genes within each group are under strong purifying selection. Transcriptome analysis suggested that many WRKY genes participate in specific fiber development processes such as fiber initiation, elongation and maturation with different expression patterns between species. Complex WRKY gene expression such as differential Dt and At allelic gene expression in G. hirsutum and alternative splicing events were also observed in both diploid and tetraploid cottons during fiber development process. In conclusion, this study provides important information on the evolution and function of WRKY gene family in cotton species.
Comprehensive identification and expression analysis of Hsp90s gene family in Solanum lycopersicum.

Science.gov (United States)

Zai, W S; Miao, L X; Xiong, Z L; Zhang, H L; Ma, Y R; Li, Y L; Chen, Y B; Ye, S G

2015-07-14

Heat shock protein 90 (Hsp90) is a protein produced by plants in response to adverse environmental stresses. In this study, we identified and analyzed Hsp90 gene family members using a bioinformatic method based on genomic data from tomato (Solanum lycopersicum L.). The results illustrated that tomato contains at least 7 Hsp90 genes distributed on 6 chromosomes; protein lengths ranged from 267-794 amino acids. Intron numbers ranged from 2-19 in the genes. The phylogenetic tree revealed that Hsp90 genes in tomato (Solanum lycopersicum L.), rice (Oryza sativa L.), and Arabidopsis (Arabidopsis thaliana L.) could be divided into 5 groups, which included 3 pairs of orthologous genes and 4 pairs of paralogous genes. Expression analysis of RNA-sequence data showed that the Hsp90-1 gene was specifically expressed in mature fruits, while Hsp90-5 and Hsp90-6 showed opposite expression patterns in various tissues of cultivated and wild tomatoes. The expression levels of the Hsp90-1, Hsp90-2, and Hsp90- 3 genes in various tissues of cultivated tomatoes were high, while both the expression levels of genes Hsp90-3 and Hsp90-4 were low. Additionally, quantitative real-time polymerase chain reaction showed that these genes were involved in the responses to yellow leaf curl virus in tomato plant leaves. Our results provide a foundation for identifying the function of the Hsp90 gene in tomato.
MiR-210 disturbs mitotic progression through regulating a group of mitosis-related genes

OpenAIRE

He, Jie; Wu, Jiangbin; Xu, Naihan; Xie, Weidong; Li, Mengnan; Li, Jianna; Jiang, Yuyang; Yang, Burton B.; Zhang, Yaou

2012-01-01

MiR-210 is up-regulated in multiple cancer types but its function is disputable and further investigation is necessary. Using a bioinformatics approach, we identified the putative target genes of miR-210 in hypoxia-induced CNE cells from genome-wide scale. Two functional gene groups related to cell cycle and RNA processing were recognized as the major targets of miR-210. Here, we investigated the molecular mechanism and biological consequence of miR-210 in cell cycle regulation, particularly ...
Symbiotic Burkholderia Species Show Diverse Arrangements of nif/fix and nod Genes and Lack Typical High-Affinity Cytochrome cbb3 Oxidase Genes.

Science.gov (United States)

De Meyer, Sofie E; Briscoe, Leah; Martínez-Hidalgo, Pilar; Agapakis, Christina M; de-Los Santos, Paulina Estrada; Seshadri, Rekha; Reeve, Wayne; Weinstock, George; O'Hara, Graham; Howieson, John G; Hirsch, Ann M

2016-08-01

Genome analysis of fourteen mimosoid and four papilionoid beta-rhizobia together with fourteen reference alpha-rhizobia for both nodulation (nod) and nitrogen-fixing (nif/fix) genes has shown phylogenetic congruence between 16S rRNA/MLSA (combined 16S rRNA gene sequencing and multilocus sequence analysis) and nif/fix genes, indicating a free-living diazotrophic ancestry of the beta-rhizobia. However, deeper genomic analysis revealed a complex symbiosis acquisition history in the beta-rhizobia that clearly separates the mimosoid and papilionoid nodulating groups. Mimosoid-nodulating beta-rhizobia have nod genes tightly clustered in the nodBCIJHASU operon, whereas papilionoid-nodulating Burkholderia have nodUSDABC and nodIJ genes, although their arrangement is not canonical because the nod genes are subdivided by the insertion of nif and other genes. Furthermore, the papilionoid Burkholderia spp. contain duplications of several nod and nif genes. The Burkholderia nifHDKEN and fixABC genes are very closely related to those found in free-living diazotrophs. In contrast, nifA is highly divergent between both groups, but the papilionoid species nifA is more similar to alpha-rhizobia nifA than to other groups. Surprisingly, for all Burkholderia, the fixNOQP and fixGHIS genes required for cbb3 cytochrome oxidase production and assembly are missing. In contrast, symbiotic Cupriavidus strains have fixNOQPGHIS genes, revealing a divergence in the evolution of two distinct electron transport chains required for nitrogen fixation within the beta-rhizobia.
Detection of virulence genes and the phylogenetic groups of Escherichia coli isolated from dogs in Brazil

Directory of Open Access Journals (Sweden)

Fernanda Morcatti Coura

2018-02-01

Full Text Available ABSTRACT: This study identified the virulence genes, pathovars, and phylogenetic groups of Escherichia coli strains obtained from the feces of dogs with and without diarrhea. Virulence genes and phylogenetic group identification were studied using polymerase chain reaction. Thirty-seven E. coli isolates were positive for at least one virulence factor gene. Twenty-one (57.8% of the positive isolates were isolated from diarrheal feces and sixteen (43.2% were from the feces of non-diarrheic dogs. Enteropathogenic E. coli (EPEC were the most frequently (62.2% detected pathovar in dog feces and were mainly from phylogroup B1 and E. Necrotoxigenic E. coli were detected in 16.2% of the virulence-positive isolates and these contained the cytotoxic necrotizing factor 1 (cnf1 gene and were classified into phylogroups B2 and D. All E. coli strains were negative for the presence of enterotoxigenic E. coli (ETEC enterotoxin genes, but four strains were positive for ETEC-related fimbriae 987P and F18. Two isolates were Shiga toxin-producing E. coli strains and contained the toxin genesStx2 or Stx2e, both from phylogroup B1. Our data showed that EPEC was the most frequent pathovar and B1 and E were the most common phylogroups detected in E. coli isolated from the feces of diarrheic and non-diarrheic dogs.
Consensus strategy in genes prioritization and combined bioinformatics analysis for preeclampsia pathogenesis.

Science.gov (United States)

Tejera, Eduardo; Cruz-Monteagudo, Maykel; Burgos, Germán; Sánchez, María-Eugenia; Sánchez-Rodríguez, Aminael; Pérez-Castillo, Yunierkis; Borges, Fernanda; Cordeiro, Maria Natália Dias Soeiro; Paz-Y-Miño, César; Rebelo, Irene

2017-08-08

Preeclampsia is a multifactorial disease with unknown pathogenesis. Even when recent studies explored this disease using several bioinformatics tools, the main objective was not directed to pathogenesis. Additionally, consensus prioritization was proved to be highly efficient in the recognition of genes-disease association. However, not information is available about the consensus ability to early recognize genes directly involved in pathogenesis. Therefore our aim in this study is to apply several theoretical approaches to explore preeclampsia; specifically those genes directly involved in the pathogenesis. We firstly evaluated the consensus between 12 prioritization strategies to early recognize pathogenic genes related to preeclampsia. A communality analysis in the protein-protein interaction network of previously selected genes was done including further enrichment analysis. The enrichment analysis includes metabolic pathways as well as gene ontology. Microarray data was also collected and used in order to confirm our results or as a strategy to weight the previously enriched pathways. The consensus prioritized gene list was rationally filtered to 476 genes using several criteria. The communality analysis showed an enrichment of communities connected with VEGF-signaling pathway. This pathway is also enriched considering the microarray data. Our result point to VEGF, FLT1 and KDR as relevant pathogenic genes, as well as those connected with NO metabolism. Our results revealed that consensus strategy improve the detection and initial enrichment of pathogenic genes, at least in preeclampsia condition. Moreover the combination of the first percent of the prioritized genes with protein-protein interaction network followed by communality analysis reduces the gene space. This approach actually identifies well known genes related with pathogenesis. However, genes like HSP90, PAK2, CD247 and others included in the first 1% of the prioritized list need to be further

Multitarget Effects of Danqi Pill on Global Gene Expression Changes in Myocardial Ischemia

Directory of Open Access Journals (Sweden)

Qiyan Wang

2018-01-01

Full Text Available Danqi pill (DQP is a widely prescribed traditional Chinese medicine (TCM in the treatment of cardiovascular diseases. The objective of this study is to systematically characterize altered gene expression pattern induced by myocardial ischemia (MI in a rat model and to investigate the effects of DQP on global gene expression. Global mRNA expression was measured. Differentially expressed genes among the sham group, model group, and DQP group were analyzed. The gene ontology enrichment analysis and pathway analysis of differentially expressed genes were carried out. We quantified 10,813 genes. Compared with the sham group, expressions of 339 genes were upregulated and 177 genes were downregulated in the model group. The upregulated genes were enriched in extracellular matrix organization, response to wounding, and defense response pathways. Downregulated genes were enriched in fatty acid metabolism, pyruvate metabolism, PPAR signaling pathways, and so forth. This indicated that energy metabolic disorders occurred in rats with MI. In the DQP group, expressions of genes in the altered pathways were regulated back towards normal levels. DQP reversed expression of 313 of the 516 differentially expressed genes in the model group. This study provides insight into the multitarget mechanism of TCM in the treatment of complex diseases.
Genome-Wide Characterization of bHLH Genes in Grape and Analysis of their Potential Relevance to Abiotic Stress Tolerance and Secondary Metabolite Biosynthesis

Science.gov (United States)

Wang, Pengfei; Su, Ling; Gao, Huanhuan; Jiang, Xilong; Wu, Xinying; Li, Yi; Zhang, Qianqian; Wang, Yongmei; Ren, Fengshan

2018-01-01

Basic helix-loop-helix (bHLH) transcription factors are involved in many abiotic stress responses as well as flavonol and anthocyanin biosynthesis. In grapes (Vitis vinifera L.), flavonols including anthocyanins and condensed tannins are most abundant in the skins of the berries. Flavonols are important phytochemicals for viticulture and enology, but grape bHLH genes have rarely been examined. We identified 94 grape bHLH genes in a genome-wide analysis and performed Nr and GO function analyses for these genes. Phylogenetic analyses placed the genes into 15 clades, with some remaining orphans. 41 duplicate gene pairs were found in the grape bHLH gene family, and all of these duplicate gene pairs underwent purifying selection. Nine triplicate gene groups were found in the grape bHLH gene family and all of these triplicate gene groups underwent purifying selection. Twenty-two grape bHLH genes could be induced by PEG treatment and 17 grape bHLH genes could be induced by cold stress treatment including a homologous form of MYC2, VvbHLH007. Based on the GO or Nr function annotations, we found three other genes that are potentially related to anthocyanin or flavonol biosynthesis: VvbHLH003, VvbHLH007, and VvbHLH010. We also performed a cis-acting regulatory element analysis on some genes involved in flavonoid or anthocyanin biosynthesis and our results showed that most of these gene promoters contained G-box or E-box elements that could be recognized by bHLH family members. PMID:29449854
Bioinformatic Analysis of Strawberry GSTF12 Gene

Science.gov (United States)

Wang, Xiran; Jiang, Leiyu; Tang, Haoru

2018-01-01

GSTF12 has always been known as a key factor of proanthocyanins accumulate in plant testa. Through bioinformatics analysis of the nucleotide and encoded protein sequence of GSTF12, it is more advantageous to the study of genes related to anthocyanin biosynthesis accumulation pathway. Therefore, we chosen GSTF12 gene of 11 kinds species, downloaded their nucleotide and protein sequence from NCBI as the research object, found strawberry GSTF12 gene via bioinformation analyse, constructed phylogenetic tree. At the same time, we analysed the strawberry GSTF12 gene of physical and chemical properties and its protein structure and so on. The phylogenetic tree showed that Strawberry and petunia were closest relative. By the protein prediction, we found that the protein owed one proper signal peptide without obvious transmembrane regions.
Final report of the group research. Genome analysis on the biological effects of radiation. Second research group of NIRS

International Nuclear Information System (INIS)

2001-10-01

This report concerns investigations on the title conducted by 5 subgroups of National Institute of Radiological Sciences (NIRS) during the period of 1993-2001. The report involves the organization of research teams and summary reports from the subgroups for Genome sequencing and informatics, Genome analysis on model organisms, The genome analysis on the specific chromosomal region related to radiation-sensitivity, Molecular analysis on the structure and function of particular regions of human genome, and Generation and characterization of DNA repair-deficient model mice. Significant results are as follows: Sequencing of the radiation sensitivity gene ATM, finding of a novel cell cycle regulator gene NPAT and regulation of gene expression of ATM/NPAT; Findings that the cause of the variability related to instability of human genome is derived from particular repeat structures of 5 and 35 bases and of the instability mutation, from the mutation of EPILS (mRNA synthase gene); Program development for novel human genome finding in the DNA sequences and making novel human gene as a resource by polymerase chain reaction (PCR) technique; and generation of the highly UV-sensitive mouse model for human xeroderma pigmentosum G. Conclusion is that findings will contribute for better understanding of the genes functioning radiation sensitivity and also biodefense mechanism against radiation and other environmental stress. (N.I.)
An Integrative Analysis to Identify Driver Genes in Esophageal Squamous Cell Carcinoma.

Directory of Open Access Journals (Sweden)

Genta Sawada

Full Text Available Few driver genes have been well established in esophageal squamous cell carcinoma (ESCC. Identification of the genomic aberrations that contribute to changes in gene expression profiles can be used to predict driver genes.We searched for driver genes in ESCC by integrative analysis of gene expression microarray profiles and copy number data. To narrow down candidate genes, we performed survival analysis on expression data and tested the genetic vulnerability of each genes using public RNAi screening data. We confirmed the results by performing RNAi experiments and evaluating the clinical relevance of candidate genes in an independent ESCC cohort.We found 10 significantly recurrent copy number alterations accompanying gene expression changes, including loci 11q13.2, 7p11.2, 3q26.33, and 17q12, which harbored CCND1, EGFR, SOX2, and ERBB2, respectively. Analysis of survival data and RNAi screening data suggested that GRB7, located on 17q12, was a driver gene in ESCC. In ESCC cell lines harboring 17q12 amplification, knockdown of GRB7 reduced the proliferation, migration, and invasion capacities of cells. Moreover, siRNA targeting GRB7 had a synergistic inhibitory effect when combined with trastuzumab, an anti-ERBB2 antibody. Survival analysis of the independent cohort also showed that high GRB7 expression was associated with poor prognosis in ESCC.Our integrative analysis provided important insights into ESCC pathogenesis. We identified GRB7 as a novel ESCC driver gene and potential new therapeutic target.
Identification of putative regulatory motifs in the upstream regions of co-expressed functional groups of genes in Plasmodium falciparum

Directory of Open Access Journals (Sweden)

Joshi NV

2009-01-01

Full Text Available Abstract Background Regulation of gene expression in Plasmodium falciparum (Pf remains poorly understood. While over half the genes are estimated to be regulated at the transcriptional level, few regulatory motifs and transcription regulators have been found. Results The study seeks to identify putative regulatory motifs in the upstream regions of 13 functional groups of genes expressed in the intraerythrocytic developmental cycle of Pf. Three motif-discovery programs were used for the purpose, and motifs were searched for only on the gene coding strand. Four motifs – the 'G-rich', the 'C-rich', the 'TGTG' and the 'CACA' motifs – were identified, and zero to all four of these occur in the 13 sets of upstream regions. The 'CACA motif' was absent in functional groups expressed during the ring to early trophozoite transition. For functional groups expressed in each transition, the motifs tended to be similar. Upstream motifs in some functional groups showed 'positional conservation' by occurring at similar positions relative to the translational start site (TLS; this increases their significance as regulatory motifs. In the ribonucleotide synthesis, mitochondrial, proteasome and organellar translation machinery genes, G-rich, C-rich, CACA and TGTG motifs, respectively, occur with striking positional conservation. In the organellar translation machinery group, G-rich motifs occur close to the TLS. The same motifs were sometimes identified for multiple functional groups; differences in location and abundance of the motifs appear to ensure different modes of action. Conclusion The identification of positionally conserved over-represented upstream motifs throws light on putative regulatory elements for transcription in Pf.
GeneAnalytics: An Integrative Gene Set Analysis Tool for Next Generation Sequencing, RNAseq and Microarray Data.

Science.gov (United States)

Ben-Ari Fuchs, Shani; Lieder, Iris; Stelzer, Gil; Mazor, Yaron; Buzhor, Ella; Kaplan, Sergey; Bogoch, Yoel; Plaschkes, Inbar; Shitrit, Alina; Rappaport, Noa; Kohn, Asher; Edgar, Ron; Shenhav, Liraz; Safran, Marilyn; Lancet, Doron; Guan-Golan, Yaron; Warshawsky, David; Shtrichman, Ronit

2016-03-01

Postgenomics data are produced in large volumes by life sciences and clinical applications of novel omics diagnostics and therapeutics for precision medicine. To move from "data-to-knowledge-to-innovation," a crucial missing step in the current era is, however, our limited understanding of biological and clinical contexts associated with data. Prominent among the emerging remedies to this challenge are the gene set enrichment tools. This study reports on GeneAnalytics™ ( geneanalytics.genecards.org ), a comprehensive and easy-to-apply gene set analysis tool for rapid contextualization of expression patterns and functional signatures embedded in the postgenomics Big Data domains, such as Next Generation Sequencing (NGS), RNAseq, and microarray experiments. GeneAnalytics' differentiating features include in-depth evidence-based scoring algorithms, an intuitive user interface and proprietary unified data. GeneAnalytics employs the LifeMap Science's GeneCards suite, including the GeneCards®--the human gene database; the MalaCards-the human diseases database; and the PathCards--the biological pathways database. Expression-based analysis in GeneAnalytics relies on the LifeMap Discovery®--the embryonic development and stem cells database, which includes manually curated expression data for normal and diseased tissues, enabling advanced matching algorithm for gene-tissue association. This assists in evaluating differentiation protocols and discovering biomarkers for tissues and cells. Results are directly linked to gene, disease, or cell "cards" in the GeneCards suite. Future developments aim to enhance the GeneAnalytics algorithm as well as visualizations, employing varied graphical display items. Such attributes make GeneAnalytics a broadly applicable postgenomics data analyses and interpretation tool for translation of data to knowledge-based innovation in various Big Data fields such as precision medicine, ecogenomics, nutrigenomics, pharmacogenomics, vaccinomics
Identification of novel risk genes associated with type 1 diabetes mellitus using a genome-wide gene-based association analysis.

Science.gov (United States)

Qiu, Ying-Hua; Deng, Fei-Yan; Li, Min-Jing; Lei, Shu-Feng

2014-11-01

Type 1 diabetes mellitus is a serious disorder characterized by destruction of pancreatic β-cells, culminating in absolute insulin deficiency. Genetic factors contribute to the susceptibility of type 1 diabetes mellitus. The aim of the present study was to identify more susceptibility genes of type 1 diabetes mellitus. We carried out an initial gene-based genome-wide association study in a total of 4,075 type 1 diabetes mellitus cases and 2,604 controls by using the Gene-based Association Test using Extended Simes procedure. Furthermore, we carried out replication studies, differential expression analysis and functional annotation clustering analysis to support the significance of the identified susceptibility genes. We identified 452 genes associated with type 1 diabetes mellitus, even after adapting the genome-wide threshold for significance (P diabetes mellitus, which were ignored in single-nucleotide polymorphism-based association analysis and were not previously reported. We found that 53 genes have supportive evidence from replication studies and/or differential expression studies. In particular, seven genes including four non-human leukocyte antigen (HLA) genes (RASIP1, STRN4, BCAR1 and MYL2) are replicated in at least one independent population and also differentially expressed in peripheral blood mononuclear cells or monocytes. Furthermore, the associated genes tend to enrich in immune-related pathways or Gene Ontology project terms. The present results suggest the high power of gene-based association analysis in detecting disease-susceptibility genes. Our findings provide more insights into the genetic basis of type 1 diabetes mellitus.
Serial analysis of gene expression (SAGE) in rat liver regeneration

International Nuclear Information System (INIS)

Cimica, Velasco; Batusic, Danko; Haralanova-Ilieva, Borislava; Chen, Yonglong; Hollemann, Thomas; Pieler, Tomas; Ramadori, Giuliano

2007-01-01

We have applied serial analysis of gene expression for studying the molecular mechanism of the rat liver regeneration in the model of 70% partial hepatectomy. We generated three SAGE libraries from a normal control liver (NL library: 52,343 tags), from a sham control operated liver (Sham library: 51,028 tags), and from a regenerating liver (PH library: 53,061 tags). By SAGE bioinformatics analysis we identified 40 induced genes and 20 repressed genes during the liver regeneration. We verified temporal expression of such genes by real time PCR during the regeneration process and we characterized 13 induced genes and 3 repressed genes. We found connective tissue growth factor transcript and protein induced very early at 4 h after PH operation before hepatocytes proliferation is triggered. Our study suggests CTGF as a growth factor signaling mediator that could be involved directly in the mechanism of liver regeneration induction
The invasive MED/Q Bemisia tabaci genome: a tale of gene loss and gene gain

Science.gov (United States)

Whiteflies are a group of invasive crop pests that impact global agriculture. An analysis was conducted to compare draft genomes of two whitefly strains, which demonstrated the relative conserved gene order, but a number of genes were either novel (added) or omitted (deleted) between genomes. This...
Genome analysis of Elusimicrobium minutum, the first cultivated representative of the Elusimicrobia phylum (formerly Termite Group 1)

Energy Technology Data Exchange (ETDEWEB)

Herlemann, D. P. R.; Geissinger, O.; Ikeda-Ohtsubo, W.; Kunin, V.; Sun, H.; Lapidus, A.; Hugenholtz, P.; Brune, A.

2009-02-01

The candidate phylum Termite group 1 (TG1), is regularly 1 encountered in termite hindguts but is present also in many other habitats. Here we report the complete genome sequence (1.64 Mbp) of Elusimicrobium minutum strain Pei191{sup T}, the first cultured representative of the TG1 phylum. We reconstructed the metabolism of this strictly anaerobic bacterium isolated from a beetle larva gut and discuss the findings in light of physiological data. E. minutum has all genes required for uptake and fermentation of sugars via the Embden-Meyerhof pathway, including several hydrogenases, and an unusual peptide degradation pathway comprising transamination reactions and leading to the formation of alanine, which is excreted in substantial amounts. The presence of genes encoding lipopolysaccharide biosynthesis and the presence of a pathway for peptidoglycan formation are consistent with ultrastructural evidence of a Gram-negative cell envelope. Even though electron micrographs showed no cell appendages, the genome encodes many genes putatively involved in pilus assembly. We assigned some to a type II secretion system, but the function of 60 pilE-like genes remains unknown. Numerous genes with hypothetical functions, e.g., polyketide synthesis, non-ribosomal peptide synthesis, antibiotic transport, and oxygen stress protection, indicate the presence of hitherto undiscovered physiological traits. Comparative analysis of 22 concatenated single-copy marker genes corroborated the status of Elusimicrobia (formerly TG1) as a separate phylum in the bacterial domain, which was so far based only on 16S rRNA sequence analysis.
Identification of wild soybean (Glycine soja) TIFY family genes and their expression profiling analysis under bicarbonate stress.

Science.gov (United States)

Zhu, Dan; Bai, Xi; Luo, Xiao; Chen, Qin; Cai, Hua; Ji, Wei; Zhu, Yanming

2013-02-01

Wild soybean (Glycine soja L. G07256) exhibits a greater adaptability to soil bicarbonate stress than cultivated soybean, and recent discoveries show that TIFY family genes are involved in the response to several abiotic stresses. A genomic and transcriptomic analysis of all TIFY genes in G. soja, compared with G. max, will provide insight into the function of this gene family in plant bicarbonate stress response. This article identified and characterized 34 TIFY genes in G. soja. Sequence analyses indicated that most GsTIFY proteins had two conserved domains: TIFY and Jas. Phylogenetic analyses suggested that these GsTIFY genes could be classified into two groups. A clustering analysis of all GsTIFY transcript expression profiles from bicarbonate stress treated G. soja showed that there were five different transcript patterns in leaves and six different transcript patterns in roots when the GsTIFY family responds to bicarbonate stress. Moreover, the expression level changes of all TIFY genes in cultivated soybean, treated with bicarbonate stress, were also verified. The expression comparison analysis of TIFYs between wild and cultivated soybeans confirmed that, different from the cultivated soybean, GsTIFY (10a, 10b, 10c, 10d, 10e, 10f, 11a, and 11b) were dramatically up-regulated at the early stage of stress, while GsTIFY 1c and 2b were significantly up-regulated at the later period of stress. The frequently stress responsive and diverse expression profiles of the GsTIFY gene family suggests that this family may play important roles in plant environmental stress responses and adaptation.
Phase analysis of circadian-related genes in two tissues

Directory of Open Access Journals (Sweden)

Li Leping

2006-02-01

Full Text Available Abstract Background Recent circadian clock studies using gene expression microarray in two different tissues of mouse have revealed not all circadian-related genes are synchronized in phase or peak expression times across tissues in vivo. Instead, some circadian-related genes may be delayed by 4–8 hrs in peak expression in one tissue relative to the other. These interesting biological observations prompt a statistical question regarding how to distinguish the synchronized genes from genes that are systematically lagged in phase/peak expression time across two tissues. Results We propose a set of techniques from circular statistics to analyze phase angles of circadian-related genes in two tissues. We first estimate the phases of a cycling gene separately in each tissue, which are then used to estimate the paired angular difference of the phase angles of the gene in the two tissues. These differences are modeled as a mixture of two von Mises distributions which enables us to cluster genes into two groups; one group having synchronized transcripts with the same phase in the two tissues, the other containing transcripts with a discrepancy in phase between the two tissues. For each cluster of genes we assess the association of phases across the tissue types using circular-circular regression. We also develop a bootstrap methodology based on a circular-circular regression model to evaluate the improvement in fit provided by allowing two components versus a one-component von-Mises model. Conclusion We applied our proposed methodologies to the circadian-related genes common to heart and liver tissues in Storch et al. 2, and found that an estimated 80% of circadian-related transcripts common to heart and liver tissues were synchronized in phase, and the other 20% of transcripts were lagged about 8 hours in liver relative to heart. The bootstrap p-value for being one cluster is 0.063, which suggests the possibility of two clusters. Our methodologies can
Computer analysis of protein functional sites projection on exon structure of genes in Metazoa.

Science.gov (United States)

Medvedeva, Irina V; Demenkov, Pavel S; Ivanisenko, Vladimir A

2015-01-01

Study of the relationship between the structural and functional organization of proteins and their coding genes is necessary for an understanding of the evolution of molecular systems and can provide new knowledge for many applications for designing proteins with improved medical and biological properties. It is well known that the functional properties of proteins are determined by their functional sites. Functional sites are usually represented by a small number of amino acid residues that are distantly located from each other in the amino acid sequence. They are highly conserved within their functional group and vary significantly in structure between such groups. According to this facts analysis of the general properties of the structural organization of the functional sites at the protein level and, at the level of exon-intron structure of the coding gene is still an actual problem. One approach to this analysis is the projection of amino acid residue positions of the functional sites along with the exon boundaries to the gene structure. In this paper, we examined the discontinuity of the functional sites in the exon-intron structure of genes and the distribution of lengths and phases of the functional site encoding exons in vertebrate genes. We have shown that the DNA fragments coding the functional sites were in the same exons, or in close exons. The observed tendency to cluster the exons that code functional sites which could be considered as the unit of protein evolution. We studied the characteristics of the structure of the exon boundaries that code, and do not code, functional sites in 11 Metazoa species. This is accompanied by a reduced frequency of intercodon gaps (phase 0) in exons encoding the amino acid residue functional site, which may be evidence of the existence of evolutionary limitations to the exon shuffling. These results characterize the features of the coding exon-intron structure that affect the functionality of the encoded protein and
Analysis of PSPHL as a Candidate Gene Influencing the Racial Disparity in Endometrial Cancer

Energy Technology Data Exchange (ETDEWEB)

Allard, Jay E. [Walter Reed Army Medical Center, Washington, DC (United States); Chandramouli, Gadisetti V. R. [Department of Obstetrics, Gynecology and Reproductive Biology, Michigan State University College of Human Medicine, Grand Rapids, MI (United States); Stagliano, Katherine [Curtis and Elizabeth Anderson Cancer Institute at Memorial Health University Medical Center, Savannah, GA (United States); Hood, Brian L. [Women’s Health Integrated Research Center at Inova Health System, Annandale, VA (United States); Litzi, Tracy [Walter Reed Army Medical Center, Washington, DC (United States); Women’s Health Integrated Research Center at Inova Health System, Annandale, VA (United States); Shoji, Yutaka [Department of Obstetrics, Gynecology and Reproductive Biology, Michigan State University College of Human Medicine, Grand Rapids, MI (United States); Curtis and Elizabeth Anderson Cancer Institute at Memorial Health University Medical Center, Savannah, GA (United States); Boyd, Jeff [Curtis and Elizabeth Anderson Cancer Institute at Memorial Health University Medical Center, Savannah, GA (United States); Fox Chase Cancer Center, Philadelphia, PA (United States); Berchuck, Andrew [Division of Gynecologic Oncology, Duke University, Durham, NC (United States); Conrads, Thomas P. [Curtis and Elizabeth Anderson Cancer Institute at Memorial Health University Medical Center, Savannah, GA (United States); Maxwell, G. Larry [Walter Reed Army Medical Center, Washington, DC (United States); Women’s Health Integrated Research Center at Inova Health System, Annandale, VA (United States); Risinger, John I., E-mail: john.risinger@hc.msu.edu [Department of Obstetrics, Gynecology and Reproductive Biology, Michigan State University College of Human Medicine, Grand Rapids, MI (United States); Curtis and Elizabeth Anderson Cancer Institute at Memorial Health University Medical Center, Savannah, GA (United States)

2012-07-04

Endometrial cancer is the most commonly diagnosed gynecologic malignancy in the United States. A well recognized disparity by race in both incidence and survival outcome exists for this cancer. Specifically Caucasians are about two times more likely to develop endometrial cancer than are African-Americans. However, African-American women are more likely to die from this disease than are Caucasians. The basis for this disparity remains unknown. Previous studies have identified differences in the types and frequencies of gene mutations among endometrial cancers from Caucasians and African-Americans suggesting that the tumors from these two groups might have differing underlying genetic defects. We performed a gene expression microarray study in an effort to identify differentially expressed transcripts between African-American and Caucasian women’s endometrial cancers. Our gene expression screen identified a list of potential biomarkers that are differentially expressed between these two groups of cancers. Of these we identified a poorly characterized transcript with a region of homology to phospho serine phosphatase (PSPH) and designated phospho serine phosphatase like (PSPHL) as the most differentially over-expressed gene in cancers from African-Americans. We further clarified the nature of expressed transcripts. Northern blot analysis confirmed the message was limited to a transcript of under 1 kB. Sequence analysis of transcripts confirmed two alternate open reading frame (ORF) isoforms due to alternative splicing events. Splice specific primer sets confirmed both isoforms were differentially expressed in tissues from Caucasians and African-Americans. We further examined the expression in other tissues from women to include normal endometrium, normal and malignant ovary. In all cases PSPHL expression was more often present in tissues from African-Americans than Caucasians. Our data confirm the African-American based expression of the PSPHL transcript in
Analysis of PSPHL as a Candidate Gene Influencing the Racial Disparity in Endometrial Cancer

International Nuclear Information System (INIS)

Allard, Jay E.; Chandramouli, Gadisetti V. R.; Stagliano, Katherine; Hood, Brian L.; Litzi, Tracy; Shoji, Yutaka; Boyd, Jeff; Berchuck, Andrew; Conrads, Thomas P.; Maxwell, G. Larry; Risinger, John I.

2012-01-01

Endometrial cancer is the most commonly diagnosed gynecologic malignancy in the United States. A well recognized disparity by race in both incidence and survival outcome exists for this cancer. Specifically Caucasians are about two times more likely to develop endometrial cancer than are African-Americans. However, African-American women are more likely to die from this disease than are Caucasians. The basis for this disparity remains unknown. Previous studies have identified differences in the types and frequencies of gene mutations among endometrial cancers from Caucasians and African-Americans suggesting that the tumors from these two groups might have differing underlying genetic defects. We performed a gene expression microarray study in an effort to identify differentially expressed transcripts between African-American and Caucasian women’s endometrial cancers. Our gene expression screen identified a list of potential biomarkers that are differentially expressed between these two groups of cancers. Of these we identified a poorly characterized transcript with a region of homology to phospho serine phosphatase (PSPH) and designated phospho serine phosphatase like (PSPHL) as the most differentially over-expressed gene in cancers from African-Americans. We further clarified the nature of expressed transcripts. Northern blot analysis confirmed the message was limited to a transcript of under 1 kB. Sequence analysis of transcripts confirmed two alternate open reading frame (ORF) isoforms due to alternative splicing events. Splice specific primer sets confirmed both isoforms were differentially expressed in tissues from Caucasians and African-Americans. We further examined the expression in other tissues from women to include normal endometrium, normal and malignant ovary. In all cases PSPHL expression was more often present in tissues from African-Americans than Caucasians. Our data confirm the African-American based expression of the PSPHL transcript in
Identification of cytokinin-responsive genes using microarray meta-analysis and RNA-Seq in Arabidopsis.

Science.gov (United States)

Bhargava, Apurva; Clabaugh, Ivory; To, Jenn P; Maxwell, Bridey B; Chiang, Yi-Hsuan; Schaller, G Eric; Loraine, Ann; Kieber, Joseph J

2013-05-01

Cytokinins are N(6)-substituted adenine derivatives that play diverse roles in plant growth and development. We sought to define a robust set of genes regulated by cytokinin as well as to query the response of genes not represented on microarrays. To this end, we performed a meta-analysis of microarray data from a variety of cytokinin-treated samples and used RNA-seq to examine cytokinin-regulated gene expression in Arabidopsis (Arabidopsis thaliana). Microarray meta-analysis using 13 microarray experiments combined with empirically defined filtering criteria identified a set of 226 genes differentially regulated by cytokinin, a subset of which has previously been validated by other methods. RNA-seq validated about 73% of the up-regulated genes identified by this meta-analysis. In silico promoter analysis indicated an overrepresentation of type-B Arabidopsis response regulator binding elements, consistent with the role of type-B Arabidopsis response regulators as primary mediators of cytokinin-responsive gene expression. RNA-seq analysis identified 73 cytokinin-regulated genes that were not represented on the ATH1 microarray. Representative genes were verified using quantitative reverse transcription-polymerase chain reaction and NanoString analysis. Analysis of the genes identified reveals a substantial effect of cytokinin on genes encoding proteins involved in secondary metabolism, particularly those acting in flavonoid and phenylpropanoid biosynthesis, as well as in the regulation of redox state of the cell, particularly a set of glutaredoxin genes. Novel splicing events were found in members of some gene families that are known to play a role in cytokinin signaling or metabolism. The genes identified in this analysis represent a robust set of cytokinin-responsive genes that are useful in the analysis of cytokinin function in plants.
Microarray Data Analysis of Space Grown Arabidopsis Leaves for Genes Important in Vascular Patterning. Analysis of Space Grown Arabidopsis with Microarray Data from GeneLab: Identification of Genes Important in Vascular Patterning

Science.gov (United States)

Weitzel, A. J.; Wyatt, S. E.; Parsons-Wingerter, P.

2016-01-01

Venation patterning in leaves is a major determinant of photosynthesis efficiency because of its dependency on vascular transport of photo-assimilates, water, and minerals. Arabidopsis thaliana grown in microgravity show delayed growth and leaf maturation. Gene expression data from the roots, hypocotyl, and leaves of A. thaliana grown during spaceflight vs. ground control analyzed by Affymetrix microarray are available through NASA's GeneLab (GLDS-7). We analyzed the data for differential expression of genes in leaves resulting from the effects of spaceflight on vascular patterning. Two genes were found by preliminary analysis to be up-regulated during spaceflight that may be related to vascular formation. The genes are responsible for coding an ARGOS (Auxin-Regulated Gene Involved in Organ Size)-like protein (potentially affecting cell elongation in the leaves), and an F-box/kelch-repeat protein (possibly contributing to protoxylem specification). Further analysis that will focus on raw data quality assessment and a moderated t-test may further confirm up-regulation of the two genes and/or identify other gene candidates. Plants defective in these genes will then be assessed for phenotype by the mapping and quantification of leaf vascular patterning by NASA's VESsel GENeration (VESGEN) software to model specific vascular differences of plants grown in spaceflight.
The Reconstruction and Analysis of Gene Regulatory Networks.

Science.gov (United States)

Zheng, Guangyong; Huang, Tao

2018-01-01

In post-genomic era, an important task is to explore the function of individual biological molecules (i.e., gene, noncoding RNA, protein, metabolite) and their organization in living cells. For this end, gene regulatory networks (GRNs) are constructed to show relationship between biological molecules, in which the vertices of network denote biological molecules and the edges of network present connection between nodes (Strogatz, Nature 410:268-276, 2001; Bray, Science 301:1864-1865, 2003). Biologists can understand not only the function of biological molecules but also the organization of components of living cells through interpreting the GRNs, since a gene regulatory network is a comprehensively physiological map of living cells and reflects influence of genetic and epigenetic factors (Strogatz, Nature 410:268-276, 2001; Bray, Science 301:1864-1865, 2003). In this paper, we will review the inference methods of GRN reconstruction and analysis approaches of network structure. As a powerful tool for studying complex diseases and biological processes, the applications of the network method in pathway analysis and disease gene identification will be introduced.
Cytogenetic mapping of the Muller F element genes in Drosophila willistoni group.

Science.gov (United States)

Pita, Sebastián; Panzera, Yanina; Lúcia da Silva Valente, Vera; de Melo, Zilpa das Graças Silva; Garcia, Carolina; Garcia, Ana Cristina Lauer; Montes, Martín Alejandro; Rohde, Claudia

2014-10-01

Comparative genomics in Drosophila began in 1940, when Muller stated that the ancestral haploid karyotype of this genus is constituted by five acrocentric chromosomes and one dot chromosome, named A to F elements. In some species of the willistoni group such as Drosophila willistoni and D. insularis, the F element, instead of a dot chromosome, has been incorporated into the E element, forming chromosome III (E + F fusion). The aim of this study was to investigate the scope of the E + F fusion in the willistoni group, evaluating six other species. Fluorescent in situ hybridization was used to locate two genes of the F element previously studied-cubitus interruptus (ci) and eyeless (ey)-in species of the willistoni and bocainensis subgroups. Moreover, polytene chromosome photomaps corresponding to the F element (basal portion of chromosome III) were constructed for each species studied. In D. willistoni, D. paulistorum and D. equinoxialis, the ci gene was located in subSectction 78B and the ey gene in 78C. In D. tropicalis, ci was located in subSection 76B and ey in 76C. In species of the bocainensis subgroup, ci and ey were localized, respectively, at subsections 76B and 76C in D. nebulosa and D. capricorni, and 76A and 76C in D. fumipennis. Despite the differences in the subsection numbers, all species showed the same position for ci and ey. The results confirm the synteny of E + F fusion in willistoni and bocainensis subgroups, and allow estimating the occurrence of this event at 15 Mya, at least.

Global gene expression analysis of the zoonotic parasite Trichinella spiralis revealed novel genes in host parasite interaction.

Directory of Open Access Journals (Sweden)

Xiaolei Liu

Full Text Available BACKGROUND: Trichinellosis is a typical food-borne zoonotic disease which is epidemic worldwide and the nematode Trichinella spiralis is the main pathogen. The life cycle of T. spiralis contains three developmental stages, i.e. adult worms, new borne larva (new borne L1 larva and muscular larva (infective L1 larva. Stage-specific gene expression in the parasites has been investigated with various immunological and cDNA cloning approaches, whereas the genome-wide transcriptome and expression features of the parasite have been largely unknown. The availability of the genome sequence information of T. spiralis has made it possible to deeply dissect parasite biology in association with global gene expression and pathogenesis. METHODOLOGY AND PRINCIPAL FINDINGS: In this study, we analyzed the global gene expression patterns in the three developmental stages of T. spiralis using digital gene expression (DGE analysis. Almost 15 million sequence tags were generated with the Illumina RNA-seq technology, producing expression data for more than 9,000 genes, covering 65% of the genome. The transcriptome analysis revealed thousands of differentially expressed genes within the genome, and importantly, a panel of genes encoding functional proteins associated with parasite invasion and immuno-modulation were identified. More than 45% of the genes were found to be transcribed from both strands, indicating the importance of RNA-mediated gene regulation in the development of the parasite. Further, based on gene ontological analysis, over 3000 genes were functionally categorized and biological pathways in the three life cycle stage were elucidated. CONCLUSIONS AND SIGNIFICANCE: The global transcriptome of T. spiralis in three developmental stages has been profiled, and most gene activity in the genome was found to be developmentally regulated. Many metabolic and biological pathways have been revealed. The findings of the differential expression of several protein
Genome-wide analysis of immune system genes by EST profiling

Science.gov (United States)

Giallourakis, Cosmas; Benita, Yair; Molinie, Benoit; Cao, Zhifang; Despo, Orion; Pratt, Henry E.; Zukerberg, Lawrence R.; Daly, Mark J.; Rioux, John D.; Xavier, Ramnik J.

2013-01-01

Profiling studies of mRNA and miRNA, particularly microarray-based studies, have been extensively used to create compendia of genes that are preferentially expressed in the immune system. In some instances, functional studies have been subsequently pursued. Recent efforts such as ENCODE have demonstrated the benefit of coupling RNA-Seq analysis with information from expressed sequence tags (ESTs) for transcriptomic analysis. However, the full characterization and identification of transcripts that function as modulators of human immune responses remains incomplete. In this study, we demonstrate that an integrated analysis of human ESTs provides a robust platform to identify the immune transcriptome. Beyond recovering a reference set of immune-enriched genes and providing large-scale cross-validation of previous microarray studies, we discovered hundreds of novel genes preferentially expressed in the immune system, including non-coding RNAs. As a result, we have established the Immunogene database, representing an integrated EST “road map” of gene expression in human immune cells, which can be used to further investigate the function of coding and non-coding genes in the immune system. Using this approach, we have uncovered a unique metabolic gene signature of human macrophages and identified PRDM15 as a novel overexpressed gene in human lymphomas. Thus we demonstrate the utility of EST profiling as a basis for further deconstruction of physiologic and pathologic immune processes. PMID:23616578
Spectral map-analysis: a method to analyze gene expression data

OpenAIRE

Bijnens, Luc J.M.; Lewi, Paul J.; Göhlmann, Hinrich W.; Molenberghs, Geert; Wouters, Luc

2004-01-01

bioinformatics; biplot; correspondence factor analysis; data mining; data visualization; gene expression data; microarray data; multivariate exploratory data analysis; principal component analysis; Spectral map analysis
Candidate chemosensory genes identified in the endoparasitoid Meteorus pulchricornis (Hymenoptera: Braconidae) by antennal transcriptome analysis.

Science.gov (United States)

Sheng, Sheng; Liao, Cheng-Wu; Zheng, Yu; Zhou, Yu; Xu, Yan; Song, Wen-Miao; He, Peng; Zhang, Jian; Wu, Fu-An

2017-06-01

Meteorus pulchricornis is an endoparasitoid wasp which attacks the larvae of various lepidopteran pests. We present the first antennal transcriptome dataset for M. pulchricornis. A total of 48,845,072 clean reads were obtained and 34,967 unigenes were assembled. Of these, 15,458 unigenes showed a significant similarity (E-value <10 -5 ) to known proteins in the NCBI non-redundant protein database. Gene ontology (GO) and cluster of orthologous groups (COG) analyses were used to classify the functions of M. pulchricornis antennae genes. We identified 16 putative odorant-binding protein (OBP) genes, eight chemosensory protein (CSP) genes, 99 olfactory receptor (OR) genes, 19 ionotropic receptor (IR) genes and one sensory neuron membrane protein (SNMP) gene. BLASTx best hit results and phylogenetic analysis both indicated that these chemosensory genes were most closely related to those found in other hymenopteran species. Real-time quantitative PCR assays showed that 14 MpulOBP genes were antennae-specific. Of these, MpulOBP6, MpulOBP9, MpulOBP10, MpulOBP12, MpulOBP15 and MpulOBP16 were found to have greater expression in the antennae than in other body parts, while MpulOBP2 and MpulOBP3 were expressed predominately in the legs and abdomens, respectively. These results might provide a foundation for future studies of olfactory genes and chemoreception in M. pulchricornis. Copyright © 2017 Elsevier Inc. All rights reserved.
A gene-based linkage map for Bicyclus anynana butterflies allows for a comprehensive analysis of synteny with the lepidopteran reference genome.

Directory of Open Access Journals (Sweden)

Patrícia Beldade

2009-02-01

Full Text Available Lepidopterans (butterflies and moths are a rich and diverse order of insects, which, despite their economic impact and unusual biological properties, are relatively underrepresented in terms of genomic resources. The genome of the silkworm Bombyx mori has been fully sequenced, but comparative lepidopteran genomics has been hampered by the scarcity of information for other species. This is especially striking for butterflies, even though they have diverse and derived phenotypes (such as color vision and wing color patterns and are considered prime models for the evolutionary and developmental analysis of ecologically relevant, complex traits. We focus on Bicyclus anynana butterflies, a laboratory system for studying the diversification of novelties and serially repeated traits. With a panel of 12 small families and a biphasic mapping approach, we first assigned 508 expressed genes to segregation groups and then ordered 297 of them within individual linkage groups. We also coarsely mapped seven color pattern loci. This is the richest gene-based map available for any butterfly species and allowed for a broad-coverage analysis of synteny with the lepidopteran reference genome. Based on 462 pairs of mapped orthologous markers in Bi. anynana and Bo. mori, we observed strong conservation of gene assignment to chromosomes, but also evidence for numerous large- and small-scale chromosomal rearrangements. With gene collections growing for a variety of target organisms, the ability to place those genes in their proper genomic context is paramount. Methods to map expressed genes and to compare maps with relevant model systems are crucial to extend genomic-level analysis outside classical model species. Maps with gene-based markers are useful for comparative genomics and to resolve mapped genomic regions to a tractable number of candidate genes, especially if there is synteny with related model species. This is discussed in relation to the identification of
Comprehensive analysis of the flowering genes in Chinese cabbage and examination of evolutionary pattern of CO-like genes in plant kingdom

Science.gov (United States)

Song, Xiaoming; Duan, Weike; Huang, Zhinan; Liu, Gaofeng; Wu, Peng; Liu, Tongkun; Li, Ying; Hou, Xilin

2015-09-01

In plants, flowering is the most important transition from vegetative to reproductive growth. The flowering patterns of monocots and eudicots are distinctly different, but few studies have described the evolutionary patterns of the flowering genes in them. In this study, we analysed the evolutionary pattern, duplication and expression level of these genes. The main results were as follows: (i) characterization of flowering genes in monocots and eudicots, including the identification of family-specific, orthologous and collinear genes; (ii) full characterization of CONSTANS-like genes in Brassica rapa (BraCOL genes), the key flowering genes; (iii) exploration of the evolution of COL genes in plant kingdom and construction of the evolutionary pattern of COL genes; (iv) comparative analysis of CO and FT genes between Brassicaceae and Grass, which identified several family-specific amino acids, and revealed that CO and FT protein structures were similar in B. rapa and Arabidopsis but different in rice; and (v) expression analysis of photoperiod pathway-related genes in B. rapa under different photoperiod treatments by RT-qPCR. This analysis will provide resources for understanding the flowering mechanisms and evolutionary pattern of COL genes. In addition, this genome-wide comparative study of COL genes may also provide clues for evolution of other flowering genes.
Gene function analysis by artificial microRNAs in Physcomitrella patens.

KAUST Repository

Khraiwesh, Basel

2011-01-01

MicroRNAs (miRNAs) are ~21 nt long small RNAs transcribed from endogenous MIR genes which form precursor RNAs with a characteristic hairpin structure. miRNAs control the expression of cognate target genes by binding to reverse complementary sequences resulting in cleavage or translational inhibition of the target RNA. Artificial miRNAs (amiRNAs) can be generated by exchanging the miRNA/miRNA sequence of endogenous MIR precursor genes, while maintaining the general pattern of matches and mismatches in the foldback. Thus, for functional gene analysis amiRNAs can be designed to target any gene of interest. During the last decade the moss Physcomitrella patens emerged as a model plant for functional gene analysis based on its unique ability to integrate DNA into the nuclear genome by homologous recombination which allows for the generation of targeted gene knockout mutants. In addition to this, we developed a protocol to express amiRNAs in P. patens that has particular advantages over the generation of knockout mutants and might be used to speed up reverse genetics approaches in this model species.
Genetic polymorphisms of xeroderma pigmentosum group D gene Asp312Asn and Lys751Gln and susceptibility to prostate cancer: a systematic review and meta-analysis.

Science.gov (United States)

Ma, Qingtong; Qi, Can; Tie, Chong; Guo, Zhanjun

2013-11-10

Many studies have reported the role of xeroderma pigmentosum group D (XPD) with prostate cancer risk, but the results remained controversial. To derive a more precise estimation of the relationship, a meta-analysis was performed. Odds ratios (ORs) with 95% confidence intervals (CIs) were estimated to assess the association between XPD Asp312Asn and Lys751Gln polymorphisms and prostate cancer risk. A total of 8 studies including 2620 cases and 3225 controls described Asp312Asn genotypes, among which 10 articles involving 3230 cases and 3582 controls described Lys751Gln genotypes and were also involved in this meta-analysis. When all the eligible studies were pooled into this meta-analysis, a significant association between prostate cancer risk and XPD Asp312Asn polymorphism was found. For Asp312Asn polymorphism, in the stratified analysis by ethnicity and source of controls, prostate cancer risk was observed in co-dominant, dominant and recessive models, while no evidence of any associations of XPD Lys751Gln polymorphism with prostate cancer was found in the overall or subgroup analyses. Our meta-analysis supports that the XPD Asp312Asn polymorphism contributed to the risk of prostate cancer from currently available evidence. However, a study with a larger sample size is needed to further evaluate gene-environment interaction on XPD Asp312Asn and Lys751Gln polymorphisms and prostate cancer risk. © 2013.
Screening Key Genes Associated with the Development and Progression of Non-small Cell Lung Cancer Based on Gene-enrichment Analysis and Meta-analysis

Directory of Open Access Journals (Sweden)

Wenwu HE

2012-07-01

Full Text Available Background and objective Non-small cell lung cancer (NSCLC is one of the most common malignant tumors; however, its causes are still not completely understood. This study was designed to screen the key genes and pathways related to NSCLC occurrence and development and to establish the scientific foundation for the genetic mechanisms and targeted therapy of NSCLC. Methods Both gene set-enrichment analysis (GSEA and meta-analysis (meta were used to screen the critical pathways and genes that might be corretacted with the development and progression of lung cancer at the transcription level. Results Using the GSEA and meta methods, focal adhesion and regulation of actin cytoskeleton were determined to be the more prominent overlapping significant pathways. In the focal adhesion pathway, 31 genes were statistically significant (P<0.05, whereas in the regulation of actin cytoskeleton pathway, 32 genes were statistically significant (P<0.05. Conclusion The focal adhesion and the regulation of actin cytoskeleton pathways might play important roles in the occurrence and development of NSCLC. Further studies are needed to determine the biological function for the positiue genes.
StreptoBase: An Oral Streptococcus mitis Group Genomic Resource and Analysis Platform.

Directory of Open Access Journals (Sweden)

Wenning Zheng

Full Text Available The oral streptococci are spherical Gram-positive bacteria categorized under the phylum Firmicutes which are among the most common causative agents of bacterial infective endocarditis (IE and are also important agents in septicaemia in neutropenic patients. The Streptococcus mitis group is comprised of 13 species including some of the most common human oral colonizers such as S. mitis, S. oralis, S. sanguinis and S. gordonii as well as species such as S. tigurinus, S. oligofermentans and S. australis that have only recently been classified and are poorly understood at present. We present StreptoBase, which provides a specialized free resource focusing on the genomic analyses of oral species from the mitis group. It currently hosts 104 S. mitis group genomes including 27 novel mitis group strains that we sequenced using the high throughput Illumina HiSeq technology platform, and provides a comprehensive set of genome sequences for analyses, particularly comparative analyses and visualization of both cross-species and cross-strain characteristics of S. mitis group bacteria. StreptoBase incorporates sophisticated in-house designed bioinformatics web tools such as Pairwise Genome Comparison (PGC tool and Pathogenomic Profiling Tool (PathoProT, which facilitate comparative pathogenomics analysis of Streptococcus strains. Examples are provided to demonstrate how StreptoBase can be employed to compare genome structure of different S. mitis group bacteria and putative virulence genes profile across multiple streptococcal strains. In conclusion, StreptoBase offers access to a range of streptococci genomic resources as well as analysis tools and will be an invaluable platform to accelerate research in streptococci. Database URL: http://streptococcus.um.edu.my.
StreptoBase: An Oral Streptococcus mitis Group Genomic Resource and Analysis Platform.

Science.gov (United States)

Zheng, Wenning; Tan, Tze King; Paterson, Ian C; Mutha, Naresh V R; Siow, Cheuk Chuen; Tan, Shi Yang; Old, Lesley A; Jakubovics, Nicholas S; Choo, Siew Woh

2016-01-01

The oral streptococci are spherical Gram-positive bacteria categorized under the phylum Firmicutes which are among the most common causative agents of bacterial infective endocarditis (IE) and are also important agents in septicaemia in neutropenic patients. The Streptococcus mitis group is comprised of 13 species including some of the most common human oral colonizers such as S. mitis, S. oralis, S. sanguinis and S. gordonii as well as species such as S. tigurinus, S. oligofermentans and S. australis that have only recently been classified and are poorly understood at present. We present StreptoBase, which provides a specialized free resource focusing on the genomic analyses of oral species from the mitis group. It currently hosts 104 S. mitis group genomes including 27 novel mitis group strains that we sequenced using the high throughput Illumina HiSeq technology platform, and provides a comprehensive set of genome sequences for analyses, particularly comparative analyses and visualization of both cross-species and cross-strain characteristics of S. mitis group bacteria. StreptoBase incorporates sophisticated in-house designed bioinformatics web tools such as Pairwise Genome Comparison (PGC) tool and Pathogenomic Profiling Tool (PathoProT), which facilitate comparative pathogenomics analysis of Streptococcus strains. Examples are provided to demonstrate how StreptoBase can be employed to compare genome structure of different S. mitis group bacteria and putative virulence genes profile across multiple streptococcal strains. In conclusion, StreptoBase offers access to a range of streptococci genomic resources as well as analysis tools and will be an invaluable platform to accelerate research in streptococci. Database URL: http://streptococcus.um.edu.my.
Gene set analysis of the EADGENE chicken data-set

DEFF Research Database (Denmark)

Skarman, Axel; Jiang, Li; Hornshøj, Henrik

2009-01-01

Abstract Background: Gene set analysis is considered to be a way of improving our biological interpretation of the observed expression patterns. This paper describes different methods applied to analyse expression data from a chicken DNA microarray dataset. Results: Applying different gene set...... analyses to the chicken expression data led to different ranking of the Gene Ontology terms tested. A method for prediction of possible annotations was applied. Conclusion: Biological interpretation based on gene set analyses dependent on the statistical method used. Methods for predicting the possible...
Role of fruA and csgA genes in gene expression during development of Myxococcus xanthus. Analysis by two-dimensional gel electrophoresis.

Science.gov (United States)

Horiuchi, Takayuki; Taoka, Masato; Isobe, Toshiaki; Komano, Teruya; Inouye, Sumiko

2002-07-26

Two genes, fruA and csgA, encoding a putative transcription factor and C-factor, respectively, are essential for fruiting body formation of Myxococcus xanthus. To investigate the role of fruA and csgA genes in developmental gene expression, developing cells as well as vegetative cells of M. xanthus wild-type, fruA::Tc, and csgA731 strains were pulse-labeled with [(35)S]methionine, and the whole cell proteins were analyzed using two-dimensional immobilized pH gradient/SDS-PAGE. Differences in protein synthesis patterns among more than 700 protein spots were detected during development of the three strains. Fourteen proteins showing distinctly different expression patterns in mutant cells were analyzed in more detail. Five of the 14 proteins were identified as elongation factor Tu (EF-Tu), Dru, DofA, FruA, and protein S by immunoblot analysis and mass spectroscopy. A gene encoding DofA was cloned and sequenced. Although both fruA and csgA genes regulate early development of M. xanthus, they were found to differently regulate expression of several developmental genes. The production of six proteins, including DofA and protein S, was dependent on fruA, whereas the production of two proteins was dependent on csgA, and one protein was dependent on both fruA and csgA. To explain the present findings, a new model was presented in which different levels of FruA phosphorylation may distinctively regulate the expression of two groups of developmental genes.
GECKO: a complete large-scale gene expression analysis platform

Directory of Open Access Journals (Sweden)

Heuer Michael

2004-12-01

Full Text Available Abstract Background Gecko (Gene Expression: Computation and Knowledge Organization is a complete, high-capacity centralized gene expression analysis system, developed in response to the needs of a distributed user community. Results Based on a client-server architecture, with a centralized repository of typically many tens of thousands of Affymetrix scans, Gecko includes automatic processing pipelines for uploading data from remote sites, a data base, a computational engine implementing ~ 50 different analysis tools, and a client application. Among available analysis tools are clustering methods, principal component analysis, supervised classification including feature selection and cross-validation, multi-factorial ANOVA, statistical contrast calculations, and various post-processing tools for extracting data at given error rates or significance levels. On account of its open architecture, Gecko also allows for the integration of new algorithms. The Gecko framework is very general: non-Affymetrix and non-gene expression data can be analyzed as well. A unique feature of the Gecko architecture is the concept of the Analysis Tree (actually, a directed acyclic graph, in which all successive results in ongoing analyses are saved. This approach has proven invaluable in allowing a large (~ 100 users and distributed community to share results, and to repeatedly return over a span of years to older and potentially very complex analyses of gene expression data. Conclusions The Gecko system is being made publicly available as free software http://sourceforge.net/projects/geckoe. In totality or in parts, the Gecko framework should prove useful to users and system developers with a broad range of analysis needs.
Phylogenetic analysis of the expansion of the MATH-BTB gene family in the grasses.

Science.gov (United States)

Juranić, Martina; Dresselhaus, Thomas

2014-01-01

MATH-BTB proteins are known to act as substrate-specific adaptors of cullin3 (CUL3)-based ubiquitin E3 ligases to target protein for ubiquitination. In a previous study we reported the presence of 31 MATH-BTB genes in the maize genome and determined the regulatory role of the MATH-BTB protein MAB1 during meiosis to mitosis transition. In contrast to maize, there are only 6 homologous genes in the model plant Arabidopsis, while this family has largely expanded in grasses. Here, we report a phylogenetic analysis of the MATH-BTB gene family in 9 land plant species including various mosses, eudicots, and grasses. We extend a previous classification of the plant MATH-BTB family and additionally arrange the expanded group into 5 grass-specific clades. Synteny studies indicate that expansion occurred to a large extent due to local gene duplications. Expression studies of 3 closely related MATH-BTB genes in maize (MAB1-3) indicate highly specific expression pattern. In summary, this work provides a solid base for further studies comparing genetic and functional information of the MATH-BTB family especially in the grasses.
Identification of a new gene regulatory circuit involving B cell receptor activated signaling using a combined analysis of experimental, clinical and global gene expression data

Science.gov (United States)

Schrader, Alexandra; Meyer, Katharina; Walther, Neele; Stolz, Ailine; Feist, Maren; Hand, Elisabeth; von Bonin, Frederike; Evers, Maurits; Kohler, Christian; Shirneshan, Katayoon; Vockerodt, Martina; Klapper, Wolfram; Szczepanowski, Monika; Murray, Paul G.; Bastians, Holger; Trümper, Lorenz; Spang, Rainer; Kube, Dieter

2016-01-01

To discover new regulatory pathways in B lymphoma cells, we performed a combined analysis of experimental, clinical and global gene expression data. We identified a specific cluster of genes that was coherently expressed in primary lymphoma samples and suppressed by activation of the B cell receptor (BCR) through αIgM treatment of lymphoma cells in vitro. This gene cluster, which we called BCR.1, includes numerous cell cycle regulators. A reduced expression of BCR.1 genes after BCR activation was observed in different cell lines and also in CD10+ germinal center B cells. We found that BCR activation led to a delayed entry to and progression of mitosis and defects in metaphase. Cytogenetic changes were detected upon long-term αIgM treatment. Furthermore, an inverse correlation of BCR.1 genes with c-Myc co-regulated genes in distinct groups of lymphoma patients was observed. Finally, we showed that the BCR.1 index discriminates activated B cell-like and germinal centre B cell-like diffuse large B cell lymphoma supporting the functional relevance of this new regulatory circuit and the power of guided clustering for biomarker discovery. PMID:27166259
Polycomb-group genes sustaining the stem cell activity

International Nuclear Information System (INIS)

Takihara, Yoshihiro

2006-01-01

Polycomb-group genes (PcG) have a role in constituting the cellular memory mechanisms through which the once expressed phenotypes during development are transmitted thereafter and this review describes, together with authors' findings of sustaining hematopoietic stem cell activity by the PcG products, what molecular bases, involving the control of histone code, are concerned in the memory. Recent investigations have gradually elucidated the outline of epigenetic control mechanisms of the memory: messages are set up as a histone code in the chromatin and the PcG complex recruited by recognition of the code regulates the chromatin structure leading to DNA transcription and maintenance of the phenotype. Proliferation of hematopoietic stem cells ex vivo will be possible if exact and detailed mechanisms for PcG are made clear in future. Such ex vivo techniques are especially awaited for marrow remodeling treatment of hematopoietic failure induced by radiation exposure. (T.I.)
Next-generation sequencing analysis of the ARMS2 gene in Turkish exudative age-related macular degeneration patients.

Science.gov (United States)

Bardak, H; Gunay, M; Ercalik, Y; Bardak, Y; Ozbas, H; Bagci, O

2017-01-23

Age-related macular degeneration (AMD) is the leading cause of blindness in developed countries. It is a complex disease with both genetic and environmental risk factors. To improve clinical management of this condition, it is important to develop risk assessment and prevention strategies for environmental influences, and establish a more effective treatment approach. The aim of the present study was to investigate age-related maculopathy susceptibility protein 2 (ARMS2) gene sequences among Turkish patients with exudative AMD. In addition to 39 advanced exudative AMD patients, 250 healthy individuals for whom exome sequencing data were available were included as a control group. Patients with a history of known environmental and systemic AMD risk factors were excluded. Genomic DNA was isolated from peripheral blood and analyzed using next-generation sequencing. All coding exons of the ARMS2 gene were assessed. Three different ARMS2 sequence variations (rs10490923, rs2736911, and rs10490924) were identified in both the patient and control group. Within the control group, two further ARMS2 gene variants (rs7088128 and rs36213074) were also detected. Logistic regression analysis revealed a relationship between the rs10490924 polymorphism and AMD in the Turkish population.
Gene profile analysis of osteoblast genes differentially regulated by histone deacetylase inhibitors

Directory of Open Access Journals (Sweden)

Lamblin Anne-Francoise

2007-10-01

Full Text Available Abstract Background Osteoblast differentiation requires the coordinated stepwise expression of multiple genes. Histone deacetylase inhibitors (HDIs accelerate the osteoblast differentiation process by blocking the activity of histone deacetylases (HDACs, which alter gene expression by modifying chromatin structure. We previously demonstrated that HDIs and HDAC3 shRNAs accelerate matrix mineralization and the expression of osteoblast maturation genes (e.g. alkaline phosphatase, osteocalcin. Identifying other genes that are differentially regulated by HDIs might identify new pathways that contribute to osteoblast differentiation. Results To identify other osteoblast genes that are altered early by HDIs, we incubated MC3T3-E1 preosteoblasts with HDIs (trichostatin A, MS-275, or valproic acid for 18 hours in osteogenic conditions. The promotion of osteoblast differentiation by HDIs in this experiment was confirmed by osteogenic assays. Gene expression profiles relative to vehicle-treated cells were assessed by microarray analysis with Affymetrix GeneChip 430 2.0 arrays. The regulation of several genes by HDIs in MC3T3-E1 cells and primary osteoblasts was verified by quantitative real-time PCR. Nine genes were differentially regulated by at least two-fold after exposure to each of the three HDIs and six were verified by PCR in osteoblasts. Four of the verified genes (solute carrier family 9 isoform 3 regulator 1 (Slc9a3r1, sorbitol dehydrogenase 1, a kinase anchor protein, and glutathione S-transferase alpha 4 were induced. Two genes (proteasome subunit, beta type 10 and adaptor-related protein complex AP-4 sigma 1 were suppressed. We also identified eight growth factors and growth factor receptor genes that are significantly altered by each of the HDIs, including Frizzled related proteins 1 and 4, which modulate the Wnt signaling pathway. Conclusion This study identifies osteoblast genes that are regulated early by HDIs and indicates pathways that
Steroidogenesis-related gene expression in the rat ovary exposed to melatonin supplementation

Directory of Open Access Journals (Sweden)

Gisele Negro Lima

2015-02-01

Full Text Available OBJECTIVE: To analyze steroidogenesis-related gene expression in the rat ovary exposed to melatonin supplementation. METHODS: Thirty-two virgin adult female rats were randomized to two groups as follows: the control group GI received vehicle and the experimental group GII received melatonin supplementation (10 µg/night per animal for 60 consecutive days. After the treatment, animals were anesthetized and the collected ovaries were immediately placed in liquid nitrogen for complementary deoxyribonucleic acid microarray analyses. A GeneChip¯ Kit Rat Genome 230 2.0 Affymetrix Array was used for gene analysis and the experiment was repeated three times for each group. The results were normalized with the GeneChip¯ Operating Software program and confirmed through analysis with the secondary deoxyribonucleic acid-Chip Analyzer (dChip software. The data were confirmed by real-time reverse transcription polymerase chain reaction analysis. Genes related to ovarian function were further confirmed by immunohistochemistry. RESULTS: We found the upregulation of the type 9 adenylate cyclase and inhibin beta B genes and the downregulation of the cyclic adenosine monophosphate response element modulator and cytochrome P450 family 17a1 genes in the ovarian tissue of GII compared to those of the control group. CONCLUSION: Our data suggest that melatonin supplementation decreases gene expression of cyclic adenosine monophosphate, which changes ovarian steroidogenesis.

Evolutionary Analysis of Minor Histocompatibility Genes In Hydra

KAUST Repository

Aalismail, Nojood

2016-01-01

In the present study we took initiative to study the self/nonself recognition in hydra and its relation to the immune response. Moreover, performing phylogenetic analysis to look for annotated immune genes in hydra gave us a potential to analyze the expression of minor histocompatibility genes that have been shown to play a major role in grafting and transplantation in mammals. Here we obtained the cDNA library that shows expression of minor histocompatibility genes and confirmed that the annotated sequences in databases are actually present. In addition, grafting experiments suggested, although still preliminary, that homograft showed less rejection response than in heterograft. Involvement of possible minor histocompatibility gene orthologous in immune response was examined by qPCR.
Analysis of gene expression profile microarray data in complex regional pain syndrome.

Science.gov (United States)

Tan, Wulin; Song, Yiyan; Mo, Chengqiang; Jiang, Shuangjian; Wang, Zhongxing

2017-09-01

The aim of the present study was to predict key genes and proteins associated with complex regional pain syndrome (CRPS) using bioinformatics analysis. The gene expression profiling microarray data, GSE47603, which included peripheral blood samples from 4 patients with CRPS and 5 healthy controls, was obtained from the Gene Expression Omnibus (GEO) database. The differentially expressed genes (DEGs) in CRPS patients compared with healthy controls were identified using the GEO2R online tool. Functional enrichment analysis was then performed using The Database for Annotation Visualization and Integrated Discovery online tool. Protein‑protein interaction (PPI) network analysis was subsequently performed using Search Tool for the Retrieval of Interaction Genes database and analyzed with Cytoscape software. A total of 257 DEGs were identified, including 243 upregulated genes and 14 downregulated ones. Genes in the human leukocyte antigen (HLA) family were most significantly differentially expressed. Enrichment analysis demonstrated that signaling pathways, including immune response, cell motion, adhesion and angiogenesis were associated with CRPS. PPI network analysis revealed that key genes, including early region 1A binding protein p300 (EP300), CREB‑binding protein (CREBBP), signal transducer and activator of transcription (STAT)3, STAT5A and integrin α M were associated with CRPS. The results suggest that the immune response may therefore serve an important role in CRPS development. In addition, genes in the HLA family, such as HLA‑DQB1 and HLA‑DRB1, may present potential biomarkers for the diagnosis of CRPS. Furthermore, EP300, its paralog CREBBP, and the STAT family genes, STAT3 and STAT5 may be important in the development of CRPS.
EST analysis in Ginkgo biloba: an assessment of conserved developmental regulators and gymnosperm specific genes

Directory of Open Access Journals (Sweden)

Runko Suzan J

2005-10-01

Full Text Available Abstract Background Ginkgo biloba L. is the only surviving member of one of the oldest living seed plant groups with medicinal, spiritual and horticultural importance worldwide. As an evolutionary relic, it displays many characters found in the early, extinct seed plants and extant cycads. To establish a molecular base to understand the evolution of seeds and pollen, we created a cDNA library and EST dataset from the reproductive structures of male (microsporangiate, female (megasporangiate, and vegetative organs (leaves of Ginkgo biloba. Results RNA from newly emerged male and female reproductive organs and immature leaves was used to create three distinct cDNA libraries from which 6,434 ESTs were generated. These 6,434 ESTs from Ginkgo biloba were clustered into 3,830 unigenes. A comparison of our Ginkgo unigene set against the fully annotated genomes of rice and Arabidopsis, and all available ESTs in Genbank revealed that 256 Ginkgo unigenes match only genes among the gymnosperms and non-seed plants – many with multiple matches to genes in non-angiosperm plants. Conversely, another group of unigenes in Gingko had highly significant homology to transcription factors in angiosperms involved in development, including MADS box genes as well as post-transcriptional regulators. Several of the conserved developmental genes found in Ginkgo had top BLAST homology to cycad genes. We also note here the presence of ESTs in G. biloba similar to genes that to date have only been found in gymnosperms and an additional 22 Ginkgo genes common only to genes from cycads. Conclusion Our analysis of an EST dataset from G. biloba revealed genes potentially unique to gymnosperms. Many of these genes showed homology to fully sequenced clones from our cycad EST dataset found in common only with gymnosperms. Other Ginkgo ESTs are similar to developmental regulators in higher plants. This work sets the stage for future studies on Ginkgo to better understand seed and
Protease of Stenotrophomonas sp. from Indonesian fermented food: gene cloning and analysis

Directory of Open Access Journals (Sweden)

Frans Kurnia

2018-02-01

Full Text Available Screening of proteolytic and fibrinolytic bacteria from Indonesian soy bean based fermented food Oncom revealed several potential isolates. Based on 16s rDNA gene analysis, one particular isolate with the highest proteolytic and fibrinolytic activity was identified as Stenotrophomonas sp. The protease gene was amplified to generate a 1749 bp Polymerase Chain Reaction product and BLAST analysis, revealed 90% homology with gene encoding protease enzyme from Stenotrophomonas maltophilia. The putative amino acid sequence indicated a serine protease enzyme with typical amino acid aspartate, histidine and serine in the catalytic triad. The gene was translated into a pre-pro-protein consisted of cleavage site on its N terminal and Pre-Peptidase Cterminal domain. Cloning of the protease gene in pET22b with Escherichia coli BL21 DE3 as the host showed that the gene was expressed as insoluble protein fraction. This is the first report for analysis of protease gene from food origin Stenotrophomonas sp.
Genome-wide analysis of carotenoid cleavage oxygenase genes and their responses to various phytohormones and abiotic stresses in apple (Malus domestica).

Science.gov (United States)

Chen, Hongfei; Zuo, Xiya; Shao, Hongxia; Fan, Sheng; Ma, Juanjuan; Zhang, Dong; Zhao, Caiping; Yan, Xiangyan; Liu, Xiaojie; Han, Mingyu

2018-02-01

Carotenoid cleavage oxygenases (CCOs) are able to cleave carotenoids to produce apocarotenoids and their derivatives, which are important for plant growth and development. In this study, 21 apple CCO genes were identified and divided into six groups based on their phylogenetic relationships. We further characterized the apple CCO genes in terms of chromosomal distribution, structure and the presence of cis-elements in the promoter. We also predicted the cellular localization of the encoded proteins. An analysis of the synteny within the apple genome revealed that tandem, segmental, and whole-genome duplication events likely contributed to the expansion of the apple carotenoid oxygenase gene family. An additional integrated synteny analysis identified orthologous carotenoid oxygenase genes between apple and Arabidopsis thaliana, which served as references for the functional analysis of the apple CCO genes. The net photosynthetic rate, transpiration rate, and stomatal conductance of leaves decreased, while leaf stomatal density increased under drought and saline conditions. Tissue-specific gene expression analyses revealed diverse spatiotemporal expression patterns. Finally, hormone and abiotic stress treatments indicated that many apple CCO genes are responsive to various phytohormones as well as drought and salinity stresses. The genome-wide identification of apple CCO genes and the analyses of their expression patterns described herein may provide a solid foundation for future studies examining the regulation and functions of this gene family. Copyright © 2017 Elsevier Masson SAS. All rights reserved.
Microarray analysis of the gene expression profile in triethylene ...

African Journals Online (AJOL)

Microarray analysis of the gene expression profile in triethylene glycol dimethacrylate-treated human dental pulp cells. ... Conclusions: Our results suggest that TEGDMA can change the many functions of hDPCs through large changes in gene expression levels and complex interactions with different signaling pathways.
Gene ontology analysis of pairwise genetic associations in two genome-wide studies of sporadic ALS

Directory of Open Access Journals (Sweden)

Kim Nora

2012-07-01

Full Text Available Abstract Background It is increasingly clear that common human diseases have a complex genetic architecture characterized by both additive and nonadditive genetic effects. The goal of the present study was to determine whether patterns of both additive and nonadditive genetic associations aggregate in specific functional groups as defined by the Gene Ontology (GO. Results We first estimated all pairwise additive and nonadditive genetic effects using the multifactor dimensionality reduction (MDR method that makes few assumptions about the underlying genetic model. Statistical significance was evaluated using permutation testing in two genome-wide association studies of ALS. The detection data consisted of 276 subjects with ALS and 271 healthy controls while the replication data consisted of 221 subjects with ALS and 211 healthy controls. Both studies included genotypes from approximately 550,000 single-nucleotide polymorphisms (SNPs. Each SNP was mapped to a gene if it was within 500 kb of the start or end. Each SNP was assigned a p-value based on its strongest joint effect with the other SNPs. We then used the Exploratory Visual Analysis (EVA method and software to assign a p-value to each gene based on the overabundance of significant SNPs at the α = 0.05 level in the gene. We also used EVA to assign p-values to each GO group based on the overabundance of significant genes at the α = 0.05 level. A GO category was determined to replicate if that category was significant at the α = 0.05 level in both studies. We found two GO categories that replicated in both studies. The first, ‘Regulation of Cellular Component Organization and Biogenesis’, a GO Biological Process, had p-values of 0.010 and 0.014 in the detection and replication studies, respectively. The second, ‘Actin Cytoskeleton’, a GO Cellular Component, had p-values of 0.040 and 0.046 in the detection and replication studies, respectively. Conclusions Pathway
[The role of cytochrome P450 in nonalcoholic fatty liver induced by high-fat diet: a gene expression profile analysis].

Science.gov (United States)

Liu, Y; Cheng, F; Luo, Y X; Hu, P; Ren, H; Peng, M L

2017-04-20

Objective: To clarify the role of cytochrome P450 in nonalcoholic fatty liver disease (NAFLD) by RNA-Seq and bioinformatics analysis. Methods: A total of 20 male C57BL/6 mice were used. Ten mice were fed with high-fat diet (D12492, 60% kcal fat) for 16 weeks to establish a mouse model of NAFLD, and the other 10 mice were fed with low-fat diet (D12450B, 10% kcal fat) as control group. At the end of the experiment, the body weight, liver weight, and hepatic triglyceride (TG) content were measured. Meanwhile, HE staining and RNA-Seq analysis were performed for the liver tissues. The differentially expressed genes were screened out and subjected to bioinformatics analysis, including KEGG and GO BP enrichment analyses and interaction network analysis. Comparison of means between the two groups was made using t-test. Results: Compared with the control group, the mice in the model group were obviously obese, with significantly increased body weight (41.41 ± 6.01 g vs 28.78 ± 1.79 g, t = 6.04, P steatosis, accompanied by a small amount of inflammatory cell infiltration, but with no obvious fibrosis, according to the results of HE staining. In addition, the hepatic TG content in the model group was significantly increased compared with that in the control group (0.64 ± 0.01 mg/mg vs 0.29 ± 0.06 mg/mg, t = 10.11, P = 0.04). Compared with the control group, a total of 367 differentially expressed genes, including 211 down-regulated and 156 up-regulated ones, were identified in the model group according to the RNA-seq results. Meanwhile, 19 CYP450 subtypes, accounting for 5% of the differentially expressed genes, were identified, and CYP2E1, CYP2C70, CYP3A11, CYP3A25, CYP2D26, CYP4A10, CYP17A1, CYP2B10, and CYP2C38 were involved in oxidative stress, steroid hormone metabolism, fatty acid metabolism, arachidonic acid metabolism, and the PPAR signaling pathway. An interaction network was constructed with 30 nodes, and CYP2E1 and CYP2C70 were identified as key nodes. RT
Association of adipocyte genes with ASP expression: a microarray analysis of subcutaneous and omental adipose tissue in morbidly obese subjects

Directory of Open Access Journals (Sweden)

Lu HuiLing

2010-01-01

Full Text Available Abstract Background Prevalence of obesity is increasing to pandemic proportions. However, obese subjects differ in insulin resistance, adipokine production and co-morbidities. Based on fasting plasma analysis, obese subjects were grouped as Low Acylation Stimulating protein (ASP and Triglyceride (TG (LAT vs High ASP and TG (HAT. Subcutaneous (SC and omental (OM adipose tissues (n = 21 were analysed by microarray, and biologic pathways in lipid metabolism and inflammation were specifically examined. Methods LAT and HAT groups were matched in age, obesity, insulin, and glucose, and had similar expression of insulin-related genes (InsR, IRS-1. ASP related genes tended to be increased in the HAT group and were correlated (factor B, adipsin, complement C3, p Results HAT adipose tissue demonstrated increased lipid related genes for storage (CD36, DGAT1, DGAT2, SCD1, FASN, and LPL, lipolysis (HSL, CES1, perilipin, fatty acid binding proteins (FABP1, FABP3 and adipocyte differentiation markers (CEBPα, CEBPβ, PPARγ. By contrast, oxidation related genes were decreased (AMPK, UCP1, CPT1, FABP7. HAT subjects had increased anti-inflammatory genes TGFB1, TIMP1, TIMP3, and TIMP4 while proinflammatory PIG7 and MMP2 were also significantly increased; all genes, p Conclusion Taken together, the profile of C5L2 receptor, ASP gene expression and metabolic factors in adipose tissue from morbidly obese HAT subjects suggests a compensatory response associated with the increased plasma ASP and TG.
Sequencing and transcriptional analysis of the Streptococcus thermophilus histamine biosynthesis gene cluster: factors that affect differential hdcA expression

DEFF Research Database (Denmark)

Calles-Enríquez, Marina; Hjort, Benjamin Benn; Andersen, Pia Skov

2010-01-01

to produce histamine. The hdc clusters of S. thermophilus CHCC1524 and CHCC6483 were sequenced, and the factors that affect histamine biosynthesis and histidine-decarboxylating gene (hdcA) expression were studied. The hdc cluster began with the hdcA gene, was followed by a transporter (hdcP), and ended...... with the hdcB gene, which is of unknown function. The three genes were orientated in the same direction. The genetic organization of the hdc cluster showed a unique organization among the lactic acid bacterial group and resembled those of Staphylococcus and Clostridium species, thus indicating possible...... acquisition through a horizontal transfer mechanism. Transcriptional analysis of the hdc cluster revealed the existence of a polycistronic mRNA covering the three genes. The histidine-decarboxylating gene (hdcA) of S. thermophilus demonstrated maximum expression during the stationary growth phase, with high...
Comparative analysis of codon usage patterns and identification of predicted highly expressed genes in five Salmonella genomes

Directory of Open Access Journals (Sweden)

Mondal U

2008-01-01

Full Text Available Purpose: To anlyse codon usage patterns of five complete genomes of Salmonella , predict highly expressed genes, examine horizontally transferred pathogenicity-related genes to detect their presence in the strains, and scrutinize the nature of highly expressed genes to infer upon their lifestyle. Methods: Protein coding genes, ribosomal protein genes, and pathogenicity-related genes were analysed with Codon W and CAI (codon adaptation index Calculator. Results: Translational efficiency plays a role in codon usage variation in Salmonella genes. Low bias was noticed in most of the genes. GC3 (guanine cytosine at third position composition does not influence codon usage variation in the genes of these Salmonella strains. Among the cluster of orthologous groups (COGs, translation, ribosomal structure biogenesis [J], and energy production and conversion [C] contained the highest number of potentially highly expressed (PHX genes. Correspondence analysis reveals the conserved nature of the genes. Highly expressed genes were detected. Conclusions: Selection for translational efficiency is the major source of variation of codon usage in the genes of Salmonella . Evolution of pathogenicity-related genes as a unit suggests their ability to infect and exist as a pathogen. Presence of a lot of PHX genes in the information and storage-processing category of COGs indicated their lifestyle and revealed that they were not subjected to genome reduction.
Gene dosage, expression, and ontology analysis identifies driver genes in the carcinogenesis and chemoradioresistance of cervical cancer.

Directory of Open Access Journals (Sweden)

Malin Lando

2009-11-01

Full Text Available Integrative analysis of gene dosage, expression, and ontology (GO data was performed to discover driver genes in the carcinogenesis and chemoradioresistance of cervical cancers. Gene dosage and expression profiles of 102 locally advanced cervical cancers were generated by microarray techniques. Fifty-two of these patients were also analyzed with the Illumina expression method to confirm the gene expression results. An independent cohort of 41 patients was used for validation of gene expressions associated with clinical outcome. Statistical analysis identified 29 recurrent gains and losses and 3 losses (on 3p, 13q, 21q associated with poor outcome after chemoradiotherapy. The intratumor heterogeneity, assessed from the gene dosage profiles, was low for these alterations, showing that they had emerged prior to many other alterations and probably were early events in carcinogenesis. Integration of the alterations with gene expression and GO data identified genes that were regulated by the alterations and revealed five biological processes that were significantly overrepresented among the affected genes: apoptosis, metabolism, macromolecule localization, translation, and transcription. Four genes on 3p (RYBP, GBE1 and 13q (FAM48A, MED4 correlated with outcome at both the gene dosage and expression level and were satisfactorily validated in the independent cohort. These integrated analyses yielded 57 candidate drivers of 24 genetic events, including novel loci responsible for chemoradioresistance. Further mapping of the connections among genetic events, drivers, and biological processes suggested that each individual event stimulates specific processes in carcinogenesis through the coordinated control of multiple genes. The present results may provide novel therapeutic opportunities of both early and advanced stage cervical cancers.
Prioritization of epilepsy associated candidate genes by convergent analysis.

Science.gov (United States)

Jia, Peilin; Ewers, Jeffrey M; Zhao, Zhongming

2011-02-24

Epilepsy is a severe neurological disorder affecting a large number of individuals, yet the underlying genetic risk factors for epilepsy remain unclear. Recent studies have revealed several recurrent copy number variations (CNVs) that are more likely to be associated with epilepsy. The responsible gene(s) within these regions have yet to be definitively linked to the disorder, and the implications of their interactions are not fully understood. Identification of these genes may contribute to a better pathological understanding of epilepsy, and serve to implicate novel therapeutic targets for further research. In this study, we examined genes within heterozygous deletion regions identified in a recent large-scale study, encompassing a diverse spectrum of epileptic syndromes. By integrating additional protein-protein interaction data, we constructed subnetworks for these CNV-region genes and also those previously studied for epilepsy. We observed 20 genes common to both networks, primarily concentrated within a small molecular network populated by GABA receptor, BDNF/MAPK signaling, and estrogen receptor genes. From among the hundreds of genes in the initial networks, these were designated by convergent evidence for their likely association with epilepsy. Importantly, the identified molecular network was found to contain complex interrelationships, providing further insight into epilepsy's underlying pathology. We further performed pathway enrichment and crosstalk analysis and revealed a functional map which indicates the significant enrichment of closely related neurological, immune, and kinase regulatory pathways. The convergent framework we proposed here provides a unique and powerful approach to screening and identifying promising disease genes out of typically hundreds to thousands of genes in disease-related CNV-regions. Our network and pathway analysis provides important implications for the underlying molecular mechanisms for epilepsy. The strategy can be
Prioritization of epilepsy associated candidate genes by convergent analysis.

Directory of Open Access Journals (Sweden)

Peilin Jia

2011-02-01

Full Text Available Epilepsy is a severe neurological disorder affecting a large number of individuals, yet the underlying genetic risk factors for epilepsy remain unclear. Recent studies have revealed several recurrent copy number variations (CNVs that are more likely to be associated with epilepsy. The responsible gene(s within these regions have yet to be definitively linked to the disorder, and the implications of their interactions are not fully understood. Identification of these genes may contribute to a better pathological understanding of epilepsy, and serve to implicate novel therapeutic targets for further research.In this study, we examined genes within heterozygous deletion regions identified in a recent large-scale study, encompassing a diverse spectrum of epileptic syndromes. By integrating additional protein-protein interaction data, we constructed subnetworks for these CNV-region genes and also those previously studied for epilepsy. We observed 20 genes common to both networks, primarily concentrated within a small molecular network populated by GABA receptor, BDNF/MAPK signaling, and estrogen receptor genes. From among the hundreds of genes in the initial networks, these were designated by convergent evidence for their likely association with epilepsy. Importantly, the identified molecular network was found to contain complex interrelationships, providing further insight into epilepsy's underlying pathology. We further performed pathway enrichment and crosstalk analysis and revealed a functional map which indicates the significant enrichment of closely related neurological, immune, and kinase regulatory pathways.The convergent framework we proposed here provides a unique and powerful approach to screening and identifying promising disease genes out of typically hundreds to thousands of genes in disease-related CNV-regions. Our network and pathway analysis provides important implications for the underlying molecular mechanisms for epilepsy. The
Screening key candidate genes and pathways involved in insulinoma by microarray analysis.

Science.gov (United States)

Zhou, Wuhua; Gong, Li; Li, Xuefeng; Wan, Yunyan; Wang, Xiangfei; Li, Huili; Jiang, Bin

2018-06-01

Insulinoma is a rare type tumor and its genetic features remain largely unknown. This study aimed to search for potential key genes and relevant enriched pathways of insulinoma.The gene expression data from GSE73338 were downloaded from Gene Expression Omnibus database. Differentially expressed genes (DEGs) were identified between insulinoma tissues and normal pancreas tissues, followed by pathway enrichment analysis, protein-protein interaction (PPI) network construction, and module analysis. The expressions of candidate key genes were validated by quantitative real-time polymerase chain reaction (RT-PCR) in insulinoma tissues.A total of 1632 DEGs were obtained, including 1117 upregulated genes and 514 downregulated genes. Pathway enrichment results showed that upregulated DEGs were significantly implicated in insulin secretion, and downregulated DEGs were mainly enriched in pancreatic secretion. PPI network analysis revealed 7 hub genes with degrees more than 10, including GCG (glucagon), GCGR (glucagon receptor), PLCB1 (phospholipase C, beta 1), CASR (calcium sensing receptor), F2R (coagulation factor II thrombin receptor), GRM1 (glutamate metabotropic receptor 1), and GRM5 (glutamate metabotropic receptor 5). DEGs involved in the significant modules were enriched in calcium signaling pathway, protein ubiquitination, and platelet degranulation. Quantitative RT-PCR data confirmed that the expression trends of these hub genes were similar to the results of bioinformatic analysis.The present study demonstrated that candidate DEGs and enriched pathways were the potential critical molecule events involved in the development of insulinoma, and these findings were useful for better understanding of insulinoma genesis.
Analysis of the structural genes encoding M-factor in the fission yeast Schizosaccharomyces pombe: identification of a third gene, mfm3

DEFF Research Database (Denmark)

Kjaerulff, S; Davey, William John; Nielsen, O

1994-01-01

We previously identified two genes, mfm1 and mfm2, with the potential to encode the M-factor mating pheromone of the fission yeast Schizosaccharomyces pombe (J. Davey, EMBO J. 11:951-960, 1992), but further analysis revealed that a mutant strain lacking both genes still produced active M-factor. ......We previously identified two genes, mfm1 and mfm2, with the potential to encode the M-factor mating pheromone of the fission yeast Schizosaccharomyces pombe (J. Davey, EMBO J. 11:951-960, 1992), but further analysis revealed that a mutant strain lacking both genes still produced active M...... that is not rescued by addition of exogenous M-factor. A mutational analysis reveals that all three mfm genes contribute to the production of M-factor. Their transcription is limited to M cells and requires the mat1-Mc and ste11 gene products. Each gene is induced when the cells are starved of nitrogen and further...
A novel statistical algorithm for gene expression analysis helps differentiate pregnane X receptor-dependent and independent mechanisms of toxicity.

Directory of Open Access Journals (Sweden)

M Ann Mongan

Full Text Available Genome-wide gene expression profiling has become standard for assessing potential liabilities as well as for elucidating mechanisms of toxicity of drug candidates under development. Analysis of microarray data is often challenging due to the lack of a statistical model that is amenable to biological variation in a small number of samples. Here we present a novel non-parametric algorithm that requires minimal assumptions about the data distribution. Our method for determining differential expression consists of two steps: 1 We apply a nominal threshold on fold change and platform p-value to designate whether a gene is differentially expressed in each treated and control sample relative to the averaged control pool, and 2 We compared the number of samples satisfying criteria in step 1 between the treated and control groups to estimate the statistical significance based on a null distribution established by sample permutations. The method captures group effect without being too sensitive to anomalies as it allows tolerance for potential non-responders in the treatment group and outliers in the control group. Performance and results of this method were compared with the Significant Analysis of Microarrays (SAM method. These two methods were applied to investigate hepatic transcriptional responses of wild-type (PXR(+/+ and pregnane X receptor-knockout (PXR(-/- mice after 96 h exposure to CMP013, an inhibitor of β-secretase (β-site of amyloid precursor protein cleaving enzyme 1 or BACE1. Our results showed that CMP013 led to transcriptional changes in hallmark PXR-regulated genes and induced a cascade of gene expression changes that explained the hepatomegaly observed only in PXR(+/+ animals. Comparison of concordant expression changes between PXR(+/+ and PXR(-/- mice also suggested a PXR-independent association between CMP013 and perturbations to cellular stress, lipid metabolism, and biliary transport.
Time-Course Analysis of Gene Expression During the Saccharomyces cerevisiae Hypoxic Response

Directory of Open Access Journals (Sweden)

Nasrine Bendjilali

2017-01-01

Full Text Available Many cells experience hypoxia, or low oxygen, and respond by dramatically altering gene expression. In the yeast Saccharomyces cerevisiae, genes that respond are required for many oxygen-dependent cellular processes, such as respiration, biosynthesis, and redox regulation. To more fully characterize the global response to hypoxia, we exposed yeast to hypoxic conditions, extracted RNA at different times, and performed RNA sequencing (RNA-seq analysis. Time-course statistical analysis revealed hundreds of genes that changed expression by up to 550-fold. The genes responded with varying kinetics suggesting that multiple regulatory pathways are involved. We identified most known oxygen-regulated genes and also uncovered new regulated genes. Reverse transcription-quantitative PCR (RT-qPCR analysis confirmed that the lysine methyltransferase EFM6 and the recombinase DMC1, both conserved in humans, are indeed oxygen-responsive. Looking more broadly, oxygen-regulated genes participate in expected processes like respiration and lipid metabolism, but also in unexpected processes like amino acid and vitamin metabolism. Using principle component analysis, we discovered that the hypoxic response largely occurs during the first 2 hr and then a new steady-state expression state is achieved. Moreover, we show that the oxygen-dependent genes are not part of the previously described environmental stress response (ESR consisting of genes that respond to diverse types of stress. While hypoxia appears to cause a transient stress, the hypoxic response is mostly characterized by a transition to a new state of gene expression. In summary, our results reveal that hypoxia causes widespread and complex changes in gene expression to prepare the cell to function with little or no oxygen.
Time-Course Analysis of Gene Expression During the Saccharomyces cerevisiae Hypoxic Response.

Science.gov (United States)

Bendjilali, Nasrine; MacLeon, Samuel; Kalra, Gurmannat; Willis, Stephen D; Hossian, A K M Nawshad; Avery, Erica; Wojtowicz, Olivia; Hickman, Mark J

2017-01-05

Many cells experience hypoxia, or low oxygen, and respond by dramatically altering gene expression. In the yeast Saccharomyces cerevisiae, genes that respond are required for many oxygen-dependent cellular processes, such as respiration, biosynthesis, and redox regulation. To more fully characterize the global response to hypoxia, we exposed yeast to hypoxic conditions, extracted RNA at different times, and performed RNA sequencing (RNA-seq) analysis. Time-course statistical analysis revealed hundreds of genes that changed expression by up to 550-fold. The genes responded with varying kinetics suggesting that multiple regulatory pathways are involved. We identified most known oxygen-regulated genes and also uncovered new regulated genes. Reverse transcription-quantitative PCR (RT-qPCR) analysis confirmed that the lysine methyltransferase EFM6 and the recombinase DMC1, both conserved in humans, are indeed oxygen-responsive. Looking more broadly, oxygen-regulated genes participate in expected processes like respiration and lipid metabolism, but also in unexpected processes like amino acid and vitamin metabolism. Using principle component analysis, we discovered that the hypoxic response largely occurs during the first 2 hr and then a new steady-state expression state is achieved. Moreover, we show that the oxygen-dependent genes are not part of the previously described environmental stress response (ESR) consisting of genes that respond to diverse types of stress. While hypoxia appears to cause a transient stress, the hypoxic response is mostly characterized by a transition to a new state of gene expression. In summary, our results reveal that hypoxia causes widespread and complex changes in gene expression to prepare the cell to function with little or no oxygen. Copyright © 2017 Bendjilali et al.
Genomewide identification and expression analysis of the ARF gene ...

Indian Academy of Sciences (India)

Figure 1. Phylogenetic relation of apple ARF genes. The phylogenetic tree was constructed based on a complete protein sequence align- ment of MdARFs by the neighbour-joining method with bootstrapping analysis (1000 replicates). The scale bar represents 0.05 amino acid substitutions per site. Paralogous gene pairs ...

Identification of Tunisian Leishmania spp. by PCR amplification of cysteine proteinase B (cpb) genes and phylogenetic analysis.

Science.gov (United States)

Chaouch, Melek; Fathallah-Mili, Akila; Driss, Mehdi; Lahmadi, Ramzi; Ayari, Chiraz; Guizani, Ikram; Ben Said, Moncef; Benabderrazak, Souha

2013-03-01

Discrimination of the Old World Leishmania parasites is important for diagnosis and epidemiological studies of leishmaniasis. We have developed PCR assays that allow the discrimination between Leishmania major, Leishmania tropica and Leishmania infantum Tunisian species. The identification was performed by a simple PCR targeting cysteine protease B (cpb) gene copies. These PCR can be a routine molecular biology tools for discrimination of Leishmania spp. from different geographical origins and different clinical forms. Our assays can be an informative source for cpb gene studying concerning drug, diagnostics and vaccine research. The PCR products of the cpb gene and the N-acetylglucosamine-1-phosphate transferase (nagt) Leishmania gene were sequenced and aligned. Phylogenetic trees of Leishmania based cpb and nagt sequences are close in topology and present the classic distribution of Leishmania in the Old World. The phylogenetic analysis has enabled the characterization and identification of different strains, using both multicopy (cpb) and single copy (nagt) genes. Indeed, the cpb phylogenetic analysis allowed us to identify the Tunisian Leishmania killicki species, and a group which gathers the least evolved isolates of the Leishmania donovani complex, that was originated from East Africa. This clustering confirms the African origin for the visceralizing species of the L. donovani complex. Copyright © 2012 Elsevier B.V. All rights reserved.
Super-delta: a new differential gene expression analysis procedure with robust data normalization.

Science.gov (United States)

Liu, Yuhang; Zhang, Jinfeng; Qiu, Xing

2017-12-21

-delta provides new insights to the area of differential gene expression analysis. Solid theoretical foundation supports its asymptotic unbiasedness and technical noise-free properties. Implementation on real and simulated datasets demonstrates its decent performance compared with state-of-art procedures. It also has the potential of expansion to be incorporated with other data type and/or more general between-group comparison problems.
Analysis of the functional gene structure and metabolic potential of microbial community in high arsenic groundwater.

Science.gov (United States)

Li, Ping; Jiang, Zhou; Wang, Yanhong; Deng, Ye; Van Nostrand, Joy D; Yuan, Tong; Liu, Han; Wei, Dazhun; Zhou, Jizhong

2017-10-15

Microbial functional potential in high arsenic (As) groundwater ecosystems remains largely unknown. In this study, the microbial community functional composition of nineteen groundwater samples was investigated using a functional gene array (GeoChip 5.0). Samples were divided into low and high As groups based on the clustering analysis of geochemical parameters and microbial functional structures. The results showed that As related genes (arsC, arrA), sulfate related genes (dsrA and dsrB), nitrogen cycling related genes (ureC, amoA, and hzo) and methanogen genes (mcrA, hdrB) in groundwater samples were correlated with As, SO 4 2- , NH 4 + or CH 4 concentrations, respectively. Canonical correspondence analysis (CCA) results indicated that some geochemical parameters including As, total organic content, SO 4 2- , NH 4 + , oxidation-reduction potential (ORP) and pH were important factors shaping the functional microbial community structures. Alkaline and reducing conditions with relatively low SO 4 2- , ORP, and high NH 4 + , as well as SO 4 2- and Fe reduction and ammonification involved in microbially-mediated geochemical processes could be associated with As enrichment in groundwater. This study provides an overall picture of functional microbial communities in high As groundwater aquifers, and also provides insights into the critical role of microorganisms in As biogeochemical cycling. Copyright © 2017 Elsevier Ltd. All rights reserved.
Protein functional links in Trypanosoma brucei, identified by gene fusion analysis

Directory of Open Access Journals (Sweden)

Trimpalis Philip

2011-07-01

Full Text Available Abstract Background Domain or gene fusion analysis is a bioinformatics method for detecting gene fusions in one organism by comparing its genome to that of other organisms. The occurrence of gene fusions suggests that the two original genes that participated in the fusion are functionally linked, i.e. their gene products interact either as part of a multi-subunit protein complex, or in a metabolic pathway. Gene fusion analysis has been used to identify protein functional links in prokaryotes as well as in eukaryotic model organisms, such as yeast and Drosophila. Results In this study we have extended this approach to include a number of recently sequenced protists, four of which are pathogenic, to identify fusion linked proteins in Trypanosoma brucei, the causative agent of African sleeping sickness. We have also examined the evolution of the gene fusion events identified, to determine whether they can be attributed to fusion or fission, by looking at the conservation of the fused genes and of the individual component genes across the major eukaryotic and prokaryotic lineages. We find relatively limited occurrence of gene fusions/fissions within the protist lineages examined. Our results point to two trypanosome-specific gene fissions, which have recently been experimentally confirmed, one fusion involving proteins involved in the same metabolic pathway, as well as two novel putative functional links between fusion-linked protein pairs. Conclusions This is the first study of protein functional links in T. brucei identified by gene fusion analysis. We have used strict thresholds and only discuss results which are highly likely to be genuine and which either have already been or can be experimentally verified. We discuss the possible impact of the identification of these novel putative protein-protein interactions, to the development of new trypanosome therapeutic drugs.
Association of vitamin D receptor BsmI gene polymorphism with risk of tuberculosis: a meta-analysis of 15 studies.

Directory of Open Access Journals (Sweden)

Yu-Jiao Wu

Full Text Available BACKGROUND: Genetic variations in vitamin D receptor (VDR may contribute to tuberculosis (TB risk. Many studies have investigated the association between VDR BsmI gene polymorphism and TB risk, but yielded inconclusive results. METHODOLOGY/PRINCIPAL FINDINGS: We performed a comprehensive meta-analysis of 15 publications with a total of 2309 cases and 3568 controls. We assessed the strength of the association between VDR BsmI gene polymorphism and TB risk and performed sub-group analyses by ethnicity, sample size and Hardy-Weinberg equilibrium (HWE. We found a statistically significant correlation between VDR BsmI gene polymorphism and decreased TB risk in four comparison models: allele model (b vs. B: OR = 0.78, 95% CI = 0.67, 0.89; Pheterogeneity = 0.004, homozygote model (bb vs. BB: OR = 0.61, 95% CI = 0.43, 0.87; Pheterogeneity = 0.001, recessive model (bb vs. Bb+BB: OR = 0.70, 95% CI = 0.56, 0.88; Pheterogeneity = 0.005 and dominant model (bb+Bb vs. BB: OR = 0.77, 95% CI = 0.61, 0.97; Pheterogeneity = 0.010, especially in studies based on Asian population. Sub-group analyses also revealed that there was a statistically decreased TB risk in "small" studies (0.5. Meta-regression and stratification analysis both showed that the ethnicity and sample size contributed to heterogeneity. CONCLUSIONS: This meta-analysis suggests that VDR BsmI gene polymorphism is associated with a significant decreased TB risk, especially in Asian population.
Transcriptomic network analysis of micronuclei-related genes: a case study

DEFF Research Database (Denmark)

van Leeuwen, D. M.; Pedersen, Marie; Knudsen, Lisbeth E.

2011-01-01

checkpoint and aneuploidy. The MN-related gene network was tested against a transcriptomics case study associated with MN measurements. In this case study, transcriptomic data from children and adults differentially exposed to ambient air pollution in the Czech Republic were analysed and visualised......Mechanistically relevant information on responses of humans to xenobiotic exposure in relation to chemically induced biological effects, such as micronuclei (MN) formation can be obtained through large-scale transcriptomics studies. Network analysis may enhance the analysis and visualisation...... of such data. Therefore, this study aimed to develop a 'MN formation' network based on a priori knowledge, by using the pathway tool MetaCore. The gene network contained 27 genes and three gene complexes that are related to processes involved in MN formation, e.g. spindle assembly checkpoint, cell cycle...
Genome-wide analysis of the cellulose synthase-like (Csl) gene family in bread wheat (Triticum aestivum L.).

Science.gov (United States)

Kaur, Simerjeet; Dhugga, Kanwarpal S; Beech, Robin; Singh, Jaswinder

2017-11-03

Hemicelluloses are a diverse group of complex, non-cellulosic polysaccharides, which constitute approximately one-third of the plant cell wall and find use as dietary fibres, food additives and raw materials for biofuels. Genes involved in hemicellulose synthesis have not been extensively studied in small grain cereals. In efforts to isolate the sequences for the cellulose synthase-like (Csl) gene family from wheat, we identified 108 genes (hereafter referred to as TaCsl). Each gene was represented by two to three homeoalleles, which are named as TaCslXY_ZA, TaCslXY_ZB, or TaCslXY_ZD, where X denotes the Csl subfamily, Y the gene number and Z the wheat chromosome where it is located. A quarter of these genes were predicted to have 2 to 3 splice variants, resulting in a total of 137 putative translated products. Approximately 45% of TaCsl genes were located on chromosomes 2 and 3. Sequences from the subfamilies C and D were interspersed between the dicots and grasses but those from subfamily A clustered within each group of plants. Proximity of the dicot-specific subfamilies B and G, to the grass-specific subfamilies H and J, respectively, points to their common origin. In silico expression analysis in different tissues revealed that most of the genes were expressed ubiquitously and some were tissue-specific. More than half of the genes had introns in phase 0, one-third in phase 2, and a few in phase 1. Detailed characterization of the wheat Csl genes has enhanced the understanding of their structural, functional, and evolutionary features. This information will be helpful in designing experiments for genetic manipulation of hemicellulose synthesis with the goal of developing improved cultivars for biofuel production and increased tolerance against various stresses.
Phylogenomic analysis of UDP glycosyltransferase 1 multigene family in Linum usitatissimum identified genes with varied expression patterns

Science.gov (United States)

2012-01-01

Background The glycosylation process, catalyzed by ubiquitous glycosyltransferase (GT) family enzymes, is a prevalent modification of plant secondary metabolites that regulates various functions such as hormone homeostasis, detoxification of xenobiotics and biosynthesis and storage of secondary metabolites. Flax (Linum usitatissimum L.) is a commercially grown oilseed crop, important because of its essential fatty acids and health promoting lignans. Identification and characterization of UDP glycosyltransferase (UGT) genes from flax could provide valuable basic information about this important gene family and help to explain the seed specific glycosylated metabolite accumulation and other processes in plants. Plant genome sequencing projects are useful to discover complexity within this gene family and also pave way for the development of functional genomics approaches. Results Taking advantage of the newly assembled draft genome sequence of flax, we identified 137 UDP glycosyltransferase (UGT) genes from flax using a conserved signature motif. Phylogenetic analysis of these protein sequences clustered them into 14 major groups (A-N). Expression patterns of these genes were investigated using publicly available expressed sequence tag (EST), microarray data and reverse transcription quantitative real time PCR (RT-qPCR). Seventy-three per cent of these genes (100 out of 137) showed expression evidence in 15 tissues examined and indicated varied expression profiles. The RT-qPCR results of 10 selected genes were also coherent with the digital expression analysis. Interestingly, five duplicated UGT genes were identified, which showed differential expression in various tissues. Of the seven intron loss/gain positions detected, two intron positions were conserved among most of the UGTs, although a clear relationship about the evolution of these genes could not be established. Comparison of the flax UGTs with orthologs from four other sequenced dicot genomes indicated that
Phylogenomic analysis of UDP glycosyltransferase 1 multigene family in Linum usitatissimum identified genes with varied expression patterns

Directory of Open Access Journals (Sweden)

Barvkar Vitthal T

2012-05-01

Full Text Available Abstract Background The glycosylation process, catalyzed by ubiquitous glycosyltransferase (GT family enzymes, is a prevalent modification of plant secondary metabolites that regulates various functions such as hormone homeostasis, detoxification of xenobiotics and biosynthesis and storage of secondary metabolites. Flax (Linum usitatissimum L. is a commercially grown oilseed crop, important because of its essential fatty acids and health promoting lignans. Identification and characterization of UDP glycosyltransferase (UGT genes from flax could provide valuable basic information about this important gene family and help to explain the seed specific glycosylated metabolite accumulation and other processes in plants. Plant genome sequencing projects are useful to discover complexity within this gene family and also pave way for the development of functional genomics approaches. Results Taking advantage of the newly assembled draft genome sequence of flax, we identified 137 UDP glycosyltransferase (UGT genes from flax using a conserved signature motif. Phylogenetic analysis of these protein sequences clustered them into 14 major groups (A-N. Expression patterns of these genes were investigated using publicly available expressed sequence tag (EST, microarray data and reverse transcription quantitative real time PCR (RT-qPCR. Seventy-three per cent of these genes (100 out of 137 showed expression evidence in 15 tissues examined and indicated varied expression profiles. The RT-qPCR results of 10 selected genes were also coherent with the digital expression analysis. Interestingly, five duplicated UGT genes were identified, which showed differential expression in various tissues. Of the seven intron loss/gain positions detected, two intron positions were conserved among most of the UGTs, although a clear relationship about the evolution of these genes could not be established. Comparison of the flax UGTs with orthologs from four other sequenced dicot
Sequencing and analysis of the gene-rich space of cowpea

Directory of Open Access Journals (Sweden)

Cheung Foo

2008-02-01

Full Text Available Abstract Background Cowpea, Vigna unguiculata (L. Walp., is one of the most important food and forage legumes in the semi-arid tropics because of its drought tolerance and ability to grow on poor quality soils. Approximately 80% of cowpea production takes place in the dry savannahs of tropical West and Central Africa, mostly by poor subsistence farmers. Despite its economic and social importance in the developing world, cowpea remains to a large extent an underexploited crop. Among the major goals of cowpea breeding and improvement programs is the stacking of desirable agronomic traits, such as disease and pest resistance and response to abiotic stresses. Implementation of marker-assisted selection and breeding programs is severely limited by a paucity of trait-linked markers and a general lack of information on gene structure and organization. With a nuclear genome size estimated at ~620 Mb, the cowpea genome is an ideal target for reduced representation sequencing. Results We report here the sequencing and analysis of the gene-rich, hypomethylated portion of the cowpea genome selectively cloned by methylation filtration (MF technology. Over 250,000 gene-space sequence reads (GSRs with an average length of 610 bp were generated, yielding ~160 Mb of sequence information. The GSRs were assembled, annotated by BLAST homology searches of four public protein annotation databases and four plant proteomes (A. thaliana, M. truncatula, O. sativa, and P. trichocarpa, and analyzed using various domain and gene modeling tools. A total of 41,260 GSR assemblies and singletons were annotated, of which 19,786 have unique GenBank accession numbers. Within the GSR dataset, 29% of the sequences were annotated using the Arabidopsis Gene Ontology (GO with the largest categories of assigned function being catalytic activity and metabolic processes, groups that include the majority of cellular enzymes and components of amino acid, carbohydrate and lipid metabolism. A
Transcriptome Analysis Reveals Regulation of Gene Expression for Lipid Catabolism in Young Broilers by Butyrate Glycerides

Science.gov (United States)

Yin, Fugui; Yu, Hai; Lepp, Dion; Shi, Xuejiang; Yang, Xiaojian; Hu, Jielun; Leeson, Steve; Yang, Chengbo; Nie, Shaoping; Hou, Yongqing; Gong, Joshua

2016-01-01

Background & Aims Butyrate has been shown to potently regulate energy expenditure and lipid metabolism in animals, yet the underlying mechanisms remain to be fully understood. The aim of this study was to investigate the molecular mechanisms of butyrate (in the form of butyrate glycerides, BG)-induced lipid metabolism at the level of gene expression in the jejunum and liver of broilers. Methodology/Principal Findings Two animal experiments were included in this study. In Experiment 1, two hundred and forty male broiler chickens were equally allocated into two groups: 1) basal diet (BD), 2) BG diets (BD + BG). Growth performance was compared between treatments for the 41-day trial. In Experiment 2, forty male broiler chickens were equally allocated into two groups. The general experimental design, group and management were the same as described in Experiment 1 except for reduced bird numbers and 21-day duration of the trial. Growth performance, abdominal fat deposition, serum lipid profiles as well as serum and tissue concentrations of key enzymes involved in lipid metabolism were compared between treatments. RNA-seq was employed to identify both differentially expressed genes (DEGs) and treatment specifically expressed genes (TSEGs). Functional clustering of DEGs and TSEGs and signaling pathways associated with lipid metabolism were identified using Ingenuity Pathways Analysis (IPA) and DAVID Bioinformatics Resources 6.7 (DAVID-BR). Quantitative PCR (qPCR) assays were subsequently conducted to further examine the expression of genes in the peroxisome proliferator-activated receptors (PPAR) signaling pathway identified by DAVID-BR. Dietary BG intervention significantly reduced abdominal fat ratio (abdominal fat weight/final body weight) in broilers. The decreased fat deposition in BG-fed chickens was in accordance with serum lipid profiles as well as the level of lipid metabolism-related enzymes in the serum, abdominal adipose, jejunum and liver. RNA-seq analysis
Functional Module Analysis for Gene Coexpression Networks with Network Integration.

Science.gov (United States)

Zhang, Shuqin; Zhao, Hongyu; Ng, Michael K

2015-01-01

Network has been a general tool for studying the complex interactions between different genes, proteins, and other small molecules. Module as a fundamental property of many biological networks has been widely studied and many computational methods have been proposed to identify the modules in an individual network. However, in many cases, a single network is insufficient for module analysis due to the noise in the data or the tuning of parameters when building the biological network. The availability of a large amount of biological networks makes network integration study possible. By integrating such networks, more informative modules for some specific disease can be derived from the networks constructed from different tissues, and consistent factors for different diseases can be inferred. In this paper, we have developed an effective method for module identification from multiple networks under different conditions. The problem is formulated as an optimization model, which combines the module identification in each individual network and alignment of the modules from different networks together. An approximation algorithm based on eigenvector computation is proposed. Our method outperforms the existing methods, especially when the underlying modules in multiple networks are different in simulation studies. We also applied our method to two groups of gene coexpression networks for humans, which include one for three different cancers, and one for three tissues from the morbidly obese patients. We identified 13 modules with three complete subgraphs, and 11 modules with two complete subgraphs, respectively. The modules were validated through Gene Ontology enrichment and KEGG pathway enrichment analysis. We also showed that the main functions of most modules for the corresponding disease have been addressed by other researchers, which may provide the theoretical basis for further studying the modules experimentally.
Association between variations in the disrupted in schizophrenia 1 gene and schizophrenia: A meta-analysis.

Science.gov (United States)

Xu, Yiliang; Ren, Jun; Ye, Haihong

2018-04-20

Schizophrenia is a severe psychiatric disorder. Genetic and functional studies have strongly implicated the disrupted in schizophrenia 1 gene (DISC1) as a candidate susceptibility gene for schizophrenia. Moreover, recent association studies have indicated that several DISC1 single nucleotide polymorphisms (SNPs) are associated with schizophrenia. However, the association is hardly replicate in different ethnic group. Here, we performed a meta-analysis of the association between DISC1 SNPs and schizophrenia in which the samples were divided into subgroups according to ethnicity. Both rs3738401 and rs821616 showed not significantly association with schizophrenia in the Caucasian, Asian, Japanese or Han Chinese populations. Copyright © 2018 Elsevier B.V. All rights reserved.
uvsI mutants defective in UV mutagenesis define a fourth epistatic group of uvs genes in Aspergillus.

Science.gov (United States)

Chae, S K; Kafer, E

1993-01-01

Three UV-sensitive mutations of A. nidulans, uvsI, uvsJ and uvsA, were tested for epistatic relationships with members of the previously established groups, here called the "UvsF", "UvsC", and "UvsB" groups. uvsI mutants are defective for spontaneous and induced reversion of certain point mutations and differ also for other properties from previously analyzed uvs types. They are very sensitive to the killing effects of UV-light and 4-NQO (4-nitro-quinoline-N-oxide) but not to MMS (methylmethane sulfonate). When double- and single-mutant uvs strains were compared for sensitivity to these three agents, synergistic or additive effects were found for uvsI with all members of the three groups. The uvsI gene may therefore represent a fourth epistatic group, possibly involved in mutagenic repair. On the other hand, uvsJ was clearly epistatic with members of the UvsF group and fitted well into this group also by phenotype. The uvsA gene was tentatively assigned to the UvsC group. uvsA showed epistatic interactions with uvsC in all tests, and like UvsC-group mutants is UV-sensitive mainly in dividing cells. However, the uvsA mutation does not cause the defects in recombination and UV mutagenesis typical for this group.
Molecular characterization, sequence analysis and tissue expression of a porcine gene – MOSPD2

Directory of Open Access Journals (Sweden)

Yang Jie

2017-01-01

Full Text Available The full-length cDNA sequence of a porcine gene, MOSPD2, was amplified using the rapid amplification of cDNA ends method based on a pig expressed sequence tag sequence which was highly homologous to the coding sequence of the human MOSPD2 gene. Sequence prediction analysis revealed that the open reading frame of this gene encodes a protein of 491 amino acids that has high homology with the motile sperm domain-containing protein 2 (MOSPD2 of five species: horse (89%, human (90%, chimpanzee (89%, rhesus monkey (89% and mouse (85%; thus, it could be defined as a porcine MOSPD2 gene. This novel porcine gene was assigned GeneID: 100153601. This gene is structured in 15 exons and 14 introns as revealed by computer-assisted analysis. The phylogenetic analysis revealed that the porcine MOSPD2 gene has a closer genetic relationship with the MOSPD2 gene of horse. Tissue expression analysis indicated that the porcine MOSPD2 gene is generally and differentially expressed in the spleen, muscle, skin, kidney, lung, liver, fat and heart. Our experiment is the first to establish the primary foundation for further research on the porcine MOSPD2 gene.
Identification of clinically relevant nonhemolytic Streptococci on the basis of sequence analysis of 16S-23S intergenic spacer region and partial gdh gene

DEFF Research Database (Denmark)

Nielsen, Xiaohui Chen; Justesen, Ulrik Stenz; Dargis, Rimtas

2009-01-01

Nonhemolytic streptococci (NHS) cause serious infections, such as endocarditis and septicemia. Many conventional phenotypic methods are insufficient for the identification of bacteria in this group to the species level. Genetic analysis has revealed that single-gene analysis is insufficient...
Capturing heterogeneity in gene expression studies by surrogate variable analysis.

Directory of Open Access Journals (Sweden)

Jeffrey T Leek

2007-09-01

Full Text Available It has unambiguously been shown that genetic, environmental, demographic, and technical factors may have substantial effects on gene expression levels. In addition to the measured variable(s of interest, there will tend to be sources of signal due to factors that are unknown, unmeasured, or too complicated to capture through simple models. We show that failing to incorporate these sources of heterogeneity into an analysis can have widespread and detrimental effects on the study. Not only can this reduce power or induce unwanted dependence across genes, but it can also introduce sources of spurious signal to many genes. This phenomenon is true even for well-designed, randomized studies. We introduce "surrogate variable analysis" (SVA to overcome the problems caused by heterogeneity in expression studies. SVA can be applied in conjunction with standard analysis techniques to accurately capture the relationship between expression and any modeled variables of interest. We apply SVA to disease class, time course, and genetics of gene expression studies. We show that SVA increases the biological accuracy and reproducibility of analyses in genome-wide expression studies.
Analysis of Pigeon (Columba) Ovary Transcriptomes to Identify Genes Involved in Blue Light Regulation

Science.gov (United States)

Wang, Ying; Ding, Jia-tong; Yang, Hai-ming; Yan, Zheng-jie; Cao, Wei; Li, Yang-bai

2015-01-01

Monochromatic light is widely applied to promote poultry reproductive performance, yet little is currently known regarding the mechanism by which light wavelengths affect pigeon reproduction. Recently, high-throughput sequencing technologies have been used to provide genomic information for solving this problem. In this study, we employed Illumina Hiseq 2000 to identify differentially expressed genes in ovary tissue from pigeons under blue and white light conditions and de novo transcriptome assembly to construct a comprehensive sequence database containing information on the mechanisms of follicle development. A total of 157,774 unigenes (mean length: 790 bp) were obtained by the Trinity program, and 35.83% of these unigenes were matched to genes in a non-redundant protein database. Gene description, gene ontology, and the clustering of orthologous group terms were performed to annotate the transcriptome assembly. Differentially expressed genes between blue and white light conditions included those related to oocyte maturation, hormone biosynthesis, and circadian rhythm. Furthermore, 17,574 SSRs and 533,887 potential SNPs were identified in this transcriptome assembly. This work is the first transcriptome analysis of the Columba ovary using Illumina technology, and the resulting transcriptome and differentially expressed gene data can facilitate further investigations into the molecular mechanism of the effect of blue light on follicle development and reproduction in pigeons and other bird species. PMID:26599806
Analysis of Pigeon (Columba Ovary Transcriptomes to Identify Genes Involved in Blue Light Regulation.

Directory of Open Access Journals (Sweden)

Ying Wang

Full Text Available Monochromatic light is widely applied to promote poultry reproductive performance, yet little is currently known regarding the mechanism by which light wavelengths affect pigeon reproduction. Recently, high-throughput sequencing technologies have been used to provide genomic information for solving this problem. In this study, we employed Illumina Hiseq 2000 to identify differentially expressed genes in ovary tissue from pigeons under blue and white light conditions and de novo transcriptome assembly to construct a comprehensive sequence database containing information on the mechanisms of follicle development. A total of 157,774 unigenes (mean length: 790 bp were obtained by the Trinity program, and 35.83% of these unigenes were matched to genes in a non-redundant protein database. Gene description, gene ontology, and the clustering of orthologous group terms were performed to annotate the transcriptome assembly. Differentially expressed genes between blue and white light conditions included those related to oocyte maturation, hormone biosynthesis, and circadian rhythm. Furthermore, 17,574 SSRs and 533,887 potential SNPs were identified in this transcriptome assembly. This work is the first transcriptome analysis of the Columba ovary using Illumina technology, and the resulting transcriptome and differentially expressed gene data can facilitate further investigations into the molecular mechanism of the effect of blue light on follicle development and reproduction in pigeons and other bird species.
Genome-wide profiling of 24 hr diel rhythmicity in the water flea, Daphnia pulex: network analysis reveals rhythmic gene expression and enhances functional gene annotation.

Science.gov (United States)

Rund, Samuel S C; Yoo, Boyoung; Alam, Camille; Green, Taryn; Stephens, Melissa T; Zeng, Erliang; George, Gary F; Sheppard, Aaron D; Duffield, Giles E; Milenković, Tijana; Pfrender, Michael E

2016-08-18

Marine and freshwater zooplankton exhibit daily rhythmic patterns of behavior and physiology which may be regulated directly by the light:dark (LD) cycle and/or a molecular circadian clock. One of the best-studied zooplankton taxa, the freshwater crustacean Daphnia, has a 24 h diel vertical migration (DVM) behavior whereby the organism travels up and down through the water column daily. DVM plays a critical role in resource tracking and the behavioral avoidance of predators and damaging ultraviolet radiation. However, there is little information at the transcriptional level linking the expression patterns of genes to the rhythmic physiology/behavior of Daphnia. Here we analyzed genome-wide temporal transcriptional patterns from Daphnia pulex collected over a 44 h time period under a 12:12 LD cycle (diel) conditions using a cosine-fitting algorithm. We used a comprehensive network modeling and analysis approach to identify novel co-regulated rhythmic genes that have similar network topological properties and functional annotations as rhythmic genes identified by the cosine-fitting analyses. Furthermore, we used the network approach to predict with high accuracy novel gene-function associations, thus enhancing current functional annotations available for genes in this ecologically relevant model species. Our results reveal that genes in many functional groupings exhibit 24 h rhythms in their expression patterns under diel conditions. We highlight the rhythmic expression of immunity, oxidative detoxification, and sensory process genes. We discuss differences in the chronobiology of D. pulex from other well-characterized terrestrial arthropods. This research adds to a growing body of literature suggesting the genetic mechanisms governing rhythmicity in crustaceans may be divergent from other arthropod lineages including insects. Lastly, these results highlight the power of using a network analysis approach to identify differential gene expression and provide novel

Genome-Wide Analysis of the RNA Helicase Gene Family in Gossypium raimondii

Directory of Open Access Journals (Sweden)

Jie Chen

2014-03-01

Full Text Available The RNA helicases, which help to unwind stable RNA duplexes, and have important roles in RNA metabolism, belong to a class of motor proteins that play important roles in plant development and responses to stress. Although this family of genes has been the subject of systematic investigation in Arabidopsis, rice, and tomato, it has not yet been characterized in cotton. In this study, we identified 161 putative RNA helicase genes in the genome of the diploid cotton species Gossypium raimondii. We classified these genes into three subfamilies, based on the presence of either a DEAD-box (51 genes, DEAH-box (52 genes, or DExD/H-box (58 genes in their coding regions. Chromosome location analysis showed that the genes that encode RNA helicases are distributed across all 13 chromosomes of G. raimondii. Syntenic analysis revealed that 62 of the 161 G. raimondii helicase genes (38.5% are within the identified syntenic blocks. Sixty-six (40.99% helicase genes from G. raimondii have one or several putative orthologs in tomato. Additionally, GrDEADs have more conserved gene structures and more simple domains than GrDEAHs and GrDExD/Hs. Transcriptome sequencing data demonstrated that many of these helicases, especially GrDEADs, are highly expressed at the fiber initiation stage and in mature leaves. To our knowledge, this is the first report of a genome-wide analysis of the RNA helicase gene family in cotton.
Finding Combination of Features from Promoter Regions for Ovarian Cancer-related Gene Group Classification

KAUST Repository

Olayan, Rawan S.

2012-01-01

In classification problems, it is always important to use the suitable combination of features that will be employed by classifiers. Generating the right combination of features usually results in good classifiers. In the situation when the problem is not well understood, data items are usually described by many features in the hope that some of these may be the relevant or most relevant ones. In this study, we focus on one such problem related to genes implicated in ovarian cancer (OC). We try to recognize two important OC-related gene groups: oncogenes, which support the development and progression of OC, and oncosuppressors, which oppose such tendencies. For this, we use the properties of promoters of these genes. We identified potential “regulatory features” that characterize OC-related oncogenes and oncosuppressors promoters. In our study, we used 211 oncogenes and 39 oncosuppressors. For these, we identified 538 characteristic sequence motifs from their promoters. Promoters are annotated by these motifs and derived feature vectors used to develop classification models. We made a comparison of a number of classification models in their ability to distinguish oncogenes from oncosuppressors. Based on 10-fold cross-validation, the resultant model was able to separate the two classes with sensitivity of 96% and specificity of 100% with the complete set of features. Moreover, we developed another recognition model where we attempted to distinguish oncogenes and oncosuppressors as one group from other OC-related genes. That model achieved accuracy of 82%. We believe that the results of this study will help in discovering other OC-related oncogenes and oncosuppressors not identified as yet.
Finding Combination of Features from Promoter Regions for Ovarian Cancer-related Gene Group Classification

KAUST Repository

Olayan, Rawan S.

2012-12-01

In classification problems, it is always important to use the suitable combination of features that will be employed by classifiers. Generating the right combination of features usually results in good classifiers. In the situation when the problem is not well understood, data items are usually described by many features in the hope that some of these may be the relevant or most relevant ones. In this study, we focus on one such problem related to genes implicated in ovarian cancer (OC). We try to recognize two important OC-related gene groups: oncogenes, which support the development and progression of OC, and oncosuppressors, which oppose such tendencies. For this, we use the properties of promoters of these genes. We identified potential “regulatory features” that characterize OC-related oncogenes and oncosuppressors promoters. In our study, we used 211 oncogenes and 39 oncosuppressors. For these, we identified 538 characteristic sequence motifs from their promoters. Promoters are annotated by these motifs and derived feature vectors used to develop classification models. We made a comparison of a number of classification models in their ability to distinguish oncogenes from oncosuppressors. Based on 10-fold cross-validation, the resultant model was able to separate the two classes with sensitivity of 96% and specificity of 100% with the complete set of features. Moreover, we developed another recognition model where we attempted to distinguish oncogenes and oncosuppressors as one group from other OC-related genes. That model achieved accuracy of 82%. We believe that the results of this study will help in discovering other OC-related oncogenes and oncosuppressors not identified as yet.
Analysis of new lactotransferrin gene variants in a case-control study related to periodontal disease in dog.

Science.gov (United States)

Morinha, Francisco; Albuquerque, Carlos; Requicha, João; Dias, Isabel; Leitão, José; Gut, Ivo; Guedes-Pinto, Henrique; Viegas, Carlos; Bastos, Estela

2012-04-01

The molecular and genetic research has contributed to a better understanding of the periodontal disease (PD) in humans and has shown that many genes play a role in the predisposition and progression of this complex disease. Variations in human lactotransferrin (LTF) gene appear to affect anti-microbial functions of this molecule, influencing the PD susceptibility. PD is also a major health problem in small animal practice, being the most common inflammatory disease found in dogs. Nevertheless, the research in genetic predisposition to PD is an unexplored subject in this species. This work aims to contribute to the characterization of the genetic basis of canine PD. In order to identify genetic variations and verify its association with PD, was performed a molecular analysis of LTF gene in a case-control approach, including 40 dogs in the PD cases group and 50 dogs in the control group. In this study were detected and characterized eight new single nucleotide variations in the dog LTF gene. Genotype and allele frequencies of these variations showed no statistically significant differences between the control and PD cases groups. Our data do not give evidence for the contribution of these LTF variations to the genetic background of canine PD. Nevertheless, the sequence variant L/15_g.411C > T leads to an aminoacid change (Proline to Leucine) and was predicted to be possibly damaging to the LTF protein. Further investigations would be of extreme value to clarify the biological importance of these new findings.
Molecular bases of the ABO blood groups of Indians from the Brazilian Amazon region.

Science.gov (United States)

Franco, R F; Simões, B P; Guerreiro, J F; Santos, S E; Zago, M A

1994-01-01

Phenotype studies of ABO blood groups in most Amerindian populations revealed the exclusive presence of group O. Since group O is the result of the absence of glycosyltransferase activity, its molecular bases may be heterogeneous. We carried out ABO blood group genotyping by analysis of DNA of 30 Indians from 2 Amazonian tribes (Yanomami and Arara), and compared the findings with other populations (Caucasians and Blacks). Two segments of the glycosyltransferase gene were amplified by PCR and digested with KpnI or AluI to detect deletion or base change at positions 258 and 700, respectively. For all subjects, the gene basis of blood group O is the deletion of a single nucleotide at position 258 of the glycosyltransferase A gene, similar to that observed in Caucasoids and Negroids. DNA sequencing of limited regions of the gene supports this conclusion. This finding does not exclude, however, that a heterogeneity of the O allele may be revealed by a more extensive analysis.
Genetic analysis and gene mapping of a low stigma exposed mutant gene by high-throughput sequencing.

Directory of Open Access Journals (Sweden)

Xiao Ma

Full Text Available Rice is one of the main food crops and several studies have examined the molecular mechanism of the exposure of the rice plant stigma. The improvement in the exposure of the stigma in female parent hybrid combinations can enhance the efficiency of hybrid breeding. In the present study, a mutant plant with low exposed stigma (lesr was discovered among the descendants of the indica thermo-sensitive sterile line 115S. The ES% rate of the mutant decreased by 70.64% compared with the wild type variety. The F2 population was established by genetic analysis considering the mutant as the female parent and the restorer line 93S as the male parent. The results indicated a normal F1 population, while a clear division was noted for the high and low exposed stigma groups, respectively. This process was possible only by a ES of 25% in the F2 population. This was in agreement with the ratio of 3:1, which indicated that the mutant was controlled by a recessive main-effect QTL locus, temporarily named as LESR. Genome-wide comparison of the SNP profiles between the early, high and low production bulks were constructed from F2 plants using bulked segregant analysis in combination with high-throughput sequencing technology. The results demonstrated that the candidate loci was located on the chromosome 10 of the rice. Following screening of the recombinant rice plants with newly developed molecular markers, the genetic region was narrowed down to 0.25 Mb. This region was flanked by InDel-2 and InDel-2 at the physical location from 13.69 to 13.94 Mb. Within this region, 7 genes indicated base differences between parents. A total of 2 genes exhibited differences at the coding region and upstream of the coding region, respectively. The present study aimed to further clone the LESR gene, verify its function and identify the stigma variation.
[FANCA gene mutation analysis in Fanconi anemia patients].

Science.gov (United States)

Chen, Fei; Peng, Guang-Jie; Zhang, Kejian; Hu, Qun; Zhang, Liu-Qing; Liu, Ai-Guo

2005-10-01

To screen the FANCA gene mutation and explore the FANCA protein function in Fanconi anemia (FA) patients. FANCA protein expression and its interaction with FANCF were analyzed using Western blot and immunoprecipitation in 3 cases of FA-A. Genomic DNA was used for MLPA analysis followed by sequencing. FANCA protein was undetectable and FANCA and FANCF protein interaction was impaired in these 3 cases of FA-A. Each case of FA-A contained biallelic pathogenic mutations in FANCA gene. No functional FANCA protein was found in these 3 cases of FA-A, and intragenic deletion, frame shift and splice site mutation were the major pathogenic mutations found in FANCA gene.
Gene set analysis for interpreting genetic studies

DEFF Research Database (Denmark)

Pers, Tune H

2016-01-01

Interpretation of genome-wide association study (GWAS) results is lacking behind the discovery of new genetic associations. Consequently, there is an urgent need for data-driven methods for interpreting genetic association studies. Gene set analysis (GSA) can identify aetiologic pathways...
Suitable Reference Genes for Accurate Gene Expression Analysis in Parsley (Petroselinum crispum) for Abiotic Stresses and Hormone Stimuli.

Science.gov (United States)

Li, Meng-Yao; Song, Xiong; Wang, Feng; Xiong, Ai-Sheng

2016-01-01

Parsley, one of the most important vegetables in the Apiaceae family, is widely used in the food, medicinal, and cosmetic industries. Recent studies on parsley mainly focus on its chemical composition, and further research involving the analysis of the plant's gene functions and expressions is required. qPCR is a powerful method for detecting very low quantities of target transcript levels and is widely used to study gene expression. To ensure the accuracy of results, a suitable reference gene is necessary for expression normalization. In this study, four software, namely geNorm, NormFinder, BestKeeper, and RefFinder were used to evaluate the expression stabilities of eight candidate reference genes of parsley ( GAPDH, ACTIN, eIF-4 α, SAND, UBC, TIP41, EF-1 α, and TUB ) under various conditions, including abiotic stresses (heat, cold, salt, and drought) and hormone stimuli treatments (GA, SA, MeJA, and ABA). Results showed that EF-1 α and TUB were the most stable genes for abiotic stresses, whereas EF-1 α, GAPDH , and TUB were the top three choices for hormone stimuli treatments. Moreover, EF-1 α and TUB were the most stable reference genes among all tested samples, and UBC was the least stable one. Expression analysis of PcDREB1 and PcDREB2 further verified that the selected stable reference genes were suitable for gene expression normalization. This study can guide the selection of suitable reference genes in gene expression in parsley.
Suitable reference genes for accurate gene expression analysis in parsley (Petroselinum crispum for abiotic stresses and hormone stimuli

Directory of Open Access Journals (Sweden)

Meng-Yao Li

2016-09-01

Full Text Available Parsley is one of the most important vegetable in Apiaceae family and widely used in food industry, medicinal and cosmetic. The recent studies in parsley are mainly focus on chemical composition, further research involving the analysis of the gene functions and expressions will be required. qPCR is a powerful method for detecting very low quantities of target transcript levels and widely used for gene expression studies. To ensure the accuracy of results, a suitable reference gene is necessary for expression normalization. In this study, three software geNorm, NormFinder, and BestKeeper were used to evaluate the expression stabilities of eight candidate reference genes (GAPDH, ACTIN, eIF-4α, SAND, UBC, TIP41, EF-1α, and TUB under various conditions including abiotic stresses (heat, cold, salt, and drought and hormone stimuli treatments (GA, SA, MeJA, and ABA. The results showed that EF-1α and TUB were identified as the most stable genes for abiotic stresses, while EF-1α, GAPDH, and TUB were the top three choices for hormone stimuli treatments. Moreover, EF-1α and TUB were the most stable reference genes across all the tested samples, while UBC was the least stable one. The expression analysis of PcDREB1 and PcDREB2 further verified that the selected stable reference genes were suitable for gene expression normalization. This study provides a guideline for selection the suitable reference genes in gene expression in parsley.
Soybean (Glycine max) SWEET gene family: insights through comparative genomics, transcriptome profiling and whole genome re-sequence analysis.

Science.gov (United States)

Patil, Gunvant; Valliyodan, Babu; Deshmukh, Rupesh; Prince, Silvas; Nicander, Bjorn; Zhao, Mingzhe; Sonah, Humira; Song, Li; Lin, Li; Chaudhary, Juhi; Liu, Yang; Joshi, Trupti; Xu, Dong; Nguyen, Henry T

2015-07-11

SWEET (MtN3_saliva) domain proteins, a recently identified group of efflux transporters, play an indispensable role in sugar efflux, phloem loading, plant-pathogen interaction and reproductive tissue development. The SWEET gene family is predominantly studied in Arabidopsis and members of the family are being investigated in rice. To date, no transcriptome or genomics analysis of soybean SWEET genes has been reported. In the present investigation, we explored the evolutionary aspect of the SWEET gene family in diverse plant species including primitive single cell algae to angiosperms with a major emphasis on Glycine max. Evolutionary features showed expansion and duplication of the SWEET gene family in land plants. Homology searches with BLAST tools and Hidden Markov Model-directed sequence alignments identified 52 SWEET genes that were mapped to 15 chromosomes in the soybean genome as tandem duplication events. Soybean SWEET (GmSWEET) genes showed a wide range of expression profiles in different tissues and developmental stages. Analysis of public transcriptome data and expression profiling using quantitative real time PCR (qRT-PCR) showed that a majority of the GmSWEET genes were confined to reproductive tissue development. Several natural genetic variants (non-synonymous SNPs, premature stop codons and haplotype) were identified in the GmSWEET genes using whole genome re-sequencing data analysis of 106 soybean genotypes. A significant association was observed between SNP-haplogroup and seed sucrose content in three gene clusters on chromosome 6. Present investigation utilized comparative genomics, transcriptome profiling and whole genome re-sequencing approaches and provided a systematic description of soybean SWEET genes and identified putative candidates with probable roles in the reproductive tissue development. Gene expression profiling at different developmental stages and genomic variation data will aid as an important resource for the soybean research
Gene expression patterns during the larval development of European sea bass (dicentrarchus labrax) by microarray analysis.

Science.gov (United States)

Darias, M J; Zambonino-Infante, J L; Hugot, K; Cahu, C L; Mazurais, D

2008-01-01

During the larval period, marine teleosts undergo very fast growth and dramatic changes in morphology, metabolism, and behavior to accomplish their metamorphosis into juvenile fish. Regulation of gene expression is widely thought to be a key mechanism underlying the management of the biological processes required for harmonious development over this phase of life. To provide an overall analysis of gene expression in the whole body during sea bass larval development, we monitored the expression of 6,626 distinct genes at 10 different points in time between 7 and 43 days post-hatching (dph) by using heterologous hybridization of a rainbow trout cDNA microarray. The differentially expressed genes (n = 485) could be grouped into two categories: genes that were generally up-expressed early, between 7 and 23 dph, and genes up-expressed between 25 and 43 dph. Interestingly, among the genes regulated during the larval period, those related to organogenesis, energy pathways, biosynthesis, and digestion were over-represented compared with total set of analyzed genes. We discuss the quantitative regulation of whole-body contents of these specific transcripts with regard to the ontogenesis and maturation of essential functions that take place over larval development. Our study is the first utilization of a transcriptomic approach in sea bass and reveals dynamic changes in gene expression patterns in relation to marine finfish larval development.
Clinical Omics Analysis of Colorectal Cancer Incorporating Copy Number Aberrations and Gene Expression Data

Directory of Open Access Journals (Sweden)

Tsuyoshi Yoshida

2010-07-01

Full Text Available Background: Colorectal cancer (CRC is one of the most frequently occurring cancers in Japan, and thus a wide range of methods have been deployed to study the molecular mechanisms of CRC. In this study, we performed a comprehensive analysis of CRC, incorporating copy number aberration (CRC and gene expression data. For the last four years, we have been collecting data from CRC cases and organizing the information as an “omics” study by integrating many kinds of analysis into a single comprehensive investigation. In our previous studies, we had experienced difficulty in finding genes related to CRC, as we observed higher noise levels in the expression data than in the data for other cancers. Because chromosomal aberrations are often observed in CRC, here, we have performed a combination of CNA analysis and expression analysis in order to identify some new genes responsible for CRC. This study was performed as part of the Clinical Omics Database Project at Tokyo Medical and Dental University. The purpose of this study was to investigate the mechanism of genetic instability in CRC by this combination of expression analysis and CNA, and to establish a new method for the diagnosis and treatment of CRC. Materials and methods: Comprehensive gene expression analysis was performed on 79 CRC cases using an Affymetrix Gene Chip, and comprehensive CNA analysis was performed using an Affymetrix DNA Sty array. To avoid the contamination of cancer tissue with normal cells, laser micro-dissection was performed before DNA/RNA extraction. Data analysis was performed using original software written in the R language. Result: We observed a high percentage of CNA in colorectal cancer, including copy number gains at 7, 8q, 13 and 20q, and copy number losses at 8p, 17p and 18. Gene expression analysis provided many candidates for CRC-related genes, but their association with CRC did not reach the level of statistical significance. The combination of CNA and gene
Gene Expression Analysis Reveals New Possible Mechanisms of Vancomycin-Induced Nephrotoxicity and Identifies Gene Markers Candidates

OpenAIRE

Dieterich, Christine; Puey, Angela; Lyn, Sylvia; Swezey, Robert; Furimsky, Anna; Fairchild, David; Mirsalis, Jon C.; Ng, Hanna H.

2008-01-01

Vancomycin, one of few effective treatments against methicillin-resistant Staphylococcus aureus, is nephrotoxic. The goals of this study were to (1) gain insights into molecular mechanisms of nephrotoxicity at the genomic level, (2) evaluate gene markers of vancomycin-induced kidney injury, and (3) compare gene expression responses after iv and ip administration. Groups of six female BALB/c mice were treated with seven daily iv or ip doses of vancomycin (50, 200, and 400 mg/kg) or saline, and...
Transcriptomic changes in human breast cancer progression as determined by serial analysis of gene expression

International Nuclear Information System (INIS)

Abba, Martin C; Aldaz, C Marcelo; Drake, Jeffrey A; Hawkins, Kathleen A; Hu, Yuhui; Sun, Hongxia; Notcovich, Cintia; Gaddis, Sally; Sahin, Aysegul; Baggerly, Keith

2004-01-01

Genomic and transcriptomic alterations affecting key cellular processes such us cell proliferation, differentiation and genomic stability are considered crucial for the development and progression of cancer. Most invasive breast carcinomas are known to derive from precursor in situ lesions. It is proposed that major global expression abnormalities occur in the transition from normal to premalignant stages and further progression to invasive stages. Serial analysis of gene expression (SAGE) was employed to generate a comprehensive global gene expression profile of the major changes occurring during breast cancer malignant evolution. In the present study we combined various normal and tumor SAGE libraries available in the public domain with sets of breast cancer SAGE libraries recently generated and sequenced in our laboratory. A recently developed modified t test was used to detect the genes differentially expressed. We accumulated a total of approximately 1.7 million breast tissue-specific SAGE tags and monitored the behavior of more than 25,157 genes during early breast carcinogenesis. We detected 52 transcripts commonly deregulated across the board when comparing normal tissue with ductal carcinoma in situ, and 149 transcripts when comparing ductal carcinoma in situ with invasive ductal carcinoma (P < 0.01). A major novelty of our study was the use of a statistical method that correctly accounts for the intra-SAGE and inter-SAGE library sources of variation. The most useful result of applying this modified t statistics beta binomial test is the identification of genes and gene families commonly deregulated across samples within each specific stage in the transition from normal to preinvasive and invasive stages of breast cancer development. Most of the gene expression abnormalities detected at the in situ stage were related to specific genes in charge of regulating the proper homeostasis between cell death and cell proliferation. The comparison of in situ lesions
Large scale gene expression meta-analysis reveals tissue-specific, sex-biased gene expression in humans

Directory of Open Access Journals (Sweden)

Benjamin Mayne

2016-10-01

Full Text Available The severity and prevalence of many diseases are known to differ between the sexes. Organ specific sex-biased gene expression may underpin these and other sexually dimorphic traits. To further our understanding of sex differences in transcriptional regulation, we performed meta-analyses of sex biased gene expression in multiple human tissues. We analysed 22 publicly available human gene expression microarray data sets including over 2500 samples from 15 different tissues and 9 different organs. Briefly, by using an inverse-variance method we determined the effect size difference of gene expression between males and females. We found the greatest sex differences in gene expression in the brain, specifically in the anterior cingulate cortex, (1818 genes, followed by the heart (375 genes, kidney (224 genes, colon (218 genes and thyroid (163 genes. More interestingly, we found different parts of the brain with varying numbers and identity of sex-biased genes, indicating that specific cortical regions may influence sexually dimorphic traits. The majority of sex-biased genes in other tissues such as the bladder, liver, lungs and pancreas were on the sex chromosomes or involved in sex hormone production. On average in each tissue, 32% of autosomal genes that were expressed in a sex-biased fashion contained androgen or estrogen hormone response elements. Interestingly, across all tissues, we found approximately two-thirds of autosomal genes that were sex-biased were not under direct influence of sex hormones. To our knowledge this is the largest analysis of sex-biased gene expression in human tissues to date. We identified many sex-biased genes that were not under the direct influence of sex chromosome genes or sex hormones. These may provide targets for future development of sex-specific treatments for diseases.
A Gene Module-Based eQTL Analysis Prioritizing Disease Genes and Pathways in Kidney Cancer

Directory of Open Access Journals (Sweden)

Mary Qu Yang

Full Text Available Clear cell renal cell carcinoma (ccRCC is the most common and most aggressive form of renal cell cancer (RCC. The incidence of RCC has increased steadily in recent years. The pathogenesis of renal cell cancer remains poorly understood. Many of the tumor suppressor genes, oncogenes, and dysregulated pathways in ccRCC need to be revealed for improvement of the overall clinical outlook of the disease. Here, we developed a systems biology approach to prioritize the somatic mutated genes that lead to dysregulation of pathways in ccRCC. The method integrated multi-layer information to infer causative mutations and disease genes. First, we identified differential gene modules in ccRCC by coupling transcriptome and protein-protein interactions. Each of these modules consisted of interacting genes that were involved in similar biological processes and their combined expression alterations were significantly associated with disease type. Then, subsequent gene module-based eQTL analysis revealed somatic mutated genes that had driven the expression alterations of differential gene modules. Our study yielded a list of candidate disease genes, including several known ccRCC causative genes such as BAP1 and PBRM1, as well as novel genes such as NOD2, RRM1, CSRNP1, SLC4A2, TTLL1 and CNTN1. The differential gene modules and their driver genes revealed by our study provided a new perspective for understanding the molecular mechanisms underlying the disease. Moreover, we validated the results in independent ccRCC patient datasets. Our study provided a new method for prioritizing disease genes and pathways. Keywords: ccRCC, Causative mutation, Pathways, Protein-protein interaction, Gene module, eQTL
Gene expression profiling with principal component analysis depicts the biological continuum from essential thrombocythemia over polycythemia vera to myelofibrosis

DEFF Research Database (Denmark)

Skov, Vibe; Thomassen, Mads; Riley, Caroline H

2012-01-01

The recent discovery of the Janus activating kinase 2 V617F mutation in most patients with polycythemia vera (PV) and half of those with essential thrombocythemia (ET) and primary myelofibrosis (PMF) has favored the hypothesis of a biological continuum from ET over PV to PMF. We performed gene...... with biological relevant overlaps between the different entities. Moreover, the analysis separates Janus activating kinase 2-negative ET patients from Janus activating kinase 2-positive ET patients. Functional annotation analysis demonstrates that clusters of gene ontology terms related to inflammation, immune...... system, apoptosis, RNA metabolism, and secretory system were the most significantly deregulated terms in the three different disease groups. Our results yield further support for the hypothesis of a biological continuum originating from ET over PV to PMF. Functional analysis suggests an important...
Length bias correction in gene ontology enrichment analysis using logistic regression.

Science.gov (United States)

Mi, Gu; Di, Yanming; Emerson, Sarah; Cumbie, Jason S; Chang, Jeff H

2012-01-01

When assessing differential gene expression from RNA sequencing data, commonly used statistical tests tend to have greater power to detect differential expression of genes encoding longer transcripts. This phenomenon, called "length bias", will influence subsequent analyses such as Gene Ontology enrichment analysis. In the presence of length bias, Gene Ontology categories that include longer genes are more likely to be identified as enriched. These categories, however, are not necessarily biologically more relevant. We show that one can effectively adjust for length bias in Gene Ontology analysis by including transcript length as a covariate in a logistic regression model. The logistic regression model makes the statistical issue underlying length bias more transparent: transcript length becomes a confounding factor when it correlates with both the Gene Ontology membership and the significance of the differential expression test. The inclusion of the transcript length as a covariate allows one to investigate the direct correlation between the Gene Ontology membership and the significance of testing differential expression, conditional on the transcript length. We present both real and simulated data examples to show that the logistic regression approach is simple, effective, and flexible.
Application of biclustering of gene expression data and gene set enrichment analysis methods to identify potentially disease causing nanomaterials

Directory of Open Access Journals (Sweden)

Andrew Williams

2015-12-01

Full Text Available Background: The presence of diverse types of nanomaterials (NMs in commerce is growing at an exponential pace. As a result, human exposure to these materials in the environment is inevitable, necessitating the need for rapid and reliable toxicity testing methods to accurately assess the potential hazards associated with NMs. In this study, we applied biclustering and gene set enrichment analysis methods to derive essential features of altered lung transcriptome following exposure to NMs that are associated with lung-specific diseases. Several datasets from public microarray repositories describing pulmonary diseases in mouse models following exposure to a variety of substances were examined and functionally related biclusters of genes showing similar expression profiles were identified. The identified biclusters were then used to conduct a gene set enrichment analysis on pulmonary gene expression profiles derived from mice exposed to nano-titanium dioxide (nano-TiO2, carbon black (CB or carbon nanotubes (CNTs to determine the disease significance of these data-driven gene sets.Results: Biclusters representing inflammation (chemokine activity, DNA binding, cell cycle, apoptosis, reactive oxygen species (ROS and fibrosis processes were identified. All of the NM studies were significant with respect to the bicluster related to chemokine activity (DAVID; FDR p-value = 0.032. The bicluster related to pulmonary fibrosis was enriched in studies where toxicity induced by CNT and CB studies was investigated, suggesting the potential for these materials to induce lung fibrosis. The pro-fibrogenic potential of CNTs is well established. Although CB has not been shown to induce fibrosis, it induces stronger inflammatory, oxidative stress and DNA damage responses than nano-TiO2 particles.Conclusion: The results of the analysis correctly identified all NMs to be inflammogenic and only CB and CNTs as potentially fibrogenic. In addition to identifying several

Cartilage-selective genes identified in genome-scale analysis of non-cartilage and cartilage gene expression

Directory of Open Access Journals (Sweden)

Cohn Zachary A

2007-06-01

Full Text Available Abstract Background Cartilage plays a fundamental role in the development of the human skeleton. Early in embryogenesis, mesenchymal cells condense and differentiate into chondrocytes to shape the early skeleton. Subsequently, the cartilage anlagen differentiate to form the growth plates, which are responsible for linear bone growth, and the articular chondrocytes, which facilitate joint function. However, despite the multiplicity of roles of cartilage during human fetal life, surprisingly little is known about its transcriptome. To address this, a whole genome microarray expression profile was generated using RNA isolated from 18–22 week human distal femur fetal cartilage and compared with a database of control normal human tissues aggregated at UCLA, termed Celsius. Results 161 cartilage-selective genes were identified, defined as genes significantly expressed in cartilage with low expression and little variation across a panel of 34 non-cartilage tissues. Among these 161 genes were cartilage-specific genes such as cartilage collagen genes and 25 genes which have been associated with skeletal phenotypes in humans and/or mice. Many of the other cartilage-selective genes do not have established roles in cartilage or are novel, unannotated genes. Quantitative RT-PCR confirmed the unique pattern of gene expression observed by microarray analysis. Conclusion Defining the gene expression pattern for cartilage has identified new genes that may contribute to human skeletogenesis as well as provided further candidate genes for skeletal dysplasias. The data suggest that fetal cartilage is a complex and transcriptionally active tissue and demonstrate that the set of genes selectively expressed in the tissue has been greatly underestimated.
Comparative genomic analysis of the PKS genes in five species and expression analysis in upland cotton

Directory of Open Access Journals (Sweden)

Xueqiang Su

2017-10-01

Full Text Available Plant type III polyketide synthase (PKS can catalyse the formation of a series of secondary metabolites with different structures and different biological functions; the enzyme plays an important role in plant growth, development and resistance to stress. At present, the PKS gene has been identified and studied in a variety of plants. Here, we identified 11 PKS genes from upland cotton (Gossypium hirsutum and compared them with 41 PKS genes in Populus tremula, Vitis vinifera, Malus domestica and Arabidopsis thaliana. According to the phylogenetic tree, a total of 52 PKS genes can be divided into four subfamilies (I–IV. The analysis of gene structures and conserved motifs revealed that most of the PKS genes were composed of two exons and one intron and there are two characteristic conserved domains (Chal_sti_synt_N and Chal_sti_synt_C of the PKS gene family. In our study of the five species, gene duplication was found in addition to Arabidopsis thaliana and we determined that purifying selection has been of great significance in maintaining the function of PKS gene family. From qRT-PCR analysis and a combination of the role of the accumulation of proanthocyanidins (PAs in brown cotton fibers, we concluded that five PKS genes are candidate genes involved in brown cotton fiber pigment synthesis. These results are important for the further study of brown cotton PKS genes. It not only reveals the relationship between PKS gene family and pigment in brown cotton, but also creates conditions for improving the quality of brown cotton fiber.
Effective identification of Lactobacillus casei group species: genome-based selection of the gene mutL as the target of a novel multiplex PCR assay.

Science.gov (United States)

Bottari, Benedetta; Felis, Giovanna E; Salvetti, Elisa; Castioni, Anna; Campedelli, Ilenia; Torriani, Sandra; Bernini, Valentina; Gatti, Monica

2017-07-01

Lactobacillus casei,Lactobacillus paracasei and Lactobacillusrhamnosus form a closely related taxonomic group (the L. casei group) within the facultatively heterofermentative lactobacilli. Strains of these species have been used for a long time as probiotics in a wide range of products, and they represent the dominant species of nonstarter lactic acid bacteria in ripened cheeses, where they contribute to flavour development. The close genetic relationship among those species, as well as the similarity of biochemical properties of the strains, hinders the development of an adequate selective method to identify these bacteria. Despite this being a hot topic, as demonstrated by the large amount of literature about it, the results of different proposed identification methods are often ambiguous and unsatisfactory. The aim of this study was to develop a more robust species-specific identification assay for differentiating the species of the L. casei group. A taxonomy-driven comparative genomic analysis was carried out to select the potential target genes whose similarity could better reflect genome-wide diversity. The gene mutL appeared to be the most promising one and, therefore, a novel species-specific multiplex PCR assay was developed to rapidly and effectively distinguish L. casei, L. paracasei and L. rhamnosus strains. The analysis of a collection of 76 wild dairy isolates, previously identified as members of the L. casei group combining the results of multiple approaches, revealed that the novel designed primers, especially in combination with already existing ones, were able to improve the discrimination power at the species level and reveal previously undiscovered intraspecific biodiversity.
Implications of XRCC1, XPD and APE1 gene polymorphism in North Indian population: a comparative approach in different ethnic groups worldwide.

Science.gov (United States)

Gangwar, Ruchika; Manchanda, Parmeet Kaur; Mittal, Rama Devi

2009-05-01

Identifying risk factors for human cancers should consider combinations of genetic variations and environmental exposures. Several polymorphisms in DNA repair genes have impact on repair and cancer susceptibility. We focused on X-ray repair cross-complementing group 1 (XRCC1), Xeroderma pigmentosum D (XPD) and apurinic/apyrimidinic endonuclease (APE1) as these are most extensively studied in cancer. Present study was conducted to determine distribution of XRCC1 C26304T, G27466A, G23591A, APE1 T2197G and XPD A35931C gene polymorphisms in North Indian population and compare with different populations globally. PCR-based analysis was conducted in 209 normal healthy individuals of similar ethnicity. Allelic frequencies in wild type of XRCC1 C26304T were 91.1% C(Arg); G27466A 62.9% G(Arg); G23591A 60.3% G(Arg); APE1 T2197G 75.1% T(Asp) and XPD A35931C 71.8% A(Lys). The variant allele frequency were 8.9% T(Trp) in XRCC1 C26304T; 37.1% A(His) in G27466A; 39.7% A(Gln) in G23591A; 24.9% G(Glu) in APE1 and 28.2% C(Gln) in XPD respectively. We further compared frequency distribution for these genes with various published studies in different ethnicity. Our results suggest that frequency in these DNA repair genes exhibit distinctive pattern in India that could be attributed to ethnicity variation. This could assist in high-risk screening of humans exposed to environmental carcinogens and cancer predisposition in different ethnic groups.
Mutation analysis of the cathepsin C gene in Indian families with Papillon-Lefèvre syndrome

Directory of Open Access Journals (Sweden)

Srivastava Satish

2003-07-01

Full Text Available Abstract Background PLS is a rare autosomal recessive disorder characterized by early onset periodontopathia and palmar plantar keratosis. PLS is caused by mutations in the cathepsin C (CTSC gene. Dipeptidyl-peptidase I encoded by the CTSC gene removes dipeptides from the amino-terminus of protein substrates and mainly plays an immune and inflammatory role. Several mutations have been reported in this gene in patients from several ethnic groups. We report here mutation analysis of the CTSC gene in three Indian families with PLS. Methods Peripheral blood samples were obtained from individuals belonging to three Indian families with PLS for genomic DNA isolation. Exon-specific intronic primers were used to amplify DNA samples from individuals. PCR products were subsequently sequenced to detect mutations. PCR-SCCP and ASOH analyses were used to determine if mutations were present in normal control individuals. Results All patients from three families had a classic PLS phenotype, which included palmoplantar keratosis and early-onset severe periodontitis. Sequence analysis of the CTSC gene showed three novel nonsense mutations (viz., p.Q49X, p.Q69X and p.Y304X in homozygous state in affected individuals from these Indian families. Conclusions This study reported three novel nonsense mutations in three Indian families. These novel nonsense mutations are predicted to produce truncated dipeptidyl-peptidase I causing PLS phenotype in these families. A review of the literature along with three novel mutations reported here showed that the total number of mutations in the CTSC gene described to date is 41 with 17 mutations being located in exon 7.
The Alcohol Dehydrogenase Gene Family in Melon (Cucumis melo L.: Bioinformatic Analysis and Expression Patterns

Directory of Open Access Journals (Sweden)

Yazhong eJin

2016-05-01

Full Text Available Alcohol dehydrogenases (ADH, encoded by multigene family in plants, play a critical role in plant growth, development, adaptation, fruit ripening and aroma production. Thirteen ADH genes were identified in melon genome, including 12 ADHs and one formaldehyde dehydrogenease (FDH, designated CmADH1-12 and CmFDH1, in which CmADH1 and CmADH2 have been isolated in Cantaloupe. ADH genes shared a lower identity with each other at the protein level and had different intron-exon structure at nucleotide level. No typical signal peptides were found in all CmADHs, and CmADH proteins might locate in the cytoplasm. The phylogenetic tree revealed that 13 ADH genes were divided into 3 groups respectively, namely long-, medium- and short-chain ADH subfamily, and CmADH1,3-11, which belongs to the medium-chain ADH subfamily, fell into 6 medium-chain ADH subgroups. CmADH12 may belong to the long-chain ADH subfamily, while CmFDH1 may be a Class III ADH and serve as an ancestral ADH in melon. Expression profiling revealed that CmADH1, CmADH2, CmADH10 and CmFDH1 were moderately or strongly expressed in different vegetative tissues and fruit at medium and late developmental stages, while CmADH8 and CmADH12 were highly expressed in fruit after 20 days. CmADH3 showed preferential expression in young tissues. CmADH4 only had slight expression in root. Promoter analysis revealed several motifs of CmADH genes involved in the gene expression modulated by various hormones, and the response pattern of CmADH genes to ABA, IAA and ethylene were different. These CmADHs were divided into ethylene-sensitive and –insensitive groups, and the functions of CmADHs were discussed.
Analysis of gene evolution and metabolic pathways using the Candida Gene Order Browser

LENUS (Irish Health Repository)

Fitzpatrick, David A

2010-05-10

Abstract Background Candida species are the most common cause of opportunistic fungal infection worldwide. Recent sequencing efforts have provided a wealth of Candida genomic data. We have developed the Candida Gene Order Browser (CGOB), an online tool that aids comparative syntenic analyses of Candida species. CGOB incorporates all available Candida clade genome sequences including two Candida albicans isolates (SC5314 and WO-1) and 8 closely related species (Candida dubliniensis, Candida tropicalis, Candida parapsilosis, Lodderomyces elongisporus, Debaryomyces hansenii, Pichia stipitis, Candida guilliermondii and Candida lusitaniae). Saccharomyces cerevisiae is also included as a reference genome. Results CGOB assignments of homology were manually curated based on sequence similarity and synteny. In total CGOB includes 65617 genes arranged into 13625 homology columns. We have also generated improved Candida gene sets by merging\\/removing partial genes in each genome. Interrogation of CGOB revealed that the majority of tandemly duplicated genes are under strong purifying selection in all Candida species. We identified clusters of adjacent genes involved in the same metabolic pathways (such as catabolism of biotin, galactose and N-acetyl glucosamine) and we showed that some clusters are species or lineage-specific. We also identified one example of intron gain in C. albicans. Conclusions Our analysis provides an important resource that is now available for the Candida community. CGOB is available at http:\\/\\/cgob.ucd.ie.
Next generation sequencing based transcriptome analysis of septic-injury responsive genes in the beetle Tribolium castaneum.

Directory of Open Access Journals (Sweden)

Boran Altincicek

Full Text Available Beetles (Coleoptera are the most diverse animal group on earth and interact with numerous symbiotic or pathogenic microbes in their environments. The red flour beetle Tribolium castaneum is a genetically tractable model beetle species and its whole genome sequence has recently been determined. To advance our understanding of the molecular basis of beetle immunity here we analyzed the whole transcriptome of T. castaneum by high-throughput next generation sequencing technology. Here, we demonstrate that the Illumina/Solexa sequencing approach of cDNA samples from T. castaneum including over 9.7 million reads with 72 base pairs (bp length (approximately 700 million bp sequence information with about 30× transcriptome coverage confirms the expression of most predicted genes and enabled subsequent qualitative and quantitative transcriptome analysis. This approach recapitulates our recent quantitative real-time PCR studies of immune-challenged and naïve T. castaneum beetles, validating our approach. Furthermore, this sequencing analysis resulted in the identification of 73 differentially expressed genes upon immune-challenge with statistical significance by comparing expression data to calculated values derived by fitting to generalized linear models. We identified up regulation of diverse immune-related genes (e.g. Toll receptor, serine proteinases, DOPA decarboxylase and thaumatin and of numerous genes encoding proteins with yet unknown functions. Of note, septic-injury resulted also in the elevated expression of genes encoding heat-shock proteins or cytochrome P450s supporting the view that there is crosstalk between immune and stress responses in T. castaneum. The present study provides a first comprehensive overview of septic-injury responsive genes in T. castaneum beetles. Identified genes advance our understanding of T. castaneum specific gene expression alteration upon immune-challenge in particular and may help to understand beetle immunity
Tumor-directed gene therapy in mice using a composite nonviral gene delivery system consisting of the piggyBac transposon and polyethylenimine

International Nuclear Information System (INIS)

Kang, Yu; Zhang, Xiaoyan; Jiang, Wei; Wu, Chaoqun; Chen, Chunmei; Zheng, Yufang; Gu, Jianren; Xu, Congjian

2009-01-01

Compared with viral vectors, nonviral vectors are less immunogenic, more stable, safer and easier to replication for application in cancer gene therapy. However, nonviral gene delivery system has not been extensively used because of the low transfection efficiency and the short transgene expression, especially in vivo. It is desirable to develop a nonviral gene delivery system that can support stable genomic integration and persistent gene expression in vivo. Here, we used a composite nonviral gene delivery system consisting of the piggyBac (PB) transposon and polyethylenimine (PEI) for long-term transgene expression in mouse ovarian tumors. A recombinant plasmid PB [Act-RFP, HSV-tk] encoding both the herpes simplex thymidine kinase (HSV-tk) and the monomeric red fluorescent protein (mRFP1) under PB transposon elements was constructed. This plasmid and the PBase plasmid were injected into ovarian cancer tumor xenografts in mice by in vivo PEI system. The antitumor effects of HSV-tk/ganciclovir (GCV) system were observed after intraperitoneal injection of GCV. Histological analysis and TUNEL assay were performed on the cryostat sections of the tumor tissue. Plasmid construction was confirmed by PCR analysis combined with restrictive enzyme digestion. mRFP1 expression could be visualized three weeks after the last transfection of pPB/TK under fluorescence microscopy. After GCV admission, the tumor volume of PB/TK group was significantly reduced and the tumor inhibitory rate was 81.96% contrasted against the 43.07% in the TK group. Histological analysis showed that there were extensive necrosis and lymphocytes infiltration in the tumor tissue of the PB/TK group but limited in the tissue of control group. TUNEL assays suggested that the transfected cells were undergoing apoptosis after GCV admission in vivo. Our results show that the nonviral gene delivery system coupling PB transposon with PEI can be used as an efficient tool for gene therapy in ovarian cancer
Singular Perturbation Analysis and Gene Regulatory Networks with Delay

Science.gov (United States)

Shlykova, Irina; Ponosov, Arcady

2009-09-01

There are different ways of how to model gene regulatory networks. Differential equations allow for a detailed description of the network's dynamics and provide an explicit model of the gene concentration changes over time. Production and relative degradation rate functions used in such models depend on the vector of steeply sloped threshold functions which characterize the activity of genes. The most popular example of the threshold functions comes from the Boolean network approach, where the threshold functions are given by step functions. The system of differential equations becomes then piecewise linear. The dynamics of this system can be described very easily between the thresholds, but not in the switching domains. For instance this approach fails to analyze stationary points of the system and to define continuous solutions in the switching domains. These problems were studied in [2], [3], but the proposed model did not take into account a time delay in cellular systems. However, analysis of real gene expression data shows a considerable number of time-delayed interactions suggesting that time delay is essential in gene regulation. Therefore, delays may have a great effect on the dynamics of the system presenting one of the critical factors that should be considered in reconstruction of gene regulatory networks. The goal of this work is to apply the singular perturbation analysis to certain systems with delay and to obtain an analog of Tikhonov's theorem, which provides sufficient conditions for constracting the limit system in the delay case.
Identification of a new complementation group of the peroxisome biogenesis disorders and PEX14 as the mutated gene

NARCIS (Netherlands)

Shimozawa, Nobuyuki; Tsukamoto, Toshiro; Nagase, Tomoko; Takemoto, Yasuhiko; Koyama, Naoki; Suzuki, Yasuyuki; Komori, Masayuki; Osumi, Takashi; Jeannette, Gootjes; Wanders, Ronald J. A.; Kondo, Naomi

2004-01-01

Peroxisome biogenesis disorders (PBD) are lethal hereditary diseases caused by abnormalities in the biogenesis of peroxisomes. At present, 12 different complementation groups have been identified and to date, all genes responsible for each of these complementation groups have been identified. The
Xylella fastidiosa gene expression analysis by DNA microarrays

Directory of Open Access Journals (Sweden)

Regiane F. Travensolo

2009-01-01

Full Text Available Xylella fastidiosa genome sequencing has generated valuable data by identifying genes acting either on metabolic pathways or in associated pathogenicity and virulence. Based on available information on these genes, new strategies for studying their expression patterns, such as microarray technology, were employed. A total of 2,600 primer pairs were synthesized and then used to generate fragments using the PCR technique. The arrays were hybridized against cDNAs labeled during reverse transcription reactions and which were obtained from bacteria grown under two different conditions (liquid XDM2 and liquid BCYE. All data were statistically analyzed to verify which genes were differentially expressed. In addition to exploring conditions for X. fastidiosa genome-wide transcriptome analysis, the present work observed the differential expression of several classes of genes (energy, protein, amino acid and nucleotide metabolism, transport, degradation of substances, toxins and hypothetical proteins, among others. The understanding of expressed genes in these two different media will be useful in comprehending the metabolic characteristics of X. fastidiosa, and in evaluating how important certain genes are for the functioning and survival of these bacteria in plants.
MimiLook: A Phylogenetic Workflow for Detection of Gene Acquisition in Major Orthologous Groups of Megavirales.

Science.gov (United States)

Jain, Sourabh; Panda, Arup; Colson, Philippe; Raoult, Didier; Pontarotti, Pierre

2017-04-07

With the inclusion of new members, understanding about evolutionary mechanisms and processes by which members of the proposed order, Megavirales, have evolved has become a key area of interest. The central role of gene acquisition has been shown in previous studies. However, the major drawback in gene acquisition studies is the focus on few MV families or putative families with large variation in their genetic structure. Thus, here we have tried to develop a methodology by which we can detect horizontal gene transfers (HGTs), taking into consideration orthologous groups of distantly related Megavirale families. Here, we report an automated workflow MimiLook, prepared as a Perl command line program, that deduces orthologous groups (OGs) from ORFomes of Megavirales and constructs phylogenetic trees by performing alignment generation, alignment editing and protein-protein BLAST (BLASTP) searching across the National Center for Biotechnology Information (NCBI) non-redundant (nr) protein sequence database. Finally, this tool detects statistically validated events of gene acquisitions with the help of the T-REX algorithm by comparing individual gene tree with NCBI species tree. In between the steps, the workflow decides about handling paralogs, filtering outputs, identifying Megavirale specific OGs, detection of HGTs, along with retrieval of information about those OGs that are monophyletic with organisms from cellular domains of life. By implementing MimiLook, we noticed that nine percent of Megavirale gene families (i.e., OGs) have been acquired by HGT, 80% OGs were Megaviralespecific and eight percent were found to be sharing common ancestry with members of cellular domains (Eukaryote, Bacteria, Archaea, Phages or other viruses) and three percent were ambivalent. The results are briefly discussed to emphasize methodology. Also, MimiLook is relevant for detecting evolutionary scenarios in other targeted phyla with user defined modifications. It can be accessed at
Transcriptomic analysis of ‘Suli’ pear (Pyrus pyrifolia white pear group buds during the dormancy by RNA-Seq

Directory of Open Access Journals (Sweden)

Liu Guoqin

2012-12-01

Full Text Available Abstract Background Bud dormancy is a critical developmental process that allows perennial plants to survive unfavorable environmental conditions. Pear is one of the most important deciduous fruit trees in the world, but the mechanisms regulating bud dormancy in this species are unknown. Because genomic information for pear is currently unavailable, transcriptome and digital gene expression data for this species would be valuable resources to better understand the molecular and biological mechanisms regulating its bud dormancy. Results We performed de novo transcriptome assembly and digital gene expression (DGE profiling analyses of ‘Suli’ pear (Pyrus pyrifolia white pear group using the Illumina RNA-seq system. RNA-Seq generated approximately 100 M high-quality reads that were assembled into 69,393 unigenes (mean length = 853 bp, including 14,531 clusters and 34,194 singletons. A total of 51,448 (74.1% unigenes were annotated using public protein databases with a cut-off E-value above 10-5. We mainly compared gene expression levels at four time-points during bud dormancy. Between Nov. 15 and Dec. 15, Dec. 15 and Jan. 15, and Jan. 15 and Feb. 15, 1,978, 1,024, and 3,468 genes were differentially expressed, respectively. Hierarchical clustering analysis arranged 190 significantly differentially-expressed genes into seven groups. Seven genes were randomly selected to confirm their expression levels using quantitative real-time PCR. Conclusions The new transcriptomes offer comprehensive sequence and DGE profiling data for a dynamic view of transcriptomic variation during bud dormancy in pear. These data provided a basis for future studies of metabolism during bud dormancy in non-model but economically-important perennial species.
A Model-Based Joint Identification of Differentially Expressed Genes and Phenotype-Associated Genes

Science.gov (United States)

Seo, Minseok; Shin, Su-kyung; Kwon, Eun-Young; Kim, Sung-Eun; Bae, Yun-Jung; Lee, Seungyeoun; Sung, Mi-Kyung; Choi, Myung-Sook; Park, Taesung

2016-01-01

Over the last decade, many analytical methods and tools have been developed for microarray data. The detection of differentially expressed genes (DEGs) among different treatment groups is often a primary purpose of microarray data analysis. In addition, association studies investigating the relationship between genes and a phenotype of interest such as survival time are also popular in microarray data analysis. Phenotype association analysis provides a list of phenotype-associated genes (PAGs). However, it is sometimes necessary to identify genes that are both DEGs and PAGs. We consider the joint identification of DEGs and PAGs in microarray data analyses. The first approach we used was a naïve approach that detects DEGs and PAGs separately and then identifies the genes in an intersection of the list of PAGs and DEGs. The second approach we considered was a hierarchical approach that detects DEGs first and then chooses PAGs from among the DEGs or vice versa. In this study, we propose a new model-based approach for the joint identification of DEGs and PAGs. Unlike the previous two-step approaches, the proposed method identifies genes simultaneously that are DEGs and PAGs. This method uses standard regression models but adopts different null hypothesis from ordinary regression models, which allows us to perform joint identification in one-step. The proposed model-based methods were evaluated using experimental data and simulation studies. The proposed methods were used to analyze a microarray experiment in which the main interest lies in detecting genes that are both DEGs and PAGs, where DEGs are identified between two diet groups and PAGs are associated with four phenotypes reflecting the expression of leptin, adiponectin, insulin-like growth factor 1, and insulin. Model-based approaches provided a larger number of genes, which are both DEGs and PAGs, than other methods. Simulation studies showed that they have more power than other methods. Through analysis of
A Model-Based Joint Identification of Differentially Expressed Genes and Phenotype-Associated Genes.

Directory of Open Access Journals (Sweden)

Samuel Sunghwan Cho

Full Text Available Over the last decade, many analytical methods and tools have been developed for microarray data. The detection of differentially expressed genes (DEGs among different treatment groups is often a primary purpose of microarray data analysis. In addition, association studies investigating the relationship between genes and a phenotype of interest such as survival time are also popular in microarray data analysis. Phenotype association analysis provides a list of phenotype-associated genes (PAGs. However, it is sometimes necessary to identify genes that are both DEGs and PAGs. We consider the joint identification of DEGs and PAGs in microarray data analyses. The first approach we used was a naïve approach that detects DEGs and PAGs separately and then identifies the genes in an intersection of the list of PAGs and DEGs. The second approach we considered was a hierarchical approach that detects DEGs first and then chooses PAGs from among the DEGs or vice versa. In this study, we propose a new model-based approach for the joint identification of DEGs and PAGs. Unlike the previous two-step approaches, the proposed method identifies genes simultaneously that are DEGs and PAGs. This method uses standard regression models but adopts different null hypothesis from ordinary regression models, which allows us to perform joint identification in one-step. The proposed model-based methods were evaluated using experimental data and simulation studies. The proposed methods were used to analyze a microarray experiment in which the main interest lies in detecting genes that are both DEGs and PAGs, where DEGs are identified between two diet groups and PAGs are associated with four phenotypes reflecting the expression of leptin, adiponectin, insulin-like growth factor 1, and insulin. Model-based approaches provided a larger number of genes, which are both DEGs and PAGs, than other methods. Simulation studies showed that they have more power than other methods
Analysis of cassava (Manihot esculenta) ESTs: A tool for the discovery of genes

International Nuclear Information System (INIS)

Zapata, Andres; Neme, Rafik; Sanabria, Carolina; Lopez, Camilo

2011-01-01

Cassava (Manihot esculenta) is the main source of calories for more than 1,000 millions of people around the world and has been consolidated as the fourth most important crop after rice, corn and wheat. Cassava is considered tolerant to abiotic and biotic stress conditions; nevertheless these characteristics are mainly present in non-commercial varieties. Genetic breeding strategies represent an alternative to introduce the desirable characteristics into commercial varieties. A fundamental step for accelerating the genetic breeding process in cassava requires the identification of genes associated to these characteristics. One rapid strategy for the identification of genes is the possibility to have a large collection of ESTs (expressed sequence tag). In this study, a complete analysis of cassava ESTs was done. The cassava ESTs represent 80,459 sequences which were assembled in a set of 29,231 unique genes (unigen), comprising 10,945 contigs and 18,286 singletones. These 29,231 unique genes represent about 80% of the genes of the cassava's genome. Between 5% and 10% of the unigenes of cassava not show similarity to any sequences present in the NCBI database and could be consider as cassava specific genes. a functional category was assigned to a group of sequences of the unigen set (29%) following the Gene Ontology Vocabulary. the molecular function component was the best represented with 43% of the sequences, followed by the biological process component (38%) and finally the cellular component with 19%. in the cassava ESTs collection, 3,709 microsatellites were identified and they could be used as molecular markers. this study represents an important contribution to the knowledge of the functional genomic structure of cassava and constitutes an important tool for the identification of genes associated to agricultural characteristics of interest that could be employed in cassava breeding programs.
R229Q Polymorphism of NPHS2 Gene in Group of Iraqi Children with Steroid-Resistant Nephrotic Syndrome

Directory of Open Access Journals (Sweden)

Shatha Hussain Ali

2017-01-01

Full Text Available Background. The polymorphism R229Q is one of the most commonly reported podocin sequence variations among steroid-resistant nephrotic syndromes (SRNS. Aim of the Study. We investigated the frequency and risk of this polymorphism among a group of Iraqi children with SRNS and steroid-sensitive nephrotic syndrome (SSNS. Patients and Methods. A prospective case control study which was conducted in Al-Imamein Al-Kadhimein Medical City, spanning the period from the 1st of April 2015 to 30th of November 2015. Study sample consisted of 54 children having NS, divided into 2 groups: patients group consisted of 27 children with SRNS, and control group involved 27 children with SSNS. Both were screened by real time polymerase chain reaction for R229Q in exon 5 of NPHS2 gene. Results. Molecular study showed R229Q polymorphism in 96.3% of SRNS and 100% of SSNS. There were no phenotypic or histologic characteristics of patients bearing homozygous R229Q polymorphism and the patients with heterozygous R229Q polymorphism. Conclusion. Polymorphism R229Q of NPHS2 gene is prevalent in Iraqi children with SRNS and SSNS. Further study needs to be done, for other exons and polymorphism of NPHS2 gene in those patients.
GWATCH: a web platform for automated gene association discovery analysis

Science.gov (United States)

2014-01-01

Background As genome-wide sequence analyses for complex human disease determinants are expanding, it is increasingly necessary to develop strategies to promote discovery and validation of potential disease-gene associations. Findings Here we present a dynamic web-based platform – GWATCH – that automates and facilitates four steps in genetic epidemiological discovery: 1) Rapid gene association search and discovery analysis of large genome-wide datasets; 2) Expanded visual display of gene associations for genome-wide variants (SNPs, indels, CNVs), including Manhattan plots, 2D and 3D snapshots of any gene region, and a dynamic genome browser illustrating gene association chromosomal regions; 3) Real-time validation/replication of candidate or putative genes suggested from other sources, limiting Bonferroni genome-wide association study (GWAS) penalties; 4) Open data release and sharing by eliminating privacy constraints (The National Human Genome Research Institute (NHGRI) Institutional Review Board (IRB), informed consent, The Health Insurance Portability and Accountability Act (HIPAA) of 1996 etc.) on unabridged results, which allows for open access comparative and meta-analysis. Conclusions GWATCH is suitable for both GWAS and whole genome sequence association datasets. We illustrate the utility of GWATCH with three large genome-wide association studies for HIV-AIDS resistance genes screened in large multicenter cohorts; however, association datasets from any study can be uploaded and analyzed by GWATCH. PMID:25374661
Molecular characterization and expression analysis of a heat shock protein 90 gene from disk abalone (Haliotis discus).

Science.gov (United States)

Wang, Ning; Whang, Ilson; Lee, Jae-Seong; Lee, Jehee

2011-06-01

Heat shock protein 90s (hsp90s) are chaperones that contribute to the proper folding of cellular proteins and help animals cope with the cellular protein damages in stress conditions. In this study, an hsp90 gene was isolated from disc abalone (Haliotis discus). The complete nucleotide sequence of the hsp90 gene contains an open reading frame of 2,184 base pairs, encoding an 84 kDa protein. Disk abalone hsp90 shares high sequence similarity with other hsp90 family proteins. Although the phylogenetic analysis did not classify it into the hsp90α group, the inductivity of this gene was confirmed by heat shock and lipopolysaccharide (LPS) challenge test. Disk abalone hsp90 gene displayed a rapid and reversible induction response to both an exposure of typical heat shock and the LPS challenge. Once given the sublethal heat shock treatment, the transcription of disk abalone hsp90 gene was significantly up-regulated. With a recovery of 12 h, the transcription of disk abalone hsp90 gene gradually attenuated to the control level. These observations reflected the feedback regulation of abalone heat shock responses faithfully. In response to LPS challenge, the transcription of disk abalone hsp90 gene was significantly increased within 2 h and it approached maximum induction at 4 h later and recovered finally the reference level in 24 h. Take all together, the cloning and expression analysis of disk abalone hsp90 gene provided useful molecular information of abalone responses in stress conditions and potential ways to monitor the chronic stressors in abalone culture environments and diagnose the animal health status.

Confirmation of Two Sibling Species among Anopheles fluviatilis Mosquitoes in South and Southeastern Iran by Analysis of Cytochrome Oxidase I Gene.

Science.gov (United States)

Naddaf, Saied Reza; Oshaghi, Mohammad Ali; Vatandoost, Hassan

2012-12-01

Anopheles fluviatilis, one of the major malaria vectors in Iran, is assumed to be a complex of sibling species. The aim of this study was to evaluate Cytochrome oxidase I (COI) gene alongside 28S-D3 as a diagnostic tool for identification of An. fluviatilis sibling species in Iran. DNA sample belonging to 24 An. fluviatilis mosquitoes from different geographical areas in south and southeastern Iran were used for amplification of COI gene followed by sequencing. The 474-475 bp COI sequences obtained in this study were aligned with 59 similar sequences of An. fluviatilis and a sequence of Anopheles minimus, as out group, from GenBank database. The distances between group and individual sequences were calculated and phylogenetic tree for obtained sequences was generated by using Kimura two parameter (K2P) model of neighbor-joining method. Phylogenetic analysis using COI gene grouped members of Fars Province (central Iran) in two distinct clades separate from other Iranian members representing Hormozgan, Kerman, and Sistan va Baluchestan Provinces. The mean distance between Iranian and Indian individuals was 1.66%, whereas the value between Fars Province individuals and the group comprising individuals from other areas of Iran was 2.06%. Presence of 2.06% mean distance between individuals from Fars Province and those from other areas of Iran is indicative of at least two sibling species in An. fluviatilis mosquitoes of Iran. This finding confirms earlier results based on RAPD-PCR and 28S-D3 analysis.
Uniform approximation is more appropriate for Wilcoxon Rank-Sum Test in gene set analysis.

Directory of Open Access Journals (Sweden)

Zhide Fang

Full Text Available Gene set analysis is widely used to facilitate biological interpretations in the analyses of differential expression from high throughput profiling data. Wilcoxon Rank-Sum (WRS test is one of the commonly used methods in gene set enrichment analysis. It compares the ranks of genes in a gene set against those of genes outside the gene set. This method is easy to implement and it eliminates the dichotomization of genes into significant and non-significant in a competitive hypothesis testing. Due to the large number of genes being examined, it is impractical to calculate the exact null distribution for the WRS test. Therefore, the normal distribution is commonly used as an approximation. However, as we demonstrate in this paper, the normal approximation is problematic when a gene set with relative small number of genes is tested against the large number of genes in the complementary set. In this situation, a uniform approximation is substantially more powerful, more accurate, and less intensive in computation. We demonstrate the advantage of the uniform approximations in Gene Ontology (GO term analysis using simulations and real data sets.
Analysis of human reticulocyte genes reveals altered erythropoiesis: potential use to detect recombinant human erythropoietin doping.

Science.gov (United States)

Varlet-Marie, Emmanuelle; Audran, Michel; Lejeune, Mireille; Bonafoux, Béatrice; Sicart, Marie-Therese; Marti, Jacques; Piquemal, David; Commes, Thérèse

2004-08-01

Enhancement of oxygen delivery to tissues is associated with improved sporting performance. One way of enhancing oxygen delivery is to take recombinant human erythropoietin (rHuEpo), which is an unethical and potentially dangerous practice. However, detection of the use of rHuEpo remains difficult in situations such as: i) several days after the end of treatment ii) when a treatment with low doses is conducted iii) if the rHuEpo effect is increased by other substances. In an attempt to detect rHuEpo abuse, we selected erythroid gene markers from a SAGE library and analyzed the effects of rHuEpo administration on expression of the HBB, FTL and OAZ genes. Ten athletes were assigned to the rHuEpo or placebo group. The rHuEpo group received subcutaneous injections of rHuEpo (50 UI/kg three times a week, 4 weeks; 20 UI/kg three times a week, 2 weeks). HBB, FTL and OAZ gene profiles were monitored by real time-polymerase chain reaction (PCR) quantification during and for 3 weeks after drug administration. The global analysis of these targeted genes detected in whole blood samples showed a characteristic profile of subjects misusing rHuEpo with a increase above the threshold levels. The individual analysis of OAZ mRNA seemed indicative of rHuEpo treatment. The performance-enhancing effect of rHuEpo treatment is greater than the duration of hematologic changes associated with rHuEpo misuse. Although direct electrophoretic methods to detect rHuEpo have been developed, recombinant isoforms of rHuEpo are not detectable some days after the last subcutaneous injection. To overcome these limitations indirect OFF models have been developed. Our data suggest that, in the near future, it will be possible to consolidate results achievable with the OFF models by analyzing selected erythroid gene markers as a supplement to indirect methods.
Plasminogen activator inhibitor-1 4G/5G gene polymorphism and coronary artery disease in the Chinese Han population: a meta-analysis.

Directory of Open Access Journals (Sweden)

Yan-yan Li

Full Text Available BACKGROUND: The polymorphism of plasminogen activator inhibitor-1 (PAI-1 4G/5G gene has been indicated to be correlated with coronary artery disease (CAD susceptibility, but study results are still debatable. OBJECTIVE AND METHODS: The present meta-analysis was performed to investigate the association between PAI-1 4G/5G gene polymorphism and CAD in the Chinese Han population. A total of 879 CAD patients and 628 controls from eight separate studies were involved. The pooled odds ratio (OR for the distribution of the 4G allele frequency of PAI-1 4G/5G gene and its corresponding 95% confidence interval (CI was assessed by the random effect model. RESULTS: The distribution of the 4 G allele frequency was 0.61 for the CAD group and 0.51 for the control group. The association between PAI-1 4G/5G gene polymorphism and CAD in the Chinese Han population was significant under an allelic genetic model (OR = 1.70, 95% CI = 1.18 to 2.44, P = 0.004. The heterogeneity test was also significant (P<0.0001. Meta-regression was performed to explore the heterogeneity source. Among the confounding factors, the heterogeneity could be explained by the publication year (P = 0.017, study region (P = 0.014, control group sample size (P = 0.011, total sample size (P = 0.011, and ratio of the case to the control group sample size (RR (P = 0.019. In a stratified analysis by the total sample size, significantly increased risk was only detected in subgroup 2 under an allelic genetic model (OR = 1.93, 95% CI = 1.09 to 3.35, P = 0.02. CONCLUSIONS: In the Chinese Han population, PAI-1 4G/5G gene polymorphism was implied to be associated with increased CAD risk. Carriers of the 4G allele of the PAI-1 4G/5G gene might predispose to CAD.
Pathway-based analysis of a melanoma genome-wide association study: analysis of genes related to tumour-immunosuppression.

Directory of Open Access Journals (Sweden)

Nils Schoof

Full Text Available Systemic immunosuppression is a risk factor for melanoma, and sunburn-induced immunosuppression is thought to be causal. Genes in immunosuppression pathways are therefore candidate melanoma-susceptibility genes. If variants within these genes individually have a small effect on disease risk, the association may be undetected in genome-wide association (GWA studies due to low power to reach a high significance level. Pathway-based approaches have been suggested as a method of incorporating a priori knowledge into the analysis of GWA studies. In this study, the association of 1113 single nucleotide polymorphisms (SNPs in 43 genes (39 genomic regions related to immunosuppression have been analysed using a gene-set approach in 1539 melanoma cases and 3917 controls from the GenoMEL consortium GWA study. The association between melanoma susceptibility and the whole set of tumour-immunosuppression genes, and also predefined functional subgroups of genes, was considered. The analysis was based on a measure formed by summing the evidence from the most significant SNP in each gene, and significance was evaluated empirically by case-control label permutation. An association was found between melanoma and the complete set of genes (p(emp=0.002, as well as the subgroups related to the generation of tolerogenic dendritic cells (p(emp=0.006 and secretion of suppressive factors (p(emp=0.0004, thus providing preliminary evidence of involvement of tumour-immunosuppression gene polymorphisms in melanoma susceptibility. The analysis was repeated on a second phase of the GenoMEL study, which showed no evidence of an association. As one of the first attempts to replicate a pathway-level association, our results suggest that low power and heterogeneity may present challenges.
Survival associated pathway identification with group Lp penalized global AUC maximization

Directory of Open Access Journals (Sweden)

Liu Zhenqiu

2010-08-01

Full Text Available Abstract It has been demonstrated that genes in a cell do not act independently. They interact with one another to complete certain biological processes or to implement certain molecular functions. How to incorporate biological pathways or functional groups into the model and identify survival associated gene pathways is still a challenging problem. In this paper, we propose a novel iterative gradient based method for survival analysis with group Lp penalized global AUC summary maximization. Unlike LASSO, Lp (p 1. We first extend Lp for individual gene identification to group Lp penalty for pathway selection, and then develop a novel iterative gradient algorithm for penalized global AUC summary maximization (IGGAUCS. This method incorporates the genetic pathways into global AUC summary maximization and identifies survival associated pathways instead of individual genes. The tuning parameters are determined using 10-fold cross validation with training data only. The prediction performance is evaluated using test data. We apply the proposed method to survival outcome analysis with gene expression profile and identify multiple pathways simultaneously. Experimental results with simulation and gene expression data demonstrate that the proposed procedures can be used for identifying important biological pathways that are related to survival phenotype and for building a parsimonious model for predicting the survival times.
Replication of type 2 diabetes candidate genes variations in three geographically unrelated Indian population groups.

Science.gov (United States)

Ali, Shafat; Chopra, Rupali; Manvati, Siddharth; Singh, Yoginder Pal; Kaul, Nabodita; Behura, Anita; Mahajan, Ankit; Sehajpal, Prabodh; Gupta, Subash; Dhar, Manoj K; Chainy, Gagan B N; Bhanwer, Amarjit S; Sharma, Swarkar; Bamezai, Rameshwar N K

2013-01-01

Type 2 diabetes (T2D) is a syndrome of multiple metabolic disorders and is genetically heterogeneous. India comprises one of the largest global populations with highest number of reported type 2 diabetes cases. However, limited information about T2D associated loci is available for Indian populations. It is, therefore, pertinent to evaluate the previously associated candidates as well as identify novel genetic variations in Indian populations to understand the extent of genetic heterogeneity. We chose to do a cost effective high-throughput mass-array genotyping and studied the candidate gene variations associated with T2D in literature. In this case-control candidate genes association study, 91 SNPs from 55 candidate genes have been analyzed in three geographically independent population groups from India. We report the genetic variants in five candidate genes: TCF7L2, HHEX, ENPP1, IDE and FTO, are significantly associated (after Bonferroni correction, ppopulation. Interestingly, SNP rs7903146 of the TCF7L2 gene passed the genome wide significance threshold (combined P value = 2.05E-08) in the studied populations. We also observed the association of rs7903146 with blood glucose (fasting and postprandial) levels, supporting the role of TCF7L2 gene in blood glucose homeostasis. Further, we noted that the moderate risk provided by the independently associated loci in combined population with Odds Ratio (OR)<1.38 increased to OR = 2.44, (95%CI = 1.67-3.59) when the risk providing genotypes of TCF7L2, HHEX, ENPP1 and FTO genes were combined, suggesting the importance of gene-gene interactions evaluation in complex disorders like T2D.
Group adaptation, formal darwinism and contextual analysis.

Science.gov (United States)

Okasha, S; Paternotte, C

2012-06-01

We consider the question: under what circumstances can the concept of adaptation be applied to groups, rather than individuals? Gardner and Grafen (2009, J. Evol. Biol.22: 659-671) develop a novel approach to this question, building on Grafen's 'formal Darwinism' project, which defines adaptation in terms of links between evolutionary dynamics and optimization. They conclude that only clonal groups, and to a lesser extent groups in which reproductive competition is repressed, can be considered as adaptive units. We re-examine the conditions under which the selection-optimization links hold at the group level. We focus on an important distinction between two ways of understanding the links, which have different implications regarding group adaptationism. We show how the formal Darwinism approach can be reconciled with G.C. Williams' famous analysis of group adaptation, and we consider the relationships between group adaptation, the Price equation approach to multi-level selection, and the alternative approach based on contextual analysis. © 2012 The Authors. Journal of Evolutionary Biology © 2012 European Society For Evolutionary Biology.
Transcriptional response of polycomb group genes to status epilepticus in mice is modified by prior exposure to epileptic preconditioning

Directory of Open Access Journals (Sweden)

James eReynolds

2015-03-01

Full Text Available Exposure of the brain to brief, non-harmful seizures can activate protective mechanisms that temporarily generate a damage-refractory state. This process, termed epileptic tolerance, is associated with large-scale down-regulation of gene expression. Polycomb group proteins are master controllers of gene silencing during development that are re-activated by injury to the brain. Here we explored the transcriptional response of genes associated with polycomb repressor complex (PRC 1 (Ring1A and Ring1B and Bmi1 and PRC2 (Ezh1, Ezh2 and Suz12, as well as additional transcriptional regulators Sirt1, Yy1 and Yy2, in a mouse model of status epilepticus. Findings were contrasted to changes after status epilepticus in mice previously given brief seizures to evoke tolerance. Real-time quantitative PCR showed status epilepticus prompted an early (1 h increase in expression of several genes in PRC1 and PRC2 in the hippocampus, followed by down-regulation of many of the same genes at later times points (4 , 8 and 24 h. Spatio-temporal differences were found among PRC2 genes in epileptic tolerance, including increased expression of Ezh2, Suz12 and Yy2 relative to the normal injury response to status epilepticus. In contrast, PRC1 complex genes including Ring 1B and Bmi1 displayed differential down-regulation in epileptic tolerance. The present study characterizes polycomb group gene expression following status epilepticus and shows prior seizure exposure produces select changes to PRC1 and PRC2 composition that may influence differential gene expression in epileptic tolerance.
THE ANALYSIS OF METHYLENETETRAHYDROFOLATE REDUCTASE GENE POLYMORPHISM IN THE PATIENTS WITH ARTERIAL HYPERTENSION IN THE REPUBLIC OF MORDOVIA

Directory of Open Access Journals (Sweden)

Lyudmila Goncharova

2016-03-01

Full Text Available Hypertension (HTN or HT is the main risk factor for cerebrovascular accidents and myocardial infarction, since it leads to imbalances during the vascular and thrombocytic part of hemostasis. In most cases, HTN is genetic in nature. Mutation of methylenetetrahydrofolate (MTHFR gene in positions C677T and A1298C is supposedly one of the major factors in evolvement of this medical condition. High percentage of patients with complicated hypertension persists in Republic of Mordovia, so the article provides data analysis of polymorphism of MTHFR gene in patients with primary hypertension of Mordvinian and Russian ethnicity residing in the Republic. Materials and Methods The study involved 113 patients (50,4 % – Mordvinian and 49,6 % – Russian nationalities with hypertension (stages II-III in classification of Society of cardiology of Russian Federation, year 2008, BP <140/90 mm Hg. Along with the traditional clinical and instrumental studies, the authors conducted identification of alleles of polymorphic markers by polymerase chain reaction method. Statistical analysis was performed with implementation of software packages “Statistica for Windows 6.0” (StatSoft, “SPSS” (version 14.0, “MS Excel XP” (Microsoft. The authors used χ2 in the process of con¬sideration of the frequencies of genotypes and alleles in individual groups of patients. Results Analysis of the distribution of genotypes of the MTHFR gene at position 677 and positions A1298C revealed the predominance of intermediate genotypes CT and AC in male and female patients with hypertension, with no correlation to nationality. Adverse CT genotype of MTHFR gene at position 677 is found in 20 % of patients with hypertension among Mordvinian males and 2,5 % – among hypertensive Russian females. Pathological CC genotype of MTHFR gene in the A1298C position was identified either in Mordvinian (from 2,3 % to 27 % and Russian (from 19,3 % to 33,7 % patients. Discussion and
Analysis of pharmacogenomic variants associated with population differentiation.

Directory of Open Access Journals (Sweden)

Bora Yeon

Full Text Available In the present study, we systematically investigated population differentiation of drug-related (DR genes in order to identify common genetic features underlying population-specific responses to drugs. To do so, we used the International HapMap project release 27 Data and Pharmacogenomics Knowledge Base (PharmGKB database. First, we compared four measures for assessing population differentiation: the chi-square test, the analysis of variance (ANOVA F-test, Fst, and Nearest Shrunken Centroid Method (NSCM. Fst showed high sensitivity with stable specificity among varying sample sizes; thus, we selected Fst for determining population differentiation. Second, we divided DR genes from PharmGKB into two groups based on the degree of population differentiation as assessed by Fst: genes with a high level of differentiation (HD gene group and genes with a low level of differentiation (LD gene group. Last, we conducted a gene ontology (GO analysis and pathway analysis. Using all genes in the human genome as the background, the GO analysis and pathway analysis of the HD genes identified terms related to cell communication. "Cell communication" and "cell-cell signaling" had the lowest Benjamini-Hochberg's q-values (0.0002 and 0.0006, respectively, and "drug binding" was highly enriched (16.51 despite its relatively high q-value (0.0142. Among the 17 genes related to cell communication identified in the HD gene group, five genes (STX4, PPARD, DCK, GRIK4, and DRD3 contained single nucleotide polymorphisms with Fst values greater than 0.5. Specifically, the Fst values for rs10871454, rs6922548, rs3775289, rs1954787, and rs167771 were 0.682, 0.620, 0.573, 0.531, and 0.510, respectively. In the analysis using DR genes as the background, the HD gene group contained six significant terms. Five were related to reproduction, and one was "Wnt signaling pathway," which has been implicated in cancer. Our analysis suggests that the HD gene group from PharmGKB is
Analysis of pharmacogenomic variants associated with population differentiation.

Science.gov (United States)

Yeon, Bora; Ahn, Eunyong; Kim, Kyung-Im; Kim, In-Wha; Oh, Jung Mi; Park, Taesung

2015-01-01

In the present study, we systematically investigated population differentiation of drug-related (DR) genes in order to identify common genetic features underlying population-specific responses to drugs. To do so, we used the International HapMap project release 27 Data and Pharmacogenomics Knowledge Base (PharmGKB) database. First, we compared four measures for assessing population differentiation: the chi-square test, the analysis of variance (ANOVA) F-test, Fst, and Nearest Shrunken Centroid Method (NSCM). Fst showed high sensitivity with stable specificity among varying sample sizes; thus, we selected Fst for determining population differentiation. Second, we divided DR genes from PharmGKB into two groups based on the degree of population differentiation as assessed by Fst: genes with a high level of differentiation (HD gene group) and genes with a low level of differentiation (LD gene group). Last, we conducted a gene ontology (GO) analysis and pathway analysis. Using all genes in the human genome as the background, the GO analysis and pathway analysis of the HD genes identified terms related to cell communication. "Cell communication" and "cell-cell signaling" had the lowest Benjamini-Hochberg's q-values (0.0002 and 0.0006, respectively), and "drug binding" was highly enriched (16.51) despite its relatively high q-value (0.0142). Among the 17 genes related to cell communication identified in the HD gene group, five genes (STX4, PPARD, DCK, GRIK4, and DRD3) contained single nucleotide polymorphisms with Fst values greater than 0.5. Specifically, the Fst values for rs10871454, rs6922548, rs3775289, rs1954787, and rs167771 were 0.682, 0.620, 0.573, 0.531, and 0.510, respectively. In the analysis using DR genes as the background, the HD gene group contained six significant terms. Five were related to reproduction, and one was "Wnt signaling pathway," which has been implicated in cancer. Our analysis suggests that the HD gene group from PharmGKB is associated with
Gene Frequency and Heritability of Rh Blood Group Gene in 44 Human Populations

Directory of Open Access Journals (Sweden)

Supriyo CHAKRABORTY

2010-09-01

Full Text Available The frequency of RhD and Rhd alleles of Rh blood group gene was estimated in 44 human populations distributed all over the world from the RhD phenotypic data. The average frequency of RhD and Rhd allele over these populations was 0.70 and 0.30, respectively. Higher frequency of RhD allele than the expected estimate (0.50 in all the populations, under Hardy-Weinberg equilibrium condition assuming equal frequency of both alleles in the initial population, indicated inbreeding at RhD/d locus as well as natural selection for RhD allele. Very high heritability estimate (84.04% of Rh allele frequency revealed that this trait was under weak selection pressure and resulted in greater genetic variation in existing populations. It is consistent with Fishers fundamental theorem of natural selection. The results from the present study suggest that inbreeding at RhD/d locus and some other factors (possibly mutation, migration and genetic drift other than natural selection alone played major roles in changing the Rh allele frequency in these populations.
Identification of Genetic Susceptibility to Childhood Cancer through Analysis of Genes in Parallel

Science.gov (United States)

Plon, Sharon E.; Wheeler, David A.; Strong, Louise C.; Tomlinson, Gail E.; Pirics, Michael; Meng, Qingchang; Cheung, Hannah C.; Begin, Phyllis R.; Muzny, Donna M.; Lewis, Lora; Biegel, Jaclyn A.; Gibbs, Richard A.

2011-01-01

Clinical cancer genetic susceptibility analysis typically proceeds sequentially beginning with the most likely causative gene. The process is time consuming and the yield is low particularly for families with unusual patterns of cancer. We determined the results of in parallel mutation analysis of a large cancer-associated gene panel. We performed deletion analysis and sequenced the coding regions of 45 genes (8 oncogenes and 37 tumor suppressor or DNA repair genes) in 48 childhood cancer patients who also (1) were diagnosed with a second malignancy under age 30, (2) have a sibling diagnosed with cancer under age 30 and/or (3) have a major congenital anomaly or developmental delay. Deleterious mutations were identified in 6 of 48 (13%) families, 4 of which met the sibling criteria. Mutations were identified in genes previously implicated in both dominant and recessive childhood syndromes including SMARCB1, PMS2, and TP53. No pathogenic deletions were identified. This approach has provided efficient identification of childhood cancer susceptibility mutations and will have greater utility as additional cancer susceptibility genes are identified. Integrating parallel analysis of large gene panels into clinical testing will speed results and increase diagnostic yield. The failure to detect mutations in 87% of families highlights that a number of childhood cancer susceptibility genes remain to be discovered. PMID:21356188
Comparative genomic analysis of the WRKY III gene family in populus, grape, arabidopsis and rice.

Science.gov (United States)

Wang, Yiyi; Feng, Lin; Zhu, Yuxin; Li, Yuan; Yan, Hanwei; Xiang, Yan

2015-09-08

WRKY III genes have significant functions in regulating plant development and resistance. In plant, WRKY gene family has been studied in many species, however, there still lack a comprehensive analysis of WRKY III genes in the woody plant species poplar, three representative lineages of flowering plant species are incorporated in most analyses: Arabidopsis (a model plant for annual herbaceous dicots), grape (one model plant for perennial dicots) and Oryza sativa (a model plant for monocots). In this study, we identified 10, 6, 13 and 28 WRKY III genes in the genomes of Populus trichocarpa, grape (Vitis vinifera), Arabidopsis thaliana and rice (Oryza sativa), respectively. Phylogenetic analysis revealed that the WRKY III proteins could be divided into four clades. By microsynteny analysis, we found that the duplicated regions were more conserved between poplar and grape than Arabidopsis or rice. We dated their duplications by Ks analysis of Populus WRKY III genes and demonstrated that all the blocks were formed after the divergence of monocots and dicots. Strong purifying selection has played a key role in the maintenance of WRKY III genes in Populus. Tissue expression analysis of the WRKY III genes in Populus revealed that five were most highly expressed in the xylem. We also performed quantitative real-time reverse transcription PCR analysis of WRKY III genes in Populus treated with salicylic acid, abscisic acid and polyethylene glycol to explore their stress-related expression patterns. This study highlighted the duplication and diversification of the WRKY III gene family in Populus and provided a comprehensive analysis of this gene family in the Populus genome. Our results indicated that the majority of WRKY III genes of Populus was expanded by large-scale gene duplication. The expression pattern of PtrWRKYIII gene identified that these genes play important roles in the xylem during poplar growth and development, and may play crucial role in defense to drought
ABCG2 in peptic ulcer: gene expression and mutation analysis.

Science.gov (United States)

Salagacka-Kubiak, Aleksandra; Żebrowska, Marta; Wosiak, Agnieszka; Balcerczak, Mariusz; Mirowski, Marek; Balcerczak, Ewa

2016-08-01

The aim of this study was to evaluate the participation of polymorphism at position C421A and mRNA expression of the ABCG2 gene in the development of peptic ulcers, which is a very common and severe disease. ABCG2, encoded by the ABCG2 gene, has been found inter alia in the gastrointestinal tract, where it plays a protective role eliminating xenobiotics from cells into the extracellular environment. The materials for the study were biopsies of gastric mucosa taken during a routine endoscopy. For genotyping by polymerase chain reaction-restriction fragment length polymorphism (PCR-RFLP) at position C421A, DNA was isolated from 201 samples, while for the mRNA expression level by real-time PCR, RNA was isolated from 60 patients. The control group of healthy individuals consisted of 97 blood donors. The dominant genotype in the group of peptic ulcer patients and healthy individuals was homozygous CC. No statistically significant differences between healthy individuals and the whole group of peptic ulcer patients and, likewise, between the subgroups of peptic ulcer patients (infected and uninfected with Helicobacter pylori) were found. ABCG2 expression relative to GAPDH expression was found in 38 of the 60 gastric mucosa samples. The expression level of the gene varies greatly among cases. The statistically significant differences between the intensity (p = 0.0375) of H. pylori infection and ABCG2 gene expression have been shown. It was observed that the more intense the infection, the higher the level of ABCG2 expression.
Comparative analysis of clustering methods for gene expression time course data

Directory of Open Access Journals (Sweden)

Ivan G. Costa

2004-01-01

Full Text Available This work performs a data driven comparative study of clustering methods used in the analysis of gene expression time courses (or time series. Five clustering methods found in the literature of gene expression analysis are compared: agglomerative hierarchical clustering, CLICK, dynamical clustering, k-means and self-organizing maps. In order to evaluate the methods, a k-fold cross-validation procedure adapted to unsupervised methods is applied. The accuracy of the results is assessed by the comparison of the partitions obtained in these experiments with gene annotation, such as protein function and series classification.
Microarray analysis reveals key genes and pathways in Tetralogy of Fallot

Science.gov (United States)

He, Yue-E; Qiu, Hui-Xian; Jiang, Jian-Bing; Wu, Rong-Zhou; Xiang, Ru-Lian; Zhang, Yuan-Hai

2017-01-01

The aim of the present study was to identify key genes that may be involved in the pathogenesis of Tetralogy of Fallot (TOF) using bioinformatics methods. The GSE26125 microarray dataset, which includes cardiovascular tissue samples derived from 16 children with TOF and five healthy age-matched control infants, was downloaded from the Gene Expression Omnibus database. Differential expression analysis was performed between TOF and control samples to identify differentially expressed genes (DEGs) using Student's t-test, and the R/limma package, with a log2 fold-change of >2 and a false discovery rate of <0.01 set as thresholds. The biological functions of DEGs were analyzed using the ToppGene database. The ReactomeFIViz application was used to construct functional interaction (FI) networks, and the genes in each module were subjected to pathway enrichment analysis. The iRegulon plugin was used to identify transcription factors predicted to regulate the DEGs in the FI network, and the gene-transcription factor pairs were then visualized using Cytoscape software. A total of 878 DEGs were identified, including 848 upregulated genes and 30 downregulated genes. The gene FI network contained seven function modules, which were all comprised of upregulated genes. Genes enriched in Module 1 were enriched in the following three neurological disorder-associated signaling pathways: Parkinson's disease, Alzheimer's disease and Huntington's disease. Genes in Modules 0, 3 and 5 were dominantly enriched in pathways associated with ribosomes and protein translation. The Xbox binding protein 1 transcription factor was demonstrated to be involved in the regulation of genes encoding the subunits of cytoplasmic and mitochondrial ribosomes, as well as genes involved in neurodegenerative disorders. Therefore, dysfunction of genes involved in signaling pathways associated with neurodegenerative disorders, ribosome function and protein translation may contribute to the pathogenesis of TOF
Gene expression signatures in peripheral blood cells from Japanese women exposed to environmental cadmium

International Nuclear Information System (INIS)

Dakeshita, Satoru; Kawai, Tomoko; Uemura, Hirokazu; Hiyoshi, Mineyoshi; Oguma, Etsuko; Horiguchi, Hyogo; Kayama, Fujio; Aoshima, Keiko; Shirahama, Satoshi; Rokutan, Kazuhito; Arisawa, Kokichi

2009-01-01

The objective of this study was to examine the effects of environmental cadmium (Cd) exposure on the gene expression profile of peripheral blood cells, using an original oligoDNA microarray. The study population consisted of 20 female residents in a Cd-polluted area (Cd-exposed group) and 20 female residents in a non-Cd-polluted area individually matched for age (control group). The mRNA levels in Cd-exposed subjects were compared with those in respective controls, using a microarray containing oligoDNA probes for 1867 genes. Median Cd concentrations in blood (3.55 μg/l) and urine (8.25 μg/g creatinine) from the Cd-exposed group were 2.4- and 1.9-times higher than those of the control group, respectively. Microarray analysis revealed that the Cd-exposed group significantly up-regulated 137 genes and down-regulated 80 genes, compared with the control group. The Ingenuity Pathway Analysis Application (IPA) revealed that differentially expressed genes were likely to modify oxidative stress and mitochondria-dependent apoptosis pathways. Among differentially expressed genes, the expression of five genes was positively correlated with Cd concentrations in blood or urine. Quantitative real-time PCR (RT-PCR) analysis validated the significant up-regulation of CASP9, TNFRSF1B, GPX3, HYOU1, SLC3A2, SLC19A1, SLC35A4 and ITGAL, and down-regulation of BCL2A1 and COX7B. After adjustment for differences in the background characteristics of the two groups, we finally identified seven Cd-responsive genes (CASP9, TNFRSF1B, GPX3, SLC3A2, ITGAL, BCL2A1, and COX7B), all of which constituted a network that controls oxidative stress response by IPA. These seven genes may be marker genes useful for the health risk assessment of chronic low level exposure to Cd
Hierarchy in the home cage affects behaviour and gene expression in group-housed C57BL/6 male mice.

Science.gov (United States)

Horii, Yasuyuki; Nagasawa, Tatsuhiro; Sakakibara, Hiroyuki; Takahashi, Aki; Tanave, Akira; Matsumoto, Yuki; Nagayama, Hiromichi; Yoshimi, Kazuto; Yasuda, Michiko T; Shimoi, Kayoko; Koide, Tsuyoshi

2017-08-01

Group-housed male mice exhibit aggressive behaviour towards their cage mates and form a social hierarchy. Here, we describe how social hierarchy in standard group-housed conditions affects behaviour and gene expression in male mice. Four male C57BL/6 mice were kept in each cage used in the study, and the social hierarchy was determined from observation of video recordings of aggressive behaviour. After formation of a social hierarchy, the behaviour and hippocampal gene expression were analysed in the mice. Higher anxiety- and depression-like behaviours and elevated gene expression of hypothalamic corticotropin-releasing hormone and hippocampal serotonin receptor subtypes were observed in subordinate mice compared with those of dominant mice. These differences were alleviated by orally administering fluoxetine, which is an antidepressant of the selective serotonin reuptake inhibitor class. We concluded that hierarchy in the home cage affects behaviour and gene expression in male mice, resulting in anxiety- and depression-like behaviours being regulated differently in dominant and subordinate mice.

p13 from group II baculoviruses is a killing-associated gene

Directory of Open Access Journals (Sweden)

Yipeng Qi

2012-12-01

Full Text Available p13 gene was first described in Leucania separata multinuclearpolyhedrosis virus (Ls-p13 several years ago, but the functionof P13 protein has not been experimentally investigated todate. In this article, we indicated that the expression of p13from Heliothis armigera single nucleocapsid nucleopolyhedrovirus(Ha-p13 was regulated by both early and late promoter.Luciferase assay demonstrated that the activity of Ha-p13promoter with hr4 enhancer was more than 100 times inheterologous Sf9 cells than that in nature host Hz-AM1 cells.Both Ls-P13 and Ha-P13 are transmembrane proteins. Confocalmicroscopic analysis showed that both mainly located in thecytoplasm membrane at 48 h. Results of RNA interferenceindicated that Ha-p13 was a killing-associated gene for hostinsects H. armigera. The AcMNPV acquired the mentionedkilling activity and markedly accelerate the killing rate whenexpressing Ls-p13. In conclusion, p13 is a killing associatedgene in both homologous and heterologous nucleopolyhedrovirus.
Disease gene characterization through large-scale co-expression analysis.

Directory of Open Access Journals (Sweden)

Allen Day

2009-12-01

Full Text Available In the post genome era, a major goal of biology is the identification of specific roles for individual genes. We report a new genomic tool for gene characterization, the UCLA Gene Expression Tool (UGET.Celsius, the largest co-normalized microarray dataset of Affymetrix based gene expression, was used to calculate the correlation between all possible gene pairs on all platforms, and generate stored indexes in a web searchable format. The size of Celsius makes UGET a powerful gene characterization tool. Using a small seed list of known cartilage-selective genes, UGET extended the list of known genes by identifying 32 new highly cartilage-selective genes. Of these, 7 of 10 tested were validated by qPCR including the novel cartilage-specific genes SDK2 and FLJ41170. In addition, we retrospectively tested UGET and other gene expression based prioritization tools to identify disease-causing genes within known linkage intervals. We first demonstrated this utility with UGET using genetically heterogeneous disorders such as Joubert syndrome, microcephaly, neuropsychiatric disorders and type 2 limb girdle muscular dystrophy (LGMD2 and then compared UGET to other gene expression based prioritization programs which use small but discrete and well annotated datasets. Finally, we observed a significantly higher gene correlation shared between genes in disease networks associated with similar complex or Mendelian disorders.UGET is an invaluable resource for a geneticist that permits the rapid inclusion of expression criteria from one to hundreds of genes in genomic intervals linked to disease. By using thousands of arrays UGET annotates and prioritizes genes better than other tools especially with rare tissue disorders or complex multi-tissue biological processes. This information can be critical in prioritization of candidate genes for sequence analysis.
Stochastic biological response to radiation. Comprehensive analysis of gene expression

International Nuclear Information System (INIS)

Inoue, Tohru; Hirabayashi, Yoko

2012-01-01

Authors explain that the radiation effect on biological system is stochastic along the law of physics, differing from chemical effect, using instances of Cs-137 gamma-ray (GR) and benzene (BZ) exposures to mice and of resultant comprehensive analyses of gene expression. Single GR irradiation is done with Gamma Cell 40 (CSR) to C57BL/6 or C3H/He mouse at 0, 0.6 and 3 Gy. BE is given orally at 150 mg/kg/day for 5 days x 2 weeks. Bone marrow cells are sampled 1 month after the exposure. Comprehensive gene expression is analyzed by Gene Chip Mouse Genome 430 2.0 Array (Affymetrix) and data are processed by programs like case normalization, statistics, network generation, functional analysis etc. GR irradiation brings about changes of gene expression, which are classifiable in common genes variable commonly on the dose change and stochastic genes variable stochastically within each dose: e.g., with Welch-t-test, significant differences are between 0/3 Gy (dose-specific difference, 455 pbs (probe set), in stochastic 2113 pbs), 0/0.6 Gy (267 in 1284 pbs) and 0.6/3 Gy (532 pbs); and with one-way analysis of variation (ANOVA) and hierarchial/dendrographic analyses, 520 pbs are shown to involve the dose-dependent 226 and dose-specific 294 pbs. It is also shown that at 3 Gy, expression of common genes are rather suppressed, including those related to the proliferation/apoptosis of B/T cells, and of stochastic genes, related to cell division/signaling. Ven diagram of the common genes of above 520 pbs, stochastic 2113 pbs at 3 Gy and 1284 pbs at 0.6 Gy shows the overlapping genes 29, 2 and 4, respectively, indicating only 35 pbs are overlapping in total. Network analysis of changes by GR shows the rather high expression of genes around hub of cAMP response element binding protein (CREB) at 0.6 Gy, and rather variable expression around CREB hub/suppressed expression of kinesin hub at 3 Gy; in the network by BZ exposure, unchanged or low expression around p53 hub and suppression
Bioinformatics analysis of RNA-seq data revealed critical genes in colon adenocarcinoma.

Science.gov (United States)

Xi, W-D; Liu, Y-J; Sun, X-B; Shan, J; Yi, L; Zhang, T-T

2017-07-01

RNA-seq data of colon adenocarcinoma (COAD) were analyzed with bioinformatics tools to discover critical genes in the disease. Relevant small molecule drugs, transcription factors (TFs) and microRNAs (miRNAs) were also investigated. RNA-seq data of COAD were downloaded from The Cancer Genome Atlas (TCGA). Differential analysis was performed with package edgeR. False positive discovery (FDR) 1 were set as the cut-offs to screen out differentially expressed genes (DEGs). Gene coexpression network was constructed with package Ebcoexpress. GO enrichment analysis was performed for the DEGs in the gene coexpression network with DAVID. Kyoto Encyclopedia of Genes and Genomes (KEGG) pathway enrichment analysis was also performed for the genes with KOBASS 2.0. Modules were identified with MCODE of Cytoscape. Relevant small molecules drugs were predicted by Connectivity map. Relevant miRNAs and TFs were searched by WebGestalt. A total of 457 DEGs, including 255 up-regulated and 202 down-regulated genes, were identified from 437 COAD and 39 control samples. A gene coexpression network was constructed containing 40 DEGs and 101 edges. The genes were mainly associated with collagen fibril organization, extracellular matrix organization and translation. Two modules were identified from the gene coexpression network, which were implicated in muscle contraction and extracellular matrix organization, respectively. Several critical genes were disclosed, such as MYH11, COL5A2 and ribosomal proteins. Nine relevant small molecule drugs were identified, such as scriptaid and STOCK1N-35874. Accordingly, a total of 17 TFs and 10 miRNAs related to COAD were acquired, such as ETS2, NFAT, AP4, miR-124A, MiR-9, miR-96 and let-7. Several critical genes and relevant drugs, TFs and miRNAs were revealed in COAD. These findings could advance the understanding of the disease and benefit therapy development.
The Genome Sequence of Leishmania (Leishmania) amazonensis: Functional Annotation and Extended Analysis of Gene Models

Science.gov (United States)

Real, Fernando; Vidal, Ramon Oliveira; Carazzolle, Marcelo Falsarella; Mondego, Jorge Maurício Costa; Costa, Gustavo Gilson Lacerda; Herai, Roberto Hirochi; Würtele, Martin; de Carvalho, Lucas Miguel; e Ferreira, Renata Carmona; Mortara, Renato Arruda; Barbiéri, Clara Lucia; Mieczkowski, Piotr; da Silveira, José Franco; Briones, Marcelo Ribeiro da Silva; Pereira, Gonçalo Amarante Guimarães; Bahia, Diana

2013-01-01

We present the sequencing and annotation of the Leishmania (Leishmania) amazonensis genome, an etiological agent of human cutaneous leishmaniasis in the Amazon region of Brazil. L. (L.) amazonensis shares features with Leishmania (L.) mexicana but also exhibits unique characteristics regarding geographical distribution and clinical manifestations of cutaneous lesions (e.g. borderline disseminated cutaneous leishmaniasis). Predicted genes were scored for orthologous gene families and conserved domains in comparison with other human pathogenic Leishmania spp. Carboxypeptidase, aminotransferase, and 3′-nucleotidase genes and ATPase, thioredoxin, and chaperone-related domains were represented more abundantly in L. (L.) amazonensis and L. (L.) mexicana species. Phylogenetic analysis revealed that these two species share groups of amastin surface proteins unique to the genus that could be related to specific features of disease outcomes and host cell interactions. Additionally, we describe a hypothetical hybrid interactome of potentially secreted L. (L.) amazonensis proteins and host proteins under the assumption that parasite factors mimic their mammalian counterparts. The model predicts an interaction between an L. (L.) amazonensis heat-shock protein and mammalian Toll-like receptor 9, which is implicated in important immune responses such as cytokine and nitric oxide production. The analysis presented here represents valuable information for future studies of leishmaniasis pathogenicity and treatment. PMID:23857904
Global Gene Expression Analysis of Yeast Cells during Sake Brewing▿ †

Science.gov (United States)

Wu, Hong; Zheng, Xiaohong; Araki, Yoshio; Sahara, Hiroshi; Takagi, Hiroshi; Shimoi, Hitoshi

2006-01-01

During the brewing of Japanese sake, Saccharomyces cerevisiae cells produce a high concentration of ethanol compared with other ethanol fermentation methods. We analyzed the gene expression profiles of yeast cells during sake brewing using DNA microarray analysis. This analysis revealed some characteristics of yeast gene expression during sake brewing and provided a scaffold for a molecular level understanding of the sake brewing process. PMID:16997994
Gene Regulation, Modulation, and Their Applications in Gene Expression Data Analysis

Directory of Open Access Journals (Sweden)

Mario Flores

2013-01-01

Full Text Available Common microarray and next-generation sequencing data analysis concentrate on tumor subtype classification, marker detection, and transcriptional regulation discovery during biological processes by exploring the correlated gene expression patterns and their shared functions. Genetic regulatory network (GRN based approaches have been employed in many large studies in order to scrutinize for dysregulation and potential treatment controls. In addition to gene regulation and network construction, the concept of the network modulator that has significant systemic impact has been proposed, and detection algorithms have been developed in past years. Here we provide a unified mathematic description of these methods, followed with a brief survey of these modulator identification algorithms. As an early attempt to extend the concept to new RNA regulation mechanism, competitive endogenous RNA (ceRNA, into a modulator framework, we provide two applications to illustrate the network construction, modulation effect, and the preliminary finding from these networks. Those methods we surveyed and developed are used to dissect the regulated network under different modulators. Not limit to these, the concept of “modulation” can adapt to various biological mechanisms to discover the novel gene regulation mechanisms.
In Silico Identification, Phylogenetic and Bioinformatic Analysis of Argonaute Genes in Plants

Directory of Open Access Journals (Sweden)

Khaled Mirzaei

2014-01-01

Full Text Available Argonaute protein family is the key players in pathways of gene silencing and small regulatory RNAs in different organisms. Argonaute proteins can bind small noncoding RNAs and control protein synthesis, affect messenger RNA stability, and even participate in the production of new forms of small RNAs. The aim of this study was to characterize and perform bioinformatic analysis of Argonaute proteins in 32 plant species that their genome was sequenced. A total of 437 Argonaute genes were identified and were analyzed based on lengths, gene structure, and protein structure. Results showed that Argonaute proteins were highly conserved across plant kingdom. Phylogenic analysis divided plant Argonautes into three classes. Argonaute proteins have three conserved domains PAZ, MID and PIWI. In addition to three conserved domains namely, PAZ, MID, and PIWI, we identified few more domains in AGO of some plant species. Expression profile analysis of Argonaute proteins showed that expression of these genes varies in most of tissues, which means that these proteins are involved in regulation of most pathways of the plant system. Numbers of alternative transcripts of Argonaute genes were highly variable among the plants. A thorough analysis of large number of putative Argonaute genes revealed several interesting aspects associated with this protein and brought novel information with promising usefulness for both basic and biotechnological applications.
Genome-wide comparative analysis of NBS-encoding genes between Brassica species and Arabidopsis thaliana.

Science.gov (United States)

Yu, Jingyin; Tehrim, Sadia; Zhang, Fengqi; Tong, Chaobo; Huang, Junyan; Cheng, Xiaohui; Dong, Caihua; Zhou, Yanqiu; Qin, Rui; Hua, Wei; Liu, Shengyi

2014-01-03

Plant disease resistance (R) genes with the nucleotide binding site (NBS) play an important role in offering resistance to pathogens. The availability of complete genome sequences of Brassica oleracea and Brassica rapa provides an important opportunity for researchers to identify and characterize NBS-encoding R genes in Brassica species and to compare with analogues in Arabidopsis thaliana based on a comparative genomics approach. However, little is known about the evolutionary fate of NBS-encoding genes in the Brassica lineage after split from A. thaliana. Here we present genome-wide analysis of NBS-encoding genes in B. oleracea, B. rapa and A. thaliana. Through the employment of HMM search and manual curation, we identified 157, 206 and 167 NBS-encoding genes in B. oleracea, B. rapa and A. thaliana genomes, respectively. Phylogenetic analysis among 3 species classified NBS-encoding genes into 6 subgroups. Tandem duplication and whole genome triplication (WGT) analyses revealed that after WGT of the Brassica ancestor, NBS-encoding homologous gene pairs on triplicated regions in Brassica ancestor were deleted or lost quickly, but NBS-encoding genes in Brassica species experienced species-specific gene amplification by tandem duplication after divergence of B. rapa and B. oleracea. Expression profiling of NBS-encoding orthologous gene pairs indicated the differential expression pattern of retained orthologous gene copies in B. oleracea and B. rapa. Furthermore, evolutionary analysis of CNL type NBS-encoding orthologous gene pairs among 3 species suggested that orthologous genes in B. rapa species have undergone stronger negative selection than those in B .oleracea species. But for TNL type, there are no significant differences in the orthologous gene pairs between the two species. This study is first identification and characterization of NBS-encoding genes in B. rapa and B. oleracea based on whole genome sequences. Through tandem duplication and whole genome
Meta-analysis of differentiating mouse embryonic stem cell gene expression kinetics reveals early change of a small gene set.

Directory of Open Access Journals (Sweden)

Clive H Glover

2006-11-01

Full Text Available Stem cell differentiation involves critical changes in gene expression. Identification of these should provide endpoints useful for optimizing stem cell propagation as well as potential clues about mechanisms governing stem cell maintenance. Here we describe the results of a new meta-analysis methodology applied to multiple gene expression datasets from three mouse embryonic stem cell (ESC lines obtained at specific time points during the course of their differentiation into various lineages. We developed methods to identify genes with expression changes that correlated with the altered frequency of functionally defined, undifferentiated ESC in culture. In each dataset, we computed a novel statistical confidence measure for every gene which captured the certainty that a particular gene exhibited an expression pattern of interest within that dataset. This permitted a joint analysis of the datasets, despite the different experimental designs. Using a ranking scheme that favored genes exhibiting patterns of interest, we focused on the top 88 genes whose expression was consistently changed when ESC were induced to differentiate. Seven of these (103728_at, 8430410A17Rik, Klf2, Nr0b1, Sox2, Tcl1, and Zfp42 showed a rapid decrease in expression concurrent with a decrease in frequency of undifferentiated cells and remained predictive when evaluated in additional maintenance and differentiating protocols. Through a novel meta-analysis, this study identifies a small set of genes whose expression is useful for identifying changes in stem cell frequencies in cultures of mouse ESC. The methods and findings have broader applicability to understanding the regulation of self-renewal of other stem cell types.
Characterization of Soybean WRKY Gene Family and Identification of Soybean WRKY Genes that Promote Resistance to Soybean Cyst Nematode.

Science.gov (United States)

Yang, Yan; Zhou, Yuan; Chi, Yingjun; Fan, Baofang; Chen, Zhixiang

2017-12-19

WRKY proteins are a superfamily of plant transcription factors with important roles in plants. WRKY proteins have been extensively analyzed in plant species including Arabidopsis and rice. Here we report characterization of soybean WRKY gene family and their functional analysis in resistance to soybean cyst nematode (SCN), the most important soybean pathogen. Through search of the soybean genome, we identified 174 genes encoding WRKY proteins that can be classified into seven groups as established in other plants. WRKY variants including a WRKY-related protein unique to legumes have also been identified. Expression analysis reveals both diverse expression patterns in different soybean tissues and preferential expression of specific WRKY groups in certain tissues. Furthermore, a large number of soybean WRKY genes were responsive to salicylic acid. To identify soybean WRKY genes that promote soybean resistance to SCN, we first screened soybean WRKY genes for enhancing SCN resistance when over-expressed in transgenic soybean hairy roots. To confirm the results, we transformed five WRKY genes into a SCN-susceptible soybean cultivar and generated transgenic soybean lines. Transgenic soybean lines overexpressing three WRKY transgenes displayed increased resistance to SCN. Thus, WRKY genes could be explored to develop new soybean cultivars with enhanced resistance to SCN.
Genomic analysis of primordial dwarfism reveals novel disease genes.

Science.gov (United States)

Shaheen, Ranad; Faqeih, Eissa; Ansari, Shinu; Abdel-Salam, Ghada; Al-Hassnan, Zuhair N; Al-Shidi, Tarfa; Alomar, Rana; Sogaty, Sameera; Alkuraya, Fowzan S

2014-02-01

Primordial dwarfism (PD) is a disease in which severely impaired fetal growth persists throughout postnatal development and results in stunted adult size. The condition is highly heterogeneous clinically, but the use of certain phenotypic aspects such as head circumference and facial appearance has proven helpful in defining clinical subgroups. In this study, we present the results of clinical and genomic characterization of 16 new patients in whom a broad definition of PD was used (e.g., 3M syndrome was included). We report a novel PD syndrome with distinct facies in two unrelated patients, each with a different homozygous truncating mutation in CRIPT. Our analysis also reveals, in addition to mutations in known PD disease genes, the first instance of biallelic truncating BRCA2 mutation causing PD with normal bone marrow analysis. In addition, we have identified a novel locus for Seckel syndrome based on a consanguineous multiplex family and identified a homozygous truncating mutation in DNA2 as the likely cause. An additional novel PD disease candidate gene XRCC4 was identified by autozygome/exome analysis, and the knockout mouse phenotype is highly compatible with PD. Thus, we add a number of novel genes to the growing list of PD-linked genes, including one which we show to be linked to a novel PD syndrome with a distinct facial appearance. PD is extremely heterogeneous genetically and clinically, and genomic tools are often required to reach a molecular diagnosis.
Sensitization to group direction in the postgraduate training on Group-Analysis

Directory of Open Access Journals (Sweden)

Simone Bruschetta

2014-09-01

Full Text Available The psychodynamic training group here introduced is a part of the General Training on Group Analysis of the Centre of Palermo of COIRAG Postgraduate School on Analytic Psychotherapy. The training project’s aim, built for the class of the third year, develops a sensitization device which provide a unique set of aquarium. The aim of that methodological artifice is not to engage students on specific group management techniques, but to allow the whole class group to bring into play the complexity of relations, of which is necessary to have awareness in order to lead a group within an institutional context: The main clinical referents that we chose to monitor in this experience are the relationship between conductors and participants and the relationship between group, task and setting. The brief description of this methodology is also including the reporting of two "cases" treated in the course of training. Keywords: Group leadership, Founding dimension, Cultural themes
In silico identification and analysis of phytoene synthase genes in plants.

Science.gov (United States)

Han, Y; Zheng, Q S; Wei, Y P; Chen, J; Liu, R; Wan, H J

2015-08-14

In this study, we examined phytoene synthetase (PSY), the first key limiting enzyme in the synthesis of carotenoids and catalyzing the formation of geranylgeranyl pyrophosphate in terpenoid biosynthesis. We used known amino acid sequences of the PSY gene in tomato plants to conduct a genome-wide search and identify putative candidates in 34 sequenced plants. A total of 101 homologous genes were identified. Phylogenetic analysis revealed that PSY evolved independently in algae as well as monocotyledonous and dicotyledonous plants. Our results showed that the amino acid structures exhibited 5 motifs (motifs 1 to 5) in algae and those in higher plants were highly conserved. The PSY gene structures showed that the number of intron in algae varied widely, while the number of introns in higher plants was 4 to 5. Identification of PSY genes in plants and the analysis of the gene structure may provide a theoretical basis for studying evolutionary relationships in future analyses.
Performance of PCR-restriction fragment length polymorphism analysis of the Helicobacter pylori ureB gene in differentiating gene variants

DEFF Research Database (Denmark)

Colding, H; Hartzen, S H; Mohammadi, M

2003-01-01

Recently, PCR-restriction fragment length polymorphism (PCR-RFLP) of the urease genes of Helicobacter pylori was evaluated in a meta-analysis; acceptable discriminatory indices of the ureAB and C genes were found. In the present investigation, we found a discriminatory index of 0.95 for 191...... is comparable to typing of other H. pylori urease genes....
Development of gene diagnosis for diabetes and cholecystitis based on gene analysis of CCK-A receptor

International Nuclear Information System (INIS)

Kono, Akira

1999-01-01

Base sequence analysis of CCKAR gene (a gene of A-type receptor for cholecystokinin) from OLETF rat, a model rat for insulin-independent diabetes was made based on the base sequence of wild CCKAR gene, which had been clarified in the previous year. From the pancreas of OLETF rat, DNA was extracted and transduced into λphage after fragmentation to construct the gene library of OLETF. Then, λphage DNA clone bound with labelled cDNA of CCKAR gene was analyzed and the gene structure was compared with that of the wild gene. It was demonstrated that CCKAR gene of OLETF had a deletion (6800 b.p.) ranging from the promoter region to the Exon 2, suggesting that CCKAR gene is not functional in OLETF rat. The whole sequence of this mutant gene was registered into Japan DNA Bank (D 50610). Then, F 2 offspring rats were obtained through crossing OLETF (female) and F344 (male) and the time course-changes in the blood glucose level after glucose loading were compared among them. The blood glucose level after glucose loading was significantly higher in the homo-mutant F 2 (CCKAR,-/-) as well as the parent OLETF rat than hetero-mutant F 2 (CCKARm-/+) or the wild rat (CCKAR,+/+). This suggests that CCKAR gene might be involved in the control of blood glucose level and an alteration of the expression level or the functions of CCKAR gene might affect the blood glucose level. (M.N.)
Network Analysis of Human Genes Influencing Susceptibility to Mycobacterial Infections

Science.gov (United States)

Lipner, Ettie M.; Garcia, Benjamin J.; Strong, Michael

2016-01-01

Tuberculosis and nontuberculous mycobacterial infections constitute a high burden of pulmonary disease in humans, resulting in over 1.5 million deaths per year. Building on the premise that genetic factors influence the instance, progression, and defense of infectious disease, we undertook a systems biology approach to investigate relationships among genetic factors that may play a role in increased susceptibility or control of mycobacterial infections. We combined literature and database mining with network analysis and pathway enrichment analysis to examine genes, pathways, and networks, involved in the human response to Mycobacterium tuberculosis and nontuberculous mycobacterial infections. This approach allowed us to examine functional relationships among reported genes, and to identify novel genes and enriched pathways that may play a role in mycobacterial susceptibility or control. Our findings suggest that the primary pathways and genes influencing mycobacterial infection control involve an interplay between innate and adaptive immune proteins and pathways. Signaling pathways involved in autoimmune disease were significantly enriched as revealed in our networks. Mycobacterial disease susceptibility networks were also examined within the context of gene-chemical relationships, in order to identify putative drugs and nutrients with potential beneficial immunomodulatory or anti-mycobacterial effects. PMID:26751573
Serial analysis of gene expression in the silkworm, Bombyx mori.

Science.gov (United States)

Huang, Jianhua; Miao, Xuexia; Jin, Weirong; Couble, Pierre; Mita, Kasuei; Zhang, Yong; Liu, Wenbin; Zhuang, Leijun; Shen, Yan; Keime, Celine; Gandrillon, Olivier; Brouilly, Patrick; Briolay, Jerome; Zhao, Guoping; Huang, Yongping

2005-08-01

The silkworm Bombyx mori is one of the most economically important insects and serves as a model for Lepidoptera insects. We used serial analysis of gene expression (SAGE) to derive profiles of expressed genes during the developmental life cycle of the silkworm and to create a reference for understanding silkworm metamorphosis. We generated four SAGE libraries, one from each of the four developmental stages of the silkworm. In total we obtained 257,964 SAGE tags, of which 39,485 were unique tags. Sorted by copy number, 14.1% of the unique tags were detected at a median to high level (five or more copies), 24.2% at lower levels (two to four copies), and 61.7% as single copies. Using a basic local alignment search tool on the EST database, 35% of the tags matched known silkworm expressed sequence tags. SAGE demonstrated that a number of the genes were up- or down-regulated during the four developmental phases of the egg, larva, pupa, and adult. Furthermore, we found that the generation of longer cDNA fragments from SAGE tags constituted the most efficient method of gene identification, which facilitated the analysis of a large number of unknown genes.
Bioinformatics analysis of the factors controlling type I IFN gene expression in autoimmune disease and virus-induced immunity

Directory of Open Access Journals (Sweden)

Di eFeng

2013-09-01

Full Text Available Patients with systemic lupus erythematosus (SLE and Sjögren's syndrome (SS display increased levels of type I IFN-induced genes. Plasmacytoid dendritic cells (PDCs are natural interferon producing cells and considered to be a primary source of IFN-α in these two diseases. Differential expression patterns of type I IFN inducible transcripts can be found in different immune cell subsets and in patients with both active and inactive autoimmune disease. A type I IFN gene signature generally consists of three groups of IFN-induced genes - those regulated in response to virus-induced type I IFN, those regulated by the IFN-induced mitogen-activated protein kinase/extracellular-regulated kinase (MAPK/ERK pathway, and those by the IFN-induced phosphoinositide-3 kinase (PI-3K pathway. These three groups of type I IFN-regulated genes control important cellular processes such as apoptosis, survival, adhesion, and chemotaxis, that when dysregulated, contribute to autoimmunity. With the recent generation of large datasets in the public domain from next-generation sequencing and DNA microarray experiments, one can perform detailed analyses of cell type-specific gene signatures as well as identify distinct transcription factors that differentially regulate these gene signatures. We have performed bioinformatics analysis of data in the public domain and experimental data from our lab to gain insight into the regulation of type I IFN gene expression. We have found that the genetic landscape of the IFNA and IFNB genes are occupied by transcription factors, such as insulators CTCF and cohesin, that negatively regulate transcription, as well as IRF5 and IRF7, that positively and distinctly regulate IFNA subtypes. A detailed understanding of the factors controlling type I IFN gene transcription will significantly aid in the identification and development of new therapeutic strategies targeting the IFN pathway in autoimmune disease.
GOexpress: an R/Bioconductor package for the identification and visualisation of robust gene ontology signatures through supervised learning of gene expression data.

Science.gov (United States)

Rue-Albrecht, Kévin; McGettigan, Paul A; Hernández, Belinda; Nalpas, Nicolas C; Magee, David A; Parnell, Andrew C; Gordon, Stephen V; MacHugh, David E

2016-03-11

Identification of gene expression profiles that differentiate experimental groups is critical for discovery and analysis of key molecular pathways and also for selection of robust diagnostic or prognostic biomarkers. While integration of differential expression statistics has been used to refine gene set enrichment analyses, such approaches are typically limited to single gene lists resulting from simple two-group comparisons or time-series analyses. In contrast, functional class scoring and machine learning approaches provide powerful alternative methods to leverage molecular measurements for pathway analyses, and to compare continuous and multi-level categorical factors. We introduce GOexpress, a software package for scoring and summarising the capacity of gene ontology features to simultaneously classify samples from multiple experimental groups. GOexpress integrates normalised gene expression data (e.g., from microarray and RNA-seq experiments) and phenotypic information of individual samples with gene ontology annotations to derive a ranking of genes and gene ontology terms using a supervised learning approach. The default random forest algorithm allows interactions between all experimental factors, and competitive scoring of expressed genes to evaluate their relative importance in classifying predefined groups of samples. GOexpress enables rapid identification and visualisation of ontology-related gene panels that robustly classify groups of samples and supports both categorical (e.g., infection status, treatment) and continuous (e.g., time-series, drug concentrations) experimental factors. The use of standard Bioconductor extension packages and publicly available gene ontology annotations facilitates straightforward integration of GOexpress within existing computational biology pipelines.

Some links on this page may take you to non-federal websites. Their policies may differ from this site.